
Sumit Aryal
Computer Engineer
About Me
I am a Machine Learning Engineer specializing in NLP and LLM, with expertise in architecting production-grade AI systems including RAG workflows, document processing pipelines, and recommendation engines.
I have proven ability to design, containerize, and scale solutions using cloud native stacks (AWS/Kubernetes/Docker). I am passionate about bridging research and operations, from developing novel NLP models for low-resource languages to implementing MLOps practices for AI product delivery.
I also aim to contribute to AI research by leveraging my expertise in Natural Language Processing, Computer Vision, and Large Language Models to create impactful solutions for real-world challenges.
Work Experience
Machine Learning Engineer
Root Level AIJanuary 2025 - Present
Kathmandu, Nepal
- Document Processing Architecture: Led pipeline redesign replacing legacy OCR with agent-based document processing system, and improved data extraction accuracy and throughput.
- RAG System Optimization: Deployed agentic RAG workflow with hybrid search (dense+sparse vectors) and cross-encoder reranking, increasing retriever recall by 20% and reducing token costs by 15%.
- ML Infrastructure: Architected and scaled end-to-end ML solutions using FastAPI, LlamaIndex for document retrieval pipelines, and Qdrant for vector operations. Orchestrated Kubernetes clusters (EC2/ECR) with Docker containers, implemented serverless data workflows via Lambda, and managed data lifecycle using S3 for storage, Redis for real-time caching, and CloudWatch for performance monitoring.
Machine Learning Engineer
DoriITApril 2024 - January 2025
Kathmandu, Nepal
- LLM Integration: Created RAG assistants using OpenAI and Gemini APIs.
- MLOps: Established CI/CD pipelines using GitHub Actions, Docker containerization, and FastAPI deployment.
- Text Analytics Pipeline: Developed sentiment analysis and NER systems using Hugging Face transformers.
- Team Development: Mentored 2 interns through the complete NLP project lifecycle, from dataset creation to BERT fine-tuning and evaluation.
AI Fellow
FusemachinesJanuary 2023 - August 2023
Kathmandu, Nepal
- Completed intensive ML/DL curriculum, focusing on techniques like regression models, neural networks, and transformer-based architectures such as BERT.
- Built sentiment-analysis and text-classification models
- Applied image processing techniques, including detection and segmentation.
QA Trainee
Bajra TechnologiesSeptember 2022 - December 2022
Kathmandu, Nepal
- Developed and executed test cases for web applications, identifying and reporting over 50 bugs.
- Automated end-to-end testing using Cypress, reducing manual testing efforts by 15%.
- Conducted API and load testing using Postman and JMeter.
Education
Pulchowk Campus, IOE, Tribhuvan University
Lalitpur, Kathmandu
Bachelors in Computer Engineering
November 2019 - April 2024
Publications
C = Conference, J = Journal, S = In Submission, T = Thesis
BERT-Based Nepali Grammatical Error Detection and Correction Leveraging a New Corpus
Sumit Aryal, et al. (2024). Presented at IEEE INSPECT-2024, ABV-IIITM, Gwalior, India, December 07-08, 2024.
Nepali Grammar Correction
Sumit Aryal, et al. (2024). Undergraduate Thesis, Pulchowk Engineering Campus, Institute of Engineering, Tribhuvan University.
Projects
Nepali Grammatical Error Correction
As a part of major project thesis of my bachelor's degree, developed a pipeline utilizing BERT. Curated a large parallel corpus for Nepali Grammar Correction task. Developed a system which takes Nepali text as input, checks its grammar, and suggests corrections if necessary.
Chat with Multiple PDFs
Project implemented using Langchain and Huggingface Transformers for RAG framework. Users upload PDFs and ask questions. The PDF is segmented, converted to vectors, and stored. Questions are converted to embeddings, and a semantic search yields ranked results.
HTML Parser using LLM
Developed an API for extracting e-commerce attributes from HTML content. Uses `meta-llama/Meta-Llama-3-8B-Instruct` from Hugging Face's Inference API. Extracts attributes like name, price, description, and images from HTML.
Travel Recommendation System
Developed a travel recommendation web application that generates personalized itineraries for travelers to Nepal based on their preferences and budget using React, Django, Python, and Flask. Implemented collaborative filtering to enhance recommendations.
ML and DL Repository
Developed and maintained a repository of machine learning and deep learning algorithms, including CNN, Linear and Logistic Regression, Decision Trees, and advanced applications like Image Segmentation and Reconstruction.
8 Puzzle Visualizer
Implemented and visualized different algorithms, such as A*, BFS, DFS, IDDFS, and Greedy to solve the 8-puzzle problem using Python and Tkinter.
Bachiyo Game
Mario-like platformer game with various levels and sound effects created using C++ and SFML.
Image Compression
Compressed images using Huffman Tree Algorithm in C++.
Stadium Modeling
Modeled a stadium using Python, Pygame, and Blender.
Skills
Programming Languages
Data Science & Machine Learning
Applied ML Frameworks
Web Technologies
Cloud & DevOps
Specialized Areas
Mathematical & Statistical Tools
Soft Skills
Honors & Awards
Best Project Award
Pulchowk Campus, IOE
December 2024
Recognized for excellence in developing "Nepali Grammatical Error Detection and Correction System", an innovative NLP system that addresses the significant challenge of automated grammar correction in the Nepali language using BERT-based models and a novel corpus.
Certifications
Professional Memberships
Nepal Engineering Council
Professional Engineering Body
October 2024 - Present
Active member of Nepal's premier engineering professional body, committed to maintaining high standards of engineering practice and professional development.
Resume / CV
Having trouble viewing the resume? Download it or open it in a new tab using the buttons below.
Contact Me
Let's Connect
Feel free to reach out for collaborations, opportunities or just to say hello!
Send a Message
I'll get back to you as soon as possible.