Hello, I am

Sumit Aryal

Computer Engineer

About Me

I am a Machine Learning Engineer specializing in NLP and LLM, with expertise in architecting production-grade AI systems including RAG workflows, document processing pipelines, and recommendation engines.

I have proven ability to design, containerize, and scale solutions using cloud native stacks (AWS/Kubernetes/Docker). I am passionate about bridging research and operations, from developing novel NLP models for low-resource languages to implementing MLOps practices for AI product delivery.

I also aim to contribute to AI research by leveraging my expertise in Natural Language Processing, Computer Vision, and Large Language Models to create impactful solutions for real-world challenges.

Machine Learning
NLP
Python
Large Language Models
FastAPI
Computer Vision
PyTorch

Work Experience

Machine Learning Engineer

Root Level AI

January 2025 - Present

Kathmandu, Nepal

  • Engineered Conversational AI Platform: Architected and deployed production-ready WebSocket-based chat system. Built an Agentic RAG pipeline using LlamaIndex with hybrid vector search (dense+sparse embeddings) and cross-encoder reranking. Implemented intelligent conversation handoff between AI agents and human operators, reducing customer wait times while maintaining service quality standards.
  • Revolutionized Document Processing Pipeline: Redesigned the document ingestion pipeline, replacing the legacy OCR with an agent-based document processing system, and improved the accuracy and throughput of data extraction.
  • ML Infrastructure: Architected and deployed end-to-end ML solutions using FastAPI for API, LlamaIndex for RAG pipeline, and Qdrant for vector operations. Orchestrated Kubernetes clusters (EC2/ECR) with Docker containers for seamless deployment and management. Implemented serverless data workflows using AWS Lambda for cost-effective processing. Managed the data lifecycle with S3 for storage, Redis for real-time caching, and PostgreSQL for relational data management. Incorporated DynamoDB for high-performance NoSQL storage and utilized CloudWatch for comprehensive performance monitoring.
Python FastAPI LlamaIndex Docker Kubernetes AWS Qdrant PostgreSQL DynamoDB Redis WebSocket

Machine Learning Engineer

DoriIT

April 2024 - January 2025

Kathmandu, Nepal

  • LLM Integration: Created RAG assistants using OpenAI and Gemini APIs.
  • Text Analytics Pipeline: Developed sentiment analysis and NER systems using Hugging Face transformers.
  • Team Development: Mentored 2 interns through the complete NLP project lifecycle, from dataset creation to BERT fine-tuning and evaluation.
Python FastAPI LlamaIndex Docker OpenAI API Gemini API

AI Fellow

Fusemachines

January 2023 - August 2023

Kathmandu, Nepal

  • Completed intensive ML/DL curriculum, focusing on techniques like regression models, neural networks, and transformer-based architectures such as BERT.
  • Built sentiment-analysis and text-classification models
  • Applied image processing techniques, including detection and segmentation.
Python Machine Learning NLP Computer Vision BERT Transformers

QA Trainee

Bajra Technologies

September 2022 - December 2022

Kathmandu, Nepal

  • Developed and executed test cases for web applications, identifying and reporting over 50 bugs.
  • Automated end-to-end testing using Cypress, reducing manual testing efforts by 15%.
  • Conducted API and load testing using Postman and JMeter.
Cypress Postman JMeter QA

Education

Pulchowk Campus, IOE, Tribhuvan University

Lalitpur, Kathmandu

Bachelors in Computer Engineering

November 2019 - April 2024

Publications

C = Conference, J = Journal, S = In Submission, T = Thesis

C.1

BERT-Based Nepali Grammatical Error Detection and Correction Leveraging a New Corpus

Sumit Aryal, et al. (2024). Presented at IEEE INSPECT-2024, ABV-IIITM, Gwalior, India, December 07-08, 2024.

IEEE Conference First Author
View Paper
T.1

Nepali Grammar Correction

Sumit Aryal, et al. (2024). Undergraduate Thesis, Pulchowk Engineering Campus, Institute of Engineering, Tribhuvan University.

Thesis Best Project Award
View Thesis

Selected Open Source Contributions

2025

  • Qdrant Sharding: Enabled distributed Qdrant deployments for LlamaIndex with configurable sharding and replication; write consistency controls in QdrantVectorStore; custom shard-key across create/add/delete/query; cluster-aware fixtures; sync/async tests. PR #19652.
  • Payload field indexes: Implemented payload_indexes creation in both sync and async flows with fixtures and tests. PR #19743.
  • Memory Component: Added missing synchronous wrapper for put_messages in the memory component to align sync and async interfaces. PR #19746.
Qdrant Integration Memory Testing

2025

  • Python Agents SDK example: Ready-to-run example setting up an agent with local gpt-oss models, connecting a filesystem MCP server, defining a custom weather tool, and streaming agent responses. PR #14.
Agents MCP Python

Projects

Nepali Grammatical Error Correction

As part of my bachelor thesis I built a BERT-based Nepali grammar correction (Nepali GEC) pipeline. I curated a large parallel corpus for the Nepali grammatical error correction task and implemented a system that ingests Nepali text, detects grammatical errors, and proposes context-aware corrections.

Python BERT PyTorch NLP HuggingFace Transformers

Chat with Multiple PDFs

Project implemented using Langchain and Huggingface Transformers for RAG framework. Users upload PDFs and ask questions. The PDF is segmented, converted to vectors, and stored. Questions are converted to embeddings, and a semantic search yields ranked results.

Python LangChain RAG HuggingFace

HTML Parser using LLM

Developed an API for extracting e-commerce attributes from HTML content. Uses `meta-llama/Meta-Llama-3-8B-Instruct` from Hugging Face's Inference API. Extracts attributes like name, price, description, and images from HTML.

Python HuggingFace LLMs FastAPI

Travel Recommendation System

Developed a travel recommendation web application that generates personalized itineraries for travelers to Nepal based on their preferences and budget using React, Django, Python, and Flask. Implemented collaborative filtering to enhance recommendations.

React Django Python Flask Pandas

ML and DL Repository

Developed and maintained a repository of machine learning and deep learning algorithms, including CNN, Linear and Logistic Regression, Decision Trees, and advanced applications like Image Segmentation and Reconstruction.

Python PyTorch TensorFlow

8 Puzzle Visualizer

Implemented and visualized different algorithms, such as A*, BFS, DFS, IDDFS, and Greedy to solve the 8-puzzle problem using Python and Tkinter.

Python Tkinter A* Algorithm BFS/DFS

Bachiyo Game

Mario-like platformer game with various levels and sound effects created using C++ and SFML.

C++ SFML Game Dev

Image Compression

Compressed images using Huffman Tree Algorithm in C++.

C++ Huffman Coding Algorithms

Stadium Modeling

Modeled a stadium using Python, Pygame, and Blender.

Python Pygame Blender 3D Modeling

Skills

Core Languages

Python Rust C/C++ Shell Scripting

ML/DL Stack

PyTorch HuggingFace Transformers Scikit-learn TensorFlow Pandas NumPy

Natural Language Processing

LlamaIndex LangChain/Graph Fine-tuning Agentic Workflow Text Analytics

ML Infrastructure & MLOps

Docker Kubernetes (EC2/ECR) AWS (EC2, S3, Lambda, ECR) CI/CD Pipelines

Data Systems

Qdrant PostgreSQL SQLAlchemy DynamoDB Redis

API & Deployment

FastAPI RESTful Design Streamlit Serverless Architectures

Specializations

NLP RAG Low-Resource Language Processing Recommendation Systems Computer Vision

Soft Skills

Communication Problem Solving Teamwork Quick Learning Mentoring

Honors & Awards

Best Project Award

Pulchowk Campus, IOE

December 2024

Recognized for excellence in developing "Nepali Grammatical Error Detection and Correction System", an innovative NLP system that addresses the significant challenge of automated grammar correction in the Nepali language using BERT-based models and a novel corpus.

Department Honor Team Achievement

Certifications

Deep Learning Specialization

DeepLearning.AI

December 2023

View Certificate

Generative AI with Large Language Models

DeepLearning.AI

October 2023

View Certificate

Microdegree in Deep Learning

Fusemachines Inc

August 2023

View Certificate

Coding and Programming

Samsung

May 2023

View Certificate

Microdegree in Machine Learning

Fusemachines Inc

May 2023

View Certificate

Machine Learning Specialization

DeepLearning.AI

November 2022

View Certificate

Professional Memberships

Nepal Engineering Council

Professional Engineering Body

October 2024 - Present

Active member of Nepal's premier engineering professional body, committed to maintaining high standards of engineering practice and professional development.

Resume / CV

Having trouble viewing the resume? Download it or open it in a new tab using the buttons below.

Contact Me

Let's Connect

Feel free to reach out for collaborations, opportunities or just to say hello!

Location

Tokha-7, Kathmandu, Nepal

Social

Available for opportunities

Send a Message

I'll get back to you as soon as possible.