Projects
NLP · low-resource language
Sentiment Analysis of Nepali COVID-19 Tweets
Transformer-based sentiment classification on 35K+ Nepali tweets with PyTorch and Hugging Face Transformers (0.73 F1). Includes a preprocessing pipeline for Nepali text and benchmarks for pandemic-related social media discourse.
NLP · named entity recognition
EverestNER & DanfeNER
NER systems for Nepali using transformers (F1 0.85 / 0.80). Two benchmark datasets: EverestNER (24,587 entities, news) and DanfeNER (4,966 entities, tweets) with annotation guidelines—the first large-scale Nepali NER resources.
Classification · text
SMS Spam Detection
Binary classification on 5,574 English SMS messages with Random Forest and Naive Bayes, NLTK features, and CountVectorizer / TF-IDF for robust spam vs. ham detection.
Full-stack · web
UofM Art Marketplace (E-commerce)
Ruby on Rails, MySQL, and a responsive HTML/CSS/Bootstrap front end. Authentication, catalog, and cart for students and faculty buying and selling artwork.
Intelligent tutoring · assessment
Student Answer Assessment in Tutorial Dialogue
Ensemble models (Decision Trees, Random Forest, SVR) with NLP features for short-answer grading in ITS. Evaluated on hundreds of dialogue responses with ~80% accuracy.
Search · information retrieval
UofM Local Search Engine
Campus-focused search using NLP on student queries, ranking, and link retrieval to streamline access to University of Memphis resources.