Hello, I am

Utsabi
Dangol

Data Engineer & ML Researcher

4+ years building production ELT pipelines with Python, Airflow & Snowflake. Pursuing MS Computer Science at USD (May 2026), researching LLMs, RAG and uncertainty-aware Reinforcement Learning.

4+

Years Experience

4.0

GPA

40+

Customer Interviews

What I work with

Technical
Skills

Data & ETL
Python
SQL
Airflow
dbt
Data Platforms
Snowflake
BigQuery
PostgreSQL
MongoDB
Cloud
GCP
AWS
Docker
Analytics & BI
Tableau
Looker
AI & ML
PyTorch
TensorFlow
Web Dev
React
Node.js

Where I have worked

Experience

Full-time

Data Engineer

LIS Nepal Pvt. Ltd.

Sep 2021 – Aug 2024Lalitpur, Nepal
  • Led offshore team of 9 delivering Python/Airflow/Snowflake ELT pipelines, improving runtimes by 40% and costs by 30%
  • Integrated 10+ data sources into Snowflake and GCP, enabling cross-channel attribution and ML-ready analytics
  • Architected data marts and automated quality checks, reducing pipeline failures by 35% supporting 50+ dashboards
  • Implemented RBAC and PII masking to secure 100% of sensitive customer fields, ensuring data privacy compliance
  • Mentored 25+ junior engineers on best practices, improving code quality and delivery velocity
Full-time

Associate Software Engineer

Impacters

Apr 2021 – Mar 2022Kathmandu, Nepal
  • Built edge AI system (Project Saathi) using CNNs and transfer learning for real-time crop health monitoring
  • Deployed ML models on solar-powered edge devices with offline capability for resource-constrained hardware
Full-time

Associate Software Engineer

Asterdio Inc.

Jul 2020 – Mar 2021Kathmandu, Nepal
  • Implemented customer segmentation using clustering algorithms to identify patterns and inform product strategy
  • Optimized Node.js/MongoDB data pipelines and built analytics dashboards, improving query performance by 40%

Leadership

Leadership

Entrepreneurial Lead

NSF I-Corps Hub Great Plains

Jun 2025 – Jul 2025South Dakota, USA
  • Conducted 40 customer discovery interviews to refine AI product concepts including legal research assistant (LLMs) and music transcription tool
  • Applied lean startup methodology to validate problem-solution fit across both cohorts

Selected work

Projects

01

Policy Shaping with Uncertainty-Aware LLM for Multi-Task RL

Aug – Dec 2025

BERT-based LLM guidance combined with PPO using MC Dropout for uncertainty estimation. Achieved 99.2% success rate and 2009.96 reward AUC, outperforming Q-learning and DQN baselines.

BERTPyTorchPPOPython
02

LLM-Based PDF Question Answering using RAG

Feb 2026

RAG pipeline for interactive PDF QA using recursive text chunking, Watsonx embeddings, and ChromaDB. Gradio interface for document upload and similarity-based retrieval.

LangChainIBM WatsonxChromaDBGradio
03

E-Commerce Analytics: Product Success & Price Prediction

Aug – Dec 2025

82.98% classification accuracy for product ratings using Amazon Reviews metadata with SHAP interpretability. RAG-based price prediction with FAISS & BM25 hybrid retrieval.

PythonFAISSBM25SHAP
04

Detecting Bias in Social Networks & News

Jan 2026

Transformer pipeline for media bias detection — fine-tuned BERT, RoBERTa, ELECTRA, GPT-2 on MBIB dataset. ELECTRA achieved best F1 of 0.769.

BERTELECTRARoBERTaGPT-2
05

Fault-Tolerant PostgreSQL with Raft Consensus

Nov 2024

Priority-based Raft consensus algorithm reducing leader election latency in distributed PostgreSQL clusters across 3–7 node Azure deployments.

PythonPostgreSQLAzureRaft
06

Snowflake / Airflow ELT Pipelines

LIS Nepal · 2021–2024

Production ELT pipelines integrating 10+ data sources. Improved runtimes 40%, cut costs 30%, and powered 50+ dashboards with automated quality checks.

PythonAirflowSnowflakeGCP