Projects

My work spans spatio-temporal machine learning, network science, and computational social systems. I build models that learn from data distributed across space, time, and network structures, and apply them to real-world systems such as cities, elections, mobility networks, and scientific collaboration. Selected projects below highlight research contributions, open-source implementations, and production-ready systems built for scientific and policy-driven applications.

Research Projects

BrainLab OS: Learning from Minimal Data under Distribution Shift

BrainLab OS: Learning from Minimal Data under Distribution Shift

Master's thesis developing theoretical and practical frameworks for robust machine learning under distribution shift and limited supervision. Building neuroanatomically-inspired cognitive architecture that learns efficiently from minimal labeled data across multiple domains.

Key Highlights

  • Developing theoretical foundations for generalization under distribution shift
  • Implementing neuroscience-inspired learning mechanisms for data-efficient training
  • Investigating robustness and adaptation in low-data regimes

Technologies

PythonPyTorchNumPyNeurosciencePAC LearningRobustness Theory
View Project →
Neural Time Capsule: Urban Growth Prediction

Neural Time Capsule: Urban Growth Prediction

ConvLSTM-based deep learning system for multi-decadal urban expansion forecasting using satellite imagery. Dual-channel architecture processes built-up surface density and road network data to predict city growth patterns 25+ years into the future.

Key Highlights

  • Achieves 0.000218 MSE with 67% improvement over U-Net baseline
  • Trained on 2,313 tiles spanning 1975-2000 across diverse U.S. regions
  • Lightweight 470K parameter model trains in 6 hours on CPU

Technologies

TensorFlowKerasNumPyMatplotlibGDALScikit-learn
View Project →
Disaster Risk Monitoring Using Satellite Imagery

Disaster Risk Monitoring Using Satellite Imagery

End-to-end machine learning system for disaster risk assessment using satellite imagery. Jupyter notebook pipeline covering data preprocessing, efficient model training, and deployment for inference with UNOSAT flood event case study.

Key Highlights

  • Automated satellite imagery preprocessing pipeline
  • Efficient model training for large-scale disaster monitoring
  • Real-world UNOSAT flood event case study validation

Technologies

PythonTensorFlowJupyterSatellite ImageryGDAL
View Project →
Bihar 2025: Real-Time Election Forecasting

Bihar 2025: Real-Time Election Forecasting

Production-grade election forecasting system combining Bayesian hierarchical models, ensemble learners, and transformers for constituency-level predictions. Ingests live polls, social media sentiment, and news feeds with 5-minute update latency using streaming architecture.

Key Highlights

  • Multi-task ensemble for seat-level win probability and vote margins
  • Transformer-based NLP for real-time social media sentiment analysis
  • Streaming data pipeline with sub-5 minute prediction updates

Technologies

PyTorchXGBoostKafkaRoBERTaNGBoostBayesian Models

Selected Engineering Projects

COVID-19 X-Ray Classification with HPC

COVID-19 X-Ray Classification with HPC

High-performance computing implementation of CNN-based COVID-19 detection from chest X-rays. Mac-optimized training pipeline leveraging Accelerate framework for efficient model training on Apple Silicon.

Key Highlights

  • CNN architecture for COVID-19 detection from chest X-rays
  • HPC-optimized training pipeline for Mac M-series chips
  • Automated preprocessing and data augmentation

Technologies

PyTorchAccelerateNumPyscikit-learnMatplotlib
View Project →
CLAD-PV: Cyber-Physical Security for Solar Systems

CLAD-PV: Cyber-Physical Security for Solar Systems

Physics-guided intrusion detection system for solar inverter networks using SunSpec/Modbus protocols. Mac-native implementation combines power system physics knowledge with anomaly detection to identify cyberattacks on distributed solar infrastructure.

Key Highlights

  • Physics-guided IDS for solar inverter security
  • SunSpec/Modbus protocol implementation
  • Detects power injection attacks and data manipulation

Technologies

PythonSunSpecModbuspyModbusNumPyMatplotlib
View Project →
CPS Threat Modeling for Solar Inverters

CPS Threat Modeling for Solar Inverters

Comprehensive threat modeling framework for grid-tied solar inverter networks. STRIDE/DREAD risk analysis with architecture diagrams, gap analysis, and security recommendations for cyber-physical solar systems.

Key Highlights

  • Complete architecture diagram for solar inverter networks
  • STRIDE/DREAD threat matrix and risk assessment
  • Gap analysis with security recommendations

Technologies

Threat ModelingSTRIDEDREADDrawIOSecurity Analysis
Global Product Catalog: Multi-Region Distributed System

Global Product Catalog: Multi-Region Distributed System

Production-grade distributed product catalog using DynamoDB Global Tables for multi-region replication. Terraform-managed infrastructure with comprehensive load testing demonstrating global consistency and sub-50ms latencies.

Key Highlights

  • DynamoDB Global Tables for multi-region active-active replication
  • Infrastructure-as-Code with Terraform
  • k6 load testing with 1000+ concurrent users

Technologies

PythonFastAPIDynamoDBTerraformAWSk6
View Project →
Customer Churn Prediction System

Customer Churn Prediction System

Machine learning system for predicting customer churn in subscription services. Dockerized deployment with comprehensive data preprocessing and multiple model architectures for production-ready churn forecasting.

Key Highlights

  • Ensemble ML models for churn prediction
  • Dockerized deployment pipeline
  • Feature engineering for subscription data

Technologies

PythonScikit-learnpandasDockerXGBoost
View Project →
Aether: AI Therapist Web Application

Aether: AI Therapist Web Application

Emotionally-aware AI therapist web app using Hugging Face Transformers for mental health support. FastAPI backend with transformer models provides empathetic, context-aware conversations for emotional well-being.

Key Highlights

  • Transformer-based conversational AI for mental health
  • Emotionally-aware response generation
  • FastAPI backend with web frontend

Technologies

FastAPIHugging FaceTransformersJavaScriptHTML/CSS
View Project →
Tripsy: AI-Powered Travel Planning

Tripsy: AI-Powered Travel Planning

Intelligent travel planning system with backend API, frontend interface, and infrastructure automation. Features AI-powered itinerary generation, geospatial mapping, and user management.

Key Highlights

  • AI-powered travel itinerary generation
  • Interactive maps with Leaflet integration
  • Multi-component architecture (backend, frontend, infra)

Technologies

PythonReactNext.jsLeafletDockerEtcd
View Project →
Global Airport Dashboard

Global Airport Dashboard

Interactive web-based visualization dashboard for global airport data. D3.js-powered interface with geospatial mapping, statistical analysis, and filtering capabilities for exploring worldwide airport infrastructure.

Key Highlights

  • Interactive geospatial visualization of global airports
  • D3.js-based charts and statistical analysis
  • Real-time filtering and search capabilities

Technologies

JavaScriptD3.jsHTML/CSSGeospatial Data
View Project →
NASA Logs Billing Dashboard

NASA Logs Billing Dashboard

Credit-based billing system that processes NASA HTTP logs as billable events with real-time usage visualization. Dockerized Flask application with Plotly dashboards for usage analytics and invoice generation.

Key Highlights

  • Real-time event ingestion and billing processing
  • Interactive usage dashboard with Plotly visualizations
  • Credit-based billing with date-range queries

Technologies

PythonFlaskPlotlyDockerTailwind CSS
View Project →
LocalMapReduce: Distributed Computing Framework

LocalMapReduce: Distributed Computing Framework

Local MapReduce simulation using Python and LevelDB for distributed data processing. Implements map-reduce paradigm with persistent storage using Plyvel for efficient key-value operations.

Key Highlights

  • MapReduce implementation with Python
  • LevelDB for persistent key-value storage
  • Modular architecture with map and reduce phases

Technologies

PythonLevelDBPlyvelDistributed Systems
View Project →
Haven: AI-Powered Personal Finance Assistant

Haven: AI-Powered Personal Finance Assistant

Privacy-first AI personal finance platform with expense categorization, budget recommendations, and financial insights. Local-first architecture ensures user data never leaves their device while providing intelligent financial guidance.

Key Highlights

  • ML-powered expense categorization with 90%+ accuracy
  • Personalized budget recommendations using spending patterns
  • Local-first architecture for maximum privacy

Technologies

FastAPIReactScikit-learnpandasSQLiteJWT