M.S., Data Science
University of North Texas
Data Science · AI/ML · Data Engineering
Data Engineer | AI/ML Enthusiast | Building Agentic AI Systems
Data Science graduate with hands-on experience building scalable data pipelines, ETL workflows, and AI/ML systems. Skilled in Python, SQL, Spark, Airflow, and cloud platforms (GCP, Azure, AWS)—turning raw multi-source data into actionable insights and shipping solutions cross-functionally.
I am a data-focused engineer passionate about building intelligent systems that combine rigorous data analysis with modern AI. I thrive on real-world, messy datasets and on designing end-to-end pipelines that turn raw inputs into decisions people can trust.
I care about clarity, reproducibility, and collaboration—whether that means solid data engineering, thoughtful modeling, or agentic workflows that make analysis faster and more reliable.
University of North Texas
Sree Vidyanikethan Engineering College
Spring Boot, Node.js microservices, React, and cloud deployment with CI/CD.
Personal project
Automates end-to-end data analysis from raw datasets to insights using AI-driven reasoning.
Tech stack: Python · Pandas · NumPy · Plotly · Streamlit · Groq (LLaMA 3)
Research project · Advisor: Dr. Clifford Whitworth, University of North Texas · Mar 2026 – May 2026
Production-grade Retrieval-Augmented Generation system with modular, role-based architecture for grounded interview workflows.
Tools used: Python · LangChain · LangGraph · ChromaDB · Groq · LLMs · Streamlit · vector databases
Personal project · Sep 2025 – Dec 2025
Time-series forecasting on 1M+ records with 34 features using predictive modeling for smart-grid energy use.
Tools used: Python · Pandas · scikit-learn · XGBoost · LightGBM · time-series forecasting · feature engineering
Personal project · Aug 2025 – Dec 2025
ML pipeline on the NYC TLC dataset to predict fares from trip-level and engineered distance and temporal features.
Tools used: Python · Pandas · scikit-learn · Streamlit · feature engineering
Personal project · May 2025 – Jul 2025
End-to-end analytics platform ingesting data from APIs (e.g., World Bank, Google Trends) and visualizing real-time metrics across 150+ countries.
Tools used: Python · SQL · Pandas · PostgreSQL · Streamlit · Airflow · Tableau · Google Cloud Scheduler
Personal project · Feb 2025 – Apr 2025
Reinforcement-style game agent that learns to play 2048 from experience, using reward feedback to improve policy over time.
Tools used: Python · TensorFlow · Keras · Pandas · NumPy
Advisor: Dr. Sahara Ali, College of Information, University of North Texas · Oct 2024 – Dec 2024
Relational database for an event-management catering workflow—food supply chain, events, staff, and payroll—with less redundancy and clearer operations.
Tools used: MySQL · Python
Advisor: Prof. Narendra Kumar Rao, Sree Vidyanikethan Engineering College · Dec 2022 – Apr 2023
Hybrid recommender combining collaborative filtering with deep learning for personalized product suggestions on simulated e-commerce data.
Tools used: Python · Pandas · NumPy · TensorFlow · Keras · scikit-learn
I'm open to Data Engineering, Data Science, and AI/ML roles, with a focus on building scalable data pipelines and AI agents.
Or reach me directly