Open to opportunities

Krishna Sathvik
Mantripragada

Senior Data Engineer

Senior Data Engineer with 6+ years building scalable data infrastructure and analytics solutions across healthcare and enterprise environments. Skilled in Azure, AWS, Python, and the modern data stack (Databricks, Synapse, Power BI). Proven track record in delivering predictive analytics, automated reporting, and actionable insights that improve operational efficiency, forecasting accuracy, and decision-making across large-scale data ecosystems. Passionate about solving real-world problems with data, cloud, and automation while exploring the cosmos through astrophotography.

Featured Project

Production-Ready Application

ApplyTrak: From Job Search Struggle to Production App

Turned my own job search frustration into a production-ready application that's helped hundreds of job seekers organize their applications and land offers.

The Problem: Scattered job applications across multiple spreadsheets, lost track of applications, zero visibility into progress

The Solution: Built a comprehensive job tracking app with analytics, goal setting, and cloud sync

The Impact: 6 months of development, hundreds of users, and real stories of job search success

React 19 TypeScript Supabase IndexedDB Vercel

Key Features

Unlimited Applications Tracking
Document Uploads (50MB)
Goal Setting & Progress Tracking
Real-Time Analytics Dashboard
Cloud Sync with Privacy
Offline-First PWA

Skills

Data Engineering & ETL

dbt, Azure Data Factory, Airflow, Data Orchestration, ETL Pipelines, Apache Kafka, Apache Spark

AI & Machine Learning

Predictive Modeling, Forecasting, Anomaly Detection, Azure ML, Deep Learning, LSTM, RAG, LLM, OpenAI API, LangChain, Vector Databases, Prompt Engineering

Cloud & Databases

Azure, AWS, Snowflake, Redshift, BigQuery, PostgreSQL, MySQL, Oracle, Cassandra

DevOps & Automation

Workflow Automation, CI/CD for Data, Data Quality Monitoring, Docker, Azure DevOps

BI & Visualization

Power BI, Tableau, Looker, Streamlit, Chart.js, Recharts

Programming & Analysis

Python, Java, SQL, R, JavaScript, TypeScript, Market Basket Analysis, Association Rule Mining, FP-Growth, Statistical Analysis

Professional Experience

Data Engineer

Walgreens Boots Alliance

Feb 2022 – Present

  • Process 10+ TB of healthcare retail data monthly with sub-second query performance by building scalable pipelines in Azure Databricks, Azure SQL Database, and Azure Data Factory for real-time executive dashboards

  • Reduce query execution costs by 22% while integrating 15+ data sources by architecting optimized ETL workflows in Azure Synapse Analytics across DEV, SIT, UAT, and PROD environments

  • Eliminate 30+ hours of manual work weekly and improve workflow efficiency by 30% by creating reusable automation frameworks enabling real-time analytics

  • Accelerate deployment cycles by 40% through CI/CD pipeline implementation in Azure DevOps, enhancing KPI tracking and data governance across healthcare analytics platforms

Analytics Engineer

CVS Health

Oct 2020 – Dec 2021

  • Designed and deployed ETL workflows processing 7+ million sales and financial records, increasing data accuracy by 16–35% and enhancing reliability of executive reporting and ML models

  • Automated integration workflows between on-premises systems and Oracle Cloud, reducing manual processing by 31% and enabling real-time insights in Power BI and Tableau dashboards across multiple business units

  • Re-engineered legacy queries and migrated business logic, improving scalability, performance, and data consistency across 8+ enterprise databases

  • Built Azure ML forecasting models leveraging 5+ years of historical trends, increasing budget and inventory forecast accuracy by 16% and supporting enterprise-wide resource allocation decisions

Data Science Intern

McKesson Corporation

Mar 2020 – Sep 2020

  • Developed scripts to clean, preprocess, and analyze 2+ million prescription and sales records, improving data readiness by 22% for downstream analytics, forecasting models, and business intelligence initiatives

  • Created and tested Azure ML models to predict prescription demand, achieving a 14% improvement in forecast accuracy over baseline methods and supporting supply chain and inventory management teams

  • Designed and deployed Tableau dashboards visualizing patient behavior and prescription trends, enabling targeted decision-making for pilot business units and improving stakeholder engagement with data insights

  • Partnered with data engineering team to integrate model outputs into ETL workflows, establishing automation foundations later scaled at CVS to support enterprise-wide forecasting and analytics pipelines

Software Developer

Inditek Pioneer Solutions

Jun 2018 – Dec 2019

  • Built scalable backend APIs using Django, integrating 10+ user interfaces with optimized database queries processing 100K+ daily transactions, improving overall system performance by 20% and reducing downtime by 15% through critical production fixes

  • Developed a content aggregation and tracking system with advanced database optimization and caching, accelerating data retrieval speed by 71% and enhancing platform responsiveness for 1,000+ users

Projects

React Firebase Leaflet Cloud Functions Push SEO

National Parks Explorer

Explore all 63 U.S. National Parks with map integration, travel tips, and community reviews.

Python Kafka Spark Redis MLflow Docker

Real-time Fraud Detection System

ML-powered fraud detection pipeline processing millions of transactions with sub-second latency.

Python pandas SQLite Data Analysis ETL Visualization

Finance Tracker Pipeline

Process, transform, and analyze expense data using Python and pandas.

Jupyter scikit-learn Keras LSTM Machine Learning Deep Learning

Stock Price Prediction

Use ML and deep learning to predict stock market trends.

Python pandas Streamlit mlxtend FP-Growth Association Rules

Market Basket Analysis

Analyze purchasing patterns with association rule mining and FP-Growth.

Kafka Spark Cassandra Streamlit Python Docker

Vehicle Telemetry Pipeline

Kafka + Spark streaming with Cassandra & Streamlit, real-time anomaly detection.

Education

Master of Science in Computer Science

University of North Texas • Denton, TX • 2021

Focus: Data Science & Analytics, Machine Learning, Big Data Processing

Relevant Coursework: Advanced Data Mining, Machine Learning, Database Systems, Software Engineering

Bachelor of Technology in Information Technology

GITAM University • Visakhapatnam, India • 2019

Focus: Software Development, Database Management, Web Technologies

Relevant Coursework: Data Structures, Algorithms, Database Design, Web Development, Software Engineering

Publications

AI for Electricity Market Design

Book Chapter — Handbook of Smart Energy Systems, Springer (2023)

Published chapter on artificial intelligence applications in electricity market design and optimization.

AI Energy Systems Optimization
Data Engineering & Cloud Architecture Articles

Blog Series — Medium • Multiple articles

Regular content on data engineering best practices, cloud architecture, and real-world data solutions.

Data Engineering Cloud Architecture Best Practices

Certifications

Get in touch

Prefer email? krishnasathwikm@gmail.com