About

I am a results-driven Data Analyst with 3 years of experience in transforming complex data into strategic assets. My expertise lies in designing and building end-to-end data solutions—from engineering automated ETL pipelines to developing interactive BI dashboards.

My goal is to bridge the gap between technical data operations and strategic business objectives, enabling organizations to make smarter, data-driven decisions. I am passionate about leveraging the modern data stack to solve challenging problems and uncover actionable insights.

Areas of Expertise

  • Data Analysis & BI icon

    Data Analysis & BI

    Developing insightful dashboards in Power BI and performing deep-dive analysis to uncover key business trends.

  • ETL & Data Pipelines icon

    ETL & Data Pipelines

    Building and orchestrating robust, automated data pipelines using Python, SQL, and modern tools like Airflow.

  • Advanced Analytics icon

    Advanced Analytics

    Applying statistical modeling and machine learning techniques to generate predictive insights and forecasts.

  • Cloud & Automation icon

    Cloud & Automation

    Leveraging cloud platforms like GCP and Azure for scalable data storage, processing, and automation.

Tech Stack

PROJECTS

More are coming soon as I am documenting my past projects and building new ones!

Recent Projects

  • Customer Churn Dashboard Screenshot

    ETL Churn Analytics PowerBI

    2024

    An end-to-end analytics project identifying key drivers of customer churn. Features a full ETL process, a predictive ML model, and a comprehensive BI dashboard. Lorem, ipsum dolor sit amet consectetur adipisicing elit. Consequatur perferendis repellendus nesciunt dicta aperiam saepe quia, quidem distinctio odio facilis explicabo corporis, culpa, earum et rem impedit. Repudiandae, illo vero?

    SQL Python Scikit-learn Power BI
  • Unified Investment Portfolio Dashboard Screenshot

    Unified Investment Portfolio Dashboard

    2024

    A fully automated data pipeline and Power BI dashboard to track a diversified investment portfolio (Stocks, MFs, Crypto). Aggregates data via APIs for daily updates.

    Python REST APIs Pandas Power BI Docker
  • Netflix Snowflake DBT Architecture

    Netflix Data Pipeline using DBT, Snowflake, and AWS S3

    2025

    This is an end-to-end cloud-based data engineering and analytics project that demonstrates the modern ELT (Extract-Load-Transform) pipeline using popular cloud tools: Amazon S3 for data storage, Snowflake as the data warehouse, and DBT (Data Build Tool) for data transformation, testing, documentation, and orchestration.

    Python SQL AWS S3 dbt Snowflake Jinja
  • NYC Taxi Databricks DBT Architecture

    NYC Taxi Data Pipeline using DBT, Databricks, and GCS

    2025

    This project demonstrates the design and implementation of a modern, end-to-end data platform on the cloud. It ingests raw NYC Taxi trip data, transforms it using a robust analytics engineering workflow, models it according to the Medallion Architecture, and orchestrates the entire pipeline for production, making it ready for business intelligence and analysis.

    Python SQL GCS dbt Databricks Jinja
  • Coming Soon Illustration

    New Project In Progress...

    Currently documentating and updating more projects while building different end-to-end data pipelines and dashboards with Airflow, dbt, and Spark. Check back soon for updates!

Past Projects

  • Network Intrusion Detection System

    A Binary and Multi-Class Classification Problem solved with the help of many machine learning algorithms. Lorem ipsum dolor sit amet consectetur adipisicing elit. Deserunt porro provident voluptatum cum accusamus nulla quas repellat beatae harum placeat, magnam nostrum voluptate, assumenda ex, explicabo iusto. Quo, magnam quibusdam!

    Python NumPy Pandas Matplotlib Scikit-learn Keras
  • Emotion Detection OpenCV Pytorch ResNet9

    A ResNet9 architecture based model was trained on a Dataset containing 28000+ images spreaded across 7 different classes of emotions. And then the Model was used in a Real Time Application made using Tkinter.

    Python NumPy Pandas Seaborn Scikit-learn PyTorch
  • Health Care Case Study

    Solving series of Business Problems related to Health Care Domain using Descriptive and Predictive Analysis.

    Python NumPy Pandas Matplotlib Scikit-learn XGBoost
  • NLP Analyzing Online Job Postings

    Providing solutions to various Business Problems using Supervised and Unsupervised Learning on Text Data

    Python Matplotlib Scikit-learn imblearn NLTK Keras
  • Thyroid Disease Detection

    Predictive model that estimates a patient’s risk of thyroid dysfunction (hyperthyroidism or hypothyroidism) from clinical and laboratory features to support early detection.

    Python Matplotlib Scikit-learn Keras Flask CSS HTML
  • Text Mining Bank Reviews Complaints Analysis

    Analyzing Bank Reviews using Supervised and Unsupervised Learning with the help of Natural Language Processing.

    Python NumPy Pandas Matplotlib Scikit-learn NLTK
  • Segmentation of Credit Card Customers

    Performing Unsupervised Learning on a Credit Card Data to Segment Customers into different clusters using KMeans Clustering.

    Python NumPy Pandas Matplotlib Scikit-learn
  • Walmart Store Sales Forecasting

    A regression based problem solved through sophisticated machine learning algorithms.

    Python NumPy Pandas Matplotlib Scikit-learn Keras
  • Predicting Credit Card Spend Identifying Key Drivers

    A regression based problem solved via statistical modelling.

    Python Matplotlib Scikit-learn SciPy Statsmodels
  • title

    description

    Python NumPy Pandas Matplotlib Scikit-learn