Skip to content
View SharadChoudhury's full-sized avatar

Block or report SharadChoudhury

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
SharadChoudhury/README.md

๐Ÿ’ซ About Me:

I am a Senior Data Analytics Engineer at ANZ specializing in building robust data transformation pipelines and scalable ETL processes.
My expertise lies in the AWS and Hadoop ecosystems, where I leverage PySpark, SQL, Python, Hive to drive data-driven optimized solutions.

Core Tech: Python, SQL, PySpark, Kafka, Airflow and AWS (S3, Redshift, EMR).

I am currently exploring on:
GenAI & ML: Exploring RAG, LangChain, and Prompt Engineering to enhance data workflows.

๐Ÿ’ป Tech Stack:

Apache Spark Python MySQL Apache Hadoop Apache Airflow AWS Apache Kafka InfluxDB Pandas Git Docker

๐Ÿ“Š GitHub Stats:



Feel free to reach out via https://www.linkedin.com/in/sharadchoudhury27/.

Let's make data-driven decisions together and unlock new possibilities!

Pinned Loading

  1. ATM-Transactions-Batch-ETL ATM-Transactions-Batch-ETL Public

    Batch ETL pipeline using Apache Sqoop, Apache PySpark, Amazon S3 and Amazon RedShift to analyze ATM withdrawl behaviours to optimally manage the refill frequency.

    Jupyter Notebook

  2. Credit-Card-Fraud-Detection Credit-Card-Fraud-Detection Public

    Credit card fraud detection of real time transaction data

    Python

  3. Hadoop-Project Hadoop-Project Public

    NYC Taxi data analysis using Mapreduce

    Python

  4. Lead-Scoring-CaseStudy Lead-Scoring-CaseStudy Public

    Scoring Leads for an Ed-Tech company to enable higher leads conversion

    Jupyter Notebook

  5. Bike-Sharing-prediction Bike-Sharing-prediction Public

    Multiple Regression model building with Sklearn and statsmodels and analysis of relevant predictors using P-values and VIF

    Jupyter Notebook

  6. Credit-EDA-Case-Study Credit-EDA-Case-Study Public

    Extensive EDA Case study of Loan applications of customers based on various factors and identifying the trends in Defaulters and Non Defaulters

    Jupyter Notebook