Skip to content

NanoPrompt/etl-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Universal Crypto Market ETL Pipeline

A lightweight, automated End-to-End (ETL) data pipeline that extracts real-time cryptocurrency market data, transforms it with volatility risk metrics, and concurrently distributes the cleaned payload locally for GitHub tracking and upstream via REST API to GitLab.

πŸš€ Architecture Overview

This project implements a Universal Distribution Architecture. Instead of relying on traditional Git remotes for dual-pushing, the core pipeline script itself interacts directly with cloud version control environments via secure APIs.

  • Extract: Pulls live market metrics (ticker, closing price, volume, market cap, price change percentage) for the top 50 cryptocurrencies using the TradingView Screener API.
  • Transform: Cleans schemas, normalizes data formats, and evaluates a custom Risk_Profile column flagging assets with high price fluctuations.
  • Load (Dual Destination):
    • Local Destination: Writes a structured JSON payload to the local directory for traditional Git versioning (GitHub).
    • Cloud Destination: Calls the GitLab Commits API directly to instantly sync and version the live data within the cloud repository.

πŸ› οΈ Project Structure

etl-project/
β”‚
β”œβ”€β”€ .gitlab-ci.yml                 # GitLab CI/CD pipeline automation config
β”œβ”€β”€ cleaned_crypto_market_data.json # The generated/transformed target payload
β”œβ”€β”€ market_etl_pipeline.py         # Main Python ETL script execution logic
└── requirements.txt               # Project dependencies

About

ETL project Trial

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages