Data-Continuum is a production-grade data pipeline sandbox designed to simulate a high-velocity Smart Logistics environment. It demonstrates the seamless flow of data from diverse sources (SQL and NoSQL) through an orchestration layer into a unified API and Machine Learning service.
- Databases: PostgreSQL (Relational) and MongoDB (NoSQL)
- Orchestration: Apache Airflow
- Microservices: FastAPI for API and ML services
- Observability: Prometheus, Grafana, MLflow
- Docker & Docker Compose
- Python 3.10+ (optional, for local development)
uvfor fast package management
You can spin up the entire Data-Continuum ecosystem using Docker Compose:
docker-compose up --build -d- React UI Dashboard:
http://localhost:5173 - Airflow UI:
http://localhost:8080(admin/admin) - Unified API (Swagger):
http://localhost:8000/docs - MLflow Tracking Server:
http://localhost:5000 - Prometheus:
http://localhost:9090 - Grafana:
http://localhost:3000(admin/admin)
Full documentation is available via MkDocs. To view it locally:
uv pip install -e .[docs]
mkdocs serveFor local development and testing, install the development dependencies:
uv pip install -e .[dev]
pre-commit installUnit tests are written using pytest for Python and vitest for the React UI.
Python Tests:
PYTHONPATH=$PYTHONPATH:. pytest tests/React UI Tests:
cd ui
npm run testThe repository also includes a GitHub Actions workflow (.github/workflows/unit-tests.yml) that automatically runs these tests on every push and pull request to the main branch.