MicroServices

Micro-Services Architecture for an ETL data pipeline handling 600 GB+ and 30 million+ data entities.

This repo outlines the architecture of an ETL data pipeline and the microservices architecture that implements it.

The pod specs and diagram of the system are included in kubernetes directory, however the actual python code base, proto schema for Google Protobuf data structures are propietary and not included.

Target System Architecure

The entire system runs on Linux-Fedora24, data replication is implemented using Gluster File System that maintain data bricks and replicas of each brick on each node of the system.

The cluster consists of a single master which host the Kubernetes critical services, and any number of nodes which can be scaled horizontally to decrease processing time.

MicroService Architecture

Implemented using Kubernetes Pods and Services which create a reliable consistent point of contact for pods to interact without hardcoded IP addresses needed.

Replication controllers were not included, although they would have increased resilience of the system by starting pods that went down.

The diagram depicts the ETL architecture, but in short:

Market data was accessed via query of an 3rd party API. The pod running this service did so in a simulated Ubuntu environment running a windows emulator and the 3rd party software IQFEED

Fetch Data pod extracts market data from IQFEED based on query arguments including options, stocks, and dividends. Serializes data into SSDB-FEED (level DB like database) data persists not in the pod but in an attached Gluster Volume and replicated on all hosts.

PostProcessing Pod pulled data from SSDB-FEED and transformed further by retrieving detailed options data, adding libor interest rate, and pairing stock option trades with the nearest underlying stock value. Data is serialized into SSDB-POST (underlying gluster volume) and similarly replicated.

XLServer Pod is a server programmed to respond to queries from user Excel sheets, query SSDB-FEED and SSDB-POST and perform additional transformations and data comparisons.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
bash		bash
docker		docker
kubernetes		kubernetes
proto		proto
python		python
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
publish.sh		publish.sh
system_diagram.jpg		system_diagram.jpg
xl_server.jpg		xl_server.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MicroServices

Target System Architecure

MicroService Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MicroServices

Target System Architecure

MicroService Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages