Skip to content

be-unkind/big-data-ucu-project

Repository files navigation

Project for Big Data course (UCU)


Authors:

  • Anastasiia Havryliv
  • Yaroslav Romanus

Aim of the project

The aim of this project is to implement a system, which will process Wikipedia stream data and process it for further access to it from REST API endpoints. Several technologies were used in the project, including Kafka, Cassandra, Spark (for batch processing), Spark Streaming and FastAPI.


System Design (Description)

System design diagram and detailed description of all of the components can be found here


Project results

Results from endpoint requests along with demonstation picture of running system in Docker (working containers) can be found here

{ more detailed urls:


How to test the system (how to run and shut it down)

To run the system (write in terminal):

docker-compose up

To shut the system down (write in terminal):

docker-compose down

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages