GitHub - be-unkind/big-data-ucu-project

Project for Big Data course (UCU)

Authors:

Anastasiia Havryliv
Yaroslav Romanus

Aim of the project

The aim of this project is to implement a system, which will process Wikipedia stream data and process it for further access to it from REST API endpoints. Several technologies were used in the project, including Kafka, Cassandra, Spark (for batch processing), Spark Streaming and FastAPI.

System Design (Description)

System design diagram and detailed description of all of the components can be found here

Project results

Results from endpoint requests along with demonstation picture of running system in Docker (working containers) can be found here

{ more detailed urls:

How to test the system (how to run and shut it down)

To run the system (write in terminal):

docker-compose up

To shut the system down (write in terminal):

docker-compose down

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
cql_scripts		cql_scripts
rest_api		rest_api
results		results
source_data_reader		source_data_reader
spark_batch		spark_batch
spark_stream		spark_stream
.gitignore		.gitignore
README.md		README.md
System-Description.pdf		System-Description.pdf
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project for Big Data course (UCU)

Authors:

Aim of the project

System Design (Description)

Project results

How to test the system (how to run and shut it down)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project for Big Data course (UCU)

Authors:

Aim of the project

System Design (Description)

Project results

How to test the system (how to run and shut it down)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages