Skip to content

TonyQ2k3/reddit-sentiment-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment analysis in brand monitoring using Apache Spark and Kafka

Docker Kubernetes Apache Spark Apache Kafka Selenium

👨‍💻 About

This is the repo for deploying sentiment analysis using Kafka, Spark and Kubernetes.

📑 Workflow diagram

Workflow Diagram

  • Dashboard Application: The app that users interact with
  • Sentiment Analyzer: Handles on-demand sentiment analysis of products
  • Database: Stores and caches data
  • Monitoring (WIP): Monitor cluster performance

📁 Repository info

The repo structure is as follows:

  • /kubernetes: Contains YAML files to deploy resources on K8s. Those includes:
    • crawler-server
    • kafka
    • spark
    • redis
    • dashboard
  • /spark: Contains codes that serves as "tasks" to submit into Apache Spark. Check the folder's README for more info.

Deploy cluster:

aws eks update-kubeconfig --name devops-eks-cluster --region us-east-1

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors