You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Requirements
java 17 python 3.8.10
ps: if you have higher version of either things pyspark might probably not get installed corrected and even if it does it will not work correctly :)
creating user and granting permission inside the psql shell
-- Create a database
CREATE DATABASE tweedbt;
-- Create a user with a password
CREATE USER <username> WITH PASSWORD '<password>'; -- these values should go into your .env
-- Grant privileges
GRANT ALL PRIVILEGES ON DATABASE mydb TO <username>;
applying the schema
cd DB/
psql -U <username> -d tweedbt -f schema.sql
accessing psql
psql -U <username> -d tweedbt
\l to list all dbs
\d to list all relations
How to run Pyspark, Kafka integration
Make sure that each time the old data is flushed
bash reset-kafka.sh
Run each of the following commands in seperate terminals and in same order