GitHub - shashuat/deception-detection

Install Instructions

Download the data

Download and unzip DOLOS.zip
Download and unzip data.zip to avoid to recompute all data processing (~7GB of data)

You can find the data on this drive You can find the demo video on this drive

Setup python environment

Download requirements

pip install -r requirements.txt

Create config.py if needed to indicate where to find DOLOS (DOLOS_PATH) and data (DATA_PATH)
Run setup.py to check the setup

Dataset Preprocessing

run YT_video_downloader2.py to get video chunks in code/data_preprocess/data/downloaded

Download Video Chunks run our_yt_video_downloader.py to get video chunks

python YT_video_downloader2.py /Data/dec/data/downloaded /Data/dec/DOLOS/dolos_timestamps.csv --cookies /Data/dec/code/data_preprocess/utils/youtube.txt

We'll store the downloaded files in: /Data/dec/data/downloaded. The --cookies arg makes the yt_dlp package not think of us as a bot.

Extract RGB frames
Extract Face Frames

Finally, you should have such a data directory structure

/data/
├── audio_files/          # .wav audio files 
├── face_frames/          # Extracted face frames as JPG files
│   ├── clip_name1/
│   │   ├── frame_0001.jpg
│   │   ├── frame_0002.jpg
│   │   └── ...
│   └── ...
└── rgb_frames/           # Full video frames (optional)
    ├── clip_name1/
    └── ...

/DOLOS/protocols/
├── train_fold1.csv
├── test_fold1.csv
├── train_fold2.csv
└── ...

Code directory

/code
├── data_preprocess/  # downloading data, frame
├── dataloader/       # contains the torch dataset class
├── models/           # torch model
├── train_test.py     # current training script
├── ...

wandb config

make an .env file with WANDB_TOKEN=e58e... and login is handled in the init.py file

Running the model

Configure the model in train_test.py by modifying the config dictionary if needed
Run this command from ~/code to train the model

python -m train_test

Results

Trained models and logs will be saved in the logs directory

Current Warnings

... and 136 more issues warnings.warn(f" - ... and {len(problem_files) - 5} more issues") Dataset validated: 407 valid samples out of 548 total. Training samples: 857, Test samples: 407 Model created: DOLOS_fusion_Encoders_4_Adapter_True_type_efficient_conv_fusion_cross2 Starting training for 20 epochs

Some terminal outputs (just for reference)

Epoch 3/20 Batch 0, Loss: 0.69411 Batch 10, Loss: 0.71668 Batch 20, Loss: 0.66646 Batch 30, Loss: 0.70222 Batch 40, Loss: 0.69655 Batch 50, Loss: 0.68353 Training completed in 86.14 seconds Epoch 3 Results: Train - Loss: 0.68658, Acc: 0.55309, F1: 0.66550, AUC: 0.52947 Valid - Loss: 0.67329, Acc: 0.59214, F1: 0.67063, AUC: 0.57660 New best model saved to logs/best_model_train_fold3_test_fold3.pt

Epoch 4/20 Batch 0, Loss: 0.67514 Batch 10, Loss: 0.66677 Batch 20, Loss: 0.63368 Batch 30, Loss: 0.75176 Batch 40, Loss: 0.61897 Batch 50, Loss: 0.63325 Training completed in 87.09 seconds Epoch 4 Results: Train - Loss: 0.68696, Acc: 0.57060, F1: 0.63780, AUC: 0.55928 Valid - Loss: 0.67313, Acc: 0.56265, F1: 0.66160, AUC: 0.54251

Epoch 5/20 Batch 0, Loss: 0.70214 Batch 10, Loss: 0.76371

...

Training completed in 83.71 seconds Epoch 20 Results: Train - Loss: 0.63875, Acc: 0.63827, F1: 0.64932, AUC: 0.63988 Valid - Loss: 0.67102, Acc: 0.60688, F1: 0.70696, AUC: 0.58302

Training completed. Best Results (Epoch 16) - Acc: 0.61671, F1: 0.64545, AUC: 0.61417

The initial epoch precision recall values might be NaN, hence a warning is to be expected

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
arc		arc
code		code
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install Instructions

Download the data

Setup python environment

Dataset Preprocessing

Code directory

wandb config

Running the model

Results

Current Warnings

Some terminal outputs (just for reference)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Install Instructions

Download the data

Setup python environment

Dataset Preprocessing

Code directory

wandb config

Running the model

Results

Current Warnings

Some terminal outputs (just for reference)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages