Skip to content

danielchyeh/Sequence_Labeling

Repository files navigation

Sequence_Labeling

– Bidirectional LSTM & CNN layers are used for training the model, which predicts the associated phone sequences given the acoustic signals.

Project Link: https://www.csie.ntu.edu.tw/~yvchen/f106-adl/A1

Dataset

TIMIT dataset, which has MFCC features (39 dims) and FBank features (69 dims) for each frame is used in this task. It also has 48 different kinds of phone.

TIMIT dataset can be downloaded from kaggle https://www.kaggle.com/c/hw1-timit/data (created by class MLDS2017, NTU)

Quick start

Run the shell script

./hw1_best.sh [input directory] [output filename]

or any of the following: hw1_cnn.sh, hw1_rnn.sh

[input directory] should be TIMIT dataset which you download from link above

[output filename] a .csv file which shows the result of prediction

About

Keras implementation of Bidirectional LSTM & CNN layers, which predicts the associated phone sequences given the acoustic signals.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors