Speech commands classification with Transformers on the Speech Commands Dataset from the TensorFlow Speech Recognition Challenge.
We use the Speech Commands Dataset from the TensorFlow Speech Recognition Challenge hosted on the kaggle platform as our dataset to train and test our models on. It consists of 30 different classes and includes 65,000 samples in total. The samples are 1- second audio snippets with a sampling rate of 16,000 Hz.