Adapting the Audio Spectrogram Transformer (AST) for human language classification.
Class project for Stanford CS 224S (Spoken Language Processing), spring 2024.
Please refer to the below links to see how AST was finetuned and evaluated for various spoken language tasks.