Disfluency Detection, Removal and Correction: Increase Apparent Public Speaking Fluency By Speech Augmentation
-
To run:
python audio_receiver_remover.py 7777
7777
is the <port_number> to host the API- Open the
index.html
in Chrome/Firefox - Navigate to the tool & use the samples!
-
If you change the <port_number>
- Go to
/js/vis.js
and replace allhttp://localhost:7777/
withhttp://localhost:<new_port_number>/
- Go to
-
Required Packages:
- web.py
- librosa
- tensorflow-gpu (>=1.5,<2)
- pydub
- pyAudioAnalysis
- soundfile
- h5py
- scipy
- numpy
- sklearn
- matplotlib
- hmmlearn
If you use our tool please consider citing our paper:
@inproceedings{das2019increase,
title={Increase Apparent Public Speaking Fluency by Speech Augmentation},
author={Das, Sagnik and Gandhi, Nisha and Naik, Tejas and Shilkrot, Roy},
booktitle={ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={6890--6894},
year={2019},
organization={IEEE}
}
If you have any questions with the code please open an issue.