Skip to content

Latest commit

 

History

History
49 lines (31 loc) · 988 Bytes

README.md

File metadata and controls

49 lines (31 loc) · 988 Bytes

Dual-signal Transformation LSTM Network

input

audio file(16kHz)

LibriSpeech ASR corpus
http://www.openslr.org/12
1221-135766-0000.wav

Output

Audio file with noise removed

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample wav,

$ python3 dtln.py

If you want to specify the audio, put the file path after the --input option.

$ python3 dtln.py --input AUDIO_FILE

If you run by onnxruntime instead of ailia, you use --onnx option.

Reference

Framework

Pytorch

Model Format

ONNX opset=11

Netron