-
Audio file
https://github.com/QwenLM/Qwen-Audio/blob/main/assets/audio/1272-128104-0000.flac
-
Prompt
what does the person say?
The person says: "mister quilter is the apostle of the middle classes and we are glad to welcome his gospel".
This model requires additional module.
pip3 install transformers
pip3 install tiktoken
pip3 install librosa
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample wav,
$ python3 qwen_audio.py
If you want to specify the audio, put the file path after the --input
option.
$ python3 qwen_audio.py --input AUDIO_FILE
If you want to specify the prompt, put the prompt after the --prompt
option.
$ python3 qwen_audio.py --prompt PROMPT
Pytorch
ONNX opset=17
Qwen-Audio-Chat_encode.onnx.prototxt
Qwen-Audio-Chat.onnx.prototxt