-
Notifications
You must be signed in to change notification settings - Fork 31
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #40 from tokk-nv/dev-audiocraft
Add AudioCraft tutorial
- Loading branch information
Showing
5 changed files
with
89 additions
and
0 deletions.
There are no files selected for viewing
Binary file not shown.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,87 @@ | ||
# Tutorial - AudioCraft | ||
|
||
Let's run Meta's [AudioCraft](https://github.com/facebookresearch/audiocraft), to produce high-quality audio and music on Jetson! | ||
|
||
!!! abstract "What you need" | ||
|
||
1. One of the following Jetson: | ||
|
||
<span class="blobDarkGreen4">Jetson AGX Orin 64GB</span> | ||
<span class="blobDarkGreen5">Jetson AGX Orin (32GB)</span> | ||
<span class="blobLightGreen4">Jetson Orin Nano Orin (8GB)</span> | ||
|
||
2. Running one of the following [JetPack.5x](https://developer.nvidia.com/embedded/jetpack) | ||
|
||
<span class="blobPink1">JetPack 5.1.2 (L4T r35.4.1)</span> | ||
<span class="blobPink2">JetPack 5.1.1 (L4T r35.3.1)</span> | ||
<span class="blobPink3">JetPack 5.1 (L4T r35.2.1)</span> | ||
|
||
3. Sufficient storage space (preferably with NVMe SSD). | ||
|
||
- `10.7 GB` for `audiocraft` container image | ||
- Space for checkpoints | ||
|
||
## Clone and set up `jetson-containers` | ||
|
||
``` | ||
git clone https://github.com/dusty-nv/jetson-containers | ||
cd jetson-containers | ||
sudo apt update; sudo apt install -y python3-pip | ||
pip3 install -r requirements.txt | ||
``` | ||
## How to start | ||
|
||
Use `run.sh` and `autotag` script to automatically pull or build a compatible container image. | ||
|
||
``` | ||
cd jetson-containers | ||
./run.sh $(./autotag audiocraft) | ||
``` | ||
|
||
The container has a default run command (`CMD`) that will automatically start the Jupyter Lab server. | ||
|
||
Open your browser and access `http://<IP_ADDRESS>:8888`. | ||
|
||
> The default password for Jupyter Lab is `nvidia`. | ||
## Run Jupyter notebooks | ||
|
||
AudioCraft repo comes with demo Jupyter notebooks. | ||
|
||
On Jupyter Lab navigation pane on the left, double-click `demos` folder. | ||
|
||
![](./images/audiocraft_jupyterlab_demo.png) | ||
|
||
### AudioGen demo | ||
|
||
For "**Text-conditional Generation**", you should get something like this. | ||
|
||
<audio controls> | ||
<source src="./assets/subway.wav" type="audio/wav"> | ||
Your browser does not support the audio element. | ||
</audio> | ||
|
||
!!! info | ||
|
||
You may encounter an error message like the following when executing the first cell, but you can keep going. | ||
``` | ||
A matching Triton is not available, some optimizations will not be enabled. | ||
Error caught was: No module named 'triton' | ||
``` | ||
|
||
!!! warning | ||
|
||
When running the 5-th cell of `audiogen_demo.ipynb`, you may run into "**Failed to load audio**" RuntimeError. | ||
|
||
### MusicGen demo | ||
|
||
For "**Text-conditional Generation**", you should get something like this. | ||
|
||
<audio controls> | ||
<source src="./assets/80s-pop.wav" type="audio/wav"> | ||
Your browser does not support the audio element. | ||
</audio> | ||
|
||
!!! warning | ||
|
||
When running the 5-th cell of `musicgen_demo.ipynb`, you may run into "**Failed to load audio**" RuntimeError. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters