Merge pull request #40 from tokk-nv/dev-audiocraft

Add AudioCraft tutorial
NVIDIA-AI-IOT · Nov 7, 2023 · 4b2e136 · 4b2e136
2 parents eedcce8 + 1930d67
commit 4b2e136
Show file tree

Hide file tree

Showing 5 changed files with 89 additions and 0 deletions.
diff --git a/docs/assets/80s-pop.wav b/docs/assets/80s-pop.wav
diff --git a/docs/assets/subway.wav b/docs/assets/subway.wav
diff --git a/docs/images/audiocraft_jupyterlab_demo.png b/docs/images/audiocraft_jupyterlab_demo.png
diff --git a/docs/tutorial_audiocraft.md b/docs/tutorial_audiocraft.md
@@ -0,0 +1,87 @@
+# Tutorial - AudioCraft
+
+Let's run Meta's [AudioCraft](https://github.com/facebookresearch/audiocraft), to produce high-quality audio and music on Jetson!
+
+!!! abstract "What you need"
+
+    1. One of the following Jetson:
+
+        <span class="blobDarkGreen4">Jetson AGX Orin 64GB</span>
+        <span class="blobDarkGreen5">Jetson AGX Orin (32GB)</span>
+        <span class="blobLightGreen4">Jetson Orin Nano Orin (8GB)</span>
+
+    2. Running one of the following [JetPack.5x](https://developer.nvidia.com/embedded/jetpack)
+
+        <span class="blobPink1">JetPack 5.1.2 (L4T r35.4.1)</span>
+        <span class="blobPink2">JetPack 5.1.1 (L4T r35.3.1)</span>
+        <span class="blobPink3">JetPack 5.1 (L4T r35.2.1)</span>
+
+    3. Sufficient storage space (preferably with NVMe SSD).
+
+        - `10.7 GB` for `audiocraft` container image
+        - Space for checkpoints
+
+## Clone and set up `jetson-containers`
+
+```
+git clone https://github.com/dusty-nv/jetson-containers
+cd jetson-containers
+sudo apt update; sudo apt install -y python3-pip
+pip3 install -r requirements.txt
+```
+## How to start
+
+Use `run.sh` and `autotag` script to automatically pull or build a compatible container image.
+
+```
+cd jetson-containers
+./run.sh $(./autotag audiocraft)
+```
+
+The container has a default run command (`CMD`) that will automatically start the Jupyter Lab server.
+
+Open your browser and access `http://<IP_ADDRESS>:8888`.
+
+> The default password for Jupyter Lab is `nvidia`.
+
+## Run Jupyter notebooks
+
+AudioCraft repo comes with demo Jupyter notebooks.
+
+On Jupyter Lab navigation pane on the left, double-click `demos` folder. 
+
+![](./images/audiocraft_jupyterlab_demo.png)
+
+### AudioGen demo
+
+For "**Text-conditional Generation**", you should get something like this.
+
+<audio controls>
+  <source src="./assets/subway.wav" type="audio/wav">
+Your browser does not support the audio element.
+</audio>
+
+!!! info
+
+    You may encounter an error message like the following when executing the first cell, but you can keep going.
+    ```
+    A matching Triton is not available, some optimizations will not be enabled.
+    Error caught was: No module named 'triton'
+    ```
+
+!!! warning
+
+    When running the 5-th cell of `audiogen_demo.ipynb`, you may run into "**Failed to load audio**" RuntimeError.
+
+### MusicGen demo
+
+For "**Text-conditional Generation**", you should get something like this.
+
+<audio controls>
+  <source src="./assets/80s-pop.wav" type="audio/wav">
+Your browser does not support the audio element.
+</audio>
+
+!!! warning
+
+    When running the 5-th cell of `musicgen_demo.ipynb`, you may run into "**Failed to load audio**" RuntimeError.
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -87,6 +87,8 @@ nav:
     # - NanoOWL: tutorial_nanoowl.md
     - Vector Database:
       - NanoDB: tutorial_nanodb.md
+    - Audio:
+      - Audiocraft: tutorial_audiocraft.md
     # - Tools:
     #   - LangChain: tutorial_distillation.md
     - Tips: