Update: SingVisio citations and resources links #274

yuantuo666 · 2024-09-19T07:26:39Z

✨ Description

Update: SingVisio citations and resources links

🚧 Related Issues

None

👨‍💻 Changes Proposed

SingVisio README.md: citations and links.
Amphion README.md: citations and links.

🧑‍🤝‍🧑 Who Can Review?

@lmxue @RMSnow

🛠 TODO

None

✅ Checklist

Code has been reviewed
Code complies with the project's code standards and best practices
Code has passed all tests
Code does not affect the normal use of existing features
Code has been commented properly
Documentation has been updated (if applicable)
Demo/checkpoint has been attached (if applicable)

RMSnow · 2024-09-19T14:51:25Z

README.md

To update the SingVisio Doc's link

RMSnow · 2024-09-19T14:52:00Z

egs/visualization/SingVisio/README.md

To update the SingVisio Doc's link

yuantuo666 · 2024-09-19T14:53:17Z

README.md

@@ -160,7 +160,7 @@ Amphion is under the [MIT License](LICENSE). It is free for both research and co
 @inproceedings{amphion,
    author={Zhang, Xueyao and Xue, Liumeng and Gu, Yicheng and Wang, Yuancheng and Li, Jiaqi and He, Haorui and Wang, Chaoren and Song, Ting and Chen, Xi and Fang, Zihao and Chen, Haopeng and Zhang, Junan and Tang, Tze Ying and Zou, Lexiao and Wang, Mingxuan and Han, Jun and Chen, Kai and Li, Haizhou and Wu, Zhizheng},
    title={Amphion: An Open-Source Audio, Music and Speech Generation Toolkit},
-    booktitle={Proc.~of SLT},
+    booktitle={{IEEE} Spoken Language Technology Workshop, {SLT} 2024},


@RMSnow Do we need to update this citation on the Emilia README.md page? Since it was also Proc.~of SLT.

lmxue

Please update the information mentioned in the comments.

lmxue · 2024-09-19T15:01:15Z

README.md

@@ -29,13 +29,13 @@
 In addition to the specific generation tasks, Amphion includes several **vocoders** and **evaluation metrics**. A vocoder is an important module for producing high-quality audio signals, while evaluation metrics are critical for ensuring consistent metrics in generation tasks. Moreover, Amphion is dedicated to advancing audio generation in real-world applications, such as building **large-scale datasets** for speech synthesis.

 ## 🚀 News
- **2024/09/01**: [Amphion](https://arxiv.org/abs/2312.09911) and [Emilia](https://arxiv.org/abs/2407.05361) got accepted by IEEE SLT 2024! 🤗
+- **2024/09/01**: [Amphion](https://arxiv.org/abs/2312.09911) and [Emilia](https://arxiv.org/abs/2407.05361) got accepted by IEEE SLT 2024! [SingVisio](https://arxiv.org/abs/2402.12660) got accepted by Computers & Graphics! 🤗


Update the accepted date.

lmxue · 2024-09-19T15:02:00Z

README.md

 - **2024/08/28**: Welcome to join Amphion's [Discord channel](https://discord.com/invite/ZxxREr3Y) to stay connected and engage with our community!
 - **2024/08/27**: *The Emilia dataset is now publicly available!* Discover the most extensive and diverse speech generation dataset with 101k hours of in-the-wild speech data now at [![hf](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-Dataset-yellow)](https://huggingface.co/datasets/amphion/Emilia-Dataset) or [![OpenDataLab](https://img.shields.io/badge/OpenDataLab-Dataset-blue)](https://opendatalab.com/Amphion/Emilia)! 👑👑👑
 - **2024/07/01**: Amphion now releases **Emilia**, the first open-source multilingual in-the-wild dataset for speech generation with over 101k hours of speech data, and the **Emilia-Pipe**, the first open-source preprocessing pipeline designed to transform in-the-wild speech data into high-quality training data with annotations for speech generation! [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2407.05361) [![hf](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-Dataset-yellow)](https://huggingface.co/datasets/amphion/Emilia) [![demo](https://img.shields.io/badge/WebPage-Demo-red)](https://emilia-dataset.github.io/Emilia-Demo-Page/) [![readme](https://img.shields.io/badge/README-Key%20Features-blue)](preprocessors/Emilia/README.md)
 - **2024/06/17**: Amphion has a new release for its **VALL-E** model! It uses Llama as its underlying architecture and has better model performance, faster training speed, and more readable codes compared to our first version. [![readme](https://img.shields.io/badge/README-Key%20Features-blue)](egs/tts/VALLE_V2/README.md)
 - **2024/03/12**: Amphion now support **NaturalSpeech3 FACodec** and release pretrained checkpoints. [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2403.03100) [![hf](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-model-yellow)](https://huggingface.co/amphion/naturalspeech3_facodec) [![hf](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-demo-pink)](https://huggingface.co/spaces/amphion/naturalspeech3_facodec) [![readme](https://img.shields.io/badge/README-Key%20Features-blue)](models/codec/ns3_codec/README.md)
- **2024/02/22**: The first Amphion visualization tool, **SingVisio**, release. [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2402.12660) [![openxlab](https://cdn-static.openxlab.org.cn/app-center/openxlab_app.svg)](https://openxlab.org.cn/apps/detail/Amphion/SingVisio) [![Video](https://img.shields.io/badge/Video-Demo-orange)](https://github.com/open-mmlab/Amphion/assets/33707885/0a6e39e8-d5f1-4288-b0f8-32da5a2d6e96) [![readme](https://img.shields.io/badge/README-Key%20Features-blue)](egs/visualization/SingVisio/README.md)
+- **2024/02/22**: The first Amphion visualization tool, **SingVisio**, release. [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2402.12660) [![openxlab](https://cdn-static.openxlab.org.cn/app-center/openxlab_app.svg)](https://openxlab.org.cn/apps/detail/Amphion/SingVisio) [![Introduction](https://img.shields.io/badge/Docs-Intro-orange)](https://speechteam.feishu.cn/wiki/KrIIwpjIVi7MhtkiCcXcjFI2nib) [![readme](https://img.shields.io/badge/README-Key%20Features-blue)](egs/visualization/SingVisio/README.md)


Update the new video demo.

lmxue · 2024-09-19T15:03:22Z

egs/visualization/SingVisio/README.md

@@ -2,17 +2,19 @@

 [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2402.12660)
 [![openxlab](https://cdn-static.openxlab.org.cn/app-center/openxlab_app.svg)](https://openxlab.org.cn/apps/detail/Amphion/SingVisio)
-[![Video](https://img.shields.io/badge/Video-Demo-orange)](https://github.com/open-mmlab/Amphion/assets/33707885/0a6e39e8-d5f1-4288-b0f8-32da5a2d6e96)
+[![Introduction](https://img.shields.io/badge/Docs-Intro-orange)](https://speechteam.feishu.cn/wiki/KrIIwpjIVi7MhtkiCcXcjFI2nib)


Update the new video demo.

lmxue · 2024-09-19T15:04:15Z

README.md

@@ -87,7 +87,7 @@ Amphion provides a comprehensive objective evaluation of the generated audio. Th

 Amphion provides visualization tools to interactively illustrate the internal processing mechanism of classic models. This provides an invaluable resource for educational purposes and for facilitating understandable research.

-Currently, Amphion supports [SingVisio](egs/visualization/SingVisio/README.md), a visualization tool of the diffusion model for singing voice conversion. [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2402.12660) [![openxlab](https://cdn-static.openxlab.org.cn/app-center/openxlab_app.svg)](https://openxlab.org.cn/apps/detail/Amphion/SingVisio) [![Video](https://img.shields.io/badge/Video-Demo-orange)](https://github.com/open-mmlab/Amphion/assets/33707885/0a6e39e8-d5f1-4288-b0f8-32da5a2d6e96)
+Currently, Amphion supports [SingVisio](egs/visualization/SingVisio/README.md), a visualization tool of the diffusion model for singing voice conversion. [![arXiv](https://img.shields.io/badge/arXiv-Paper-COLOR.svg)](https://arxiv.org/abs/2402.12660) [![openxlab](https://cdn-static.openxlab.org.cn/app-center/openxlab_app.svg)](https://openxlab.org.cn/apps/detail/Amphion/SingVisio) [![Introduction](https://img.shields.io/badge/Docs-Intro-orange)](https://speechteam.feishu.cn/wiki/KrIIwpjIVi7MhtkiCcXcjFI2nib)


Update the new video demo.

yuantuo666 added 2 commits September 19, 2024 15:14

Update: SingVisio docs

e283a05

Fix: README.md links

d36ce95

yuantuo666 requested review from RMSnow and lmxue September 19, 2024 07:26

Update bibtex

d9c1326

RMSnow requested changes Sep 19, 2024

View reviewed changes

yuantuo666 commented Sep 19, 2024

View reviewed changes

lmxue reviewed Sep 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update: SingVisio citations and resources links #274

Update: SingVisio citations and resources links #274

yuantuo666 commented Sep 19, 2024

RMSnow Sep 19, 2024

RMSnow Sep 19, 2024

yuantuo666 Sep 19, 2024

lmxue left a comment

lmxue Sep 19, 2024

lmxue Sep 19, 2024

lmxue Sep 19, 2024

lmxue Sep 19, 2024

Update: SingVisio citations and resources links #274

Are you sure you want to change the base?

Update: SingVisio citations and resources links #274

Conversation

yuantuo666 commented Sep 19, 2024

✨ Description

🚧 Related Issues

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

🛠 TODO

✅ Checklist

RMSnow Sep 19, 2024

Choose a reason for hiding this comment

RMSnow Sep 19, 2024

Choose a reason for hiding this comment

yuantuo666 Sep 19, 2024

Choose a reason for hiding this comment

lmxue left a comment

Choose a reason for hiding this comment

lmxue Sep 19, 2024

Choose a reason for hiding this comment

lmxue Sep 19, 2024

Choose a reason for hiding this comment

lmxue Sep 19, 2024

Choose a reason for hiding this comment

lmxue Sep 19, 2024

Choose a reason for hiding this comment