Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add PicoAudio Model #249

Merged
merged 1 commit into from
Jan 2, 2025
Merged

Add PicoAudio Model #249

merged 1 commit into from
Jan 2, 2025

Conversation

zeyuxie29
Copy link
Contributor

@zeyuxie29 zeyuxie29 commented Jul 19, 2024

✨ Description

The PR adds the PicoAudio into the Amphion toolkit.

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

🚧 Related Issues

[List the issue numbers related to this PR]

👨‍💻 Changes Proposed

  • Added the dataloader and model implement of PicoAudio into models/temporally_controllable_tta
  • Added the training and inference scripts of PicoAudio into models/temporally_controllable_tta

🧑‍🤝‍🧑 Who Can Review?

@zhizhengwu @HeCheng0625

🛠 TODO

✅ Checklist

  • Code has been reviewed
  • Code complies with the project's code standards and best practices
  • Code has passed all tests
  • Code does not affect the normal use of existing features
  • Code has been commented properly
  • Documentation has been updated (if applicable)
  • Demo/checkpoint has been attached (if applicable)

Copy link
Collaborator

@jiaqili3 jiaqili3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, could you format the code using black? thanks!

@jiaqili3 jiaqili3 merged commit 83591d2 into open-mmlab:main Jan 2, 2025
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants