-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Created SER tutorial #201
base: main
Are you sure you want to change the base?
Created SER tutorial #201
Conversation
Thanks @wilke0818 for the tutorial! It’s helpful and does a good job of addressing real challenges that users might face. I like it, and I’m curious to see how actual users respond. A few things are still missing to make this more complete:
def is_colab():
try:
import google.colab
return True
except ImportError:
return False
if is_colab():
%pip install senselab
else:
print("Not running on Colab. Skipping installation.")
Let me know if you need help with any of these points! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @wilke0818 . I have commented some required changes
hi, @wilke0818 ! Did you have any time to work on this? |
Nope. Added the tutorial change that you gave (might not have updated). Need to refactor for the API per the other issue on this topic. It is unclear to me what you would want for documentation. The functionalities themselves are documented already and this tutorial provides the information about the task a user might need (it is pretty akin to the link you sent). |
This reverts commit 9749f39ec273cac1016fb3461e5cf1344f340073.
Every task has a documentation page that explains what the task is, how it's commonly evaluated, what are the popular datasets and models. You can see the doc: https://sensein.group/senselab/senselab/audio/tasks/text_to_speech.html |
I mean that makes sense but do we want this to be an SER task or a generic audio classification task (which is what the HuggingFace pipeline is) which doesn't have a specific task/dataset but where SER is just an example usage of the task? |
Following #197, both. I would implement both a classification task (as HuggingFace has) and a SER task and would make it so that SER exploits the classification interfaces and employs some checks before and after (e.g., outputs should be emotion-related) |
Description
Creates a tutorial using Senselab for SER.
Related Issue(s)
#197
Motivation and Context
We were lacking a tutorial using the functionality for audio classification, which currently has one specific implementation, speech emotion recognition, and so now novice users can better understand how to Senselab might be useful for this task.
How Has This Been Tested?
Through Colab