From 8fc62917c37ac0428950c5764f603bfcfac13263 Mon Sep 17 00:00:00 2001 From: Simon Devauchelle Date: Thu, 13 Jul 2023 17:35:32 +0200 Subject: [PATCH] Minor fixes --- README.md | 48 +++++++++++++++++++++++++++++------------------- 1 file changed, 29 insertions(+), 19 deletions(-) diff --git a/README.md b/README.md index fd3489d..dda8987 100644 --- a/README.md +++ b/README.md @@ -108,7 +108,7 @@ Class constructor accept 3 optional arguments: #### _VBx-based_ models Another gender segmentation system, which utilizes **x-vectors** (https://github.com/BUTSpeechFIT/VBx), is also available and can be specified in the class constructor for usage: -* vbx_based (default:False): if set to True, performs gender segmentation using _vbx-based_ system on speech segment. Computation time is longer but gender detection can be better depending on your use (see scoring tables). +* vbx_based (default: False): if set to True, performs gender segmentation using _vbx-based_ system on speech segment. Computation time is longer but gender detection can be better depending on your use (see scoring tables). Warning : 'detect_gender' argument must be set to True. @@ -116,7 +116,17 @@ Warning : 'detect_gender' argument must be set to True. **Frame-level evaluation** (collar = 500ms) - +
++++++++++ @@ -138,53 +148,53 @@ Warning : 'detect_gender' argument must be set to True. - + - + - + - - + + - + - + - - + + - + - + - + - + - - + +
Recall Female 96,8898,9798,97 95,8697,0097,00 96,1597,8297,82
Male 99,2699,3898,5799,3898,57 97,9098,5598,55 95,60
Precision Female 97,5397,5497,54 90,0594,9595,0794,9595,07 94,47
Male 97,3099,6199,61 98,9799,3599,35 96,4796,8596,85
F1-score 97,7498,8798,87 95,8298,8796,5598,8796,55 96,16
-\* **DATA-INA-FR** : a new annotated corpus of French audiovisual archives has been annotated. +\* **DATA-INA-FR** : a new corpus of French audiovisual archives has been annotated. It represents 285 minutes of content from French television and radio channels. This corpus is much noisier, but more representative of the reality of an audiovisual stream. We recommend setting vbx_based to False if you are processing such data. @@ -228,7 +238,7 @@ Details on the speech detection submodule can be found bellow: } ``` -If you use the VBx-Based Voice Femininity Scoring, you can cite its publication accepted in the 24th INTERSPEECH Conference (2023) in Dublin, Ireland: +If you use the Voice Femininity Scoring, you can cite its publication accepted in the 24th INTERSPEECH Conference (2023) in Dublin, Ireland: ```bibtex @inproceedings{ddoukhaninterspeech2023, author = {Doukhan, David and Devauchelle, Simon and Girard-Monneron Lucile and Wagner, Isabelle and Rilliard Albert.},