how I set up an annotation campaign #216

alecristia · 2021-05-28T12:28:38Z

alecristia
May 28, 2021
Maintainer

TODO

embed relative paths to wav file
decide on standardized relative paths for "raw" eafs and "annotated" eafs

My goals

In this project, I want to select sections to be annotated by humans in the lab, with the end goal of having more exemplars of "other child" and "male adult". My annotators speak French, so I've decided to draw samples from the Lyon corpus. I've looked at prior transcriptions and determined that the children that have the most OCH and MAL are: GAL, DUN, GOE2, FRH1, CUM, COF.

Prep work

activate my env source ~/ChildProjectVenv/bin/activate
update the childproject package: pip3 install git+https://github.com/LAAC-LSCP/ChildProject.git
I've made a local copy of the Lyon dataset
I've checked that I've the audio files in that local copy

Whenever I get back to this project after a while, I always do these checks:

activate my env source ~/ChildProjectVenv/bin/activate
update the childproject package: pip3 install git+https://github.com/LAAC-LSCP/ChildProject.git
make sure I had the most recent version of the dataset: git pull and then datalad update
make sure I was in the right branch git status then check which is the latest branch on https://gin.g-node.org/EL1000/lyon, and if needed switch with something like git checkout eaf-corrections
make sure I had the most recent versions of the files, if needed eg: datalad get annotations/eaf/mc/converted annotations/its/converted/

The sampler phase

I don't want our annotations to be too biased by our current algorithms' performance on OCH and MAL; but I also don't want to have a lot of silence. Among the sampling types mentioned in the sampler docs (currently: periodic, random-vocalizations, high-volubility, energy-detection), the most appropriate to avoid silence while not biasing selection by our voice type classifier is energy-detection.

Among the options for the energy-based sampler, I need to choose:

windows-length 30000 (i.e., 30 seconds)
windows-spacing 300000 (i.e., 5 mins)
windows-offset 1800000 (ie 1800 seconds, or 30 minutes)
window-count 40 (by recording)
threshold .75
low-freq 50 (remove freqs below 50hz)
high-freq 3000 (remove freqs above 3khz)
--by recording_filename

This leads me to the command (the first two parameters are the path to the dataset, and the path to the folder where segments will be stored):

child-project sampler . samples/och_mal/ --recordings samples/selrecs.csv energy-detection --windows-length 30000 --windows-spacing 300000 --windows-offset 1800000 --windows-count 40  --threshold .75 --low-freq 50 --high-freq 3000 --by recording_filename

For one sample child, I got:

$ child-project sampler . samples/och_mal/ --recordings samples/selrecs.csv energy-detection --windows-length 30000 --windows-spacing 300000 --windows-offset 1800000 --windows-count 40 --threshold .75 --low-freq 50 --high-freq 3000 --by recording_filename
computing the energy of 150 windows for recording COF_030503_1...
exported sampled segments to samples/och_mal/segments_20210605_123405.csv
exported sampler parameters to samples/och_mal/parameters_20210605_123405.yml

TODO --profile converted

Creating a template

This annotation will have a first-pass check by humans, who listen to 30s and decide whether they'll annotate that segment or not. Then a second pass, for which we'll use a template we created for this which is a variant of ACLEW's; it differs in that:

all speakers have VCM (rather than only target child)
for target child, there is no mwu or lex
for all speakers there is a tier to count syllables

To that end, we need to create a specific ELAN template. This is done inside ELAN, following these instructions.

exelang-template.zip

Building my .eafs

The first parameter is the destination. (Notice that I don't need to provide the path to the project for this one.) The segments file is outputted by the previous step

child-project eaf-builder --destination samples/och_mal/ --segments samples/och_mal/segments_20210605_123405.csv --template extra/exelang --context-onset 0 --context-offset 0 --eaf-type random

TODO check with LG: --eaf-type random

output:

making the random eaf file and csv
Parsing unknown version of ELAN spec... This could result in errors...
Creating eaf code segment # 1
enumerate makes: 0 (11700000, 11730000)
...
CHI : []
FA1 : []
vcm@CHI : []
vcm@FA1 : []
MA1 : []
vcm@MA1 : []
vcm@UC1 : []
UC1 : []
EE1 : []
xds@FA1 : []
xds@UC1 : []
xds@EE1 : []
xds@MA1 : []
vcm@EE1 : []
sylcnt@CHI : []
sylcnt@FA1 : []
sylcnt@MA1 : []
sylcnt@UC1 : []
sylcnt@EE1 : []
tocode : []
code_random : [(11700000, 11730000, ''), (6300000, 6330000, ''), (35400000, 35430000, ''), (6000000, .....

Using the seated scribe for selecting sections to annotate

TODO I'm adding/removing the .wav extension -- we should fix this upstream

Setting up files for annotators to access them

to be discussed

in this step, we would get the sections that are "yes" and set them to be annotated. But perhaps it is simpler to create a template with everything, and then during import designating this as the section that has been coded?

todo incorporate this, in order to split the sound files into the different recordings contained in a single wav file:

from ChildProject.projects import ChildProject 
from pydub import AudioSegment
import os
import sys

audio_path = sys.argv[1]

project = ChildProject('.')
project.read()

for session, recordings in project.recordings.groupby('session_id'):
    recordings['position'] = recordings['duration'].cumsum().shift(1, fill_value = 0)

    input_audio = os.path.join(audio_path, session + '.wav')
    audio = AudioSegment.from_file(input_audio)

    for recording in recordings.to_dict(orient = 'records'):
        on = recording['position']
        off = recording['position']+recording['duration']
        
        audio[on:off].to_wav(
            project.get_recording_path(recording['recording_filename']),
            format = 's16le',
            bitrate = '16k'
        )

lucasgautheron · 2021-05-28T13:01:23Z

lucasgautheron
May 28, 2021
Maintainer

Notice that this command will generate a csv with the selection for all the audiofiles (not just those of my 6 selected children).

Indeed. And this is a significant drawback since the energy sampler may be slow... (even though you can just sample everything and retain only the segments for the recordings you're targetting).
How do you suggest we implement the filtering from the commandline ? (It would be rather easy with the python API, adding an optional argument for the list/datafrane of recordings to process). The commandline could accept the list of recordings directly, or a path some files with the appropriate configuration (e.g. a CSV).

?? how do I know the name of the segments file created??

Good question.
For with the python API it is very straightforward: the SamplerPipeline returns the path to the output dataframe.
However, the command-line tool prints it to stdout, but among other things, so it might take some bash trickery to recover it.
Maybe the user should have an option to decde the name of the output file, not just the prefix (currently the output file location is built as:

segments_path = os.path.join(destination, 'segments_{}.csv'.format(date))

2 replies

lucasgautheron Jun 2, 2021
Maintainer

I am currently implementing more control options over the samplers, see #220 and #221

lucasgautheron Jun 3, 2021
Maintainer

Done (see 3c99fab)

alecristia · 2021-06-10T07:54:15Z

alecristia
Jun 10, 2021
Maintainer Author

We probably want a fully public sample file to train annotators with.

I suggest we use the first recording of Anae in the Paris corpus:

create a new public dataset
get movie from here: https://media.talkbank.org/phonbank/French/Paris/Anae/010413.mp4
convert movie into wav
generate samples and an eaf with the commands above

2 replies

alecristia Jun 10, 2021
Maintainer Author

[INFO ] Creating a new annex repo at /Users/alejandrinacristia/gitrepos/paris_anae_1
[ERROR ] git-annex of version >= 7.20190503 is missing. Visit http://handbook.datalad.org/r.html?install for instructions on how to install DataLad and git-annex. [annexrepo.py:_check_git_annex_version:555] (MissingExternalDependency)

SOLUTION: brew install git-annex

alecristia Jun 10, 2021
Maintainer Author

(ChildProjectVenv) mac-cristia:gitrepos alejandrinacristia$ datalad create paris_anae_1
[ERROR ] will not create a dataset in a non-empty directory, use force option to ignore [create(/Users/alejandrinacristia/gitrepos/paris_anae_1)]
create(error): /Users/alejandrinacristia/gitrepos/paris_anae_1 (dataset) [will not create a dataset in a non-empty directory, use force option to ignore]
(ChildProjectVenv) mac-cristia:gitrepos alejandrinacristia$ ls
ABX-Brent LenaSysRev broad-strokes-lena metaSES peerproject zoo-babble-validation
ABX-Buckeye MA_speech_pref get-pip.py nwryd19 raw_YEL
DiaVM-tools TopDown_BottomUp lena_eval paris_anae_1 wordseg
(ChildProjectVenv) mac-cristia:gitrepos alejandrinacristia$ rm -r paris_anae_1/
(ChildProjectVenv) mac-cristia:gitrepos alejandrinacristia$ datalad create paris_anae_1
[INFO ] Creating a new annex repo at /Users/alejandrinacristia/gitrepos/paris_anae_1
[INFO ] Scanning for unlocked files (this may take some time)
create(ok): /Users/alejandrinacristia/gitrepos/paris_anae_1 (dataset)

alecristia · 2021-06-11T09:17:45Z

alecristia
Jun 11, 2021
Maintainer Author

Process adapted to the Paris minicorpus:

source ~/ChildProjectVenv/bin/activate
datalad install -r [email protected]:/LAAC-LSCP/paris.git
cd paris
datalad get recordings/converted/16kHz/Anae/010413.wav
mkdir samples

then inside samples/selrecs.csv I put:

recording_filename
Anae/010413.wav

A few variants of the sampling command give me a concatenation error:

child-project sampler . samples/och_mal/ --recordings samples/selrecs.csv energy-detection --windows-length 30000 --windows-spacing 30000 --windows-offset 60000 --windows-count 5  --threshold .75 --low-freq 50 --high-freq 3000 --by recording_filename
child-project sampler . samples/och_mal/ --recordings samples/selrecs.csv energy-detection --windows-length 30000 --windows-offset 0 --windows-spacing 0 --windows-count 5  --threshold .1 --low-freq 50 --high-freq 3000 --by recording_filename

Notice the last one asks for a rather low threshold, no skipping, no spacing

Traceback (most recent call last):
File "/Users/acristia/ChildProjectVenv/bin/child-project", line 11, in
load_entry_point('ChildProject==0.0.1', 'console_scripts', 'child-project')()
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/cmdline.py", line 331, in main
args.func(args)
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/cmdline.py", line 31, in
_parser.set_defaults(func = lambda args: cls().run(**vars(args)))
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/pipelines/samplers.py", line 579, in run
splr.sample()
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/pipelines/samplers.py", line 380, in sample
windows = pd.concat([self.get_recording_windows(r) for r in recordings.to_dict(orient = 'records')])
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/pandas/core/reshape/concat.py", line 284, in concat
sort=sort,
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/pandas/core/reshape/concat.py", line 331, in init
raise ValueError("No objects to concatenate")
ValueError: No objects to concatenate

16 replies

lucasgautheron Jun 11, 2021
Maintainer

do pip3 install git+https://github.com/LAAC-LSCP/ChildProject.git --upgrade ? (notice the --upgrade switch)

alecristia Jun 11, 2021
Maintainer Author

I hadn't, but now I did, and I don't see any errors in the output. Nonetheless, the eaf creation still fails but notice it only does it after a while:
(ChildProjectVenv) (base) Mac-mini:paris acristia$ pip3 install git+https://github.com/LAAC-LSCP/ChildProject.git --upgrade

Collecting git+https://github.com/LAAC-LSCP/ChildProject.git
Cloning https://github.com/LAAC-LSCP/ChildProject.git to /private/var/folders/mx/f_gpzmq105x8sgqfdtbr4zrr0000gn/T/pip-req-build-rl7dfjdo
Running command git clone -q https://github.com/LAAC-LSCP/ChildProject.git /private/var/folders/mx/f_gpzmq105x8sgqfdtbr4zrr0000gn/T/pip-req-build-rl7dfjdo
Requirement already satisfied: pandas>=0.25.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (1.1.4)
Requirement already satisfied: jinja2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (2.11.2)
Requirement already satisfied: numpy>=1.16.5 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (1.19.4)
Requirement already satisfied: sox in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (1.4.1)
Requirement already satisfied: datalad in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.14.0)
Requirement already satisfied: requests<2.25.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (2.24.0)
Requirement already satisfied: lxml in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (4.6.2)
Requirement already satisfied: pympi-ling in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (1.69)
Requirement already satisfied: pylangacq in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.14.1)
Requirement already satisfied: python-dateutil>=2.8.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (2.8.1)
Requirement already satisfied: librosa in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.8.1)
Requirement already satisfied: pydub in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.24.1)
Requirement already satisfied: pysoundfile in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.9.0.post1)
Requirement already satisfied: nltk in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (3.6.2)
Requirement already satisfied: sklearn in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.0)
Requirement already satisfied: PyYAML in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (5.4.1)
Requirement already satisfied: panoptes-client in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (1.3.0)
Requirement already satisfied: importlib-resources in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (5.1.4)
Requirement already satisfied: pygamma-agreement in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from ChildProject==0.0.1) (0.1.6)
Requirement already satisfied: pytz>=2017.2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pandas>=0.25.0->ChildProject==0.0.1) (2020.4)
Requirement already satisfied: six>=1.5 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from python-dateutil>=2.8.1->ChildProject==0.0.1) (1.15.0)
Requirement already satisfied: chardet<4,>=3.0.2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from requests<2.25.0->ChildProject==0.0.1) (3.0.4)
Requirement already satisfied: idna<3,>=2.5 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from requests<2.25.0->ChildProject==0.0.1) (2.10)
Requirement already satisfied: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from requests<2.25.0->ChildProject==0.0.1) (1.25.11)
Requirement already satisfied: certifi>=2017.4.17 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from requests<2.25.0->ChildProject==0.0.1) (2020.11.8)
Requirement already satisfied: appdirs in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.4.4)
Requirement already satisfied: iso8601 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (0.1.13)
Requirement already satisfied: humanize in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (3.1.0)
Requirement already satisfied: fasteners>=0.14 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (0.15)
Requirement already satisfied: patool>=1.7 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.12)
Requirement already satisfied: tqdm in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (4.52.0)
Requirement already satisfied: wrapt in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.12.1)
Requirement already satisfied: annexremote in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.4.3)
Requirement already satisfied: boto in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (2.49.0)
Requirement already satisfied: keyring>=8.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (21.5.0)
Requirement already satisfied: keyrings.alt in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (4.0.1)
Requirement already satisfied: msgpack in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.0.0)
Requirement already satisfied: jsmin in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (2.2.2)
Requirement already satisfied: PyGithub in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (1.53)
Requirement already satisfied: simplejson in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (3.17.2)
Requirement already satisfied: whoosh in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from datalad->ChildProject==0.0.1) (2.7.4)
Requirement already satisfied: monotonic>=0.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from fasteners>=0.14->datalad->ChildProject==0.0.1) (1.5)
Requirement already satisfied: importlib-metadata>=1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from keyring>=8.0->datalad->ChildProject==0.0.1) (2.0.0)
Requirement already satisfied: zipp>=0.5 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from importlib-metadata>=1->keyring>=8.0->datalad->ChildProject==0.0.1) (3.4.0)
Requirement already satisfied: future in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from annexremote->datalad->ChildProject==0.0.1) (0.18.2)
Requirement already satisfied: setuptools in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from humanize->datalad->ChildProject==0.0.1) (39.0.1)
Requirement already satisfied: MarkupSafe>=0.23 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from jinja2->ChildProject==0.0.1) (1.1.1)
Requirement already satisfied: soundfile>=0.10.2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (0.10.3.post1)
Requirement already satisfied: scipy>=1.0.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (1.5.4)
Requirement already satisfied: pooch>=1.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (1.3.0)
Requirement already satisfied: numba>=0.43.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (0.53.1)
Requirement already satisfied: decorator>=3.0.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (5.0.9)
Requirement already satisfied: scikit-learn!=0.19.0,>=0.14.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (0.24.2)
Requirement already satisfied: packaging>=20.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (20.9)
Requirement already satisfied: resampy>=0.2.2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (0.2.2)
Requirement already satisfied: audioread>=2.0.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (2.1.9)
Requirement already satisfied: joblib>=0.14 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from librosa->ChildProject==0.0.1) (1.0.1)
Requirement already satisfied: llvmlite<0.37,>=0.36.0rc1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from numba>=0.43.0->librosa->ChildProject==0.0.1) (0.36.0)
Requirement already satisfied: pyparsing>=2.0.2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from packaging>=20.0->librosa->ChildProject==0.0.1) (2.4.7)
Requirement already satisfied: threadpoolctl>=2.0.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from scikit-learn!=0.19.0,>=0.14.0->librosa->ChildProject==0.0.1) (2.1.0)
Requirement already satisfied: cffi>=1.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from soundfile>=0.10.2->librosa->ChildProject==0.0.1) (1.14.5)
Requirement already satisfied: pycparser in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from cffi>=1.0->soundfile>=0.10.2->librosa->ChildProject==0.0.1) (2.20)
Requirement already satisfied: regex in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from nltk->ChildProject==0.0.1) (2021.4.4)
Requirement already satisfied: click in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from nltk->ChildProject==0.0.1) (7.1.2)
Requirement already satisfied: python-magic<0.5,>=0.4 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from panoptes-client->ChildProject==0.0.1) (0.4.20)
Requirement already satisfied: redo>=1.7 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from panoptes-client->ChildProject==0.0.1) (2.0.4)
Requirement already satisfied: TextGrid>=1.5 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (1.5)
Requirement already satisfied: sortedcontainers>=2.0.4 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (2.4.0)
Requirement already satisfied: typing-extensions>=3.7.4.3 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (3.10.0.0)
Requirement already satisfied: pyannote.core>=4.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (4.1)
Requirement already satisfied: pyannote.database>=4.0.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (4.1)
Requirement already satisfied: cvxpy==1.0.25 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (1.0.25)
Requirement already satisfied: dataclasses>=0.7 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pygamma-agreement->ChildProject==0.0.1) (0.8)
Requirement already satisfied: multiprocess in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (0.70.11.1)
Requirement already satisfied: osqp>=0.4.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (0.6.2.post0)
Requirement already satisfied: scs>=1.1.3 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (2.1.3)
Requirement already satisfied: ecos>=2 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (2.0.7.post1)
Requirement already satisfied: qdldl in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from osqp>=0.4.1->cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (0.1.5.post0)
Requirement already satisfied: matplotlib>=2.0.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pyannote.core>=4.1->pygamma-agreement->ChildProject==0.0.1) (3.3.4)
Requirement already satisfied: cycler>=0.10 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from matplotlib>=2.0.0->pyannote.core>=4.1->pygamma-agreement->ChildProject==0.0.1) (0.10.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from matplotlib>=2.0.0->pyannote.core>=4.1->pygamma-agreement->ChildProject==0.0.1) (1.3.1)
Requirement already satisfied: pillow>=6.2.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from matplotlib>=2.0.0->pyannote.core>=4.1->pygamma-agreement->ChildProject==0.0.1) (8.2.0)
Requirement already satisfied: typer[all]>=0.2.1 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from pyannote.database>=4.0.1->pygamma-agreement->ChildProject==0.0.1) (0.3.2)
Requirement already satisfied: shellingham<2.0.0,>=1.3.0 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from typer[all]>=0.2.1->pyannote.database>=4.0.1->pygamma-agreement->ChildProject==0.0.1) (1.4.0)
Requirement already satisfied: colorama<0.5.0,>=0.4.3 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from typer[all]>=0.2.1->pyannote.database>=4.0.1->pygamma-agreement->ChildProject==0.0.1) (0.4.4)
Requirement already satisfied: dill>=0.3.3 in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from multiprocess->cvxpy==1.0.25->pygamma-agreement->ChildProject==0.0.1) (0.3.3)
Requirement already satisfied: pyjwt in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from PyGithub->datalad->ChildProject==0.0.1) (1.7.1)
Requirement already satisfied: deprecated in /Users/acristia/ChildProjectVenv/lib/python3.6/site-packages (from PyGithub->datalad->ChildProject==0.0.1) (1.2.10)

(ChildProjectVenv) (base) Mac-mini:paris acristia$ child-project eaf-builder --destination samples/och_mal/ --segments samples/och_mal/segments_20210611_134214.csv --template extra/exelang --context-onset 0 --context-offset 0 --eaf-type periodic

making the periodic eaf file and csv
Parsing unknown version of ELAN spec... This could result in errors...
Creating eaf code segment # 1
enumerate makes: 0 (180000, 210000)
Creating eaf code segment # 2
enumerate makes: 1 (1350000, 1380000)
Creating eaf code segment # 3
enumerate makes: 2 (690000, 720000)
Creating eaf code segment # 4
enumerate makes: 3 (480000, 510000)
Creating eaf code segment # 5
enumerate makes: 4 (150000, 180000)
Traceback (most recent call last):
File "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/xml/etree/ElementTree.py", line 789, in _get_writer
write = file_or_filename.write
AttributeError: 'str' object has no attribute 'write'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/Users/acristia/ChildProjectVenv/bin/child-project", line 11, in
load_entry_point('ChildProject==0.0.1', 'console_scripts', 'child-project')()
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/cmdline.py", line 331, in main
args.func(args)
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/cmdline.py", line 31, in
_parser.set_defaults(func = lambda args: cls().run(**vars(args)))
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/pipelines/eafbuilder.py", line 115, in run
template
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/ChildProject/pipelines/eafbuilder.py", line 40, in create_eaf
eaf.to_file(os.path.join(output_dir, "{}.eaf".format(id)))
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/pympi/Elan.py", line 1315, in to_file
to_eaf(file_path, self, pretty)
File "/Users/acristia/ChildProjectVenv/lib/python3.6/site-packages/pympi/Elan.py", line 1688, in to_eaf
file_path, xml_declaration=True, encoding='UTF-8')
File "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/xml/etree/ElementTree.py", line 759, in write
with _get_writer(file_or_filename, enc_lower) as write:
File "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/contextlib.py", line 81, in enter
return next(self.gen)
File "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/xml/etree/ElementTree.py", line 796, in _get_writer
errors="xmlcharrefreplace")
FileNotFoundError: [Errno 2] No such file or directory: 'samples/och_mal/Anae/010413/Anae/010413_periodic_exelang.eaf'

lucasgautheron Jun 11, 2021
Maintainer

Somehow your package did not upgrade.

do pip3 uninstall ChildProject then reinstall it I'd say! Not sure why --upgrade is ignored. I think you had this problem once already right?

alecristia Jun 11, 2021
Maintainer Author

your memory is better than mine, and yes, that fixed it -- thanks!!

alecristia Jun 11, 2021
Maintainer Author

my next steps were:

put the files into dropbox: the paris wav + eaf + psfx
open the eaf
ok error about closed vocab absent (?!)
create the link with the wav
save eaf and close
launch tss
select wav, eaf, output, start coding
code (but some issues, reported to gwendal via mattermost)
open resulting file in ELAN
all looks good!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how I set up an annotation campaign #216

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 20 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

how I set up an annotation campaign #216

alecristia May 28, 2021 Maintainer

My goals

Prep work

The sampler phase

Creating a template

Building my .eafs

Using the seated scribe for selecting sections to annotate

Setting up files for annotators to access them

to be discussed

Replies: 3 comments · 20 replies

lucasgautheron May 28, 2021 Maintainer

lucasgautheron Jun 2, 2021 Maintainer

lucasgautheron Jun 3, 2021 Maintainer

alecristia Jun 10, 2021 Maintainer Author

alecristia Jun 10, 2021 Maintainer Author

alecristia Jun 10, 2021 Maintainer Author

alecristia Jun 11, 2021 Maintainer Author

lucasgautheron Jun 11, 2021 Maintainer

alecristia Jun 11, 2021 Maintainer Author

lucasgautheron Jun 11, 2021 Maintainer

alecristia Jun 11, 2021 Maintainer Author

alecristia Jun 11, 2021 Maintainer Author

alecristia
May 28, 2021
Maintainer

Replies: 3 comments 20 replies

lucasgautheron
May 28, 2021
Maintainer

lucasgautheron Jun 2, 2021
Maintainer

lucasgautheron Jun 3, 2021
Maintainer

alecristia
Jun 10, 2021
Maintainer Author

alecristia Jun 10, 2021
Maintainer Author

alecristia Jun 10, 2021
Maintainer Author

alecristia
Jun 11, 2021
Maintainer Author

lucasgautheron Jun 11, 2021
Maintainer

alecristia Jun 11, 2021
Maintainer Author

lucasgautheron Jun 11, 2021
Maintainer

alecristia Jun 11, 2021
Maintainer Author

alecristia Jun 11, 2021
Maintainer Author