Scheduled service maintenance on November 22


On Friday, November 22, 2024, between 06:00 CET and 18:00 CET, GIN services will undergo planned maintenance. Extended service interruptions should be expected. We will try to keep downtimes to a minimum, but recommend that users avoid critical tasks, large data uploads, or DOI requests during this time.

We apologize for any inconvenience.

YODA repo to align vandam corpus using Montreal Forced Aligner.

Martin Frébourg e867d77c1f modified annotations input in compare.py 3 anni fa
.datalad c11e500a15 [DATALAD] new dataset 3 anni fa
.vscode 385198801d corrected with lucas' review 3 anni fa
code 65a6385203 added compare.py 3 anni fa
inputs 2cb46a39ea cleaned unnecessary files 3 anni fa
outputs e867d77c1f modified annotations input in compare.py 3 anni fa
.gitattributes c84fb1c33e Apply YODA dataset setup 3 anni fa
.gitmodules 1d2b1741b5 [DATALAD] Recorded changes 3 anni fa
CHANGELOG.md c84fb1c33e Apply YODA dataset setup 3 anni fa
README.md 2820fd10b6 modified readme 3 anni fa

README.md

Project

Dataset structure

  • All inputs (i.e. building blocks from other sources) are located in inputs/.
  • All custom code is located in code/.

Steps to generate aligned .csv from vandam-data .cha annotations

  1. Run code/csv2grid with annotations/cha/converted as input (converts the original .csv to .TextGrid)
  2. Run MFA Align with output files of previous step as input (with inputs/mfa-models/acoustic & inputs/mfa-models/dictionary as required)
  3. Run code/grid2csv to convert .TextGrids to .csv with outputs of previous step as input.

Steps for comparison of aligned segments with human annotator

  1. Use child-project sampler to generate 5x 1 minute segments (high-volubility) and outputs them in outputs/
  2. Use child-project eaf-builder with files generated at previous step and templates at inputs/eaf_templates
  3. Annotate segments by hand on ELAN
  4. Create csv dataframe with each segment in outputs/fivesegments-eaf
  5. Import that .csv with child-project import-annotations