mfrebo/align-vandam: YODA repo to align vandam corpus using Montreal Forced Aligner. @ 1fd0ae8758b51b06f8937dfe9209b59bcf093739

Scheduled service maintenance on November 22

On Friday, November 22, 2024, between 06:00 CET and 18:00 CET, GIN services will undergo planned maintenance. Extended service interruptions should be expected. We will try to keep downtimes to a minimum, but recommend that users avoid critical tasks, large data uploads, or DOI requests during this time.

We apologize for any inconvenience.

Martin Frébourg 1fd0ae8758 edited comaprison.md		пре 3 година
.datalad	c11e500a15 [DATALAD] new dataset	пре 3 година
.vscode	385198801d corrected with lucas' review	пре 3 година
code	dbd9efcf3f updated compare.py + confusion matrices	пре 3 година
inputs	2cb46a39ea cleaned unnecessary files	пре 3 година
outputs	dbd9efcf3f updated compare.py + confusion matrices	пре 3 година
.gitattributes	c84fb1c33e Apply YODA dataset setup	пре 3 година
.gitmodules	1d2b1741b5 [DATALAD] Recorded changes	пре 3 година
CHANGELOG.md	c84fb1c33e Apply YODA dataset setup	пре 3 година
Comparison-summary.md	1fd0ae8758 edited comaprison.md	пре 3 година
README.md	2820fd10b6 modified readme	пре 3 година

Martin Frébourg 1fd0ae8758 edited comaprison.md

пре 3 година

.datalad

c11e500a15 [DATALAD] new dataset

пре 3 година

.vscode

385198801d corrected with lucas' review

пре 3 година

code

dbd9efcf3f updated compare.py + confusion matrices

пре 3 година

inputs

2cb46a39ea cleaned unnecessary files

пре 3 година

outputs

dbd9efcf3f updated compare.py + confusion matrices

пре 3 година

.gitattributes

c84fb1c33e Apply YODA dataset setup

пре 3 година

.gitmodules

1d2b1741b5 [DATALAD] Recorded changes

пре 3 година

CHANGELOG.md

c84fb1c33e Apply YODA dataset setup

пре 3 година

Comparison-summary.md

1fd0ae8758 edited comaprison.md

пре 3 година

README.md

2820fd10b6 modified readme

пре 3 година

Steps to generate aligned .csv from vandam-data .cha annotations

Run code/csv2grid with annotations/cha/converted as input (converts the original .csv to .TextGrid)

Run MFA Align with output files of previous step as input (with inputs/mfa-models/acoustic & inputs/mfa-models/dictionary as required)

Run code/grid2csv to convert .TextGrids to .csv with outputs of previous step as input.

Steps for comparison of aligned segments with human annotator

Use child-project sampler to generate 5x 1 minute segments (high-volubility) and outputs them in outputs/

Use child-project eaf-builder with files generated at previous step and templates at inputs/eaf_templates

Annotate segments by hand on ELAN

Create csv dataframe with each segment in outputs/fivesegments-eaf

Import that .csv with child-project import-annotations

Scheduled service maintenance on November 22

README.md

Project

Dataset structure

Steps to generate aligned .csv from vandam-data .cha annotations

Steps for comparison of aligned segments with human annotator