LAAC-LSCP/align-vandam: some default @ 2820fd10b6030fd43b21b77c6d451e35feaf9062

some default

2 Větve

Martin Frébourg 2820fd10b6 modified readme		před 3 roky
.datalad	c11e500a15 [DATALAD] new dataset	před 3 roky
.vscode	385198801d corrected with lucas' review	před 3 roky
code	34bc9b8856 modified grid2csv.py	před 3 roky
inputs	2cb46a39ea cleaned unnecessary files	před 3 roky
outputs	34bc9b8856 modified grid2csv.py	před 3 roky
.gitattributes	c84fb1c33e Apply YODA dataset setup	před 3 roky
.gitmodules	1d2b1741b5 [DATALAD] Recorded changes	před 3 roky
CHANGELOG.md	c84fb1c33e Apply YODA dataset setup	před 3 roky
README.md	2820fd10b6 modified readme	před 3 roky

		
				README.md
			
				Project 

Dataset structure

All inputs (i.e. building blocks from other sources) are located in
inputs/.
All custom code is located in code/.

Steps to generate aligned .csv from vandam-data .cha annotations

Run code/csv2grid with annotations/cha/converted as input (converts the original .csv to .TextGrid)
Run MFA Align with output files of previous step as input (with inputs/mfa-models/acoustic & inputs/mfa-models/dictionary as required)
Run code/grid2csv to convert .TextGrids to .csv with outputs of previous step as input.

Steps for comparison of aligned segments with human annotator

Use child-project sampler to generate 5x 1 minute segments (high-volubility) and outputs them in outputs/
Use child-project eaf-builder with files generated at previous step and templates at inputs/eaf_templates
Annotate segments by hand on ELAN
Create csv dataframe with each segment in outputs/fivesegments-eaf
Import that .csv with child-project import-annotations