yaya-sy 24feb5e9fd opensubtitles downloader only phonemes 2 سال پیش
..
.gitattributes 2702adb24e Apply YODA dataset setup 2 سال پیش
README.md 2702adb24e Apply YODA dataset setup 2 سال پیش
download_childes_corpora.py b4fa96e143 add rmarkdown code 2 سال پیش
download_opensubtitles_corpora.py 24feb5e9fd opensubtitles downloader only phonemes 2 سال پیش
evaluate_language_models.py 202ad44f37 importing conda environment 2 سال پیش
get_most_probable_phonemes.py b4fa96e143 add rmarkdown code 2 سال پیش
make_noiser.py b4fa96e143 add rmarkdown code 2 سال پیش
one_utterance_per_line_to_json.py b4fa96e143 add rmarkdown code 2 سال پیش
test_on_all_languages.py f7c6ced59f add plot 2 سال پیش
train_language_models.sh 7ac48a81ab update readme 2 سال پیش
utterances_cleaner.py b4fa96e143 add rmarkdown code 2 سال پیش

README.md

All custom code goes into this directory. All scripts should be written such that they can be executed from the root of the dataset, and are only using relative paths for portability.

datacite.yml
Title Unsupervised metrics of child language development
Authors Sy,Yaya;LSCP;ORCID:0000-0002-0292-451X
Description Code to extract metrics of child language development from language models based on n-grams.
License Creative Commons CC 4.0 By (https://creativecommons.org/licenses/by/4.0/)
References Sy, Y. (2022, July 18). Vers des métriques non supervisées des compétences langagières des enfants. [doi:10.31219/osf.io/4pe2u] (IsSupplementTo)
Funding ANR, ANR-17-EURE-0017
J. S. McDonnell Foundation, Understanding Human Cognition Scholar Award
ERC, European Union’s Horizon 2020 research and innovation programme (ExELang, Grant agreement No. 101001095)
Keywords Language acquisition
Language models
Development
Children
n-gram
Metrics
Entropy
Resource Type Software