The EL1000 dataset is a collection of corpora of annotations derived from child-centered longform audio recordings in a naturalistic environment.