This repository contains the data and the code necessary to reproduce the results of the paper "Establishing the reliability of measures extracted from long-form recordings using LENA and the ACLEW pipeline".
This repository is structured as follows :
all-analyses.R
using the data found in DATASome readers may want to check our materials for reproducibility. To regenerate our supplementary materials, you will need RStudio. For further information on using Rmd for transparent (knittable) analyses, see Mike Frank & Chris Hartgerink's tutorial.
If you simply want to check the reproducibility of the paper analyses, you can download a zipped version from our GIN repo, by clicking on the button that looks like a downward pointing arrow, near the top right of the page (under Publications; see to-download-zip.jpg).
install.package("LIBRARYNAME")
(replace LIBRARYNAME with the package that the system said was not found). If the package missing is papaja, please follow instructions here.
Dependencies can be quickly installed by issuing the following command in Rstudio:list.of.packages <- c("lme4","performance","ggplot2","ggthemes","ggpubr","kableExtra","psych","dplyr","tidyr","stringr","car","ggbeeswarm")
new.packages <- list.of.packages[!(list.of.packages %in% installed.packages()[,"Package"])]
if(length(new.packages)) install.packages(new.packages)
If anything looks different, please double-check that you are using the same versions of all packages by looking at the capture of the environment at the very end of the .pdf file
It is also possible to generate the supplementary materials from the command line (without opening Rstudio) in a single instruction:
Rscript -e 'library(rmarkdown); rmarkdown::render("CODE/SM.Rmd", "pdf_document", output_file = "SM.pdf")'
Raw data access is NOT necessary for you to reproduce the supplementary analyses, and thus the numbers and figures in the manuscript. At present, the raw data is only accessible with additional ethics and security approval.
You will first need to install the ChildProject package for Python (optional) as well as DataLad. Instructions to install these packages can be found here.
This step should only be done once:
Create an account on (GIN)[https://gin.g-node.org/] if you don't have one already
Copy your SSH public key to your clipboard (usually located in ~/.ssh/id_rsa.pub). If you don't have one, please create one following these instructions
In your browser, go to GIN > Your parameters > SSH keys
Click on the blue "Add a key" button, then paste the content of your public key in the Content field, and submit
Your key should now appear in your list of SSH keys - you can add as many as necessary.
The next step is to clone the dataset :
datalad install -r git@gin.g-node.org:/LAAC-LSCP/RELIVAL.git
cd RELIVAL
You can get data from a dataset using the datalad get
command, e.g.:
datalad get CODE/* # download scripts
datalad get DATA/* # download data
Or:
datalad get . # get everything
You can download many files in parallel using the -J or --jobs parameters:
datalad get . -J 4 # get everything, with 4 parallel transfers
For more help with using DataLad, please refer to our cheatsheet or DataLad's own cheatsheet. If this is not enough, check DataLad's documentation and Handbook.
If you are notified of changes to the data, please retrieve them by issuing the following commands:
datalad update --merge
datalad get .
It is important that you delete the data once your project is complete.
This can be done with datalad remove
:
datalad remove -r path/to/your/dataset