Motor evoked potentials for multiple sclerosis: A multiyear follow-up dataset.

Jan Yperman 227d024b6b Add information for publishing with DataCite		пре 3 година
LICENSE	fc835d36a9 Initial commit	пре 3 година
README.md	2226146156 Added progress bar for timeseries linking; Added introduction to README	пре 3 година
apen_cache.p	813f4c2f30 Added cache for the ApEn computation	пре 3 година
create_df_from_portable_dataset.py	2226146156 Added progress bar for timeseries linking; Added introduction to README	пре 3 година
datacite.yml	227d024b6b Add information for publishing with DataCite	пре 3 година
mep_dataset.zip	8de7561f3a Added the MEP dataset	пре 3 година
sample_use_case.ipynb	2226146156 Added progress bar for timeseries linking; Added introduction to README	пре 3 година

Motor evoked potentials for multiple sclerosis: A multiyear follow-up dataset.

Introduction

Multiple sclerosis (MS) is a chronic disease affecting millions of people worldwide. The signal conduction through the central nervous system of MS patients deteriorates. Evoked potential measurements allow clinicians to monitor the degree of deterioration and are used for decision support. We share a dataset that contains motor evoked potential (MEP) measurements, in which the brain is stimulated and the resulting signal is measured in the hands and feet. This results in time series of 100 milliseconds long. Typically, both hands and feet are measured in one hospital visit. The dataset consists of 5586 visits of 963 patients, performed in day-to-day clinical care over a period of 6 years. The dataset consists of approximately 100,000 MEP. Clinical metadata such as the expanded disability status scale, sex, and age is also available. This dataset can be used to explore the role of evoked potentials in MS research and patient care. It may also be used as a real-world benchmark for machine learning techniques for time series analysis and predictive modelling.

Usage

Downloading the dataset

There are a few ways to download the dataset (mep_dataset.zip). Since it is a fairly small filesize (~300MB), it can just be downloaded through the web interface. Or from the commandline:

wget https://gin.g-node.org/JanYperman/motor_evoked_potentials/raw/master/mep_dataset.zip

Alternatively, you may clone the repository to your local machine, which will also include the dataset:

git clone https://gin.g-node.org/JanYperman/motor_evoked_potentials.git

For more ways of accessing the data, please refer to GIN's FAQ.

Structure

The dataset itself is stored in mep_dataset.zip. The general structures is as follows:

patient.csv: Contains the records for the various patients.
visit.csv: Contains the records for the various visits.
test.csv: Contains the records for the various tests.
measurement.csv: Contains the records for the various measurements.
edss.csv: Contains the records for the various edss measurements.

Besides these files the dataset also contains textfiles for each of the actual time series. The filenames of these files contain a unique identifier which can be used to link back to the column "timeseries" in the measurement.csv file. Some code to automate this linking (in Python) is included in create_df_from_portable_dataset.py.

More details about specifics fields can be found in the dataset descriptor.

Getting started

It is highly recommended to have a look at the included jupyter notebook to familiarize oneself with the dataset. It includes a sample use case and goes over how to work with the dataset.

To run the jupyter notebook a few Python packages are required:

Pandas
Numpy
Matplotlib
Scipy
Scikit-learn
Tqdm
Jupyter

For example in anaconda this could be achieved using:

conda create --name mep python=3 pandas numpy matplotlib scipy scikit-learn tqdm jupyter

which creates an environment called "mep" that contains the required packages.

datacite.yml
Title	Example Title
Authors	FamilyName1,GivenName1;Affiliation1;ORCID:0000-0001-2345-6789 FamilyName2,GivenName2;Affiliation2;ResearcherID:X-1234-5678 FamilyName3,GivenName3
Description	Example description that can contain linebreaks but has to maintain indentation.
License	Creative Commons CC0 1.0 Public Domain Dedication (https://creativecommons.org/publicdomain/zero/1.0/)
References	Citation1 [doi:10.xxx/zzzz] (IsSupplementTo) Citation2 [arxiv:mmmm.nnnn] (IsSupplementTo) Citation3 [pmid:nnnnnnnn] (IsReferencedBy)
Funding	DFG, AB1234/5-6 EU, EU.12345
Keywords	Neuroscience Keyword2 Keyword3
Resource Type	Dataset

README.md

Motor evoked potentials for multiple sclerosis: A multiyear follow-up dataset.

Introduction

Usage

Downloading the dataset

Structure

Getting started