README.md 3.2 KB

Dataset Structure

We provide the imagenet scans in the .zip format. Unzipping them will create two folders data and meta.

  • data: Includes the variables that were recorded during the experiment. The experimental variables are saved as a collection of numpy arrays. Each numpy array contains the value of that variable for a specific image presentation (i.e. trial). Note that the name of the files does not contain any information about the order or time at which the trials took place in experimental time. They are randomly ordered.
    • images: This directory contains NumPy arrays where each single X.npy contains the image that was shown to the mouse in trial X.
    • responses: This directory contains NumPy arrays where each single X.npy contains the deconvolved calcium traces (i.e. responses) recorded from mouse V1 in trial X in response to the particular presented image.
    • behavior: Behavioral variables include pupil dilation, the derivative of the pupil size, and locomotion speed. The directory contains NumPy arrays (of size 1 x 3) where each single X.npy contains the behavioral variables (in the same order that was mentioned earlier) for trial X.
    • pupil_center: The eye position of the mouse, estimated as the center of the pupil. The directory contains NumPy arrays (of size 1 x 2) for horizontal and vertical eye positions.
  • meta: Includes meta data of the experiment

    • neurons: This directory contains neuron-specific information. Below are a list of important variables in this directory
      • area.npy: Contains the area of each neuron.
      • cell_motor_coordinates.npy: Contains the position (x, y, z) of each neuron in the recording field, given in microns.
      • layer.npy: Contains the cortical layer the neuron was recorded in.
      • unit_ids.npy: Contains a unique id for each neuron.
    • statistics: This directory contains statistics (i.e. mean, median, etc.) of the experimental variables (i.e. behavior, images, pupil_center, and responses).

    • trials: This directory contains trial-specific meta data. This includes single 1-d NumPy arrays for each trial variable.

      How to relate these meta data to the neuronal data (images, responses, ...)?

      The indices of these arrays correspond to the .npy files in data. For example:

      # get meta data array
      image_ids = np.load('./meta/trials/colorframeprojector_image_id.npy')
            
      # relate meta data with neuronal data
      trial_image_id = image_ids[0]
      corresponding_image = np.load('./data/images/0.npy')
      corresponding_neuronal_response = np.load('./data/responses/0.npy')
      

      Below are a list of important variables in this directory.

      • colorframeprojector_image_id.npy: Contains unique image id. If the image is presented multiple times (which is the case in the test set) this image ID will be present multiple times.
      • tiers.npy: Contains labels that are used to split the data into train, validation, and test set.
      • trial_idx.npy: Contains the index for each trial. It corresponds to the actual order of image presentations to the mouse.