Sin descripción

Robin Patrick Schenk 195568f5e1 Merge branch '7_pipeline_run_page' of RSchenk/MriSequencingPipeline into master hace 6 meses
backend 64a1a6fa0a [7] Minor code optimisations hace 6 meses
documentation 440f046d93 [13] Added documentation flder hace 11 meses
frontend 605090d9ef [7] Implemented step download hace 6 meses
pipelines 5145312923 [7] Updated pipelines hace 6 meses
testing_resources a7783ca03f [7] Fixed experiment file input path hace 11 meses
.gitignore 6ee1a2d9bd [0] Sample application setup hace 2 años
LICENSE 71b98eea85 Initial commit hace 2 años
README.md 2b76e21289 [7] Embedded logo into readme hace 11 meses

README.md

Encapsulated Sequencing Pipeline Engine

Design philosophy

This encapsulated sequencing pipeline engine (ESPE) is intended as a simple, streamlined, well documented and tested solution for common sequencing data analysis setups. Since transformation and analysis of sequencing data is typically a time and resource consuming task, the main focus is not placed on scalability of a centralised processing pipeline for e.g. a core facility. Rather an engine is provided to allow easy setup of the pipelines of choice within a specific workgroup. The software itself is self-contained, so all processed data and associated information (database etc.) is stored locally with the software. While this severely limits the scalability it allows for an easy setup, documentation and improves general usability. For the same reasons execution of pipelines themselves is not paralellised. Instead each pipeline step uses paralellisation whenever possible. Additonally, each pipeline step can be configured as desired by the user and runs inside a container to prevent data and package conflicts, allow for running arbitrary pipeline steps without the need to install additional software and mitigate difficulties during setup.

Architecture

Frontend

Vue.js with the Quasar framework is used for easy implementation of the frontend.

Backend

An actix-Rust-backend is used as a statically typed, fast and reliable backend.

Database

SQLite is used as it provides a simple, self-contained and fast database. Diesel is used for automated database migration.

Containerisation

Docker was selected as tool for running and managing containers due to its ease of use and widespread adoption.

Usage

TODO: Create and add link to full user guide.

Enviroment variables

Also see backend/.env.

Variable Description
CONTEXT_FOLDER the context folder of the application where all relevant data is stored
DATABASE_URL the database location
LOG_LEVEL the minimum log level of the application (debug, info, warn, error)
PIPELINE_FOLDER the folder storing pipeline definitions
SERVER_ADDRESS the address of the server
SERVER_PORT the port of the server

Build (for developers)

Prerequisites:

# Install Rust.
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
# Install Node.
curl -fsSL https://deb.nodesource.com/setup_lts.x | sudo bash -
sudo apt update
sudo apt install nodejs
# Install the requirements for the openssl crate.
sudo apt install libssl-dev

Conventional release build:

cd backend
cargo build --release

Building for development:

cd backend
cargo build
cd ../frontend
npm install
npm run build

Run (for developers)

Prerequisites:

# Install SQLite and the Diesel dependency.
# The version requirement for SQLite is >= 3.35.
sudo apt install sqlite3 libsqlite3-dev
# Install Diesel.
cargo install diesel_cli --no-default-features --features sqlite
# Initialise the database.
cd backend
mkdir application # without the sub-folder Diesel will fail to create the database
diesel setup
# Install docker for containerisation of the pipeline.
sudo apt install docker.io

Running a release build:

cd backend
cargo run --release

Running for development:

cd backend
cargo run
cd ../frontend
npm run build:dev

Testing

Backend testing:

cd backend
cargo test

Frontend testing:

cd frontend
npm run test:unit