1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
# MDverse data analysis

## Setup your environment

Clone the repository:

```bash
git clone https://github.com/MDverse/mdda.git
```

Move to the new directory:

```bash
cd mdda
```

Install [miniconda](https://docs.conda.io/en/latest/miniconda.html).

Install [mamba](https://github.com/mamba-org/mamba):

```bash
conda install mamba -n base -c conda-forge
```

Create the `mdda` conda environment:
```
mamba env create -f binder/environment.yml
```

Load the `mdda` conda environment:
```
conda activate mdda
```

Note: you can also update the conda environment with:

```bash
mamba env update -f binder/environment.yml
```

To deactivate an active environment, use

```
conda deactivate
```

## Get data

Run this command to download required data files:

```bash
bash scripts/get_data.sh
```

Files are downloaded in the `data` directory:

```bash
$ tree data
data/
├── data_model_parquet.md
├── datasets.parquet
├── files.parquet
├── gromacs_gro_files.parquet
├── gromacs_mdp_files.parquet
└── gromacs_xtc_files.parquet
```

The data model is described in the codebook `data/data_model_parquet.md`

## Analyse data

Use this script to run all Jupyter notebooks in batch mode:

```bash
bash make_results.sh
```

The directory `results` contains Jupyter notebooks exported in html, figures as .png and .svg images and a couple of TSV files. All these files are also bundled in the `results.zip` file.