1. Course material

1.1. List of Tools

Data quality control and preprocessing
Taxonomic classification of shotgun sequencing data
Visualization of taxonomic classification
R packages for data analysis
Other tools:

1.2. Datasets to download

In the links below download the material needed for this course:

  • Linux cheatsheet - summary of the most used commands in Linux.

  • Fastq files 1 - paired-end sequencing dataset of dental calculus from chimpanzees.

  • Fastq files 2 - paired-end sequencing dataset of dental calculus from ancient humans (for further practice).

  • Kraken2 2Gb database - built from complete genomes of bacteria, archaea, fungi and protozoa in the NCBI RefSeq.

  • Scripts used during the course.

To practice some of the most used command lines in Linux we will go through a tutorial by Kristian Rother (Academis). Download and follow the instructions step-by-step to find the hidden word.