Homework #1: DataCamp Intro to R course (~2hrs) is due today!
In this class, we’ll finally get down to the business of using Kallisto for memory-efficient mapping of raw reads to a reference transcriptome. You’ll carry out this mapping in class, right on your laptop, while we discuss what’s happening ‘under the hood’ with Kallisto and how this compares to more traditional alignment methods. You’ll be introduced to using command line software and will learn about automation and reproducibility through shell scripts.
- Discuss the course dataset.
- Download and examine a reference transcriptome from Ensembl.
- Use Kallisto to construct an index from this reference file.
- Use Kallisto to map our raw reads to this index
- Talk a bit about how an index is built and facilitates read alignment
Part 1 - Step-by-step walkthrough of using FastQC, Kallisto and MultiQC on our course dataset.
Part 2 - A discussion of traditional and alignment-free (pseudoalignment) methods for quantifying gene expression
papers and blogs posts on Kallisto
2016 Nature Biotech paper from Lior Pachter’s lab describing Kallisto
2017 Nature Methods paper from Lior Pachter’s lab describing Sleuth
blog post on pseudoalignments - helps understand how Kallisto maps reads to transcripts
Did you notice that Kallisto is using ‘Expectation Maximization (EM)’ during the alignment? You can read more about what this is here
General info about ultra lightweight methods for transcript quantification
2014 Nature Biotech paper - describes Sailfish, which implimented the first lightweight method for quantifying transcript expression.
Not quite alignments - Rob Patro, the first author of the Sailfish paper, wrote a nice blog post comparing and contrasting alignment-free methods used by Sailfish, Salmon and Kallisto.
2011 Nature Biotechnology - Great primer to better understand what de Bruijn graph is.
Greg Grant’s recent paper comparing different aligners. This should be a helpful guide in choosing alignment software outside of what we used in class.
Harold Pimentel’s talk on alignment (20 min)
Lior Pachter’s talk at CSHL (45 min)