tfcb_2021

Lectures 18: Introduction to single-cell RNA-seq

In this lecture, we will take analyze a single-cell RNA-seq data using scanpy. The lecture will introduce Anndata objects, plotting and interacting witn single-cell RNA-seq, QC and analysis of data and as time permits, batch correction.

We will use two PBMC datasets made available by 10X Genomics. Please download the following to the data/ directory:

Learning Objectives

Class materials

Data Download

Download the following datasets and copy it a folder called data/

Environment setup

We will be using cellxgene, and scanpy for Lectures 18 and 19, and Homework 8. All the packages and dependencies can be installed using conda.

Using conda, the following commands can be used to install all the required dependencies. The environment is the same one you used for the bulk RNA-seq analysis.

# Activate conda environment
conda activate tfcb2021_rna


# Scanpy installation 
conda install seaborn scikit-learn statsmodels numba pytables
conda install -c conda-forge python-igraph leidenalg
pip install scanpy


# cellxgene installation 
pip install cellxgene

# harmonypy installation
pip install harmonypy

# umap version
pip install umap-learn==0.5.1

# jupyter/ipython installation 
conda install -c conda-forge jupyterlab