In this lecture, we will take analyze a single-cell RNA-seq data using scanpy. The lecture will introduce Anndata
objects, plotting and interacting witn single-cell RNA-seq, QC and analysis of data and as time permits, batch correction.
We will use two PBMC datasets made available by 10X Genomics. Please download the following to the data/
directory:
git pull
to obtain the materials for this class, it is likely because you have a conflict between Lecture19-scRNA-seq-analysis.ipynb)
and the version in the public GitHub repo. You can resolve this by making a copy of that markdown (naming it something different, like my_Lecture19-scRNA-seq-analysis.ipynb)
) and then discarding changes to the original markdown file.Download the following datasets and copy it a folder called data/
We will be using cellxgene
, and scanpy
for Lectures 18 and 19, and Homework 8. All the packages and dependencies can be installed using conda
.
Using conda
, the following commands can be used to install all the required dependencies. The environment is the same one you used for the bulk RNA-seq analysis.
# Activate conda environment
conda activate tfcb2021_rna
# Scanpy installation
conda install seaborn scikit-learn statsmodels numba pytables
conda install -c conda-forge python-igraph leidenalg
pip install scanpy
# cellxgene installation
pip install cellxgene
# harmonypy installation
pip install harmonypy
# umap version
pip install umap-learn==0.5.1
# jupyter/ipython installation
conda install -c conda-forge jupyterlab