In this lecture, we will take analyze a single-cell RNA-seq data using scanpy. The lecture will introduce Anndata objects, plotting and interacting witn single-cell RNA-seq, QC and analysis of data and as time permits, batch correction.
We will use two PBMC datasets made available by 10X Genomics. Please download the following to the data/ directory:
git pull to obtain the materials for this class, it is likely because you have a conflict between Lecture19-scRNA-seq-analysis.ipynb) and the version in the public GitHub repo. You can resolve this by making a copy of that markdown (naming it something different, like my_Lecture19-scRNA-seq-analysis.ipynb)) and then discarding changes to the original markdown file.Download the following datasets and copy it a folder called data/
We will be using cellxgene, and scanpy for Lectures 18 and 19, and Homework 8. All the packages and dependencies can be installed using conda.
Using conda, the following commands can be used to install all the required dependencies. The environment is the same one you used for the bulk RNA-seq analysis.
# Activate conda environment
conda activate tfcb2021_rna
# Scanpy installation
conda install seaborn scikit-learn statsmodels numba pytables
conda install -c conda-forge python-igraph leidenalg
pip install scanpy
# cellxgene installation
pip install cellxgene
# harmonypy installation
pip install harmonypy
# umap version
pip install umap-learn==0.5.1
# jupyter/ipython installation
conda install -c conda-forge jupyterlab