Day 5. Unified Analytical Group Projects

On the final day of this bootcamp, we will group up with the other members of our table and embark on a joint effort to take what we have learned this past week and use it to explore a large-scale biological data set in a collaborative fashion. We will make use of the publicly available digital expression matrix from single-cell RNA-seq of ~2,700 periphral blood mononuclear cells (PBMCs) produced by 10x genomics. We will explore, pre-process, summarize, and visualize the dataset and perform basic statistical analysis to identify cell-type-specific genes.

Typically, a research project can take a very long time to generate the data and analyze the results. For the purposes of this bootcamp, we will be using a small subset of these data and will attempt to recreate the published results over these regions. Our goal is to give you a taste of what types of data exploration are now available to you with the simple yet powerful biocomputing tools you have learned and to serve as a foundation for your future research endeavors.


Schedule:

Session Time Topics
I 9:00-10:15 AM Introduction to scRNA-seq and Overview of Project
  10:15-10:30 AM Coffee Break
II 10:30-12:00 AM Analysis of scRNA-seq dataset with python
  12:00-1:00 PM Lunch
III 1:00-2:15 PM Analysis of scRNA-seq dataset with R
  2:15-2:30 PM Coffee Break
IV 2:30-4:00 PM Group Presentations and Discussion


Instructors:

Hyun Min Kang (HMK) Hui Jiang (HJ)


Class Questionnaire

Please help us improve this course by completing this questionnaire.


Data Sets:


Project Resources


Analysis notebooks

Python notebooks

RStudio notebooks