css.file | dirs.to.publish | |
---|---|---|
_fileherd/markdown7.css |
|
STAT 540 is a 3 credit course with a mandatory computing seminar.
Cross-listed as
- STAT 540
- BIOF 540
- GSAT 540
-
Jennifer Bryan, Course coordinator (email is [email protected]), Statistics and Michael Smith Labs
-
Gabriela Cohen-Freue, Statistics
-
Paul Pavlidis, CHiBi and Psychiatry
Vast majority of course content, including source for this website, can be found here:
https://github.com/jennybc/stat540_2014
photoRec
data
- It's in the course repository
- it's in the examples/photoRec directory
- It's in the course website.
- it's in the examples/photoRec directory
06 January 2014 - 07 April 2014
Time : Mon Wed 9:30 - 11am
Location : ESB 4192
Time: officially runs Wed 12pm - 1pm; unofficially students are welcome to come after class around 11am and begin a ~1 hour guided analysis with TA support; TA will remain in the lab until 1pm to help those who start as 12pm and for general office hours.
Location: ESB 1042 and 1046
Officially none BUT here in reality ...
-
Statistics: you should have already taken university level introductory statistics course.
-
Biology: No requirements, but you are expected to learn things like the difference between a DNA and RNA and a gene and a genome.
-
R: no experience required but be prepared to do a lot of self-guided learning. Go ahead and start now by installing R and the HIGHLY RECOMMENDED "integrated development environment" (IDE) RStudio! Students are expected to run R on their own computer or a computer they have plenty of access to and control over. The best set-up, if possible, is to bring your own laptop to the computing seminars.
-
Homework. Two assignments worth 25 points each. Homework #1 was due Thurs Feb 27. Homework #2 was due Fri March 28. Instructions for how to submit homework.
-
Group project. Groups formed and projects conceived during January/February. Primary deliverable is a poster, presented in last class meeting. Each student also produces a short report. 40 points. More more information, go here.
-
10 points for "other", e.g. participation in class, seminars, and the Google group, engagement with small computing exercises.
seminar 00 | R, RStudio Set Up & Basics, borrowed from STAT 545A. Students complete/attempt on their own in advance. Bring any difficulties to first seminar.
lecture 01 | Introduction to high dimensional biology and the course (PP) | Mon Jan 06 | slides as PDF
lecture 02 | Overview / review of probability and statistical inference, 1 of 2 (JB) | Wed Jan 08 | slides as PDF
seminar 01 | R basics and exploring a small gene expression dataset | Wed Jan 08
- R stuff 11am - 12pm (or later, if necessary)
- Molecular biology/genetics 101 (LL), 12pm - 1pm | slides as PDF
lecture 03 | Overview / review of probability and statistical inference, 2 of 2 (JB) | Mon Jan 13 | slides as PDF
lecture 04 | Exploratory analysis (PP) | Wed Jan 15 | slides as PDF
seminar 02 | Learn more R while reviewing probability (LL) | Wed Jan 15
lecture 05 | Data QC and preprocessing (JB for GC-F) | Mon Jan 20 | slides as PDF
lecture 06 | Statistical inference: two group comparisons, e.g. differential expression analysis (JB) | Wed Jan 22 | slides as PDF
seminar 03 | R graphics AND knitr
, R markdown, and git(hub) | Wed Jan 22
- Introduction to
knitr
, R markdown, and git(hub) 11:15am - 12pm SJ will run a guided, hands-on tutorial in one of the labs - R graphics (LL) content is ready to work through in the other lab, from 12 - 1pm, or on your own.
Fri Jan 24: Project groups should be formed.
lecture 07 | Statistical inference: more than two groups --> linear models (JB prep, GCF deliver) | Mon Jan 27 | slides as PDF
lecture 08 | Statistical inference: linear models with 2 categorical covariates, greatest hits of linear models inference (JB prep, GCF deliver) | Wed Jan 29 | slides as PDF
seminar 04 | Two group testing and data aggregation (SJ) | Wed Jan 29
Fri Jan 31: Initial project proposals due.
lecture 09 | Statistical inference: linear models including a quantitative covariate, fitting many linear models at once (JB) | Mon Feb 03 | slides as PDF
lecture 10 | Large scale inference: Empirical Bayes, limma (JB) | Wed Feb 05 | slides as PDF
seminar 05 | Fitting and interpretting linear models (low volume) (SJ) | Wed Feb 05
Fri Feb 07: Homework #1 assignment is posted. Due Thurs Feb 27.
Mon Feb 10 is Family Day; no class
lecture 11 | Large scale inference: multiple testing (JB) | Wed Feb 12 | slides as PDF
seminar 06 | Fitting and interpretting linear models (high volume), limma package | Wed Feb 12
Fri Feb 14: feedback to groups re: initial project proposals. Each group will be assigned an instructor or TA + instructor pair for extra support.
(Mon Feb 17 UBC closed for mid-term break)
(Wed Feb 19 UBC closed for mid-term break)
lecture 12 | Analysis of RNA-Seq data (PP), 1 of 2 | Mon Feb 24 | slides as PDF
lecture 13 | Analysis of RNA-Seq data (PP), 2 of 2 | Wed Feb 26 | slides as PDF
seminar 07 | RNA-Seq analysis (SJ) | Wed Feb 26
Thurs Feb 27: Homework #1 due.
lecture 14 | Analysis of epigenetic data, focus on methylation (Elodie Portales-Casamar) | Mon Mar 03 | slides as PDF
Wed Mar 05: final project proposals due.
lecture 15 | Principal component analysis (PP) | Wed Mar 05 | slides as PDF
seminar 08 | Methylation analysis (LL) | Wed Mar 05
Fri Mar 07: Homework #2 assigned
lecture 16 | Cluster analysis (GC-F) | Mon Mar 10 | slides as PDF
lecture 17 | Classification (GC-F) | Wed Mar 12 | slides as PDF
seminar 09 | Clustering and PCA (LL) | Wed Mar 12
lecture 18 | Model and variable selection: cross validation and regularization (GC-F) | Mon Mar 17 | slides as PDF
lecture 19 | Regularization cont'd, Proteomics and missingness (GC-F) | Wed Mar 19 | slides as PDF: variable selection, proteomics and missing values
seminar 10 | Supervised learning, cross validation, variable selection (SJ) | Wed Mar 19
lecture 20 | Analysis of gene function, 1 of 2: Gene set analysis (PP) | Mon Mar 24 | slides as PDF
lecture 21 | Analysis of gene function, 2 of 2 (PP) | Wed Mar 26 | slides as PDF
seminar 11 | TA office hours during seminar time ... group project work | Wed Mar 26
Fri Mar 28: Homework #2 due.
lecture 22 | Resampling and the bootstrap (JB)| Mon Mar 31 | slides as PDF
lecture 23 | Guest lecture by STAT540 alum Dr. Sohrab Shah | Wed Apr 02
seminar 12 | TA office hours during seminar time ... group project work | Wed Apr 02
lecture 24 | Poster session 9:30am - 12:00pm | Wed Apr 09 <-- NOTE THIS IS ON WEDNESDAY, instead of Monday. Location: Room 101, aka the multi-purpose room, on ground floor of the Michael Smith Building.
We will borrow some material from STAT 545A Exploratory Analysis, in addition to using content specific to STAT 540.
- seminar 00 | R, RStudio Set Up & Basics, borrowed from STAT 545A.
- seminar 01 Wed Jan 08
- Basics of R/RStudio, workspaces, and projects, borrowed from STAT 545A.
- Basic care and feeding of data in R, borrowed from STAT 545A.
- R objects (beyond data.frames) and indexing, borrowed from STAT 545A.
- Explore a small gene expression dataset
- Prep work for later use of Git, GitHub, Rpubs, etc.
- seminar 02 Wed Jan 15 Pick (at least) one tutorial to work through. The latter two options get into control flow, writing functions, and base R graphics, which might be overwhelming for novices.
- Gentlest, written by Jenny: Play with probability distributions and simulations
- More advanced, revision by Gloria: Introduction To Simulation
- More advanced, original 2013 version by Andy: Introduction To Simulation
- seminar 03 Wed Jan 22
- Introduction to
knitr
, R markdown, and git(hub) (SJ) | slides as PDF - The R graphics landscape, borrowed from STAT 545A.
- Exploration of
lattice
graphics (LL) - NEW! Exploration of
ggplot2
graphics (LL) - Optional Using colors in R, read only if you're curious.
- Optional Writing figures to file, you'll eventually need this but it's not wildly urgent.
- Introduction to
- seminar 04 Wed Jan 29
- Two group testing and data aggregation
- Data aggregation, borrowed from STAT 545A. Certain sections are linked to from the seminar.
- seminar 05 Wed Feb 05
- seminar 06 Wed Feb 12
- seminar 07 Wed Feb 26 RNA-Seq analysis
- optional material Getting read counts
- Using read counts for differential expression analysis
- seminar 08 Wed Mar 05 Methylation analysis
- seminar 09 Wed Mar 12
- seminar 10 Wed Mar 19