Trascriptomics and AI approaches for genome analysis
BIO 37105 / 77105 • SPRING 2026
🧬 Course Overview
Explore the cutting-edge intersection of transcriptomics and artificial intelligence. Learn to predict cellular responses to perturbations, analyze single-cell RNA sequencing data, and build virtual cell models that simulate biological behavior without traditional wet-lab experiments.
Credits
3 Credits (Lecture)
Level
300 + 700
Format
Online - synchronous
Tuesdays 2:30-5:00pm
Class Size
Limited to 35 students
Prerequisites: BIO 203
📚 Course Syllabus
Weeks 1-2: Transcriptomics Fundamentals Central dogma review, gene expression and regulation, RNA biogenesis, alternative splicing, RNA-seq technologies (bulk and single-cell), library preparation methods, sequencing platforms, and transcriptome data structure
Weeks 3-4: Single-Cell Data Analysis Quality control, normalization, dimensionality reduction, clustering, and differential expression analysis
Weeks 5-6: Perturbation Approaches in Genomics Gene perturbation strategies, Perturb-seq methods, linking genotype to phenotype, high-throughput functional screens, and interpreting perturbation data
Weeks 7-9: AI for Biology Machine learning fundamentals, deep learning architectures, transformer models, protein language models (ESM2), biological embeddings, attention mechanisms, and introduction to the Arc Institute’s STATE model
Weeks 10-11: Virtual Cell Modeling Predicting perturbation responses, context generalization across cell types, causal vs. correlational modeling
Week 12: Large-Scale Datasets & Resources Arc Virtual Cell Atlas, CELL×GENE, scBaseCount, Tahoe-100M dataset exploration
Weeks 13-14: Model Evaluation & Benchmarking Perturbation discrimination, differential expression metrics, MAE, and Virtual Cell Challenge framework
Week 15: Final Exam [.description]##
🎯 Learning Outcomes
Students will be able to:
Analyze scRNA-seq Data Build Predictive Models Design Perturbation Experiments Interpret AI Predictions Use Large Scale Datasets Apply Virtual Cell Models
🔬 Hands-On Experience
Learn to work with real transcriptomic datasets including single-cell RNA-seq profiles. Master quality control pipelines, normalization techniques, and batch effect correction. Perform dimensionality reduction (PCA, UMAP, t-SNE), cell type identification, and trajectory analysis. Apply differential expression analysis across conditions and cell types. Explore virtual cell models using the Arc Institute’s STATE architecture and the Virtual Cell Challenge framework.
Instructor: Assoc. Prof. Konstantinos Krampis
Contact: kk104@hunter.cuny.edu