Literature Hub

文献与动态集

将经典方法论文与平台内的分析流程、软件算法和数据库教程相互关联，形成可追溯的证据链。

papers

linked

Nature Biotechnology/2024

Dictionary learning for integrative, multimodal and scalable single-cell analysis

This work presents dictionary learning strategies used in Seurat v5 for integrative and scalable analysis of multimodal single-cell data. The study connects computational representation learning with practical single-cell tasks such as cross-dataset integration, modality alignment and annotation transfer across complex biological atlases.

Yuhan Hao, Tim Stuart, Madeline H. Kowalskiet al.

Nature Communications/2019

CUT&Tag for efficient epigenomic profiling of small samples and single cells

This paper describes CUT&Tag, an enzyme-tethering approach for highly efficient and sensitive epigenomic profiling from small samples and single cells. The method uses protein-A-Tn5 transposase fusion targeted by specific antibodies to perform tagmentation directly on chromatin in permeabilized cells or nuclei, eliminating the need for traditional ChIP-seq library preparation steps.

Hatice S. Kaya-Okur, Steven J. Wu, Christine A. Codomoet al.

Nature Communications/2017

Massively parallel digital transcriptional profiling of single cells

This paper describes a high-throughput droplet-based single-cell RNA-seq system that enabled large-scale digital transcriptional profiling. It is a key experimental foundation for 10x Genomics workflows and motivates the preprocessing steps performed by tools such as Cell Ranger.

Grace X. Y. Zheng, Jessica M. Terry, Phillip Belgraderet al.

Science/2016

Visualization and analysis of gene expression in tissue sections by spatial transcriptomics

This landmark study introduced spatial transcriptomics, a method that enables visualization and quantitative analysis of the transcriptome with spatial resolution in individual tissue sections. By placing tissue sections on arrayed reverse transcription primers with unique positional barcodes, the method captures both global gene expression patterns and the spatial organization of the tissue microenvironment.

Patrik L. Stahl, Fredrik Salmen, Sanja Vickovicet al.

Bioinformatics/2014

featureCounts: an efficient general purpose program for assigning sequence reads to genomic features

This paper describes featureCounts, a highly efficient general-purpose read summarization program that counts mapped reads for genomic features such as genes, exons, promoters, and genomic bins. The method implements highly efficient chromosome hashing and feature blocking techniques to achieve read counting speeds substantially faster than existing methods while maintaining accuracy.

Yang Liao, Gordon K. Smyth, Wei Shi

Bioinformatics/2013

STAR: ultrafast universal RNA-seq aligner

This paper introduces STAR (Spliced Transcripts Alignment to a Reference), an ultrafast RNA-seq aligner that achieves high alignment accuracy by using uncompressed suffix arrays together with a novel sequential maximum mappable seed search strategy. STAR can align both short and long reads at speeds an order of magnitude faster than contemporary tools while maintaining splice junction discovery capabilities.

Alexander Dobin, Carrie A. Davis, Felix Schlesingeret al.

OMICS: A Journal of Integrative Biology/2012

clusterProfiler: an R package for comparing biological themes among gene clusters

This paper presents clusterProfiler, an R package that implements methods for analyzing and visualizing functional profiles of gene clusters. The package supports over-representation analysis and gene set enrichment analysis of Gene Ontology and KEGG pathways, providing statistical analysis and rich visualization for interpreting functional enrichment results from high-throughput experiments.

Guangchuang Yu, Li-Gen Wang, Yanyan Hanet al.

Genome Research/2012

ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia

This comprehensive paper provides guidelines and best practices for ChIP-seq experiments as developed by the ENCODE and modENCODE consortia. It addresses antibody validation, library complexity, sequencing depth, replication, and data analysis standards that have become foundational references for the chromatin biology community.

Stephen G. Landt, Georgi K. Marinov, Anshul Kundajeet al.

Bioinformatics/2009

Fast and accurate short read alignment with Burrows-Wheeler transform

This landmark paper introduced BWA, a Burrows-Wheeler transform based short-read aligner designed for efficient mapping against large reference genomes. It established the algorithmic foundation for many WGS and WES variant calling pipelines, including workflows that later adopted BWA-MEM for longer reads and paired-end sequencing data.

Heng Li, Richard Durbin

BMC Bioinformatics/2008

WGCNA: an R package for weighted correlation network analysis

This paper presents the WGCNA R package, a comprehensive collection of functions for performing weighted gene co-expression network analysis. WGCNA identifies gene modules based on pairwise correlations, relates modules to external sample traits, and calculates module membership measures. It has become an essential tool for systems biology and co-expression network studies.

Peter Langfelder, Steve Horvath