Martinez et al. 2019 (PRJNA515538)

General Details

Title Accurate annotation of human protein-coding small open reading frames
Organism
Number of Samples 16
Release Date 2019/01/16 00:00
Sequencing Types
Protocol Details

Study Links

Repository Details

SRA SRP179798
ENA SRP179798
GEO GSE125218
BioProject PRJNA515538

Publication

Title
Authors Martinez TF,Chu Q,Donaldson C,Tan D,Shokhirev MN,Saghatelian A
Journal Nature chemical biology
Publication Date 2020 Apr
Abstract Functional protein-coding small open reading frames (smORFs) are emerging as an important class of genes. However, the number of translated smORFs in the human genome is unclear because proteogenomic methods are not sensitive enough, and, as we show, Ribo-seq strategies require additional measures to ensure comprehensive and accurate smORF annotation. Here, we integrate de novo transcriptome assembly and Ribo-seq into an improved workflow that overcomes obstacles with previous methods, to more confidently annotate thousands of smORFs. Evolutionary conservation analyses suggest that hundreds of smORF-encoded microproteins are likely functional. Additionally, many smORFs are regulated during fundamental biological processes, such as cell stress. Peptides derived from smORFs are also detectable on human leukocyte antigen complexes, revealing smORFs as a source of antigens. Thus, by including additional validation into our smORF annotation workflow, we accurately identify thousands of unannotated translated smORFs that will provide a rich pool of unexplored, functional human genes.
PMC PMC7085969
PMID 31819274
DOI
Run Accession Study Accession Scientific Name Cell Line Library Type Treatment GWIPS-viz Trips-Viz Reads BAM BigWig (F) BigWig (R)
SRR8449577 PRJNA515538 Homo sapiens HeLa Ribo-Seq Cycloheximide
SRR8449566 PRJNA515538 Homo sapiens HEK293 Ribo-Seq Cycloheximide
SRR8449567 PRJNA515538 Homo sapiens HEK293 Ribo-Seq Cycloheximide
SRR8449568 PRJNA515538 Homo sapiens HEK293 Ribo-Seq Cycloheximide
SRR8449569 PRJNA515538 Homo sapiens HEK293 Ribo-Seq
SRR8449570 PRJNA515538 Homo sapiens HEK293 Ribo-Seq
SRR8449571 PRJNA515538 Homo sapiens HEK293 Ribo-Seq tunicamycin
SRR8449572 PRJNA515538 Homo sapiens HEK293 Ribo-Seq tunicamycin
SRR8449573 PRJNA515538 Homo sapiens HEK293 Ribo-Seq thapsigargin
SRR8449574 PRJNA515538 Homo sapiens HEK293 Ribo-Seq thapsigargin
SRR8449575 PRJNA515538 Homo sapiens HeLa Ribo-Seq Cycloheximide
SRR8449576 PRJNA515538 Homo sapiens HeLa Ribo-Seq Cycloheximide
SRR8449578 PRJNA515538 Homo sapiens HeLa Ribo-Seq Cycloheximide
SRR8449579 PRJNA515538 Homo sapiens K562 Ribo-Seq Cycloheximide
SRR8449580 PRJNA515538 Homo sapiens K562 Ribo-Seq Cycloheximide
SRR8449581 PRJNA515538 Homo sapiens K562 Ribo-Seq Cycloheximide
Run Accession Study Accession Scientific Name Cell Line Library Type Treatment GWIPS-viz Trips-Viz Reads BAM BigWig (F) BigWig (R)

ⓘ For more Information on the columns shown here see: About