Skip to main navigation Skip to search Skip to main content

Low complexity (A/C)GG repeats and m1A methylation sites in 5' UTRs regulate gene

Zachery W Dickson, Megan Bilodeau, Daniel L Ruiz, Geoffrey Brian Golding

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Repetitive and compositionally biased low-complexity (LC) motifs appear in biological sequences where they interact with the machinery controlling the abundance of their host molecules. They can have significant impacts on physiological function, and act as raw material for evolution of regulatory motifs. The extent to which LC motifs affect abundance is not known. Even definitions of LC sequences are not well established, let alone which motifs exists in LC sequences, and which of those are abundance associated. To fill these knowledge gaps for post-transcriptional impacts of LC motifs, we integrated data from the GTEx project, PaxDb, and the IGSR. We establish definitions for LC motifs in both RNA and protein sequences. We observed that the presence of LC motifs in the 5′ UTR were positively associated with transcript abundance. We present a method to de novo identify abundance associated motifs and identified trinucleotide repeats of (A/C)GG as most strongly abundance associated. We observed that m1A methylation sites were strongly associated with both LC motifs and abundance, an effect which is amplified as methylation signatures from unspecialized RNA-seq increased. Together, our results demonstrate that LC motifs play important roles in regulating gene expression.
Original languageEnglish
Article numberPMID 8704544
JournalGenome
Volume69
Pages (from-to)1-12
Number of pages12
ISSN0831-2796
DOIs
Publication statusPublished - 9 Jan 2026
MoE publication typeA1 Journal article-refereed

Bibliographical note

PMID: 41512817

Fields of Science

  • Low-complexity motifs
  • Post-transcriptional regulation
  • Transcript abundance
  • Trinucleotide repeats
  • m1A methylation sites

Cite this