Saved in:
| Main Authors: | Yang, Hexiong, Chen, Mingrui, Huang, Huaibo, Duan, Junxian, Cao, Jie, Zhou, Zhen, He, Ran |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.20836 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Extending Sequence Length is Not All You Need: Effective Integration of Multimodal Signals for Gene Expression Prediction
by: Yang, Zhao, et al.
Published: (2026)
by: Yang, Zhao, et al.
Published: (2026)
Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences
by: Ramesh, Krithik, et al.
Published: (2025)
by: Ramesh, Krithik, et al.
Published: (2025)
Contrastive Deep Learning for Variant Detection in Wastewater Genomic Sequencing
by: Chinda, Adele, et al.
Published: (2025)
by: Chinda, Adele, et al.
Published: (2025)
Enhancing Downstream Analysis in Genome Sequencing: Species Classification While Basecalling
by: Kodra, Riselda, et al.
Published: (2025)
by: Kodra, Riselda, et al.
Published: (2025)
Genome-Factory: A Library for Tuning, Deploying, and Interpreting Genomic Foundation Models
by: Wu, Weimin, et al.
Published: (2025)
by: Wu, Weimin, et al.
Published: (2025)
Absorb & Escape: Overcoming Single Model Limitations in Generating Genomic Sequences
by: Li, Zehui, et al.
Published: (2024)
by: Li, Zehui, et al.
Published: (2024)
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model
by: Duan, Qihao, et al.
Published: (2025)
by: Duan, Qihao, et al.
Published: (2025)
Cancer-inspired Genomics Mapper Model for the Generation of Synthetic DNA Sequences with Desired Genomics Signatures
by: Lazebnik, Teddy, et al.
Published: (2023)
by: Lazebnik, Teddy, et al.
Published: (2023)
How Private Are DNA Embeddings? Inverting Foundation Model Representations of Genomic Sequences
by: Ouaari, Sofiane, et al.
Published: (2026)
by: Ouaari, Sofiane, et al.
Published: (2026)
DNAMotifTokenizer: Towards Biologically Informed Tokenization of Genomic Sequences
by: Zhou, Xiaoxiao, et al.
Published: (2025)
by: Zhou, Xiaoxiao, et al.
Published: (2025)
SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model
by: Yang, Zhao, et al.
Published: (2025)
by: Yang, Zhao, et al.
Published: (2025)
A Misclassification Network-Based Method for Comparative Genomic Analysis
by: He, Wan, et al.
Published: (2024)
by: He, Wan, et al.
Published: (2024)
GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models
by: Liu, Zicheng, et al.
Published: (2024)
by: Liu, Zicheng, et al.
Published: (2024)
A Phylogenetic Approach to Genomic Language Modeling
by: Albors, Carlos, et al.
Published: (2025)
by: Albors, Carlos, et al.
Published: (2025)
Robust Machine Learning for Regulatory Sequence Modeling under Biological and Technical Distribution Shifts
by: Yang, Yiyao
Published: (2026)
by: Yang, Yiyao
Published: (2026)
GenomeQA: Benchmarking General Large Language Models for Genome Sequence Understanding
by: Long, Weicai, et al.
Published: (2026)
by: Long, Weicai, et al.
Published: (2026)
Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization
by: Zhang, Weiliang, et al.
Published: (2024)
by: Zhang, Weiliang, et al.
Published: (2024)
Whole-Genome Phenotype Prediction with Machine Learning: Open Problems in Bacterial Genomics
by: James, Tamsin, et al.
Published: (2025)
by: James, Tamsin, et al.
Published: (2025)
Machine Learning-Based Genomic Linguistic Analysis (Gene Sequence Feature Learning): A Case Study on Predicting Heavy Metal Response Genes in Rice
by: Yang, Ruiqi, et al.
Published: (2025)
by: Yang, Ruiqi, et al.
Published: (2025)
MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
by: Li, Siyuan, et al.
Published: (2025)
by: Li, Siyuan, et al.
Published: (2025)
Multimodal 3D Genome Pre-training
by: Yang, Minghao, et al.
Published: (2025)
by: Yang, Minghao, et al.
Published: (2025)
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling
by: Schiff, Yair, et al.
Published: (2024)
by: Schiff, Yair, et al.
Published: (2024)
Statistical Linear Models in Virus Genomic Alignment-free Classification: Application to Hepatitis C Viruses
by: Remita, Amine M., et al.
Published: (2019)
by: Remita, Amine M., et al.
Published: (2019)
Celler:A Genomic Language Model for Long-Tailed Single-Cell Annotation
by: Zhao, Huan, et al.
Published: (2025)
by: Zhao, Huan, et al.
Published: (2025)
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
by: Qiao, Lifeng, et al.
Published: (2024)
by: Qiao, Lifeng, et al.
Published: (2024)
PlantBiMoE: A Bidirectional Foundation Model with SparseMoE for Plant Genomes
by: Lin, Kepeng, et al.
Published: (2025)
by: Lin, Kepeng, et al.
Published: (2025)
DNA Sequence Classification with Compressors
by: Ozan, Şükrü
Published: (2024)
by: Ozan, Şükrü
Published: (2024)
Entropy, Disagreement, and the Limits of Foundation Models in Genomics
by: Rochkoulets, Maxime, et al.
Published: (2026)
by: Rochkoulets, Maxime, et al.
Published: (2026)
D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation
by: Yang, Zhao, et al.
Published: (2026)
by: Yang, Zhao, et al.
Published: (2026)
Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth
by: Chen, Mingrui, et al.
Published: (2026)
by: Chen, Mingrui, et al.
Published: (2026)
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
by: Li, Siyuan, et al.
Published: (2024)
by: Li, Siyuan, et al.
Published: (2024)
EvoLen: Evolution-Guided Tokenization for DNA Language Model
by: Huang, Nan, et al.
Published: (2026)
by: Huang, Nan, et al.
Published: (2026)
Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity
by: Li, Zhufeng, et al.
Published: (2024)
by: Li, Zhufeng, et al.
Published: (2024)
Quantifying Memorization and Privacy Risks in Genomic Language Models
by: Nemecek, Alexander, et al.
Published: (2026)
by: Nemecek, Alexander, et al.
Published: (2026)
A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models
by: Tsuruta, Hirofumi, et al.
Published: (2024)
by: Tsuruta, Hirofumi, et al.
Published: (2024)
scMamba: A Pre-Trained Model for Single-Nucleus RNA Sequencing Analysis in Neurodegenerative Disorders
by: Oh, Gyutaek, et al.
Published: (2025)
by: Oh, Gyutaek, et al.
Published: (2025)
A Chromosome-level Assembly and Functional Genomic Resources for the Model Annelid Capitella teleta.
by: Davies, Billie E, et al.
Published: (2026)
by: Davies, Billie E, et al.
Published: (2026)
Unlocking the Power of Multi-institutional Data: Integrating and Harmonizing Genomic Data Across Institutions
by: Chen, Yuan, et al.
Published: (2024)
by: Chen, Yuan, et al.
Published: (2024)
Explainable AI in Genomics: Transcription Factor Binding Site Prediction with Mixture of Experts
by: Tripathi, Aakash, et al.
Published: (2025)
by: Tripathi, Aakash, et al.
Published: (2025)
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
by: Zhan, Huixin, et al.
Published: (2024)
by: Zhan, Huixin, et al.
Published: (2024)
Similar Items
-
Extending Sequence Length is Not All You Need: Effective Integration of Multimodal Signals for Gene Expression Prediction
by: Yang, Zhao, et al.
Published: (2026) -
Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences
by: Ramesh, Krithik, et al.
Published: (2025) -
Contrastive Deep Learning for Variant Detection in Wastewater Genomic Sequencing
by: Chinda, Adele, et al.
Published: (2025) -
Enhancing Downstream Analysis in Genome Sequencing: Species Classification While Basecalling
by: Kodra, Riselda, et al.
Published: (2025) -
Genome-Factory: A Library for Tuning, Deploying, and Interpreting Genomic Foundation Models
by: Wu, Weimin, et al.
Published: (2025)