Saved in:
| Main Author: | Sun, Simeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.25073 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How much do contextualized representations encode long-range context?
by: Sun, Simeng, et al.
Published: (2024)
by: Sun, Simeng, et al.
Published: (2024)
Suri: Multi-constraint Instruction Following for Long-form Text Generation
by: Pham, Chau Minh, et al.
Published: (2024)
by: Pham, Chau Minh, et al.
Published: (2024)
SugarTextNet: A Transformer-Based Framework for Detecting Sugar Dating-Related Content on Social Media with Context-Aware Focal Loss
by: Wang, Lionel Z., et al.
Published: (2025)
by: Wang, Lionel Z., et al.
Published: (2025)
TopicGPT: A Prompt-based Topic Modeling Framework
by: Pham, Chau Minh, et al.
Published: (2023)
by: Pham, Chau Minh, et al.
Published: (2023)
L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution
by: Sun, Simeng, et al.
Published: (2025)
by: Sun, Simeng, et al.
Published: (2025)
The optimality of word lengths. Theoretical foundations and an empirical study
by: Petrini, Sonia, et al.
Published: (2022)
by: Petrini, Sonia, et al.
Published: (2022)
Comparing large language models and human programmers for generating programming code
by: Hou, Wenpin, et al.
Published: (2024)
by: Hou, Wenpin, et al.
Published: (2024)
HYBRIDMIND: Meta Selection of Natural Language and Symbolic Language for Enhanced LLM Reasoning
by: Han, Simeng, et al.
Published: (2024)
by: Han, Simeng, et al.
Published: (2024)
Combining psychoanalysis and computer science: an empirical study of the relationship between emotions and the Lacanian discourses
by: Gadalla, Minas, et al.
Published: (2024)
by: Gadalla, Minas, et al.
Published: (2024)
RULER: What's the Real Context Size of Your Long-Context Language Models?
by: Hsieh, Cheng-Ping, et al.
Published: (2024)
by: Hsieh, Cheng-Ping, et al.
Published: (2024)
Learning to Reason via Mixture-of-Thought for Logical Reasoning
by: Zheng, Tong, et al.
Published: (2025)
by: Zheng, Tong, et al.
Published: (2025)
Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics
by: Lee, Jinu, et al.
Published: (2025)
by: Lee, Jinu, et al.
Published: (2025)
Multilingual Generative Retrieval via Cross-lingual Semantic Compression
by: Huang, Yuxin, et al.
Published: (2025)
by: Huang, Yuxin, et al.
Published: (2025)
Are most sentences unique? An empirical examination of Chomskyan claims
by: Ring, Hiram
Published: (2025)
by: Ring, Hiram
Published: (2025)
Assessing Pause Thresholds for empirical Translation Process Research
by: Bandaru, Devi Sri, et al.
Published: (2026)
by: Bandaru, Devi Sri, et al.
Published: (2026)
Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-Problems
by: Miner, Stephen, et al.
Published: (2024)
by: Miner, Stephen, et al.
Published: (2024)
Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
Optimizing Language Model's Reasoning Abilities with Weak Supervision
by: Tong, Yongqi, et al.
Published: (2024)
by: Tong, Yongqi, et al.
Published: (2024)
SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
by: Puvvada, Krishna C., et al.
Published: (2025)
by: Puvvada, Krishna C., et al.
Published: (2025)
Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data
by: Benzoni, Claudio, et al.
Published: (2025)
by: Benzoni, Claudio, et al.
Published: (2025)
The effect of source disclosure on evaluation of AI-generated messages: A two-part study
by: Lim, Sue, et al.
Published: (2023)
by: Lim, Sue, et al.
Published: (2023)
Transformer Layers as Painters
by: Sun, Qi, et al.
Published: (2024)
by: Sun, Qi, et al.
Published: (2024)
Probabilistic energy profiler for statically typed JVM-based programming languages
by: Nyholm, Joel, et al.
Published: (2025)
by: Nyholm, Joel, et al.
Published: (2025)
Fact-checking AI-generated news reports: Can LLMs catch their own lies?
by: Yao, Jiayi, et al.
Published: (2025)
by: Yao, Jiayi, et al.
Published: (2025)
ATEB: Evaluating and Improving Advanced NLP Tasks for Text Embedding Models
by: Han, Simeng, et al.
Published: (2025)
by: Han, Simeng, et al.
Published: (2025)
Statistical investigations into the geometry and homology of random programs
by: Sporring, Jon, et al.
Published: (2024)
by: Sporring, Jon, et al.
Published: (2024)
Towards an empirical understanding of MoE design choices
by: Fan, Dongyang, et al.
Published: (2024)
by: Fan, Dongyang, et al.
Published: (2024)
Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation
by: Chirkova, Nadezhda, et al.
Published: (2023)
by: Chirkova, Nadezhda, et al.
Published: (2023)
Controllable Text Generation with Residual Memory Transformer
by: Zhang, Hanqing, et al.
Published: (2023)
by: Zhang, Hanqing, et al.
Published: (2023)
Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
by: Sun, Ke, et al.
Published: (2026)
by: Sun, Ke, et al.
Published: (2026)
Automata Extraction from Transformers
by: Zhang, Yihao, et al.
Published: (2024)
by: Zhang, Yihao, et al.
Published: (2024)
Aura: Universal Multi-dimensional Exogenous Integration for Aviation Time Series
by: Lin, Jiafeng, et al.
Published: (2026)
by: Lin, Jiafeng, et al.
Published: (2026)
Tracers for debugging and program exploration
by: Chiplunkar, Shardul, et al.
Published: (2026)
by: Chiplunkar, Shardul, et al.
Published: (2026)
Mixture of Hidden-Dimensions Transformer
by: Chen, Yilong, et al.
Published: (2024)
by: Chen, Yilong, et al.
Published: (2024)
Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation
by: Hou, Jingrui, et al.
Published: (2023)
by: Hou, Jingrui, et al.
Published: (2023)
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
by: Si, Xiaonan, et al.
Published: (2025)
by: Si, Xiaonan, et al.
Published: (2025)
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
by: Wang, Boshi, et al.
Published: (2024)
by: Wang, Boshi, et al.
Published: (2024)
H$^{2}$MT: Semantic Hierarchy-Aware Hierarchical Memory Transformer
by: Haghifam, Maryam, et al.
Published: (2026)
by: Haghifam, Maryam, et al.
Published: (2026)
Evaluation of GPT-based large language generative AI models as study aids for the national licensure examination for registered dietitians in Japan
by: Nagamori, Yuta, et al.
Published: (2025)
by: Nagamori, Yuta, et al.
Published: (2025)
Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets
by: Cheekati, Shravan
Published: (2024)
by: Cheekati, Shravan
Published: (2024)
Similar Items
-
How much do contextualized representations encode long-range context?
by: Sun, Simeng, et al.
Published: (2024) -
Suri: Multi-constraint Instruction Following for Long-form Text Generation
by: Pham, Chau Minh, et al.
Published: (2024) -
SugarTextNet: A Transformer-Based Framework for Detecting Sugar Dating-Related Content on Social Media with Context-Aware Focal Loss
by: Wang, Lionel Z., et al.
Published: (2025) -
TopicGPT: A Prompt-based Topic Modeling Framework
by: Pham, Chau Minh, et al.
Published: (2023) -
L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution
by: Sun, Simeng, et al.
Published: (2025)