:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Sun, Simeng
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2509.25073
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

How much do contextualized representations encode long-range context?
by: Sun, Simeng, et al.
Published: (2024)

Suri: Multi-constraint Instruction Following for Long-form Text Generation
by: Pham, Chau Minh, et al.
Published: (2024)

SugarTextNet: A Transformer-Based Framework for Detecting Sugar Dating-Related Content on Social Media with Context-Aware Focal Loss
by: Wang, Lionel Z., et al.
Published: (2025)

TopicGPT: A Prompt-based Topic Modeling Framework
by: Pham, Chau Minh, et al.
Published: (2023)

L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution
by: Sun, Simeng, et al.
Published: (2025)

The optimality of word lengths. Theoretical foundations and an empirical study
by: Petrini, Sonia, et al.
Published: (2022)

Comparing large language models and human programmers for generating programming code
by: Hou, Wenpin, et al.
Published: (2024)

HYBRIDMIND: Meta Selection of Natural Language and Symbolic Language for Enhanced LLM Reasoning
by: Han, Simeng, et al.
Published: (2024)

Combining psychoanalysis and computer science: an empirical study of the relationship between emotions and the Lacanian discourses
by: Gadalla, Minas, et al.
Published: (2024)

RULER: What's the Real Context Size of Your Long-Context Language Models?
by: Hsieh, Cheng-Ping, et al.
Published: (2024)

Learning to Reason via Mixture-of-Thought for Logical Reasoning
by: Zheng, Tong, et al.
Published: (2025)

Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics
by: Lee, Jinu, et al.
Published: (2025)

Multilingual Generative Retrieval via Cross-lingual Semantic Compression
by: Huang, Yuxin, et al.
Published: (2025)

Are most sentences unique? An empirical examination of Chomskyan claims
by: Ring, Hiram
Published: (2025)

Assessing Pause Thresholds for empirical Translation Process Research
by: Bandaru, Devi Sri, et al.
Published: (2026)

Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-Problems
by: Miner, Stephen, et al.
Published: (2024)

Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu
by: Liu, Chang, et al.
Published: (2025)

Optimizing Language Model's Reasoning Abilities with Weak Supervision
by: Tong, Yongqi, et al.
Published: (2024)

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
by: Puvvada, Krishna C., et al.
Published: (2025)

Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data
by: Benzoni, Claudio, et al.
Published: (2025)

The effect of source disclosure on evaluation of AI-generated messages: A two-part study
by: Lim, Sue, et al.
Published: (2023)

Transformer Layers as Painters
by: Sun, Qi, et al.
Published: (2024)

Probabilistic energy profiler for statically typed JVM-based programming languages
by: Nyholm, Joel, et al.
Published: (2025)

Fact-checking AI-generated news reports: Can LLMs catch their own lies?
by: Yao, Jiayi, et al.
Published: (2025)

ATEB: Evaluating and Improving Advanced NLP Tasks for Text Embedding Models
by: Han, Simeng, et al.
Published: (2025)

Statistical investigations into the geometry and homology of random programs
by: Sporring, Jon, et al.
Published: (2024)

Towards an empirical understanding of MoE design choices
by: Fan, Dongyang, et al.
Published: (2024)

Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation
by: Chirkova, Nadezhda, et al.
Published: (2023)

Controllable Text Generation with Residual Memory Transformer
by: Zhang, Hanqing, et al.
Published: (2023)

Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
by: Sun, Ke, et al.
Published: (2026)

Automata Extraction from Transformers
by: Zhang, Yihao, et al.
Published: (2024)

Aura: Universal Multi-dimensional Exogenous Integration for Aviation Time Series
by: Lin, Jiafeng, et al.
Published: (2026)

Tracers for debugging and program exploration
by: Chiplunkar, Shardul, et al.
Published: (2026)

Mixture of Hidden-Dimensions Transformer
by: Chen, Yilong, et al.
Published: (2024)

Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation
by: Hou, Jingrui, et al.
Published: (2023)

SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
by: Si, Xiaonan, et al.
Published: (2025)

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
by: Wang, Boshi, et al.
Published: (2024)

H$^{2}$MT: Semantic Hierarchy-Aware Hierarchical Memory Transformer
by: Haghifam, Maryam, et al.
Published: (2026)

Evaluation of GPT-based large language generative AI models as study aids for the national licensure examination for registered dietitians in Japan
by: Nagamori, Yuta, et al.
Published: (2025)

Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets
by: Cheekati, Shravan
Published: (2024)