Saved in:
| Main Authors: | Foodeei, Darius, Fan, Simin, Jaggi, Martin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.17296 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DoGE: Domain Reweighting with Generalization Estimation
by: Fan, Simin, et al.
Published: (2023)
by: Fan, Simin, et al.
Published: (2023)
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
by: Fan, Dongyang, et al.
Published: (2025)
by: Fan, Dongyang, et al.
Published: (2025)
Towards an empirical understanding of MoE design choices
by: Fan, Dongyang, et al.
Published: (2024)
by: Fan, Dongyang, et al.
Published: (2024)
Toward Cross-Lingual Quality Classifiers for Multilingual Pretraining Data Selection
by: Turki, Yassine, et al.
Published: (2026)
by: Turki, Yassine, et al.
Published: (2026)
Periodic RoPE for Infinite Context LLMs
by: Huo, Simin
Published: (2026)
by: Huo, Simin
Published: (2026)
A cross-species neural foundation model for end-to-end speech decoding
by: Zhang, Yizi, et al.
Published: (2025)
by: Zhang, Yizi, et al.
Published: (2025)
Aligning Multilingual Reasoning with Verifiable Semantics from a High-Resource Expert Model
by: Faisal, Fahim, et al.
Published: (2025)
by: Faisal, Fahim, et al.
Published: (2025)
Recent advancements in LLM Red-Teaming: Techniques, Defenses, and Ethical Considerations
by: Raheja, Tarun, et al.
Published: (2024)
by: Raheja, Tarun, et al.
Published: (2024)
Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding
by: Olak, Michal, et al.
Published: (2026)
by: Olak, Michal, et al.
Published: (2026)
NeuralGrok: Accelerate Grokking by Neural Gradient Transformation
by: Zhou, Xinyu, et al.
Published: (2025)
by: Zhou, Xinyu, et al.
Published: (2025)
Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs
by: Bossy, Thierry, et al.
Published: (2025)
by: Bossy, Thierry, et al.
Published: (2025)
LLM-Agnostic Semantic Representation Attack
by: Lian, Jiawei, et al.
Published: (2026)
by: Lian, Jiawei, et al.
Published: (2026)
Benchmark of stylistic variation in LLM-generated texts
by: Milička, Jiří, et al.
Published: (2025)
by: Milička, Jiří, et al.
Published: (2025)
Gender Bias in LLM-generated Interview Responses
by: Kong, Haein, et al.
Published: (2024)
by: Kong, Haein, et al.
Published: (2024)
Learning to Summarize from LLM-generated Feedback
by: Song, Hwanjun, et al.
Published: (2024)
by: Song, Hwanjun, et al.
Published: (2024)
Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models
by: Chattopadhyay, Souradeep, et al.
Published: (2026)
by: Chattopadhyay, Souradeep, et al.
Published: (2026)
Digital Twin Ecosystem for Oncology Clinical Operations
by: Pandey, Himanshu, et al.
Published: (2024)
by: Pandey, Himanshu, et al.
Published: (2024)
LLM Self-Explanations Fail Semantic Invariance
by: Szeider, Stefan
Published: (2026)
by: Szeider, Stefan
Published: (2026)
An overview of model uncertainty and variability in LLM-based sentiment analysis. Challenges, mitigation strategies and the role of explainability
by: Herrera-Poyatos, David, et al.
Published: (2025)
by: Herrera-Poyatos, David, et al.
Published: (2025)
On the Semantics of Large Language Models
by: Schuele, Martin
Published: (2025)
by: Schuele, Martin
Published: (2025)
Where meaning lives: Layer-wise accessibility of psycholinguistic features in encoder and decoder language models
by: Tikhomirova, Taisiia, et al.
Published: (2026)
by: Tikhomirova, Taisiia, et al.
Published: (2026)
SemBench: A Universal Semantic Framework for LLM Evaluation
by: Zubillaga, Mikel, et al.
Published: (2026)
by: Zubillaga, Mikel, et al.
Published: (2026)
A decoder-only foundation model for time-series forecasting
by: Das, Abhimanyu, et al.
Published: (2023)
by: Das, Abhimanyu, et al.
Published: (2023)
The Scaling Laws of Skills in LLM Agent Systems
by: Chen, Charles, et al.
Published: (2026)
by: Chen, Charles, et al.
Published: (2026)
Multi-Agent LLM Judge: automatic personalized LLM judge design for evaluating natural language generation applications
by: Cao, Hongliu, et al.
Published: (2025)
by: Cao, Hongliu, et al.
Published: (2025)
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
by: Chen, Simin, et al.
Published: (2025)
by: Chen, Simin, et al.
Published: (2025)
When Safety Blocks Sense: Measuring Semantic Confusion in LLM Refusals
by: Anonto, Riad Ahmed, et al.
Published: (2025)
by: Anonto, Riad Ahmed, et al.
Published: (2025)
CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections
by: Kim, Keuntae, et al.
Published: (2025)
by: Kim, Keuntae, et al.
Published: (2025)
LLM-as-a-qualitative-judge: automating error analysis in natural language generation
by: Chirkova, Nadezhda, et al.
Published: (2025)
by: Chirkova, Nadezhda, et al.
Published: (2025)
Beyond speculation: Measuring the growing presence of LLM-generated texts in multilingual disinformation
by: Macko, Dominik, et al.
Published: (2025)
by: Macko, Dominik, et al.
Published: (2025)
Not too long do read: Evaluating LLM-generated extreme scientific summaries
by: Lyu, Zhuoqi, et al.
Published: (2025)
by: Lyu, Zhuoqi, et al.
Published: (2025)
Small Language Model Helps Resolve Semantic Ambiguity of LLM Prompt
by: Huang, Zhenzhen, et al.
Published: (2026)
by: Huang, Zhenzhen, et al.
Published: (2026)
UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI
by: Shumailov, Ilia, et al.
Published: (2024)
by: Shumailov, Ilia, et al.
Published: (2024)
Unused information in token probability distribution of generative LLM: improving LLM reading comprehension through calculation of expected values
by: Zawistowski, Krystian
Published: (2024)
by: Zawistowski, Krystian
Published: (2024)
LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs
by: Meyer, Lars-Peter, et al.
Published: (2025)
by: Meyer, Lars-Peter, et al.
Published: (2025)
Can LLMs Generate Visualizations with Dataless Prompts?
by: Coelho, Darius, et al.
Published: (2024)
by: Coelho, Darius, et al.
Published: (2024)
ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)
by: Huang, Fan
Published: (2026)
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety
by: Yang, Junxiao, et al.
Published: (2026)
by: Yang, Junxiao, et al.
Published: (2026)
Cross-cultural Inspiration Detection and Analysis in Real and LLM-generated Social Media Data
by: Ignat, Oana, et al.
Published: (2024)
by: Ignat, Oana, et al.
Published: (2024)
The "LLM World of Words" English free association norms generated by large language models
by: Abramski, Katherine, et al.
Published: (2024)
by: Abramski, Katherine, et al.
Published: (2024)
Similar Items
-
DoGE: Domain Reweighting with Generalization Estimation
by: Fan, Simin, et al.
Published: (2023) -
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
by: Fan, Dongyang, et al.
Published: (2025) -
Towards an empirical understanding of MoE design choices
by: Fan, Dongyang, et al.
Published: (2024) -
Toward Cross-Lingual Quality Classifiers for Multilingual Pretraining Data Selection
by: Turki, Yassine, et al.
Published: (2026) -
Periodic RoPE for Infinite Context LLMs
by: Huo, Simin
Published: (2026)