Guardado en:
| Autores principales: | Zhu, Yubo, Liu, Dongrui, Lin, Zecheng, Tong, Wei, Zhong, Sheng, Shao, Jing |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2509.12886 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Loop as a Bridge: Can Looped Transformers Truly Link Representation Space and Natural Language Outputs?
por: Chen, Guanxu, et al.
Publicado: (2026)
por: Chen, Guanxu, et al.
Publicado: (2026)
RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
por: Zhang, Ziqian, et al.
Publicado: (2026)
por: Zhang, Ziqian, et al.
Publicado: (2026)
Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning
por: Qian, Chen, et al.
Publicado: (2025)
por: Qian, Chen, et al.
Publicado: (2025)
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
por: Yang, Ruihan, et al.
Publicado: (2024)
por: Yang, Ruihan, et al.
Publicado: (2024)
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
por: Shao, Shuai, et al.
Publicado: (2025)
por: Shao, Shuai, et al.
Publicado: (2025)
ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging
por: Yang, Junyao, et al.
Publicado: (2026)
por: Yang, Junyao, et al.
Publicado: (2026)
Subtoxic Questions: Dive Into Attitude Change of LLM's Response in Jailbreak Attempts
por: Zhang, Tianyu, et al.
Publicado: (2024)
por: Zhang, Tianyu, et al.
Publicado: (2024)
REEF: Representation Encoding Fingerprints for Large Language Models
por: Zhang, Jie, et al.
Publicado: (2024)
por: Zhang, Jie, et al.
Publicado: (2024)
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
por: Zhou, Tianyi, et al.
Publicado: (2026)
por: Zhou, Tianyi, et al.
Publicado: (2026)
LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked
por: Phute, Mansi, et al.
Publicado: (2023)
por: Phute, Mansi, et al.
Publicado: (2023)
Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering
por: Liu, Jiaxiang, et al.
Publicado: (2024)
por: Liu, Jiaxiang, et al.
Publicado: (2024)
NLP Methods May Actually Be Better Than Professors at Estimating Question Difficulty
por: Zotos, Leonidas, et al.
Publicado: (2025)
por: Zotos, Leonidas, et al.
Publicado: (2025)
CaRT: Teaching LLM Agents to Know When They Know Enough
por: Liu, Grace, et al.
Publicado: (2025)
por: Liu, Grace, et al.
Publicado: (2025)
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
por: Chen, Xin, et al.
Publicado: (2025)
por: Chen, Xin, et al.
Publicado: (2025)
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
por: Ma, Qianli, et al.
Publicado: (2025)
por: Ma, Qianli, et al.
Publicado: (2025)
UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions
por: Rogoz, Ana-Cristina, et al.
Publicado: (2024)
por: Rogoz, Ana-Cristina, et al.
Publicado: (2024)
RvB: Automating AI System Hardening via Iterative Red-Blue Games
por: Huang, Lige, et al.
Publicado: (2026)
por: Huang, Lige, et al.
Publicado: (2026)
Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models
por: Chen, Guanxu, et al.
Publicado: (2025)
por: Chen, Guanxu, et al.
Publicado: (2025)
CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering
por: Yang, Hao, et al.
Publicado: (2026)
por: Yang, Hao, et al.
Publicado: (2026)
Unleashing LLM Reasoning Capability via Scalable Question Synthesis from Scratch
por: Ding, Yuyang, et al.
Publicado: (2024)
por: Ding, Yuyang, et al.
Publicado: (2024)
Rhetorical Questions in LLM Representations: A Linear Probing Study
por: Yao, Louie Hong, et al.
Publicado: (2026)
por: Yao, Louie Hong, et al.
Publicado: (2026)
Question Difficulty Ranking for Multiple-Choice Reading Comprehension
por: Raina, Vatsal, et al.
Publicado: (2024)
por: Raina, Vatsal, et al.
Publicado: (2024)
Toward Automated Simulation Research Workflow through LLM Prompt Engineering Design
por: Liu, Zhihan, et al.
Publicado: (2024)
por: Liu, Zhihan, et al.
Publicado: (2024)
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
por: Orgad, Hadas, et al.
Publicado: (2024)
por: Orgad, Hadas, et al.
Publicado: (2024)
Asynchronous LLM Function Calling
por: Gim, In, et al.
Publicado: (2024)
por: Gim, In, et al.
Publicado: (2024)
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
por: Qian, Chen, et al.
Publicado: (2024)
por: Qian, Chen, et al.
Publicado: (2024)
Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering
por: Xu, Yao, et al.
Publicado: (2024)
por: Xu, Yao, et al.
Publicado: (2024)
Good Idea or Not, Representation of LLM Could Tell
por: Xu, Yi, et al.
Publicado: (2024)
por: Xu, Yi, et al.
Publicado: (2024)
REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing
por: Zhong, Haitian, et al.
Publicado: (2025)
por: Zhong, Haitian, et al.
Publicado: (2025)
Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use
por: Pang, Renning, et al.
Publicado: (2026)
por: Pang, Renning, et al.
Publicado: (2026)
LLM Agents Already Know When to Call Tools -- Even Without Reasoning
por: Sun, Chung-En, et al.
Publicado: (2026)
por: Sun, Chung-En, et al.
Publicado: (2026)
SEAR: Schema-Based Evaluation and Routing for LLM Gateways
por: Zhang, Zecheng, et al.
Publicado: (2026)
por: Zhang, Zecheng, et al.
Publicado: (2026)
Common 7B Language Models Already Possess Strong Math Capabilities
por: Li, Chen, et al.
Publicado: (2024)
por: Li, Chen, et al.
Publicado: (2024)
SnapKV: LLM Knows What You are Looking for Before Generation
por: Li, Yuhong, et al.
Publicado: (2024)
por: Li, Yuhong, et al.
Publicado: (2024)
Tackling the Inherent Difficulty of Noise Filtering in RAG
por: Liu, Jingyu, et al.
Publicado: (2026)
por: Liu, Jingyu, et al.
Publicado: (2026)
INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems
por: Zhou, Yijin, et al.
Publicado: (2026)
por: Zhou, Yijin, et al.
Publicado: (2026)
CuriousLLM: Elevating Multi-Document Question Answering with LLM-Enhanced Knowledge Graph Reasoning
por: Yang, Zukang, et al.
Publicado: (2024)
por: Yang, Zukang, et al.
Publicado: (2024)
Efficient LLM-Jailbreaking via Multimodal-LLM Jailbreak
por: Ji, Haoxuan, et al.
Publicado: (2024)
por: Ji, Haoxuan, et al.
Publicado: (2024)
Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation
por: Lin, Wei-Hsiang, et al.
Publicado: (2025)
por: Lin, Wei-Hsiang, et al.
Publicado: (2025)
Improving Data Efficiency via Curating LLM-Driven Rating Systems
por: Pang, Jinlong, et al.
Publicado: (2024)
por: Pang, Jinlong, et al.
Publicado: (2024)
Ejemplares similares
-
Loop as a Bridge: Can Looped Transformers Truly Link Representation Space and Natural Language Outputs?
por: Chen, Guanxu, et al.
Publicado: (2026) -
RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
por: Zhang, Ziqian, et al.
Publicado: (2026) -
Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning
por: Qian, Chen, et al.
Publicado: (2025) -
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
por: Yang, Ruihan, et al.
Publicado: (2024) -
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
por: Shao, Shuai, et al.
Publicado: (2025)