:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Yang, Shu, Wu, Junchao, Wu, Xuansheng, Wong, Derek, Liu, Ninhao, Wang, Di
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2506.19492
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Investigating CoT Monitorability in Large Reasoning Models
von: Yang, Shu, et al.
Veröffentlicht: (2025)

Understanding and Mitigating Political Stance Cross-topic Generalization in Large Language Models
von: Zhang, Jiayi, et al.
Veröffentlicht: (2025)

Understanding Aha Moments: from External Observations to Internal Mechanisms
von: Yang, Shu, et al.
Veröffentlicht: (2025)

Rethinking Prompt-based Debiasing in Large Language Models
von: Yang, Xinyi, et al.
Veröffentlicht: (2025)

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
von: Yang, Junxiao, et al.
Veröffentlicht: (2025)

Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
von: Wu, Junchao, et al.
Veröffentlicht: (2024)

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
von: Wu, Junchao, et al.
Veröffentlicht: (2023)

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements
von: Yang, Shu, et al.
Veröffentlicht: (2025)

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
von: Chen, Xin, et al.
Veröffentlicht: (2025)

Asking LLMs to Verify First is Almost Free Lunch
von: Wu, Shiguang, et al.
Veröffentlicht: (2025)

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
von: Wu, Junchao, et al.
Veröffentlicht: (2024)

Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation
von: Li, Tong, et al.
Veröffentlicht: (2025)

Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning Potential
von: Wu, Xuansheng, et al.
Veröffentlicht: (2025)

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models
von: Chen, Xin, et al.
Veröffentlicht: (2026)

CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs
von: Li, Siyi, et al.
Veröffentlicht: (2026)

Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification
von: Wu, Xuansheng, et al.
Veröffentlicht: (2025)

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders
von: Shu, Dong, et al.
Veröffentlicht: (2025)

Learnable Assessment Skills for LLM-based Automated Scoring: Rubric Construction via Iterative Optimization
von: Wang, Yun, et al.
Veröffentlicht: (2026)

Let LRMs Break Free from Overthinking via Self-Braking Tuning
von: Zhao, Haoran, et al.
Veröffentlicht: (2025)

Benchmarking the Detection of LLMs-Generated Modern Chinese Poetry
von: Wang, Shanshan, et al.
Veröffentlicht: (2025)

Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs
von: Zhang, Xuan, et al.
Veröffentlicht: (2025)

From Leaky Thoughts to Private Reasoning: Controlling What LRMs Say to Themselves
von: Puerto, Haritz, et al.
Veröffentlicht: (2026)

Unlocking Fine-Grained Translation Quality Estimation in LRMs through Synergistically Evolving Implicit and Explicit Reasoning
von: Dang, Renfei, et al.
Veröffentlicht: (2026)

Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty
von: Yu, Zewei, et al.
Veröffentlicht: (2026)

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
von: Wu, Xuansheng, et al.
Veröffentlicht: (2025)

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging
von: Wu, Han, et al.
Veröffentlicht: (2025)

Is It a Free Lunch for Removing Outliers during Pretraining?
von: Liao, Baohao, et al.
Veröffentlicht: (2024)

BRIEF-Pro: Universal Context Compression with Short-to-Long Synthesis for Fast and Accurate Multi-Hop Reasoning
von: Gu, Jia-Chen, et al.
Veröffentlicht: (2025)

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs
von: Li, Haoran, et al.
Veröffentlicht: (2026)

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation
von: Wang, Yun, et al.
Veröffentlicht: (2026)

Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge
von: Arora, Ansh, et al.
Veröffentlicht: (2024)

No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
von: Hu, Mengxuan, et al.
Veröffentlicht: (2024)

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering
von: Zhao, Haiyan, et al.
Veröffentlicht: (2025)

Your thoughts tell who you are: Characterize the reasoning patterns of LRMs
von: Chen, Yida, et al.
Veröffentlicht: (2025)

Short Chains, Deep Thoughts: Balancing Reasoning Efficiency and Intra-Segment Capability via Split-Merge Optimization
von: Gui, Runquan, et al.
Veröffentlicht: (2026)

AutoSCORE: Enhancing Automated Scoring with Multi-Agent Large Language Models via Structured Component Recognition
von: Wang, Yun, et al.
Veröffentlicht: (2025)

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models
von: Shu, Dong, et al.
Veröffentlicht: (2025)

DetectRL-X: Towards Reliable Multilingual and Real-World LLM-Generated Text Detection
von: Wu, Junchao, et al.
Veröffentlicht: (2026)

Applying Large Language Models and Chain-of-Thought for Automatic Scoring
von: Lee, Gyeong-Geon, et al.
Veröffentlicht: (2023)

OCR Error Post-Correction with LLMs in Historical Documents: No Free Lunches
von: Kanerva, Jenna, et al.
Veröffentlicht: (2025)