:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Jiaming, Ye, Haoran, Chen, Yukun, Li, Xinyue, Zhang, Lei, Alinejad-Rokny, Hamid, Peng, Jimmy Chih-Hsien, Yang, Min
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2506.07691
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CollectiveSFT: Scaling Large Language Models for Chinese Medical Benchmark with Collective Instructions in Healthcare
by: Zhu, Jingwei, et al.
Published: (2024)

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
by: Wang, Qiyao, et al.
Published: (2026)

Small Language Model as Data Prospector for Large Language Model
by: Ni, Shiwen, et al.
Published: (2024)

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning
by: Chen, Yukun, et al.
Published: (2026)

STORYTELLER: An Enhanced Plot-Planning Framework for Coherent and Cohesive Story Generation
by: Li, Jiaming, et al.
Published: (2025)

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination
by: Wang, Qiyao, et al.
Published: (2026)

Automatic Paper Reviewing with Heterogeneous Graph Reasoning over LLM-Simulated Reviewer-Author Debates
by: Li, Shuaimin, et al.
Published: (2025)

PersonaMath: Boosting Mathematical Reasoning via Persona-Driven Data Augmentation
by: Luo, Jing, et al.
Published: (2024)

Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation
by: Chen, Dingwei, et al.
Published: (2025)

FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration
by: Wang, Qiyao, et al.
Published: (2026)

EVADE-Bench: Multimodal Benchmark for Evaluating and Enhancing Evasive Content Detection
by: Xu, Ancheng, et al.
Published: (2025)

PLOT: Enhancing Preference Learning via Optimal Transport
by: Zhu, Liang, et al.
Published: (2026)

Lower Layers Matter: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
by: Chen, Dingwei, et al.
Published: (2024)

ToolRM: Towards Agentic Tool-Use Reward Modeling
by: Li, Renhao, et al.
Published: (2025)

OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
by: Luo, Run, et al.
Published: (2025)

CLaSp: In-Context Layer Skip for Self-Speculative Decoding
by: Chen, Longze, et al.
Published: (2025)

xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
by: Lee, Sunbowen, et al.
Published: (2025)

ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance
by: Shamsi, Afshar, et al.
Published: (2024)

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
by: Li, Jiaming, et al.
Published: (2025)

Time-Aware Feature Selection: Adaptive Temporal Masking for Stable Sparse Autoencoder Training
by: Li, T. Ed, et al.
Published: (2025)

AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents
by: Chen, Guhong, et al.
Published: (2024)

Model Unlearning via Sparse Autoencoder Subspace Guided Projections
by: Wang, Xu, et al.
Published: (2025)

Act-Adaptive Margin: Dynamically Calibrating Reward Models for Subjective Ambiguity
by: Fang, Feiteng, et al.
Published: (2025)

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders
by: Jing, Yi, et al.
Published: (2026)

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning
by: Li, Xiaochuan, et al.
Published: (2024)

SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics
by: Zahedi, Roxana, et al.
Published: (2025)

CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs
by: Li, Siyi, et al.
Published: (2026)

AlignSAE: Concept-Aligned Sparse Autoencoders
by: Yang, Minglai, et al.
Published: (2025)

Sparse Autoencoder Features for Classifications and Transferability
by: Gallifant, Jack, et al.
Published: (2025)

InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
by: Wei, Zhepei, et al.
Published: (2024)

AutoPatent: A Multi-Agent Framework for Automatic Patent Generation
by: Wang, Qiyao, et al.
Published: (2024)

Evaluating Adversarial Robustness of Concept Representations in Sparse Autoencoders
by: Li, Aaron J., et al.
Published: (2025)

Quantification of Large Language Model Distillation
by: Lee, Sunbowen, et al.
Published: (2025)

Model Directions, Not Words: Mechanistic Topic Models Using Sparse Autoencoders
by: Zheng, Carolina, et al.
Published: (2025)

SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders
by: Yu, Zhuohao, et al.
Published: (2025)

InstructPro: Natural Language Guided Ligand-Binding Protein Design
by: Song, Zhenqiao, et al.
Published: (2025)

EdgeInfinite-Instruct: Bridging SFT-Based Optimization and NPU-Level Efficiency for Edge Devices
by: Chen, Jiyu, et al.
Published: (2025)

Sparse Autoencoders Enable Scalable and Reliable Circuit Identification in Language Models
by: O'Neill, Charles, et al.
Published: (2024)

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability
by: Karvonen, Adam, et al.
Published: (2025)

Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
by: Yamashita, Tomoya, et al.
Published: (2025)