:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	He, Weilei, Ju, Feng, Fan, Zhiyuan, Min, Rui, Cheng, Minhao, Fung, Yi R.
Formato:	Preprint
Publicado:	2026
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2601.03198
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Scalable Token-Level Hallucination Detection in Large Language Models
por: Min, Rui, et al.
Publicado: (2026)

TextSquare: Scaling up Text-Centric Visual Instruction Tuning
por: Tang, Jingqun, et al.
Publicado: (2024)

CREAM: Consistency Regularized Self-Rewarding Language Models
por: Wang, Zhaoyang, et al.
Publicado: (2024)

Evaluating Large Language Models at Evaluating Instruction Following
por: Zeng, Zhiyuan, et al.
Publicado: (2023)

Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
por: Min, Rui, et al.
Publicado: (2024)

Improving Your Model Ranking on Chatbot Arena by Vote Rigging
por: Min, Rui, et al.
Publicado: (2025)

Hierarchical Multi-Label Generation with Probabilistic Level-Constraint
por: Chen, Linqing, et al.
Publicado: (2025)

MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs
por: Barrios, Wayner, et al.
Publicado: (2025)

Sparsity Forcing: Reinforcing Token Sparsity of MLLMs
por: Chen, Feng, et al.
Publicado: (2025)

M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
por: AI, Inclusion, et al.
Publicado: (2025)

Learning Interactive World Model for Object-Centric Reinforcement Learning
por: Feng, Fan, et al.
Publicado: (2025)

Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
por: Suo, Yucheng, et al.
Publicado: (2025)

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
por: Myers, Vivek, et al.
Publicado: (2023)

Environment Scaling for Interactive Agentic Experience Collection: A Survey
por: Huang, Yuchen, et al.
Publicado: (2025)

Community-Centric Graph Unlearning
por: Li, Yi, et al.
Publicado: (2024)

SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models
por: He, Zirui, et al.
Publicado: (2025)

CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
por: Yang, Tianyu, et al.
Publicado: (2024)

Agglomerative Federated Learning: Empowering Larger Model Training via End-Edge-Cloud Collaboration
por: Wu, Zhiyuan, et al.
Publicado: (2023)

Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
por: Lee, JoonHo, et al.
Publicado: (2024)

Input Snapshots Fusion for Scalable Discrete-Time Dynamic Graph Neural Networks
por: Qi, QingGuo, et al.
Publicado: (2024)

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
por: Choi, Yumin, et al.
Publicado: (2025)

Visual Latents Know More Than They Say: Unsilencing Latent Reasoning in MLLMs
por: Zhang, Xin, et al.
Publicado: (2026)

Task-Centric Policy Optimization from Misaligned Motion Priors
por: Zheng, Ziang, et al.
Publicado: (2026)

Growing Visual Generative Capacity for Pre-Trained MLLMs
por: Wang, Hanyu, et al.
Publicado: (2025)

Invisible Backdoor Attacks on Diffusion Models
por: Li, Sen, et al.
Publicado: (2024)

Off-Policy Selection for Initiating Human-Centric Experimental Design
por: Gao, Ge, et al.
Publicado: (2024)

One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
por: Wang, Ruochen, et al.
Publicado: (2024)

Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
por: Chen, Yan-Lun, et al.
Publicado: (2025)

VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs
por: Zheng, Naishan, et al.
Publicado: (2025)

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
por: Li, Zhiyuan, et al.
Publicado: (2024)

Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash Function
por: Yin, Shuo, et al.
Publicado: (2025)

Learning Variable-Length Tokenization for Generative Recommendation
por: Wang, Minhao, et al.
Publicado: (2026)

SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read
por: Peng, Yibo, et al.
Publicado: (2026)

Enhancing and Assessing Instruction-Following with Fine-Grained Instruction Variants
por: Yang, Jiuding, et al.
Publicado: (2024)

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
por: Liu, Ziyu, et al.
Publicado: (2024)

Achieving Constant Regret in Linear Markov Decision Processes
por: Zhang, Weitong, et al.
Publicado: (2024)

From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalities
por: Jiang, Shixin, et al.
Publicado: (2024)

MLLMs-Augmented Visual-Language Representation Learning
por: Liu, Yanqing, et al.
Publicado: (2023)

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model
por: Zhou, Hengguang, et al.
Publicado: (2025)

Financial Instruction Following Evaluation (FIFE)
por: Matlin, Glenn, et al.
Publicado: (2025)