Guardado en:
| Autores principales: | He, Weilei, Ju, Feng, Fan, Zhiyuan, Min, Rui, Cheng, Minhao, Fung, Yi R. |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2601.03198 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Scalable Token-Level Hallucination Detection in Large Language Models
por: Min, Rui, et al.
Publicado: (2026)
por: Min, Rui, et al.
Publicado: (2026)
TextSquare: Scaling up Text-Centric Visual Instruction Tuning
por: Tang, Jingqun, et al.
Publicado: (2024)
por: Tang, Jingqun, et al.
Publicado: (2024)
CREAM: Consistency Regularized Self-Rewarding Language Models
por: Wang, Zhaoyang, et al.
Publicado: (2024)
por: Wang, Zhaoyang, et al.
Publicado: (2024)
Evaluating Large Language Models at Evaluating Instruction Following
por: Zeng, Zhiyuan, et al.
Publicado: (2023)
por: Zeng, Zhiyuan, et al.
Publicado: (2023)
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
por: Min, Rui, et al.
Publicado: (2024)
por: Min, Rui, et al.
Publicado: (2024)
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
por: Min, Rui, et al.
Publicado: (2025)
por: Min, Rui, et al.
Publicado: (2025)
Hierarchical Multi-Label Generation with Probabilistic Level-Constraint
por: Chen, Linqing, et al.
Publicado: (2025)
por: Chen, Linqing, et al.
Publicado: (2025)
MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs
por: Barrios, Wayner, et al.
Publicado: (2025)
por: Barrios, Wayner, et al.
Publicado: (2025)
Sparsity Forcing: Reinforcing Token Sparsity of MLLMs
por: Chen, Feng, et al.
Publicado: (2025)
por: Chen, Feng, et al.
Publicado: (2025)
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
por: AI, Inclusion, et al.
Publicado: (2025)
por: AI, Inclusion, et al.
Publicado: (2025)
Learning Interactive World Model for Object-Centric Reinforcement Learning
por: Feng, Fan, et al.
Publicado: (2025)
por: Feng, Fan, et al.
Publicado: (2025)
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
por: Suo, Yucheng, et al.
Publicado: (2025)
por: Suo, Yucheng, et al.
Publicado: (2025)
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
por: Myers, Vivek, et al.
Publicado: (2023)
por: Myers, Vivek, et al.
Publicado: (2023)
Environment Scaling for Interactive Agentic Experience Collection: A Survey
por: Huang, Yuchen, et al.
Publicado: (2025)
por: Huang, Yuchen, et al.
Publicado: (2025)
Community-Centric Graph Unlearning
por: Li, Yi, et al.
Publicado: (2024)
por: Li, Yi, et al.
Publicado: (2024)
SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models
por: He, Zirui, et al.
Publicado: (2025)
por: He, Zirui, et al.
Publicado: (2025)
CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
por: Yang, Tianyu, et al.
Publicado: (2024)
por: Yang, Tianyu, et al.
Publicado: (2024)
Agglomerative Federated Learning: Empowering Larger Model Training via End-Edge-Cloud Collaboration
por: Wu, Zhiyuan, et al.
Publicado: (2023)
por: Wu, Zhiyuan, et al.
Publicado: (2023)
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
por: Lee, JoonHo, et al.
Publicado: (2024)
por: Lee, JoonHo, et al.
Publicado: (2024)
Input Snapshots Fusion for Scalable Discrete-Time Dynamic Graph Neural Networks
por: Qi, QingGuo, et al.
Publicado: (2024)
por: Qi, QingGuo, et al.
Publicado: (2024)
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
por: Choi, Yumin, et al.
Publicado: (2025)
por: Choi, Yumin, et al.
Publicado: (2025)
Visual Latents Know More Than They Say: Unsilencing Latent Reasoning in MLLMs
por: Zhang, Xin, et al.
Publicado: (2026)
por: Zhang, Xin, et al.
Publicado: (2026)
Task-Centric Policy Optimization from Misaligned Motion Priors
por: Zheng, Ziang, et al.
Publicado: (2026)
por: Zheng, Ziang, et al.
Publicado: (2026)
Growing Visual Generative Capacity for Pre-Trained MLLMs
por: Wang, Hanyu, et al.
Publicado: (2025)
por: Wang, Hanyu, et al.
Publicado: (2025)
Invisible Backdoor Attacks on Diffusion Models
por: Li, Sen, et al.
Publicado: (2024)
por: Li, Sen, et al.
Publicado: (2024)
Off-Policy Selection for Initiating Human-Centric Experimental Design
por: Gao, Ge, et al.
Publicado: (2024)
por: Gao, Ge, et al.
Publicado: (2024)
One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
por: Wang, Ruochen, et al.
Publicado: (2024)
por: Wang, Ruochen, et al.
Publicado: (2024)
Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
por: Chen, Yan-Lun, et al.
Publicado: (2025)
por: Chen, Yan-Lun, et al.
Publicado: (2025)
VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs
por: Zheng, Naishan, et al.
Publicado: (2025)
por: Zheng, Naishan, et al.
Publicado: (2025)
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
por: Li, Zhiyuan, et al.
Publicado: (2024)
por: Li, Zhiyuan, et al.
Publicado: (2024)
Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash Function
por: Yin, Shuo, et al.
Publicado: (2025)
por: Yin, Shuo, et al.
Publicado: (2025)
Learning Variable-Length Tokenization for Generative Recommendation
por: Wang, Minhao, et al.
Publicado: (2026)
por: Wang, Minhao, et al.
Publicado: (2026)
SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read
por: Peng, Yibo, et al.
Publicado: (2026)
por: Peng, Yibo, et al.
Publicado: (2026)
Enhancing and Assessing Instruction-Following with Fine-Grained Instruction Variants
por: Yang, Jiuding, et al.
Publicado: (2024)
por: Yang, Jiuding, et al.
Publicado: (2024)
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
por: Liu, Ziyu, et al.
Publicado: (2024)
por: Liu, Ziyu, et al.
Publicado: (2024)
Achieving Constant Regret in Linear Markov Decision Processes
por: Zhang, Weitong, et al.
Publicado: (2024)
por: Zhang, Weitong, et al.
Publicado: (2024)
From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalities
por: Jiang, Shixin, et al.
Publicado: (2024)
por: Jiang, Shixin, et al.
Publicado: (2024)
MLLMs-Augmented Visual-Language Representation Learning
por: Liu, Yanqing, et al.
Publicado: (2023)
por: Liu, Yanqing, et al.
Publicado: (2023)
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model
por: Zhou, Hengguang, et al.
Publicado: (2025)
por: Zhou, Hengguang, et al.
Publicado: (2025)
Financial Instruction Following Evaluation (FIFE)
por: Matlin, Glenn, et al.
Publicado: (2025)
por: Matlin, Glenn, et al.
Publicado: (2025)
Ejemplares similares
-
Scalable Token-Level Hallucination Detection in Large Language Models
por: Min, Rui, et al.
Publicado: (2026) -
TextSquare: Scaling up Text-Centric Visual Instruction Tuning
por: Tang, Jingqun, et al.
Publicado: (2024) -
CREAM: Consistency Regularized Self-Rewarding Language Models
por: Wang, Zhaoyang, et al.
Publicado: (2024) -
Evaluating Large Language Models at Evaluating Instruction Following
por: Zeng, Zhiyuan, et al.
Publicado: (2023) -
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
por: Min, Rui, et al.
Publicado: (2024)