Saved in:
| Main Authors: | Kong, Zicheng, Ma, Dehua, Xu, Zhenbo, Yang, Alven, Ru, Yiwei, Wang, Haoran, Zhou, Zixuan, Bie, Fuqing, Xiang, Liuyu, Wu, Huijia, Zhao, Jian, He, Zhaofeng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00846 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OmniPlay: Benchmarking Omni-Modal Models on Omni-Modal Game Playing
by: Bie, Fuqing, et al.
Published: (2025)
by: Bie, Fuqing, et al.
Published: (2025)
Rethinking Class-Incremental Learning from a Dynamic Imbalanced Learning Perspective
by: Wang, Leyuan, et al.
Published: (2024)
by: Wang, Leyuan, et al.
Published: (2024)
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
by: Jin, Zhuoran, et al.
Published: (2025)
by: Jin, Zhuoran, et al.
Published: (2025)
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding
by: Guan, Yiran, et al.
Published: (2026)
by: Guan, Yiran, et al.
Published: (2026)
Dynamic Generation of Personalities with Large Language Models
by: Liu, Jianzhi, et al.
Published: (2024)
by: Liu, Jianzhi, et al.
Published: (2024)
Select-Then-Decompose: From Empirical Analysis to Adaptive Selection Strategy for Task Decomposition in Large Language Models
by: Liu, Shuodi, et al.
Published: (2025)
by: Liu, Shuodi, et al.
Published: (2025)
Unlocking the Address Book: Dissecting the Sparse Semantic Structure of LLM Key-Value Caches via Sparse Autoencoders
by: Ma, Qingsen, et al.
Published: (2025)
by: Ma, Qingsen, et al.
Published: (2025)
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering
by: Jia, Yiduo, et al.
Published: (2026)
by: Jia, Yiduo, et al.
Published: (2026)
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
by: Yao, Jiali, et al.
Published: (2025)
by: Yao, Jiali, et al.
Published: (2025)
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
by: Peng, Haosong, et al.
Published: (2025)
by: Peng, Haosong, et al.
Published: (2025)
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
by: Chen, Junzhe, et al.
Published: (2025)
by: Chen, Junzhe, et al.
Published: (2025)
ChronusOmni: Improving Time Awareness of Omni Large Language Models
by: Chen, Yijing, et al.
Published: (2025)
by: Chen, Yijing, et al.
Published: (2025)
AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction
by: Chen, Zixuan, et al.
Published: (2026)
by: Chen, Zixuan, et al.
Published: (2026)
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
by: Zhao, Xiangyu, et al.
Published: (2025)
by: Zhao, Xiangyu, et al.
Published: (2025)
Visual Preference Optimization with Rubric Rewards
by: Yu, Ya-Qi, et al.
Published: (2026)
by: Yu, Ya-Qi, et al.
Published: (2026)
RRM: Robust Reward Model Training Mitigates Reward Hacking
by: Liu, Tianqi, et al.
Published: (2024)
by: Liu, Tianqi, et al.
Published: (2024)
OmniACBench: A Benchmark for Evaluating Context-Grounded Acoustic Control in Omni-Modal Models
by: Kim, Seunghee, et al.
Published: (2026)
by: Kim, Seunghee, et al.
Published: (2026)
Beyond Darkness: Thermal-Supervised 3D Gaussian Splatting for Low-Light Novel View Synthesis
by: Ma, Qingsen, et al.
Published: (2025)
by: Ma, Qingsen, et al.
Published: (2025)
OmniQuality-R: Advancing Reward Models Through All-Encompassing Quality Assessment
by: Lu, Yiting, et al.
Published: (2025)
by: Lu, Yiting, et al.
Published: (2025)
OmniDiagram: Advancing Unified Diagram Code Generation via Visual Interrogation Reward
by: Yang, Haoyue, et al.
Published: (2026)
by: Yang, Haoyue, et al.
Published: (2026)
AEQ-Bench: Measuring Empathy of Omni-Modal Large Models
by: Luo, Xuan, et al.
Published: (2026)
by: Luo, Xuan, et al.
Published: (2026)
OmniRe: Omni Urban Scene Reconstruction
by: Chen, Ziyu, et al.
Published: (2024)
by: Chen, Ziyu, et al.
Published: (2024)
Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models
by: Tao, Dehua, et al.
Published: (2026)
by: Tao, Dehua, et al.
Published: (2026)
Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction
by: He, Chaoqun, et al.
Published: (2026)
by: He, Chaoqun, et al.
Published: (2026)
OmniBench: Towards The Future of Universal Omni-Language Models
by: Li, Yizhi, et al.
Published: (2024)
by: Li, Yizhi, et al.
Published: (2024)
Simulation-Free PSRO: Removing Game Simulation from Policy Space Response Oracles
by: Liu, Yingzhuo, et al.
Published: (2025)
by: Liu, Yingzhuo, et al.
Published: (2025)
CLIP model is an Efficient Online Lifelong Learner
by: Wang, Leyuan, et al.
Published: (2024)
by: Wang, Leyuan, et al.
Published: (2024)
Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding
by: Zhang, Xiaojie, et al.
Published: (2025)
by: Zhang, Xiaojie, et al.
Published: (2025)
OmniGaze: Reward-inspired Generalizable Gaze Estimation In The Wild
by: Qu, Hongyu, et al.
Published: (2025)
by: Qu, Hongyu, et al.
Published: (2025)
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
by: Liu, Runtao, et al.
Published: (2024)
by: Liu, Runtao, et al.
Published: (2024)
Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
by: Hu, Yushi, et al.
Published: (2025)
by: Hu, Yushi, et al.
Published: (2025)
OmniVox: Zero-Shot Emotion Recognition with Omni-LLMs
by: Murzaku, John, et al.
Published: (2025)
by: Murzaku, John, et al.
Published: (2025)
RoboOmni: Proactive Robot Manipulation in Omni-modal Context
by: Wang, Siyin, et al.
Published: (2025)
by: Wang, Siyin, et al.
Published: (2025)
OmniGAIA: Towards Native Omni-Modal AI Agents
by: Li, Xiaoxi, et al.
Published: (2026)
by: Li, Xiaoxi, et al.
Published: (2026)
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
by: Zeng, Weixuan, et al.
Published: (2026)
by: Zeng, Weixuan, et al.
Published: (2026)
OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning
by: Zhu, Boyu, et al.
Published: (2025)
by: Zhu, Boyu, et al.
Published: (2025)
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
by: Xi, Dianbing, et al.
Published: (2025)
by: Xi, Dianbing, et al.
Published: (2025)
Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation
by: Wang, Dianyun, et al.
Published: (2025)
by: Wang, Dianyun, et al.
Published: (2025)
OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset
by: Hu, Wenbin, et al.
Published: (2026)
by: Hu, Wenbin, et al.
Published: (2026)
Auto-Rubric: Learning From Implicit Weights to Explicit Rubrics for Reward Modeling
by: Xie, Lipeng, et al.
Published: (2025)
by: Xie, Lipeng, et al.
Published: (2025)
Similar Items
-
OmniPlay: Benchmarking Omni-Modal Models on Omni-Modal Game Playing
by: Bie, Fuqing, et al.
Published: (2025) -
Rethinking Class-Incremental Learning from a Dynamic Imbalanced Learning Perspective
by: Wang, Leyuan, et al.
Published: (2024) -
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
by: Jin, Zhuoran, et al.
Published: (2025) -
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding
by: Guan, Yiran, et al.
Published: (2026) -
Dynamic Generation of Personalities with Large Language Models
by: Liu, Jianzhi, et al.
Published: (2024)