Saved in:
| Main Authors: | Luo, Yinyi, Wang, Wenwen, Bai, Hayes, Zhu, Hongyu, Chen, Hao, He, Pan, Savvides, Marios, Li, Sharon, Wang, Jindong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.10784 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LatentUMM: Dual Latent Alignment for Unified Multimodal Models
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning
by: Bai, Hayes, et al.
Published: (2026)
by: Bai, Hayes, et al.
Published: (2026)
Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
by: Luo, Yinyi, et al.
Published: (2025)
by: Luo, Yinyi, et al.
Published: (2025)
FedUMM: A General Framework for Federated Learning with Unified Multimodal Models
by: Su, Zhaolong, et al.
Published: (2026)
by: Su, Zhaolong, et al.
Published: (2026)
Image Tokenizer Needs Post-Training
by: Qiu, Kai, et al.
Published: (2025)
by: Qiu, Kai, et al.
Published: (2025)
MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
by: Su, Zhaolong, et al.
Published: (2025)
by: Su, Zhaolong, et al.
Published: (2025)
An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
by: Chen, Hao, et al.
Published: (2022)
by: Chen, Hao, et al.
Published: (2022)
A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation
by: He, Haonan, et al.
Published: (2026)
by: He, Haonan, et al.
Published: (2026)
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
by: Qiu, Kai, et al.
Published: (2025)
by: Qiu, Kai, et al.
Published: (2025)
Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets
by: Chen, Hao, et al.
Published: (2022)
by: Chen, Hao, et al.
Published: (2022)
SciPost Physics Codebases
Published: (2026)
Published: (2026)
Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
by: Yao, Changwei, et al.
Published: (2025)
by: Yao, Changwei, et al.
Published: (2025)
ChatUMM: Robust Context Tracking for Conversational Interleaved Generation
by: Dai, Wenxun, et al.
Published: (2026)
by: Dai, Wenxun, et al.
Published: (2026)
PromptBench: A Unified Library for Evaluation of Large Language Models
by: Zhu, Kaijie, et al.
Published: (2023)
by: Zhu, Kaijie, et al.
Published: (2023)
Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
by: Sun, Peng, et al.
Published: (2026)
by: Sun, Peng, et al.
Published: (2026)
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
by: DataFlow Team, et al.
Published: (2026)
by: DataFlow Team, et al.
Published: (2026)
UniSD: Towards a Unified Self-Distillation Framework for Large Language Models
by: Jin, Yiqiao, et al.
Published: (2026)
by: Jin, Yiqiao, et al.
Published: (2026)
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
by: Chen, Fangyi, et al.
Published: (2024)
by: Chen, Fangyi, et al.
Published: (2024)
RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
by: Luo, Jane, et al.
Published: (2025)
by: Luo, Jane, et al.
Published: (2025)
On Fairness of Unified Multimodal Large Language Model for Image Generation
by: Liu, Ming, et al.
Published: (2025)
by: Liu, Ming, et al.
Published: (2025)
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
by: Yang, Zhantao, et al.
Published: (2024)
by: Yang, Zhantao, et al.
Published: (2024)
Self-Ensemble Post Learning for Noisy Domain Generalization
by: Lu, Wang, et al.
Published: (2025)
by: Lu, Wang, et al.
Published: (2025)
Efficient Autoregressive Audio Modeling via Next-Scale Prediction
by: Qiu, Kai, et al.
Published: (2024)
by: Qiu, Kai, et al.
Published: (2024)
MotionVerse: A Unified Multimodal Framework for Motion Comprehension, Generation and Editing
by: Hou, Ruibing, et al.
Published: (2025)
by: Hou, Ruibing, et al.
Published: (2025)
STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
by: Liang, Wanchao, et al.
Published: (2024)
by: Liang, Wanchao, et al.
Published: (2024)
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems
by: Ye, Rui, et al.
Published: (2025)
by: Ye, Rui, et al.
Published: (2025)
When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning
by: Li, Chen, et al.
Published: (2026)
by: Li, Chen, et al.
Published: (2026)
A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving
by: Li, Jialin, et al.
Published: (2025)
by: Li, Jialin, et al.
Published: (2025)
ExecuTorch -- A Unified PyTorch Solution to Run AI Models On-Device
by: Nachin, Mergen, et al.
Published: (2026)
by: Nachin, Mergen, et al.
Published: (2026)
Neural Radiance Fields with Torch Units
by: Ni, Bingnan, et al.
Published: (2024)
by: Ni, Bingnan, et al.
Published: (2024)
Can Vision Replace Text in Working Memory? Evidence from Spatial n-Back in Vision-Language Models
by: Liang, Sichu, et al.
Published: (2026)
by: Liang, Sichu, et al.
Published: (2026)
FormulaCode: Evaluating Agentic Optimization on Large Codebases
by: Sehgal, Atharva, et al.
Published: (2026)
by: Sehgal, Atharva, et al.
Published: (2026)
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization
by: Du, Bo, et al.
Published: (2025)
by: Du, Bo, et al.
Published: (2025)
torchtune: PyTorch native post-training library
by: Obozov, Mark, et al.
Published: (2026)
by: Obozov, Mark, et al.
Published: (2026)
SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution
by: He, Kang, et al.
Published: (2026)
by: He, Kang, et al.
Published: (2026)
Similar Items
-
LatentUMM: Dual Latent Alignment for Unified Multimodal Models
by: Luo, Yinyi, et al.
Published: (2026) -
UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning
by: Bai, Hayes, et al.
Published: (2026) -
Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026) -
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
by: Luo, Yinyi, et al.
Published: (2025) -
FedUMM: A General Framework for Federated Learning with Unified Multimodal Models
by: Su, Zhaolong, et al.
Published: (2026)