:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Yinyi, Wang, Wenwen, Bai, Hayes, Zhu, Hongyu, Chen, Hao, He, Pan, Savvides, Marios, Li, Sharon, Wang, Jindong
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.10784
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LatentUMM: Dual Latent Alignment for Unified Multimodal Models
by: Luo, Yinyi, et al.
Published: (2026)

UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning
by: Bai, Hayes, et al.
Published: (2026)

Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026)

KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning
by: Luo, Yinyi, et al.
Published: (2025)

FedUMM: A General Framework for Federated Learning with Unified Multimodal Models
by: Su, Zhaolong, et al.
Published: (2026)

Image Tokenizer Needs Post-Training
by: Qiu, Kai, et al.
Published: (2025)

MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
by: Li, Chen, et al.
Published: (2025)

SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning
by: Li, Chen, et al.
Published: (2025)

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
by: Su, Zhaolong, et al.
Published: (2025)

An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
by: Chen, Hao, et al.
Published: (2022)

A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation
by: He, Haonan, et al.
Published: (2026)

Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
by: Qiu, Kai, et al.
Published: (2025)

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets
by: Chen, Hao, et al.
Published: (2022)

SciPost Physics Codebases
Published: (2026)

Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
by: Yao, Changwei, et al.
Published: (2025)

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation
by: Dai, Wenxun, et al.
Published: (2026)

PromptBench: A Unified Library for Evaluation of Large Language Models
by: Zhu, Kaijie, et al.
Published: (2023)

Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
by: Sun, Peng, et al.
Published: (2026)

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
by: DataFlow Team, et al.
Published: (2026)

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models
by: Jin, Yiqiao, et al.
Published: (2026)

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
by: Chen, Fangyi, et al.
Published: (2024)

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation
by: Luo, Jane, et al.
Published: (2025)

On Fairness of Unified Multimodal Large Language Model for Image Generation
by: Liu, Ming, et al.
Published: (2025)

Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
by: Yang, Zhantao, et al.
Published: (2024)

Self-Ensemble Post Learning for Noisy Domain Generalization
by: Lu, Wang, et al.
Published: (2025)

Efficient Autoregressive Audio Modeling via Next-Scale Prediction
by: Qiu, Kai, et al.
Published: (2024)

MotionVerse: A Unified Multimodal Framework for Motion Comprehension, Generation and Editing
by: Hou, Ruibing, et al.
Published: (2025)

STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision
by: Li, Chen, et al.
Published: (2025)

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
by: Liang, Wanchao, et al.
Published: (2024)

MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems
by: Ye, Rui, et al.
Published: (2025)

When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning
by: Li, Chen, et al.
Published: (2026)

A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving
by: Li, Jialin, et al.
Published: (2025)

ExecuTorch -- A Unified PyTorch Solution to Run AI Models On-Device
by: Nachin, Mergen, et al.
Published: (2026)

Neural Radiance Fields with Torch Units
by: Ni, Bingnan, et al.
Published: (2024)

Can Vision Replace Text in Working Memory? Evidence from Spatial n-Back in Vision-Language Models
by: Liang, Sichu, et al.
Published: (2026)

FormulaCode: Evaluating Agentic Optimization on Large Codebases
by: Sehgal, Atharva, et al.
Published: (2026)

AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
by: Luo, Yinyi, et al.
Published: (2026)

ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization
by: Du, Bo, et al.
Published: (2025)

torchtune: PyTorch native post-training library
by: Obozov, Mark, et al.
Published: (2026)

SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution
by: He, Kang, et al.
Published: (2026)