Saved in:
| Main Authors: | Mu, Mingxuan, Yang, Guo, Chen, Lei, Wu, Ping, Cui, Jianxun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04868 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
by: Wang, Wenhao, et al.
Published: (2025)
by: Wang, Wenhao, et al.
Published: (2025)
SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling
by: Cui, Jinlong, et al.
Published: (2026)
by: Cui, Jinlong, et al.
Published: (2026)
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation
by: Li, Mingxuan, et al.
Published: (2025)
by: Li, Mingxuan, et al.
Published: (2025)
Accurate and Interpretable Postmenstrual Age Prediction via Multimodal Large Language Model
by: Chen, Qifan, et al.
Published: (2025)
by: Chen, Qifan, et al.
Published: (2025)
PrefGen: Multimodal Preference Learning for Preference-Conditioned Image Generation
by: Mo, Wenyi, et al.
Published: (2025)
by: Mo, Wenyi, et al.
Published: (2025)
Visuospatial Perspective Taking in Multimodal Language Models
by: Prunty, Jonathan, et al.
Published: (2026)
by: Prunty, Jonathan, et al.
Published: (2026)
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents
by: Zhu, Kaijie, et al.
Published: (2026)
by: Zhu, Kaijie, et al.
Published: (2026)
FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction
by: You, Junwei, et al.
Published: (2024)
by: You, Junwei, et al.
Published: (2024)
AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models
by: Wen, Yunge, et al.
Published: (2026)
by: Wen, Yunge, et al.
Published: (2026)
Language-Conditioned Safe Trajectory Generation for Spacecraft Rendezvous
by: Takubo, Yuji, et al.
Published: (2025)
by: Takubo, Yuji, et al.
Published: (2025)
SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation
by: Li, Xin, et al.
Published: (2024)
by: Li, Xin, et al.
Published: (2024)
General-purpose Clothes Manipulation with Semantic Keypoints
by: Deng, Yuhong, et al.
Published: (2024)
by: Deng, Yuhong, et al.
Published: (2024)
SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation
by: Zhou, Xinyi, et al.
Published: (2024)
by: Zhou, Xinyi, et al.
Published: (2024)
Generating Survival Interpretable Trajectories and Data
by: Konstantinov, Andrei V., et al.
Published: (2024)
by: Konstantinov, Andrei V., et al.
Published: (2024)
AttnGen: Attention-Guided Saliency Learning for Interpretable Genomic Sequence Classification
by: Nia, Rayhaneh Shabani, et al.
Published: (2026)
by: Nia, Rayhaneh Shabani, et al.
Published: (2026)
Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates
by: Dang, Hy, et al.
Published: (2025)
by: Dang, Hy, et al.
Published: (2025)
RecAI: Leveraging Large Language Models for Next-Generation Recommender Systems
by: Lian, Jianxun, et al.
Published: (2024)
by: Lian, Jianxun, et al.
Published: (2024)
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
by: Wang, Jiayu, et al.
Published: (2025)
by: Wang, Jiayu, et al.
Published: (2025)
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models
by: Liu, Shuqi, et al.
Published: (2025)
by: Liu, Shuqi, et al.
Published: (2025)
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
by: Zhang, Jiajun, et al.
Published: (2025)
by: Zhang, Jiajun, et al.
Published: (2025)
SlideGen: Collaborative Multimodal Agents for Scientific Slide Generation
by: Liang, Xin, et al.
Published: (2025)
by: Liang, Xin, et al.
Published: (2025)
CLASP: General-Purpose Clothes Manipulation with Semantic Keypoints
by: Deng, Yuhong, et al.
Published: (2025)
by: Deng, Yuhong, et al.
Published: (2025)
Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach
by: Jahan, Rafid Ishrak, et al.
Published: (2024)
by: Jahan, Rafid Ishrak, et al.
Published: (2024)
FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories
by: Ke, Lei, et al.
Published: (2025)
by: Ke, Lei, et al.
Published: (2025)
Interactive AI NPCs Powered by LLMs: Technical Report for the CPDC Challenge 2025
by: Huang, Yitian, et al.
Published: (2025)
by: Huang, Yitian, et al.
Published: (2025)
Why not Collaborative Filtering in Dual View? Bridging Sparse and Dense Models
by: Guo, Hanze, et al.
Published: (2026)
by: Guo, Hanze, et al.
Published: (2026)
From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models
by: Zhou, Chenyue, et al.
Published: (2025)
by: Zhou, Chenyue, et al.
Published: (2025)
Confounding Robust Deep Reinforcement Learning: A Causal Approach
by: Li, Mingxuan, et al.
Published: (2025)
by: Li, Mingxuan, et al.
Published: (2025)
Dynamic Summary Generation for Interpretable Multimodal Depression Detection
by: Teng, Shiyu, et al.
Published: (2026)
by: Teng, Shiyu, et al.
Published: (2026)
Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning
by: Gong, Siyu, et al.
Published: (2026)
by: Gong, Siyu, et al.
Published: (2026)
Physics-Guided Multimodal Transformers are the Necessary Foundation for the Next Generation of Meteorological Science
by: Han, Jing, et al.
Published: (2025)
by: Han, Jing, et al.
Published: (2025)
R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression
by: Zhao, Kun, et al.
Published: (2025)
by: Zhao, Kun, et al.
Published: (2025)
Personality-Guided Code Generation Using Large Language Models
by: Guo, Yaoqi, et al.
Published: (2024)
by: Guo, Yaoqi, et al.
Published: (2024)
GenOM: Ontology Matching with Description Generation and Large Language Model
by: Song, Yiping, et al.
Published: (2025)
by: Song, Yiping, et al.
Published: (2025)
Multimodal Trajectory Prediction for Autonomous Driving on Unstructured Roads using Deep Convolutional Network
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach
by: Liu, Shuqi, et al.
Published: (2025)
by: Liu, Shuqi, et al.
Published: (2025)
Language-Driven Interactive Traffic Trajectory Generation
by: Xia, Junkai, et al.
Published: (2024)
by: Xia, Junkai, et al.
Published: (2024)
A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards
by: Patel, Shivansh, et al.
Published: (2025)
by: Patel, Shivansh, et al.
Published: (2025)
CauTraj: A Causal-Knowledge-Guided Framework for Lane-Changing Trajectory Planning of Autonomous Vehicles
by: Lei, Cailin, et al.
Published: (2025)
by: Lei, Cailin, et al.
Published: (2025)
ProductWebGen: Benchmarking Multimodal Product Webpage Generation
by: Liu, Zhihong, et al.
Published: (2026)
by: Liu, Zhihong, et al.
Published: (2026)
Similar Items
-
FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
by: Wang, Wenhao, et al.
Published: (2025) -
SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling
by: Cui, Jinlong, et al.
Published: (2026) -
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation
by: Li, Mingxuan, et al.
Published: (2025) -
Accurate and Interpretable Postmenstrual Age Prediction via Multimodal Large Language Model
by: Chen, Qifan, et al.
Published: (2025) -
PrefGen: Multimodal Preference Learning for Preference-Conditioned Image Generation
by: Mo, Wenyi, et al.
Published: (2025)