Saved in:
| Main Authors: | Yu, Jinghan, Xiao, Junhao, Ma, Zhiyuan, Ma, Yue, Liu, Kaiqi, Wang, Yuhan, Liu, Daizong, Meng, Xianghao, Li, Jianjun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.06543 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MILD: Mediator Agent System with Bidirectional Perception and Multi-Layered Alignment for Human-Vehicle Collaboration
by: Wang, Jiyao, et al.
Published: (2026)
by: Wang, Jiyao, et al.
Published: (2026)
Dual-Stream Decoupled Learning for Temporal Consistency and Speaker Interaction in AVSD
by: Xiao, Junhao, et al.
Published: (2025)
by: Xiao, Junhao, et al.
Published: (2025)
Context-Aware Autoregressive Models for Multi-Conditional Image Generation
by: Chen, Yixiao, et al.
Published: (2025)
by: Chen, Yixiao, et al.
Published: (2025)
Low-Complexity Near-Field Localization with XL-MIMO Sectored Uniform Circular Arrays
by: Liu, Shicong, et al.
Published: (2024)
by: Liu, Shicong, et al.
Published: (2024)
TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment
by: Li, Jiaming, et al.
Published: (2026)
by: Li, Jiaming, et al.
Published: (2026)
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model
by: Ma, Fulong, et al.
Published: (2024)
by: Ma, Fulong, et al.
Published: (2024)
Layered 3D Human Generation via Semantic-Aware Diffusion Model
by: Wang, Yi, et al.
Published: (2023)
by: Wang, Yi, et al.
Published: (2023)
Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding
by: Huang, Wencan, et al.
Published: (2025)
by: Huang, Wencan, et al.
Published: (2025)
MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based Networking
by: Hossain, Md. Kamrul, et al.
Published: (2026)
by: Hossain, Md. Kamrul, et al.
Published: (2026)
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
by: Fang, Xiang, et al.
Published: (2022)
by: Fang, Xiang, et al.
Published: (2022)
HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution
by: Zou, Yang, et al.
Published: (2026)
by: Zou, Yang, et al.
Published: (2026)
I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing
by: Yu, Jinghan, et al.
Published: (2026)
by: Yu, Jinghan, et al.
Published: (2026)
Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts
by: Zhang, Tong, et al.
Published: (2025)
by: Zhang, Tong, et al.
Published: (2025)
HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model
by: Wang, Yi, et al.
Published: (2024)
by: Wang, Yi, et al.
Published: (2024)
ActErase: A Training-Free Paradigm for Precise Concept Erasure via Activation Redirection
by: Sun, Yi, et al.
Published: (2026)
by: Sun, Yi, et al.
Published: (2026)
Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation
by: Ma, Xiao, et al.
Published: (2024)
by: Ma, Xiao, et al.
Published: (2024)
OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
GO-MLVTON: Garment Occlusion-Aware Multi-Layer Virtual Try-On with Diffusion Models
by: Yu, Yang, et al.
Published: (2026)
by: Yu, Yang, et al.
Published: (2026)
MILD: Multispectral Image dataset with Lighting Diversity
by: Oh, Hyejin, et al.
Published: (2026)
by: Oh, Hyejin, et al.
Published: (2026)
Reconfiguring Ionomers in Proton Exchange Membrane Fuel Cell Catalyst Layer to Promote Multi‐Species Transport and Durability
by: Feiyu Yue, et al.
Published: (2025)
by: Feiyu Yue, et al.
Published: (2025)
Reconfiguring Ionomers in Proton Exchange Membrane Fuel Cell Catalyst Layer to Promote Multi‐Species Transport and Durability
by: Feiyu Yue, et al.
Published: (2025)
by: Feiyu Yue, et al.
Published: (2025)
Auto-ICL: In-Context Learning without Human Supervision
by: Yang, Jinghan, et al.
Published: (2023)
by: Yang, Jinghan, et al.
Published: (2023)
KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning
by: Zhang, Kaiqi, et al.
Published: (2024)
by: Zhang, Kaiqi, et al.
Published: (2024)
Audio Does Matter: Importance-Aware Multi-Granularity Fusion for Video Moment Retrieval
by: Lin, Junan, et al.
Published: (2025)
by: Lin, Junan, et al.
Published: (2025)
Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling
by: Ma, Zhiyuan, et al.
Published: (2025)
by: Ma, Zhiyuan, et al.
Published: (2025)
FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting
by: He, Zefeng, et al.
Published: (2025)
by: He, Zefeng, et al.
Published: (2025)
UniErase: Towards Balanced and Precise Unlearning in Language Models
by: Yu, Miao, et al.
Published: (2025)
by: Yu, Miao, et al.
Published: (2025)
Joint Transmit and Reflective Beamforming for Multi-Active-IRS-Assisted Cooperative Sensing
by: Fang, Yuan, et al.
Published: (2024)
by: Fang, Yuan, et al.
Published: (2024)
MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
by: Liu, Penghui, et al.
Published: (2025)
by: Liu, Penghui, et al.
Published: (2025)
RESISTANCE EXERCISE AND BRAIN CONNECTOME IN MILD COGNITIVE IMPAIRMENT
by: Isadora Cristina Ribeiro, et al.
Published: (2024)
by: Isadora Cristina Ribeiro, et al.
Published: (2024)
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
by: Ma, Zhiyuan, et al.
Published: (2024)
by: Ma, Zhiyuan, et al.
Published: (2024)
Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation
by: Ma, Haijian, et al.
Published: (2025)
by: Ma, Haijian, et al.
Published: (2025)
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
by: Kogashi, Kaen, et al.
Published: (2025)
by: Kogashi, Kaen, et al.
Published: (2025)
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways
by: Liu, Yi, et al.
Published: (2025)
by: Liu, Yi, et al.
Published: (2025)
Cognitive Disentanglement for Referring Multi-Object Tracking
by: Liang, Shaofeng, et al.
Published: (2025)
by: Liang, Shaofeng, et al.
Published: (2025)
Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding
by: Liu, Yang, et al.
Published: (2024)
by: Liu, Yang, et al.
Published: (2024)
Broadcast Graph Is NP-complete
by: Xu, Jinghan, et al.
Published: (2024)
by: Xu, Jinghan, et al.
Published: (2024)
Do VLMs Truly "Read" Candlesticks? A Multi-Scale Benchmark for Visual Stock Price Forecasting
by: Hu, Kaiqi, et al.
Published: (2026)
by: Hu, Kaiqi, et al.
Published: (2026)
MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels
by: Hu, Chuanyang, et al.
Published: (2023)
by: Hu, Chuanyang, et al.
Published: (2023)
Erasing Undesirable Influence in Diffusion Models
by: Wu, Jing, et al.
Published: (2024)
by: Wu, Jing, et al.
Published: (2024)
Similar Items
-
MILD: Mediator Agent System with Bidirectional Perception and Multi-Layered Alignment for Human-Vehicle Collaboration
by: Wang, Jiyao, et al.
Published: (2026) -
Dual-Stream Decoupled Learning for Temporal Consistency and Speaker Interaction in AVSD
by: Xiao, Junhao, et al.
Published: (2025) -
Context-Aware Autoregressive Models for Multi-Conditional Image Generation
by: Chen, Yixiao, et al.
Published: (2025) -
Low-Complexity Near-Field Localization with XL-MIMO Sectored Uniform Circular Arrays
by: Liu, Shicong, et al.
Published: (2024) -
TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment
by: Li, Jiaming, et al.
Published: (2026)