:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Mingyang, Mishra, Ashirbad, Dey, Soumik, Xing, Shuo, Ravipati, Naveen, Wu, Hansi, Li, Binbin, Tu, Zhengzhong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.10113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
by: Xing, Shuo, et al.
Published: (2025)

LLMDistill4Ads: Using Cross-Encoders to Distill from LLM Signals for Advertiser Keyphrase Recommendations at eBay
by: Dey, Soumik, et al.
Published: (2025)

BroadGen: A Framework for Generating Effective and Efficient Advertiser Broad Match Keyphrase Recommendations
by: Mishra, Ashirbad, et al.
Published: (2025)

Batch Speculative Decoding Done Right
by: Zhang, Ranran Haoran, et al.
Published: (2025)

To Judge or not to Judge: Using LLM Judgements for Advertiser Keyphrase Relevance at eBay
by: Dey, Soumik, et al.
Published: (2025)

Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation
by: Mishra, Ashirbad, et al.
Published: (2024)

Middleman Bias in Advertising: Aligning Relevance of Keyphrase Recommendations with Search
by: Dey, Soumik, et al.
Published: (2025)

GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation
by: Mishra, Ashirbad, et al.
Published: (2024)

From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning
by: Zhang, Ranran Haoran, et al.
Published: (2024)

VISTA: Generative Visual Imagination for Vision-and-Language Navigation
by: Huang, Yanjia, et al.
Published: (2025)

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2024)

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
by: Huang, Jiehui, et al.
Published: (2024)

FORGE-Tree: Diffusion-Forcing Tree Search for Long-Horizon Robot Manipulation
by: Huang, Yanjia, et al.
Published: (2025)

VISTAv2: World Imagination for Indoor Vision-and-Language Navigation
by: Huang, Yanjia, et al.
Published: (2025)

MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
by: Li, Hengjia, et al.
Published: (2025)

StableAnimator: High-Quality Identity-Preserving Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2024)

Neuromorphic Mimicry Attacks Exploiting Brain-Inspired Computing for Covert Cyber Intrusions
by: Ravipati, Hemanth
Published: (2025)

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
by: He, Xuanhua, et al.
Published: (2024)

4KLSDB: A Large-Scale Dataset for 4K Image Restoration and Generation
by: Zhu, Zihao, et al.
Published: (2026)

GenFusion: Closing the Loop between Reconstruction and Generation via Videos
by: Wu, Sibo, et al.
Published: (2025)

Interaction driven topological phase transitions of hardcore bosons on a two-leg ladder
by: Parida, Rajashri, et al.
Published: (2024)

Correlated hopping induced topological order in an atomic mixture
by: Padhan, Ashirbad, et al.
Published: (2025)

DecAlign: Hierarchical Cross-Modal Alignment for Decoupled Multimodal Representation Learning
by: Qian, Chengxuan, et al.
Published: (2025)

Slot-ID: Identity-Preserving Video Generation from Reference Videos via Slot-Based Temporal Identity Encoding
by: Lai, Yixuan, et al.
Published: (2026)

FlowSteer: Conditioning Flow Field for Consistent Image Restoration
by: Wickremasinghe, Tharindu, et al.
Published: (2025)

Concat-ID: Towards Universal Identity-Preserving Video Synthesis
by: Zhong, Yong, et al.
Published: (2025)

Communication-Efficient and Privacy-Preserving Decentralized Meta-Learning
by: Yang, Hansi, et al.
Published: (2024)

WithAnyone: Towards Controllable and ID Consistent Image Generation
by: Xu, Hengyuan, et al.
Published: (2025)

GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
by: Seo, Junyoung, et al.
Published: (2024)

Delta Forcing: Trust Region Steering for Interactive Autoregressive Video Generation
by: Wu, Yuheng, et al.
Published: (2026)

The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
by: Gao, Xiangbo, et al.
Published: (2026)

InstantID: Zero-shot Identity-Preserving Generation in Seconds
by: Wang, Qixun, et al.
Published: (2024)

Does RLVR Extend Reasoning Boundaries? Investigating Capability Expansion in Vision-Language Models
by: Shen, Minghe, et al.
Published: (2025)

Auto-Regressively Generating Multi-View Consistent Images
by: Hu, JiaKui, et al.
Published: (2025)

Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
by: Shen, Liao, et al.
Published: (2025)

AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
by: Wang, Jiahao, et al.
Published: (2026)

GenRec: Unifying Video Generation and Recognition with Diffusion Models
by: Weng, Zejia, et al.
Published: (2024)

Consistency-Preserving Diverse Video Generation
by: Liu, Xinshuang, et al.
Published: (2026)

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
by: Li, Renjie, et al.
Published: (2025)

mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation
by: Hu, Chan-Wei, et al.
Published: (2025)