Saved in:
| Main Authors: | Wu, Mingyang, Mishra, Ashirbad, Dey, Soumik, Xing, Shuo, Ravipati, Naveen, Wu, Hansi, Li, Binbin, Tu, Zhengzhong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
by: Xing, Shuo, et al.
Published: (2025)
by: Xing, Shuo, et al.
Published: (2025)
LLMDistill4Ads: Using Cross-Encoders to Distill from LLM Signals for Advertiser Keyphrase Recommendations at eBay
by: Dey, Soumik, et al.
Published: (2025)
by: Dey, Soumik, et al.
Published: (2025)
BroadGen: A Framework for Generating Effective and Efficient Advertiser Broad Match Keyphrase Recommendations
by: Mishra, Ashirbad, et al.
Published: (2025)
by: Mishra, Ashirbad, et al.
Published: (2025)
Batch Speculative Decoding Done Right
by: Zhang, Ranran Haoran, et al.
Published: (2025)
by: Zhang, Ranran Haoran, et al.
Published: (2025)
To Judge or not to Judge: Using LLM Judgements for Advertiser Keyphrase Relevance at eBay
by: Dey, Soumik, et al.
Published: (2025)
by: Dey, Soumik, et al.
Published: (2025)
Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation
by: Mishra, Ashirbad, et al.
Published: (2024)
by: Mishra, Ashirbad, et al.
Published: (2024)
Middleman Bias in Advertising: Aligning Relevance of Keyphrase Recommendations with Search
by: Dey, Soumik, et al.
Published: (2025)
by: Dey, Soumik, et al.
Published: (2025)
GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation
by: Mishra, Ashirbad, et al.
Published: (2024)
by: Mishra, Ashirbad, et al.
Published: (2024)
From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning
by: Zhang, Ranran Haoran, et al.
Published: (2024)
by: Zhang, Ranran Haoran, et al.
Published: (2024)
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
by: Huang, Yanjia, et al.
Published: (2025)
by: Huang, Yanjia, et al.
Published: (2025)
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2024)
by: Chen, Weifeng, et al.
Published: (2024)
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
by: Huang, Jiehui, et al.
Published: (2024)
by: Huang, Jiehui, et al.
Published: (2024)
FORGE-Tree: Diffusion-Forcing Tree Search for Long-Horizon Robot Manipulation
by: Huang, Yanjia, et al.
Published: (2025)
by: Huang, Yanjia, et al.
Published: (2025)
VISTAv2: World Imagination for Indoor Vision-and-Language Navigation
by: Huang, Yanjia, et al.
Published: (2025)
by: Huang, Yanjia, et al.
Published: (2025)
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
by: Li, Hengjia, et al.
Published: (2025)
by: Li, Hengjia, et al.
Published: (2025)
StableAnimator: High-Quality Identity-Preserving Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2024)
by: Tu, Shuyuan, et al.
Published: (2024)
Neuromorphic Mimicry Attacks Exploiting Brain-Inspired Computing for Covert Cyber Intrusions
by: Ravipati, Hemanth
Published: (2025)
by: Ravipati, Hemanth
Published: (2025)
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
by: He, Xuanhua, et al.
Published: (2024)
by: He, Xuanhua, et al.
Published: (2024)
4KLSDB: A Large-Scale Dataset for 4K Image Restoration and Generation
by: Zhu, Zihao, et al.
Published: (2026)
by: Zhu, Zihao, et al.
Published: (2026)
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
by: Wu, Sibo, et al.
Published: (2025)
by: Wu, Sibo, et al.
Published: (2025)
Interaction driven topological phase transitions of hardcore bosons on a two-leg ladder
by: Parida, Rajashri, et al.
Published: (2024)
by: Parida, Rajashri, et al.
Published: (2024)
Correlated hopping induced topological order in an atomic mixture
by: Padhan, Ashirbad, et al.
Published: (2025)
by: Padhan, Ashirbad, et al.
Published: (2025)
DecAlign: Hierarchical Cross-Modal Alignment for Decoupled Multimodal Representation Learning
by: Qian, Chengxuan, et al.
Published: (2025)
by: Qian, Chengxuan, et al.
Published: (2025)
Slot-ID: Identity-Preserving Video Generation from Reference Videos via Slot-Based Temporal Identity Encoding
by: Lai, Yixuan, et al.
Published: (2026)
by: Lai, Yixuan, et al.
Published: (2026)
FlowSteer: Conditioning Flow Field for Consistent Image Restoration
by: Wickremasinghe, Tharindu, et al.
Published: (2025)
by: Wickremasinghe, Tharindu, et al.
Published: (2025)
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
by: Zhong, Yong, et al.
Published: (2025)
by: Zhong, Yong, et al.
Published: (2025)
Communication-Efficient and Privacy-Preserving Decentralized Meta-Learning
by: Yang, Hansi, et al.
Published: (2024)
by: Yang, Hansi, et al.
Published: (2024)
WithAnyone: Towards Controllable and ID Consistent Image Generation
by: Xu, Hengyuan, et al.
Published: (2025)
by: Xu, Hengyuan, et al.
Published: (2025)
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
by: Seo, Junyoung, et al.
Published: (2024)
by: Seo, Junyoung, et al.
Published: (2024)
Delta Forcing: Trust Region Steering for Interactive Autoregressive Video Generation
by: Wu, Yuheng, et al.
Published: (2026)
by: Wu, Yuheng, et al.
Published: (2026)
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
by: Gao, Xiangbo, et al.
Published: (2026)
by: Gao, Xiangbo, et al.
Published: (2026)
InstantID: Zero-shot Identity-Preserving Generation in Seconds
by: Wang, Qixun, et al.
Published: (2024)
by: Wang, Qixun, et al.
Published: (2024)
Does RLVR Extend Reasoning Boundaries? Investigating Capability Expansion in Vision-Language Models
by: Shen, Minghe, et al.
Published: (2025)
by: Shen, Minghe, et al.
Published: (2025)
Auto-Regressively Generating Multi-View Consistent Images
by: Hu, JiaKui, et al.
Published: (2025)
by: Hu, JiaKui, et al.
Published: (2025)
Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
by: Shen, Liao, et al.
Published: (2025)
by: Shen, Liao, et al.
Published: (2025)
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
by: Wang, Jiahao, et al.
Published: (2026)
by: Wang, Jiahao, et al.
Published: (2026)
GenRec: Unifying Video Generation and Recognition with Diffusion Models
by: Weng, Zejia, et al.
Published: (2024)
by: Weng, Zejia, et al.
Published: (2024)
Consistency-Preserving Diverse Video Generation
by: Liu, Xinshuang, et al.
Published: (2026)
by: Liu, Xinshuang, et al.
Published: (2026)
MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
by: Li, Renjie, et al.
Published: (2025)
by: Li, Renjie, et al.
Published: (2025)
mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation
by: Hu, Chan-Wei, et al.
Published: (2025)
by: Hu, Chan-Wei, et al.
Published: (2025)
Similar Items
-
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
by: Xing, Shuo, et al.
Published: (2025) -
LLMDistill4Ads: Using Cross-Encoders to Distill from LLM Signals for Advertiser Keyphrase Recommendations at eBay
by: Dey, Soumik, et al.
Published: (2025) -
BroadGen: A Framework for Generating Effective and Efficient Advertiser Broad Match Keyphrase Recommendations
by: Mishra, Ashirbad, et al.
Published: (2025) -
Batch Speculative Decoding Done Right
by: Zhang, Ranran Haoran, et al.
Published: (2025) -
To Judge or not to Judge: Using LLM Judgements for Advertiser Keyphrase Relevance at eBay
by: Dey, Soumik, et al.
Published: (2025)