:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yin, Jiaxi, Wang, Pengcheng, Ding, Han, Wang, Fei
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2511.05292
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses
by: Lan, Bo, et al.
Published: (2025)

Khana: A Comprehensive Indian Cuisine Dataset
by: Prabhu, Omkar
Published: (2025)

SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions
by: Xu, Vasco, et al.
Published: (2026)

Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025)

Seeing Without Eyes: 4D Human-Scene Understanding from Wearable IMUs
by: Hsu, Hao-Yu, et al.
Published: (2026)

Diverse via bounded Agreement: Geometric Regularization for Multimodal Fusion
by: Xia, Zixuan, et al.
Published: (2026)

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)

Dietary Intake Estimation via Continuous 3D Reconstruction of Food
by: Lee, Wallace, et al.
Published: (2025)

Textualize Visual Prompt for Image Editing via Diffusion Bridge
by: Xu, Pengcheng, et al.
Published: (2025)

Inferring Dynamic Physical Properties from Video Foundation Models
by: Zhan, Guanqi, et al.
Published: (2025)

Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation
by: Wang, Fei, et al.
Published: (2025)

What's Inside Your Diffusion Model? A Score-Based Riemannian Metric to Explore the Data Manifold
by: Azeglio, Simone, et al.
Published: (2025)

PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits
by: Li, Loka, et al.
Published: (2025)

ChronoSelect: Robust Learning with Noisy Labels via Dynamics Temporal Memory
by: Wang, Jianchao, et al.
Published: (2025)

SDS-Net: Shallow-Deep Synergism-detection Network for infrared small target detection
by: Yue, Taoran, et al.
Published: (2025)

WS-IMUBench: Can Weakly Supervised Methods from Audio, Image, and Video Be Adapted for IMU-based Temporal Action Localization?
by: Li, Pei, et al.
Published: (2026)

Self-Bootstrapping for Versatile Test-Time Adaptation
by: Niu, Shuaicheng, et al.
Published: (2025)

Multimodal Structure Learning: Disentangling Shared and Specific Topology via Cross-Modal Graphical Lasso
by: Wang, Fei, et al.
Published: (2026)

MobiDiary: Autoregressive Action Captioning with Wearable Devices and Wireless Signals
by: Deng, Fei, et al.
Published: (2026)

Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents
by: Veerabadran, Vijay, et al.
Published: (2025)

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation
by: Liu, Fangfu, et al.
Published: (2024)

LPTR-AFLNet: Lightweight Integrated Chinese License Plate Rectification and Recognition Network
by: Xu, Guangzhu, et al.
Published: (2025)

Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)

TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting
by: Kim, Hyeseong, et al.
Published: (2026)

Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma
by: AlDahoul, Nouar, et al.
Published: (2024)

BYOM: Building Your Own Multi-Task Model For Free
by: Jiang, Weisen, et al.
Published: (2023)

GMValuator: Similarity-based Data Valuation for Generative Models
by: Yang, Jiaxi, et al.
Published: (2023)

EXACT: How to Train Your Accuracy
by: Karpukhin, Ivan, et al.
Published: (2022)

Lite-SAM Is Actually What You Need for Segment Everything
by: Fu, Jianhai, et al.
Published: (2024)

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
by: Wang, Xiyao, et al.
Published: (2025)

Wearable-Derived Behavioral and Physiological Biomarkers for Classifying Unipolar and Bipolar Depression Severity
by: Ouzar, Yassine, et al.
Published: (2025)

Decoding Human Activities: Analyzing Wearable Accelerometer and Gyroscope Data for Activity Recognition
by: Saha, Utsab, et al.
Published: (2023)

How to Squeeze An Explanation Out of Your Model
by: Roxo, Tiago, et al.
Published: (2024)

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations
by: Zhang, Yiyuan, et al.
Published: (2024)

DeepFRC: An End-to-End Deep Learning Model for Functional Registration and Classification
by: Jiang, Siyuan, et al.
Published: (2025)

Learn from Foundation Model: Fruit Detection Model without Manual Annotation
by: Wang, Yanan, et al.
Published: (2024)

License Plate Images Generation with Diffusion Models
by: Shpir, Mariia, et al.
Published: (2025)

Enhancing Diffusion-Based Quantitatively Controllable Image Generation via Matrix-Form EDM and Adaptive Vicinal Training
by: Ding, Xin, et al.
Published: (2026)

Reimagining Anomalies: What If Anomalies Were Normal?
by: Liznerski, Philipp, et al.
Published: (2024)

DocVLM: Make Your VLM an Efficient Reader
by: Nacson, Mor Shpigel, et al.
Published: (2024)