Saved in:
| Main Authors: | Yin, Jiaxi, Wang, Pengcheng, Ding, Han, Wang, Fei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.05292 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses
by: Lan, Bo, et al.
Published: (2025)
by: Lan, Bo, et al.
Published: (2025)
Khana: A Comprehensive Indian Cuisine Dataset
by: Prabhu, Omkar
Published: (2025)
by: Prabhu, Omkar
Published: (2025)
SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions
by: Xu, Vasco, et al.
Published: (2026)
by: Xu, Vasco, et al.
Published: (2026)
Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025)
by: Ding, Xi, et al.
Published: (2025)
Seeing Without Eyes: 4D Human-Scene Understanding from Wearable IMUs
by: Hsu, Hao-Yu, et al.
Published: (2026)
by: Hsu, Hao-Yu, et al.
Published: (2026)
Diverse via bounded Agreement: Geometric Regularization for Multimodal Fusion
by: Xia, Zixuan, et al.
Published: (2026)
by: Xia, Zixuan, et al.
Published: (2026)
What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
by: Humayun, Ahmed Imtiaz, et al.
Published: (2024)
Dietary Intake Estimation via Continuous 3D Reconstruction of Food
by: Lee, Wallace, et al.
Published: (2025)
by: Lee, Wallace, et al.
Published: (2025)
Textualize Visual Prompt for Image Editing via Diffusion Bridge
by: Xu, Pengcheng, et al.
Published: (2025)
by: Xu, Pengcheng, et al.
Published: (2025)
Inferring Dynamic Physical Properties from Video Foundation Models
by: Zhan, Guanqi, et al.
Published: (2025)
by: Zhan, Guanqi, et al.
Published: (2025)
Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation
by: Wang, Fei, et al.
Published: (2025)
by: Wang, Fei, et al.
Published: (2025)
What's Inside Your Diffusion Model? A Score-Based Riemannian Metric to Explore the Data Manifold
by: Azeglio, Simone, et al.
Published: (2025)
by: Azeglio, Simone, et al.
Published: (2025)
PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits
by: Li, Loka, et al.
Published: (2025)
by: Li, Loka, et al.
Published: (2025)
ChronoSelect: Robust Learning with Noisy Labels via Dynamics Temporal Memory
by: Wang, Jianchao, et al.
Published: (2025)
by: Wang, Jianchao, et al.
Published: (2025)
SDS-Net: Shallow-Deep Synergism-detection Network for infrared small target detection
by: Yue, Taoran, et al.
Published: (2025)
by: Yue, Taoran, et al.
Published: (2025)
WS-IMUBench: Can Weakly Supervised Methods from Audio, Image, and Video Be Adapted for IMU-based Temporal Action Localization?
by: Li, Pei, et al.
Published: (2026)
by: Li, Pei, et al.
Published: (2026)
Self-Bootstrapping for Versatile Test-Time Adaptation
by: Niu, Shuaicheng, et al.
Published: (2025)
by: Niu, Shuaicheng, et al.
Published: (2025)
Multimodal Structure Learning: Disentangling Shared and Specific Topology via Cross-Modal Graphical Lasso
by: Wang, Fei, et al.
Published: (2026)
by: Wang, Fei, et al.
Published: (2026)
MobiDiary: Autoregressive Action Captioning with Wearable Devices and Wireless Signals
by: Deng, Fei, et al.
Published: (2026)
by: Deng, Fei, et al.
Published: (2026)
Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents
by: Veerabadran, Vijay, et al.
Published: (2025)
by: Veerabadran, Vijay, et al.
Published: (2025)
Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation
by: Liu, Fangfu, et al.
Published: (2024)
by: Liu, Fangfu, et al.
Published: (2024)
LPTR-AFLNet: Lightweight Integrated Chinese License Plate Rectification and Recognition Network
by: Xu, Guangzhu, et al.
Published: (2025)
by: Xu, Guangzhu, et al.
Published: (2025)
Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)
by: Ribeiro, Victor Nascimento, et al.
Published: (2025)
TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting
by: Kim, Hyeseong, et al.
Published: (2026)
by: Kim, Hyeseong, et al.
Published: (2026)
Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma
by: AlDahoul, Nouar, et al.
Published: (2024)
by: AlDahoul, Nouar, et al.
Published: (2024)
BYOM: Building Your Own Multi-Task Model For Free
by: Jiang, Weisen, et al.
Published: (2023)
by: Jiang, Weisen, et al.
Published: (2023)
GMValuator: Similarity-based Data Valuation for Generative Models
by: Yang, Jiaxi, et al.
Published: (2023)
by: Yang, Jiaxi, et al.
Published: (2023)
EXACT: How to Train Your Accuracy
by: Karpukhin, Ivan, et al.
Published: (2022)
by: Karpukhin, Ivan, et al.
Published: (2022)
Lite-SAM Is Actually What You Need for Segment Everything
by: Fu, Jianhai, et al.
Published: (2024)
by: Fu, Jianhai, et al.
Published: (2024)
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
by: Wang, Xiyao, et al.
Published: (2025)
by: Wang, Xiyao, et al.
Published: (2025)
Wearable-Derived Behavioral and Physiological Biomarkers for Classifying Unipolar and Bipolar Depression Severity
by: Ouzar, Yassine, et al.
Published: (2025)
by: Ouzar, Yassine, et al.
Published: (2025)
Decoding Human Activities: Analyzing Wearable Accelerometer and Gyroscope Data for Activity Recognition
by: Saha, Utsab, et al.
Published: (2023)
by: Saha, Utsab, et al.
Published: (2023)
How to Squeeze An Explanation Out of Your Model
by: Roxo, Tiago, et al.
Published: (2024)
by: Roxo, Tiago, et al.
Published: (2024)
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations
by: Zhang, Yiyuan, et al.
Published: (2024)
by: Zhang, Yiyuan, et al.
Published: (2024)
DeepFRC: An End-to-End Deep Learning Model for Functional Registration and Classification
by: Jiang, Siyuan, et al.
Published: (2025)
by: Jiang, Siyuan, et al.
Published: (2025)
Learn from Foundation Model: Fruit Detection Model without Manual Annotation
by: Wang, Yanan, et al.
Published: (2024)
by: Wang, Yanan, et al.
Published: (2024)
License Plate Images Generation with Diffusion Models
by: Shpir, Mariia, et al.
Published: (2025)
by: Shpir, Mariia, et al.
Published: (2025)
Enhancing Diffusion-Based Quantitatively Controllable Image Generation via Matrix-Form EDM and Adaptive Vicinal Training
by: Ding, Xin, et al.
Published: (2026)
by: Ding, Xin, et al.
Published: (2026)
Reimagining Anomalies: What If Anomalies Were Normal?
by: Liznerski, Philipp, et al.
Published: (2024)
by: Liznerski, Philipp, et al.
Published: (2024)
DocVLM: Make Your VLM an Efficient Reader
by: Nacson, Mor Shpigel, et al.
Published: (2024)
by: Nacson, Mor Shpigel, et al.
Published: (2024)
Similar Items
-
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses
by: Lan, Bo, et al.
Published: (2025) -
Khana: A Comprehensive Indian Cuisine Dataset
by: Prabhu, Omkar
Published: (2025) -
SurfaceXR: Fusing Smartwatch IMUs and Egocentric Hand Pose for Seamless Surface Interactions
by: Xu, Vasco, et al.
Published: (2026) -
Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025) -
Seeing Without Eyes: 4D Human-Scene Understanding from Wearable IMUs
by: Hsu, Hao-Yu, et al.
Published: (2026)