Saved in:
| Main Authors: | Liu, Mingyu, Mao, Zian, Liu, Zhu, Zhang, Haoran, Guo, Jintao, He, Xiaoya, Huang, Xi, Chu, Shufen, Cheng, Chun, Ding, Jun, Xie, Yujun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.03775 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
by: Liang, Shuqiao, et al.
Published: (2025)
by: Liang, Shuqiao, et al.
Published: (2025)
Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
by: Gomes, Manuel, et al.
Published: (2025)
by: Gomes, Manuel, et al.
Published: (2025)
Correspondence of high-dimensional emotion structures elicited by video clips between humans and Multimodal LLMs
by: Asanuma, Haruka, et al.
Published: (2025)
by: Asanuma, Haruka, et al.
Published: (2025)
Task Singular Vectors: Reducing Task Interference in Model Merging
by: Gargiulo, Antonio Andrea, et al.
Published: (2024)
by: Gargiulo, Antonio Andrea, et al.
Published: (2024)
Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)
by: Adžemović, Momir
Published: (2025)
PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025)
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025)
U-Net-Like Spiking Neural Networks for Single Image Dehazing
by: Li, Huibin, et al.
Published: (2025)
by: Li, Huibin, et al.
Published: (2025)
Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
by: Estepa, Imanol G., et al.
Published: (2026)
by: Estepa, Imanol G., et al.
Published: (2026)
All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction
by: Estepa, Imanol G., et al.
Published: (2023)
by: Estepa, Imanol G., et al.
Published: (2023)
Decoupling Vision and Language: Codebook Anchored Visual Adaptation
by: Wu, Jason, et al.
Published: (2026)
by: Wu, Jason, et al.
Published: (2026)
CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)
by: Diller, Christian, et al.
Published: (2023)
3D Adaptive Structural Convolution Network for Domain-Invariant Point Cloud Recognition
by: Kim, Younggun, et al.
Published: (2024)
by: Kim, Younggun, et al.
Published: (2024)
A Persistent Homology Design Space for 3D Point Cloud Deep Learning
by: Kudeshia, Prachi, et al.
Published: (2026)
by: Kudeshia, Prachi, et al.
Published: (2026)
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
by: Semenov, Andrei, et al.
Published: (2024)
by: Semenov, Andrei, et al.
Published: (2024)
CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution
by: Schiffer, Christian, et al.
Published: (2025)
by: Schiffer, Christian, et al.
Published: (2025)
Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
by: Wu, Kunlin, et al.
Published: (2026)
by: Wu, Kunlin, et al.
Published: (2026)
Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition
by: Nakamura, Ikuo
Published: (2024)
by: Nakamura, Ikuo
Published: (2024)
Robust Confidence Intervals in Stereo Matching using Possibility Theory
by: Malinowski, Roman, et al.
Published: (2024)
by: Malinowski, Roman, et al.
Published: (2024)
Few-Shot Learning of a Graph-Based Neural Network Model Without Backpropagation
by: Lapin, Mykyta, et al.
Published: (2025)
by: Lapin, Mykyta, et al.
Published: (2025)
FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
by: Diller, Christian, et al.
Published: (2022)
by: Diller, Christian, et al.
Published: (2022)
FedWCM: Unleashing the Potential of Momentum-based Federated Learning in Long-Tailed Scenarios
by: Li, Tianle, et al.
Published: (2025)
by: Li, Tianle, et al.
Published: (2025)
Tricks and Plug-ins for Gradient Boosting in Image Classification
by: Fang, Biyi, et al.
Published: (2025)
by: Fang, Biyi, et al.
Published: (2025)
TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
by: Nauen, Tobias Christian, et al.
Published: (2024)
by: Nauen, Tobias Christian, et al.
Published: (2024)
PhysVid: Physics Aware Local Conditioning for Generative Video Models
by: Pathak, Saurabh, et al.
Published: (2026)
by: Pathak, Saurabh, et al.
Published: (2026)
Adapting SAM with Dynamic Similarity Graphs for Few-Shot Parameter-Efficient Small Dense Object Detection: A Case Study of Chickpea Pods in Field Conditions
by: Jiang, Xintong, et al.
Published: (2025)
by: Jiang, Xintong, et al.
Published: (2025)
TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition
by: Hassan, Imtiaz Ul, et al.
Published: (2026)
by: Hassan, Imtiaz Ul, et al.
Published: (2026)
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification
by: Ho, Darryl, et al.
Published: (2025)
by: Ho, Darryl, et al.
Published: (2025)
High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery
by: Peng, Hongxing, et al.
Published: (2025)
by: Peng, Hongxing, et al.
Published: (2025)
Cross-Domain Adversarial Augmentation: Stabilizing GANs for Medical and Handwriting Data Scarcity
by: Soad, Md. Sohanuzzaman, et al.
Published: (2026)
by: Soad, Md. Sohanuzzaman, et al.
Published: (2026)
CARScenes: Semantic VLM Dataset for Safe Autonomous Driving
by: He, Yuankai, et al.
Published: (2025)
by: He, Yuankai, et al.
Published: (2025)
Predicting When to Trust Vision-Language Models for Spatial Reasoning
by: Imran, Muhammad, et al.
Published: (2026)
by: Imran, Muhammad, et al.
Published: (2026)
NV3D: Leveraging Spatial Shape Through Normal Vector-based 3D Object Detection
by: Chaowakarn, Krittin, et al.
Published: (2025)
by: Chaowakarn, Krittin, et al.
Published: (2025)
Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey
by: Rajapaksha, Uchitha, et al.
Published: (2024)
by: Rajapaksha, Uchitha, et al.
Published: (2024)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation
by: Wei, Hualiang, et al.
Published: (2026)
by: Wei, Hualiang, et al.
Published: (2026)
Can Robots "Taste" Grapes? Estimating SSC with Simple RGB Sensors
by: Ciarfuglia, Thomas Alessandro, et al.
Published: (2024)
by: Ciarfuglia, Thomas Alessandro, et al.
Published: (2024)
Butter: Frequency Consistency and Hierarchical Fusion for Autonomous Driving Object Detection
by: Lin, Xiaojian, et al.
Published: (2025)
by: Lin, Xiaojian, et al.
Published: (2025)
Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix
by: Gurbindo, Unai, et al.
Published: (2025)
by: Gurbindo, Unai, et al.
Published: (2025)
Implementing Adaptations for Vision AutoRegressive Model
by: Shaikh, Kaif, et al.
Published: (2025)
by: Shaikh, Kaif, et al.
Published: (2025)
CASE: Contrastive Activation for Saliency Estimation
by: Williamson, Dane, et al.
Published: (2025)
by: Williamson, Dane, et al.
Published: (2025)
Similar Items
-
FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
by: Liang, Shuqiao, et al.
Published: (2025) -
Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
by: Gomes, Manuel, et al.
Published: (2025) -
Correspondence of high-dimensional emotion structures elicited by video clips between humans and Multimodal LLMs
by: Asanuma, Haruka, et al.
Published: (2025) -
Task Singular Vectors: Reducing Task Interference in Model Merging
by: Gargiulo, Antonio Andrea, et al.
Published: (2024) -
Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)