:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Mingyu, Mao, Zian, Liu, Zhu, Zhang, Haoran, Guo, Jintao, He, Xiaoya, Huang, Xi, Chu, Shufen, Cheng, Chun, Ding, Jun, Xie, Yujun
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Materials Science Artificial Intelligence I.2.10; I.5.1; J.2
Online Access:	https://arxiv.org/abs/2508.03775
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
by: Liang, Shuqiao, et al.
Published: (2025)

Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
by: Gomes, Manuel, et al.
Published: (2025)

Correspondence of high-dimensional emotion structures elicited by video clips between humans and Multimodal LLMs
by: Asanuma, Haruka, et al.
Published: (2025)

Task Singular Vectors: Reducing Task Interference in Model Merging
by: Gargiulo, Antonio Andrea, et al.
Published: (2024)

Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)

PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025)

U-Net-Like Spiking Neural Networks for Single Image Dehazing
by: Li, Huibin, et al.
Published: (2025)

Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
by: Estepa, Imanol G., et al.
Published: (2026)

All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction
by: Estepa, Imanol G., et al.
Published: (2023)

Decoupling Vision and Language: Codebook Anchored Visual Adaptation
by: Wu, Jason, et al.
Published: (2026)

CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)

3D Adaptive Structural Convolution Network for Domain-Invariant Point Cloud Recognition
by: Kim, Younggun, et al.
Published: (2024)

A Persistent Homology Design Space for 3D Point Cloud Deep Learning
by: Kudeshia, Prachi, et al.
Published: (2026)

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
by: Semenov, Andrei, et al.
Published: (2024)

CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution
by: Schiffer, Christian, et al.
Published: (2025)

Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
by: Wu, Kunlin, et al.
Published: (2026)

Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition
by: Nakamura, Ikuo
Published: (2024)

Robust Confidence Intervals in Stereo Matching using Possibility Theory
by: Malinowski, Roman, et al.
Published: (2024)

Few-Shot Learning of a Graph-Based Neural Network Model Without Backpropagation
by: Lapin, Mykyta, et al.
Published: (2025)

FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
by: Diller, Christian, et al.
Published: (2022)

FedWCM: Unleashing the Potential of Momentum-based Federated Learning in Long-Tailed Scenarios
by: Li, Tianle, et al.
Published: (2025)

Tricks and Plug-ins for Gradient Boosting in Image Classification
by: Fang, Biyi, et al.
Published: (2025)

TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
by: Nauen, Tobias Christian, et al.
Published: (2024)

PhysVid: Physics Aware Local Conditioning for Generative Video Models
by: Pathak, Saurabh, et al.
Published: (2026)

Adapting SAM with Dynamic Similarity Graphs for Few-Shot Parameter-Efficient Small Dense Object Detection: A Case Study of Chickpea Pods in Field Conditions
by: Jiang, Xintong, et al.
Published: (2025)

TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition
by: Hassan, Imtiaz Ul, et al.
Published: (2026)

DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification
by: Ho, Darryl, et al.
Published: (2025)

High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery
by: Peng, Hongxing, et al.
Published: (2025)

Cross-Domain Adversarial Augmentation: Stabilizing GANs for Medical and Handwriting Data Scarcity
by: Soad, Md. Sohanuzzaman, et al.
Published: (2026)

CARScenes: Semantic VLM Dataset for Safe Autonomous Driving
by: He, Yuankai, et al.
Published: (2025)

Predicting When to Trust Vision-Language Models for Spatial Reasoning
by: Imran, Muhammad, et al.
Published: (2026)

NV3D: Leveraging Spatial Shape Through Normal Vector-based 3D Object Detection
by: Chaowakarn, Krittin, et al.
Published: (2025)

Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey
by: Rajapaksha, Uchitha, et al.
Published: (2024)

CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)

LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation
by: Wei, Hualiang, et al.
Published: (2026)

Can Robots "Taste" Grapes? Estimating SSC with Simple RGB Sensors
by: Ciarfuglia, Thomas Alessandro, et al.
Published: (2024)

Butter: Frequency Consistency and Hierarchical Fusion for Autonomous Driving Object Detection
by: Lin, Xiaojian, et al.
Published: (2025)

Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix
by: Gurbindo, Unai, et al.
Published: (2025)

Implementing Adaptations for Vision AutoRegressive Model
by: Shaikh, Kaif, et al.
Published: (2025)

CASE: Contrastive Activation for Saliency Estimation
by: Williamson, Dane, et al.
Published: (2025)