:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kawano, Rinka, Kawamura, Masaki
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2601.18385
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PowerCLIP: Powerset Alignment for Contrastive Pre-Training
by: Kawamura, Masaki, et al.
Published: (2025)

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024)

TAG: Guidance-free Open-Vocabulary Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024)

Beyond flattening: a geometrically principled positional encoding for vision transformers with Weierstrass elliptic functions
by: Xin, Zhihang, et al.
Published: (2025)

BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
by: Zhang, Francis Xiatian, et al.
Published: (2025)

Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras
by: Kawano, Takumi, et al.
Published: (2026)

LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes
by: Suk, Julian, et al.
Published: (2024)

HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
by: Jiang, Chengjie, et al.
Published: (2024)

Learning Fourier shapes to probe the geometric world of deep neural networks
by: Wang, Jian, et al.
Published: (2025)

A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
by: Inadumi, Shun, et al.
Published: (2024)

MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition
by: Kawamura, Ryosuke, et al.
Published: (2025)

ManzaiSet: A Multimodal Dataset of Viewer Responses to Japanese Manzai Comedy
by: Kawamura, Kazuki, et al.
Published: (2025)

ACCURATE: Arbitrary-shaped Continuum Reconstruction Under Robust Adaptive Two-view Estimation
by: Zhang, Yaozhi, et al.
Published: (2026)

Multiple weather images restoration using the task transformer and adaptive mixup strategy
by: Wen, Yang, et al.
Published: (2024)

Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation
by: Fujimori, Izumi, et al.
Published: (2024)

Unlocking Noise-Resistant Vision: Key Architectural Secrets for Robust Models
by: Kim, Bum Jun, et al.
Published: (2025)

Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop
by: Qian, Zhaofang, et al.
Published: (2024)

Minimal Sufficient Views: A DNN model making predictions with more evidence has higher accuracy
by: Kawano, Keisuke, et al.
Published: (2024)

Deepfake detection in videos with multiple faces using geometric-fakeness features
by: Vyshegorodtsev, Kirill, et al.
Published: (2024)

NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections
by: Korkeala, Juho, et al.
Published: (2025)

Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation
by: Kawamura, Ryosuke, et al.
Published: (2025)

Oriented-grid Encoder for 3D Implicit Representations
by: Gaur, Arihant, et al.
Published: (2024)

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
by: Chen, Yihang, et al.
Published: (2024)

X-ray illicit object detection using hybrid CNN-transformer neural network architectures
by: Cani, Jorgen, et al.
Published: (2025)

PyCellMech: A shape-based feature extraction pipeline for use in medical and biological studies
by: Arslan, Janan, et al.
Published: (2024)

FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models
by: Galanakis, Stathis, et al.
Published: (2023)

Scale-interaction transformer: a hybrid cnn-transformer model for facial beauty prediction
by: Boukhari, Djamel Eddine
Published: (2025)

FastPerson: Enhancing Video Learning through Effective Video Summarization that Preserves Linguistic and Visual Contexts
by: Kawamura, Kazuki, et al.
Published: (2024)

Generating grid maps via the snake model
by: Wei, Zhiwei, et al.
Published: (2024)

Multi-instance robust fitting for non-classical geometric models
by: Zhang, Zongliang, et al.
Published: (2026)

Research on geometric figure classification algorithm based on Deep Learning
by: Wang, Ruiyang, et al.
Published: (2024)

Few-Part-Shot Font Generation
by: Akiba, Masaki, et al.
Published: (2025)

Rotation center identification based on geometric relationships for rotary motion deblurring
by: Qin, Jinhui, et al.
Published: (2024)

Non rigid geometric distortions correction -- Application to atmospheric turbulence stabilization
by: Mao, Yu, et al.
Published: (2024)

Posterior shape models revisited: Improving 3D reconstructions from partial data using target specific models
by: Aellen, Jonathan, et al.
Published: (2025)

Unified theory for joint covariance properties under geometric image transformations for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields
by: Lindeberg, Tony
Published: (2023)

Unsupervised-learning-based method for chest MRI-CT transformation using structure constrained unsupervised generative attention networks
by: Matsuo, Hidetoshi, et al.
Published: (2021)

LoGDesc: Local geometric features aggregation for robust point cloud registration
by: Slimani, Karim, et al.
Published: (2024)

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
by: Zhang, Xiaohan, et al.
Published: (2023)

The MCC approaches the geometric mean of precision and recall as true negatives approach infinity
by: Crall, Jon
Published: (2023)