Saved in:
| Main Authors: | Kawano, Rinka, Kawamura, Masaki |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.18385 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PowerCLIP: Powerset Alignment for Contrastive Pre-Training
by: Kawamura, Masaki, et al.
Published: (2025)
by: Kawamura, Masaki, et al.
Published: (2025)
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024)
by: Kawano, Yasufumi, et al.
Published: (2024)
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024)
by: Kawano, Yasufumi, et al.
Published: (2024)
Beyond flattening: a geometrically principled positional encoding for vision transformers with Weierstrass elliptic functions
by: Xin, Zhihang, et al.
Published: (2025)
by: Xin, Zhihang, et al.
Published: (2025)
BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
by: Zhang, Francis Xiatian, et al.
Published: (2025)
by: Zhang, Francis Xiatian, et al.
Published: (2025)
Breaking the Scalability Limit of Multi-Projector Calibration with Embedded Cameras
by: Kawano, Takumi, et al.
Published: (2026)
by: Kawano, Takumi, et al.
Published: (2026)
LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes
by: Suk, Julian, et al.
Published: (2024)
by: Suk, Julian, et al.
Published: (2024)
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
by: Jiang, Chengjie, et al.
Published: (2024)
by: Jiang, Chengjie, et al.
Published: (2024)
Learning Fourier shapes to probe the geometric world of deep neural networks
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
by: Inadumi, Shun, et al.
Published: (2024)
by: Inadumi, Shun, et al.
Published: (2024)
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition
by: Kawamura, Ryosuke, et al.
Published: (2025)
by: Kawamura, Ryosuke, et al.
Published: (2025)
ManzaiSet: A Multimodal Dataset of Viewer Responses to Japanese Manzai Comedy
by: Kawamura, Kazuki, et al.
Published: (2025)
by: Kawamura, Kazuki, et al.
Published: (2025)
ACCURATE: Arbitrary-shaped Continuum Reconstruction Under Robust Adaptive Two-view Estimation
by: Zhang, Yaozhi, et al.
Published: (2026)
by: Zhang, Yaozhi, et al.
Published: (2026)
Multiple weather images restoration using the task transformer and adaptive mixup strategy
by: Wen, Yang, et al.
Published: (2024)
by: Wen, Yang, et al.
Published: (2024)
Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation
by: Fujimori, Izumi, et al.
Published: (2024)
by: Fujimori, Izumi, et al.
Published: (2024)
Unlocking Noise-Resistant Vision: Key Architectural Secrets for Robust Models
by: Kim, Bum Jun, et al.
Published: (2025)
by: Kim, Bum Jun, et al.
Published: (2025)
Scene Co-pilot: Procedural Text to Video Generation with Human in the Loop
by: Qian, Zhaofang, et al.
Published: (2024)
by: Qian, Zhaofang, et al.
Published: (2024)
Minimal Sufficient Views: A DNN model making predictions with more evidence has higher accuracy
by: Kawano, Keisuke, et al.
Published: (2024)
by: Kawano, Keisuke, et al.
Published: (2024)
Deepfake detection in videos with multiple faces using geometric-fakeness features
by: Vyshegorodtsev, Kirill, et al.
Published: (2024)
by: Vyshegorodtsev, Kirill, et al.
Published: (2024)
NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections
by: Korkeala, Juho, et al.
Published: (2025)
by: Korkeala, Juho, et al.
Published: (2025)
Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation
by: Kawamura, Ryosuke, et al.
Published: (2025)
by: Kawamura, Ryosuke, et al.
Published: (2025)
Oriented-grid Encoder for 3D Implicit Representations
by: Gaur, Arihant, et al.
Published: (2024)
by: Gaur, Arihant, et al.
Published: (2024)
HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression
by: Chen, Yihang, et al.
Published: (2024)
by: Chen, Yihang, et al.
Published: (2024)
X-ray illicit object detection using hybrid CNN-transformer neural network architectures
by: Cani, Jorgen, et al.
Published: (2025)
by: Cani, Jorgen, et al.
Published: (2025)
PyCellMech: A shape-based feature extraction pipeline for use in medical and biological studies
by: Arslan, Janan, et al.
Published: (2024)
by: Arslan, Janan, et al.
Published: (2024)
FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models
by: Galanakis, Stathis, et al.
Published: (2023)
by: Galanakis, Stathis, et al.
Published: (2023)
Scale-interaction transformer: a hybrid cnn-transformer model for facial beauty prediction
by: Boukhari, Djamel Eddine
Published: (2025)
by: Boukhari, Djamel Eddine
Published: (2025)
FastPerson: Enhancing Video Learning through Effective Video Summarization that Preserves Linguistic and Visual Contexts
by: Kawamura, Kazuki, et al.
Published: (2024)
by: Kawamura, Kazuki, et al.
Published: (2024)
Generating grid maps via the snake model
by: Wei, Zhiwei, et al.
Published: (2024)
by: Wei, Zhiwei, et al.
Published: (2024)
Multi-instance robust fitting for non-classical geometric models
by: Zhang, Zongliang, et al.
Published: (2026)
by: Zhang, Zongliang, et al.
Published: (2026)
Research on geometric figure classification algorithm based on Deep Learning
by: Wang, Ruiyang, et al.
Published: (2024)
by: Wang, Ruiyang, et al.
Published: (2024)
Few-Part-Shot Font Generation
by: Akiba, Masaki, et al.
Published: (2025)
by: Akiba, Masaki, et al.
Published: (2025)
Rotation center identification based on geometric relationships for rotary motion deblurring
by: Qin, Jinhui, et al.
Published: (2024)
by: Qin, Jinhui, et al.
Published: (2024)
Non rigid geometric distortions correction -- Application to atmospheric turbulence stabilization
by: Mao, Yu, et al.
Published: (2024)
by: Mao, Yu, et al.
Published: (2024)
Posterior shape models revisited: Improving 3D reconstructions from partial data using target specific models
by: Aellen, Jonathan, et al.
Published: (2025)
by: Aellen, Jonathan, et al.
Published: (2025)
Unified theory for joint covariance properties under geometric image transformations for spatio-temporal receptive fields according to the generalized Gaussian derivative model for visual receptive fields
by: Lindeberg, Tony
Published: (2023)
by: Lindeberg, Tony
Published: (2023)
Unsupervised-learning-based method for chest MRI-CT transformation using structure constrained unsupervised generative attention networks
by: Matsuo, Hidetoshi, et al.
Published: (2021)
by: Matsuo, Hidetoshi, et al.
Published: (2021)
LoGDesc: Local geometric features aggregation for robust point cloud registration
by: Slimani, Karim, et al.
Published: (2024)
by: Slimani, Karim, et al.
Published: (2024)
GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
by: Zhang, Xiaohan, et al.
Published: (2023)
by: Zhang, Xiaohan, et al.
Published: (2023)
The MCC approaches the geometric mean of precision and recall as true negatives approach infinity
by: Crall, Jon
Published: (2023)
by: Crall, Jon
Published: (2023)
Similar Items
-
PowerCLIP: Powerset Alignment for Contrastive Pre-Training
by: Kawamura, Masaki, et al.
Published: (2025) -
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024) -
TAG: Guidance-free Open-Vocabulary Semantic Segmentation
by: Kawano, Yasufumi, et al.
Published: (2024) -
Beyond flattening: a geometrically principled positional encoding for vision transformers with Weierstrass elliptic functions
by: Xin, Zhihang, et al.
Published: (2025) -
BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
by: Zhang, Francis Xiatian, et al.
Published: (2025)