:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Okada, Shuntaro, Doi, Kenji, Yoshihashi, Ryota, Kataoka, Hirokatsu, Tanaka, Tomohiro
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2411.12188
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation
by: Yoshihashi, Ryota, et al.
Published: (2023)

Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026)

Simple Visual Artifact Detection in Sora-Generated Videos
by: Sugiyama, Misora, et al.
Published: (2025)

3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach
by: Tanaka, Ryota, et al.
Published: (2024)

Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models
by: Zhu, Peifei, et al.
Published: (2024)

Variational Trajectory Optimization of Anisotropic Diffusion Schedules
by: Liu, Pengxi, et al.
Published: (2026)

VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
by: Zhang, Jiahao, et al.
Published: (2024)

MoireDB: Formula-generated Interference-fringe Image Dataset
by: Matsuo, Yuto, et al.
Published: (2025)

Noise Scheduling as Information-Guided Allocation in Diffusion Training
by: Raya, Gabriel, et al.
Published: (2026)

Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
by: Sabour, Amirmojtaba, et al.
Published: (2024)

Industrial Synthetic Segment Pre-training
by: Mae, Shinichi, et al.
Published: (2025)

S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
by: Kupyn, Orest, et al.
Published: (2025)

High Noise Scheduling is a Must
by: Gokmen, Mahmut S., et al.
Published: (2024)

On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning Process
by: Iwase, Shun, et al.
Published: (2025)

Human Action Recognition without Human
by: Kataoka, Hirokatsu, et al.
Published: (2016)

Improved Noise Schedule for Diffusion Training
by: Hang, Tiankai, et al.
Published: (2024)

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
by: Lin, Haonan, et al.
Published: (2024)

Common Diffusion Noise Schedules and Sample Steps are Flawed
by: Lin, Shanchuan, et al.
Published: (2023)

MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
by: Guerra-Manzanares, Alejandro, et al.
Published: (2025)

Can masking background and object reduce static bias for zero-shot action recognition?
by: Fukuzawa, Takumi, et al.
Published: (2025)

Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes
by: Otsuka, Daichi, et al.
Published: (2025)

Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
by: Mori, Erika, et al.
Published: (2025)

What-Where Transformer: A Slot-Centric Visual Backbone for Concurrent Representation and Localization
by: Yoshihashi, Ryota, et al.
Published: (2026)

Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
by: Kada, Masahiro, et al.
Published: (2026)

VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation
by: Tanaka, Ryota, et al.
Published: (2025)

Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling
by: Cho, Hansam, et al.
Published: (2025)

Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training
by: Mitkiy, Alan, et al.
Published: (2025)

Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
by: Choi, JunYong, et al.
Published: (2025)

$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models
by: Park, Yong-Hyun, et al.
Published: (2024)

Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation
by: Tadokoro, Ryu, et al.
Published: (2024)

HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
by: Tateno, Masatoshi, et al.
Published: (2025)

AnimalClue: Recognizing Animals by their Traces
by: Shinoda, Risa, et al.
Published: (2025)

AgroBench: Vision-Language Model Benchmark in Agriculture
by: Shinoda, Risa, et al.
Published: (2025)

PowerCLIP: Powerset Alignment for Contrastive Pre-Training
by: Kawamura, Masaki, et al.
Published: (2025)

AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability
by: Suzuki, Tomohiro, et al.
Published: (2025)

Analysis of Classifier-Free Guidance Weight Schedulers
by: Wang, Xi, et al.
Published: (2024)

Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
by: Yu, Peiyu, et al.
Published: (2025)

SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
by: Trinh, Luan Thanh, et al.
Published: (2025)

NODI: Out-Of-Distribution Detection with Noise from Diffusion
by: Zhou, Jingqiu, et al.
Published: (2024)

FDIF: Formula-Driven supervised Learning with Implicit Functions for 3D Medical Image Segmentation
by: Yamamoto, Yukinori, et al.
Published: (2026)