Saved in:
| Main Authors: | Okada, Shuntaro, Doi, Kenji, Yoshihashi, Ryota, Kataoka, Hirokatsu, Tanaka, Tomohiro |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.12188 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation
by: Yoshihashi, Ryota, et al.
Published: (2023)
by: Yoshihashi, Ryota, et al.
Published: (2023)
Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026)
by: Esteves, Carlos, et al.
Published: (2026)
Simple Visual Artifact Detection in Sora-Generated Videos
by: Sugiyama, Misora, et al.
Published: (2025)
by: Sugiyama, Misora, et al.
Published: (2025)
3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach
by: Tanaka, Ryota, et al.
Published: (2024)
by: Tanaka, Ryota, et al.
Published: (2024)
Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models
by: Zhu, Peifei, et al.
Published: (2024)
by: Zhu, Peifei, et al.
Published: (2024)
Variational Trajectory Optimization of Anisotropic Diffusion Schedules
by: Liu, Pengxi, et al.
Published: (2026)
by: Liu, Pengxi, et al.
Published: (2026)
VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
by: Zhang, Jiahao, et al.
Published: (2024)
by: Zhang, Jiahao, et al.
Published: (2024)
MoireDB: Formula-generated Interference-fringe Image Dataset
by: Matsuo, Yuto, et al.
Published: (2025)
by: Matsuo, Yuto, et al.
Published: (2025)
Noise Scheduling as Information-Guided Allocation in Diffusion Training
by: Raya, Gabriel, et al.
Published: (2026)
by: Raya, Gabriel, et al.
Published: (2026)
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
by: Sabour, Amirmojtaba, et al.
Published: (2024)
by: Sabour, Amirmojtaba, et al.
Published: (2024)
Industrial Synthetic Segment Pre-training
by: Mae, Shinichi, et al.
Published: (2025)
by: Mae, Shinichi, et al.
Published: (2025)
S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
by: Kupyn, Orest, et al.
Published: (2025)
by: Kupyn, Orest, et al.
Published: (2025)
High Noise Scheduling is a Must
by: Gokmen, Mahmut S., et al.
Published: (2024)
by: Gokmen, Mahmut S., et al.
Published: (2024)
On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning Process
by: Iwase, Shun, et al.
Published: (2025)
by: Iwase, Shun, et al.
Published: (2025)
Human Action Recognition without Human
by: Kataoka, Hirokatsu, et al.
Published: (2016)
by: Kataoka, Hirokatsu, et al.
Published: (2016)
Improved Noise Schedule for Diffusion Training
by: Hang, Tiankai, et al.
Published: (2024)
by: Hang, Tiankai, et al.
Published: (2024)
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
by: Lin, Haonan, et al.
Published: (2024)
by: Lin, Haonan, et al.
Published: (2024)
Common Diffusion Noise Schedules and Sample Steps are Flawed
by: Lin, Shanchuan, et al.
Published: (2023)
by: Lin, Shanchuan, et al.
Published: (2023)
MILES: Modality-Informed Learning Rate Scheduler for Balancing Multimodal Learning
by: Guerra-Manzanares, Alejandro, et al.
Published: (2025)
by: Guerra-Manzanares, Alejandro, et al.
Published: (2025)
Can masking background and object reduce static bias for zero-shot action recognition?
by: Fukuzawa, Takumi, et al.
Published: (2025)
by: Fukuzawa, Takumi, et al.
Published: (2025)
Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes
by: Otsuka, Daichi, et al.
Published: (2025)
by: Otsuka, Daichi, et al.
Published: (2025)
Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
by: Mori, Erika, et al.
Published: (2025)
by: Mori, Erika, et al.
Published: (2025)
What-Where Transformer: A Slot-Centric Visual Backbone for Concurrent Representation and Localization
by: Yoshihashi, Ryota, et al.
Published: (2026)
by: Yoshihashi, Ryota, et al.
Published: (2026)
Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
by: Kada, Masahiro, et al.
Published: (2026)
by: Kada, Masahiro, et al.
Published: (2026)
VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation
by: Tanaka, Ryota, et al.
Published: (2025)
by: Tanaka, Ryota, et al.
Published: (2025)
Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling
by: Cho, Hansam, et al.
Published: (2025)
by: Cho, Hansam, et al.
Published: (2025)
Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training
by: Mitkiy, Alan, et al.
Published: (2025)
by: Mitkiy, Alan, et al.
Published: (2025)
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
by: Choi, JunYong, et al.
Published: (2025)
by: Choi, JunYong, et al.
Published: (2025)
$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models
by: Park, Yong-Hyun, et al.
Published: (2024)
by: Park, Yong-Hyun, et al.
Published: (2024)
Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation
by: Tadokoro, Ryu, et al.
Published: (2024)
by: Tadokoro, Ryu, et al.
Published: (2024)
HanDyVQA: A Video QA Benchmark for Fine-Grained Hand-Object Interaction Dynamics
by: Tateno, Masatoshi, et al.
Published: (2025)
by: Tateno, Masatoshi, et al.
Published: (2025)
AnimalClue: Recognizing Animals by their Traces
by: Shinoda, Risa, et al.
Published: (2025)
by: Shinoda, Risa, et al.
Published: (2025)
AgroBench: Vision-Language Model Benchmark in Agriculture
by: Shinoda, Risa, et al.
Published: (2025)
by: Shinoda, Risa, et al.
Published: (2025)
PowerCLIP: Powerset Alignment for Contrastive Pre-Training
by: Kawamura, Masaki, et al.
Published: (2025)
by: Kawamura, Masaki, et al.
Published: (2025)
AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability
by: Suzuki, Tomohiro, et al.
Published: (2025)
by: Suzuki, Tomohiro, et al.
Published: (2025)
Analysis of Classifier-Free Guidance Weight Schedulers
by: Wang, Xi, et al.
Published: (2024)
by: Wang, Xi, et al.
Published: (2024)
Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage
by: Yu, Peiyu, et al.
Published: (2025)
by: Yu, Peiyu, et al.
Published: (2025)
SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
by: Trinh, Luan Thanh, et al.
Published: (2025)
by: Trinh, Luan Thanh, et al.
Published: (2025)
NODI: Out-Of-Distribution Detection with Noise from Diffusion
by: Zhou, Jingqiu, et al.
Published: (2024)
by: Zhou, Jingqiu, et al.
Published: (2024)
FDIF: Formula-Driven supervised Learning with Implicit Functions for 3D Medical Image Segmentation
by: Yamamoto, Yukinori, et al.
Published: (2026)
by: Yamamoto, Yukinori, et al.
Published: (2026)
Similar Items
-
Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation
by: Yoshihashi, Ryota, et al.
Published: (2023) -
Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026) -
Simple Visual Artifact Detection in Sora-Generated Videos
by: Sugiyama, Misora, et al.
Published: (2025) -
3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach
by: Tanaka, Ryota, et al.
Published: (2024) -
Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models
by: Zhu, Peifei, et al.
Published: (2024)