:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Merizzi, Fabio
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2403.08782
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
by: Greco, Antonino, et al.
Published: (2024)

Mitigating analytical variability in fMRI results with style transfer
by: Germani, Elodie, et al.
Published: (2024)

Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks
by: Playout, Clément, et al.
Published: (2024)

Estimating forest carbon stocks from high-resolution remote sensing imagery by reducing domain shift with style transfer
by: Yu, Zhenyu, et al.
Published: (2025)

Mitigating attribute amplification in counterfactual image generation
by: Xia, Tian, et al.
Published: (2024)

WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
by: Le, Huy, et al.
Published: (2023)

SceneX: Procedural Controllable Large-scale Scene Generation
by: Zhou, Mengqi, et al.
Published: (2024)

PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs
by: Assouel, Rim, et al.
Published: (2026)

FlowExtract: Procedural Knowledge Extraction from Maintenance Flowcharts
by: de Avalle, Guillermo Gil, et al.
Published: (2026)

Less is More: Label-Guided Summarization of Procedural and Instructional Videos
by: Rajpal, Shreya, et al.
Published: (2026)

Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
by: Yuan, Kun, et al.
Published: (2024)

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
by: Zhang, Shougao, et al.
Published: (2024)

MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence
by: Chen, Yifan, et al.
Published: (2026)

Deep transfer learning for image classification: a survey
by: Plested, Jo, et al.
Published: (2022)

IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly
by: Wen, Di, et al.
Published: (2026)

ReXSonoVQA: A Video QA Benchmark for Procedure-Centric Ultrasound Understanding
by: Wang, Xucheng, et al.
Published: (2026)

Procedural Knowledge Extraction from Industrial Troubleshooting Guides Using Vision Language Models
by: de Avalle, Guillermo Gil, et al.
Published: (2026)

Enhancing targeted transferability via feature space fine-tuning
by: Zeng, Hui, et al.
Published: (2024)

An efficient plant disease detection using transfer learning approach
by: Sambana, Bosubabu, et al.
Published: (2025)

A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett-Luce Ranking
by: Che, Chengan, et al.
Published: (2025)

ZigMa: A DiT-style Zigzag Mamba Diffusion Model
by: Hu, Vincent Tao, et al.
Published: (2024)

Can video generation replace cinematographers? Research on the cinematic language of generated video
by: Li, Xiaozhe, et al.
Published: (2024)

Imagine How To Change: Explicit Procedure Modeling for Change Captioning
by: Sun, Jiayang, et al.
Published: (2026)

EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control
by: Weng, Yuzhe, et al.
Published: (2026)

Facial beauty prediction fusing transfer learning and broad learning system
by: Gan, Junying, et al.
Published: (2026)

WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic Segmentation
by: Qian, Chenghao, et al.
Published: (2024)

CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition
by: Stilz, Florian, et al.
Published: (2026)

SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection
by: Kühn, Paul Julius, et al.
Published: (2026)

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
by: Zare, Ali, et al.
Published: (2024)

Guiding Video Prediction with Explicit Procedural Knowledge
by: Takenaka, Patrick, et al.
Published: (2024)

Flow caching for autoregressive video generation
by: Ma, Yuexiao, et al.
Published: (2026)

Left-Right Symmetry Breaking in CLIP-style Vision-Language Models Trained on Synthetic Spatial-Relation Data
by: Yamamoto, Takaki, et al.
Published: (2026)

FloraForge: LLM-Assisted Procedural Generation of Editable and Analysis-Ready 3D Plant Geometric Models For Agricultural Applications
by: Hadadi, Mozhgan, et al.
Published: (2025)

PreMind: Multi-Agent Video Understanding for Advanced Indexing of Presentation-style Videos
by: Wei, Kangda, et al.
Published: (2025)

Attention mechanisms and transfer learning for robust peach leaf damage classification under domain shift
by: Cánovas-Rodriguez, Adrián, et al.
Published: (2026)

Learning to Evaluate the Artness of AI-generated Images
by: Chen, Junyu, et al.
Published: (2023)

VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
by: Si, Shengyu, et al.
Published: (2026)

AI-generated Image Quality Assessment in Visual Communication
by: Tian, Yu, et al.
Published: (2024)

Controlling the image generation process with parametric activation functions
by: Pavlov, Ilia
Published: (2025)

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers
by: Dax, Maximilian, et al.
Published: (2025)