Saved in:
| Main Author: | Merizzi, Fabio |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.08782 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
by: Greco, Antonino, et al.
Published: (2024)
by: Greco, Antonino, et al.
Published: (2024)
Mitigating analytical variability in fMRI results with style transfer
by: Germani, Elodie, et al.
Published: (2024)
by: Germani, Elodie, et al.
Published: (2024)
Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks
by: Playout, Clément, et al.
Published: (2024)
by: Playout, Clément, et al.
Published: (2024)
Estimating forest carbon stocks from high-resolution remote sensing imagery by reducing domain shift with style transfer
by: Yu, Zhenyu, et al.
Published: (2025)
by: Yu, Zhenyu, et al.
Published: (2025)
Mitigating attribute amplification in counterfactual image generation
by: Xia, Tian, et al.
Published: (2024)
by: Xia, Tian, et al.
Published: (2024)
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
by: Le, Huy, et al.
Published: (2023)
by: Le, Huy, et al.
Published: (2023)
SceneX: Procedural Controllable Large-scale Scene Generation
by: Zhou, Mengqi, et al.
Published: (2024)
by: Zhou, Mengqi, et al.
Published: (2024)
PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs
by: Assouel, Rim, et al.
Published: (2026)
by: Assouel, Rim, et al.
Published: (2026)
FlowExtract: Procedural Knowledge Extraction from Maintenance Flowcharts
by: de Avalle, Guillermo Gil, et al.
Published: (2026)
by: de Avalle, Guillermo Gil, et al.
Published: (2026)
Less is More: Label-Guided Summarization of Procedural and Instructional Videos
by: Rajpal, Shreya, et al.
Published: (2026)
by: Rajpal, Shreya, et al.
Published: (2026)
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
by: Yuan, Kun, et al.
Published: (2024)
by: Yuan, Kun, et al.
Published: (2024)
CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
by: Zhang, Shougao, et al.
Published: (2024)
by: Zhang, Shougao, et al.
Published: (2024)
MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence
by: Chen, Yifan, et al.
Published: (2026)
by: Chen, Yifan, et al.
Published: (2026)
Deep transfer learning for image classification: a survey
by: Plested, Jo, et al.
Published: (2022)
by: Plested, Jo, et al.
Published: (2022)
IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly
by: Wen, Di, et al.
Published: (2026)
by: Wen, Di, et al.
Published: (2026)
ReXSonoVQA: A Video QA Benchmark for Procedure-Centric Ultrasound Understanding
by: Wang, Xucheng, et al.
Published: (2026)
by: Wang, Xucheng, et al.
Published: (2026)
Procedural Knowledge Extraction from Industrial Troubleshooting Guides Using Vision Language Models
by: de Avalle, Guillermo Gil, et al.
Published: (2026)
by: de Avalle, Guillermo Gil, et al.
Published: (2026)
Enhancing targeted transferability via feature space fine-tuning
by: Zeng, Hui, et al.
Published: (2024)
by: Zeng, Hui, et al.
Published: (2024)
An efficient plant disease detection using transfer learning approach
by: Sambana, Bosubabu, et al.
Published: (2025)
by: Sambana, Bosubabu, et al.
Published: (2025)
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett-Luce Ranking
by: Che, Chengan, et al.
Published: (2025)
by: Che, Chengan, et al.
Published: (2025)
ZigMa: A DiT-style Zigzag Mamba Diffusion Model
by: Hu, Vincent Tao, et al.
Published: (2024)
by: Hu, Vincent Tao, et al.
Published: (2024)
Can video generation replace cinematographers? Research on the cinematic language of generated video
by: Li, Xiaozhe, et al.
Published: (2024)
by: Li, Xiaozhe, et al.
Published: (2024)
Imagine How To Change: Explicit Procedure Modeling for Change Captioning
by: Sun, Jiayang, et al.
Published: (2026)
by: Sun, Jiayang, et al.
Published: (2026)
EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control
by: Weng, Yuzhe, et al.
Published: (2026)
by: Weng, Yuzhe, et al.
Published: (2026)
Facial beauty prediction fusing transfer learning and broad learning system
by: Gan, Junying, et al.
Published: (2026)
by: Gan, Junying, et al.
Published: (2026)
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic Segmentation
by: Qian, Chenghao, et al.
Published: (2024)
by: Qian, Chenghao, et al.
Published: (2024)
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition
by: Stilz, Florian, et al.
Published: (2026)
by: Stilz, Florian, et al.
Published: (2026)
SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection
by: Kühn, Paul Julius, et al.
Published: (2026)
by: Kühn, Paul Julius, et al.
Published: (2026)
RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
by: Zare, Ali, et al.
Published: (2024)
by: Zare, Ali, et al.
Published: (2024)
Guiding Video Prediction with Explicit Procedural Knowledge
by: Takenaka, Patrick, et al.
Published: (2024)
by: Takenaka, Patrick, et al.
Published: (2024)
Flow caching for autoregressive video generation
by: Ma, Yuexiao, et al.
Published: (2026)
by: Ma, Yuexiao, et al.
Published: (2026)
Left-Right Symmetry Breaking in CLIP-style Vision-Language Models Trained on Synthetic Spatial-Relation Data
by: Yamamoto, Takaki, et al.
Published: (2026)
by: Yamamoto, Takaki, et al.
Published: (2026)
FloraForge: LLM-Assisted Procedural Generation of Editable and Analysis-Ready 3D Plant Geometric Models For Agricultural Applications
by: Hadadi, Mozhgan, et al.
Published: (2025)
by: Hadadi, Mozhgan, et al.
Published: (2025)
PreMind: Multi-Agent Video Understanding for Advanced Indexing of Presentation-style Videos
by: Wei, Kangda, et al.
Published: (2025)
by: Wei, Kangda, et al.
Published: (2025)
Attention mechanisms and transfer learning for robust peach leaf damage classification under domain shift
by: Cánovas-Rodriguez, Adrián, et al.
Published: (2026)
by: Cánovas-Rodriguez, Adrián, et al.
Published: (2026)
Learning to Evaluate the Artness of AI-generated Images
by: Chen, Junyu, et al.
Published: (2023)
by: Chen, Junyu, et al.
Published: (2023)
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
by: Si, Shengyu, et al.
Published: (2026)
by: Si, Shengyu, et al.
Published: (2026)
AI-generated Image Quality Assessment in Visual Communication
by: Tian, Yu, et al.
Published: (2024)
by: Tian, Yu, et al.
Published: (2024)
Controlling the image generation process with parametric activation functions
by: Pavlov, Ilia
Published: (2025)
by: Pavlov, Ilia
Published: (2025)
Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers
by: Dax, Maximilian, et al.
Published: (2025)
by: Dax, Maximilian, et al.
Published: (2025)
Similar Items
-
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
by: Greco, Antonino, et al.
Published: (2024) -
Mitigating analytical variability in fMRI results with style transfer
by: Germani, Elodie, et al.
Published: (2024) -
Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks
by: Playout, Clément, et al.
Published: (2024) -
Estimating forest carbon stocks from high-resolution remote sensing imagery by reducing domain shift with style transfer
by: Yu, Zhenyu, et al.
Published: (2025) -
Mitigating attribute amplification in counterfactual image generation
by: Xia, Tian, et al.
Published: (2024)