:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Hu, Jia Cheng, Cavicchioli, Roberto, Capotondi, Alessandro
Format:	Preprint
Veröffentlicht:	2022
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2208.06551
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Shifted Window Fourier Transform And Retention For Image Captioning
von: Hu, Jia Cheng, et al.
Veröffentlicht: (2024)

Diffusion Is Your Friend in Show, Suggest and Tell
von: Hu, Jia Cheng, et al.
Veröffentlicht: (2025)

Enhancing Traffic Safety with Parallel Dense Video Captioning for End-to-End Event Analysis
von: Shoman, Maged, et al.
Veröffentlicht: (2024)

RankE: End-to-End Post-Training for Discrete Text-to-Image Generation with Decoder Co-Evolution
von: Jian, Siyong, et al.
Veröffentlicht: (2026)

The Devil is in the EOS: Sequence Training for Detailed Image Captioning
von: Mohamed, Abdelrahman, et al.
Veröffentlicht: (2025)

Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning
von: Shi, Yudi, et al.
Veröffentlicht: (2026)

Exploiting Auxiliary Caption for Video Grounding
von: Li, Hongxiang, et al.
Veröffentlicht: (2023)

LPSNet: End-to-End Human Pose and Shape Estimation with Lensless Imaging
von: Ge, Haoyang, et al.
Veröffentlicht: (2024)

OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
von: Li, Jinyang, et al.
Veröffentlicht: (2025)

Tracking by Detection and Query: An Efficient End-to-End Framework for Multi-Object Tracking
von: Jia, Shukun, et al.
Veröffentlicht: (2024)

StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation
von: Ragusa, Francesco, et al.
Veröffentlicht: (2023)

End-to-End Semantic Preservation in Text-Aware Image Compression Systems
von: Della Fiore, Stefano, et al.
Veröffentlicht: (2025)

SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows
von: Zhao, Qinyu, et al.
Veröffentlicht: (2025)

End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
von: Guo, Yuwei, et al.
Veröffentlicht: (2025)

Training Multi-Image Vision Agents via End2End Reinforcement Learning
von: Dong, Chengqi, et al.
Veröffentlicht: (2025)

DLAFormer: An End-to-End Transformer For Document Layout Analysis
von: Wang, Jiawei, et al.
Veröffentlicht: (2024)

An Effective End-to-End Solution for Multimodal Action Recognition
von: Wang, Songping, et al.
Veröffentlicht: (2025)

An End-to-End Real-World Camera Imaging Pipeline
von: Xu, Kepeng, et al.
Veröffentlicht: (2024)

REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching
von: Nie, Han, et al.
Veröffentlicht: (2024)

ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay
von: Lu, Fanbin, et al.
Veröffentlicht: (2025)

Fully Unified Motion Planning for End-to-End Autonomous Driving
von: Liu, Lin, et al.
Veröffentlicht: (2025)

Efficient End-to-End Visual Document Understanding with Rationale Distillation
von: Zhu, Wang, et al.
Veröffentlicht: (2023)

GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving
von: Liu, Lin, et al.
Veröffentlicht: (2025)

End to End AI System for Surgical Gesture Sequence Recognition and Clinical Outcome Prediction
von: Li, Xi, et al.
Veröffentlicht: (2025)

Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression
von: Khatib, Fadi, et al.
Veröffentlicht: (2022)

Bidirectional Awareness Induction in Autoregressive Seq2Seq Models
von: Hu, Jia Cheng, et al.
Veröffentlicht: (2024)

End-to-End Chess Recognition
von: Masouris, Athanasios, et al.
Veröffentlicht: (2023)

OPERA: An Agent for Image Restoration with End-to-End Joint Planning-Execution Optimization
von: Zhu, Feng, et al.
Veröffentlicht: (2026)

SMTrack: End-to-End Trained Spiking Neural Networks for Multi-Object Tracking in RGB Videos
von: Zhong, Pengzhi, et al.
Veröffentlicht: (2025)

Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving
von: Liu, Lin, et al.
Veröffentlicht: (2025)

Generative Scenario Rollouts for End-to-End Autonomous Driving
von: Yasarla, Rajeev, et al.
Veröffentlicht: (2026)

Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
von: Cheng, Sheng, et al.
Veröffentlicht: (2024)

SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking
von: Lin, Shubo, et al.
Veröffentlicht: (2024)

End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
von: Ahmed, Yeruru Asrar, et al.
Veröffentlicht: (2025)

Fast-SmartWay: Panoramic-Free End-to-End Zero-Shot Vision-and-Language Navigation
von: Shi, Xiangyu, et al.
Veröffentlicht: (2025)

Active Learning from Scene Embeddings for End-to-End Autonomous Driving
von: Jiang, Wenhao, et al.
Veröffentlicht: (2025)

SceneLCM: End-to-End Layout-Guided Interactive Indoor Scene Generation with Latent Consistency Model
von: Lin, Yangkai, et al.
Veröffentlicht: (2025)

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology
von: Tang, Wenhao, et al.
Veröffentlicht: (2025)

FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning
von: Cao, Jiajun, et al.
Veröffentlicht: (2025)

Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage
von: Sun, Zhengwentai, et al.
Veröffentlicht: (2025)