:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Hamdi, Laziz, Tamasna, Amine, Boisson, Pascal, Paquet, Thierry
Formato:	Preprint
Publicado:	2026
Materias:	Computer Vision and Pattern Recognition Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2605.22422
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

PILOT: A Promptable Interleaved Layout-aware OCR Transformer
por: Hamdi, Laziz, et al.
Publicado: (2025)

TableSeq: Unified Generation of Structure, Content, and Layout
por: Hamdi, Laziz, et al.
Publicado: (2026)

DenTab: A Dataset for Table Recognition and Visual QA on Real-World Dental Estimates
por: Hamdi, Laziz, et al.
Publicado: (2026)

FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
por: Liao, Pan, et al.
Publicado: (2024)

TinyDrop: Tiny Model Guided Token Dropping for Vision Transformers
por: Wang, Guoxin, et al.
Publicado: (2025)

Few-shot Writer Adaptation via Multimodal In-Context Learning
por: Simon, Tom, et al.
Publicado: (2026)

Deterministic Continuous Replacement: Fast and Stable Module Replacement in Pretrained Transformers
por: Bradbury, Rowan, et al.
Publicado: (2025)

GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers
por: Prospero, Lorenza, et al.
Publicado: (2024)

MVTN: Learning Multi-View Transformations for 3D Understanding
por: Hamdi, Abdullah, et al.
Publicado: (2022)

Patch Rebirth: Toward Fast and Transferable Model Inversion of Vision Transformers
por: Heo, Seongsoo, et al.
Publicado: (2025)

Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos
por: Xu, Qingyu, et al.
Publicado: (2024)

Recognize Any Regions
por: Yang, Haosen, et al.
Publicado: (2023)

FastInit: Fast Noise Initialization for Temporally Consistent Video Generation
por: Bai, Chengyu, et al.
Publicado: (2025)

DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models
por: Yu, Hu, et al.
Publicado: (2025)

Fast Training of Diffusion Models with Masked Transformers
por: Zheng, Hongkai, et al.
Publicado: (2023)

Fast Occupancy Network
por: Lu, Mingjie, et al.
Publicado: (2024)

CF3: Compact and Fast 3D Feature Fields
por: Lee, Hyunjoon, et al.
Publicado: (2025)

Tiny-Engram: Trigger-Indexed Concept Tables for Generative Vision
por: Cai, Runyuan, et al.
Publicado: (2026)

TinyFusion: Diffusion Transformers Learned Shallow
por: Fang, Gongfan, et al.
Publicado: (2024)

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
por: Tian, Yunjie, et al.
Publicado: (2022)

StreamTinyNet: video streaming analysis with spatial-temporal TinyML
por: Shalby, Hazem Hesham Yousef, et al.
Publicado: (2024)

FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge
por: Shen, Xuan, et al.
Publicado: (2025)

FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework
por: Hu, Haotian, et al.
Publicado: (2025)

FasterViT: Fast Vision Transformers with Hierarchical Attention
por: Hatamizadeh, Ali, et al.
Publicado: (2023)

FastV-RAG: Towards Fast and Fine-Grained Video QA with Retrieval-Augmented Generation
por: Li, Gen, et al.
Publicado: (2026)

TinyFormer: Preserving Tiny Objects in YOLO-DETR Hybrid Real-time Detectors
por: Hsieh, Jun-Wei, et al.
Publicado: (2026)

Tiny-ViT: A Compact Vision Transformer for Efficient and Explainable Potato Leaf Disease Classification
por: Mia, Shakil, et al.
Publicado: (2026)

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
por: Liu, Dong, et al.
Publicado: (2025)

Accurate and Fast Compressed Video Captioning
por: Shen, Yaojie, et al.
Publicado: (2023)

Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly 4D Reconstruction
por: Liu, Zhening, et al.
Publicado: (2024)

ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
por: Xu, Hongbin, et al.
Publicado: (2024)

Fast Autoregressive Video Generation with Diagonal Decoding
por: Ye, Yang, et al.
Publicado: (2025)

DEIM: DETR with Improved Matching for Fast Convergence
por: Huang, Shihua, et al.
Publicado: (2024)

Turbo4DGen: Ultra-Fast Acceleration for 4D Generation
por: Man, Yuanbin, et al.
Publicado: (2026)

FastBO: Fast HPO and NAS with Adaptive Fidelity Identification
por: Jiang, Jiantong, et al.
Publicado: (2024)

Noise-Robust Tiny Object Localization with Flows
por: Sun, Huixin, et al.
Publicado: (2026)

TinyViT-Batten: Few-Shot Vision Transformer with Explainable Attention for Early Batten-Disease Detection on Pediatric MRI
por: Uppalapati, Khartik, et al.
Publicado: (2025)

Integrating Features for Recognizing Human Activities through Optimized Parameters in Graph Convolutional Networks and Transformer Architectures
por: Belal, Mohammad, et al.
Publicado: (2024)

Fast Registration of Photorealistic Avatars for VR Facial Animation
por: Patel, Chaitanya, et al.
Publicado: (2024)

SFMViT: SlowFast Meet ViT in Chaotic World
por: Lin, Jiaying, et al.
Publicado: (2024)