Guardado en:
| Autores principales: | Hamdi, Laziz, Tamasna, Amine, Boisson, Pascal, Paquet, Thierry |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.22422 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
PILOT: A Promptable Interleaved Layout-aware OCR Transformer
por: Hamdi, Laziz, et al.
Publicado: (2025)
por: Hamdi, Laziz, et al.
Publicado: (2025)
TableSeq: Unified Generation of Structure, Content, and Layout
por: Hamdi, Laziz, et al.
Publicado: (2026)
por: Hamdi, Laziz, et al.
Publicado: (2026)
DenTab: A Dataset for Table Recognition and Visual QA on Real-World Dental Estimates
por: Hamdi, Laziz, et al.
Publicado: (2026)
por: Hamdi, Laziz, et al.
Publicado: (2026)
FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
por: Liao, Pan, et al.
Publicado: (2024)
por: Liao, Pan, et al.
Publicado: (2024)
TinyDrop: Tiny Model Guided Token Dropping for Vision Transformers
por: Wang, Guoxin, et al.
Publicado: (2025)
por: Wang, Guoxin, et al.
Publicado: (2025)
Few-shot Writer Adaptation via Multimodal In-Context Learning
por: Simon, Tom, et al.
Publicado: (2026)
por: Simon, Tom, et al.
Publicado: (2026)
Deterministic Continuous Replacement: Fast and Stable Module Replacement in Pretrained Transformers
por: Bradbury, Rowan, et al.
Publicado: (2025)
por: Bradbury, Rowan, et al.
Publicado: (2025)
GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers
por: Prospero, Lorenza, et al.
Publicado: (2024)
por: Prospero, Lorenza, et al.
Publicado: (2024)
MVTN: Learning Multi-View Transformations for 3D Understanding
por: Hamdi, Abdullah, et al.
Publicado: (2022)
por: Hamdi, Abdullah, et al.
Publicado: (2022)
Patch Rebirth: Toward Fast and Transferable Model Inversion of Vision Transformers
por: Heo, Seongsoo, et al.
Publicado: (2025)
por: Heo, Seongsoo, et al.
Publicado: (2025)
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos
por: Xu, Qingyu, et al.
Publicado: (2024)
por: Xu, Qingyu, et al.
Publicado: (2024)
Recognize Any Regions
por: Yang, Haosen, et al.
Publicado: (2023)
por: Yang, Haosen, et al.
Publicado: (2023)
FastInit: Fast Noise Initialization for Temporally Consistent Video Generation
por: Bai, Chengyu, et al.
Publicado: (2025)
por: Bai, Chengyu, et al.
Publicado: (2025)
DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models
por: Yu, Hu, et al.
Publicado: (2025)
por: Yu, Hu, et al.
Publicado: (2025)
Fast Training of Diffusion Models with Masked Transformers
por: Zheng, Hongkai, et al.
Publicado: (2023)
por: Zheng, Hongkai, et al.
Publicado: (2023)
Fast Occupancy Network
por: Lu, Mingjie, et al.
Publicado: (2024)
por: Lu, Mingjie, et al.
Publicado: (2024)
CF3: Compact and Fast 3D Feature Fields
por: Lee, Hyunjoon, et al.
Publicado: (2025)
por: Lee, Hyunjoon, et al.
Publicado: (2025)
Tiny-Engram: Trigger-Indexed Concept Tables for Generative Vision
por: Cai, Runyuan, et al.
Publicado: (2026)
por: Cai, Runyuan, et al.
Publicado: (2026)
TinyFusion: Diffusion Transformers Learned Shallow
por: Fang, Gongfan, et al.
Publicado: (2024)
por: Fang, Gongfan, et al.
Publicado: (2024)
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
por: Tian, Yunjie, et al.
Publicado: (2022)
por: Tian, Yunjie, et al.
Publicado: (2022)
StreamTinyNet: video streaming analysis with spatial-temporal TinyML
por: Shalby, Hazem Hesham Yousef, et al.
Publicado: (2024)
por: Shalby, Hazem Hesham Yousef, et al.
Publicado: (2024)
FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge
por: Shen, Xuan, et al.
Publicado: (2025)
por: Shen, Xuan, et al.
Publicado: (2025)
FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework
por: Hu, Haotian, et al.
Publicado: (2025)
por: Hu, Haotian, et al.
Publicado: (2025)
FasterViT: Fast Vision Transformers with Hierarchical Attention
por: Hatamizadeh, Ali, et al.
Publicado: (2023)
por: Hatamizadeh, Ali, et al.
Publicado: (2023)
FastV-RAG: Towards Fast and Fine-Grained Video QA with Retrieval-Augmented Generation
por: Li, Gen, et al.
Publicado: (2026)
por: Li, Gen, et al.
Publicado: (2026)
TinyFormer: Preserving Tiny Objects in YOLO-DETR Hybrid Real-time Detectors
por: Hsieh, Jun-Wei, et al.
Publicado: (2026)
por: Hsieh, Jun-Wei, et al.
Publicado: (2026)
Tiny-ViT: A Compact Vision Transformer for Efficient and Explainable Potato Leaf Disease Classification
por: Mia, Shakil, et al.
Publicado: (2026)
por: Mia, Shakil, et al.
Publicado: (2026)
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
por: Liu, Dong, et al.
Publicado: (2025)
por: Liu, Dong, et al.
Publicado: (2025)
Accurate and Fast Compressed Video Captioning
por: Shen, Yaojie, et al.
Publicado: (2023)
por: Shen, Yaojie, et al.
Publicado: (2023)
Dynamics-Aware Gaussian Splatting Streaming Towards Fast On-the-Fly 4D Reconstruction
por: Liu, Zhening, et al.
Publicado: (2024)
por: Liu, Zhening, et al.
Publicado: (2024)
ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
por: Xu, Hongbin, et al.
Publicado: (2024)
por: Xu, Hongbin, et al.
Publicado: (2024)
Fast Autoregressive Video Generation with Diagonal Decoding
por: Ye, Yang, et al.
Publicado: (2025)
por: Ye, Yang, et al.
Publicado: (2025)
DEIM: DETR with Improved Matching for Fast Convergence
por: Huang, Shihua, et al.
Publicado: (2024)
por: Huang, Shihua, et al.
Publicado: (2024)
Turbo4DGen: Ultra-Fast Acceleration for 4D Generation
por: Man, Yuanbin, et al.
Publicado: (2026)
por: Man, Yuanbin, et al.
Publicado: (2026)
FastBO: Fast HPO and NAS with Adaptive Fidelity Identification
por: Jiang, Jiantong, et al.
Publicado: (2024)
por: Jiang, Jiantong, et al.
Publicado: (2024)
Noise-Robust Tiny Object Localization with Flows
por: Sun, Huixin, et al.
Publicado: (2026)
por: Sun, Huixin, et al.
Publicado: (2026)
TinyViT-Batten: Few-Shot Vision Transformer with Explainable Attention for Early Batten-Disease Detection on Pediatric MRI
por: Uppalapati, Khartik, et al.
Publicado: (2025)
por: Uppalapati, Khartik, et al.
Publicado: (2025)
Integrating Features for Recognizing Human Activities through Optimized Parameters in Graph Convolutional Networks and Transformer Architectures
por: Belal, Mohammad, et al.
Publicado: (2024)
por: Belal, Mohammad, et al.
Publicado: (2024)
Fast Registration of Photorealistic Avatars for VR Facial Animation
por: Patel, Chaitanya, et al.
Publicado: (2024)
por: Patel, Chaitanya, et al.
Publicado: (2024)
SFMViT: SlowFast Meet ViT in Chaotic World
por: Lin, Jiaying, et al.
Publicado: (2024)
por: Lin, Jiaying, et al.
Publicado: (2024)
Ejemplares similares
-
PILOT: A Promptable Interleaved Layout-aware OCR Transformer
por: Hamdi, Laziz, et al.
Publicado: (2025) -
TableSeq: Unified Generation of Structure, Content, and Layout
por: Hamdi, Laziz, et al.
Publicado: (2026) -
DenTab: A Dataset for Table Recognition and Visual QA on Real-World Dental Estimates
por: Hamdi, Laziz, et al.
Publicado: (2026) -
FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
por: Liao, Pan, et al.
Publicado: (2024) -
TinyDrop: Tiny Model Guided Token Dropping for Vision Transformers
por: Wang, Guoxin, et al.
Publicado: (2025)