:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Li, Jiaze, Shi, Yaya, Ma, Zongyang, Xu, Haoran, Cheng, Feng, Xiao, Huihui, Kang, Ruiwen, Yang, Fan, Gao, Tingting, Zhang, Di
Formato:	Preprint
Publicado:	2025
Materias:	Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2502.11594
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
por: Chen, Jiankang, et al.
Publicado: (2025)

MOVE: Motion-Guided Few-Shot Video Object Segmentation
por: Ying, Kaining, et al.
Publicado: (2025)

Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
por: Li, Jiaze, et al.
Publicado: (2025)

Libraries on the MOVE.
por: Edgar, Jim, et al.
Publicado: (1986)

THE SUDAN: PARALLEL MOVE
Publicado: (1958)

MOVE: A Simple Motion-Based Data Collection Paradigm for Spatial Generalization in Robotic Manipulation
por: Wang, Huanqian, et al.
Publicado: (2025)

Physical oceanography during L'Atalante cruise MOVE2005 (ATA_MOVE_2005)
por: Krahmann, Gerd, et al.
Publicado: (2017)

Physical oceanography during L'Atalante cruise MOVE2002 (ATA_MOVE_2002)
por: Krahmann, Gerd, et al.
Publicado: (2017)

EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and Generation
por: Qiu, Zongyang, et al.
Publicado: (2025)

E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
por: Liu, Ye, et al.
Publicado: (2024)

Do We Need iPhone Moment or Xiaomi Moment for Robots? Design of Affordable Home Robots for Health Monitoring
por: Wei, Bo, et al.
Publicado: (2024)

TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
por: Xu, Boshen, et al.
Publicado: (2025)

Federated Learning with Sample-level Client Drift Mitigation
por: Xu, Haoran, et al.
Publicado: (2025)

Federated Joint Learning for Domain and Class Generalization
por: Xu, Haoran, et al.
Publicado: (2026)

Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow
por: Liu, Ruyang, et al.
Publicado: (2025)

Physics-Aware Video Instance Removal Benchmark
por: Li, Zirui, et al.
Publicado: (2026)

LLMSQRec: A study on a large language model‐based framework for dual‐view semantic‐quantized recommendation
por: Zhixue Zhang, et al.
Publicado: (2026)

EA-VTR: Event-Aware Video-Text Retrieval
por: Ma, Zongyang, et al.
Publicado: (2024)

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding
por: Tan, Wenhui, et al.
Publicado: (2026)

InstanceAnimator: Multi-Instance Sketch Video Colorization
por: Zhang, Yinhan, et al.
Publicado: (2026)

Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding
por: Shi, Shuyao, et al.
Publicado: (2026)

Redshift Evolution of the HII Galaxy $L$-$σ$ Relation: Gaussian Process Analysis and Cosmological Implications
por: Gao, Jiaze, et al.
Publicado: (2024)

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation
por: Li, Jiaze, et al.
Publicado: (2026)

HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
por: Li, Haoran, et al.
Publicado: (2025)

Instance-Aware Robust Consistency Regularization for Semi-Supervised Nuclei Instance Segmentation
por: Lin, Zenan, et al.
Publicado: (2025)

“MANUFACTURED BY THE SUN”: EVE LANGLEY’S THE PEA-PICKERS ON THE MOVE
por: Nicholas Birns
Publicado: (2016)

Immunological Tolerance Induced by Nanoliposome with Autoantigenie Peptide and Artesunate to Inhibit Complement and Remodel Immune Balance for Multiple Sclerosis Treatment
por: Yaya Wei, et al.
Publicado: (2025)

GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation
por: Tian, Jiayi, et al.
Publicado: (2026)

SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
por: Wang, Yuji, et al.
Publicado: (2025)

Subshifts of finite symbolic rank
por: Gao, Su, et al.
Publicado: (2023)

InstanceV: Instance-Level Video Generation
por: Chen, Yuheng, et al.
Publicado: (2025)

MathLearner: A Large Language Model Agent Framework for Learning to Solve Mathematical Problems
por: Xie, Wenbei, et al.
Publicado: (2024)

MoVideo: Motion-Aware Video Generation with Diffusion Models
por: Liang, Jingyun, et al.
Publicado: (2023)

CAVIS: Context-Aware Video Instance Segmentation
por: Lee, Seunghun, et al.
Publicado: (2024)

Geometry-Guided Camera Motion Understanding in VideoLLMs
por: Feng, Haoan, et al.
Publicado: (2026)

COMO O SER SE MOVE A OUTRO SER
por: Leandro Carvalho de Bitencourt
Publicado: (2022)

ReMOVE: A Reference-free Metric for Object Erasure
por: Chandrasekar, Aditya, et al.
Publicado: (2024)

Motion-Aware Caching for Efficient Autoregressive Video Generation
por: Xu, Jing, et al.
Publicado: (2026)

iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection
por: Yi, Huahui, et al.
Publicado: (2025)

The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures
por: Reich, Christoph, et al.
Publicado: (2023)