Guardado en:
| Autores principales: | Li, Jiaze, Shi, Yaya, Ma, Zongyang, Xu, Haoran, Cheng, Feng, Xiao, Huihui, Kang, Ruiwen, Yang, Fan, Gao, Tingting, Zhang, Di |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2502.11594 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
por: Chen, Jiankang, et al.
Publicado: (2025)
por: Chen, Jiankang, et al.
Publicado: (2025)
MOVE: Motion-Guided Few-Shot Video Object Segmentation
por: Ying, Kaining, et al.
Publicado: (2025)
por: Ying, Kaining, et al.
Publicado: (2025)
Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
por: Li, Jiaze, et al.
Publicado: (2025)
por: Li, Jiaze, et al.
Publicado: (2025)
Libraries on the MOVE.
por: Edgar, Jim, et al.
Publicado: (1986)
por: Edgar, Jim, et al.
Publicado: (1986)
THE SUDAN: PARALLEL MOVE
Publicado: (1958)
Publicado: (1958)
MOVE: A Simple Motion-Based Data Collection Paradigm for Spatial Generalization in Robotic Manipulation
por: Wang, Huanqian, et al.
Publicado: (2025)
por: Wang, Huanqian, et al.
Publicado: (2025)
Physical oceanography during L'Atalante cruise MOVE2005 (ATA_MOVE_2005)
por: Krahmann, Gerd, et al.
Publicado: (2017)
por: Krahmann, Gerd, et al.
Publicado: (2017)
Physical oceanography during L'Atalante cruise MOVE2002 (ATA_MOVE_2002)
por: Krahmann, Gerd, et al.
Publicado: (2017)
por: Krahmann, Gerd, et al.
Publicado: (2017)
EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and Generation
por: Qiu, Zongyang, et al.
Publicado: (2025)
por: Qiu, Zongyang, et al.
Publicado: (2025)
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
por: Liu, Ye, et al.
Publicado: (2024)
por: Liu, Ye, et al.
Publicado: (2024)
Do We Need iPhone Moment or Xiaomi Moment for Robots? Design of Affordable Home Robots for Health Monitoring
por: Wei, Bo, et al.
Publicado: (2024)
por: Wei, Bo, et al.
Publicado: (2024)
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
por: Xu, Boshen, et al.
Publicado: (2025)
por: Xu, Boshen, et al.
Publicado: (2025)
Federated Learning with Sample-level Client Drift Mitigation
por: Xu, Haoran, et al.
Publicado: (2025)
por: Xu, Haoran, et al.
Publicado: (2025)
Federated Joint Learning for Domain and Class Generalization
por: Xu, Haoran, et al.
Publicado: (2026)
por: Xu, Haoran, et al.
Publicado: (2026)
Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow
por: Liu, Ruyang, et al.
Publicado: (2025)
por: Liu, Ruyang, et al.
Publicado: (2025)
Physics-Aware Video Instance Removal Benchmark
por: Li, Zirui, et al.
Publicado: (2026)
por: Li, Zirui, et al.
Publicado: (2026)
LLMSQRec: A study on a large language model‐based framework for dual‐view semantic‐quantized recommendation
por: Zhixue Zhang, et al.
Publicado: (2026)
por: Zhixue Zhang, et al.
Publicado: (2026)
EA-VTR: Event-Aware Video-Text Retrieval
por: Ma, Zongyang, et al.
Publicado: (2024)
por: Ma, Zongyang, et al.
Publicado: (2024)
Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding
por: Tan, Wenhui, et al.
Publicado: (2026)
por: Tan, Wenhui, et al.
Publicado: (2026)
InstanceAnimator: Multi-Instance Sketch Video Colorization
por: Zhang, Yinhan, et al.
Publicado: (2026)
por: Zhang, Yinhan, et al.
Publicado: (2026)
Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding
por: Shi, Shuyao, et al.
Publicado: (2026)
por: Shi, Shuyao, et al.
Publicado: (2026)
Redshift Evolution of the HII Galaxy $L$-$σ$ Relation: Gaussian Process Analysis and Cosmological Implications
por: Gao, Jiaze, et al.
Publicado: (2024)
por: Gao, Jiaze, et al.
Publicado: (2024)
Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation
por: Li, Jiaze, et al.
Publicado: (2026)
por: Li, Jiaze, et al.
Publicado: (2026)
HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
por: Li, Haoran, et al.
Publicado: (2025)
por: Li, Haoran, et al.
Publicado: (2025)
Instance-Aware Robust Consistency Regularization for Semi-Supervised Nuclei Instance Segmentation
por: Lin, Zenan, et al.
Publicado: (2025)
por: Lin, Zenan, et al.
Publicado: (2025)
“MANUFACTURED BY THE SUN”: EVE LANGLEY’S THE PEA-PICKERS ON THE MOVE
por: Nicholas Birns
Publicado: (2016)
por: Nicholas Birns
Publicado: (2016)
Immunological Tolerance Induced by Nanoliposome with Autoantigenie Peptide and Artesunate to Inhibit Complement and Remodel Immune Balance for Multiple Sclerosis Treatment
por: Yaya Wei, et al.
Publicado: (2025)
por: Yaya Wei, et al.
Publicado: (2025)
GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation
por: Tian, Jiayi, et al.
Publicado: (2026)
por: Tian, Jiayi, et al.
Publicado: (2026)
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
por: Wang, Yuji, et al.
Publicado: (2025)
por: Wang, Yuji, et al.
Publicado: (2025)
Subshifts of finite symbolic rank
por: Gao, Su, et al.
Publicado: (2023)
por: Gao, Su, et al.
Publicado: (2023)
InstanceV: Instance-Level Video Generation
por: Chen, Yuheng, et al.
Publicado: (2025)
por: Chen, Yuheng, et al.
Publicado: (2025)
MathLearner: A Large Language Model Agent Framework for Learning to Solve Mathematical Problems
por: Xie, Wenbei, et al.
Publicado: (2024)
por: Xie, Wenbei, et al.
Publicado: (2024)
MoVideo: Motion-Aware Video Generation with Diffusion Models
por: Liang, Jingyun, et al.
Publicado: (2023)
por: Liang, Jingyun, et al.
Publicado: (2023)
CAVIS: Context-Aware Video Instance Segmentation
por: Lee, Seunghun, et al.
Publicado: (2024)
por: Lee, Seunghun, et al.
Publicado: (2024)
Geometry-Guided Camera Motion Understanding in VideoLLMs
por: Feng, Haoan, et al.
Publicado: (2026)
por: Feng, Haoan, et al.
Publicado: (2026)
COMO O SER SE MOVE A OUTRO SER
por: Leandro Carvalho de Bitencourt
Publicado: (2022)
por: Leandro Carvalho de Bitencourt
Publicado: (2022)
ReMOVE: A Reference-free Metric for Object Erasure
por: Chandrasekar, Aditya, et al.
Publicado: (2024)
por: Chandrasekar, Aditya, et al.
Publicado: (2024)
Motion-Aware Caching for Efficient Autoregressive Video Generation
por: Xu, Jing, et al.
Publicado: (2026)
por: Xu, Jing, et al.
Publicado: (2026)
iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection
por: Yi, Huahui, et al.
Publicado: (2025)
por: Yi, Huahui, et al.
Publicado: (2025)
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures
por: Reich, Christoph, et al.
Publicado: (2023)
por: Reich, Christoph, et al.
Publicado: (2023)
Ejemplares similares
-
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
por: Chen, Jiankang, et al.
Publicado: (2025) -
MOVE: Motion-Guided Few-Shot Video Object Segmentation
por: Ying, Kaining, et al.
Publicado: (2025) -
Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment
por: Li, Jiaze, et al.
Publicado: (2025) -
Libraries on the MOVE.
por: Edgar, Jim, et al.
Publicado: (1986) -
THE SUDAN: PARALLEL MOVE
Publicado: (1958)