:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Chao, Lianying, Yin, Linfeng, Ren, Peiyu, Jiang, Yifan, Ren, Qiaoyu, Shan, Dingcheng, Pang, Jing-cheng, Wu, Sijie, Li, Xubin, Zhang, Kai, Chen, Xin
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computer Vision and Pattern Recognition
Accesso online:	https://arxiv.org/abs/2601.14594
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Multi-Modal LLM based Image Captioning in ICT: Bridging the Gap Between General and Industry Domain
di: Chao, Lianying, et al.
Pubblicazione: (2026)

EasyRec: Simple yet Effective Language Models for Recommendation
di: Ren, Xubin, et al.
Pubblicazione: (2024)

LFS-Aware Surface Reconstruction from Unoriented 3D Point Clouds
di: Fu, Rao, et al.
Pubblicazione: (2024)

Progress-Aware Video Frame Captioning
di: Xue, Zihui, et al.
Pubblicazione: (2024)

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos
di: Ren, Xubin, et al.
Pubblicazione: (2025)

XRec: Large Language Models for Explainable Recommendation
di: Ma, Qiyao, et al.
Pubblicazione: (2024)

AMOSL: Adaptive Modality-wise Structure Learning in Multi-view Graph Neural Networks For Enhanced Unified Representation
di: Liang, Peiyu, et al.
Pubblicazione: (2024)

Disentangled Contrastive Collaborative Filtering
di: Ren, Xubin, et al.
Pubblicazione: (2023)

A Survey of Large Language Models for Graphs
di: Ren, Xubin, et al.
Pubblicazione: (2024)

A Comprehensive Survey on Self-Supervised Learning for Recommendation
di: Ren, Xubin, et al.
Pubblicazione: (2024)

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation
di: Fan, Tianyu, et al.
Pubblicazione: (2025)

Enhance Temporal Relations in Audio Captioning with Sound Event Detection
di: Xie, Zeyu, et al.
Pubblicazione: (2023)

Reinforcement Learning with Promising Tokens for Large Language Models
di: Pang, Jing-Cheng, et al.
Pubblicazione: (2026)

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
di: Lu, Yifan, et al.
Pubblicazione: (2023)

Harnessing Corporate Social Responsibility and Innovation Management: A Path Toward Diversity, Equity, and Inclusion Management Using Stakeholder Theory
di: Lianying Yao, et al.
Pubblicazione: (2026)

Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning
di: Xie, Zhuyang, et al.
Pubblicazione: (2024)

Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction
di: Jia, Mingda, et al.
Pubblicazione: (2025)

RAG-Anything: All-in-One RAG Framework
di: Guo, Zirui, et al.
Pubblicazione: (2025)

DeepCode: Open Agentic Coding
di: Li, Zongwei, et al.
Pubblicazione: (2025)

VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
di: Zhu, Jiaying, et al.
Pubblicazione: (2025)

Panoptic Captioning: An Equivalence Bridge for Image and Text
di: Lin, Kun-Yu, et al.
Pubblicazione: (2025)

Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence
di: Yang, Shuai, et al.
Pubblicazione: (2025)

Emodin promotes the recovery of rheumatoid arthritis by regulating the crosstalk between macrophage subsets and synovial fibroblast subsets
di: Lianying Cheng, et al.
Pubblicazione: (2024)

Effectiveness of an Enhanced Nursing Intervention Program Combining Infection Control and Respiratory Function Training in Patients With Leukemia and Respiratory Infections During Chemotherapy
di: Yuzhen Lu, et al.
Pubblicazione: (2025)

Towards Diverse and Efficient Audio Captioning via Diffusion Models
di: Xu, Manjie, et al.
Pubblicazione: (2024)

CARES: Context-Aware Resolution Selector for VLMs
di: Kimhi, Moshe, et al.
Pubblicazione: (2025)

Modular matrix invariants under some transpose actions
di: Chen, Yin, et al.
Pubblicazione: (2025)

Carrier-Phonon Decoupling via Annealing Enhances Thermoelectric Performance of Bi2(Te,Se)
di: cheng, xinxiu, et al.
Pubblicazione: (2025)

Event-Anchored Frame Selection for Effective Long-Video Understanding
di: Chen, Wang, et al.
Pubblicazione: (2026)

Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames
di: Chen, Chao, et al.
Pubblicazione: (2023)

EventVAD: Training-Free Event-Aware Video Anomaly Detection
di: Shao, Yihua, et al.
Pubblicazione: (2025)

Frame-Level Captions for Long Video Generation with Complex Multi Scenes
di: Zheng, Guangcong, et al.
Pubblicazione: (2025)

RecGPT: A Foundation Model for Sequential Recommendation
di: Jiang, Yangqin, et al.
Pubblicazione: (2025)

EVOS: Efficient Implicit Neural Training via EVOlutionary Selector
di: Zhang, Weixiang, et al.
Pubblicazione: (2024)

Representation Learning with Large Language Models for Recommendation
di: Ren, Xubin, et al.
Pubblicazione: (2023)

V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models
di: Lin, Xinying, et al.
Pubblicazione: (2026)

$\ell_1$DecNet+: A new architecture framework by $\ell_1$ decomposition and iteration unfolding for sparse feature segmentation
di: Ren, Yumeng, et al.
Pubblicazione: (2022)

To Unpack or Not to Unpack: Living with Packers to Enable Dynamic Analysis of Android Apps
di: Asghari, Mohammad Hossein, et al.
Pubblicazione: (2025)

Invariant Link Selector for Spatial-Temporal Out-of-Distribution Problem
di: Tieu, Katherine, et al.
Pubblicazione: (2025)

Diversity and Temporality of Chaotic Events
di: Javier Montenegro Joo
Pubblicazione: (2016)