:: Library Catalog

Нүүр зураг

-д хадгалсан:

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Chen, Guiming Hardy, Chen, Shunian, Zhang, Ruifei, Chen, Junying, Wu, Xiangbo, Zhang, Zhiyi, Chen, Zhihong, Li, Jianquan, Wan, Xiang, Wang, Benyou
Формат:	Preprint
Хэвлэсэн:	2024
Нөхцлүүд:	Computation and Language Artificial Intelligence
Онлайн хандалт:	https://arxiv.org/abs/2402.11684
Шошгууд:	Шошго нэмэх Шошго байхгүй, Энэхүү баримтыг шошголох эхний хүн болох!

Ижил төстэй зүйлс

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2024)

MileBench: Benchmarking MLLMs in Long Context
-н: Song, Dingjie, зэрэг
Хэвлэсэн: (2024)

Humans or LLMs as the Judge? A Study on Judgement Biases
-н: Chen, Guiming Hardy, зэрэг
Хэвлэсэн: (2024)

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
-н: Ge, Wentao, зэрэг
Хэвлэсэн: (2023)

CMB: A Comprehensive Medical Benchmark in Chinese
-н: Wang, Xidong, зэрэг
Хэвлэсэн: (2023)

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2025)

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2023)

AceGPT, Localizing Large Language Models in Arabic
-н: Huang, Huang, зэрэг
Хэвлэсэн: (2023)

Online Training of Large Language Models: Learn while chatting
-н: Liang, Juhao, зэрэг
Хэвлэсэн: (2024)

Large Multimodal Agents: A Survey
-н: Xie, Junlin, зэрэг
Хэвлэсэн: (2024)

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos
-н: Wang, Rongsheng, зэрэг
Хэвлэсэн: (2025)

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture
-н: Wang, Xidong, зэрэг
Хэвлэсэн: (2024)

Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM
-н: Song, Dingjie, зэрэг
Хэвлэсэн: (2024)

From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents
-н: Zhu, Qiming, зэрэг
Хэвлэсэн: (2026)

A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models
-н: Jing, Liqiang, зэрэг
Хэвлэсэн: (2025)

Harnessing the Power of Local Representations for Few-Shot Classification
-н: Tang, Shi, зэрэг
Хэвлэсэн: (2024)

LLMs Could Autonomously Learn Without External Supervision
-н: Ji, Ke, зэрэг
Хэвлэсэн: (2024)

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
-н: Li, Lei, зэрэг
Хэвлэсэн: (2024)

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
-н: Zhang, Ruifei, зэрэг
Хэвлэсэн: (2025)

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2024)

VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
-н: Zhang, Ruifei, зэрэг
Хэвлэсэн: (2025)

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs
-н: Song, Dingjie, зэрэг
Хэвлэсэн: (2024)

EvA: An Evidence-First Audio Understanding Paradigm for LALMs
-н: Xie, Xinyuan, зэрэг
Хэвлэсэн: (2026)

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2024)

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion
-н: Chen, Shunian, зэрэг
Хэвлэсэн: (2025)

ALLaM: Large Language Models for Arabic and English
-н: Bari, M Saiful, зэрэг
Хэвлэсэн: (2024)

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
-н: Chen, Shunian, зэрэг
Хэвлэсэн: (2025)

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
-н: Chen, Junying, зэрэг
Хэвлэсэн: (2025)

RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions
-н: Liu, Wanlong, зэрэг
Хэвлэсэн: (2024)

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement
-н: Du, Yuhao, зэрэг
Хэвлэсэн: (2024)

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models
-н: Tao, Dehua, зэрэг
Хэвлэсэн: (2026)

AME: Aligned Manifold Entropy for Robust Vision-Language Distillation
-н: Cao, Guiming, зэрэг
Хэвлэсэн: (2025)

Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
-н: Wang, Xidong, зэрэг
Хэвлэсэн: (2024)

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry
-н: Cai, Zhenyang, зэрэг
Хэвлэсэн: (2025)

From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph
-н: Gong, Junfeng, зэрэг
Хэвлэсэн: (2025)

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives
-н: Chen, Zuyao, зэрэг
Хэвлэсэн: (2023)

Do LLMs Triage Like Clinicians? A Dynamic Study of Outpatient Referral
-н: Liu, Xiaoxiao, зэрэг
Хэвлэсэн: (2025)

Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging
-н: Cai, Zhenyang, зэрэг
Хэвлэсэн: (2024)

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
-н: Xie, Wenya, зэрэг
Хэвлэсэн: (2024)

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task
-н: Le-Duc, Khai, зэрэг
Хэвлэсэн: (2024)