:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhou, Weixiao, Zhu, Junnan, Li, Gengyao, Cheng, Xianfu, Liang, Xinnian, Zhai, Feifei, Li, Zhoujun
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2505.12474
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser
von: Cheng, Xianfu, et al.
Veröffentlicht: (2024)

What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations
von: Liu, Dongqi, et al.
Veröffentlicht: (2025)

TROVE: A Challenge for Fine-Grained Text Provenance via Source Sentence Tracing and Relationship Classification
von: Zhu, Junnan, et al.
Veröffentlicht: (2025)

What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models
von: Zhu, Shengqi, et al.
Veröffentlicht: (2024)

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
von: Wu, Xianjie, et al.
Veröffentlicht: (2024)

SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
von: Wang, Bing, et al.
Veröffentlicht: (2023)

P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark
von: Sun, Tao, et al.
Veröffentlicht: (2025)

Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation
von: Li, Zhuohang, et al.
Veröffentlicht: (2024)

What Are Tools Anyway? A Survey from the Language Model Perspective
von: Wang, Zhiruo, et al.
Veröffentlicht: (2024)

Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems
von: Yan, Bingyu, et al.
Veröffentlicht: (2025)

m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
von: Yang, Jian, et al.
Veröffentlicht: (2024)

What Do AI Agents Talk About? Discourse and Architectural Constraints in the First AI-Only Social Network
von: Dube, Taksch, et al.
Veröffentlicht: (2026)

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
von: Wang, Bing, et al.
Veröffentlicht: (2023)

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
von: Cheng, Xianfu, et al.
Veröffentlicht: (2025)

Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
von: Chen, Xiuying, et al.
Veröffentlicht: (2024)

Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework
von: Xing, Peng, et al.
Veröffentlicht: (2024)

What Large Language Models Do Not Talk About: An Empirical Study of Moderation and Censorship Practices
von: Noels, Sander, et al.
Veröffentlicht: (2025)

Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA
von: Dong, Xuanzhao, et al.
Veröffentlicht: (2025)

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding
von: Wu, Haoning, et al.
Veröffentlicht: (2024)

Refining Transcripts With TV Subtitles by Prompt-Based Weakly Supervised Training of ASR
von: Zhao, Xinnian, et al.
Veröffentlicht: (2025)

ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
von: Zhang, Wei, et al.
Veröffentlicht: (2024)

Word Matters: What Influences Domain Adaptation in Summarization?
von: Li, Yinghao, et al.
Veröffentlicht: (2024)

TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured Table Question Answering
von: Zhu, Junnan, et al.
Veröffentlicht: (2025)

M3TQA: Massively Multilingual Multitask Table Question Answering
von: Shu, Daixin, et al.
Veröffentlicht: (2025)

SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor
von: Cheng, Xianfu, et al.
Veröffentlicht: (2024)

Garbage In, Reasoning Out? Why Benchmark Scores are Unreliable and What to Do About It
von: Mousavi, Seyed Mahed, et al.
Veröffentlicht: (2025)

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
von: Sreedhar, Makesh Narsimhan, et al.
Veröffentlicht: (2024)

CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes
von: Marques, Miguel, et al.
Veröffentlicht: (2026)

What Matters to an LLM? Behavioral and Computational Evidences from Summarization
von: Zhou, Yongxin, et al.
Veröffentlicht: (2026)

SNS-Bench-VL: Benchmarking Multimodal Large Language Models in Social Networking Services
von: Guo, Hongcheng, et al.
Veröffentlicht: (2025)

We Need to Talk About Classification Evaluation Metrics in NLP
von: Vickers, Peter, et al.
Veröffentlicht: (2024)

Context-Adaptive Synthesis and Compression for Enhanced Retrieval-Augmented Generation in Complex Domains
von: Zhou, Peiran, et al.
Veröffentlicht: (2025)

SUMIE: A Synthetic Benchmark for Incremental Entity Summarization
von: Hwang, Eunjeong, et al.
Veröffentlicht: (2024)

SituatedThinker: Grounding LLM Reasoning with Real-World through Situated Thinking
von: Liu, Junnan, et al.
Veröffentlicht: (2025)

DependEval: Benchmarking LLMs for Repository Dependency Understanding
von: Du, Junjia, et al.
Veröffentlicht: (2025)

Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
von: Cui, Chenhao, et al.
Veröffentlicht: (2024)

Flexible and Adaptable Summarization via Expertise Separation
von: Chen, Xiuying, et al.
Veröffentlicht: (2024)

ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering
von: Wei, Jingxuan, et al.
Veröffentlicht: (2025)

SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment
von: Zhai, Zhouwei, et al.
Veröffentlicht: (2026)

Still "Talking About Large Language Models": Some Clarifications
von: Shanahan, Murray
Veröffentlicht: (2024)