:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Chuqiao, Chen, Shunzhang, Cai, Xinyi, Chen, Hao
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Computers and Society 68T50 (Natural Language Processing), 68T10 (Pattern Recognition, Speech Recognition), 91F10 (Political Science)
Online Access:	https://arxiv.org/abs/2411.04862
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
by: Madhavi, Hrishit, et al.
Published: (2025)

Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data
by: González-Bustamante, Bastián
Published: (2024)

Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
by: González-Bustamante, Bastián, et al.
Published: (2025)

Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025)

TextClass Benchmark: A Continuous Elo Rating of LLMs in Social Sciences
by: González-Bustamante, Bastián
Published: (2024)

SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations
by: Dumpala, Sri Harsha, et al.
Published: (2024)

Semi-automated extraction of research topics and trends from NCI funding in radiological sciences from 2000-2020
by: Nguyen, Mark, et al.
Published: (2023)

What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations
by: Manohar, Kavya, et al.
Published: (2024)

Beyond RNNs: Benchmarking Attention-Based Image Captioning Models
by: Yanambakkam, Hemanth Teja, et al.
Published: (2025)

Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model
by: Vosoughi, Ali, et al.
Published: (2025)

A Survey on Vision-Language-Action Models for Embodied AI
by: Ma, Yueen, et al.
Published: (2024)

The Association of Transformer-based Sentiment Analysis with Symptom Distress and Deterioration in Routine Psychotherapy Care
by: Faust, Douglas K., et al.
Published: (2026)

Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics
by: Kellert, Olga, et al.
Published: (2024)

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
by: Koukounas, Andreas, et al.
Published: (2024)

IRONIC: Coherence-Aware Reasoning Chains for Multi-Modal Sarcasm Detection
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)

RONA: Pragmatically Diverse Image Captioning with Coherence Relations
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)

Quantization for OpenAI's Whisper Models: A Comparative Analysis
by: Andreyev, Allison
Published: (2025)

From Black Box to Glass Box: Cross-Model ASR Disagreement to Prioto Review in Ambient AI Scribe Documentation
by: Karbalaie, Abdolamir, et al.
Published: (2026)

Sentiment analysis of texts from social networks based on machine learning methods for monitoring public sentiment
by: Nurlanuly, Arsen Tolebay
Published: (2025)

MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation
by: Qi, Dekang, et al.
Published: (2026)

Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
by: Gkountouras, John, et al.
Published: (2025)

Biomedical Visual Instruction Tuning with Clinician Preference Alignment
by: Cui, Hejie, et al.
Published: (2024)

COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation
by: Hassan, Umair
Published: (2025)

ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation
by: Silva, Fillipe dos Santos, et al.
Published: (2024)

Extracting and Validating Explanatory Word Archipelagoes using Dual Entropy
by: Ohsawa, Yukio
Published: (2020)

Hateful Meme Detection through Context-Sensitive Prompting and Fine-Grained Labeling
by: Ouyang, Rongxin, et al.
Published: (2024)

MarketSenseAI 2.0: Enhancing Stock Analysis through LLM Agents
by: Fatouros, George, et al.
Published: (2025)

Non-Verbal Vocalisations and their Challenges: Emotion, Privacy, Sparseness, and Real Life
by: Batliner, Anton, et al.
Published: (2025)

Unpacking Hateful Memes: Presupposed Context and False Claims
by: Cai, Weibin, et al.
Published: (2025)

MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering
by: Yim, Wen-wai, et al.
Published: (2025)

Does CLIP perceive art the same way we do?
by: Asperti, Andrea, et al.
Published: (2025)

Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation
by: Nasriddinov, Firdavs, et al.
Published: (2025)

Comparison of parameters of vowel sounds of russian and english languages
by: Fedoseev, V. I., et al.
Published: (2024)

MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering
by: Gondhalekar, Chinmay, et al.
Published: (2025)

PCRI: Measuring Context Robustness in Multimodal Models for Enterprise Applications
by: Patel, Hitesh Laxmichand, et al.
Published: (2025)

Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection
by: Fatouros, Georgios, et al.
Published: (2024)

SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
by: Wu, Ren-Di, et al.
Published: (2025)

Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
by: Chopra, Anuradha, et al.
Published: (2025)

NLP-Based Review for Toxic Comment Detection Tailored to the Chinese Cyberspace
by: Ren, Ruixing, et al.
Published: (2026)