Saved in:
| Main Authors: | Song, Chuqiao, Chen, Shunzhang, Cai, Xinyi, Chen, Hao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.04862 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
by: Madhavi, Hrishit, et al.
Published: (2025)
by: Madhavi, Hrishit, et al.
Published: (2025)
Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data
by: González-Bustamante, Bastián
Published: (2024)
by: González-Bustamante, Bastián
Published: (2024)
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
by: González-Bustamante, Bastián, et al.
Published: (2025)
by: González-Bustamante, Bastián, et al.
Published: (2025)
Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
TextClass Benchmark: A Continuous Elo Rating of LLMs in Social Sciences
by: González-Bustamante, Bastián
Published: (2024)
by: González-Bustamante, Bastián
Published: (2024)
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Semi-automated extraction of research topics and trends from NCI funding in radiological sciences from 2000-2020
by: Nguyen, Mark, et al.
Published: (2023)
by: Nguyen, Mark, et al.
Published: (2023)
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations
by: Manohar, Kavya, et al.
Published: (2024)
by: Manohar, Kavya, et al.
Published: (2024)
Beyond RNNs: Benchmarking Attention-Based Image Captioning Models
by: Yanambakkam, Hemanth Teja, et al.
Published: (2025)
by: Yanambakkam, Hemanth Teja, et al.
Published: (2025)
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model
by: Vosoughi, Ali, et al.
Published: (2025)
by: Vosoughi, Ali, et al.
Published: (2025)
A Survey on Vision-Language-Action Models for Embodied AI
by: Ma, Yueen, et al.
Published: (2024)
by: Ma, Yueen, et al.
Published: (2024)
The Association of Transformer-based Sentiment Analysis with Symptom Distress and Deterioration in Routine Psychotherapy Care
by: Faust, Douglas K., et al.
Published: (2026)
by: Faust, Douglas K., et al.
Published: (2026)
Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics
by: Kellert, Olga, et al.
Published: (2024)
by: Kellert, Olga, et al.
Published: (2024)
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
by: Koukounas, Andreas, et al.
Published: (2024)
by: Koukounas, Andreas, et al.
Published: (2024)
IRONIC: Coherence-Aware Reasoning Chains for Multi-Modal Sarcasm Detection
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)
RONA: Pragmatically Diverse Image Captioning with Coherence Relations
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2025)
Quantization for OpenAI's Whisper Models: A Comparative Analysis
by: Andreyev, Allison
Published: (2025)
by: Andreyev, Allison
Published: (2025)
From Black Box to Glass Box: Cross-Model ASR Disagreement to Prioto Review in Ambient AI Scribe Documentation
by: Karbalaie, Abdolamir, et al.
Published: (2026)
by: Karbalaie, Abdolamir, et al.
Published: (2026)
Sentiment analysis of texts from social networks based on machine learning methods for monitoring public sentiment
by: Nurlanuly, Arsen Tolebay
Published: (2025)
by: Nurlanuly, Arsen Tolebay
Published: (2025)
MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation
by: Qi, Dekang, et al.
Published: (2026)
by: Qi, Dekang, et al.
Published: (2026)
Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
by: Gkountouras, John, et al.
Published: (2025)
by: Gkountouras, John, et al.
Published: (2025)
Biomedical Visual Instruction Tuning with Clinician Preference Alignment
by: Cui, Hejie, et al.
Published: (2024)
by: Cui, Hejie, et al.
Published: (2024)
COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation
by: Hassan, Umair
Published: (2025)
by: Hassan, Umair
Published: (2025)
ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation
by: Silva, Fillipe dos Santos, et al.
Published: (2024)
by: Silva, Fillipe dos Santos, et al.
Published: (2024)
Extracting and Validating Explanatory Word Archipelagoes using Dual Entropy
by: Ohsawa, Yukio
Published: (2020)
by: Ohsawa, Yukio
Published: (2020)
Hateful Meme Detection through Context-Sensitive Prompting and Fine-Grained Labeling
by: Ouyang, Rongxin, et al.
Published: (2024)
by: Ouyang, Rongxin, et al.
Published: (2024)
MarketSenseAI 2.0: Enhancing Stock Analysis through LLM Agents
by: Fatouros, George, et al.
Published: (2025)
by: Fatouros, George, et al.
Published: (2025)
Non-Verbal Vocalisations and their Challenges: Emotion, Privacy, Sparseness, and Real Life
by: Batliner, Anton, et al.
Published: (2025)
by: Batliner, Anton, et al.
Published: (2025)
Unpacking Hateful Memes: Presupposed Context and False Claims
by: Cai, Weibin, et al.
Published: (2025)
by: Cai, Weibin, et al.
Published: (2025)
MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering
by: Yim, Wen-wai, et al.
Published: (2025)
by: Yim, Wen-wai, et al.
Published: (2025)
Does CLIP perceive art the same way we do?
by: Asperti, Andrea, et al.
Published: (2025)
by: Asperti, Andrea, et al.
Published: (2025)
Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation
by: Nasriddinov, Firdavs, et al.
Published: (2025)
by: Nasriddinov, Firdavs, et al.
Published: (2025)
Comparison of parameters of vowel sounds of russian and english languages
by: Fedoseev, V. I., et al.
Published: (2024)
by: Fedoseev, V. I., et al.
Published: (2024)
MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering
by: Gondhalekar, Chinmay, et al.
Published: (2025)
by: Gondhalekar, Chinmay, et al.
Published: (2025)
PCRI: Measuring Context Robustness in Multimodal Models for Enterprise Applications
by: Patel, Hitesh Laxmichand, et al.
Published: (2025)
by: Patel, Hitesh Laxmichand, et al.
Published: (2025)
Can Large Language Models Beat Wall Street? Unveiling the Potential of AI in Stock Selection
by: Fatouros, Georgios, et al.
Published: (2024)
by: Fatouros, Georgios, et al.
Published: (2024)
SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
by: Wu, Ren-Di, et al.
Published: (2025)
by: Wu, Ren-Di, et al.
Published: (2025)
Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
by: Chopra, Anuradha, et al.
Published: (2025)
by: Chopra, Anuradha, et al.
Published: (2025)
NLP-Based Review for Toxic Comment Detection Tailored to the Chinese Cyberspace
by: Ren, Ruixing, et al.
Published: (2026)
by: Ren, Ruixing, et al.
Published: (2026)
Similar Items
-
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
by: Madhavi, Hrishit, et al.
Published: (2025) -
Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data
by: González-Bustamante, Bastián
Published: (2024) -
Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case
by: González-Bustamante, Bastián, et al.
Published: (2025) -
Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025) -
TextClass Benchmark: A Continuous Elo Rating of LLMs in Social Sciences
by: González-Bustamante, Bastián
Published: (2024)