:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mahbub, Ridwan, Aziz, Syem, Ahmed, Mahir, Rahman, Shadikur, Rahman, Mizanur, Joty, Shafiq, Hoque, Enamul
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.25220
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM-Based Data Science Agents: A Survey of Capabilities, Challenges, and Future Directions
by: Rahman, Mizanur, et al.
Published: (2025)

DashboardQA: Benchmarking Multimodal Agents for Question Answering on Interactive Dashboards
by: Kartha, Aaryaman, et al.
Published: (2025)

Text2Vis: A Challenging and Diverse Benchmark for Generating Multimodal Visualizations from Text
by: Rahman, Mizanur, et al.
Published: (2025)

From Charts to Fair Narratives: Uncovering and Mitigating Geo-Economic Biases in Chart-to-Text
by: Mahbub, Ridwan, et al.
Published: (2025)

Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization
by: Rahman, Mizanur, et al.
Published: (2026)

Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning?
by: Laskar, Md Tahmid Rahman, et al.
Published: (2025)

DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts
by: Islam, Mohammed Saidul, et al.
Published: (2024)

ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
by: Masry, Ahmed, et al.
Published: (2024)

Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models
by: Islam, Shayekh Bin, et al.
Published: (2024)

Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices
by: Laskar, Md Tahmid Rahman, et al.
Published: (2025)

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering
by: Masry, Ahmed, et al.
Published: (2025)

The Perils of Chart Deception: How Misleading Visualizations Affect Vision-Language Models
by: Mahbub, Ridwan, et al.
Published: (2025)

A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
by: Laskar, Md Tahmid Rahman, et al.
Published: (2024)

Lost in Translation: Do LVLM Judges Generalize Across Languages?
by: Laskar, Md Tahmid Rahman, et al.
Published: (2026)

Multi-Armed Bandits-Based Optimization of Decision Trees
by: Shanto, Hasibul Karim, et al.
Published: (2025)

Automated User Story Generation with Test Case Specification Using Large Language Model
by: Rahman, Tajmilur, et al.
Published: (2024)

ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
by: Masry, Ahmed, et al.
Published: (2024)

NAACL2025 Tutorial: Adaptation of Large Language Models
by: Ke, Zixuan, et al.
Published: (2025)

Grid2Guide: A* Enabled Small Language Model for Indoor Navigation
by: Haque, Md. Wasiul, et al.
Published: (2025)

BEADs: Bias Evaluation Across Domains
by: Raza, Shaina, et al.
Published: (2024)

Experimental Validation of Sensor Fusion-based GNSS Spoofing Attack Detection Framework for Autonomous Vehicles
by: Dasgupta, Sagar, et al.
Published: (2024)

Comparative Evaluation of Deep Learning Models for Fake Image Detection
by: Pakala, Akhitha, et al.
Published: (2026)

Harnessing LLM Agents with Skill Programs
by: Liu, Hongjun, et al.
Published: (2026)

BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
by: Purushwalkam, Senthil, et al.
Published: (2024)

V2CE: Video to Continuous Events Simulator
by: Zhang, Zhongyang, et al.
Published: (2023)

Vision-Based Localization and LLM-based Navigation for Indoor Environments
by: Rahimi, Keyan, et al.
Published: (2025)

Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic Workflows
by: Ming, Yifei, et al.
Published: (2025)

Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
by: Nguyen, Xuan-Phi, et al.
Published: (2023)

Automated Toll Management System Using RFID and Image Processing
by: Ahmed, Raihan, et al.
Published: (2024)

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules
by: Le, Hung, et al.
Published: (2023)

Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs
by: Liu, Ye, et al.
Published: (2023)

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts
by: Nguyen, Xuan-Phi, et al.
Published: (2026)

J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
by: Xu, Austin, et al.
Published: (2025)

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing
by: Jiao, Fangkai, et al.
Published: (2024)

FocalLens: Visualizing Narratives through Focalization
by: Alam, S M Raihanul, et al.
Published: (2026)

Evaluating Psychological Safety of Large Language Models
by: Li, Xingxuan, et al.
Published: (2022)

Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings
by: Xu, Austin, et al.
Published: (2025)

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms
by: Pandit, Shrey, et al.
Published: (2025)

Agile Story-Point Estimation: Is RAG a Better Way to Go?
by: Maha, Lamyea, et al.
Published: (2026)

Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation
by: Peter, Ojonugwa Oluwafemi Ejiga, et al.
Published: (2025)