Saved in:
| Main Authors: | Sakib, Shadman, Akhand, Oishy Fatema, Abrar, Ajwad |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.14949 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Reviews to Requirements: Can LLMs Generate Human-Like User Stories?
by: Sakib, Shadman, et al.
Published: (2026)
by: Sakib, Shadman, et al.
Published: (2026)
Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization
by: Abrar, Ajwad, et al.
Published: (2025)
by: Abrar, Ajwad, et al.
Published: (2025)
Distinguishing Repetition Disfluency from Morphological Reduplication in Bangla ASR Transcripts: A Novel Corpus and Benchmarking Analysis
by: Arpa, Zaara Zabeen, et al.
Published: (2025)
by: Arpa, Zaara Zabeen, et al.
Published: (2025)
Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies
by: Abrar, Ajwad, et al.
Published: (2025)
by: Abrar, Ajwad, et al.
Published: (2025)
Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs
by: Kabir, Mohsinul, et al.
Published: (2025)
by: Kabir, Mohsinul, et al.
Published: (2025)
Assessing Large Language Models for Medical QA: Zero-Shot and LLM-as-a-Judge Evaluation
by: Adib, Shefayat E Shams, et al.
Published: (2026)
by: Adib, Shefayat E Shams, et al.
Published: (2026)
BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali
by: Adib, Shefayat E Shams, et al.
Published: (2026)
by: Adib, Shefayat E Shams, et al.
Published: (2026)
FirstAidQA: A Synthetic Dataset for First Aid and Emergency Response in Low-Connectivity Settings
by: Muna, Saiyma Sittul, et al.
Published: (2025)
by: Muna, Saiyma Sittul, et al.
Published: (2025)
Social media polarization during conflict: Insights from an ideological stance dataset on Israel-Palestine Reddit comments
by: Ali, Hasin Jawad, et al.
Published: (2025)
by: Ali, Hasin Jawad, et al.
Published: (2025)
BanglaSummEval: Reference-Free Factual Consistency Evaluation for Bangla Summarization
by: Rafid, Ahmed, et al.
Published: (2026)
by: Rafid, Ahmed, et al.
Published: (2026)
CogniAlign: Survivability-Grounded Multi-Agent Moral Reasoning for Safe and Transparent AI
by: Ali, Hasin Jawad, et al.
Published: (2025)
by: Ali, Hasin Jawad, et al.
Published: (2025)
Faithful Summarization of Consumer Health Queries: A Cross-Lingual Framework with LLMs
by: Abrar, Ajwad, et al.
Published: (2025)
by: Abrar, Ajwad, et al.
Published: (2025)
BanglaMedQA and BanglaMMedBench: Evaluating Retrieval-Augmented Generation Strategies for Bangla Biomedical Question Answering
by: Sultana, Sadia, et al.
Published: (2025)
by: Sultana, Sadia, et al.
Published: (2025)
Toward Trustworthy Difficulty Assessments: Large Language Models as Judges in Programming and Synthetic Tasks
by: Tabib, H. M. Shadman, et al.
Published: (2025)
by: Tabib, H. M. Shadman, et al.
Published: (2025)
Motamot: A Dataset for Revealing the Supremacy of Large Language Models over Transformer Models in Bengali Political Sentiment Analysis
by: Faria, Fatema Tuj Johora, et al.
Published: (2024)
by: Faria, Fatema Tuj Johora, et al.
Published: (2024)
MixSarc: A Bangla-English Code-Mixed Corpus for Implicit Meaning Identification
by: Alam, Kazi Samin Yasar, et al.
Published: (2026)
by: Alam, Kazi Samin Yasar, et al.
Published: (2026)
LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification
by: Adib, Shefayat E Shams, et al.
Published: (2026)
by: Adib, Shefayat E Shams, et al.
Published: (2026)
Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models
by: Lopez-Lira, Alejandro, et al.
Published: (2023)
by: Lopez-Lira, Alejandro, et al.
Published: (2023)
LLM-Assisted Question-Answering on Technical Documents Using Structured Data-Aware Retrieval Augmented Generation
by: Sobhan, Shadman, et al.
Published: (2025)
by: Sobhan, Shadman, et al.
Published: (2025)
Addressing Data Scarcity in Bangla Fake News Detection: An LLM-Based Dataset Augmentation Approach
by: Sani, Ahmed Alfey, et al.
Published: (2026)
by: Sani, Ahmed Alfey, et al.
Published: (2026)
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach
by: Tabib, H. M. Shadman, et al.
Published: (2025)
by: Tabib, H. M. Shadman, et al.
Published: (2025)
An Empirical Evaluation of Large Language Models on Consumer Health Questions
by: Abrar, Moaiz, et al.
Published: (2024)
by: Abrar, Moaiz, et al.
Published: (2024)
Large Language Models for Propaganda Span Annotation
by: Hasanain, Maram, et al.
Published: (2023)
by: Hasanain, Maram, et al.
Published: (2023)
Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability
by: Sakib, Md Sadman, et al.
Published: (2024)
by: Sakib, Md Sadman, et al.
Published: (2024)
Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles
by: Hasanain, Maram, et al.
Published: (2024)
by: Hasanain, Maram, et al.
Published: (2024)
Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?
by: Salami, Hossein, et al.
Published: (2024)
by: Salami, Hossein, et al.
Published: (2024)
Can Large Language Models Predict Antimicrobial Resistance Gene?
by: Yoo, Hyunwoo
Published: (2025)
by: Yoo, Hyunwoo
Published: (2025)
Towards Linguistically-Aware and Language-Independent Tokenization for Large Language Models (LLMs)
by: Rahman, Abrar, et al.
Published: (2024)
by: Rahman, Abrar, et al.
Published: (2024)
Battling Misinformation: An Empirical Study on Adversarial Factuality in Open-Source Large Language Models
by: Sakib, Shahnewaz Karim, et al.
Published: (2025)
by: Sakib, Shahnewaz Karim, et al.
Published: (2025)
Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive Study
by: Faria, Fatema Tuj Johora, et al.
Published: (2024)
by: Faria, Fatema Tuj Johora, et al.
Published: (2024)
Can Large Language Models Predict the Outcome of Judicial Decisions?
by: Kmainasi, Mohamed Bayan, et al.
Published: (2025)
by: Kmainasi, Mohamed Bayan, et al.
Published: (2025)
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
by: Das, Anindya Bijoy, et al.
Published: (2025)
by: Das, Anindya Bijoy, et al.
Published: (2025)
Why Not Transform Chat Large Language Models to Non-English?
by: Geng, Xiang, et al.
Published: (2024)
by: Geng, Xiang, et al.
Published: (2024)
Large Language Models in Cryptocurrency Securities Cases: Can a GPT Model Meaningfully Assist Lawyers?
by: Trozze, Arianna, et al.
Published: (2023)
by: Trozze, Arianna, et al.
Published: (2023)
Can Large Language Models Predict Associations Among Human Attitudes?
by: Ma, Ana, et al.
Published: (2025)
by: Ma, Ana, et al.
Published: (2025)
Risks, Causes, and Mitigations of Widespread Deployments of Large Language Models (LLMs): A Survey
by: Sakib, Md Nazmus, et al.
Published: (2024)
by: Sakib, Md Nazmus, et al.
Published: (2024)
From Word to World: Can Large Language Models be Implicit Text-based World Models?
by: Li, Yixia, et al.
Published: (2025)
by: Li, Yixia, et al.
Published: (2025)
Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
by: Venkatasubramanian, Venkat, et al.
Published: (2024)
by: Venkatasubramanian, Venkat, et al.
Published: (2024)
Wi-Chat: Large Language Model Powered Wi-Fi Sensing
by: Zhang, Haopeng, et al.
Published: (2025)
by: Zhang, Haopeng, et al.
Published: (2025)
Can We Predict Performance of Large Models across Vision-Language Tasks?
by: Zhao, Qinyu, et al.
Published: (2024)
by: Zhao, Qinyu, et al.
Published: (2024)
Similar Items
-
From Reviews to Requirements: Can LLMs Generate Human-Like User Stories?
by: Sakib, Shadman, et al.
Published: (2026) -
Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization
by: Abrar, Ajwad, et al.
Published: (2025) -
Distinguishing Repetition Disfluency from Morphological Reduplication in Bangla ASR Transcripts: A Novel Corpus and Benchmarking Analysis
by: Arpa, Zaara Zabeen, et al.
Published: (2025) -
Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies
by: Abrar, Ajwad, et al.
Published: (2025) -
Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs
by: Kabir, Mohsinul, et al.
Published: (2025)