:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autore principale:	D'Souza, Alex
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computation and Language Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2604.19764
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Exploring the Latest LLMs for Leaderboard Extraction
di: Kabongo, Salomon, et al.
Pubblicazione: (2024)

LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology Learning Challenge
di: Giglou, Hamed Babaei, et al.
Pubblicazione: (2024)

A FAIR and Free Prompt-based Research Assistant
di: Shamsabadi, Mahsa, et al.
Pubblicazione: (2024)

LLMs4Synthesis: Leveraging Large Language Models for Scientific Synthesis
di: Giglou, Hamed Babaei, et al.
Pubblicazione: (2024)

LLMs Can Plan Only If We Tell Them
di: Sel, Bilgehan, et al.
Pubblicazione: (2025)

Diagnosing Structural Failures in LLM-Based Evidence Extraction for Meta-Analysis
di: Tan, Zhiyin, et al.
Pubblicazione: (2026)

Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models
di: Tan, Zhiyin, et al.
Pubblicazione: (2025)

Bridging the Evaluation Gap: Leveraging Large Language Models for Topic Model Evaluation
di: Tan, Zhiyin, et al.
Pubblicazione: (2025)

Can We Edit LLMs for Long-Tail Biomedical Knowledge?
di: Yi, Xinhao, et al.
Pubblicazione: (2025)

Are Stereotypes Leading LLMs' Zero-Shot Stance Detection ?
di: Dubreuil, Anthony, et al.
Pubblicazione: (2025)

Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models
di: D'Souza, Jennifer, et al.
Pubblicazione: (2025)

YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering
di: D'Souza, Jennifer, et al.
Pubblicazione: (2025)

Stereotype Detection in LLMs: A Multiclass, Explainable, and Benchmark-Driven Approach
di: Wu, Zekun, et al.
Pubblicazione: (2024)

From Keywords to Structured Summaries: Streamlining Scholarly Information Access
di: Shamsabadi, Mahsa, et al.
Pubblicazione: (2024)

Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph
di: Nechakhin, Vladyslav, et al.
Pubblicazione: (2024)

Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?
di: Evans, Julia, et al.
Pubblicazione: (2024)

Large Language Models as Evaluators for Scientific Synthesis
di: Evans, Julia, et al.
Pubblicazione: (2024)

OntoAligner: A Comprehensive Modular and Robust Python Toolkit for Ontology Alignment
di: Giglou, Hamed Babaei, et al.
Pubblicazione: (2025)

DeepResearch$^{\text{Eco}}$: A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology
di: D'Souza, Jennifer, et al.
Pubblicazione: (2025)

StereoTales: A Multilingual Framework for Open-Ended Stereotype Discovery in LLMs
di: Jeune, Pierre Le, et al.
Pubblicazione: (2026)

Can We Trust LLM Detectors?
di: Sandhan, Jivnesh, et al.
Pubblicazione: (2026)

How Can We Effectively Expand the Vocabulary of LLMs with 0.01GB of Target Language Text?
di: Yamaguchi, Atsuki, et al.
Pubblicazione: (2024)

LLMs4SchemaDiscovery: A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models
di: Sadruddin, Sameer, et al.
Pubblicazione: (2025)

SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
di: D'Souza, Jennifer, et al.
Pubblicazione: (2025)

Large Language Models for Scientific Information Extraction: An Empirical Study for Virology
di: Shamsabadi, Mahsa, et al.
Pubblicazione: (2024)

Can We Count on LLMs? The Fixed-Effect Fallacy and Claims of GPT-4 Capabilities
di: Ball, Thomas, et al.
Pubblicazione: (2024)

Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs
di: Sridhar, Srivarshinee, et al.
Pubblicazione: (2025)

LLMs for Relational Reasoning: How Far are We?
di: Li, Zhiming, et al.
Pubblicazione: (2024)

LLM-REVal: Can We Trust LLM Reviewers Yet?
di: Li, Rui, et al.
Pubblicazione: (2025)

Can We Verify Step by Step for Incorrect Answer Detection?
di: Xu, Xin, et al.
Pubblicazione: (2024)

We Can't Understand AI Using our Existing Vocabulary
di: Hewitt, John, et al.
Pubblicazione: (2025)

Responsible AI in NLP: GUS-Net Span-Level Bias Detection Dataset and Benchmark for Generalizations, Unfairness, and Stereotypes
di: Powers, Maximus, et al.
Pubblicazione: (2024)

Automatic Prompt Engineering with No Task Cues and No Tuning
di: Chowdhury, Faisal, et al.
Pubblicazione: (2026)

Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions
di: Barve, Saharsh, et al.
Pubblicazione: (2025)

Location Not Found: Exposing Implicit Local and Global Biases in Multilingual LLMs
di: Mor-Lan, Guy, et al.
Pubblicazione: (2026)

Coordinates from Context: Using LLMs to Ground Complex Location References
di: Masis, Tessa, et al.
Pubblicazione: (2025)

Evaluation of Large Language Models: STEM education and Gender Stereotypes
di: Due, Smilla, et al.
Pubblicazione: (2024)

Characterizing Stereotypical Bias from Privacy-preserving Pre-Training
di: Arnold, Stefan, et al.
Pubblicazione: (2024)

Detecting Linguistic Indicators for Stereotype Assessment with Large Language Models
di: Görge, Rebekka, et al.
Pubblicazione: (2025)

The production of meaning in the processing of natural language
di: Agostino, Christopher J., et al.
Pubblicazione: (2026)