Saved in:
| Main Authors: | Rasool, Zafaryab, Kurniawan, Stefanus, Balugo, Sherwin, Barnett, Scott, Vasa, Rajesh, Chesser, Courtney, Hampstead, Benjamin M., Belleville, Sylvie, Mouzakis, Kon, Bahar-Fuchs, Alex |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.07878 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RAGProbe: An Automated Approach for Evaluating RAG Applications
by: Sivasothy, Shangeetha, et al.
Published: (2024)
by: Sivasothy, Shangeetha, et al.
Published: (2024)
LLMs for Test Input Generation for Semantic Caches
by: Rasool, Zafaryab, et al.
Published: (2024)
by: Rasool, Zafaryab, et al.
Published: (2024)
The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments
by: Thudumu, Srikanth, et al.
Published: (2025)
by: Thudumu, Srikanth, et al.
Published: (2025)
A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions
by: Du, Hung, et al.
Published: (2024)
by: Du, Hung, et al.
Published: (2024)
Contextual Knowledge Sharing in Multi-Agent Reinforcement Learning with Decentralized Communication and Coordination
by: Du, Hung, et al.
Published: (2025)
by: Du, Hung, et al.
Published: (2025)
Goal-Oriented Multi-Agent Reinforcement Learning for Decentralized Agent Teams
by: Du, Hung, et al.
Published: (2025)
by: Du, Hung, et al.
Published: (2025)
Overcoming Semantic Dilution in Transformer-Based Next Frame Prediction
by: Nguyen, Hy, et al.
Published: (2025)
by: Nguyen, Hy, et al.
Published: (2025)
CSAOT: Cooperative Multi-Agent System for Active Object Tracking
by: Nguyen, Hy, et al.
Published: (2025)
by: Nguyen, Hy, et al.
Published: (2025)
Local Control Networks (LCNs): Optimizing Flexibility in Neural Network Data Pattern Capture
by: Nguyen, Hy, et al.
Published: (2025)
by: Nguyen, Hy, et al.
Published: (2025)
Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models
by: Barnett, Scott, et al.
Published: (2024)
by: Barnett, Scott, et al.
Published: (2024)
TaskEval: Synthesised Evaluation for Foundation-Model Tasks
by: Widanapathiranage, Dilani, et al.
Published: (2025)
by: Widanapathiranage, Dilani, et al.
Published: (2025)
Dual-Branch HNSW Approach with Skip Bridges and LID-Driven Optimization
by: Nguyen, Hy, et al.
Published: (2025)
by: Nguyen, Hy, et al.
Published: (2025)
Seven Failure Points When Engineering a Retrieval Augmented Generation System
by: Barnett, Scott, et al.
Published: (2024)
by: Barnett, Scott, et al.
Published: (2024)
Large language models for generating rules, yay or nay?
by: Sivasothy, Shangeetha, et al.
Published: (2024)
by: Sivasothy, Shangeetha, et al.
Published: (2024)
ML-On-Rails: Safeguarding Machine Learning Models in Software Systems A Case Study
by: Abdelkader, Hala, et al.
Published: (2024)
by: Abdelkader, Hala, et al.
Published: (2024)
Gluing characteristics of Papua New Guinea timber species for various non-structural applications
by: Benoit Belleville
Published: (2024)
by: Benoit Belleville
Published: (2024)
When loxodromics are pseudo-Anosovs on witnesses
by: Chesser, Marissa
Published: (2026)
by: Chesser, Marissa
Published: (2026)
WOOD PLANING PROPERTIES OF AUSTRALIAN PLANTATION-GROWN Eucalypts
by: Benoit Belleville
Published: (2016)
by: Benoit Belleville
Published: (2016)
Assessment of physical and mechanical properties of Papua New Guinea timber species
by: Benoit Belleville
Published: (2020)
by: Benoit Belleville
Published: (2020)
WOOD MACHINING PROPERTIES OF AUSTRALIAN PLANTATION-GROWN EUCALYPTS
by: Benoit Belleville
Published: (2016)
by: Benoit Belleville
Published: (2016)
Numerical Analysis of Lensless Imaging with Active Metasurfaces and Single-Pixel Detectors
by: Belleville, Julie, et al.
Published: (2024)
by: Belleville, Julie, et al.
Published: (2024)
Clinical QA 2.0: Multi-Task Learning for Answer Extraction and Categorization
by: Pattnayak, Priyaranjan, et al.
Published: (2025)
by: Pattnayak, Priyaranjan, et al.
Published: (2025)
Two-Stage Quranic QA via Ensemble Retrieval and Instruction-Tuned Answer Extraction
by: Basem, Mohamed, et al.
Published: (2025)
by: Basem, Mohamed, et al.
Published: (2025)
Educational attainment mitigates hippocampal‐related episodic memory decline in individuals at risk of Alzheimer's disease
by: Annalise Aleta LaPlume, et al.
Published: (2026)
by: Annalise Aleta LaPlume, et al.
Published: (2026)
Scientific QA System with Verifiable Answers
by: Ljajić, Adela, et al.
Published: (2024)
by: Ljajić, Adela, et al.
Published: (2024)
NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA
by: Pahilajani, Anish, et al.
Published: (2024)
by: Pahilajani, Anish, et al.
Published: (2024)
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
by: Mozafari, Jamshid, et al.
Published: (2025)
by: Mozafari, Jamshid, et al.
Published: (2025)
Volume Tracking Based Reference Mesh Extraction for Time-Varying Mesh Compression
by: Chen, Guodong, et al.
Published: (2024)
by: Chen, Guodong, et al.
Published: (2024)
Explainablity QA dataset
by: Anonymous
Published: (2026)
by: Anonymous
Published: (2026)
Purely pseudo-Anosov subgroups of the genus two handlebody group
by: Chesser, Marissa, et al.
Published: (2023)
by: Chesser, Marissa, et al.
Published: (2023)
Optimal Differentially Private Sampling of Unbounded Gaussians
by: Iverson, Valentio, et al.
Published: (2025)
by: Iverson, Valentio, et al.
Published: (2025)
RealTime QA: What's the Answer Right Now?
by: Kasai, Jungo, et al.
Published: (2022)
by: Kasai, Jungo, et al.
Published: (2022)
ExpertQA: Expert-Curated Questions and Attributed Answers
by: Malaviya, Chaitanya, et al.
Published: (2023)
by: Malaviya, Chaitanya, et al.
Published: (2023)
Ensuring Robustness in ML-enabled Software Systems: A User Survey
by: Abdelkader, Hala, et al.
Published: (2025)
by: Abdelkader, Hala, et al.
Published: (2025)
A taxonomy of grain boundary migration mechanisms via displacement texture characterization
by: Chesser, Ian, et al.
Published: (2021)
by: Chesser, Ian, et al.
Published: (2021)
Decentralized Multi-product Pricing: Diagonal Dominance, Nash Equilibrium, and Price of Anarchy
by: Chen, Boxiao, et al.
Published: (2026)
by: Chen, Boxiao, et al.
Published: (2026)
EEE-QA: Exploring Effective and Efficient Question-Answer Representations
by: Hu, Zhanghao, et al.
Published: (2024)
by: Hu, Zhanghao, et al.
Published: (2024)
HARP: Measuring Harm Amplification in Multi-Agent LLM Systems
by: Rahman, Md Hafizur, et al.
Published: (2026)
by: Rahman, Md Hafizur, et al.
Published: (2026)
Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
by: Lee, Dongryeol, et al.
Published: (2024)
by: Lee, Dongryeol, et al.
Published: (2024)
MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare
by: Alhuzali, Hassan, et al.
Published: (2024)
by: Alhuzali, Hassan, et al.
Published: (2024)
Similar Items
-
RAGProbe: An Automated Approach for Evaluating RAG Applications
by: Sivasothy, Shangeetha, et al.
Published: (2024) -
LLMs for Test Input Generation for Semantic Caches
by: Rasool, Zafaryab, et al.
Published: (2024) -
The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments
by: Thudumu, Srikanth, et al.
Published: (2025) -
A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions
by: Du, Hung, et al.
Published: (2024) -
Contextual Knowledge Sharing in Multi-Agent Reinforcement Learning with Decentralized Communication and Coordination
by: Du, Hung, et al.
Published: (2025)