Saved in:
| Main Authors: | Sivasothy, Shangeetha, Barnett, Scott, Logothetis, Rena, Abdelrazek, Mohamed, Rasool, Zafaryab, Thudumu, Srikanth, Brannelly, Zac |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.06835 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Seven Failure Points When Engineering a Retrieval Augmented Generation System
by: Barnett, Scott, et al.
Published: (2024)
by: Barnett, Scott, et al.
Published: (2024)
LLMs for Test Input Generation for Semantic Caches
by: Rasool, Zafaryab, et al.
Published: (2024)
by: Rasool, Zafaryab, et al.
Published: (2024)
RAGProbe: An Automated Approach for Evaluating RAG Applications
by: Sivasothy, Shangeetha, et al.
Published: (2024)
by: Sivasothy, Shangeetha, et al.
Published: (2024)
The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments
by: Thudumu, Srikanth, et al.
Published: (2025)
by: Thudumu, Srikanth, et al.
Published: (2025)
ML-On-Rails: Safeguarding Machine Learning Models in Software Systems A Case Study
by: Abdelkader, Hala, et al.
Published: (2024)
by: Abdelkader, Hala, et al.
Published: (2024)
Ensuring Robustness in ML-enabled Software Systems: A User Survey
by: Abdelkader, Hala, et al.
Published: (2025)
by: Abdelkader, Hala, et al.
Published: (2025)
Symbolic Execution Meets Multi-LLM Orchestration: Detecting Memory Vulnerabilities in Incomplete Rust CVE Snippets
by: Abdelrazek, Zeyad, et al.
Published: (2026)
by: Abdelrazek, Zeyad, et al.
Published: (2026)
A framework for assessing the capabilities of code generation of constraint domain-specific languages with large language models
by: Delgado, David, et al.
Published: (2026)
by: Delgado, David, et al.
Published: (2026)
Large language models for behavioral modeling: A literature survey
by: Laiq, Muhammad
Published: (2025)
by: Laiq, Muhammad
Published: (2025)
Minimising changes to audit when updating decision trees
by: Simmons, Anj, et al.
Published: (2024)
by: Simmons, Anj, et al.
Published: (2024)
The importance of visual modelling languages in generative software engineering
by: Rossi, Roberto
Published: (2024)
by: Rossi, Roberto
Published: (2024)
TaskEval: Synthesised Evaluation for Foundation-Model Tasks
by: Widanapathiranage, Dilani, et al.
Published: (2025)
by: Widanapathiranage, Dilani, et al.
Published: (2025)
Designing and Implementing Robust Test Automation Frameworks using Cucumber BDD and Java
by: Srinivas, Srikanth, et al.
Published: (2025)
by: Srinivas, Srikanth, et al.
Published: (2025)
Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models
by: Barnett, Scott, et al.
Published: (2024)
by: Barnett, Scott, et al.
Published: (2024)
ALPINE: An adaptive language-agnostic pruning method for language models for code
by: Saad, Mootez, et al.
Published: (2024)
by: Saad, Mootez, et al.
Published: (2024)
Large language models for automated PRISMA 2020 adherence checking
by: Kataoka, Yuki, et al.
Published: (2025)
by: Kataoka, Yuki, et al.
Published: (2025)
Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering
by: Maharaj, Kishan, et al.
Published: (2026)
by: Maharaj, Kishan, et al.
Published: (2026)
Agentic Property-Based Testing: Finding Bugs Across the Python Ecosystem
by: Maaz, Muhammad, et al.
Published: (2025)
by: Maaz, Muhammad, et al.
Published: (2025)
Using rule engine in self-healing systems and MAPE model
by: Yazdanparast, Zahra
Published: (2024)
by: Yazdanparast, Zahra
Published: (2024)
An approach for API synthesis using large language models
by: Zhong, Hua, et al.
Published: (2025)
by: Zhong, Hua, et al.
Published: (2025)
Monitoring Machine Learning Systems: A Multivocal Literature Review
by: Naveed, Hira, et al.
Published: (2025)
by: Naveed, Hira, et al.
Published: (2025)
An empirical study of LoRA-based fine-tuning of large language models for automated test case generation
by: Moradi, Milad, et al.
Published: (2026)
by: Moradi, Milad, et al.
Published: (2026)
Extending ResourceLink: Patterns for Large Dataset Processing in MCP Applications
by: Frees, Scott
Published: (2025)
by: Frees, Scott
Published: (2025)
SPVR: syntax-to-prompt vulnerability repair based on large language models
by: Wang, Ruoke, et al.
Published: (2024)
by: Wang, Ruoke, et al.
Published: (2024)
Evaluating LLM-generated code for domain-specific languages: molecular dynamics with LAMMPS
by: Holbrook, Ethan, et al.
Published: (2026)
by: Holbrook, Ethan, et al.
Published: (2026)
EnseSmells: Deep ensemble and programming language models for automated code smells detection
by: Ho, Anh, et al.
Published: (2025)
by: Ho, Anh, et al.
Published: (2025)
Towards a unified user modeling language for engineering human centered AI systems
by: Conrardy, Aaron, et al.
Published: (2025)
by: Conrardy, Aaron, et al.
Published: (2025)
Comparing large language models and human programmers for generating programming code
by: Hou, Wenpin, et al.
Published: (2024)
by: Hou, Wenpin, et al.
Published: (2024)
Automatically generating decision-support chatbots based on DMN models
by: Estrada-Torres, Bedilia, et al.
Published: (2024)
by: Estrada-Torres, Bedilia, et al.
Published: (2024)
On the synchronization between Hugging Face pre-trained language models and their upstream GitHub repository
by: Ajibode, Adekunle, et al.
Published: (2025)
by: Ajibode, Adekunle, et al.
Published: (2025)
Codellm-Devkit: A Framework for Contextualizing Code LLMs with Program Analysis Insights
by: Krishna, Rahul, et al.
Published: (2024)
by: Krishna, Rahul, et al.
Published: (2024)
Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation
by: Ouédraogo, Wendkûuni C., et al.
Published: (2024)
by: Ouédraogo, Wendkûuni C., et al.
Published: (2024)
Enabling Communication via APIs for Mainframe Applications
by: Kanvar, Vini, et al.
Published: (2024)
by: Kanvar, Vini, et al.
Published: (2024)
Usefulness of data flow diagrams and large language models for security threat validation: a registered report
by: Mbaka, Winnie Bahati, et al.
Published: (2024)
by: Mbaka, Winnie Bahati, et al.
Published: (2024)
A Code Comprehension Benchmark for Large Language Models for Code
by: Havare, Jayant, et al.
Published: (2025)
by: Havare, Jayant, et al.
Published: (2025)
Finding Cross-rule Optimization Bugs in Datalog Engines
by: Zhang, Chi, et al.
Published: (2024)
by: Zhang, Chi, et al.
Published: (2024)
Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding
by: Havare, Jayant, et al.
Published: (2026)
by: Havare, Jayant, et al.
Published: (2026)
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
by: Kammakomati, Mehant, et al.
Published: (2024)
by: Kammakomati, Mehant, et al.
Published: (2024)
Octopus: On-device language model for function calling of software APIs
by: Chen, Wei, et al.
Published: (2024)
by: Chen, Wei, et al.
Published: (2024)
GeoAnalystBench: A GeoAI benchmark for assessing large language models for spatial analysis workflow and code generation
by: Zhang, Qianheng, et al.
Published: (2025)
by: Zhang, Qianheng, et al.
Published: (2025)
Similar Items
-
Seven Failure Points When Engineering a Retrieval Augmented Generation System
by: Barnett, Scott, et al.
Published: (2024) -
LLMs for Test Input Generation for Semantic Caches
by: Rasool, Zafaryab, et al.
Published: (2024) -
RAGProbe: An Automated Approach for Evaluating RAG Applications
by: Sivasothy, Shangeetha, et al.
Published: (2024) -
The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments
by: Thudumu, Srikanth, et al.
Published: (2025) -
ML-On-Rails: Safeguarding Machine Learning Models in Software Systems A Case Study
by: Abdelkader, Hala, et al.
Published: (2024)