:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sivasothy, Shangeetha, Barnett, Scott, Logothetis, Rena, Abdelrazek, Mohamed, Rasool, Zafaryab, Thudumu, Srikanth, Brannelly, Zac
Format:	Preprint
Published:	2024
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2406.06835
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Seven Failure Points When Engineering a Retrieval Augmented Generation System
by: Barnett, Scott, et al.
Published: (2024)

LLMs for Test Input Generation for Semantic Caches
by: Rasool, Zafaryab, et al.
Published: (2024)

RAGProbe: An Automated Approach for Evaluating RAG Applications
by: Sivasothy, Shangeetha, et al.
Published: (2024)

The M-factor: A Novel Metric for Evaluating Neural Architecture Search in Resource-Constrained Environments
by: Thudumu, Srikanth, et al.
Published: (2025)

ML-On-Rails: Safeguarding Machine Learning Models in Software Systems A Case Study
by: Abdelkader, Hala, et al.
Published: (2024)

Ensuring Robustness in ML-enabled Software Systems: A User Survey
by: Abdelkader, Hala, et al.
Published: (2025)

Symbolic Execution Meets Multi-LLM Orchestration: Detecting Memory Vulnerabilities in Incomplete Rust CVE Snippets
by: Abdelrazek, Zeyad, et al.
Published: (2026)

A framework for assessing the capabilities of code generation of constraint domain-specific languages with large language models
by: Delgado, David, et al.
Published: (2026)

Large language models for behavioral modeling: A literature survey
by: Laiq, Muhammad
Published: (2025)

Minimising changes to audit when updating decision trees
by: Simmons, Anj, et al.
Published: (2024)

The importance of visual modelling languages in generative software engineering
by: Rossi, Roberto
Published: (2024)

TaskEval: Synthesised Evaluation for Foundation-Model Tasks
by: Widanapathiranage, Dilani, et al.
Published: (2025)

Designing and Implementing Robust Test Automation Frameworks using Cucumber BDD and Java
by: Srinivas, Srikanth, et al.
Published: (2025)

Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models
by: Barnett, Scott, et al.
Published: (2024)

ALPINE: An adaptive language-agnostic pruning method for language models for code
by: Saad, Mootez, et al.
Published: (2024)

Large language models for automated PRISMA 2020 adherence checking
by: Kataoka, Yuki, et al.
Published: (2025)

Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering
by: Maharaj, Kishan, et al.
Published: (2026)

Agentic Property-Based Testing: Finding Bugs Across the Python Ecosystem
by: Maaz, Muhammad, et al.
Published: (2025)

Using rule engine in self-healing systems and MAPE model
by: Yazdanparast, Zahra
Published: (2024)

An approach for API synthesis using large language models
by: Zhong, Hua, et al.
Published: (2025)

Monitoring Machine Learning Systems: A Multivocal Literature Review
by: Naveed, Hira, et al.
Published: (2025)

An empirical study of LoRA-based fine-tuning of large language models for automated test case generation
by: Moradi, Milad, et al.
Published: (2026)

Extending ResourceLink: Patterns for Large Dataset Processing in MCP Applications
by: Frees, Scott
Published: (2025)

SPVR: syntax-to-prompt vulnerability repair based on large language models
by: Wang, Ruoke, et al.
Published: (2024)

Evaluating LLM-generated code for domain-specific languages: molecular dynamics with LAMMPS
by: Holbrook, Ethan, et al.
Published: (2026)

EnseSmells: Deep ensemble and programming language models for automated code smells detection
by: Ho, Anh, et al.
Published: (2025)

Towards a unified user modeling language for engineering human centered AI systems
by: Conrardy, Aaron, et al.
Published: (2025)

Comparing large language models and human programmers for generating programming code
by: Hou, Wenpin, et al.
Published: (2024)

Automatically generating decision-support chatbots based on DMN models
by: Estrada-Torres, Bedilia, et al.
Published: (2024)

On the synchronization between Hugging Face pre-trained language models and their upstream GitHub repository
by: Ajibode, Adekunle, et al.
Published: (2025)

Codellm-Devkit: A Framework for Contextualizing Code LLMs with Program Analysis Insights
by: Krishna, Rahul, et al.
Published: (2024)

Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation
by: Ouédraogo, Wendkûuni C., et al.
Published: (2024)

Enabling Communication via APIs for Mainframe Applications
by: Kanvar, Vini, et al.
Published: (2024)

Usefulness of data flow diagrams and large language models for security threat validation: a registered report
by: Mbaka, Winnie Bahati, et al.
Published: (2024)

A Code Comprehension Benchmark for Large Language Models for Code
by: Havare, Jayant, et al.
Published: (2025)

Finding Cross-rule Optimization Bugs in Datalog Engines
by: Zhang, Chi, et al.
Published: (2024)

Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding
by: Havare, Jayant, et al.
Published: (2026)

ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
by: Kammakomati, Mehant, et al.
Published: (2024)

Octopus: On-device language model for function calling of software APIs
by: Chen, Wei, et al.
Published: (2024)

GeoAnalystBench: A GeoAI benchmark for assessing large language models for spatial analysis workflow and code generation
by: Zhang, Qianheng, et al.
Published: (2025)