:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Xuan, Weihao, Yang, Rui, Qi, Heli, Zeng, Qingcheng, Xiao, Yunze, Feng, Aosong, Liu, Dairui, Xing, Yun, Wang, Junjue, Gao, Fan, Lu, Jinghui, Jiang, Yuang, Li, Huitao, Li, Xin, Yu, Kunyu, Dong, Ruihai, Gu, Shangding, Li, Yuekang, Xie, Xiaofei, Juefei-Xu, Felix, Khomh, Foutse, Yoshie, Osamu, Chen, Qingyu, Teodoro, Douglas, Liu, Nan, Goebel, Randy, Ma, Lei, Marrese-Taylor, Edison, Lu, Shijian, Iwasawa, Yusuke, Matsuo, Yutaka, Li, Irene
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2503.10497
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents
von: Xuan, Weihao, et al.
Veröffentlicht: (2026)

Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts
von: Gao, Fan, et al.
Veröffentlicht: (2023)

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models
von: Xuan, Weihao, et al.
Veröffentlicht: (2025)

Topic-Centric Explanations for News Recommendation
von: Liu, Dairui, et al.
Veröffentlicht: (2023)

On the Effectiveness of Log Representation for Log-based Anomaly Detection
von: Wu, Xingfang, et al.
Veröffentlicht: (2023)

Protecting Privacy in Software Logs: What Should Be Anonymized?
von: Aghili, Roozbeh, et al.
Veröffentlicht: (2024)

What Information Contributes to Log-based Anomaly Detection? Insights from a Configurable Transformer-Based Approach
von: Wu, Xingfang, et al.
Veröffentlicht: (2024)

Representation Improvement in Latent Space for Search-Based Testing of Autonomous Robotic Systems
von: Humeniuk, Dmytro, et al.
Veröffentlicht: (2025)

An Efficient Model Maintenance Approach for MLOps
von: Majidi, Forough, et al.
Veröffentlicht: (2024)

Understanding Web Application Workloads and Their Applications: Systematic Literature Review and Characterization
von: Aghili, Roozbeh, et al.
Veröffentlicht: (2024)

Adversarial Attack Classification and Robustness Testing for Large Language Models for Code
von: Liu, Yang, et al.
Veröffentlicht: (2025)

Tracing Optimization for Performance Modeling and Regression Detection
von: Shahedi, Kaveh, et al.
Veröffentlicht: (2024)

SDLog: A Deep Learning Framework for Detecting Sensitive Information in Software Logs
von: Aghili, Roozbeh, et al.
Veröffentlicht: (2025)

Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education
von: Yang, Rui, et al.
Veröffentlicht: (2024)

Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education
von: Yang, Rui, et al.
Veröffentlicht: (2024)

From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs
von: Chen, Yingjian, et al.
Veröffentlicht: (2026)

Machine Learning Robustness: A Primer
von: Braiek, Houssem Ben, et al.
Veröffentlicht: (2024)

Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis
von: Song, Da, et al.
Veröffentlicht: (2026)

Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective
von: Yang, Rui, et al.
Veröffentlicht: (2024)

RecPrompt: A Self-tuning Prompting Framework for News Recommendation Using Large Language Models
von: Liu, Dairui, et al.
Veröffentlicht: (2023)

Transformers4NewsRec: A Transformer-based News Recommendation Framework
von: Liu, Dairui, et al.
Veröffentlicht: (2024)

Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code
von: Majdinasab, Vahid, et al.
Veröffentlicht: (2024)

GIST: Generated Inputs Sets Transferability in Deep Learning
von: Tambon, Florian, et al.
Veröffentlicht: (2023)

DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
von: Majdinasab, Vahid, et al.
Veröffentlicht: (2024)

Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing
von: Humeniuk, Dmytro, et al.
Veröffentlicht: (2023)

Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing
von: Mzoughi, Seif, et al.
Veröffentlicht: (2025)

Prism: Dynamic and Flexible Benchmarking of LLMs Code Generation with Monte Carlo Tree Search
von: Majdinasab, Vahid, et al.
Veröffentlicht: (2025)

PathOCL: Path-Based Prompt Augmentation for OCL Generation with GPT-4
von: Abukhalaf, Seif, et al.
Veröffentlicht: (2024)

RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
von: Oueslati, Khouloud, et al.
Veröffentlicht: (2025)

An Empirical Study on Method-Level Performance Evolution in Open-Source Java Projects
von: Shahedi, Kaveh, et al.
Veröffentlicht: (2025)

From Technical Excellence to Practical Adoption: Lessons Learned Building an ML-Enhanced Trace Analysis Tool
von: Shahedi, Kaveh, et al.
Veröffentlicht: (2025)

Improving the Robustness of Large Language Models for Code Tasks via Fine-tuning with Perturbed Data
von: Liu, Yang, et al.
Veröffentlicht: (2026)

Structural Anchors and Reasoning Fragility:Understanding CoT Robustness in LLM4Code
von: Liu, Yang, et al.
Veröffentlicht: (2026)

QMon: Monitoring the Execution of Quantum Circuits with Mid-Circuit Measurement and Reset
von: Ma, Ning, et al.
Veröffentlicht: (2025)

Refining GPT-3 Embeddings with a Siamese Structure for Technical Post Duplicate Detection
von: Wu, Xingfang, et al.
Veröffentlicht: (2023)

Direction-aware 3D Large Multimodal Models
von: Liu, Quan, et al.
Veröffentlicht: (2026)

Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models
von: Gu, Xiaojie, et al.
Veröffentlicht: (2026)

Investigating the Multilingual Calibration Effects of Language Model Instruction-Tuning
von: Huang, Jerry, et al.
Veröffentlicht: (2026)

An Empirical Study on Logging Evolution On Stack Overflow: Trends, Topics, and Challenges
von: Foalem, Patrick Loic, et al.
Veröffentlicht: (2026)

LLMs and Stack Overflow Discussions: Reliability, Impact, and Challenges
von: Da Silva, Leuson, et al.
Veröffentlicht: (2024)