Saved in:
| Main Authors: | Gohari, Hajar Emami, Kadhe, Swanand Ravindra, Shah, Syed Yousaf, Adam, Constantin, Adebayo, Abdulhamid, Adusumilli, Praneet, Ahmed, Farhan, Angel, Nathalie Baracaldo, Borse, Santosh Subhashrao, Chang, Yuan-Chi, Dang, Xuan-Hong, Desai, Nirmit, Eres, Revital, Iwamoto, Ran, Karve, Alexei, Koyfman, Yan, Lee, Wei-Han, Liu, Changchang, Lublinsky, Boris, Ohko, Takuyo, Pesce, Pablo, Touma, Maroun, Wang, Shiqiang, Witherspoon, Shalisha, Woisetschläger, Herbert, Wood, David, Wu, Kun-Lung, Yoshida, Issei, Zawad, Syed, Zerfos, Petros, Zhou, Yi, Bhattacharjee, Bishwaranjan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.14907 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging
by: Djuhera, Aladin, et al.
Published: (2025)
by: Djuhera, Aladin, et al.
Published: (2025)
Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance
by: Djuhera, Aladin, et al.
Published: (2025)
by: Djuhera, Aladin, et al.
Published: (2025)
TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents
by: Djuhera, Aladin, et al.
Published: (2026)
by: Djuhera, Aladin, et al.
Published: (2026)
When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets
by: Djuhera, Aladin, et al.
Published: (2025)
by: Djuhera, Aladin, et al.
Published: (2025)
SafeCOMM: A Study on Safety Degradation in Fine-Tuned Telecom Large Language Models
by: Djuhera, Aladin, et al.
Published: (2025)
by: Djuhera, Aladin, et al.
Published: (2025)
Towards a Re-evaluation of Data Forging Attacks in Practice
by: Suliman, Mohamed, et al.
Published: (2024)
by: Suliman, Mohamed, et al.
Published: (2024)
Evaluating the Dynamics of Membership Privacy in Deep Learning
by: Chen, Yuetian, et al.
Published: (2025)
by: Chen, Yuetian, et al.
Published: (2025)
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
by: Kadhe, Swanand Ravindra, et al.
Published: (2024)
Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble
by: Wang, Zhiqi, et al.
Published: (2025)
by: Wang, Zhiqi, et al.
Published: (2025)
Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
by: Jiang, Shuli, et al.
Published: (2024)
by: Jiang, Shuli, et al.
Published: (2024)
In-Context Probing for Membership Inference in Fine-Tuned Language Models
by: Lu, Zhexi, et al.
Published: (2025)
by: Lu, Zhexi, et al.
Published: (2025)
Data-Prep-Kit: getting your data ready for LLM application development
by: Wood, David, et al.
Published: (2024)
by: Wood, David, et al.
Published: (2024)
Know When To Fold 'Em: Token-Efficient LLM Synthetic Data Generation via Multi-Stage In-Flight Rejection
by: Chowdhury, Anjir Ahmed, et al.
Published: (2026)
by: Chowdhury, Anjir Ahmed, et al.
Published: (2026)
STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMs
by: An, Sungeun, et al.
Published: (2026)
by: An, Sungeun, et al.
Published: (2026)
PPFS: Predictive Permutation Feature Selection
by: Hassan, Atif, et al.
Published: (2021)
by: Hassan, Atif, et al.
Published: (2021)
PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts
by: Chowdhury, Anjir Ahmed, et al.
Published: (2026)
by: Chowdhury, Anjir Ahmed, et al.
Published: (2026)
AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows
by: Ngong, Ivoline C., et al.
Published: (2026)
by: Ngong, Ivoline C., et al.
Published: (2026)
You've Got to be Efficient: Ambiguity, Misspecification and Variational Preferences
by: Adusumilli, Karun
Published: (2026)
by: Adusumilli, Karun
Published: (2026)
Continuous time asymptotic representations for adaptive experiments
by: Adusumilli, Karun
Published: (2026)
by: Adusumilli, Karun
Published: (2026)
How to sample and when to stop sampling: The generalized Wald problem and minimax policies
by: Adusumilli, Karun
Published: (2022)
by: Adusumilli, Karun
Published: (2022)
Risk and optimal policies in bandit experiments
by: Adusumilli, Karun
Published: (2021)
by: Adusumilli, Karun
Published: (2021)
Evolutionary Novelties in Bacteria and the Missing Backdrop of the Environment
by: Shraddha Karve
Published: (2025)
by: Shraddha Karve
Published: (2025)
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents
by: Ngong, Ivoline, et al.
Published: (2025)
by: Ngong, Ivoline, et al.
Published: (2025)
Against the Monolithic Wireless World Model: Why NextG Needs Composable and Agentic Intelligence
by: Djuhera, Aladin, et al.
Published: (2026)
by: Djuhera, Aladin, et al.
Published: (2026)
Tool Forge: A Validation-Carrying Toolchain for Governed Agentic Execution
by: Rao, Swanand
Published: (2026)
by: Rao, Swanand
Published: (2026)
The Dirac Delta as a Singular Potential for the 2D Schrodinger Equation
by: Maroun, Michael
Published: (2023)
by: Maroun, Michael
Published: (2023)
The Energy Eigenvalue for the Singular Wave Function of the Three Dimensional Dirac Delta Schrodinger Potential via Distributionally Generalized Quantum Mechanics
by: Maroun, Michael
Published: (2021)
by: Maroun, Michael
Published: (2021)
Relación entre dopamina e insulina en sujetos sanos y diabeticos tipo 2
by: Charbel Maroun
Published: (2006)
by: Charbel Maroun
Published: (2006)
Language and Art in the Navajo Universe
by: Witherspoon, Gary
Published: (2025)
by: Witherspoon, Gary
Published: (2025)
Sakai-Sugimoto Model in an Off-Shell: Chiral Lagrangian to All Orders
by: Lublinsky, Michael, et al.
Published: (2025)
by: Lublinsky, Michael, et al.
Published: (2025)
Asplenium X kentuckiense on Granitic Gneiss in Georgia
by: Duncan, Wilbur H. (Wilbur Howard)
Published: (1966)
by: Duncan, Wilbur H. (Wilbur Howard)
Published: (1966)
A HISTÓRIA DE UM DICIONÁRIO BILÍNGÜE
by: Gretel Eres Fernández
Published: (2006)
by: Gretel Eres Fernández
Published: (2006)
Roberto Romero Sandoval (ed.), Cuevas y cenotes mayas; una mirada multidisciplinaria. México: Universidad Nacional Autónoma de México, Instituto de Investigaciones Filológicas, Centro de Estudios Mayas, 2016, 195 pp.
by: Ana Somohano Eres
Published: (2017)
by: Ana Somohano Eres
Published: (2017)
Leitura em língua estrangeira: entre o ensino médio e o vestibular
by: Gretel Eres Fernández
Published: (2006)
by: Gretel Eres Fernández
Published: (2006)
Efficient Models for the Detection of Hate, Abuse and Profanity
by: Tillmann, Christoph, et al.
Published: (2024)
by: Tillmann, Christoph, et al.
Published: (2024)
Agentic Performance at the Edge: Insights from Benchmarking
by: Wang, Shiqiang, et al.
Published: (2026)
by: Wang, Shiqiang, et al.
Published: (2026)
SIGN: Schema-Induced Games for Naming
by: Zhang, Ryan, et al.
Published: (2025)
by: Zhang, Ryan, et al.
Published: (2025)
Designing Persuasive Experiments
by: Adusumilli, Karun, et al.
Published: (2026)
by: Adusumilli, Karun, et al.
Published: (2026)
A Dense and Efficient Instruction Set Architecture Encoding
by: Maroun, Emad Jacob
Published: (2025)
by: Maroun, Emad Jacob
Published: (2025)
Exact Community Recovery under Side Information: Optimality of Spectral Algorithms
by: Gaudio, Julia, et al.
Published: (2024)
by: Gaudio, Julia, et al.
Published: (2024)
Similar Items
-
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging
by: Djuhera, Aladin, et al.
Published: (2025) -
Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance
by: Djuhera, Aladin, et al.
Published: (2025) -
TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents
by: Djuhera, Aladin, et al.
Published: (2026) -
When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets
by: Djuhera, Aladin, et al.
Published: (2025) -
SafeCOMM: A Study on Safety Degradation in Fine-Tuned Telecom Large Language Models
by: Djuhera, Aladin, et al.
Published: (2025)