Saved in:
| Main Authors: | Zhang, Lily H., Ranganath, Rajesh, Tafvizi, Arya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.13660 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Preference Learning Algorithms Do Not Learn Preference Rankings
by: Chen, Angelica, et al.
Published: (2024)
by: Chen, Angelica, et al.
Published: (2024)
Preference learning made easy: Everything should be understood through win rate
by: Zhang, Lily H., et al.
Published: (2025)
by: Zhang, Lily H., et al.
Published: (2025)
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities
by: Saporta, Adriel, et al.
Published: (2024)
by: Saporta, Adriel, et al.
Published: (2024)
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
by: Yamaguchi, Atsuki, et al.
Published: (2025)
by: Yamaguchi, Atsuki, et al.
Published: (2025)
Target-driven Attack for Large Language Models
by: Zhang, Chong, et al.
Published: (2024)
by: Zhang, Chong, et al.
Published: (2024)
Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning
by: Krishnan, Ranganath, et al.
Published: (2024)
by: Krishnan, Ranganath, et al.
Published: (2024)
Towards Generalized Offensive Language Identification
by: Dmonte, Alphaeus, et al.
Published: (2024)
by: Dmonte, Alphaeus, et al.
Published: (2024)
Target-Aware Language Modeling via Granular Data Sampling
by: Chang, Ernie, et al.
Published: (2024)
by: Chang, Ernie, et al.
Published: (2024)
Adapting Chat Language Models Using Only Target Unlabeled Language Data
by: Yamaguchi, Atsuki, et al.
Published: (2024)
by: Yamaguchi, Atsuki, et al.
Published: (2024)
Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models
by: Bianco, Pedro Dal, et al.
Published: (2026)
by: Bianco, Pedro Dal, et al.
Published: (2026)
Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models
by: Pipatanakul, Kunat, et al.
Published: (2026)
by: Pipatanakul, Kunat, et al.
Published: (2026)
Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models
by: Yuan, Chenchen, et al.
Published: (2025)
by: Yuan, Chenchen, et al.
Published: (2025)
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights
by: Valero, Eneko, et al.
Published: (2026)
by: Valero, Eneko, et al.
Published: (2026)
Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models
by: Vorakitphan, Vorakit, et al.
Published: (2024)
by: Vorakitphan, Vorakit, et al.
Published: (2024)
Zero-shot Cross-lingual Transfer Learning with Multiple Source and Target Languages for Information Extraction: Language Selection and Adversarial Training
by: Ngo, Nghia Trung, et al.
Published: (2024)
by: Ngo, Nghia Trung, et al.
Published: (2024)
COFT: Counterfactual-Conformal Decoding for Fair Chain-of-Thought Reasoning in Large Language Models
by: Fayyazi, Arya, et al.
Published: (2026)
by: Fayyazi, Arya, et al.
Published: (2026)
Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models
by: Davies, Harry J., et al.
Published: (2024)
by: Davies, Harry J., et al.
Published: (2024)
Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models
by: Shah, Arya, et al.
Published: (2026)
by: Shah, Arya, et al.
Published: (2026)
Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation
by: Su, Zeli, et al.
Published: (2026)
by: Su, Zeli, et al.
Published: (2026)
Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
by: Wang, Jian, et al.
Published: (2024)
by: Wang, Jian, et al.
Published: (2024)
Multi-Attribute Steering of Language Models via Targeted Intervention
by: Nguyen, Duy, et al.
Published: (2025)
by: Nguyen, Duy, et al.
Published: (2025)
Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
by: Zhang, Rongzhi, et al.
Published: (2025)
by: Zhang, Rongzhi, et al.
Published: (2025)
Targeted Remasking: Replacing Token Editing with Token-to-Mask Refinement in Discrete Diffusion Language Models
by: Yao, Lin
Published: (2026)
by: Yao, Lin
Published: (2026)
Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning
by: Li, Yajie, et al.
Published: (2025)
by: Li, Yajie, et al.
Published: (2025)
Correcting Negative Bias in Large Language Models through Negative Attention Score Alignment
by: Yu, Sangwon, et al.
Published: (2024)
by: Yu, Sangwon, et al.
Published: (2024)
Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs
by: Saha, Anusa, et al.
Published: (2026)
by: Saha, Anusa, et al.
Published: (2026)
Context Matters: Incorporating Target Awareness in Conversational Abusive Language Detection
by: Alharthi, Raneem, et al.
Published: (2025)
by: Alharthi, Raneem, et al.
Published: (2025)
Atoxia: Red-teaming Large Language Models with Target Toxic Answers
by: Du, Yuhao, et al.
Published: (2024)
by: Du, Yuhao, et al.
Published: (2024)
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
by: Alzahrani, Norah, et al.
Published: (2024)
by: Alzahrani, Norah, et al.
Published: (2024)
Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning
by: Lin, Hong-Yun, et al.
Published: (2025)
by: Lin, Hong-Yun, et al.
Published: (2025)
Multilingual Target-Stance Extraction
by: Mines, Ethan, et al.
Published: (2025)
by: Mines, Ethan, et al.
Published: (2025)
The Impact of Negated Text on Hallucination with Large Language Models
by: Seo, Jaehyung, et al.
Published: (2025)
by: Seo, Jaehyung, et al.
Published: (2025)
Toward Robust In-Context Learning: Leveraging Out-of-distribution Proxies for Target Inaccessible Demonstration Retrieval
by: Xu, Hao, et al.
Published: (2026)
by: Xu, Hao, et al.
Published: (2026)
Where Does Toxicity Live? Mechanistic Localization and Targeted Suppression in Language Models
by: Beniwal, Himanshu, et al.
Published: (2026)
by: Beniwal, Himanshu, et al.
Published: (2026)
One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages?
by: Shah, Arya, et al.
Published: (2026)
by: Shah, Arya, et al.
Published: (2026)
PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training
by: Brandfonbrener, David, et al.
Published: (2024)
by: Brandfonbrener, David, et al.
Published: (2024)
Disentangling Hate Across Target Identities
by: Jin, Yiping, et al.
Published: (2024)
by: Jin, Yiping, et al.
Published: (2024)
Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
by: Paul, Ronny, et al.
Published: (2024)
by: Paul, Ronny, et al.
Published: (2024)
Evaluating Large Language Models for Stance Detection on Financial Targets from SEC Filing Reports and Earnings Call Transcripts
by: Gyawali, Nikesh, et al.
Published: (2025)
by: Gyawali, Nikesh, et al.
Published: (2025)
Similar Items
-
Preference Learning Algorithms Do Not Learn Preference Rankings
by: Chen, Angelica, et al.
Published: (2024) -
Preference learning made easy: Everything should be understood through win rate
by: Zhang, Lily H., et al.
Published: (2025) -
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities
by: Saporta, Adriel, et al.
Published: (2024) -
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
by: Yamaguchi, Atsuki, et al.
Published: (2025) -
Target-driven Attack for Large Language Models
by: Zhang, Chong, et al.
Published: (2024)