:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Magal, Nicholas, Tran, Minh, Arakawa, Riku, Nie, Suzanne
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2501.00865
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

IMUCoCo: Enabling Flexible On-Body IMU Placement for Human Pose Estimation and Activity Recognition
by: Zhou, Haozhe, et al.
Published: (2025)

DPad: Efficient Diffusion Language Models with Suffix Dropout
by: Chen, Xinhua, et al.
Published: (2025)

Supporting Experts with a Multimodal Machine-Learning-Based Tool for Human Behavior Analysis of Conversational Videos
by: Arakawa, Riku, et al.
Published: (2024)

Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
by: Li, Qingyang, et al.
Published: (2024)

Deep Augmentation: Dropout as Augmentation for Self-Supervised Learning
by: Brüel-Gabrielsson, Rickard, et al.
Published: (2023)

Pretraining Language Models with Subword Regularization: An Empirical Study of BPE Dropout in Low-Resource NLP
by: Visser, Ruan, et al.
Published: (2026)

Negation Neglect: When models fail to learn negations in training
by: Mayne, Harry, et al.
Published: (2026)

Re-examining learning linear functions in context
by: Naim, Omar, et al.
Published: (2024)

Sensitivity-Positional Co-Localization in GQA Transformers
by: Rao, Manoj Chandrashekar
Published: (2026)

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
by: Ma, Avery, et al.
Published: (2025)

Predicting First Year Dropout from Pre Enrolment Motivation Statements Using Text Mining
by: Soppe, K. F. B., et al.
Published: (2025)

TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)

Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities
by: Geng, Jiahui, et al.
Published: (2025)

Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining
by: Liu, Chenxi, et al.
Published: (2025)

JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
by: Xue, Jieying, et al.
Published: (2025)

Extracting Small Translation Specialists from LLMs by Aggressively Pruning Experts
by: Martin, Liu O., et al.
Published: (2026)

Text-Enhanced Data-free Approach for Federated Class-Incremental Learning
by: Tran, Minh-Tuan, et al.
Published: (2024)

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
by: Zhu, Xinyu, et al.
Published: (2025)

TaCo: Targeted Concept Erasure Prevents Non-Linear Classifiers From Detecting Protected Attributes
by: Jourdan, Fanny, et al.
Published: (2023)

Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
by: Le, Quang-Hung, et al.
Published: (2024)

On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization
by: Deng, Wenlong, et al.
Published: (2025)

Data Augmentation for Classification of Negative Pregnancy Outcomes in Imbalanced Data
by: Biswas, Md Badsha
Published: (2025)

Effective Context Modeling Framework for Emotion Recognition in Conversations
by: Van, Cuong Tran, et al.
Published: (2024)

$μ^2$Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
by: Li, Siyou, et al.
Published: (2025)

Selective Rotary Position Embedding
by: Movahedi, Sajad, et al.
Published: (2025)

On the Geometry of Positional Encodings in Transformers
by: Cirrincione, Giansalvo
Published: (2026)

Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings
by: Zuo, Chunsheng, et al.
Published: (2024)

CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending
by: Zhu, Shiyi, et al.
Published: (2023)

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
by: Solatorio, Aivin V.
Published: (2024)

AF-KAN: Activation Function-Based Kolmogorov-Arnold Networks for Efficient Representation Learning
by: Ta, Hoang-Thang, et al.
Published: (2025)

Reward-Based Online LLM Routing via NeuralUCB
by: Tsai, Ming-Hua, et al.
Published: (2026)

Detecting and Rectifying Noisy Labels: A Similarity-based Approach
by: Huu-Tien, Dang, et al.
Published: (2025)

LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples
by: Liu, Yezi, et al.
Published: (2025)

ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
by: She, Jingyuan Selena, et al.
Published: (2023)

Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding
by: Feng, Xincan, et al.
Published: (2024)

More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)

Beyond Forgetting: Machine Unlearning Elicits Controllable Side Behaviors and Capabilities
by: Dang, Tien, et al.
Published: (2026)

Multi-Modal Framing Analysis of News
by: Arora, Arnav, et al.
Published: (2025)

One STEP at a time: Language Agents are Stepwise Planners
by: Nguyen, Minh, et al.
Published: (2024)

Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training
by: Tran, Toan, et al.
Published: (2025)