Saved in:
| Main Authors: | Magal, Nicholas, Tran, Minh, Arakawa, Riku, Nie, Suzanne |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.00865 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
IMUCoCo: Enabling Flexible On-Body IMU Placement for Human Pose Estimation and Activity Recognition
by: Zhou, Haozhe, et al.
Published: (2025)
by: Zhou, Haozhe, et al.
Published: (2025)
DPad: Efficient Diffusion Language Models with Suffix Dropout
by: Chen, Xinhua, et al.
Published: (2025)
by: Chen, Xinhua, et al.
Published: (2025)
Supporting Experts with a Multimodal Machine-Learning-Based Tool for Human Behavior Analysis of Conversational Videos
by: Arakawa, Riku, et al.
Published: (2024)
by: Arakawa, Riku, et al.
Published: (2024)
Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
by: Li, Qingyang, et al.
Published: (2024)
by: Li, Qingyang, et al.
Published: (2024)
Deep Augmentation: Dropout as Augmentation for Self-Supervised Learning
by: Brüel-Gabrielsson, Rickard, et al.
Published: (2023)
by: Brüel-Gabrielsson, Rickard, et al.
Published: (2023)
Pretraining Language Models with Subword Regularization: An Empirical Study of BPE Dropout in Low-Resource NLP
by: Visser, Ruan, et al.
Published: (2026)
by: Visser, Ruan, et al.
Published: (2026)
Negation Neglect: When models fail to learn negations in training
by: Mayne, Harry, et al.
Published: (2026)
by: Mayne, Harry, et al.
Published: (2026)
Re-examining learning linear functions in context
by: Naim, Omar, et al.
Published: (2024)
by: Naim, Omar, et al.
Published: (2024)
Sensitivity-Positional Co-Localization in GQA Transformers
by: Rao, Manoj Chandrashekar
Published: (2026)
by: Rao, Manoj Chandrashekar
Published: (2026)
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
by: Ma, Avery, et al.
Published: (2025)
by: Ma, Avery, et al.
Published: (2025)
Predicting First Year Dropout from Pre Enrolment Motivation Statements Using Text Mining
by: Soppe, K. F. B., et al.
Published: (2025)
by: Soppe, K. F. B., et al.
Published: (2025)
TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)
by: Dinh, Quang Minh, et al.
Published: (2024)
Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities
by: Geng, Jiahui, et al.
Published: (2025)
by: Geng, Jiahui, et al.
Published: (2025)
Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining
by: Liu, Chenxi, et al.
Published: (2025)
by: Liu, Chenxi, et al.
Published: (2025)
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
by: Xue, Jieying, et al.
Published: (2025)
by: Xue, Jieying, et al.
Published: (2025)
Extracting Small Translation Specialists from LLMs by Aggressively Pruning Experts
by: Martin, Liu O., et al.
Published: (2026)
by: Martin, Liu O., et al.
Published: (2026)
Text-Enhanced Data-free Approach for Federated Class-Incremental Learning
by: Tran, Minh-Tuan, et al.
Published: (2024)
by: Tran, Minh-Tuan, et al.
Published: (2024)
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
by: Zhu, Xinyu, et al.
Published: (2025)
by: Zhu, Xinyu, et al.
Published: (2025)
TaCo: Targeted Concept Erasure Prevents Non-Linear Classifiers From Detecting Protected Attributes
by: Jourdan, Fanny, et al.
Published: (2023)
by: Jourdan, Fanny, et al.
Published: (2023)
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
by: Le, Quang-Hung, et al.
Published: (2024)
by: Le, Quang-Hung, et al.
Published: (2024)
On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization
by: Deng, Wenlong, et al.
Published: (2025)
by: Deng, Wenlong, et al.
Published: (2025)
Data Augmentation for Classification of Negative Pregnancy Outcomes in Imbalanced Data
by: Biswas, Md Badsha
Published: (2025)
by: Biswas, Md Badsha
Published: (2025)
Effective Context Modeling Framework for Emotion Recognition in Conversations
by: Van, Cuong Tran, et al.
Published: (2024)
by: Van, Cuong Tran, et al.
Published: (2024)
$μ^2$Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
by: Li, Siyou, et al.
Published: (2025)
by: Li, Siyou, et al.
Published: (2025)
Selective Rotary Position Embedding
by: Movahedi, Sajad, et al.
Published: (2025)
by: Movahedi, Sajad, et al.
Published: (2025)
On the Geometry of Positional Encodings in Transformers
by: Cirrincione, Giansalvo
Published: (2026)
by: Cirrincione, Giansalvo
Published: (2026)
Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings
by: Zuo, Chunsheng, et al.
Published: (2024)
by: Zuo, Chunsheng, et al.
Published: (2024)
CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending
by: Zhu, Shiyi, et al.
Published: (2023)
by: Zhu, Shiyi, et al.
Published: (2023)
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
by: Solatorio, Aivin V.
Published: (2024)
by: Solatorio, Aivin V.
Published: (2024)
AF-KAN: Activation Function-Based Kolmogorov-Arnold Networks for Efficient Representation Learning
by: Ta, Hoang-Thang, et al.
Published: (2025)
by: Ta, Hoang-Thang, et al.
Published: (2025)
Reward-Based Online LLM Routing via NeuralUCB
by: Tsai, Ming-Hua, et al.
Published: (2026)
by: Tsai, Ming-Hua, et al.
Published: (2026)
Detecting and Rectifying Noisy Labels: A Similarity-based Approach
by: Huu-Tien, Dang, et al.
Published: (2025)
by: Huu-Tien, Dang, et al.
Published: (2025)
LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples
by: Liu, Yezi, et al.
Published: (2025)
by: Liu, Yezi, et al.
Published: (2025)
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
by: She, Jingyuan Selena, et al.
Published: (2023)
by: She, Jingyuan Selena, et al.
Published: (2023)
Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding
by: Feng, Xincan, et al.
Published: (2024)
by: Feng, Xincan, et al.
Published: (2024)
More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)
by: Lv, Ang, et al.
Published: (2024)
Beyond Forgetting: Machine Unlearning Elicits Controllable Side Behaviors and Capabilities
by: Dang, Tien, et al.
Published: (2026)
by: Dang, Tien, et al.
Published: (2026)
Multi-Modal Framing Analysis of News
by: Arora, Arnav, et al.
Published: (2025)
by: Arora, Arnav, et al.
Published: (2025)
One STEP at a time: Language Agents are Stepwise Planners
by: Nguyen, Minh, et al.
Published: (2024)
by: Nguyen, Minh, et al.
Published: (2024)
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training
by: Tran, Toan, et al.
Published: (2025)
by: Tran, Toan, et al.
Published: (2025)
Similar Items
-
IMUCoCo: Enabling Flexible On-Body IMU Placement for Human Pose Estimation and Activity Recognition
by: Zhou, Haozhe, et al.
Published: (2025) -
DPad: Efficient Diffusion Language Models with Suffix Dropout
by: Chen, Xinhua, et al.
Published: (2025) -
Supporting Experts with a Multimodal Machine-Learning-Based Tool for Human Behavior Analysis of Conversational Videos
by: Arakawa, Riku, et al.
Published: (2024) -
Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
by: Li, Qingyang, et al.
Published: (2024) -
Deep Augmentation: Dropout as Augmentation for Self-Supervised Learning
by: Brüel-Gabrielsson, Rickard, et al.
Published: (2023)