Saved in:
| Main Authors: | Remfry, Elizabeth, Henkin, Rafael, Barnes, Michael R, Naik, Aakanksha |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.01331 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating Collaborative Data Practices: a Case Study on Artificial Intelligence for Healthcare Research
by: Henkin, Rafael, et al.
Published: (2023)
by: Henkin, Rafael, et al.
Published: (2023)
Intent-Aware Schema Generation And Refinement For Literature Review Tables
by: Padmakumar, Vishakh, et al.
Published: (2025)
by: Padmakumar, Vishakh, et al.
Published: (2025)
Distilling Multi-Scale Knowledge for Event Temporal Relation Extraction
by: Yao, Hao-Ren, et al.
Published: (2022)
by: Yao, Hao-Ren, et al.
Published: (2022)
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
by: Aakanksha, et al.
Published: (2024)
by: Aakanksha, et al.
Published: (2024)
Towards Compositionality in Concept Learning
by: Stein, Adam, et al.
Published: (2024)
by: Stein, Adam, et al.
Published: (2024)
Probabilistic Consensus through Ensemble Validation: A Framework for LLM Reliability
by: Naik, Ninad
Published: (2024)
by: Naik, Ninad
Published: (2024)
Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential
by: Samragh, Mohammad, et al.
Published: (2025)
by: Samragh, Mohammad, et al.
Published: (2025)
Do We Need Frontier Models to Verify Mathematical Proofs?
by: Naik, Aaditya, et al.
Published: (2026)
by: Naik, Aaditya, et al.
Published: (2026)
ForesightKV: Optimizing KV Cache Eviction for Reasoning Models by Learning Long-Term Contribution
by: Dong, Zican, et al.
Published: (2026)
by: Dong, Zican, et al.
Published: (2026)
Understanding How CodeLLMs (Mis)Predict Types with Activation Steering
by: Lucchetti, Francesca, et al.
Published: (2024)
by: Lucchetti, Francesca, et al.
Published: (2024)
Evaluating Very Long-Term Conversational Memory of LLM Agents
by: Maharana, Adyasha, et al.
Published: (2024)
by: Maharana, Adyasha, et al.
Published: (2024)
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference
by: Bhendawade, Nikhil, et al.
Published: (2025)
by: Bhendawade, Nikhil, et al.
Published: (2025)
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm
by: Aakanksha, et al.
Published: (2024)
by: Aakanksha, et al.
Published: (2024)
Towards a Realistic Long-Term Benchmark for Open-Web Research Agents
by: Mühlbacher, Peter, et al.
Published: (2024)
by: Mühlbacher, Peter, et al.
Published: (2024)
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
by: Chen, Guanzheng, et al.
Published: (2025)
by: Chen, Guanzheng, et al.
Published: (2025)
Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference
by: Wu, Zimeng, et al.
Published: (2026)
by: Wu, Zimeng, et al.
Published: (2026)
When Routine Chats Turn Toxic: Unintended Long-Term State Poisoning in Personalized Agents
by: Xu, Xiaoyu, et al.
Published: (2026)
by: Xu, Xiaoyu, et al.
Published: (2026)
MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models
by: Ha, Hyeonjeong, et al.
Published: (2026)
by: Ha, Hyeonjeong, et al.
Published: (2026)
Large Language Models are Learnable Planners for Long-Term Recommendation
by: Shi, Wentao, et al.
Published: (2024)
by: Shi, Wentao, et al.
Published: (2024)
Exploring Contrastive Learning for Long-Tailed Multi-Label Text Classification
by: Audibert, Alexandre, et al.
Published: (2024)
by: Audibert, Alexandre, et al.
Published: (2024)
DeferMem: Query-Time Evidence Distillation via Reinforcement Learning for Long-Term Memory QA
by: Yin, Jianing, et al.
Published: (2026)
by: Yin, Jianing, et al.
Published: (2026)
Latent Traits and Cross-Task Transfer: Deconstructing Dataset Interactions in LLM Fine-tuning
by: Krishna, Shambhavi, et al.
Published: (2025)
by: Krishna, Shambhavi, et al.
Published: (2025)
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
by: Chen, Yuhan, et al.
Published: (2024)
by: Chen, Yuhan, et al.
Published: (2024)
Long Context RAG Performance of Large Language Models
by: Leng, Quinn, et al.
Published: (2024)
by: Leng, Quinn, et al.
Published: (2024)
Once Upon an Input: Reasoning via Per-Instance Program Synthesis
by: Stein, Adam, et al.
Published: (2025)
by: Stein, Adam, et al.
Published: (2025)
Data Augmentation for Code Translation with Comparable Corpora and Multiple References
by: Xie, Yiqing, et al.
Published: (2023)
by: Xie, Yiqing, et al.
Published: (2023)
Statistical NLP for Optimization of Clinical Trial Success Prediction in Pharmaceutical R&D
by: Doane, Michael R.
Published: (2025)
by: Doane, Michael R.
Published: (2025)
Predicting Evoked Emotions in Conversations
by: Altarawneh, Enas, et al.
Published: (2023)
by: Altarawneh, Enas, et al.
Published: (2023)
Multipole Attention for Efficient Long Context Reasoning
by: Hooper, Coleman, et al.
Published: (2025)
by: Hooper, Coleman, et al.
Published: (2025)
Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering
by: Müller, Philip, et al.
Published: (2026)
by: Müller, Philip, et al.
Published: (2026)
Idea2Plan: Exploring AI-Powered Research Planning
by: Huang, Jin, et al.
Published: (2025)
by: Huang, Jin, et al.
Published: (2025)
Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web
by: Lin, Kate, et al.
Published: (2024)
by: Lin, Kate, et al.
Published: (2024)
Exploring Bias and Prediction Metrics to Characterise the Fairness of Machine Learning for Equity-Centered Public Health Decision-Making: A Narrative Review
by: Raza, Shaina, et al.
Published: (2024)
by: Raza, Shaina, et al.
Published: (2024)
LongEmbed: Extending Embedding Models for Long Context Retrieval
by: Zhu, Dawei, et al.
Published: (2024)
by: Zhu, Dawei, et al.
Published: (2024)
Long-Short Alignment for Effective Long-Context Modeling in LLMs
by: Du, Tianqi, et al.
Published: (2025)
by: Du, Tianqi, et al.
Published: (2025)
BioCoref: Benchmarking Biomedical Coreference Resolution with LLMs
by: Salem, Nourah M, et al.
Published: (2025)
by: Salem, Nourah M, et al.
Published: (2025)
Mnemosyne: An Unsupervised, Human-Inspired Long-Term Memory Architecture for Edge-Based LLMs
by: Jonelagadda, Aneesh, et al.
Published: (2025)
by: Jonelagadda, Aneesh, et al.
Published: (2025)
Double Equivariance for Inductive Link Prediction for Both New Nodes and New Relation Types
by: Zhou, Jincheng, et al.
Published: (2023)
by: Zhou, Jincheng, et al.
Published: (2023)
ComplicaCode: Enhancing Disease Complication Detection in Electronic Health Records through ICD Path Generation
by: Zhou, Xiaofan
Published: (2023)
by: Zhou, Xiaofan
Published: (2023)
LongReward: Improving Long-context Large Language Models with AI Feedback
by: Zhang, Jiajie, et al.
Published: (2024)
by: Zhang, Jiajie, et al.
Published: (2024)
Similar Items
-
Investigating Collaborative Data Practices: a Case Study on Artificial Intelligence for Healthcare Research
by: Henkin, Rafael, et al.
Published: (2023) -
Intent-Aware Schema Generation And Refinement For Literature Review Tables
by: Padmakumar, Vishakh, et al.
Published: (2025) -
Distilling Multi-Scale Knowledge for Event Temporal Relation Extraction
by: Yao, Hao-Ren, et al.
Published: (2022) -
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
by: Aakanksha, et al.
Published: (2024) -
Towards Compositionality in Concept Learning
by: Stein, Adam, et al.
Published: (2024)