:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Dai, Hui, Pechi, Dan, Yang, Xinyi, Banga, Garvit, Mantri, Raghav
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computation and Language Artificial Intelligence Machine Learning
Accesso online:	https://arxiv.org/abs/2411.19360
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

RoboSignature: Robust Signature and Watermarking on Network Attacks
di: Shaan, Aryaman, et al.
Pubblicazione: (2024)

Needle in the Haystack for Memory Based Large Language Models
di: Nelson, Elliot, et al.
Pubblicazione: (2024)

When Every Token Counts: Optimal Segmentation for Low-Resource Language Models
di: Raj, Bharath, et al.
Pubblicazione: (2024)

Hidden in the Haystack: Smaller Needles are More Difficult for LLMs to Find
di: Bianchi, Owen, et al.
Pubblicazione: (2025)

From Haystack to Needle: Label Space Reduction for Zero-shot Classification
di: Vandemoortele, Nathan, et al.
Pubblicazione: (2025)

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models
di: Wang, Hengyi, et al.
Pubblicazione: (2024)

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss
di: Kuratov, Yuri, et al.
Pubblicazione: (2024)

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
di: Xiong, Zheyang, et al.
Pubblicazione: (2024)

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring
di: Aksoy, Sinan G., et al.
Pubblicazione: (2026)

Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack
di: Xu, Xiaoyue, et al.
Pubblicazione: (2024)

Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark
di: Huybrechts, Goeric, et al.
Pubblicazione: (2025)

Jailbreaking in the Haystack
di: Shah, Rishi Rajesh, et al.
Pubblicazione: (2025)

Reasoning on Multiple Needles In A Haystack
di: Wang, Yidong
Pubblicazione: (2025)

Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models
di: Hengle, Amey, et al.
Pubblicazione: (2024)

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
di: Yang, Wang, et al.
Pubblicazione: (2025)

Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
di: Wang, Xinyi, et al.
Pubblicazione: (2024)

100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
di: Yang, Wang, et al.
Pubblicazione: (2025)

An Empirical Study on Context Length for Open-Domain Dialog Generation
di: Shen, Xinyi, et al.
Pubblicazione: (2024)

Aligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecasting
di: Dai, Hui, et al.
Pubblicazione: (2026)

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning
di: Kim, Bosung, et al.
Pubblicazione: (2025)

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning
di: Wang, Xinyi, et al.
Pubblicazione: (2023)

Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
di: Sharma, Aditya, et al.
Pubblicazione: (2024)

RLHF: A comprehensive Survey for Cultural, Multimodal and Low Latency Alignment Methods
di: Sharma, Raghav, et al.
Pubblicazione: (2025)

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization
di: Ji, Kaixuan, et al.
Pubblicazione: (2024)

$π^2$: Structure-Originated Reasoning Data Improves Long-Context Reasoning Ability of Large Language Models
di: Do, Quyet V., et al.
Pubblicazione: (2026)

Safety Subspaces are Not Linearly Distinct: A Fine-Tuning Case Study
di: Ponkshe, Kaustubh, et al.
Pubblicazione: (2025)

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
di: Jin, Hongye, et al.
Pubblicazione: (2024)

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
di: Tang, Zhenheng, et al.
Pubblicazione: (2025)

PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents
di: Gu, Zhuohan, et al.
Pubblicazione: (2026)

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
di: Yu, Hongli, et al.
Pubblicazione: (2025)

On the Ability of Transformers to Verify Plans
di: Sarrof, Yash, et al.
Pubblicazione: (2026)

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
di: Yang, Siwei, et al.
Pubblicazione: (2024)

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning
di: Wang, Yaxuan, et al.
Pubblicazione: (2025)

TTKV: Temporal-Tiered KV Cache for Long-Context LLM Inference
di: Dzikanyanga, Gradwell, et al.
Pubblicazione: (2026)

ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models
di: Singhal, Raghav, et al.
Pubblicazione: (2025)

Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
di: Chavan, Arnav, et al.
Pubblicazione: (2024)

How Well Can a Long Sequence Model Model Long Sequences? Comparing Architechtural Inductive Biases on Long-Context Abilities
di: Huang, Jerry
Pubblicazione: (2024)

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
di: Xiao, Chaojun, et al.
Pubblicazione: (2024)

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
di: Fu, Qichen, et al.
Pubblicazione: (2024)

Emergent Abilities in Large Language Models: A Survey
di: Berti, Leonardo, et al.
Pubblicazione: (2025)