Saved in:
| Main Authors: | Ramesh, Mahesh, Jayakumar, Kaousheik, Ramkumar, Aswinkumar, Thodima, Pavan, Rege, Aniket, Vlatakis-Gkaragkounis, Emmanouil-Vasileios |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.18077 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
No Coin Left Behind: Maximizing Strategic Surplus Against No-Regret Dynamics
by: Su, Yiheng, et al.
Published: (2026)
by: Su, Yiheng, et al.
Published: (2026)
MABViT -- Modified Attention Block Enhances Vision Transformers
by: Ramesh, Mahesh, et al.
Published: (2023)
by: Ramesh, Mahesh, et al.
Published: (2023)
Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics
by: Patel, Deep, et al.
Published: (2025)
by: Patel, Deep, et al.
Published: (2025)
Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays
by: Hu, Ting, et al.
Published: (2026)
by: Hu, Ting, et al.
Published: (2026)
Learning Safely Without Knowing the World:COMPASS-Hedge
by: Hu, Ting, et al.
Published: (2026)
by: Hu, Ting, et al.
Published: (2026)
Breaking $1/ε$ Barrier in Quantum Zero-Sum Games: Generalizing Metric Subregularity for Spectraplexes
by: Su, Yiheng, et al.
Published: (2025)
by: Su, Yiheng, et al.
Published: (2025)
Building Robust and Scalable Multilingual ASR for Indian Languages
by: Gangwar, Arjun, et al.
Published: (2025)
by: Gangwar, Arjun, et al.
Published: (2025)
Solving Zero-Sum Convex Markov Games
by: Kalogiannis, Fivos, et al.
Published: (2025)
by: Kalogiannis, Fivos, et al.
Published: (2025)
Shuffling the Data, Stretching the Step-size: Sharper Bias in constant step-size SGD
by: Emmanouilidis, Konstantinos, et al.
Published: (2026)
by: Emmanouilidis, Konstantinos, et al.
Published: (2026)
Algorithms and Complexity for Computing Nash Equilibria in Adversarial Team Games
by: Anagnostides, Ioannis, et al.
Published: (2023)
by: Anagnostides, Ioannis, et al.
Published: (2023)
Last-Iterate Convergence of Adaptive Riemannian Gradient Descent for Equilibrium Computation
by: Cai, Yang, et al.
Published: (2023)
by: Cai, Yang, et al.
Published: (2023)
Armada: Memory-Efficient Distributed Training of Large-Scale Graph Neural Networks
by: Waleffe, Roger, et al.
Published: (2025)
by: Waleffe, Roger, et al.
Published: (2025)
A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games
by: Vasconcelos, Francisca, et al.
Published: (2023)
by: Vasconcelos, Francisca, et al.
Published: (2023)
Contracting with a Learning Agent
by: Guruganesh, Guru, et al.
Published: (2024)
by: Guruganesh, Guru, et al.
Published: (2024)
LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
by: Liang, Fangzhou, et al.
Published: (2025)
by: Liang, Fangzhou, et al.
Published: (2025)
RiddleBench: A New Generative Reasoning Benchmark for LLMs
by: Halder, Deepon, et al.
Published: (2025)
by: Halder, Deepon, et al.
Published: (2025)
Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest
by: O'Neill, Abigail, et al.
Published: (2026)
by: O'Neill, Abigail, et al.
Published: (2026)
Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
by: He, Zicong, et al.
Published: (2025)
by: He, Zicong, et al.
Published: (2025)
Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
by: Wang, Danqing, et al.
Published: (2024)
by: Wang, Danqing, et al.
Published: (2024)
From Building Blocks to Planning: Multi-Step Spatial Reasoning in LLMs with Reinforcement Learning
by: Tahmasbi, Amir, et al.
Published: (2025)
by: Tahmasbi, Amir, et al.
Published: (2025)
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
by: Halder, Deepon, et al.
Published: (2025)
by: Halder, Deepon, et al.
Published: (2025)
Multi-Turn Puzzles: Evaluating Interactive Reasoning and Strategic Dialogue in LLMs
by: Badola, Kartikeya, et al.
Published: (2025)
by: Badola, Kartikeya, et al.
Published: (2025)
Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning
by: Wu, Jinyang, et al.
Published: (2026)
by: Wu, Jinyang, et al.
Published: (2026)
CryptoLLM: Unleashing the Power of Prompted LLMs for SmartQnA and Classification of Crypto Posts
by: Deroy, Aniket, et al.
Published: (2024)
by: Deroy, Aniket, et al.
Published: (2024)
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos
by: Goel, Arushi, et al.
Published: (2026)
by: Goel, Arushi, et al.
Published: (2026)
Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning
by: You, Zhiwen, et al.
Published: (2026)
by: You, Zhiwen, et al.
Published: (2026)
Reinforcement Learning for Hanabi
by: Cohen, Nina, et al.
Published: (2025)
by: Cohen, Nina, et al.
Published: (2025)
EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning
by: Liu, Xiaoqian, et al.
Published: (2025)
by: Liu, Xiaoqian, et al.
Published: (2025)
Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning
by: Stoisser, Josefa Lia, et al.
Published: (2025)
by: Stoisser, Josefa Lia, et al.
Published: (2025)
YouTube Comments Decoded: Leveraging LLMs for Low Resource Language Classification
by: Deroy, Aniket, et al.
Published: (2024)
by: Deroy, Aniket, et al.
Published: (2024)
GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents
by: Costarelli, Anthony, et al.
Published: (2024)
by: Costarelli, Anthony, et al.
Published: (2024)
ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs
by: Tian, Yuan, et al.
Published: (2025)
by: Tian, Yuan, et al.
Published: (2025)
Enhancing Language Agent Strategic Reasoning through Self-Play in Adversarial Games
by: Zhang, Yikai, et al.
Published: (2025)
by: Zhang, Yikai, et al.
Published: (2025)
Losses that Cook: Topological Optimal Transport for Structured Recipe Generation
by: Ottoborgo, Mattia, et al.
Published: (2026)
by: Ottoborgo, Mattia, et al.
Published: (2026)
EvoSpark: Endogenous Interactive Agent Societies for Unified Long-Horizon Narrative Evolution
by: He, Shiyu, et al.
Published: (2026)
by: He, Shiyu, et al.
Published: (2026)
SparkRA: A Retrieval-Augmented Knowledge Service System Based on Spark Large Language Model
by: Wu, Dayong, et al.
Published: (2024)
by: Wu, Dayong, et al.
Published: (2024)
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems
by: Rege, Aniket, et al.
Published: (2025)
by: Rege, Aniket, et al.
Published: (2025)
CHBench: A Cognitive Hierarchy Benchmark for Evaluating Strategic Reasoning Capability of LLMs
by: Liu, Hongtao, et al.
Published: (2025)
by: Liu, Hongtao, et al.
Published: (2025)
Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
ProvRain: Rain-Adaptive Denoising and Vehicle Detection via MobileNet-UNet and Faster R-CNN
by: Varathakumaran, Aswinkumar, et al.
Published: (2025)
by: Varathakumaran, Aswinkumar, et al.
Published: (2025)
Similar Items
-
No Coin Left Behind: Maximizing Strategic Surplus Against No-Regret Dynamics
by: Su, Yiheng, et al.
Published: (2026) -
MABViT -- Modified Attention Block Enhances Vision Transformers
by: Ramesh, Mahesh, et al.
Published: (2023) -
Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics
by: Patel, Deep, et al.
Published: (2025) -
Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays
by: Hu, Ting, et al.
Published: (2026) -
Learning Safely Without Knowing the World:COMPASS-Hedge
by: Hu, Ting, et al.
Published: (2026)