Guardado en:
| Autores principales: | Sheshanarayana, Disha, Pal, Rajat Subhra, Sinha, Manjira, Dasgupta, Tirthankar |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.11603 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs
por: Sheshanarayana, Disha, et al.
Publicado: (2026)
por: Sheshanarayana, Disha, et al.
Publicado: (2026)
Decoding Plastic Toxicity: An Intelligent Framework for Conflict-Aware Relational Metapath Extraction from Scientific Abstracts
por: Jana, Sudeshna, et al.
Publicado: (2025)
por: Jana, Sudeshna, et al.
Publicado: (2025)
StayLTC: A Cost-Effective Multimodal Framework for Hospital Length of Stay Forecasting
por: Jana, Sudeshna, et al.
Publicado: (2025)
por: Jana, Sudeshna, et al.
Publicado: (2025)
ProofSketch: Efficient Verified Reasoning for Large Language Models
por: Sheshanarayana, Disha, et al.
Publicado: (2025)
por: Sheshanarayana, Disha, et al.
Publicado: (2025)
CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues
por: Sheshanarayana, Disha, et al.
Publicado: (2025)
por: Sheshanarayana, Disha, et al.
Publicado: (2025)
HeCiX: Integrating Knowledge Graphs and Large Language Models for Biomedical Research
por: Kulkarni, Prerana Sanjay, et al.
Publicado: (2024)
por: Kulkarni, Prerana Sanjay, et al.
Publicado: (2024)
Mapping Patient Trajectories: Understanding and Visualizing Sepsis Prognostic Pathways from Patients Clinical Narratives
por: Jana, Sudeshna, et al.
Publicado: (2024)
por: Jana, Sudeshna, et al.
Publicado: (2024)
Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking
por: Singha, Disha
Publicado: (2026)
por: Singha, Disha
Publicado: (2026)
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
por: Hazra, Somnath, et al.
Publicado: (2025)
por: Hazra, Somnath, et al.
Publicado: (2025)
ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference
por: Ma, Siyuan, et al.
Publicado: (2026)
por: Ma, Siyuan, et al.
Publicado: (2026)
Token-Budget-Aware Pool Routing for Cost-Efficient LLM Inference
por: Chen, Huamin, et al.
Publicado: (2026)
por: Chen, Huamin, et al.
Publicado: (2026)
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
por: Wang, Ruida, et al.
Publicado: (2025)
por: Wang, Ruida, et al.
Publicado: (2025)
Utilizing Lyapunov Exponents in designing deep neural networks
por: Mittra, Tirthankar
Publicado: (2024)
por: Mittra, Tirthankar
Publicado: (2024)
SeqRoute: Global Budget-Aware Sequential LLM Routing via Offline Reinforcement Learning
por: Xu, Zhongling, et al.
Publicado: (2026)
por: Xu, Zhongling, et al.
Publicado: (2026)
CR^2: Cost-Aware Risk-Controlled Routing for Wireless Device-Edge LLM Inference
por: Xue, Nan, et al.
Publicado: (2026)
por: Xue, Nan, et al.
Publicado: (2026)
Dynamic Quality-Latency Aware Routing for LLM Inference in Wireless Edge-Device Networks
por: Bao, Rui, et al.
Publicado: (2025)
por: Bao, Rui, et al.
Publicado: (2025)
PROTEUS: SLA-Aware Routing via Lagrangian RL for Multi-LLM Serving Systems
por: Bhatti, Amit Singh, et al.
Publicado: (2026)
por: Bhatti, Amit Singh, et al.
Publicado: (2026)
Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents
por: Zhe, Tao, et al.
Publicado: (2025)
por: Zhe, Tao, et al.
Publicado: (2025)
Outcome-Aware Tool Selection for Semantic Routers: Latency-Constrained Learning Without LLM Inference
por: Chen, Huamin, et al.
Publicado: (2026)
por: Chen, Huamin, et al.
Publicado: (2026)
Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization
por: Wang, Xudong, et al.
Publicado: (2026)
por: Wang, Xudong, et al.
Publicado: (2026)
Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learning
por: Mittra, Tirthankar
Publicado: (2024)
por: Mittra, Tirthankar
Publicado: (2024)
Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization
por: Wang, Mingyi, et al.
Publicado: (2026)
por: Wang, Mingyi, et al.
Publicado: (2026)
Efficient Environmental Claim Detection with Hyperbolic Graph Neural Networks
por: Aswal, Darpan, et al.
Publicado: (2025)
por: Aswal, Darpan, et al.
Publicado: (2025)
CR-Bench: Evaluating the Real-World Utility of AI Code Review Agents
por: Pereira, Kristen, et al.
Publicado: (2026)
por: Pereira, Kristen, et al.
Publicado: (2026)
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
por: Lu, Yuxiao, et al.
Publicado: (2023)
por: Lu, Yuxiao, et al.
Publicado: (2023)
Neural Breadcrumbs: Membership Inference Attacks on LLMs Through Hidden State and Attention Pattern Analysis
por: Makhija, Disha, et al.
Publicado: (2025)
por: Makhija, Disha, et al.
Publicado: (2025)
Resilient Routing: Risk-Aware Dynamic Routing in Smart Logistics via Spatiotemporal Graph Learning
por: Xue, Zhiming, et al.
Publicado: (2026)
por: Xue, Zhiming, et al.
Publicado: (2026)
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization
por: Kancheti, Sai Srinivas, et al.
Publicado: (2026)
por: Kancheti, Sai Srinivas, et al.
Publicado: (2026)
Carbon Intensity-Aware Adaptive Inference of DNNs
por: Jung, Jiwan
Publicado: (2024)
por: Jung, Jiwan
Publicado: (2024)
CoGS: Model Agnostic Causality Constrained Counterfactual Explanations using goal-directed ASP
por: Dasgupta, Sopam, et al.
Publicado: (2024)
por: Dasgupta, Sopam, et al.
Publicado: (2024)
Network-Constrained Policy Optimization for Adaptive Multi-agent Vehicle Routing
por: Arasteh, Fazel, et al.
Publicado: (2025)
por: Arasteh, Fazel, et al.
Publicado: (2025)
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
por: Lu, Guanxi, et al.
Publicado: (2025)
por: Lu, Guanxi, et al.
Publicado: (2025)
NeSy-Route: A Neuro-Symbolic Benchmark for Constrained Route Planning in Remote Sensing
por: Yang, Ming, et al.
Publicado: (2026)
por: Yang, Ming, et al.
Publicado: (2026)
Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm
por: Khanda, Rajat
Publicado: (2024)
por: Khanda, Rajat
Publicado: (2024)
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
por: Ding, Dujian, et al.
Publicado: (2024)
por: Ding, Dujian, et al.
Publicado: (2024)
Energy-Efficient Wireless LLM Inference via Uncertainty and Importance-Aware Speculative Decoding
por: Park, Jihoon, et al.
Publicado: (2025)
por: Park, Jihoon, et al.
Publicado: (2025)
Trust-Aware Routing for Distributed Generative AI Inference at the Edge
por: Nguyen, Chanh, et al.
Publicado: (2026)
por: Nguyen, Chanh, et al.
Publicado: (2026)
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
por: Sikeridis, Dimitrios, et al.
Publicado: (2024)
por: Sikeridis, Dimitrios, et al.
Publicado: (2024)
Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
por: Tian, Arther, et al.
Publicado: (2025)
por: Tian, Arther, et al.
Publicado: (2025)
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
por: Yu, Zishun, et al.
Publicado: (2025)
por: Yu, Zishun, et al.
Publicado: (2025)
Ejemplares similares
-
Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs
por: Sheshanarayana, Disha, et al.
Publicado: (2026) -
Decoding Plastic Toxicity: An Intelligent Framework for Conflict-Aware Relational Metapath Extraction from Scientific Abstracts
por: Jana, Sudeshna, et al.
Publicado: (2025) -
StayLTC: A Cost-Effective Multimodal Framework for Hospital Length of Stay Forecasting
por: Jana, Sudeshna, et al.
Publicado: (2025) -
ProofSketch: Efficient Verified Reasoning for Large Language Models
por: Sheshanarayana, Disha, et al.
Publicado: (2025) -
CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues
por: Sheshanarayana, Disha, et al.
Publicado: (2025)