Saved in:
| Main Authors: | Coda-Forno, Julian, Zhao, Zhuokai, Zhang, Qiang, Tamboli, Dipesh, Li, Weiwei, Fan, Xiangjun, Zhang, Lizhu, Schulz, Eric, Tseng, Hsiao-Ping |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.00494 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TARo: Token-level Adaptive Routing for LLM Test-time Alignment
by: Rai, Arushi, et al.
Published: (2026)
by: Rai, Arushi, et al.
Published: (2026)
CogBench: a large language model walks into a psychology lab
by: Coda-Forno, Julian, et al.
Published: (2024)
by: Coda-Forno, Julian, et al.
Published: (2024)
Synthetic Sandbox for Training Machine Learning Engineering Agents
by: Zhou, Yuhang, et al.
Published: (2026)
by: Zhou, Yuhang, et al.
Published: (2026)
Meta-learning ecological priors from large language models explains human learning and decision making
by: Jagadish, Akshay K., et al.
Published: (2025)
by: Jagadish, Akshay K., et al.
Published: (2025)
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
by: Jagadish, Akshay K., et al.
Published: (2024)
by: Jagadish, Akshay K., et al.
Published: (2024)
Playing repeated games with Large Language Models
by: Akata, Elif, et al.
Published: (2023)
by: Akata, Elif, et al.
Published: (2023)
S'MoRE: Structural Mixture of Residual Experts for Parameter-Efficient LLM Fine-tuning
by: Zeng, Hanqing, et al.
Published: (2025)
by: Zeng, Hanqing, et al.
Published: (2025)
OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification
by: Zhou, Yuhang, et al.
Published: (2026)
by: Zhou, Yuhang, et al.
Published: (2026)
The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse
by: Coda-Forno, Julian, et al.
Published: (2026)
by: Coda-Forno, Julian, et al.
Published: (2026)
Inducing anxiety in large language models can induce bias
by: Coda-Forno, Julian, et al.
Published: (2023)
by: Coda-Forno, Julian, et al.
Published: (2023)
RecoWorld: Building Simulated Environments for Agentic Recommender Systems
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
Agentic Recommender System with Hierarchical Belief-State Memory
by: Shen, Xiang, et al.
Published: (2026)
by: Shen, Xiang, et al.
Published: (2026)
PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing
by: Tamboli, Dipesh, et al.
Published: (2026)
by: Tamboli, Dipesh, et al.
Published: (2026)
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
by: Wang, Chaoqi, et al.
Published: (2025)
by: Wang, Chaoqi, et al.
Published: (2025)
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning
by: Yang, Chenghao, et al.
Published: (2025)
by: Yang, Chenghao, et al.
Published: (2025)
Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision
by: Malusare, Aditya, et al.
Published: (2023)
by: Malusare, Aditya, et al.
Published: (2023)
Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer
by: Tamboli, Dipesh, et al.
Published: (2024)
by: Tamboli, Dipesh, et al.
Published: (2024)
Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding
by: Zhou, Yuhang, et al.
Published: (2025)
by: Zhou, Yuhang, et al.
Published: (2025)
Thought Communication in Multiagent Collaboration
by: Zheng, Yujia, et al.
Published: (2025)
by: Zheng, Yujia, et al.
Published: (2025)
LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems
by: Zhou, Yuhang, et al.
Published: (2026)
by: Zhou, Yuhang, et al.
Published: (2026)
BalancedDPO: Adaptive Multi-Metric Alignment
by: Tamboli, Dipesh, et al.
Published: (2025)
by: Tamboli, Dipesh, et al.
Published: (2025)
EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization
by: Han, Kevin, et al.
Published: (2026)
by: Han, Kevin, et al.
Published: (2026)
GEM: Empowering LLM for both Embedding Generation and Language Understanding
by: Zhang, Caojin, et al.
Published: (2025)
by: Zhang, Caojin, et al.
Published: (2025)
Are small business owners entrepreneurs? Exploring small business manager behavior profiles in the São Paulo Metropolitan region
by: Roberto Coda
Published: (2018)
by: Roberto Coda
Published: (2018)
StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding
by: Yang, Yanlai, et al.
Published: (2025)
by: Yang, Yanlai, et al.
Published: (2025)
Token-Level LLM Collaboration via FusionRoute
by: Xiong, Nuoya, et al.
Published: (2026)
by: Xiong, Nuoya, et al.
Published: (2026)
Quantifying Generalization Complexity for Large Language Models
by: Qi, Zhenting, et al.
Published: (2024)
by: Qi, Zhenting, et al.
Published: (2024)
Self-Evolving Multi-Agent Systems via Decentralized Memory
by: Hao, Guangya, et al.
Published: (2026)
by: Hao, Guangya, et al.
Published: (2026)
RELAX: Reinforcement Learning Enabled 2D-LiDAR Autonomous System for Parsimonious UAVs
by: Wu, Guanlin, et al.
Published: (2023)
by: Wu, Guanlin, et al.
Published: (2023)
Contaminación ambiental y actores sociales en Bolivia: Un balance de la situación
by: Eduardo Forno
Published: (2010)
by: Eduardo Forno
Published: (2010)
ENTRE EL EDIFICIO Y EL CURRÍCULUM DE LA INTERCULTURALIDAD: UNA MIRADA ANTROPOLÓGICA A LA EDUCACIÓN ACTUAL EN TERRITORIO MAPUCHE-HUILLICHE
by: Amilcar Forno
Published: (2009)
by: Amilcar Forno
Published: (2009)
Bolivia: Cambio climático, pobreza y adaptación. La Paz: Oxfam Internacional, 67 pp.
by: Eduardo Forno
Published: (2010)
by: Eduardo Forno
Published: (2010)
Editorial
by: Roberto Coda
Published: (2008)
by: Roberto Coda
Published: (2008)
Desempenho Estratégico do Departamento de Gestão de Recursos Humanos: uma Pesquisa Exploratória Acerca das Implicações dos Estilos Comportamentais de seus Profissionais
by: Roberto Coda
Published: (2014)
by: Roberto Coda
Published: (2014)
Editorial
by: Roberto Coda
Published: (2006)
by: Roberto Coda
Published: (2006)
Similar Items
-
TARo: Token-level Adaptive Routing for LLM Test-time Alignment
by: Rai, Arushi, et al.
Published: (2026) -
CogBench: a large language model walks into a psychology lab
by: Coda-Forno, Julian, et al.
Published: (2024) -
Synthetic Sandbox for Training Machine Learning Engineering Agents
by: Zhou, Yuhang, et al.
Published: (2026) -
Meta-learning ecological priors from large language models explains human learning and decision making
by: Jagadish, Akshay K., et al.
Published: (2025) -
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
by: Jagadish, Akshay K., et al.
Published: (2024)