Saved in:
| Main Authors: | Chitsaz, Kamran, Balaji, Roshan, Fournier, Quentin, Bhatt, Nirav Pravinbhai, Chandar, Sarath |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.13408 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
by: Chitsaz, Kamran, et al.
Published: (2024)
by: Chitsaz, Kamran, et al.
Published: (2024)
Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction
by: Balaji, Roshan, et al.
Published: (2025)
by: Balaji, Roshan, et al.
Published: (2025)
CoPeP: Benchmarking Continual Pretraining for Protein Language Models
by: Patil, Darshan, et al.
Published: (2026)
by: Patil, Darshan, et al.
Published: (2026)
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2025)
by: Burnwal, Returaj, et al.
Published: (2025)
OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)
by: Burnwal, Returaj, et al.
Published: (2026)
BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives
by: Sinha, Aarush, et al.
Published: (2025)
by: Sinha, Aarush, et al.
Published: (2025)
Manifold Metric: A Loss Landscape Approach for Predicting Model Performance
by: Malviya, Pranshu, et al.
Published: (2024)
by: Malviya, Pranshu, et al.
Published: (2024)
Learning from Observation: A Survey of Recent Advances
by: Burnwal, Returaj, et al.
Published: (2025)
by: Burnwal, Returaj, et al.
Published: (2025)
The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning
by: Aghajohari, Milad, et al.
Published: (2025)
by: Aghajohari, Milad, et al.
Published: (2025)
Faithfulness Measurable Masked Language Models
by: Madsen, Andreas, et al.
Published: (2023)
by: Madsen, Andreas, et al.
Published: (2023)
Are self-explanations from Large Language Models faithful?
by: Madsen, Andreas, et al.
Published: (2024)
by: Madsen, Andreas, et al.
Published: (2024)
MetaMolGen: A Neural Graph Motif Generation Model for De Novo Molecular Design
by: Yan, Zimo, et al.
Published: (2025)
by: Yan, Zimo, et al.
Published: (2025)
Small Encoders Can Rival Large Decoders in Detecting Groundedness
by: Abbes, Istabrak, et al.
Published: (2025)
by: Abbes, Istabrak, et al.
Published: (2025)
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
by: Huang, Jerry, et al.
Published: (2024)
by: Huang, Jerry, et al.
Published: (2024)
Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
by: Prato, Gabriele, et al.
Published: (2025)
by: Prato, Gabriele, et al.
Published: (2025)
BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
by: Zholus, Artem, et al.
Published: (2024)
by: Zholus, Artem, et al.
Published: (2024)
Uni-Mol2: Exploring Molecular Pretraining Model at Scale
by: Ji, Xiaohong, et al.
Published: (2024)
by: Ji, Xiaohong, et al.
Published: (2024)
Mastering Memory Tasks with World Models
by: Samsami, Mohammad Reza, et al.
Published: (2024)
by: Samsami, Mohammad Reza, et al.
Published: (2024)
Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)
by: Bayat, Reza, et al.
Published: (2025)
Do Large Language Models Know How Much They Know?
by: Prato, Gabriele, et al.
Published: (2025)
by: Prato, Gabriele, et al.
Published: (2025)
Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models
by: Nilaksh, et al.
Published: (2026)
by: Nilaksh, et al.
Published: (2026)
Sub-goal Distillation: A Method to Improve Small Language Agents
by: Hashemzadeh, Maryam, et al.
Published: (2024)
by: Hashemzadeh, Maryam, et al.
Published: (2024)
The Expressive Limits of Diagonal SSMs for State-Tracking
by: Shakerinava, Mehran, et al.
Published: (2026)
by: Shakerinava, Mehran, et al.
Published: (2026)
CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design
by: Govindarajan, Prashant, et al.
Published: (2025)
by: Govindarajan, Prashant, et al.
Published: (2025)
MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models
by: Formont, Philippe, et al.
Published: (2026)
by: Formont, Philippe, et al.
Published: (2026)
Neural Coherence : Find higher performance to out-of-distribution tasks from few samples
by: Guiroy, Simon, et al.
Published: (2025)
by: Guiroy, Simon, et al.
Published: (2025)
Intelligent Switching for Reset-Free RL
by: Patil, Darshan, et al.
Published: (2024)
by: Patil, Darshan, et al.
Published: (2024)
Lookbehind-SAM: k steps back, 1 step forward
by: Mordido, Gonçalo, et al.
Published: (2023)
by: Mordido, Gonçalo, et al.
Published: (2023)
Towards Practical Tool Usage for Continually Learning LLMs
by: Huang, Jerry, et al.
Published: (2024)
by: Huang, Jerry, et al.
Published: (2024)
Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning
by: Nilaksh, et al.
Published: (2026)
by: Nilaksh, et al.
Published: (2026)
Too Big to Fool: Resisting Deception in Language Models
by: Samsami, Mohammad Reza, et al.
Published: (2024)
by: Samsami, Mohammad Reza, et al.
Published: (2024)
MolPLA: A Molecular Pretraining Framework for Learning Cores, R-Groups and their Linker Joints
by: Gim, Mogan, et al.
Published: (2024)
by: Gim, Mogan, et al.
Published: (2024)
Interpretability Needs a New Paradigm
by: Madsen, Andreas, et al.
Published: (2024)
by: Madsen, Andreas, et al.
Published: (2024)
Why Don't Prompt-Based Fairness Metrics Correlate?
by: Zayed, Abdelrahman, et al.
Published: (2024)
by: Zayed, Abdelrahman, et al.
Published: (2024)
Should We Attend More or Less? Modulating Attention for Fairness
by: Zayed, Abdelrahman, et al.
Published: (2023)
by: Zayed, Abdelrahman, et al.
Published: (2023)
Dialectics of Alignment: Harnessing Unsafe Knowledge for Dynamic Safety Routing
by: Hashemzadeh, Maryam, et al.
Published: (2026)
by: Hashemzadeh, Maryam, et al.
Published: (2026)
GRPO-$λ$: Credit Assignment improves LLM Reasoning
by: Parthasarathi, Prasanna, et al.
Published: (2025)
by: Parthasarathi, Prasanna, et al.
Published: (2025)
MolTC: Towards Molecular Relational Modeling In Language Models
by: Fang, Junfeng, et al.
Published: (2024)
by: Fang, Junfeng, et al.
Published: (2024)
Investigating the Multilingual Calibration Effects of Language Model Instruction-Tuning
by: Huang, Jerry, et al.
Published: (2026)
by: Huang, Jerry, et al.
Published: (2026)
CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement Learning
by: Govindarajan, Prashant, et al.
Published: (2025)
by: Govindarajan, Prashant, et al.
Published: (2025)
Similar Items
-
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
by: Chitsaz, Kamran, et al.
Published: (2024) -
Functional Groups are All you Need for Chemically Interpretable Molecular Property Prediction
by: Balaji, Roshan, et al.
Published: (2025) -
CoPeP: Benchmarking Continual Pretraining for Protein Language Models
by: Patil, Darshan, et al.
Published: (2026) -
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2025) -
OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)