Saved in:
| Main Authors: | Kaiser, Lukasz, Babaeizadeh, Mohammad, Milos, Piotr, Osinski, Blazej, Campbell, Roy H, Czechowski, Konrad, Erhan, Dumitru, Finn, Chelsea, Kozakowski, Piotr, Levine, Sergey, Mohiuddin, Afroz, Sepassi, Ryan, Tucker, George, Michalewski, Henryk |
|---|---|
| Format: | Preprint |
| Published: |
2019
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1903.00374 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Off-Policy Correction For Multi-Agent Reinforcement Learning
by: Zawalski, Michał, et al.
Published: (2021)
by: Zawalski, Michał, et al.
Published: (2021)
Simulation-based reinforcement learning for real-world autonomous driving
by: Osiński, Błażej, et al.
Published: (2019)
by: Osiński, Błażej, et al.
Published: (2019)
Structured Packing in LLM Training Improves Long Context Utilization
by: Staniszewski, Konrad, et al.
Published: (2023)
by: Staniszewski, Konrad, et al.
Published: (2023)
tsGT: Stochastic Time Series Modeling With Transformer
by: Kuciński, Łukasz, et al.
Published: (2024)
by: Kuciński, Łukasz, et al.
Published: (2024)
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
by: Zawalski, Michał, et al.
Published: (2022)
by: Zawalski, Michał, et al.
Published: (2022)
Subgoal Search For Complex Reasoning Tasks
by: Czechowski, Konrad, et al.
Published: (2021)
by: Czechowski, Konrad, et al.
Published: (2021)
Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models
by: Mouselinos, Spyridon, et al.
Published: (2024)
by: Mouselinos, Spyridon, et al.
Published: (2024)
What factors affect the ‘flocking’ of birdwatchers during bird rarity observations?
by: Piotr Tryjanowski, et al.
Published: (2024)
by: Piotr Tryjanowski, et al.
Published: (2024)
Connections between certain numbers related to derangements and $r$-permutations
by: Miska, Piotr, et al.
Published: (2024)
by: Miska, Piotr, et al.
Published: (2024)
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication
by: Kuciński, Łukasz, et al.
Published: (2021)
by: Kuciński, Łukasz, et al.
Published: (2021)
Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
by: Piotrowski, Bartosz, et al.
Published: (2025)
by: Piotrowski, Bartosz, et al.
Published: (2025)
Polygenic Discordance in Sibling Pairs: The Paradoxical Cost of Being the "Healthy" Child in a High-ADHD-Risk Family (Study Protocol)
by: Lewandowski, Łukasz Piotr
Published: (2026)
by: Lewandowski, Łukasz Piotr
Published: (2026)
Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
by: Wagenmaker, Andrew, et al.
Published: (2025)
by: Wagenmaker, Andrew, et al.
Published: (2025)
The Bonebridge Active Bone Conduction Hearing Implant: Safety, Effectiveness and Outcomes Based on 355 Patients
by: Piotr Henryk Skarzynski, et al.
Published: (2025)
by: Piotr Henryk Skarzynski, et al.
Published: (2025)
Renormalization in Lorenz maps -- completely invariant sets and periodic orbits
by: Cholewa, Łukasz, et al.
Published: (2021)
by: Cholewa, Łukasz, et al.
Published: (2021)
Objectivity of classical quantum stochastic processes
by: Szańkowski, Piotr, et al.
Published: (2023)
by: Szańkowski, Piotr, et al.
Published: (2023)
The influx of refugees from Ukraine and changes in Polish educational policy: A case study of the National External Examinations System
by: Piotr Załęski, et al.
Published: (2025)
by: Piotr Załęski, et al.
Published: (2025)
Grounding Data Science Code Generation with Input-Output Specifications
by: Wen, Yeming, et al.
Published: (2024)
by: Wen, Yeming, et al.
Published: (2024)
Tinnitus: The Phantom Sound (Part V) Psychological Aspects – volume 1
by: Holdefer, Lisiane, et al.
Published: (2025)
by: Holdefer, Lisiane, et al.
Published: (2025)
RoboReward: General-Purpose Vision-Language Reward Models for Robotics
by: Lee, Tony, et al.
Published: (2026)
by: Lee, Tony, et al.
Published: (2026)
Robotic Control via Embodied Chain-of-Thought Reasoning
by: Zawalski, Michał, et al.
Published: (2024)
by: Zawalski, Michał, et al.
Published: (2024)
Stability of solutions of the porous medium equation with growth with respect to the diffusion exponent
by: Dębiec, Tomasz, et al.
Published: (2024)
by: Dębiec, Tomasz, et al.
Published: (2024)
Seguimento ambulatorial de um grupo de prematuros e a prevalência do aleitamento na alta hospitalar e ao sexto mês de vida: contribuições da Fonoaudiologia
by: Aliana Eduarda Czechowski
Published: (2010)
by: Aliana Eduarda Czechowski
Published: (2010)
Trustworthy Retrosynthesis: Eliminating Hallucinations with a Diverse Ensemble of Reaction Scorers
by: Sadowski, Michal, et al.
Published: (2025)
by: Sadowski, Michal, et al.
Published: (2025)
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
by: Chen, Annie S., et al.
Published: (2023)
by: Chen, Annie S., et al.
Published: (2023)
Benchmarking of Different YOLO Models for CAPTCHAs Detection and Classification
by: Wysocki, Mikołaj, et al.
Published: (2025)
by: Wysocki, Mikołaj, et al.
Published: (2025)
A framework for distributed discrete evacuation strategies
by: Borowiecki, Piotr, et al.
Published: (2025)
by: Borowiecki, Piotr, et al.
Published: (2025)
Data integration of non-probability and probability samples with predictive mean matching
by: Chlebicki, Piotr, et al.
Published: (2024)
by: Chlebicki, Piotr, et al.
Published: (2024)
nonprobsvy -- An R package for modern methods for non-probability surveys
by: Chrostowski, Łukasz, et al.
Published: (2025)
by: Chrostowski, Łukasz, et al.
Published: (2025)
Subject Access Points in the MARC Record and Archival Finding Aid: Enough or Too Many?
by: Cox, Elizabeth, et al.
Published: (2007)
by: Cox, Elizabeth, et al.
Published: (2007)
HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
by: Delfosse, Quentin, et al.
Published: (2024)
by: Delfosse, Quentin, et al.
Published: (2024)
The Influence of MoS2 Thickness on the Efficiency of Solar Energy Conversion in TiO2/MoS2/P3HT Cells
by: Kamila Kollbek, et al.
Published: (2024)
by: Kamila Kollbek, et al.
Published: (2024)
Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models
by: Chen, Annie S., et al.
Published: (2024)
by: Chen, Annie S., et al.
Published: (2024)
Colorization of Optically Transparent Surfactants to Track Their Movement in Biphasic Systems Used for Differentiation of Nanomaterials
by: Podlesny, Blazej, et al.
Published: (2025)
by: Podlesny, Blazej, et al.
Published: (2025)
Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
by: Olko, Mateusz, et al.
Published: (2025)
by: Olko, Mateusz, et al.
Published: (2025)
Strategic Cost Selection in Participatory Budgeting
by: Faliszewski, Piotr, et al.
Published: (2024)
by: Faliszewski, Piotr, et al.
Published: (2024)
Micro‐ and nanoplastics in the human genitourinary system: oncological impact – a systematic review
by: Nicole Akpang, et al.
Published: (2026)
by: Nicole Akpang, et al.
Published: (2026)
Operator splitting algorithm for structured population models on metric spaces
by: Lindow, Carolin, et al.
Published: (2025)
by: Lindow, Carolin, et al.
Published: (2025)
Diastatotropis petulae Tryzna, Blazej & Rakotonirina 2024, sp. nov.
by: Trýzna, Miloš, et al.
Published: (2024)
by: Trýzna, Miloš, et al.
Published: (2024)
Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation
by: Yang, Jonathan, et al.
Published: (2024)
by: Yang, Jonathan, et al.
Published: (2024)
Similar Items
-
Off-Policy Correction For Multi-Agent Reinforcement Learning
by: Zawalski, Michał, et al.
Published: (2021) -
Simulation-based reinforcement learning for real-world autonomous driving
by: Osiński, Błażej, et al.
Published: (2019) -
Structured Packing in LLM Training Improves Long Context Utilization
by: Staniszewski, Konrad, et al.
Published: (2023) -
tsGT: Stochastic Time Series Modeling With Transformer
by: Kuciński, Łukasz, et al.
Published: (2024) -
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
by: Zawalski, Michał, et al.
Published: (2022)