:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kaiser, Lukasz, Babaeizadeh, Mohammad, Milos, Piotr, Osinski, Blazej, Campbell, Roy H, Czechowski, Konrad, Erhan, Dumitru, Finn, Chelsea, Kozakowski, Piotr, Levine, Sergey, Mohiuddin, Afroz, Sepassi, Ryan, Tucker, George, Michalewski, Henryk
Format:	Preprint
Published:	2019
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/1903.00374
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Off-Policy Correction For Multi-Agent Reinforcement Learning
by: Zawalski, Michał, et al.
Published: (2021)

Simulation-based reinforcement learning for real-world autonomous driving
by: Osiński, Błażej, et al.
Published: (2019)

Structured Packing in LLM Training Improves Long Context Utilization
by: Staniszewski, Konrad, et al.
Published: (2023)

tsGT: Stochastic Time Series Modeling With Transformer
by: Kuciński, Łukasz, et al.
Published: (2024)

Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
by: Zawalski, Michał, et al.
Published: (2022)

Subgoal Search For Complex Reasoning Tasks
by: Czechowski, Konrad, et al.
Published: (2021)

Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models
by: Mouselinos, Spyridon, et al.
Published: (2024)

What factors affect the ‘flocking’ of birdwatchers during bird rarity observations?
by: Piotr Tryjanowski, et al.
Published: (2024)

Connections between certain numbers related to derangements and $r$-permutations
by: Miska, Piotr, et al.
Published: (2024)

Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication
by: Kuciński, Łukasz, et al.
Published: (2021)

Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
by: Piotrowski, Bartosz, et al.
Published: (2025)

Polygenic Discordance in Sibling Pairs: The Paradoxical Cost of Being the "Healthy" Child in a High-ADHD-Risk Family (Study Protocol)
by: Lewandowski, Łukasz Piotr
Published: (2026)

Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
by: Wagenmaker, Andrew, et al.
Published: (2025)

The Bonebridge Active Bone Conduction Hearing Implant: Safety, Effectiveness and Outcomes Based on 355 Patients
by: Piotr Henryk Skarzynski, et al.
Published: (2025)

Renormalization in Lorenz maps -- completely invariant sets and periodic orbits
by: Cholewa, Łukasz, et al.
Published: (2021)

Objectivity of classical quantum stochastic processes
by: Szańkowski, Piotr, et al.
Published: (2023)

The influx of refugees from Ukraine and changes in Polish educational policy: A case study of the National External Examinations System
by: Piotr Załęski, et al.
Published: (2025)

Grounding Data Science Code Generation with Input-Output Specifications
by: Wen, Yeming, et al.
Published: (2024)

Tinnitus: The Phantom Sound (Part V) Psychological Aspects – volume 1
by: Holdefer, Lisiane, et al.
Published: (2025)

RoboReward: General-Purpose Vision-Language Reward Models for Robotics
by: Lee, Tony, et al.
Published: (2026)

Robotic Control via Embodied Chain-of-Thought Reasoning
by: Zawalski, Michał, et al.
Published: (2024)

Stability of solutions of the porous medium equation with growth with respect to the diffusion exponent
by: Dębiec, Tomasz, et al.
Published: (2024)

Seguimento ambulatorial de um grupo de prematuros e a prevalência do aleitamento na alta hospitalar e ao sexto mês de vida: contribuições da Fonoaudiologia
by: Aliana Eduarda Czechowski
Published: (2010)

Trustworthy Retrosynthesis: Eliminating Hallucinations with a Diverse Ensemble of Reaction Scorers
by: Sadowski, Michal, et al.
Published: (2025)

Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
by: Chen, Annie S., et al.
Published: (2023)

Benchmarking of Different YOLO Models for CAPTCHAs Detection and Classification
by: Wysocki, Mikołaj, et al.
Published: (2025)

A framework for distributed discrete evacuation strategies
by: Borowiecki, Piotr, et al.
Published: (2025)

Data integration of non-probability and probability samples with predictive mean matching
by: Chlebicki, Piotr, et al.
Published: (2024)

nonprobsvy -- An R package for modern methods for non-probability surveys
by: Chrostowski, Łukasz, et al.
Published: (2025)

Subject Access Points in the MARC Record and Archival Finding Aid: Enough or Too Many?
by: Cox, Elizabeth, et al.
Published: (2007)

HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
by: Delfosse, Quentin, et al.
Published: (2024)

The Influence of MoS2 Thickness on the Efficiency of Solar Energy Conversion in TiO2/MoS2/P3HT Cells
by: Kamila Kollbek, et al.
Published: (2024)

Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models
by: Chen, Annie S., et al.
Published: (2024)

Colorization of Optically Transparent Surfactants to Track Their Movement in Biphasic Systems Used for Differentiation of Nanomaterials
by: Podlesny, Blazej, et al.
Published: (2025)

Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery
by: Olko, Mateusz, et al.
Published: (2025)

Strategic Cost Selection in Participatory Budgeting
by: Faliszewski, Piotr, et al.
Published: (2024)

Micro‐ and nanoplastics in the human genitourinary system: oncological impact – a systematic review
by: Nicole Akpang, et al.
Published: (2026)

Operator splitting algorithm for structured population models on metric spaces
by: Lindow, Carolin, et al.
Published: (2025)

Diastatotropis petulae Tryzna, Blazej & Rakotonirina 2024, sp. nov.
by: Trýzna, Miloš, et al.
Published: (2024)

Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation
by: Yang, Jonathan, et al.
Published: (2024)