Saved in:
| Main Authors: | Franzmeyer, Tim, Shtedritski, Aleksandar, Albanie, Samuel, Torr, Philip, Henriques, João F., Foerster, Jakob N. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.03428 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Select to Perfect: Imitating desired behavior from large multi-agent data
by: Franzmeyer, Tim, et al.
Published: (2024)
by: Franzmeyer, Tim, et al.
Published: (2024)
Dataset de recetas de HelloFresh España
by: Arenas Villanueva, Javier, et al.
Published: (2026)
by: Arenas Villanueva, Javier, et al.
Published: (2026)
TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs
by: Nuti, Felipe, et al.
Published: (2025)
by: Nuti, Felipe, et al.
Published: (2025)
Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks
by: Franzmeyer, Tim, et al.
Published: (2022)
by: Franzmeyer, Tim, et al.
Published: (2022)
What an Elegant Bridge: Multilingual LLMs are Biased Similarly in Different Languages
by: Mihaylov, Viktor, et al.
Published: (2024)
by: Mihaylov, Viktor, et al.
Published: (2024)
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection
by: Nasvytis, Linas, et al.
Published: (2024)
by: Nasvytis, Linas, et al.
Published: (2024)
SHIC: Shape-Image Correspondences with no Keypoint Supervision
by: Shtedritski, Aleksandar, et al.
Published: (2024)
by: Shtedritski, Aleksandar, et al.
Published: (2024)
SynCity: Training-Free Generation of 3D Worlds
by: Engstler, Paul, et al.
Published: (2025)
by: Engstler, Paul, et al.
Published: (2025)
Select2Plan: Training-Free ICL-Based Planning through VQA and Memory Retrieval
by: Buoso, Davide, et al.
Published: (2024)
by: Buoso, Davide, et al.
Published: (2024)
A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval
by: Oncescu, Andreea-Maria, et al.
Published: (2024)
by: Oncescu, Andreea-Maria, et al.
Published: (2024)
High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
by: Franzmeyer, Tim, et al.
Published: (2025)
by: Franzmeyer, Tim, et al.
Published: (2025)
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
by: Prabhu, Ameya, et al.
Published: (2024)
by: Prabhu, Ameya, et al.
Published: (2024)
AI & Human Co-Improvement for Safer Co-Superintelligence
by: Weston, Jason, et al.
Published: (2025)
by: Weston, Jason, et al.
Published: (2025)
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
by: Cook, Jonathan, et al.
Published: (2024)
by: Cook, Jonathan, et al.
Published: (2024)
JaxUED: A simple and useable UED library in Jax
by: Coward, Samuel, et al.
Published: (2024)
by: Coward, Samuel, et al.
Published: (2024)
'Hello, World!': Making GNNs Talk with LLMs
by: Kim, Sunwoo, et al.
Published: (2025)
by: Kim, Sunwoo, et al.
Published: (2025)
Reinforcement Learning for Quantum Control under Physical Constraints
by: Ernst, Jan Ole, et al.
Published: (2025)
by: Ernst, Jan Ole, et al.
Published: (2025)
Hello, world!
by: Chiang, Tai-Wei
Published: (2025)
by: Chiang, Tai-Wei
Published: (2025)
Hello world
by: Qu, Chenyang
Published: (2026)
by: Qu, Chenyang
Published: (2026)
Hello, Dolly
Published: (1997)
Published: (1997)
Hello 2
Hello 4
How Long Is a Piece of String? A Brief Empirical Analysis of Tokenizers
by: Roberts, Jonathan, et al.
Published: (2026)
by: Roberts, Jonathan, et al.
Published: (2026)
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems
by: Röpke, Willem, et al.
Published: (2026)
by: Röpke, Willem, et al.
Published: (2026)
Hello Friends, Cantemos
by: Carlos Poveda, Juan
Published: (2024)
by: Carlos Poveda, Juan
Published: (2024)
India. Hello, word
Published: (1995)
Published: (1995)
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
by: Lin, Wenye, et al.
Published: (2024)
by: Lin, Wenye, et al.
Published: (2024)
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
by: Udandarao, Vishaal, et al.
Published: (2024)
by: Udandarao, Vishaal, et al.
Published: (2024)
AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement
by: Rosser, J, et al.
Published: (2025)
by: Rosser, J, et al.
Published: (2025)
Dynamic Classifier-Free Diffusion Guidance via Online Feedback
by: Papalampidi, Pinelopi, et al.
Published: (2025)
by: Papalampidi, Pinelopi, et al.
Published: (2025)
Princ-wiki-a Mathematica: Wikipedia editing and mathematics
by: Eppstein, David, et al.
Published: (2024)
by: Eppstein, David, et al.
Published: (2024)
Evolving Many Worlds: Towards Open-Ended Discovery in Petri Dish NCA via Population-Based Training
by: Berdica, Uljad, et al.
Published: (2026)
by: Berdica, Uljad, et al.
Published: (2026)
Prompting a Pretrained Transformer Can Be a Universal Approximator
by: Petrov, Aleksandar, et al.
Published: (2024)
by: Petrov, Aleksandar, et al.
Published: (2024)
When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations
by: Petrov, Aleksandar, et al.
Published: (2023)
by: Petrov, Aleksandar, et al.
Published: (2023)
Learning Camera Movement Control from Real-World Drone Videos
by: Hou, Yunzhong, et al.
Published: (2024)
by: Hou, Yunzhong, et al.
Published: (2024)
Hello IM, Goodbye TTY
by: Bell, Lori, et al.
Published: (2006)
by: Bell, Lori, et al.
Published: (2006)
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Similar Items
-
Select to Perfect: Imitating desired behavior from large multi-agent data
by: Franzmeyer, Tim, et al.
Published: (2024) -
Dataset de recetas de HelloFresh España
by: Arenas Villanueva, Javier, et al.
Published: (2026) -
TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs
by: Nuti, Felipe, et al.
Published: (2025) -
Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks
by: Franzmeyer, Tim, et al.
Published: (2022) -
What an Elegant Bridge: Multilingual LLMs are Biased Similarly in Different Languages
by: Mihaylov, Viktor, et al.
Published: (2024)