Saved in:
| Main Authors: | Pietruszka, Michał, Borchmann, Łukasz, Jędrosz, Aleksander, Morawiecki, Paweł |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.23331 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Language Models Model Language
by: Borchmann, Łukasz
Published: (2025)
by: Borchmann, Łukasz
Published: (2025)
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
by: Turski, Michał, et al.
Published: (2025)
by: Turski, Michał, et al.
Published: (2025)
Notes on Applicability of GPT-4 to Document Understanding
by: Borchmann, Łukasz
Published: (2024)
by: Borchmann, Łukasz
Published: (2024)
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
by: Borchmann, Łukasz, et al.
Published: (2024)
by: Borchmann, Łukasz, et al.
Published: (2024)
In Case You Missed It: ARC 'Challenge' Is Not That Challenging
by: Borchmann, Łukasz
Published: (2024)
by: Borchmann, Łukasz
Published: (2024)
Tackling prediction tasks in relational databases with LLMs
by: Wydmuch, Marek, et al.
Published: (2024)
by: Wydmuch, Marek, et al.
Published: (2024)
Query and Conquer: Execution-Guided SQL Generation
by: Borchmann, Łukasz, et al.
Published: (2025)
by: Borchmann, Łukasz, et al.
Published: (2025)
Can LLMs Help Create Grammar?: Automating Grammar Creation for Endangered Languages with In-Context Learning
by: Spencer, Piyapath T, et al.
Published: (2024)
by: Spencer, Piyapath T, et al.
Published: (2024)
Dynamic Boundary Time Warping for Sub-sequence Matching with Few Examples
by: Borchmann, Łukasz, et al.
Published: (2020)
by: Borchmann, Łukasz, et al.
Published: (2020)
How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO
by: Ng, Man Tik, et al.
Published: (2024)
by: Ng, Man Tik, et al.
Published: (2024)
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
by: Lee, Jooyoung, et al.
Published: (2024)
by: Lee, Jooyoung, et al.
Published: (2024)
Can Large Language Models Replace Data Scientists in Biomedical Research?
by: Wang, Zifeng, et al.
Published: (2024)
by: Wang, Zifeng, et al.
Published: (2024)
Can Many-Shot In-Context Learning Help LLMs as Evaluators? A Preliminary Empirical Study
by: Song, Mingyang, et al.
Published: (2024)
by: Song, Mingyang, et al.
Published: (2024)
Mixed Distillation Helps Smaller Language Model Better Reasoning
by: Li, Chenglin, et al.
Published: (2023)
by: Li, Chenglin, et al.
Published: (2023)
Can Hallucinations Help? Boosting LLMs for Drug Discovery
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
by: GX-Chen, Anthony, et al.
Published: (2025)
by: GX-Chen, Anthony, et al.
Published: (2025)
Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge
by: Koutcheme, Charles, et al.
Published: (2024)
by: Koutcheme, Charles, et al.
Published: (2024)
Can LLMs Solve longer Math Word Problems Better?
by: Xu, Xin, et al.
Published: (2024)
by: Xu, Xin, et al.
Published: (2024)
LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)
by: Turtel, Benjamin, et al.
Published: (2025)
Can GRPO Help LLMs Transcend Their Pretraining Origin?
by: Ni, Kangqi, et al.
Published: (2025)
by: Ni, Kangqi, et al.
Published: (2025)
Better Call Claude: Can LLMs Detect Changes of Writing Style?
by: Römisch, Johannes, et al.
Published: (2025)
by: Römisch, Johannes, et al.
Published: (2025)
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
by: Li, Zichong, et al.
Published: (2025)
by: Li, Zichong, et al.
Published: (2025)
CauScientist: Teaching LLMs to Respect Data for Causal Discovery
by: Peng, Bo, et al.
Published: (2026)
by: Peng, Bo, et al.
Published: (2026)
Multi-Prompting Decoder Helps Better Language Understanding
by: Cheng, Zifeng, et al.
Published: (2024)
by: Cheng, Zifeng, et al.
Published: (2024)
Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
by: Ning, Xuefei, et al.
Published: (2024)
by: Ning, Xuefei, et al.
Published: (2024)
The AI Data Scientist
by: Akimov, Farkhad, et al.
Published: (2025)
by: Akimov, Farkhad, et al.
Published: (2025)
Between Help and Harm: An Evaluation of Mental Health Crisis Handling by LLMs
by: Arnaiz-Rodriguez, Adrian, et al.
Published: (2025)
by: Arnaiz-Rodriguez, Adrian, et al.
Published: (2025)
Importance Weighting Can Help Large Language Models Self-Improve
by: Jiang, Chunyang, et al.
Published: (2024)
by: Jiang, Chunyang, et al.
Published: (2024)
Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams
by: Lee, Yukyung, et al.
Published: (2026)
by: Lee, Yukyung, et al.
Published: (2026)
What Can String Probability Tell Us About Grammaticality?
by: Hu, Jennifer, et al.
Published: (2025)
by: Hu, Jennifer, et al.
Published: (2025)
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
by: Jin, Yiping, et al.
Published: (2024)
by: Jin, Yiping, et al.
Published: (2024)
Decoder-Only LLMs are Better Controllers for Diffusion Models
by: Dong, Ziyi, et al.
Published: (2025)
by: Dong, Ziyi, et al.
Published: (2025)
Can Generic LLMs Help Analyze Child-adult Interactions Involving Children with Autism in Clinical Observation?
by: Feng, Tiantian, et al.
Published: (2024)
by: Feng, Tiantian, et al.
Published: (2024)
Model-Aware Tokenizer Transfer
by: Haltiuk, Mykola, et al.
Published: (2025)
by: Haltiuk, Mykola, et al.
Published: (2025)
Can Reasoning Help Large Language Models Capture Human Annotator Disagreement?
by: Ni, Jingwei, et al.
Published: (2025)
by: Ni, Jingwei, et al.
Published: (2025)
Can Stories Help LLMs Reason? Curating Information Space Through Narrative
by: Javadi, Vahid Sadiri, et al.
Published: (2024)
by: Javadi, Vahid Sadiri, et al.
Published: (2024)
Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks?
by: Greatrix, Thomas, et al.
Published: (2024)
by: Greatrix, Thomas, et al.
Published: (2024)
Rethinking LLM Evaluation: Can We Evaluate LLMs with 200x Less Data?
by: Wang, Shaobo, et al.
Published: (2025)
by: Wang, Shaobo, et al.
Published: (2025)
Evaluating Large Language Models in Theory of Mind Tasks
by: Kosinski, Michal
Published: (2023)
by: Kosinski, Michal
Published: (2023)
Can LLMs Help Uncover Insights about LLMs? A Large-Scale, Evolving Literature Analysis of Frontier LLMs
by: Park, Jungsoo, et al.
Published: (2025)
by: Park, Jungsoo, et al.
Published: (2025)
Similar Items
-
Language Models Model Language
by: Borchmann, Łukasz
Published: (2025) -
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA
by: Turski, Michał, et al.
Published: (2025) -
Notes on Applicability of GPT-4 to Document Understanding
by: Borchmann, Łukasz
Published: (2024) -
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
by: Borchmann, Łukasz, et al.
Published: (2024) -
In Case You Missed It: ARC 'Challenge' Is Not That Challenging
by: Borchmann, Łukasz
Published: (2024)