Saved in:
| Main Author: | |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2026
|
| Online Access: | https://doi.org/10.5281/zenodo.18857202 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866901892208001024 |
|---|---|
| author | LIN, XINYI |
| author_facet | LIN, XINYI |
| contents | <p>This is the visualization code and data of <span>four biomedical tasks: clinical QA (MedMCQA), evidence extraction (PubMedQA), gene set functional annotation, and cell type identification for manuscript "<span>EvaluateBM: a multi-agent framework for evaluating reasoning-capable language models in biomedical tasks</span>"</span></p> |
| format | Recurso digital |
| id | zenodo_https___doi_org_10_5281_zenodo_18857202 |
| institution | Zenodo |
| language | |
| publishDate | 2026 |
| publisher | Zenodo |
| record_format | zenodo |
| spellingShingle | EvaluateBM: a multi-agent framework for evaluating reasoning-capable language models in biomedical tasks LIN, XINYI <p>This is the visualization code and data of <span>four biomedical tasks: clinical QA (MedMCQA), evidence extraction (PubMedQA), gene set functional annotation, and cell type identification for manuscript "<span>EvaluateBM: a multi-agent framework for evaluating reasoning-capable language models in biomedical tasks</span>"</span></p> |
| title | EvaluateBM: a multi-agent framework for evaluating reasoning-capable language models in biomedical tasks |
| url | https://doi.org/10.5281/zenodo.18857202 |