The TRIPOD-LLM reporting guideline for studies using large language models

Jack Gallifant1,2,3, Majid Afshar4,20, Saleem Ameen1,5,6,29, Yindalon Aphinyanaphongs7,29, Shan Chen8,9,28, Giovanni Cacciamani8,10,29, Dina Demner-Fushman11,29, Dmitriy Dligach12,29

Roxana Daneshjou13,14,29, Chrystinne Fernandes1,29, Lasse Hyldig Hansen15,29, Adam Landman16,20, Lisa Lehmann16,29, Liam G. McCoy17,29, Timothy Miller18,29, Amy Moreno19,29

Nikolaj Munch15,29, David Restrepo1,20,29, Guergana Savova18,29, Renato Umeton21,29, Judy Wawira Gichoya22,29, Gary S. Collins23,24, Karel G. M. Moons25,26, Leo A. Celi1,27,28, Danielle S. Bitterman1,8

Standardized Reporting for LLMs in Healthcare

TRIPOD-LLM provides standardized reporting guidelines for large language models in healthcare applications. This extension of the TRIPOD framework addresses the unique challenges of LLMs through a comprehensive checklist of 19 main items, covering key aspects from research design to clinical applicability. Developed through expert consensus, TRIPOD-LLM emphasizes transparency, human oversight, and task-specific performance reporting, supported by an interactive website for guideline completion.

Why Standardized Reporting Matters

Large language models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. Current LLM research often lacks consistency in how methods and results are reported, making it difficult to assess and compare different approaches.

TRIPOD-LLM Funnel Diagram

TRIPOD-LLM extends the TRIPOD + artificial intelligence statement, addressing the unique challenges of LLMs in biomedical applications.

TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce a modular format accommodating various LLM research designs and tasks, with 14 main items and 32 subitems applicable across all categories.

Key Features of TRIPOD-LLM

Modular Framework

Accommodates various LLM research designs and tasks with a flexible structure that can be adapted to different use cases while maintaining reporting consistency.

Comprehensive Checklist

19 main items and 50 subitems covering all aspects of LLM research from title and abstract to methods, results, and discussion.

Task-Specific Performance

Emphasizes the importance of reporting performance metrics that are relevant to the specific healthcare task being addressed.

Interactive Website

Interactive tool for easy guideline completion and PDF generation for submission with research papers.

Interactive Website

To facilitate the adoption of TRIPOD-LLM, we developed an interactive website that guides researchers through the process of completing the checklist, providing context and examples for each item.

How To Cite

When using TRIPOD-LLM for your research, please cite the following paper:

Bibliography

Gallifant, J., Afshar, M., Ameen, S. et al. The TRIPOD-LLM reporting guideline for studies using large language models. Nat Med (2025). https://doi.org/10.1038/s41591-024-03425-5

BibTeX

@article{gallifant2025tripod,
  title={The TRIPOD-LLM reporting guideline for studies using large language models},
	author = {Gallifant, Jack and Afshar, Majid and Ameen, Saleem and Aphinyanaphongs, Yindalon and Chen, Shan and Cacciamani, Giovanni and Demner-Fushman, Dina and Dligach, Dmitriy and Daneshjou, Roxana and Fernandes, Chrystinne and Hansen, Lasse Hyldig and Landman, Adam and Lehmann, Lisa and McCoy, Liam G. and Miller, Timothy and Moreno, Amy and Munch, Nikolaj and Restrepo, David and Savova, Guergana and Umeton, Renato and Gichoya, Judy Wawira and Collins, Gary S. and Moons, Karel G. M. and Celi, Leo A. and Bitterman, Danielle S.},
  journal={Nature Medicine},
  year={2025},
  publisher={Nature Publishing Group},
  doi={10.1038/s41591-024-03425-5}
}