The TRIPOD-LLM reporting guideline for studies using large language models
Jack Gallifant1,2,3Harvard Medical School, Department of Biomedical Informatics, Majid Afshar4,20Healthcare Analytics and Clinical Informatics, Saleem Ameen1,5,6,29Harvard Medical School, Yindalon Aphinyanaphongs7,29NYU Langone Health, Shan Chen8,9,28University of Oxford, Giovanni Cacciamani8,10,29Keck School of Medicine USC, Dina Demner-Fushman11,29National Library of Medicine, Dmitriy Dligach12,29Loyola University Chicago
Roxana Daneshjou13,14,29Stanford University School of Medicine, Chrystinne Fernandes1,29Harvard Medical School, Lasse Hyldig Hansen15,29Aarhus University, Adam Landman16,20Brigham and Women's Hospital, Lisa Lehmann16,29Brigham and Women's Hospital, Liam G. McCoy17,29University of Toronto, Timothy Miller18,29Boston Children's Hospital, Amy Moreno19,29Duke University School of Medicine
Nikolaj Munch15,29Aarhus University, David Restrepo1,20,29Harvard Medical School, Guergana Savova18,29Boston Children's Hospital, Renato Umeton21,29Dana-Farber Cancer Institute, Judy Wawira Gichoya22,29Emory University School of Medicine, Gary S. Collins23,24University of Oxford, Karel G. M. Moons25,26University Medical Center Utrecht, Leo A. Celi1,27,28Harvard Medical School, MIT, Danielle S. Bitterman1,8Harvard Medical School, Dana-Farber Cancer Institute
TRIPOD-LLM provides standardized reporting guidelines for large language models in healthcare applications. This extension of the TRIPOD framework addresses the unique challenges of LLMs through a comprehensive checklist of 19 main items, covering key aspects from research design to clinical applicability. Developed through expert consensus, TRIPOD-LLM emphasizes transparency, human oversight, and task-specific performance reporting, supported by an interactive website for guideline completion.
Why Standardized Reporting Matters
Large language models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. Current LLM research often lacks consistency in how methods and results are reported, making it difficult to assess and compare different approaches.
TRIPOD-LLM extends the TRIPOD + artificial intelligence statement, addressing the unique challenges of LLMs in biomedical applications.
TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce a modular format accommodating various LLM research designs and tasks, with 14 main items and 32 subitems applicable across all categories.
Key Features of TRIPOD-LLM
Accommodates various LLM research designs and tasks with a flexible structure that can be adapted to different use cases while maintaining reporting consistency.
19 main items and 50 subitems covering all aspects of LLM research from title and abstract to methods, results, and discussion.
Emphasizes the importance of reporting performance metrics that are relevant to the specific healthcare task being addressed.
Interactive tool for easy guideline completion and PDF generation for submission with research papers.
Interactive Website
To facilitate the adoption of TRIPOD-LLM, we developed an interactive website that guides researchers through the process of completing the checklist, providing context and examples for each item.
When using TRIPOD-LLM for your research, please cite the following paper:
Gallifant, J., Afshar, M., Ameen, S. et al. The TRIPOD-LLM reporting guideline for studies using large language models. Nat Med (2025). https://doi.org/10.1038/s41591-024-03425-5
@article{gallifant2025tripod, title={The TRIPOD-LLM reporting guideline for studies using large language models}, author = {Gallifant, Jack and Afshar, Majid and Ameen, Saleem and Aphinyanaphongs, Yindalon and Chen, Shan and Cacciamani, Giovanni and Demner-Fushman, Dina and Dligach, Dmitriy and Daneshjou, Roxana and Fernandes, Chrystinne and Hansen, Lasse Hyldig and Landman, Adam and Lehmann, Lisa and McCoy, Liam G. and Miller, Timothy and Moreno, Amy and Munch, Nikolaj and Restrepo, David and Savova, Guergana and Umeton, Renato and Gichoya, Judy Wawira and Collins, Gary S. and Moons, Karel G. M. and Celi, Leo A. and Bitterman, Danielle S.}, journal={Nature Medicine}, year={2025}, publisher={Nature Publishing Group}, doi={10.1038/s41591-024-03425-5} }