Lung cancer screening and real-world data
Natural-language processing pipelines that extract granular smoking history from clinical narratives to power lung cancer screening eligibility determination at scale.
Named programs
- SHAPES (Smoking History and Pack-Year Extraction System)
- SmokeBERT - clinical narrative to structured smoking data (JCO CCI, 2025)
- Lung cancer screening implementation research at Vanderbilt-Ingram
Peer-reviewed publications (12)
- Heng Tan, Travis J. Osterman. SmokeBERT and Beyond: Bridging Clinical Narratives and Structured Smoking Data To Improve Lung Cancer Screening. JCO clinical cancer informatics Dec 22, 2025
- Shelby A. Crants et al. Clonal Hematopoiesis of Indeterminate Potential After Radiation Therapy. International Journal of Radiation Oncology*Biology*Physics Oct 24, 2025
- Kai Zhang, Tongtong Huang, Bradley A. Malin, Travis Osterman, Qi Long, Xiaoqian Jiang. Introducing mCODEGPT as a zero-shot information extraction from clinical free text data tool for cancer research. Communications Medicine Oct 15, 2025
- Irbaz Bin Riaz, Muhammad Ali Khan, Travis J. Osterman. Artificial intelligence across the cancer care continuum. Cancer Aug 15, 2025
- Yanwei Li et al. Minimal Common Oncology Data Elements Genomics Pilot Project: Enhancing Oncology Research Through Electronic Health Record Interoperability at Vanderbilt University Medical Center. JCO Clinical Cancer Informatics Jun 28, 2024
- Levente Lippenszky et al. Prediction of Effectiveness and Toxicities of Immune Checkpoint Inhibitors Using Real-World Patient Data. JCO Clinical Cancer Informatics Mar 21, 2024
- Rachel S. Goodman et al. Accuracy and Reliability of Chatbot Responses to Physician Questions. JAMA Network Open Oct 2, 2023
- Eric M. Lander et al. Identification and Characterization of Avoidable Hospital Admissions in Patients With Lung Cancer. Journal of the National Comprehensive Cancer Network Oct 1, 2023
- Protiva Rahman et al. Accelerated curation of checkpoint inhibitor-induced colitis cases from electronic health records. JAMIA Open Apr 1, 2023
- Rachel S. Goodman, J. Randall Patrinely, Travis Osterman, Lee Wheless, Douglas B. Johnson. On the cusp: Considering the impact of artificial intelligence language models in healthcare. Med (New York, N.Y.) Mar 10, 2023
- Douglas Johnson et al. Assessing the Accuracy and Reliability of AI-Generated Medical Responses: An Evaluation of the Chat-GPT Model (under review). Feb 28, 2023
- Chloe Weidenbaum, Christopher G. Cann, Sarah Osmundson, Wade T. Iams, Travis Osterman. Two Uncomplicated Pregnancies on Alectinib in a Woman With Metastatic ALK-Rearranged NSCLC: A Case Report. JTO Clinical and Research Reports Jun 18, 2022
Selected talks (7)
- Tennessee Oncology Data Analysts Association (Nashville, TN): "Advancing Lung Cancer Treatment in the Era of Precision Oncology". Oct 7, 2022
- 44. DBMI Research Forum (Nashville, TN): "EHR-Wide GxE Study Using Smoking Information Extracted From Clinical Notes". May 16, 2016
- 46. University of California San Diego, Division of Biomedical Informatics (San Diego, CA): "Extracting and Studying Granular Smoking History from the Electronic Health Record". Mar 8, 2016
- AMIA Joint Summit: "Extracting Tobacco Exposure with the Smoking History and Pack-Year Extraction System (SHAPES)". Mar 13, 2018
- Conquer Cancer Foundation Scientific and Career Development Retreat (Washington, DC): "Smoking History and Pack Year Extraction System (SHAPES): Supporting Lung Cancer Screening and Tobacco-related Research". Oct 11, 2017
- 28. NLM Informatics Training Conference (Columbus, OH): "EHR-Wide GxE Study using Smoking Information Extracted from Clinical Notes,”". Jun 29, 2016
- AMIA Annual Symposium (San Francisco, CA): "Quantifying Tobacco Exposure Using Clinical Notes and Natural Language Processing to Enable Lung Cancer Screening". Nov 18, 2015
Abstracts (8)
- Joseph Vento, Lisa Bastarache, Qingxia M. Chen, Travis Osterman. Real-world side effects of targeted therapies: High-throughput association studies leveraging the CancerLinq Discovery lung cancer database.. Journal of Clinical Oncology May 28, 2025
- David Smith et al. 1246 Prediction of pneumonitis in immunotherapy patients from prior thorax CT. Journal for ImmunoTherapy of Cancer Nov 1, 2024
- Zoltan Kiss et al. 1294 External validation of machine learning models to predict efficacy and toxicity of immune checkpoint inhibitors using real-world pan cancer cohorts. Journal for ImmunoTherapy of Cancer Nov 1, 2023
- Levente Lippenszky et al. 1300 Prediction of efficacy and toxicities of immune checkpoint inhibitors using real-world patient data. Journal for ImmunoTherapy of Cancer Nov 1, 2023
- Eszter Csernai et al. Rolling window-based hepatitis toxicity prediction from routine bloodwork in patients undergoing immune checkpoint inhibitor therapy.. Journal of Clinical Oncology Jun 2022
- Gergely Horváth et al. Predicting immune checkpoint inhibitor-related hepatitis using electronic health records of patients.. Journal of Clinical Oncology Jun 2022
- Levente Lippenszky et al. Predicting immune checkpoint inhibitor-related pneumonitis using patient medical information.. Journal of Clinical Oncology Jun 2022
- Eric Michael Lander et al. Characterization of avoidable hospital admissions in patients with lung cancer in the immunotherapy and targeted therapy era.. Journal of Clinical Oncology Jun 2022
In the news (2)
- Targeted cancer drug during pregnancy · Vanderbilt University. Aug 2, 2022
- Microsoft Investigator Fellow Dr. Travis Osterman uses Azure to support lung cancer treatment protocols · Microsoft Customers Stories. Jul 23, 2021
Related: all expertise domains · AI in oncology · Cancer data standards · Clinical genomics in the EHR · Precision oncology · CI education.