國家衛生研究院 NHRI:Item 3990099045/15767
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 12145/12927 (94%)
Visitors : 907819      Online Users : 916
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: http://ir.nhri.org.tw/handle/3990099045/15767


    Title: Early detection of nasopharyngeal carcinoma through machine-learning-driven prediction model in a population-based healthcare record database
    Authors: Chen, JW;Lin, ST;Lin, YC;Wang, BS;Chien, YN;Chiou, HY
    Contributors: Institute of Population Health Sciences
    Abstract: Objective: Early diagnosis and treatment of nasopharyngeal carcinoma (NPC) are vital for a better prognosis. Still, because of obscure anatomical sites and insidious symptoms, nearly 80% of patients with NPC are diagnosed at a late stage. This study aimed to validate a machine learning (ML) model utilizing symptom-related diagnoses and procedures in medical records to predict nasopharyngeal carcinoma (NPC) occurrence and reduce the prediagnostic period. Materials and Methods: Data from a population-based health insurance database (2001-2008) were analyzed, comparing adults with and without newly diagnosed NPC. Medical records from 90 to 360 days before diagnosis were examined. Five ML algorithms (Light Gradient Boosting Machine [LGB], eXtreme Gradient Boosting [XGB], Multivariate Adaptive Regression Splines [MARS], Random Forest [RF], and Logistics Regression [LG]) were evaluated for optimal early NPC detection. We further use a real-world data of 1 million individuals randomly selected for testing the final model. Model performance was assessed using AUROC. Shapley values identified significant contributing variables. Results: LGB showed maximum predictive power using 14 features and 90 days before diagnosis. The LGB models achieved AUROC, specificity, and sensitivity were 0.83, 0.81, and 0.64 for the test dataset, respectively. The LGB-driven NPC predictive tool effectively differentiated patients into high-risk and low-risk groups (hazard ratio: 5.85; 95% CI: 4.75-7.21). The model-layering effect is valid. Conclusions: ML approaches using electronic medical records accurately predicted NPC occurrence. The risk prediction model serves as a low-cost digital screening tool, offering rapid medical decision support to shorten prediagnostic periods. Timely referral is crucial for high-risk patients identified by the model.
    Date: 2024-03-28
    Relation: Cancer Medicine. 2024 Mar 28;13(7):Article number e7144.
    Link to: http://dx.doi.org/10.1002/cam4.7144
    JIF/Ranking 2023: http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=NHRI&SrcApp=NHRI_IR&KeyISSN=2045-7634&DestApp=IC2JCR
    Cited Times(WOS): https://www.webofscience.com/wos/woscc/full-record/WOS:001192185600001
    Cited Times(Scopus): https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85189259863
    Appears in Collections:[Hung-Yi Chiou] Periodical Articles

    Files in This Item:

    File Description SizeFormat
    ISI001192185600001.pdf850KbAdobe PDF161View/Open


    All items in NHRI are protected by copyright, with all rights reserved.

    Related Items in TAIR

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback