國家衛生研究院 NHRI:Item 3990099045/14074

國家衛生研究院 NHRI:Item 3990099045/14074

English | 正體中文 | 简体中文 | 全文筆數/總筆數 : 12145/12927 (94%)
造訪人次 : 913992 線上人數 : 1266

RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

搜尋範圍

查詢小技巧：

您可在西文檢索詞彙前後加上"雙引號"，以獲取較精準的檢索結果

若欲以作者姓名搜尋，建議至進階搜尋限定作者欄位，可獲得較完整資料

進階搜尋

主頁 ‧ 登入 ‧ 上傳 ‧ 說明 ‧ 關於NHRI ‧ 管理

到手機版

國家衛生研究院 NHRI > 高齡醫學暨健康福祉研究中心 > 吳其炘 > 期刊論文 > Item 3990099045/14074

請使用永久網址來引用或連結此文件: http://ir.nhri.org.tw/handle/3990099045/14074

題名:	Principle-based approach for the de-identification of code-mixed electronic health records
作者:	Wang, C;Wang, F;Lee, Y;Chen, P;Wang, B;Su, C;Kuo, C;Wu, C;Chien, Y;Dai, H;Tseng, VS;Hsu, W
貢獻者:	National Center for Geriatrics and Welfare Research;National Institute of Cancer Research
摘要:	Code-mixing is a phenomenon when at least two languages combined in a hybrid way in the context of a single conversation. The use of mixed language is widespread in multilingual and multicultural countries and poses significant challenges for the development of automated language processing tools. In Taiwan’s electronic health record (EHR) systems, the unstructured EHR texts are usually represented in the mixing of English and Chinese languages resulting in the difficulty for de-identification and synthetization of protected health information (PHI). We explored this problem by applied several state-of-the-art pre-trained mono- and multilingual language models and proposed to apply the principle-based approach (PBA) for the tasks of PHI recognition and resynthesis on a code-mixed EHR corpus, which was annotated with 6 main categories and 25 subcategories of PHIs. In PBA, a hierarchical principle slot schema is defined to encode knowledge of code-mixed PHIs and the defined slots were learned from the training set to assemble into principles for recognizing PHI mentions and synthesizing surrogates at the same time. A semantic disambiguation process is developed used to disambiguate ambiguous PHI categories in the de-identification process and to dynamically extend the knowledge encoded in PBA during the knowledge augmentation process. The experimental results demonstrate that the proposed method can achieve the best micro- and macro-F-scores performance in comparison with the other mono- and multilingual language models fine-tuned on our code-mixed corpus.
日期:	2022-02-01
關聯:	IEEE Access. 2022 Feb;10:22875-22885.
Link to:	http://dx.doi.org/10.1109/ACCESS.2022.3148396
JIF/Ranking 2023:	http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=NHRI&SrcApp=NHRI_IR&KeyISSN=2169-3536&DestApp=IC2JCR
Cited Times(WOS):	https://www.webofscience.com/wos/woscc/full-record/WOS:000766560600001
Cited Times(Scopus):	https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85124199713
顯示於類別:	[吳其炘] 期刊論文 [其他] 期刊論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
SCP85124199713.pdf		816Kb	Adobe PDF	180	檢視/開啟

在NHRI中所有的資料項目都受到原著作權保護.

TAIR相關文章

DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library IR team Copyright © - 回饋