Prevalence of Thalassemia in the Vietnamese Population and Building a Clinical Decision Support System for Prenatal Screening for Thalassemia 

Danh Cuong Tran1, Anh Linh Dang1, Thi Ngoc Lan Hoang2, Chi Thanh Nguyen3, Thi Minh Phuong Le4, Thi Ngoc Mai Dinh5, Van Anh Tran5, Thi Kim Phuong Doan2 and Thi Trang Nguyen2.


1 Center for Prenatal Diagnosis, National Hospital of Obstetrics and Gynecology, Hanoi, Vietnam.
2 Department of Biomedical and genetics, Hanoi Medical University, Hanoi, Vietnam.
3 Department of Specialized Software, Academy of Military Science and Technology, Hanoi, Vietnam.
4 Department of Basic Sciences in Medicine and Pharmacy, University of Medicine and Pharmacy - Vietnam National University, Hanoi, Vietnam.
5 Department of Pediatrics, Hanoi Medical University, Hanoi, Vietnam. 

Correspondence to:  Thi Trang Nguyen. E-mail: thitrangnguyen42@yahoo.com

Published: May 1, 2023
Received: January 11, 2023
Accepted: April 16, 2023
Mediterr J Hematol Infect Dis 2023, 15(1): e2023026 DOI 10.4084/MJHID.2023.026

This is an Open Access article distributed under the terms of the Creative Commons Attribution License
(
https://creativecommons.org/licenses/by-nc/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Introduction: The prevalence of thalassemia among the Vietnamese population was studied, and clinical decision support systems for prenatal screening of thalassemia were created. The aim of this report was to investigate the prevalence of thalassemia in the Vietnamese population, building a clinical decision support system for prenatal screening for thalassemia. 

Methods: A cross-sectional study was conducted on pregnant women and their husbands visiting the Vietnam National Hospital of Obstetrics and Gynecology from October 2020 to December 2021. A total of 10112 medical records of first-time pregnant women and their husbands were collected. 

Results: A clinical decision support system was built, including 2 different types of systems for prenatal screening for thalassemia (an expert system and 4 AI-based CDSS). One thousand nine hundred ninety-two cases were used to train and test machine learning models, while 1555 cases were used for specialized expert system evaluation. There were ten key variables for AI-based CDSS for machine learning. The four most important features in thalassemia screening were identified. The accuracy of the expert system and AI-based CDSS was compared. The rate of patients with Alpha thalassemia is 10.73% (1085 patients), the rate of patients with betathalassemia is 2.24% (227 patients), and 0.29% (29 patients) of patients carry both alphathalassemia and beta-thalassemia gene mutations. The expert system showed an accuracy of 98.45%. Among the AI-based CDSS developed, the multilayer perceptron (MLP) model was the most stable regardless of the training database (accuracy of 98,5% using all features and 97% using only the four most important features). 

Conclusions: When comparing the expert system with the AI-based CDSS, the accuracy of the expert system and AI-based models was comparable. The developed expert system for prenatal thalassemia screening showed high accuracy. AI-based CDSS showed satisfactory results. Further development of such systems is promising with a view to their introduction into clinical practice.



Introduction

Thalassemia is an inherited autosomal recessive disease characterized by impaired synthesis of hemoglobin protein chains.[1] A normal mature hemoglobin molecule (HbA) consists of two pairs of alpha and beta chains. Normal adult blood also contains ≤ 2.5% hemoglobin A2 (HbA2) (consisting of alpha and delta chains) and <2% hemoglobin F (HbF, fetal hemoglobin), which has gamma chains instead of beta chains.[2] Thalassemia is caused by a gene mutation of the gene responsible for globin chain synthesis, based on which alpha-thalassemia and, beta-thalassemia, delta-beta-thalassemia are distinguished.[3] Such a gene can be inherited from one parent or two. The child's body produces fewer or no hemoglobin chains. The production of the other chains that make up globin does not end. As a result, unstable protein components are produced that destroy the blood cells.[4] Thalassemia is thus the result of reduced synthesis of at least one globin polypeptide chain (beta, alpha, gamma, delta), resulting in abnormal erythrocytes that are microcytic, often irregularly shaped, and prone to hemolysis. This results in anemia and often in iron overload.[5]
The severity of the disease depends on the number of mutated alleles. In humans, the alpha chain of hemoglobin is encoded by two pairs of genes, while the beta chain has only one pair. Patients with one alpha + allele are clinically normal and are called asymptomatic carriers. Heterozygotes with defects in 2 of the 4 genes (small alpha thalassemia) tend to develop microcytic anemia of mild to moderate severity but with a subclinical course. Defects in 3 of the four genes significantly impair alpha-chain synthesis, in which hemolytic anemia and splenomegaly are common. A defect in all 4 is a fatal condition that causes intrauterine fetal death.[6] Minor beta-thalassemia occurs in asymptomatic heterozygotes with a mild to moderate clinical picture of microcytic anemia. Intermediate beta-thalassemia presents a variable clinical picture due to the inheritance of 2 beta-thalassemia alleles. Large beta-thalassemia (Cooley's anemia) occurs in homozygous patients or complex heterozygotes due to a severe beta-globin defect. These patients develop severe anemia and bone marrow hyperactivity.[7,8] In addition, it is rare to find simultaneous disorders in both alpha- and beta-chain at once, but in this case, the disease may be milder because there is little imbalance between the two types of chains.[3] Thus, severe forms of the disease seriously affect physical development, causing patients to need continuous blood transfusions for life with many complications in the liver, heart, endocrine glands, and bones. The disease is life-threatening, worsening the patient's quality of life, and involves expensive treatments that burden the family and society.[9]
Worldwide, an estimated 7% of the population carries the thalassemia gene, and each year between 300,000 and 500,000 babies are born with severe homozygosity for the disease.[10,11] In Vietnam, more than 5 million people carry the gene; every year, more than 100,000 children carry the gene, and 1,700 children have the severe disease due to mutations in both genes.[12] Thalassemia is distributed in all provinces and ethnic groups throughout the country, especially ethnic minorities in mountainous provinces.[12]
Thalassemia is a preventable disease screening pregnant women and their husbands at risk of carrying the disease gene to prevent having children with the disease.[13,14]
Nowadays, modern technology has been researched and applied in medicine to support doctors in patient care and practice specialization.[15] A Clinical Decision Support System (CDSS) is "any electronic or non-electronic system designed to aid directly in clinical decision making, in which characteristics of individual patients are used to generate patient-specific assessments or recommendations that are then presented to clinicians for consideration".[16] CDSSs are classified as expert knowledge-based systems and artificial intelligence (AI).[17] CDSSs have many advantages, such as reducing the rate of misdiagnosis, improving efficiency and patient care, and reducing the risk of medication errors.[18] For thalassemia, there have been studies around the world applying AI in screening carriers with high efficiency. In 2002, Amendolia and colleagues[19] studied and built a real-time classification system based on Artificial Neural Networks (ANNs) to distinguish thalassemia gene carriers and normal people with an accuracy of 94%, a sensitivity of 92% and a specificity of 95%. In 2013, Masala et al.[20] compared the performance of The Radial Basis Function (RBF) network, The Probabilistic Neural Network (PNN), and The K-Nearest Neural Network (KNN) algorithms in thalassemia screening with 304 data samples. The results show that the RBF algorithm had a sensitivity of 93% and a specificity of 91%, similar to the results of KNN of 80% and 91% and PNN of 89% and 73%.[20]
The screening for thalassemia in Vietnam is still mainly based on the two indexes (mean corpuscular volume (MCV) and mean corpuscular hemoglobin (MCH)) and performed manually, and no CDSS has been built yet. This causes many difficulties in disease prevention at primary health care facilities and in ethnic minority areas because of the limited understanding of thalassemia not only by the people but also by the grassroots medical staff. The requirement is to build AI software and an expert knowledge-based system for thalassemia screening that can be applied to even primary healthcare facilities.
Thus, developing and introducing into clinical practice modern CDSSs for screening thalassemia in Vietnam is an urgent task. These systems can significantly assist doctors in making optimal decisions, even in primary healthcare institutions. Special attention should be paid to CDSSs based on machine learning algorithms, which remain poorly studied.
Therefore, our research was carried out with the following objectives:
1. Investigating the prevalence of thalassemia in the Vietnamese population
2. Building a clinical decision support system for prenatal screening for thalassemia.

Material and Methods

Study design. This cross-sectional study was conducted from October 2020 to December 2021. Data were collected using a convenient sampling method. We collected data on pregnant women and their husbands when they came to the Vietnam National Hospital of Obstetrics and Gynecology for annual screening of congenital disabilities through medical records. Data were collected from the medical records of patients who came to the hospital before the study.
Ethical approval of the study. The research was approved by the Vietnam National Hospital of Obstetrics and Gynecology Ethics committee, Minutes 54/5 of 2020-09-12. The research has been performed in accordance with the Declaration of Helsinki.
Study subjects. A total of 10112 medical records of first-time pregnant women and their husbands were collected, of which 1992 cases were used to train and test machine learning models while 1555 cases were used for specialized knowledge system evaluation. All patients underwent routine screening of thalassemia: peripheral blood smear, complete blood count (CBC), hemoglobin quantification by high-performance liquid chromatography (HPLC) and capillary electrophoresis (CE), and iron status tests. Detection of hemoglobin gene mutations by polymerase chain reaction (PCR) was performed for 1,364 patients and 658 newborns to assess the prevalence of different forms of thalassemia in the Vietnamese population. The multiplex ligation-dependent probe amplification (MLPA) technique has been used for the molecular detection of alpha-thalassemia. The Reverse Dot-Blot PCR technique has been used for the molecular detection of beta-thalassemia.
Data analysis. Two CDSS models for thalassemia pre-screening have been created: 4 AI-based CDSS for machine learning and an expert system. The basic difference between these two types of systems is that one is based on the knowledge base gathered from the knowledge of experts, and the other is based on computer mining knowledge from medical data. The specialized expert system was built based on the guideline for prenatal screening for thalassemia of the Vietnamese Ministry of Health.
The following independent variables were used in expert system CDSS: 4 most important indicators from CBC result (according to MID and MDA algorithms), including hemoglobin (HGB), mean corpuscular hemoglobin (MCH), mean corpuscular volume (MCV), red blood cell distribution width (RDW), serum ferritin concentration from iron status tests result, HbA2 and HbF levels from HPLC result. In addition, one should take into account the history of hydrops fetalis and having children or family members diagnosed with thalassemia.
There were 10 key variables for AI-based CDSSs for machine learning: the dependent variable was whether a patient had a thalassemia gene; 9 independent variables, including 7 from the CBC result were hematocrit (HCT), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), mean corpuscular volume (MCV), hemoglobin (HGB), red blood cell distribution width (RDW), red blood cell (RBC) and the other 2 are iron status: serum iron and serum ferritin. In addition, AI-based CDSSs for machine learning were evaluated using the four most important indicators above-mentioned (according to MID and MDA algorithms).
Data from 1992 pregnant women and their husbands were used to train and test 4 machine learning models, which were K-nearest neighbors (KNN), Support vector machine (SVM), Random Forest (RF), and Multilayer perceptron (MLP). The purpose of these models is the screening for thalassemia gene in pregnant women, husbands, and both pregnant women and husbands. Thus, we used data from all participants and divided them into 2 subsets, 1 with data from pregnant women only and the other from the husbands. After analyzing the dependent variable, whether the participant had a thalassemia gene or not, we realized that there were more participants without a thalassemia gene than those who did, which caused an unbalance in the dataset and resulted in the inaccuracy of all models. To solve this problem, we performed Synthetic Minority Over-sampling Technique (SMOTE). This method was introduced in 2002 by Nitesh Chawla et al.;[21] the idea is based on the K-Nearest Neighbors algorithm. We get one sample randomly from the minority layer a and one of its k nearest neighbors in the feature space, then we randomly choose a k nearest neighbors b and draw a line between these samples in the feature space. New samples are created on this line as the combination of a and b. These new samples helped balance the datasets, meaning the number of participants with the thalassemia gene is now equal to those without this gene. Datasets were then standardized using the z-score method; in particular, we used a StandardScaler command from the sci-kit-learn library. Hyperparameters were found using grid search. All 4 models were tested using 10-fold cross-validation using these datasets and evaluated by 4 indices: accuracy, precision, recall, and F1-score to find the best one.

Results

Prevalence of types of thalassemia. Ten thousand one hundred twelve pregnant women and their husbands performed the CBC, HPLC, and iron status during the study period. Based on the CBC, HPLC, and iron status results, 1,364 pregnant women and their husbands were prescribed a genetic test due to the suspected presence of thalassemia genes. The genetic test resulted in 1,085 (10.73% of the total number of pregnant women and their husbands) alpha-thalassemia cases and 227 (2.24%) beta-thalassemia cases. A small ratio of 0.29% inherited both α– and β–thalassemia genes, and 0.23% were others, including HbE disease, alpha thalassemia/HbE, beta-thalassemia/HbE and hemoglobin Constant Spring disease (Table 1). Among 658 fetuses of parents with identified thalassemia genes who performed the genetic testing, the frequency of α-thalassemia (61.85%) was also higher than β-thalassemia (13.07%) and others (including HbE disease, α-thalassemia/HbE, and β-thalassemia/HbE) (0.61%) (Table 2).

Table 1    Table 1. The frequency of different types of thalassemia in patients.

Table 2    Table 2. Prevalence of thalassemia types in fetuses who underwent genetic testing.

Determining the features of the CBC, HPLC, and iron status test needed to screen for thalassemia. After using MID and MDA algorithms, we found that 4 indices included, HGB, MCV, MCH, and RDW were the most important ones in the screening of thalassemia from the database containing 10112 cases having CBC, HPLC, and iron status results (Figures 1, 2).

Figure 1   Figure 1. From left to right, feature importance using the MDI algorithm.

Figure 2   Figure 2. From left to right, feature importance using the MDA algorithm.

Selection and evaluation of the effectiveness of AI models in screening for thalassemia.
- The experimental result with general CBC and iron status database. According to Table 3, when using the 4 most important indicators in thalassemia screening instead of using them all in CBC and iron status database, the Accuracy and Precision indices decreased but not much. With several models, the result was even more improved. For example, with the SVM model, Accuracy and Precision indices were increased (95% and 91.25% versus 94.5% and 90.12%).
KNN model using only 4 important features and training with the general CBC and iron status database (pregnant women and their husbands) had an accuracy of 97%, a precision of 93.75%, a recall of 98.68%, and an F1-score of 98.15% for original data and an accuracy of 92.5%, a precision of 85.9%, a recall of 98.68%, and an F1-score of 90.91% for SMOTE; MLP model had an accuracy of 97%, a precision of 92.68%, a recall of 99.23%, and an F1-score of 96.20% for original data and an accuracy of 94.5%, a precision of 88.24%, a recall of 98.68%, and an F1-score of 93.17% for SMOTE. SVM and RF models showed similar but slightly lower results for some indices (Table 3).


Table 3
Table 3. Comparing the results of models using the 4 most important features with models using all features in general CBC and iron status database (pregnant women and their husbands). Precision and Recall indices were calculated for thalassemia carrier.

- The experimental result with CBC and iron status database of pregnant women. With the data filtered for pregnant women, the training and testing are similar to the full one, including pregnant women and their husbands. The results are shown in Table 4.
The results showed that among the 4 training models, the KNN and MLP models were the best with 96.93% and 96.91% accuracy, 93.22% and 93.25% precision, 98.21% and 100% recall when using all CBC and iron status features. Wherein, with only the 4 most important features, the accuracy of the KNN and MLP models impressively increased from 96.93% to 97.53% and 96.91% to 97.81%. The accuracy of the SVM model also increased from 95.68% to 96.91% (Table 4). One special issue with this database was that after balancing the dataset with SMOTE technique, the result was improved for some indexes.


Table 4
Table 4. Comparing the results of models using the 4 most important features with models using all features in CBC, and iron status database of pregnant women. Precision and Recall indices were calculated for the thalassemia carrier.

- Experimental result with CBC and iron status database of husband. Similar to data on complete blood counts and iron status from pregnant women, the packaged models were not influenced by gender. With full CBC and iron status, the RF model showed the highest accuracy at 97.55%, precision index at 95.46%, and recall index at 100% (Table 5). Using only 4 important features did not change the results, except for the multilayer perceptron (MLP) model, whose accuracy was increased significantly to 97.44%. One remarkable issue with this database was that after balancing the dataset with SMOTE technique, the result of the models was decreased for some indexes (Table 5).

Table 5
Table 5. Comparing the results of models using the 4 most important features with models using all features in CBC and iron status database of husbands. Precision and Recall indices were calculated for thalassemia carrier. Precision and Recall indices were calculated for the thalassemia carrier.

Evaluation of the effectiveness of the expert system (ES) in screening for thalassemia and comparison with AI models. To build an expert system (ES) in screening for thalassemia, we applied the rules in Table 6.
The effectiveness of the expert system was evaluated in the same CBC, HPLC, and iron status database used to test the AI models, including 396 pregnant women and husbands who met the inclusion criteria (presence of thalassemia gene by PCR). However, the result showed that 323 cases were determined the risk of having thalassemia by the software, and 73 cases that were not. The reason for 73 cases was that patients had performed only CBC tests, neither serum ferritin nor hemoglobin variant analysis, and no related history, which caused a lack of indicators to predict. 323 cases left were evaluated and calculated, shown in Tables 7 and 8. There were 5 incorrect cases in risk assessment (false positive) and no false negative cases, which meant no thalassemia patients were left out. Comparing the expert system with the AI model, the accuracy of the expert system was 98.45%, while that of the KNN and MLP models was 97%, the RF model – 96%, and the SVM model – 94,5% when using only the 4 most important features (Table 8)
 

Table 6    Table 6. Summary of rules in ES.

Table 7    Table 7. Evaluation of the effectiveness of the expert system (ES).

Table 8    Table 8. Comparison between the expert system and AI model when using only the 4 most important features in general CBC, HPLC and iron status database (pregnant women and their husbands).


Discussion

The present study analyzed the prevalence of different thalassemia types among pregnant women and their husbands who came to the Vietnam National Hospital of Obstetrics and Gynecology. The prevalence of different thalassemia types in fetuses of parents with established, according to PCR tests, thalassemia genes was also evaluated. The prevalence of alpha-thalassemia in pregnant women, husbands, and fetuses was higher than in beta-thalassemia carriers. (Tables 1, 2).
The prevalence may have been because women whose babies had hydrop details in the previous pregnancy or the fetuses with alpha thalassemia would show clinical manifestations, so they would go for a check-up and perform prenatal screening tests to do the treatment or prevent thalassemia for the next pregnancy. The same reason could explain why the prevalence of alpha-thalassemia carriers of pregnant women who were screened for thalassemia was also higher than beta-thalassemia at the prenatal diagnostic center in Central Obstetrics Hospital in the study conducted by Dang.[22]
In our study, 8841 pregnant women were screened by CBC, HPLC, and iron status test; the prevalence of alpha-thalassemia carriers was 6.4%, the prevalence of beta-thalassemia carriers was 1.32%, the prevalence of co-inheritance of alpha and beta-thalassemia was 0.08% (Table 1), which were higher than those studied by Nguyen Khac Han Hoan. In this study, the percentages of pregnant women carrying the alpha-thalassemia carriers, beta-thalassemia carriers, and co-inheritance of alpha and beta-thalassemia were 1.11%, 0.52%, and 0.07%, respectively. However, in both studies, the percentage of pregnant women and pregnant women with alpha-thalassemia carriers was the highest (6.4%, 61.85%, and 1.11%, 71.3%).[23] However, our results were lower compared to the research of Sheng He et al. (8.1% and 3.4%)[24] and Naili Husna's study (alpha gene carrier thalassemia is 9.8%).[25] The difference in size, region, and country can explain the reason for these differences.
Nowadays, one of the most important components in family planning and pregnancy is detecting and preventing hereditary pathologies in future offspring. In this regard, genetic tests and consultations are conducted for future parents, based on which final decisions are made. The clinical decision support system (CDSS), which is currently being widely implemented in various fields of medicine, can undoubtedly provide significant assistance to physicians in the diagnosis and prognosis of hereditary pathologies.[26]
Thalassemia is a complicated disease without a cure, excluding allogeneic hematopoietic stem cell transplantation. Currently, the best strategies for thalassemia management are screening for thalassemia based on family history, cell blood count, serum iron, and ferritin. However, this is a challenge for primary health care facilities and ethnic minority areas due to the limited understanding of thalassemia not only by the population but also by the grassroots medical staff. To solve this problem, the authors conducted a clinical decision support system to support physicians in screening and diagnosing thalassemia. The clinical decision support system (CDSS) is a system comprising 2 different systems: The expert system (ES) and the AI-based clinical decision support system (AI-based CDSS).
The Expert system is a knowledge-based clinical decision support system encoding the experts' knowledge into an automated system.[27] The operating principle of the ES is the simulation of the procedure of diagnosis and screening for thalassemia done by medical physicians. It aims to support doctors dealing with complicated cases, especially at the commune health centers facing a shortage of qualified health workers. Like other expert systems, it comprises three main parts: The knowledge base, the inference engine, and the EHR front-end interface. The knowledge base includes a set of if-then rules built based on red blood cell indices, ferritin, and hemoglobin electrophoresis results using the guideline for prenatal screening for thalassemia of WHO and the Vietnam Ministry of Health (Table 8).
The guideline comprises recommendations proposed by experts and used by medical physicians to screen and diagnose thalassemia in clinical practice; thus, it is reliable and suitable for the racial characteristics of Vietnamese people. The system also includes a knowledge update interface, with which the experts can enrich the knowledge base by directly updating their clinical experience and new medical knowledge. This part is important to ensure the accuracy and the update of the expert system because the knowledge of thalassemia can be updated and changed over time. The inference engine is an essential part of the expert system, which applies the if-then rules of the knowledge base to the patient's clinical data to create an inference. Development of the inference engine is an important step when building an expert system to increase the system's accuracy in the research; the three-layer model was applied.
The EHR front-end interface of the model has 2 sections: the input section and the output section (Figure 3). The input section includes 9 boxes to enter patients' clinical information, including cell blood count, plasma iron, and ferritin. The output section, which includes 3 boxes, shows the conclusion drawn by the inference engine. The conclusions drawn by the expert system predict the risk of thalassemia, whether it is high risk or low risk, and the physicians can identify if this case requires diagnostic genetic testing.


Figure 3
Figure 3. Screenshots of the web-based ES after submission of laboratory data.

The advantage of the expert system is that it is possible to explain how the system makes the recommendation, and due to this, it has high reliability and accuracy. Another advantage of our ES is that it is deployed on the Internet; thus, users can access the ES on any computer and at any time with an internet connection. The evaluation of the effectiveness of the developed expert system (ES) in thalassemia screening in the present study showed a high accuracy of 98.45%.
Unlike the ES, The AI-based clinical decision support system does not use the knowledge base. Instead, it uses a form of AI called machine learning (ML), which allows computers to learn from past experiences and/or find patterns in clinical data to make decisions; thus, it does not require writing rules. Machine learning describes the computer algorithms to determine patterns in very large datasets. Over the past years, ML has been applied in a wide range of medical fields and demonstrated impressive results, especially in clinical decision support, patient monitoring, and management.[28] Many ML techniques have been applied in building CDSS, in which K-nearest neighbors, Support vector machine, Random Forest, and Multilayer perceptron (MLP) are the most common techniques.[29]
To optimize the clinical decision support system (CDSS), before constructing the CDSS, the authors conducted to determine the features of the CBC, HPLC, and iron status needed to screen for thalassemia. After using MID and MDA algorithms found that 4 parameters, including HGB, MCV, MCH, and RDW, were the most important ones in screening for thalassemia from the database containing over 10000 cases having CBC, HPLC, and iron status results. This report is consistent with the recommendation of WHO in screening for thalassemia; HGB, MCV, and MCH are the parameters currently used in screening for thalassemia in clinical practice.
In the research, the authors built an AI-based CDSS, including the AI algorithm and the EHR front-end interface.
To determine the most appropriate AI algorithm for constructing AI-based CDSS, the authors conducted to train the dataset on 4 models, SVM, KNN, MLP, and RF, then evaluated them on 4 indices: accuracy, precision, recall, and F1-score to choose the most appropriate one.
The obtained result showed that the MLP model was the most stable regardless of the training database. Particularly, when training with the general CBC and iron status database for 4 most important features, it had an accuracy of 97%, a precision of 92.68%, a recall of 99.23%, an F1 score of 96.20% for original data, and an accuracy of 94.5%, a precision of 88.24%, a recall of 98.68% and an F1-score of 93.17% for SMOTE. When training with the database of pregnant women, it had an accuracy of 97.81%, a precision of 93.21%, a recall of 99.84%, an F1-score of 95.72% for original data, and an accuracy of 97.65%, a precision of 93.28%, recall of 99.73%, F1-score of 96.43% for SMOTE. Especially with only 4 important features selected above, the model's results were extremely impressive since the accuracy of MLP increased from 96.91% to 97.81%. Another remarkable thing about this database is that after balancing data with SMOTE, the results had a significant difference between the models. Thus, it is possible to temporarily conclude that with only the data of the pregnant woman, the model is no longer confounded by the cell blood count data of the husband. When training with the database of husbands, it had an accuracy of 97.44%, a precision of 96.42%, a recall of 100%, and an F1 score of 97.62% for original data and an accuracy of 94.18%, a precision of 95.44%, a recall of 95.95% and an F1-score of 95.18% for SMOTE.
With such high accuracy, it is possible to apply MLP to construct an AI-based CDSS to predict the risk of thalassemia. In fact, this has been proven by previous research. Al Agha et al.[30] proposed an AI model for thalassemia prenatal screening built by training and evaluating 3 models, including KNN, NB (Navies Bayes), and MLP, among which the MLP model got the highest accuracy with 99.73%. Egejuru et al.[31] proposed to apply MLP to build an AI model for thalassemia classification. In the research, the accuracy of the MLP was 98.11%. Like the ES, The EHR fronted interference of the AI-based CDSS has two sections: An input section and an output section (Figure 4). The difference between the two systems is that the ES concludes the type of anemia and whether patients are at high risk or low risk, while the AI-based predicts the risk of thalassemia in the percentage form.


Figure 4
Figure 4. Screenshots of the web-based AI model after submission of laboratory data.

Despite its advantage, the adoption of AI in medicine is rife with challenges, including the impossibility of explaining the logic that ML uses to make an inference (black box). In AI-based CDSS, the users and researchers can only know the inputs and outputs, but it is challenging to understand how it works inside. Therefore, the accuracy of the AI-based CDSS is questionable.[17] To deal with this problem, the authors compared it to the ES by testing both ES and AI-based CDSS with 396 cases of thalassemia. The result showed that when testing on the same dataset, the AI-based CDSS got an accuracy of 94.5%, 96%, and 97% (depending on the algorithm used) when using four important features slightly lower than ES with 98.45%. Thus, it is possible to apply the AI-based CDSS in screening for thalassemia. However, it should be noted that the proposed AI-based CDSS for thalassemia screening is experimental. The advantage of algorithms built on deep machine learning over physician-based assessments requires more in-depth and comprehensive research. According to a meta-analysis of publications on the use of AI in CDSS models for various diseases, no advantage of algorithms built on deep machine learning over physician estimates was noted.[32] It is noteworthy that the effectiveness and safety of AI-based CDSS vary and are ambiguous: there are both successes and failures.[33] Regarding the proposed AI-based CDSSs, it is important to note that future advances in genetic diagnosis of thalassemia may require a significant revision of these CDSS and new studies to confirm the effectiveness and safety of such systems. AI-based CDSSs are an emerging but understudied field, requiring much effort before showing real progress.

Conclusions

Based on PCR tests, it was found that among pregnant women and their husbands who came to Vietnam National Hospital of Obstetrics and Gynecology, the rate of patients with Alpha thalassemia was 10.73% (1085 patients), the rate of patients with beta-thalassemia is 2.24% (227 patients), and 0.29% (29 patients) of patients carry both alpha-thalassemia and beta-thalassemia gene mutations. The authors successfully built expert and 4 AI-based CDSS for prenatal screening for thalassemia. The expert system developed based on WHO, and Vietnamese Ministry of Health rules and guidelines for prenatal thalassemia screening showed an accuracy of 98.45%. Among the AI-based CDSS developed, the MLP model was the most stable regardless of the training database (accuracy of 98,5% using all features and 97% using only the 4 most important features). When comparing the expert system with the AI-based CDSS, comparable accuracy of the expert system and AI-based models was established. Thus, AI-based CDSS showed satisfactory results. Further development of such systems is promising with a view to their introduction into clinical practice.

References   

  1. Sheth S, Thein SL. Thalassemia: a disorder of globin synthesis. In: Kaushansky K, Prchal JT, Burns LJ, Lichtman MA, Levi M, Linch DC, eds. Williams Hematology 10, New York, McGraw-Hill Education. 2021; 12-13.
  2. Gell DA. Structure and function of haemoglobins. Blood Cells Mol Dis. 2018;70:13-42. https://doi.org/10.1016/j.bcmd.2017.10.006 PMid:29126700
  3. Viprakasit V, Ekwattanakit S. Clinical classification, screening and diagnosis for thalassemia. Hematol Oncol Clin North Am. 2018;32:193-211. https://doi.org/10.1016/j.hoc.2017.11.006 PMid:29458726
  4. Petrakos G, Andriopoulos P, Tsironi M. Pregnancy in women with thalassemia: challenges and solutions. Int J Womens Health. 2016;8:441-51. https://doi.org/10.2147/IJWH.S89308 PMid:27660493 PMCid:PMC5019437
  5. Shang X, Xu X. Update in the genetics of thalassemia: what clinicians need to know. Best Pract Res Clin Obstet Gynaecol. 2017;39:3-15. https://doi.org/10.1016/j.bpobgyn.2016.10.012 PMid:27876354
  6. Farashi S, Harteveld CL. Molecular basis of α-thalassemia. Blood Cells Mol Dis. 2018;70:43-53. https://doi.org/10.1016/j.bcmd.2017.09.004 PMid:29032940
  7. Shah FT, Sayani F, Trompeter S, Drasar E, Piga A. Challenges of blood transfusions in β-thalassemia. Blood Rev. 2019;37:100588.  https://doi.org/10.1016/j.blre.2019.100588 PMid:31324412
  8. Bain BJ. Delta beta thalassemia. Am J Hematol. 2020;95:116. https://doi.org/10.1002/ajh.25623 PMid:31444804  
  9. Borgna-Pignatti C, Gamberini MR. Complications of thalassemia major and their treatment. Expert Rev Hematol. 2011;4:353-66. https://doi.org/10.1586/ehm.11.29 PMid:21668399
  10. Thalassaemia International Federation. [Internet]. Response to the proposal for the inclusion of whole blood and red blood cells on The WHO Essential Medicines Lists (EML). 2013. https://issuu.com/internationalthalassaemiafederation/docs/tif_magazine_issue_62 (accessed 15 December 2022)
  11. Fucharoen S, Winichagoon P. Haemoglobinopathies in southeast Asia. Indian J Med Res. 2011;134:498-506.  
  12. Nguyen KHH. Thalassemia and a model of prevention in Vietnam. Sydney: The University of Sydney; 2005.
  13. Dossanova A, Lozovoy V, Wood D, Lozovaya Y. Reducing the risk of postoperative genital complications in male adolescents. Int J Environ Sci Educ. 2016;11:5797-807.  
  14. Thalassaemia International Federation. Guidelines for the Management of Transfusion Dependent Thalassaemia; 2021. https://thalassaemia.org.cy/publications/tif-publications/guidelines-for-the-management-of-transfusion-dependent-thalassaemia-4th-edition-2021/ (accessed 15 December 2022)  
  15. Dossanov B, Trofimchuk V, Lozovoy V, Khmyzov S, Dossanova A, Zhukenov O, Tuktiyeva N, Angelov A. Evaluating the results of long tubular bone distraction with an advanced rod monolateral external fixator for achondroplasia. Sci Rep. 2021;11:14727.  https://doi.org/10.1038/s41598-021-94146-z PMid:34282216 PMCid:PMC8290032
  16. Kawamoto K, Houlihan CA, Balas EA, Lobach DF. Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success. BMJ. 2005;330:765. https://doi.org/10.1136/bmj.38398.500764.8F PMid:15767266 PMCid:PMC555881  
  17. Sutton RT, Pincock D, Baumgart DC, Sadowski DC, Fedorak RN, Kroeker KI. An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ Digit Med. 2020;3:1-10. https://doi.org/10.1038/s41746-020-0221-y PMid:32047862 PMCid:PMC7005290  
  18. Muhiyaddin R, Abd-Alrazaq AA, Househ M, Alam T, Shah Z. The impact of clinical decision support systems (CDSS) on physicians: a scoping review. Stud Health Technol Inform. 2020;470-3.  
  19. Amendolia SR, Brunetti A, Carta P, Cossu G, Ganadu ML, Golosio B, Mura GM, Pirastru MG. A real-time classification system of thalassemic pathologies based on artificial neural networks. Med Decis Making. 2002;22:18-26. https://doi.org/10.1177/0272989X0202200102 PMid:11833662
  20. Masala GL, Golosio B, Cutzu R, Pola R. A two-layered classifier based on the radial basis function for the screening of thalassaemia. Comput Biol Med. 2013;43:1724-31. https://doi.org/10.1016/j.compbiomed.2013.08.020 PMid:24209918
  21. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Int Res. 2020;16:321-57. https://doi.org/10.1613/jair.953 
  22. Dang HTT. Screening for Thalassemia in Pregnant Women Who Come for Examination and Treatment at the National Hospital of Obstetrics and Gynecology. Vietnam, National Hospital of Obstetrics and Gynecology. 2019.
  23. Nguyen HHK. Research for Alpha and Beta Thalassemia Screening and Diagnosis. Vietnam, Ho Chi Minh City University of Medicine and Pharmacy. 2013.
  24. Sheng H, Qiang Z, Dongming L, Chen S, Tang Y, Chen Q, Zheng C. Prevention and control of Hb Barts disease in Guangxi Zhuang Autonomous Region, China. Eur J Obstet Gynecol. 2020:178:138-41. https://doi.org/10.1016/j.ejogrb.2014.03.034 PMid:24792538
  25. Husna N, Handayani NSN. Molecular and haematological characteristics of alpha-thalassemia deletions in Yogyakarta special region, Indonesia. Rep Biochem Mol Biol. 2021;10:346-53. https://doi.org/10.52547/rbmb.10.3.346 PMid:34981010 PMCid:PMC8718782
  26. Sarafidis M, Manta O, Kouris I, Schlee W, Kikidis D, Vellidou E, Koutsouris D. Why a clinical decision support system is needed for tinnitus? Annu Int Conf IEEE Eng Med Biol Soc. 2021;2021:2075-8. https://doi.org/10.1109/EMBC46164.2021.9630137 PMid:34891697
  27. Klar R, Zaiß A. Medical expert systems: design and applications in pulmonary medicine. Lung. 1990;168:1201-9. https://doi.org/10.1007/BF02718262 PMid:2117122
  28. Rowe M. An introduction to machine learning for clinicians. Acad Med. 2019;94:1433-6. https://doi.org/10.1097/ACM.0000000000002792 PMid:31094727
  29. Shaikh F, Dehmeshki J, Bisdas S, Roettger-Dupont D, Kubassova O, Aziz M, Awan O. Artificial intelligence-based clinical decision support systems using advanced medical imaging and radiomics. Curr Probl Diagn Radiol. 2021;50:262-7. https://doi.org/10.1067/j.cpradiol.2020.05.006 PMid:32591104
  30. AlAgha AS, Faris H, Hammo BH, Ala'M AZ. Identifying β-thalassemia carriers using a data mining approach: the case of the Gaza Strip, Palestine. Artif Intell Med. 2018;88:70-83. https://doi.org/10.1016/j.artmed.2018.04.009 PMid:29730048
  31. Egejuru NC, Olusanya SO, Asinobi AO, Adeyemi OJ, Adebayo VO, Idowu PA. Using data mining algorithms for thalassemia risk prediction. Int J Biomed Sci Eng. 2019;7:33. https://doi.org/10.11648/j.ijbse.20190702.12
  32. Liu X, Faes L, Kale AU, Wagner SW, Fu DJ, Bruynseels A, Mahendiran T, Moraes G, Shamdas M, Kern C, Ledsam JR, Schmid MK, Balaskas K, Topol EJ, Bachmann LM, Keane PA, Denniston AK. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis. The Lancet Digital Health.  2019;1:e271-97. https://doi.org/10.1016/S2589-7500(19)30123-2 PMid:33323251
  33. Roshanov PS, Fernandes N, Wilczynski JM, Hemens BJ, You JJ, Handler SM, Nieuwlaat R, Souza NM, Beyene J, Van Spall HGC, Garg AX. Features of effective computerised clinical decision support systems: metaregression of 162 randomised trials. BMJ. 2013;346:f657. https://doi.org/10.1136/bmj.f657 PMid:23412440

[TOP]