Warning: mkdir(): Permission denied in /home/virtual/lib/view_data.php on line 93 Warning: chmod() expects exactly 2 parameters, 3 given in /home/virtual/lib/view_data.php on line 94 Warning: fopen(/home/virtual/pfmjournal/journal/upload/ip_log/ip_log_2024-10.txt): failed to open stream: No such file or directory in /home/virtual/lib/view_data.php on line 100 Warning: fwrite() expects parameter 1 to be resource, boolean given in /home/virtual/lib/view_data.php on line 101
Precis Future Med > Volume 8(3); 2024 > Article |
|
Won Chul Cha has been editorial board of Precision and Future Medicine since January 2023. He was not involved in the review process of this original article. Jinsung Yoon has no conflict of interest with Google.
AUTHOR CONTRIBUTIONS
Conception or design: JA, JYY, KHJ, JY, WCC.
Acquisition, analysis, or interpretation of data: JA, JYY, KHJ, JY, WCC.
Drafting the work or revising: JA, JYY, KHJ, JY, WCC.
Final approval of the manuscript: JA, JYY, KHJ, JY, WCC.
Values are presented as mean±standard deviation or number (%).
TabDDPM, tabular denoising diffusion probabilistic model; KS, Komolgorov-Smirnov; HR, heart rate; DBP, diastolic blood pressure; RR, respiratory rate; SBP, systolic blood pressure; SpO2, oxygen saturation; count ER, count of emergency room visit; count ICU, count of intensive care visit; count surgery, count of surgery visit; MI, myocardial infarction; CHF, congestive heart failure; PVD, peripheral vascular disease; STR, stroke; DEM, dementia; CPD, chronic pulmonary disease; RD, rheumatoid disease; PUD, peptic ulcer disease; DM WOC, diabetes without chronic complication; DM C, diabetes with complication; HEMI, hemiplegia/paraplegia; RENAL, kidney disease; TU LE, local tumor/leukemia/lymphoma; MST, metastatic solid tumor; MLD, mild liver disease; SLD, severe liver disease.
Average WD (Cont.) | Average JSD (Cat.) | Diff. Pair-Wise distance | |
---|---|---|---|
TabDDPM | 0.001472 | 0.011608 | 0.803866 |
CTABGAN+ | 0.002614 | 0.015443 | 0.452392 |
CTGAN | 0.025507 | 0.08734 | 1.880659 |
In WD and JSD metrics TabDDPM outperforms the rest while CTABGAN+ outperforms the rest in pair-wise correlation distance.
WD, Wasserstein distance; JSD, Jensen-Shannon divergence; Cont., continuous; Cat., categorical; Diff., difference; TabDDPM, tabular denoising diffusion probabilistic model; CTABGAN+, contidional tabular generative adversarial network +; CTGAN, contidional tabular generative adversarial network.
Values are presented as mean±standard deviation. The table shows the AUC of logistic regression, random forest, and XGBoost as well as the standard deviation of running the test five-fold. TabDDPM had a better utility performance in most cases except from logistic regression which may have been more sensitive to subtle changes in data. On the right side of the table time comparison shows that TabDDPM is on average 36 times faster than CTABGAN+. CTGAN results were excluded since we could not train the model with graphics processing units (GPU) and results would have been biased.
AUC, area under the curve; XGBoost, Extreme Gradient Boosting; NA, not applicable; TabDDPM, tabular denoising diffusion probabilistic model; CTABGAN+, contidional tabular generative adversarial network +; CTGAN, contidional tabular generative adversarial network.
Values are presented as mean±standard deviation. Excluding CTGAN, TabDDPM has the best performance. The higher Real vs. Synthetic the higher privacy value. Also MIA values are also included, and all models pass the test by not being vulnerable to the attack since their values are below the 0.5 threshold.
DCR, distance to closest record; NNDR, nearest neighbor distance ratio; MIA, membership inference attack; TabDDPM, tabular denoising diffusion probabilistic model; CTABGAN+, contidional tabular generative adversarial network +; CTGAN, contidional tabular generative adversarial network.