Skip to main content
Download PDF
- Main
Trees vs Neurons: Comparison between Denoising Autoencoders and Random Forest for Imputation of Mixed Data from Electronic Medical Records
- Peng, Qin
- Advisor(s): Wu, Ying Nian
Abstract
Missing data is a significant challenge impacting almost all studies; however, this is especially true for analyses of electronic health record (EHR). We propose a multiple imputation model based on multi-layer denoising autoencoders. This nonparametric model can deal with mixed-typed data types, and not making assumptions of missing mechanism. Evaluation on simulated datasets based on real life EHR datasets showed that our proposed model outperforms current Random Forest method and median/mode Imputation.
Main Content
For improved accessibility of PDF content, download the file to your device.
Enter the password to open this PDF file:
File name:
-
File size:
-
Title:
-
Author:
-
Subject:
-
Keywords:
-
Creation Date:
-
Modification Date:
-
Creator:
-
PDF Producer:
-
PDF Version:
-
Page Count:
-
Page Size:
-
Fast Web View:
-
Preparing document for printing…
0%