-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Implemented TruncatedOnlineStatistics to collect truncated stats in a…
… distributed way (#57) * added patient_id to the pretraining data for debugging * added RunningStatistics to remove the extreme outliers when calculating the running mean and std * added the lab value lower/upper bounds to the tokenizer so we can bound the extreme values during tokenization * we only add a new value to the running stat if the value is between the lower and upper bounds * renamed RunningStatistics to TruncatedOnlineStatistics, and ExcludingOnlineStatistics to TruncatedOfflineStatistics, added unittests for these new stats utilities * added a tqdm progress bar for aggregating the lab statistics * handle the cases in TruncatedOnlineStatistics, where the filtered_data contains zero elements * added lower_bound and upper_bound to the lab stats to cehrbert tokenizer * fixed the masked mse loss for cehrbert when value_prediction is enabled * updated the cehr-bert architecture for predicting concepts with values * fixed a bug where the normalized value should be bounded by a multiple of the standard deviation because the normalized value is assumed to follow the standard normal
- Loading branch information
Showing
10 changed files
with
519 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.