Skip to content

Commit

Permalink
added num_of_concepts and num_of_visits to the saved pretraining data…
Browse files Browse the repository at this point in the history
…set so we can filter examples by the num of tokens
  • Loading branch information
ChaoPang committed Oct 11, 2024
1 parent 8c40a27 commit 3375240
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions src/cehrbert/data_generators/hf_data_generator/hf_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,8 @@
"concept_values",
"concept_value_masks",
"mlm_skip_values",
"num_of_concepts",
"num_of_visits",
]

TRANSFORMER_COLUMNS = ["input_ids", "labels"]
Expand Down

0 comments on commit 3375240

Please sign in to comment.