CPU Memory exhausted when loading data with random_grid_cell_assignment() #1578
Unanswered
trettelbach
asked this question in
Q&A
Replies: 1 comment 5 replies
-
A couple comments:
|
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I'm currently loading data with the
random_grid_cell_assignment(self.dataset_full, [0.7, 0.15, 0.15], grid_size=6)
. I hadnum_workers=4
.The dataset I was loading consisted of 3 files, each 15-16 GB in size; ca. 47 GB total.
When I ran the code, and monitored my resources with
top
, I saw that each individual process had loaded 47+ GB into its memory; cumulatively crashing my workstation as it ran out of CPU memory.Is there a way that not all datasets are loaded entirely with all processes?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions