TCGA Dataset Training and Testing Distributions #88

bryanwong17 · 2024-01-22T01:42:26Z

Hi, could you please share with me the distribution of slides used for training and testing in the TCGA dataset, along with their respective labels?

I noticed that it's mentioned here "We randomly split the WSIs into 840 training slides and 210 testing slides (4 low-quality corrupted slides are discarded)". However, upon examining the TEST_ID.csv file from this link, I observed that there are 214 testing slides. Could you provide clarification which slides were discarded? And also which slides are used for training? Thank you!

The text was updated successfully, but these errors were encountered:

GeorgeBatch · 2024-02-15T10:13:12Z

@bryanwong17, I went through this. See the results of my investigation in my README file for downloading TCGA.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TCGA Dataset Training and Testing Distributions #88

TCGA Dataset Training and Testing Distributions #88

bryanwong17 commented Jan 22, 2024 •

edited

Loading

GeorgeBatch commented Feb 15, 2024

TCGA Dataset Training and Testing Distributions #88

TCGA Dataset Training and Testing Distributions #88

Comments

bryanwong17 commented Jan 22, 2024 • edited Loading

GeorgeBatch commented Feb 15, 2024

bryanwong17 commented Jan 22, 2024 •

edited

Loading