Skip to content
This repository has been archived by the owner on Nov 16, 2023. It is now read-only.

add unit test to check that each input slice has uniform distribution - facilitates uniform sampling of training data for better model accuracy #319

Open
maxkazmsft opened this issue May 26, 2020 · 1 comment
Labels
Prior: High Type: Accuracy related to increasing performance accuracy

Comments

@maxkazmsft
Copy link
Contributor

Related to #259 - this is essentially a test for it

can test that for checkerboard dataset train and val patches have a uniform 50/50 class distribution across all patches

@maxkazmsft maxkazmsft added this to the V0.1.3 [BYOD] milestone May 26, 2020
@maxkazmsft maxkazmsft added Prior: High Type: Correctness anything to do with repo being technically correct labels May 26, 2020
@maxkazmsft maxkazmsft added Type: Accuracy related to increasing performance accuracy and removed Type: Correctness anything to do with repo being technically correct labels May 28, 2020
@maxkazmsft
Copy link
Contributor Author

We should re-scope this issue - there should be a tolerance for sampling synthetic checkerboard training data where with a large enough number of samples the distribution of two classes is roughly 50/50, but it's not clear just how 50/50 the distribution would be

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Prior: High Type: Accuracy related to increasing performance accuracy
Projects
None yet
Development

No branches or pull requests

1 participant