diff --git a/notebooks/cnn_data_sampler.ipynb b/notebooks/cnn_data_sampler.ipynb index 5d1b4f71..9f296c4a 100644 --- a/notebooks/cnn_data_sampler.ipynb +++ b/notebooks/cnn_data_sampler.ipynb @@ -24,7 +24,7 @@ " - Thus there shouldn't be too much misclassifications in the data that would cause problem\n", "- User has the option to set buffer size for the known occurences, because one pixel locations are harder to locate for model than .\n", "### TODO\n", - "- Current implementation creates images in order, known occurences first and non-occurences last. This makes suffling necessary at the model script, which complicates things. Suffling should be moved to this script\n", + "- Current implementation creates images in order, known occurences first and non-occurences last. This makes shuffling necessary at the model script, which complicates things. Shuffling should be moved to this script\n", "- This sampling script should be integrated to the cnn scripts so that leave one out cross validation would be easy to do\n", " - The leave one out method should be implented in a way that one feature is completely removed from the occurence data before any sampling is done, and then test data is creaed by sampling the left out feature alone\n", " - without this it is hard to assess level of overfitting\n",