Fixed Overfitting with Dropout Rate, Weight Decay, and on the fly data augmentation.#18
Merged
Fixed Overfitting with Dropout Rate, Weight Decay, and on the fly data augmentation.#18
Conversation
…he fly data augmentation. Added seed and require gpu flags as well.
cwsmith
requested changes
Nov 11, 2025
Contributor
cwsmith
left a comment
There was a problem hiding this comment.
There is one small comment below. In addition to that, we discussed offline about changing the default checkpoint rate to every 100 epochs. Each checkpoint was about 100MB which could quickly fill up disk space for someone who doesn't expect it.
Contributor
|
@sridhs21 would you also please change the default checkpoint interval to 100 epochs? |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Add Regularization and Data Augmentation
This PR adds regularization techniques and on-the-fly data augmentation to improve model training.
Changes
New Command-Line Arguments
--dropoutRate(default: 0.3) - Control dropout strength--weightDecay(default: 5e-4) - Control L2 regularization--seed(default: None) - Set random seed for reproducibility--require-gpu- Exit if GPU not availableDropout Improvements
UNetnow acceptsdropout_rateparameterdec1_dropout,dec2_dropout,dec3_dropout,dec4_dropout)bottleneck_dropout)Weight Decay
weight_decay=1e-5to configurableweight_decay=args.weightDecayOn-the-Fly Data Augmentation
augmentparameter toXPointPatchDatasetseedparameter toXPointPatchDatasetfor reproducible augmentation_apply_augmentation()method with:augment=True, validation usesaugment=FalseXPointDataset(..., rotateAndReflect=False)Reproducibility
set_seed()function sets seeds for Python, NumPy, and PyTorchGPU Checking
--require-gpuflag exits with error if CUDA unavailable