-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Training] Better image interpolation in training scripts #11206
base: main
Are you sure you want to change the base?
Conversation
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
if args.image_interpolation_mode == "bilinear": | ||
train_resize = transforms.Resize(size, interpolation=transforms.InterpolationMode.BILINEAR) | ||
else: | ||
train_resize = transforms.Resize(size, interpolation=transforms.InterpolationMode.LANCZOS) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if args.image_interpolation_mode == "bilinear": | |
train_resize = transforms.Resize(size, interpolation=transforms.InterpolationMode.BILINEAR) | |
else: | |
train_resize = transforms.Resize(size, interpolation=transforms.InterpolationMode.LANCZOS) | |
interpolation = getattr(transforms.InterpolationMode, args.image_interpolation_mode.upper(), None) | |
if interpolation is None: | |
raise ValueError(f"Unsupported interpolation mode.") | |
train_resize = transforms.Resize(size, interpolation=interpolation) |
Could we do something like this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah, not sure about the other interpolation modes but I guess if the user uses another one it's because they know what they're doing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah since we have the default as lanczos I also think it's ok to assume the user knows what their doing if they use a mode other than lanczos/bilinear
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just cleaner code and preemptively solves the inevitable issue report: "Setting image_interpolation_mode
other than bilinear
uses lanczos
"
thank you |
What does this PR do?
As discussed here, this PR will add LANCZOS as a default interpolation mode for the image resizing in the training scripts and if the users prefers can choose BILINEAR.
I'll add this to the most popular recent ones and leave the rest to the community if they want to add them to other training scripts.
I'll do some training runs first to test if I can see the difference, but still, I already know that LANCZOS is better and that the models can pick subtle details that the human eye can't.
Fixes #6397
Who can review?
@bghira @linoytsaban @sayakpaul
Additional
Here's a little script if you want to test and try to see the difference: