Add compile_fn parameter for Trainer #20269

mieshkiwrk · 2024-09-10T08:16:34Z

Add support for compile_fn for Trainer for example to compile model after applying strategy

Example usage: needed to compile after applying DDP strategy to get pre/post forward also compiled

Fixes #20242

📚 Documentation preview 📚: https://pytorch-lightning--20269.org.readthedocs.build/en/20269/

mieshkiwrk · 2024-09-13T09:37:50Z

Both benchmarks checks failed due to timeout

mieshkiwrk · 2024-09-20T09:15:19Z

bump

codecov · 2024-09-30T16:11:48Z

Codecov Report

Attention: Patch coverage is 75.00000% with 1 line in your changes missing coverage. Please review.

Project coverage is 81%. Comparing base (5be58f6) to head (4648ea2).
Report is 1 commits behind head on master.

❗ There is a different number of reports uploaded between BASE (5be58f6) and HEAD (4648ea2). Click for more details.

HEAD has 553 uploads less than BASE

Flag BASE (5be58f6) HEAD (4648ea2)

cpu 147 21

lightning 106 16

pytest 87 2

python3.9 43 6

python3.10 42 6

lightning_fabric 25 0

gpu 4 2

python3.11 42 6

python3.12 20 3

pytorch2.1 38 12

pytest-full 64 21

pytorch2.3 9 3

pytorch_lightning 20 7

pytorch2.2 9 3

pytorch2.4 8 3

Additional details and impacted files

@@            Coverage Diff            @@
##           master   #20269     +/-   ##
=========================================
- Coverage      89%      81%     -8%     
=========================================
  Files         267      264      -3     
  Lines       23084    23032     -52     
=========================================
- Hits        20585    18620   -1965     
- Misses       2499     4412   +1913

for more information, see https://pre-commit.ci

lantiga · 2024-11-12T22:36:08Z

Thank you @mieshkiwrk.

The way we recommend users to use torch.compile with lightning is to call torch.compile on the model and then pass it to the trainer.

import torch
import lightning as L

model = MyLightningModule()

model = torch.compile(model)

trainer = L.Trainer()
trainer.fit(model)

This PR would add an additional entrypoint and there's probably a simpler way to go about it (for users).

We should replicate what Fabric does here: https://github.com/Lightning-AI/pytorch-lightning/blob/master/src/lightning/fabric/wrappers.py#L421

where we capture the arguments passed to torch.compile so we can re-apply it when using strategies, just like we do in Fabric but in the Trainer:

#19280

Would you like to take a stab at it?

mieshkiwrk · 2024-11-15T13:35:02Z

Let me try, looks like I see what's needed to be done

for more information, see https://pre-commit.ci

mieshkiwrk · 2024-11-27T12:09:29Z

@lantiga, something like this would be fine? Wanted to make sure about re-using _unwrap_compiled and _to_compiled from fabric or should these be copied to lightning.pytorch wrappers?
If this approach would be okay I'll add comments, verify sanity checks and some unit test

lantiga · 2024-11-27T19:02:24Z

hey, thanks for updating the PR
the approach looks good! I don’t think we need reapply_compile, we will want this whenever we have an optimized model in input (the call is largely free, until we run the first forward)
I’d reuse Fabric’s wrapper, unless we need to tweak them in which case I’d duplicate them

mieshkiwrk requested review from lantiga, Borda, tchaton, awaelchli and justusschock as code owners September 10, 2024 08:16

github-actions bot added the pl Generic label for PyTorch Lightning package label Sep 10, 2024

mieshkiwrk and others added 3 commits October 24, 2024 08:21

Add compile_fn for Trainer

1bc2ce7

[pre-commit.ci] auto fixes from pre-commit.com hooks

e26132a

for more information, see https://pre-commit.ci

Add parameter description

925c376

mieshkiwrk force-pushed the feature/trainer-compile-fn branch from 4648ea2 to 925c376 Compare October 24, 2024 06:21

Merge branch 'master' into feature/trainer-compile-fn

1946070

lantiga added the torch.compile label Nov 12, 2024

lantiga added the waiting on author Waiting on user action, correction, or update label Nov 12, 2024

mergify bot added the has conflicts label Nov 25, 2024

Test reapply_compile for trainer

86d2c70

mieshkiwrk requested a review from ethanwharris as a code owner November 27, 2024 12:07

mergify bot removed the has conflicts label Nov 27, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

8db1a6f

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compile_fn parameter for Trainer #20269

Add compile_fn parameter for Trainer #20269

mieshkiwrk commented Sep 10, 2024 •

edited

Loading

mieshkiwrk commented Sep 13, 2024

mieshkiwrk commented Sep 20, 2024

codecov bot commented Sep 30, 2024

lantiga commented Nov 12, 2024 •

edited by Borda

Loading

mieshkiwrk commented Nov 15, 2024

mieshkiwrk commented Nov 27, 2024

lantiga commented Nov 27, 2024

Add compile_fn parameter for Trainer #20269

Are you sure you want to change the base?

Add compile_fn parameter for Trainer #20269

Conversation

mieshkiwrk commented Sep 10, 2024 • edited Loading

mieshkiwrk commented Sep 13, 2024

mieshkiwrk commented Sep 20, 2024

codecov bot commented Sep 30, 2024

Codecov Report

lantiga commented Nov 12, 2024 • edited by Borda Loading

mieshkiwrk commented Nov 15, 2024

mieshkiwrk commented Nov 27, 2024

lantiga commented Nov 27, 2024

mieshkiwrk commented Sep 10, 2024 •

edited

Loading

lantiga commented Nov 12, 2024 •

edited by Borda

Loading