Disable donated_buffer for all ops's backward benchmarking #104

FindHao · 2024-12-07T00:55:13Z

It is still a temporary fix for backward benchmarking. Related discussion #40

xuzhao9 · 2024-12-07T01:01:40Z

tritonbench/utils/triton_op.py

@@ -39,6 +39,8 @@
    tqdm = None

 logger = logging.getLogger(__name__)
+# TODO: remove this once we have a better way to handle backward benchmarking
+torch._functorch.config.donated_buffer = False


This seems an overkill to me. Can we disable it only in backward and forward_backward?

fixed in 20b138d

facebook-github-bot · 2024-12-07T01:23:29Z

@FindHao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-09T18:04:23Z

@FindHao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

FindHao · 2024-12-09T18:33:31Z

It looks the ci errors are true, and the config is not compatible with flash_attention. I'm going to set this config for fused_linear_cross_entropy, geglu, and swiglu only.

…_entropy, geglu, swiglu, and layernorm

facebook-github-bot · 2024-12-09T18:56:35Z

@FindHao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-09T19:39:35Z

@FindHao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-09T23:19:45Z

@FindHao merged this pull request in 642ac1e.

Summary: The previous PR #104 causes the following issue. ``` % python run.py --op geglu --mode fwd --precision fp32 --metrics latency,speedup --csv --cudagraph 0%| | 0/4 [00:03<?, ?it/s] Caught exception, terminating early with partial results Traceback (most recent call last): File "/scratch/yhao/pta/tritonbench/tritonbench/utils/triton_op.py", line 782, in run y_vals: Dict[str, BenchmarkOperatorMetrics] = functools.reduce( File "/scratch/yhao/pta/tritonbench/tritonbench/utils/triton_op.py", line 770, in _reduce_benchmarks acc[bm_name] = self._do_bench( File "/scratch/yhao/pta/tritonbench/tritonbench/utils/triton_op.py", line 981, in _do_bench fn = self._get_bm_func(fn_name) File "/scratch/yhao/pta/tritonbench/tritonbench/utils/triton_op.py", line 667, in _get_bm_func fwd_fn = fwd_fn_lambda(*self.example_inputs) File "/scratch/yhao/pta/tritonbench/tritonbench/utils/triton_op.py", line 481, in _inner return function(self, *args, **kwargs) File "/scratch/yhao/pta/tritonbench/tritonbench/operators/geglu/operator.py", line 69, in inductor_geglu compiled = torch.compile(self.baseline_model) UnboundLocalError: local variable 'torch' referenced before assignment (B, T, H) ``` we should use `from torch._functorch import config` rather than `import torch._functorch.config` Pull Request resolved: #113 Reviewed By: adamomainz Differential Revision: D67110110 Pulled By: FindHao fbshipit-source-id: e5143b06d0e62fb2a7b83464e23126e73a52ee10

facebook-github-bot added the cla signed label Dec 7, 2024

FindHao had a problem deploying to docker-s3-upload December 7, 2024 00:55 — with GitHub Actions Failure

FindHao requested review from xuzhao9 and adamomainz December 7, 2024 00:55

xuzhao9 reviewed Dec 7, 2024

View reviewed changes

FindHao had a problem deploying to docker-s3-upload December 7, 2024 01:22 — with GitHub Actions Failure

xuzhao9 approved these changes Dec 7, 2024

View reviewed changes

FindHao had a problem deploying to docker-s3-upload December 9, 2024 18:04 — with GitHub Actions Failure

FindHao added 4 commits December 9, 2024 10:20

disable donated_buffer for all ops's backend benchmarking

9534e39

disable donated buffer for bwd and fwd_bwd specifically

121571b

import torch._functorch.config

cdcd06d

fix lint

5aa17b6

FindHao force-pushed the findhao/disable_donated_buffer branch from 5217014 to 5aa17b6 Compare December 9, 2024 18:20

FindHao had a problem deploying to docker-s3-upload December 9, 2024 18:20 — with GitHub Actions Failure

disable torch._functorch.config.donated_buffer for fused_linear_cross…

50b64db

…_entropy, geglu, swiglu, and layernorm

FindHao had a problem deploying to docker-s3-upload December 9, 2024 18:37 — with GitHub Actions Failure

fix import;remove extra disablement in triton_op

7024357

FindHao had a problem deploying to docker-s3-upload December 9, 2024 19:37 — with GitHub Actions Failure

facebook-github-bot closed this in 642ac1e Dec 9, 2024

facebook-github-bot added the Merged label Dec 9, 2024

FindHao mentioned this pull request Dec 11, 2024

Fix donated_buffer issue #113

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable donated_buffer for all ops's backward benchmarking #104

Disable donated_buffer for all ops's backward benchmarking #104

FindHao commented Dec 7, 2024

xuzhao9 Dec 7, 2024

FindHao Dec 7, 2024

facebook-github-bot commented Dec 7, 2024

facebook-github-bot commented Dec 9, 2024

FindHao commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024

Disable donated_buffer for all ops's backward benchmarking #104

Disable donated_buffer for all ops's backward benchmarking #104

Conversation

FindHao commented Dec 7, 2024

xuzhao9 Dec 7, 2024

Choose a reason for hiding this comment

FindHao Dec 7, 2024

Choose a reason for hiding this comment

facebook-github-bot commented Dec 7, 2024

facebook-github-bot commented Dec 9, 2024

FindHao commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024

facebook-github-bot commented Dec 9, 2024