Fix Llama4 attention flops #2030

gagika · 2025-07-26T01:10:28Z

Description

Fix Llama4 attention flops to only count for chunk attention window size for chunk attention layers.

Tests

Comparing total_tflops, learnable_weight_tflops, attention_tflops before and after the fix:

https://diff.googleplex.com/#key=wo0bVmy9wbUc

learnable_weight_tflops are matching
attention_tflops are matching for context length <= 8192
For context length 2* 8192, old attention flops was increasing 4 x over 8192, now
it's increasing 10/4 x (3 chunk attention increasing 2x and 1 global attention increasing 4x).
1187* 10/4 = 2968

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

RissyRan

LGTM

gobbleturk · 2025-07-28T18:11:25Z

MaxText/maxtext_utils.py

+  # FLOPs for a single global attention layer (full attention, non-causal)
+  global_attention_flops_per_layer = 4 * config.per_device_batch_size * seq_len**2 * config.num_query_heads * config.head_dim
+
+  # FLOPs for a single chunked attention layer (non-causal)


Can we separate out chunked attention flops into its own method?

gobbleturk

Thanks Gagik!

gagika marked this pull request as ready for review July 27, 2025 19:39

gagika requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla, RissyRan, richjames0, shralex, yangyuwei, SurbhiJainUSC, hengtaoguo, A9isha and aireenmei as code owners July 27, 2025 19:39

gagika assigned gobbleturk, shralex and NuojCheng Jul 27, 2025

RissyRan approved these changes Jul 28, 2025

View reviewed changes

gobbleturk reviewed Jul 28, 2025

View reviewed changes

gobbleturk approved these changes Jul 28, 2025

View reviewed changes

github-actions bot added the pull ready label Jul 28, 2025

Fix Llama4 attention flops

43f5406

gagika force-pushed the llama4-flops branch from 22d6db6 to 43f5406 Compare July 28, 2025 19:36

copybara-service bot merged commit 9cabaf6 into main Jul 28, 2025
20 checks passed

copybara-service bot deleted the llama4-flops branch July 28, 2025 22:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Llama4 attention flops #2030

Fix Llama4 attention flops #2030

gagika commented Jul 26, 2025 •

edited

Loading

Uh oh!

RissyRan left a comment

Uh oh!

gobbleturk Jul 28, 2025

Uh oh!

gagika Jul 28, 2025

Uh oh!

gobbleturk left a comment

Uh oh!

Uh oh!

Uh oh!

Fix Llama4 attention flops #2030

Fix Llama4 attention flops #2030

Conversation

gagika commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

gobbleturk Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

gagika Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

gobbleturk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gagika commented Jul 26, 2025 •

edited

Loading