-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: a calculation error in the example of dividing out in the 02_layout_algebra doc
#2635
opened Sep 13, 2025 by
Asuka0630
Loading…
is_unsigned_v is only available if the code is compiled with C++17 or later:
#2621
opened Sep 5, 2025 by
FurkanGozukara
Loading…
fix: missing argparse argument static_shape for cutedsl ampere sgemm example
#2568
opened Aug 14, 2025 by
a-r-r-o-w
Loading…
Fix typo in tv layout stride in elementwise_add
inactive-30d
#2564
opened Aug 12, 2025 by
henrylhtsang
Loading…
Add missing CUDA_ARCH guard for
__nanosleep
in example
inactive-30d
#2558
opened Aug 11, 2025 by
Flamefire
Loading…
Liberate runtime check in example 13
inactive-30d
#2554
opened Aug 7, 2025 by
FabianSchuetze
Loading…
Fix typo in cute.nvgpu.warpgroup.mma doc
inactive-30d
#2548
opened Aug 7, 2025 by
lucifer1004
Loading…
fix: too early producer commit in examples/77_blackwell_fmha/kernel/sm100_fmha_mla_tma_warpspecialized.hpp
inactive-30d
#2502
opened Jul 24, 2025 by
soundOfDestiny
Loading…
Mixed Precision Grouped Gemm with zero points and GPT-Q semantics closes #2261
inactive-30d
#2457
opened Jul 11, 2025 by
ankutalev
Loading…
Add SM80/89 blockwise scaling kernel, support FP8 block/groupwise on Ada, INT8 on Ampere
#2328
opened May 24, 2025 by
solrex
Loading…
Fix CMake build error and -Wdeprecated-literal-operator warnings
#2305
opened May 15, 2025 by
BwL1289
Loading…
Support N={48, 80, 96, 112, ...} for SM100 EpilogueTileAuto
inactive-30d
inactive-90d
#2269
opened Apr 29, 2025 by
Algy
Loading…
Limit the number of SMs (sm_count) to user-provided value during profiling.
inactive-30d
inactive-90d
#2257
opened Apr 22, 2025 by
manishucsd
Loading…
fix: the bug of example_simt_canonical
inactive-30d
inactive-90d
#2160
opened Mar 10, 2025 by
fengxiaoshuai
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.