Skip to content

Conversation

lzhangzz
Copy link
Collaborator

@lzhangzz lzhangzz commented Sep 11, 2025

  • Add FP8*(B)F16 for sm_70 ... sm_90
  • Optimize grouped GEMM performance for all mixed GEMMs
  • Re-organized code structure

@lzhangzz lzhangzz changed the title Add FP8x(B)F16 GEMM Add FP8*(B)F16 GEMM Sep 11, 2025
@lvhan028 lvhan028 added the enhancement New feature or request label Sep 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants