-
Notifications
You must be signed in to change notification settings - Fork 5
Implement AVX-512 SIMD Optimizations #5
Copy link
Copy link
Open
Labels
optimizationCode optimization and algorithmic improvementsCode optimization and algorithmic improvementsperformanceSpeed improvements and optimization workSpeed improvements and optimization workpriority: mediumImportant improvements that enhance functionality or performanceImportant improvements that enhance functionality or performancesimdSIMD optimizations (AVX2, AVX-512, NEON)SIMD optimizations (AVX2, AVX-512, NEON)
Metadata
Metadata
Assignees
Labels
optimizationCode optimization and algorithmic improvementsCode optimization and algorithmic improvementsperformanceSpeed improvements and optimization workSpeed improvements and optimization workpriority: mediumImportant improvements that enhance functionality or performanceImportant improvements that enhance functionality or performancesimdSIMD optimizations (AVX2, AVX-512, NEON)SIMD optimizations (AVX2, AVX-512, NEON)
AVX2 is working great (42% speedup on ML-DSA), but newer CPUs have AVX-512 which should give more boost.
Targets for AVX-512:
Requirements: