Stars
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
STREAM, for lots of devices written in many programming models
Code and Lexicons for WebSci2019 paper: Exploring Misogyny across the Manosphere in Reddit
a Parallel Unstructured Mesh Library for reading large unstructured meshes
A scientific software for the numerical simulation of seismic wave phenomena and earthquake dynamics
Linux system exploration and troubleshooting tool with first class support for containers