Sparse Matrix Tutorials

Sparse Matrix Computations in CUDA

This project implements various sparse matrix computations in CUDA and C++. It includes conversion routines between sparse matrix formats and efficient CUDA kernels for Sparse Matrix-Vector ...

GitHub

sparse-attention.md

in which sparse_self_attention is an instance of SparseSelfAttention. This module computes attention context through sparse attention replacing underlying matrix multiplications and softmax with their ...

IEEE

A Context-Awareness and Hardware-Friendly Sparse Matrix Multiplication Kernel for CNN Inference Acceleration

Abstract: Sparsification technology is crucial for deploying convolutional neural networks in resource-constrained environments. However, the efficiency of sparse models is hampered by irregular ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Sparse Matrix Computations in CUDA

sparse-attention.md

A Context-Awareness and Hardware-Friendly Sparse Matrix Multiplication Kernel for CNN Inference Acceleration

Trending now