Skip to content

atlarge-research/AIE-BLAS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

aieblas

This repository contains a BLAS implementation for the AMD AI Engine.

Note that the library is mostly a proof of concept and does not support all routines.

Supported routines

Level 1:

  • ASUM
  • AXPY
  • DOT
  • IAMAX
  • NRM2
  • ROT
  • SCAL

Level 2:

  • GEMV

Supported devices

  • AMD Versal VCK5000

Compiling the library

To compile the code generator, run ./configure.sh && cmake --build build in the folder aieblas/.

Running the benchmarks

To run the benchmarks, first compile the code generator, and then build the benchmarks by running ./build-all.sh in the folder benchmark/util.

About

BLAS Implementation for AMD/Xilinx Versal Devices

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published