Demo Repository for the Intel Gaudi

Introduction

This repository contains code to reproduce performance numbers for the Intel Gaudi.

Specifically, code is available to measure the throughput for matrix multiplication (BF16 and FP8) and the prefill stage of Llama models.

In addition, we also provide code for users to reproduce throughput numbers for NVIDIA GPUs such as the A100 and the H100. However, setting up the necessary development environments is left to the user.

Setup

Visit https://github.com/NAVER-INTEL-Co-Lab/gaudi-cresset for detailed setup instructions.

Run make env to create a .env file. This need only be done once per directory.
Run make build to build the Docker image and start the container. Run this command when you wish to rebuild the Docker image.
Run make exec to enter an existing Docker container.

Getting started

For instructions on matrix multiplication throughput measurements, visit the matmul directory. Commands are described in their respective files.

To measure prefill throughputs for Llama models, visit the prefill directory.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
matmul		matmul
prefill		prefill
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
apt.requirements.txt		apt.requirements.txt
docker-compose.yaml		docker-compose.yaml
environment.yaml		environment.yaml
pip.uninstalls.txt		pip.uninstalls.txt
pyproject.toml		pyproject.toml
upload.sh		upload.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demo Repository for the Intel Gaudi

Introduction

Setup

Getting started

About

Languages

License

NAVER-INTEL-Co-Lab/gaudi-perf

Folders and files

Latest commit

History

Repository files navigation

Demo Repository for the Intel Gaudi

Introduction

Setup

Getting started

About

Resources

License

Stars

Watchers

Forks

Languages