llama-recipes/recipes/quickstart/inference at main · meta-llama/llama-recipes

Name		Name	Last commit message	Last commit date
parent directory ..
code_llama		code_llama
local_inference		local_inference
mobile_inference/android_inference		mobile_inference/android_inference
README.md		README.md
modelUpgradeExample.py		modelUpgradeExample.py

README.md

This folder contains scripts to get you started with inference on Meta Llama models.

Code Llama contains scripts for tasks relating to code generation using CodeLlama
Local Inference contains scripts to do memory efficient inference on servers and local machines
Mobile Inference has scripts using MLC to serve Llama on Android (h/t to OctoAI for the contribution!)
Model Update Example shows an example of replacing a Llama 3 model with a Llama 3.1 model.