about
blog
publications
projects
repositories
cv
talks+teaching
downloads
resume
datasets

Distributing the Llama LLM with Wrapyfi

Wrapyfi enables distributing LLaMA (inference only) on multiple GPUs/machines, each with less than 16GB VRAM

© Copyright 2025 Fares Abawi. Powered by Jekyll with al-folio theme.