This repository has been archived by the owner on Oct 19, 2024. It is now read-only.
Release v0.1.0
Major feature updates:
- Support serving Meta's open source OPT model 175B
- Distributed weight init
- Add a set of strategy-specific interfaces such as DataParallel, Zero2Parallel, Zero3Parallel
Minors:
- enhancement on the PyTorch frontend
- many bugfixes
- Doc update: Performance tuning guide, and a slide deck for Alpa