FlashInfer: Kernel Library for LLM Serving
Python bindings to the Zstandard (zstd) compression library
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
最近更新: 6天前The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
最近更新: 6天前Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
最近更新: 6天前