A high-throughput and memory-efficient inference and serving engine for LLMs(Fork for contributing. All changes intended for upstream PRs.)
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
A universal sandbox platform for AI application scenarios, providing multi-language SDKs, unified sandbox protocols, and sandbox runtimes for LLM-related capabilities.
A Go web framework for quickly building recommendation online services based on JSON configuration.
Official repository for paper "LaTo: Landmark-tokenized Diffusion Transformer for Fine-grained Human Face Editing"