6 results
- Ray Serve FeatureddeploymentScalable model serving built on Ray with dynamic batching, GPU sharing, and multi-model composition for AI pipelines.Medium setupCPU OnlyCommercial useApache 2.0★ 31kUpdated 26d ago
- BentoMLdeploymentUnified model serving framework supporting any model with optimized inference, batching, and auto-scaling for production.Medium setupCPU OnlyCommercial useApache 2.0★ 16kUpdated 27d ago
- KServedeploymentKubernetes-native model serving platform with canary rollouts, auto-scaling, and multi-framework support for production AI.High setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 5.2kUpdated 27d ago
- OpenVINO (Intel)deploymentIntel toolkit for optimizing and deploying AI inference on CPU, GPU, and NPU hardware with low-latency execution.Low setupCPU OnlyCommercial useApache 2.0★ 7.2kUpdated 26d ago
- Seldon CoredeploymentEnterprise-grade Kubernetes platform for deploying, monitoring, and managing ML models at scale with built-in explainability.High setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 5.8kUpdated 1mo ago
- Triton Inference Server (NVIDIA)deploymentNVIDIA production inference server supporting any framework with concurrent model execution and GPU optimization.Medium setupNVIDIA GPU (CUDA)Commercial useBSD★ 12kUpdated 28d ago
Offgrid AI tools · Updated daily
Enclavetools
Stop paying for AI APIs. Everything here runs on your hardware.
Sponsor
Reach 50,000+ enterprise buyers looking for private AI solutions.
Newsletter
5 new tools, every Friday
No fluff. No spam. Join 12,000+ builders.
Get featured
Put your tool at the top
Featured listings get 10× more clicks and are shown prominently across the directory.
Page 1 of 1