25 results
- DeepSeek V3 (DeepSeek AI) Featuredllm-modelsOpen-weight Mixture-of-Experts model with MIT license, strong reasoning, and efficient inference for offline use.Medium setupNVIDIA GPU (CUDA)Commercial useMIT★ 104kUpdated 25d ago
- Llama 3 (Meta) Featuredllm-modelsThe official Meta Llama 3 GitHub site.Medium setupCPU OnlyCommercial useLlama 3 Community License★ 29kUpdated 25d ago
- Mistral (Mistral AI) Featuredllm-modelsOfficial inference library for Mistral models.Medium setupCPU OnlyCommercial useApache 2.0★ 11kUpdated 25d ago
- Qwen3 (Alibaba) Featuredllm-modelsQwen3 is the large language model series developed by Qwen team, Alibaba Cloud.Medium setupCPU OnlyCommercial useApache 2.0★ 27kUpdated 25d ago
- llama.cpp Featuredllm-inferenceLLM inference in C/C++.Low setupCPU OnlyCommercial useMIT★ 111kUpdated 25d ago
- Ollama Featuredllm-inferenceGet up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.Medium setupCPU OnlyCommercial useMIT★ 172kUpdated 25d ago
- vLLM Featuredllm-inferenceA high-throughput and memory-efficient inference and serving engine for LLMs.Medium setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 80kUpdated 25d ago
- FAISS (Meta) Featuredvector-databasesA library for efficient similarity search and clustering of dense vectors.Medium setupCPU OnlyCommercial useMIT★ 40kUpdated 25d ago
- Milvus Featuredvector-databasesMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search.Medium setupCPU OnlyCommercial useApache 2.0★ 44kUpdated 25d ago
- supertonicllm-modelsLightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.Low setupCPU OnlyCommercial useMIT★ 9.2kUpdated 20d ago
- Gemma 3 (Google)llm-modelsGemma open-weight LLM library, from Google DeepMind.Low setupCPU OnlyCommercial useGemma Terms of Use★ 5.2kUpdated 25d ago
- OLMo (Allen AI)llm-modelsModeling, training, eval, and inference code for OLMo.Medium setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 6.5kUpdated 25d ago
- Phi-3 (Microsoft)llm-modelsMicrosoft compact open-weight models delivering strong performance at small sizes, ideal for low-resource air-gapped systems.Low setupCPU OnlyCommercial useMIT★ 1.7kUpdated 6mo ago
- ExLlamaV2llm-inferenceA fast inference library for running LLMs locally on modern consumer-class GPUs.Medium setupNVIDIA GPU (CUDA)Commercial useMIT★ 4.5kUpdated 26d ago
- KoboldCppllm-inferenceRun GGUF models easily with a KoboldAI UI. One File. Zero Install.Low setupCPU OnlyAGPL 3.0★ 11kUpdated 25d ago
- LocalAIllm-inferenceLocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.Low setupCPU OnlyCommercial useMIT★ 46kUpdated 25d ago
- MLC LLMllm-inferenceUniversal LLM Deployment Engine with ML Compilation.Low setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 23kUpdated 26d ago
- MLX (Apple)llm-inferenceMLX: An array framework for Apple silicon.Medium setupApple Silicon (Metal)Commercial useMIT★ 26kUpdated 25d ago
- SGLangllm-inferenceSGLang is a high-performance serving framework for large language models and multimodal models.Medium setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 28kUpdated 25d ago
- TabbyAPIllm-inferenceThe official API server for Exllama. OAI compatible, lightweight, and fast.Medium setupNVIDIA GPU (CUDA)AGPL 3.0★ 1.2kUpdated 25d ago
- TensorRT-LLM (NVIDIA)llm-inferenceTensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.Medium setupNVIDIA GPU (CUDA)Commercial useOther★ 14kUpdated 25d ago
- Text Generation Inference (Hugging Face)llm-inferenceLarge Language Model Text Generation Inference.Medium setupNVIDIA GPU (CUDA)Commercial useApache 2.0★ 11kUpdated 25d ago
- Chromavector-databasesSearch infrastructure for AI.Low setupCPU OnlyCommercial useApache 2.0★ 28kUpdated 25d ago
- LanceDBvector-databasesDeveloper-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.Low setupCPU OnlyCommercial useApache 2.0★ 10kUpdated 25d ago
- pgvectorvector-databasesOpen-source vector similarity search for Postgres.Medium setupCPU OnlyCommercial useOther★ 21kUpdated 25d ago
Offgrid AI tools · Updated daily
Enclavetools
Stop paying for AI APIs. Everything here runs on your hardware.
Sponsor
Reach 50,000+ enterprise buyers looking for private AI solutions.
Newsletter
5 new tools, every Friday
No fluff. No spam. Join 12,000+ builders.
Get featured
Put your tool at the top
Featured listings get 10× more clicks and are shown prominently across the directory.
Page 1 of 6
Next →