Llm Inference AI Tools | Enclavetools

Skip to main content

⌘K

All Tools LLM Inference Engines LLM Models Vector Databases Agent Frameworks Chat Interfaces RAG & Document Processing Speech to Text Text to Speech Image Generation Fine-tuning & Training Monitoring & Observability Privacy & Security Embedding Models Deployment Agent & Workflow Automation Video Generation Vision & Multimodal Code Assistants Data Utilities

Compare Stack Saved tools Submit ↗Submit

Compare Stack Saved tools Submit ↗

⌘K

All Tools LLM Inference Engines LLM Models Vector Databases Agent Frameworks Chat Interfaces RAG & Document Processing Speech to Text Text to Speech Image Generation Fine-tuning & Training Monitoring & Observability Privacy & Security Embedding Models Deployment Agent & Workflow Automation Video Generation Vision & Multimodal Code Assistants Data Utilities

All Categories

All Tools LLM Inference Engines LLM Models Vector Databases Agent Frameworks Chat Interfaces RAG & Document Processing Speech to Text Text to Speech Image Generation Fine-tuning & Training Monitoring & Observability Privacy & Security Embedding Models Deployment Agent & Workflow Automation Video Generation Vision & Multimodal Code Assistants Data Utilities

12 results

Offgrid AI tools · Updated daily

Enclavetools

Stop paying for AI APIs. Everything here runs on your hardware.

Publish yours now →

Compare
llama.cpp Featured
llm-inference
LLM inference in C/C++.
Low setupCPU OnlyCommercial use
MIT★ 111kUpdated 25d ago
Compare
Ollama Featured
llm-inference
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Medium setupCPU OnlyCommercial use
MIT★ 172kUpdated 25d ago
Compare
vLLM Featured
llm-inference
A high-throughput and memory-efficient inference and serving engine for LLMs.
Medium setupNVIDIA GPU (CUDA)Commercial use
Apache 2.0★ 80kUpdated 25d ago

Sponsor

Reach 50,000+ enterprise buyers looking for private AI solutions.

Compare
ExLlamaV2
llm-inference
A fast inference library for running LLMs locally on modern consumer-class GPUs.
Medium setupNVIDIA GPU (CUDA)Commercial use
MIT★ 4.5kUpdated 26d ago
Compare
KoboldCpp
llm-inference
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Low setupCPU Only
AGPL 3.0★ 11kUpdated 25d ago

Newsletter

5 new tools, every Friday

No fluff. No spam. Join 12,000+ builders.

Compare
LocalAI
llm-inference
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Low setupCPU OnlyCommercial use
MIT★ 46kUpdated 25d ago
Compare
MLC LLM
llm-inference
Universal LLM Deployment Engine with ML Compilation.
Low setupNVIDIA GPU (CUDA)Commercial use
Apache 2.0★ 23kUpdated 26d ago
Compare
MLX (Apple)
llm-inference
MLX: An array framework for Apple silicon.
Medium setupApple Silicon (Metal)Commercial use
MIT★ 26kUpdated 25d ago
Compare
SGLang
llm-inference
SGLang is a high-performance serving framework for large language models and multimodal models.
Medium setupNVIDIA GPU (CUDA)Commercial use
Apache 2.0★ 28kUpdated 25d ago
Compare
TabbyAPI
llm-inference
The official API server for Exllama. OAI compatible, lightweight, and fast.
Medium setupNVIDIA GPU (CUDA)
AGPL 3.0★ 1.2kUpdated 25d ago

Get featured

Put your tool at the top

Featured listings get 10× more clicks and are shown prominently across the directory.

Compare
TensorRT-LLM (NVIDIA)
llm-inference
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
Medium setupNVIDIA GPU (CUDA)Commercial use
Other★ 14kUpdated 25d ago
Compare
Text Generation Inference (Hugging Face)
llm-inference
Large Language Model Text Generation Inference.
Medium setupNVIDIA GPU (CUDA)Commercial use
Apache 2.0★ 11kUpdated 25d ago

Newsletter

5 new tools, every Friday

No fluff. No spam. Join 12,000+ builders.

Featured

Put your tool at the top

Featured listings get 10× more clicks and are shown prominently across the directory.

Get featured →

Page 1 of 1

Enclavetools

Your models. Your hardware. Zero subscriptions.

Browse

Browse Saved tools

Company

More

© 2026 Enclavetools

Privacy Policy · Terms