AI Explorer
Infrastructure
LLM inference, model serving, and AI infrastructure projects.
Data source
Category lists combine GitHub search queries, repository topics, descriptions, and sync snapshots.
Ranking logic
Projects are filtered for category relevance, then ordered by stars and quality signals.
Best for
Use category pages when you already know the AI workflow or tool type you want to evaluate.
23 projects
LLM inference in C/C++
- Stars
- 111,457
- Growth
- -
- Language
- C++
- Created
- 2023-03-10
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
- Stars
- 77,365
- Growth
- -
- Language
- C++
- Created
- 2023-03-27
Port of OpenAI's Whisper model in C/C++
- Stars
- 49,893
- Growth
- -
- Language
- C++
- Created
- 2022-09-25
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
- Stars
- 42,595
- Growth
- -
- Language
- Python
- Created
- 2016-10-25
A generative speech model for daily dialogue.
- Stars
- 39,289
- Growth
- -
- Language
- Python
- Created
- 2024-05-27
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
- Stars
- 31,423
- Growth
- -
- Language
- Rust
- Created
- 2020-05-30
Find secrets with Gitleaks 🔑
- Stars
- 27,127
- Growth
- -
- Language
- Go
- Created
- 2018-01-27
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
- Stars
- 24,329
- Growth
- -
- Language
- HTML
- Created
- 2023-05-23
Faster Whisper transcription with CTranslate2
- Stars
- 23,001
- Growth
- -
- Language
- Python
- Created
- 2023-02-11
A list of free LLM inference resources accessible via API.
- Stars
- 21,880
- Growth
- -
- Language
- Python
- Created
- 2024-07-04
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
- Stars
- 20,274
- Growth
- -
- Language
- Jupyter Notebook
- Created
- 2025-06-16
High-performance In-browser LLM Inference Engine
- Stars
- 18,018
- Growth
- -
- Language
- TypeScript
- Created
- 2023-04-13
The absolute trainer to light up AI agents.
- Stars
- 17,199
- Growth
- -
- Language
- Python
- Created
- 2025-06-18
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
- Stars
- 17,179
- Growth
- -
- Language
- Python
- Created
- 2024-07-26
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
- Stars
- 16,217
- Growth
- -
- Language
- Go
- Created
- 2016-03-30
An orchestration platform for the development, production, and observation of data assets.
- Stars
- 15,541
- Growth
- -
- Language
- Python
- Created
- 2018-04-30
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
- Stars
- 14,615
- Growth
- -
- Language
- Python
- Created
- 2026-02-13
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
- Stars
- 13,372
- Growth
- -
- Language
- Python
- Created
- 2023-05-04
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
- Stars
- 12,327
- Growth
- -
- Language
- Python
- Created
- 2023-04-19
Official inference library for Mistral models
- Stars
- 10,806
- Growth
- -
- Language
- Jupyter Notebook
- Created
- 2023-09-27
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
- Stars
- 10,266
- Growth
- -
- Language
- C++
- Created
- 2018-10-15
Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
- Stars
- 1,262
- Growth
- -
- Language
- Python
- Created
- 2026-02-09
A curated collection of datasets for Large Language Models (LLMs), covering medical AI, NLP, multimodal learning, instruction tuning, reasoning, code generation, and evaluation benchmarks.
- Stars
- 111
- Growth
- -
- Language
- Unknown
- Created
- 2026-05-15