Hereβs a comprehensive and categorized list of open source AI stack components that you can mix and match when building GenAI applications β especially when focusing on modularity, scalability, and performance. This includes components for data processing, model serving, retrieval-augmented generation (RAG), vector search, and orchestration.
π§ Foundational Model Alternatives
Models you can self-host or fine-tune:
-
LLMs
-
Multimodal
-
Fine-Tuning
-
QLoRA,LoRA,PEFT(via π€ Transformers + PEFT) -
Axolotlβ Full stack fine-tuning.
-
π RAG (Retrieval-Augmented Generation) Stack
Tools to power knowledge-based Q&A systems:
-
Embeddings
sentence-transformers-
Instructor-XLβ Instruction-based embeddings.
-
Vector Databases
-
Document Loaders & Chunking
-
LangChainorLlamaIndex -
Haystackβ Full RAG pipelines.
-
π§ Serving & Orchestration
Serving models with APIs, managing prompts, memory, and chaining tools:
-
Model Servers
-
vLLMβ Fast LLM serving with paged attention. -
TGIβ HuggingFaceβs scalable inference server. Triton Inference Server-
LMDeployβ Model optimization & serving.
-
-
Agent / Workflow Frameworks
LangChainLlamaIndexHaystack-
CrewAIβ Multi-agent framework. AutoGen
-
Prompt Management
PromptLayerLangfuse-
Helicone(for logging OpenAI usage)
πΌοΈ Frontend / Chat UI
For chatbots or multimodal interfaces:
-
Next.jsβ UI + SSR/ISR. -
ShadCN/uiβ Design system for building clean UIs. -
Chatbot UIβ Open-source ChatGPT-style interface. -
Open WebUIβ Web UI for LM Studio / Ollama.
π Inference & Runtime Optimization
-
llm.rsβ LLM inference in Rust. -
ggmlβ Quantized models, runs on CPU. -
exllamaβ High-perf quantized inference.
π Security & DevOps (for production)
-
AuthN/AuthZ: [
Auth.js(NextAuth)], [Clerk], [Ory], [ZITADEL] -
Logging/Tracing: [
Langfuse], [OpenTelemetry], [Sentry] - DevOps: Docker, Kubernetes, GitHub Actions, Terraform
π§± Full Stack Boilerplates
If you're looking to start fast:
-
AI Engineer OSβ Full-stack open source GenAI stack. -
LangChainHubβ Reusable chains and prompts. -
OpenChatKitβ Chatbot framework. -
Flowiseβ Visual LangChain builder.
π§ͺ Experimental Tools
-
Ollamaβ Run and manage LLMs locally. -
Modalβ Serverless infra for AI. -
LiteLLMβ Drop-in proxy for OpenAI-compatible APIs.
Top comments (1)
Could you write a new topic about how to use that