Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

Awesome Free Models Awesome

A curated list of free AI models, APIs, and tools you can use without paying a cent.

Last Updated Models Tools Sections License

✅ All links verified live on June 23, 2026. ~300 URLs checked. 0 broken links, 2 archived repos, 1 GitHub org rename (OpenCode → anomalyco), 3 inaccurate descriptions fixed, 5 unmaintained projects flagged, 2 pricing changes (Fireworks, Tensorlake).

Running AI shouldn't require a credit card. This list curates genuinely free models — open-weight models you can self-host, free API tiers from major providers, and tools to run everything locally.


Contents


🧠 Open-Weight Models

📅 Last checked: June 23, 2026

Notable open-weight models you can download and run on your own hardware.

NameReleasedDescription
Llama 4 Scout / MaverickMeta's latest MoE generation. Scout: 109B, 10M context. Maverick: 402B, 1M context. Native multimodal. [License]
DeepSeek V4Latest generation with extreme cost-efficiency. MIT license.
DeepSeek-V4-FlashApr 2026Efficiency-focused variant of DeepSeek V4. 1M token context, optimized for fast inference. MIT license.
Gemma 4 31B / 26B MoE / E4B / E2BFully permissive Apache 2.0. 256K context, native multimodal. New standard for open-weight.
GLM-5.1 (Zhipu AI)744B MoE model, competitive with top proprietary models. MIT license.
MiniMax M3Frontier-tier 1M context, native multimodal + computer use. MSA architecture.
Trinity (Arcee AI)400B parameter enterprise model. Apache 2.0.
Step 3.7 Flash (StepFun)May 2026Apache 2.0. Native multimodal (image+video), strong agentic performance. Efficient enough for high-end local hardware.
Kimi K2.6 (Moonshot AI)Apr 20261T-parameter MoE model. Modified MIT license. Exceptional coding (SWE-Bench ~54%) and multi-agent swarm orchestration.
Qwen 3.6-35B-A3BApr 2026MoE variant with only 3B active parameters. Extremely efficient for consumer hardware. Apache 2.0.
InternLM 3 (Shanghai AI Lab)Early 2026Strong long-context reasoning and agentic performance. Competitive in open-weight benchmarks.
MiMo-V2.5-Pro (Xiaomi)Apr 20261.02T-parameter MoE (42B active). Optimized for complex agentic tasks, coding, and long-context.
Bonsai 8B (PrismML)Apr 2026Groundbreaking 1-bit quantized model. Extremely efficient for edge and consumer hardware (Apple Silicon).
Mistral Small 3.1 (Mistral)Mar 2025Versatile 24B multimodal model. Strong text performance with native image understanding and 128K context. Apache 2.0.
Mistral Small 4 (Mistral)Mar 2026Hybrid MoE (6.5B active params) unifying instruction, reasoning, and multimodal capabilities. Efficient frontier-class model. Apache 2.0.
Command A+ (Cohere)May 2026Enterprise multimodal MoE optimized for sovereignty and multilingual RAG across 48 languages. Apache 2.0.
Hermes 4 (NousResearch)Feb 2026Self-improving agentic model with closed-loop learning. Curates own memory and builds skills from experience. Apache 2.0.
Snowflake ArcticApr 2024Enterprise MoE model balancing high-quality performance with efficient training costs. Optimized for complex data operations. Apache 2.0.
Falcon 3 (TII)Dec 2024Compact high-performance model with strong reasoning. Designed for efficient deployment on resource-constrained hardware. TII Falcon-LLM License 2.0.
Apple OpenELMApr 2024Family of efficient on-device SLMs using layer-wise attention scaling. Runs locally on Apple Silicon with full privacy. Apple Sample Code License.
Nemotron 3 Ultra (NVIDIA)Jun 2026550B MoE (55B active). Hybrid Mamba-Transformer, NVFP4 quantization. Optimized for agentic workflows. Fully open (weights, data, recipes). OpenMDW-1.1 license.

🔌 Free API Providers

📅 Last checked: June 25, 2026

Providers offering free tiers to access models via API — no local hardware required.

NameDescription
Google AI StudioMost generous free tier. Access Gemini 2.5 Flash, Gemini 2.0 Flash, and other models. Generous rate limits for prototyping.
OpenRouterAggregates 400+ models from 70+ providers. Filter by "Free" to see models available at no cost. Includes experimental and subsidized open-weight models.
GroqExtremely fast inference for text, speech, and vision-based OCR. Free tier supports GPT-OSS, Llama 4, and Whisper, but is bottlenecked by low per-minute caps (30 RPM / 8K TPM), requiring a paid upgrade for production scale.
Hugging Face Inference APIFree tier for thousands of community models. Rate-limited but excellent for testing.
NVIDIA NIMFree API access to accelerated versions of Llama, Mistral, Gemma, Nemotron 3 Ultra, and more on NVIDIA infrastructure.
Together AIAccess 200+ open-source models including MiniMax M3, DeepSeek, Qwen. Free credits may be available for new accounts — check current promotions.
Fireworks AI$1 free starter credits for new users. Optimized for low latency across 50+ models including GLM 5.2, Kimi K2.7, MiniMax M3. ⚠️ Moved to prepaid billing July 1, 2026.
SiliconFlowRising platform with free access to many open-source models.
Cloudflare Workers AIFree tier for running 50+ open-source models at the edge on Cloudflare's global network. Pay-for-what-you-use pricing.
ReplicateFree tier with limited credits for running open-source models. Pay-per-second for GPU usage.
Poe (Quora)Free tier with daily credits for GPT-4 mini, Claude instant, and community bots.
Cerebras1M tokens/day free, no credit card. Ultra-fast inference on WSE chips. Access Llama 3.3 70B, GPT-OSS 120B, Qwen 3, and more via OpenAI-compatible API.
Qwen Chat (Alibaba)Free access to Qwen 3.7-Plus, Qwen 3.6-Max, and other Qwen models via web chat and API.
Ollama CloudFree tier for running open-source models on Ollama's cloud infrastructure. Light usage included, 1 concurrent model. Pro ($20/mo) and Max ($100/mo) tiers available. Zero data retention.
OpenAI API~$5 trial credits for new API accounts. Access GPT-5, GPT-4o, o4-mini, and more. Rate-limited free tier available after credits expire.
Mistral AI (Vibe)⚠️ Pivoted to Vibe (consumer AI agent). Free tier for Vibe chat agent with limited messages. API access via Studio — enterprise pricing, contact sales. New models: Mistral OCR 4, Mistral Medium 3.5.
Model RouterFree API with intent-based routing across Groq and Cerebras models (Llama 4 Scout, DeepSeek, Qwen, Nemotron) — no credit card, no trial credits. Set `prefer=cheap
CohereFree trial API key for Command A+, North Mini Code, Transcribe, Embed 4, Rerank 4, and Aya models. Rate-limited, not for production.
DeepSeek PlatformFree API credits for new users (5M tokens). Access to DeepSeek V4, DeepSeek-R1, and other models. Generous free allocation.
GitHub ModelsFree tier for GitHub users. Access GPT-5, GPT-4o, o4-mini, Phi-4, Llama 4, Mistral, and more with rate-limited playground and API.
Hyperbolic⚠️ No standalone free tier — pay-per-use GPU cloud. Free credits via referral program only ($5 for referrer, $6 for referee when referee deposits $5+).
Novita AIFree starter credits for testing 200+ models including DeepSeek V4 Pro, MiniMax M3, GLM-5.1, Kimi K2.6. Also offers Agent Sandbox and GPU Cloud. OpenAI-compatible API.
Anakin.ai30 daily free credits for accessing multiple AI models. Web chat interface and API access. Supports GPT-4, Claude, DeepSeek, and open-weight models.
Anthropic (Claude API)~$5 trial credits for new API accounts. Access Claude Opus 4.8, Sonnet, and Haiku models. Phone verification required.
Nebius AI$100 free credits for new users. AI Studio with access to Llama, Qwen, DeepSeek, Nemotron 3 Ultra, and other open-weight models. Fast inference on NVIDIA H100/B200 infrastructure.
Fal.aiFree starter credits for generative media inference. 1,000+ models for image, video, audio, and 3D. SOC 2 compliant. Pay-as-you-go beyond free tier.
Vercel AI Gateway$5/month free credits for the AI Gateway. Proxy and cache requests across multiple LLM providers. SDK is open-source and free.
AI21 Labs⚠️ Enterprise-focused. Maestro framework and Jamba models. No clear free tier visible — contact sales.
Amazon Bedrock$200 AWS credits for new customers. Access to Llama, Mistral, Claude, Titan, and other foundation models via API.
Azure AI Foundry$200 free trial credits (30 days). Access GPT-4o, Llama, Mistral, Phi, and other models via Azure's unified AI platform.
xAI (Grok)$25 sign-up credits + $150/month with data-sharing program. Access Grok-3, Grok-3 Mini via API. No credit card required. ⚠️ Console may require alternate access.
ZeroLimitAIFree API with auto model routing to the best available free model (Gemini 2.5 Flash, Llama 4, DeepSeek R1). No credit card, drop-in OpenAI replacement. Paid plans from $49 one-time.
Stability AIFree API credits for image generation with Stable Diffusion and Stable Video models. Rate-limited access without credit card.
Eden AIFree tier aggregating 500+ models from multiple providers via a single API key. Unified interface for text, image, and code generation. GDPR-compliant EU endpoint available.
SambaNovaFree tier with fast inference on custom RDU chips. Access DeepSeek, Llama 4, MiniMax M2.7, GPT-OSS 120B. Fastest inference speeds available.
Inference.netFree tier for LLM observability and monitoring. Deploy, trace, evaluate, and train custom models. SOC 2 Type II compliant.
RunPod⚠️ No free tier — pay-per-use for pods, serverless, and clusters.
FreeTheAiFree OpenAI-compatible AI API gateway with 50+ active models. Discord-based key signup with daily check-in to keep access active. Streaming, tool calling, and multiple model support. No credit card.

💻 Local Inference Tools

📅 Last checked: June 23, 2026

Run models on your own machine — no API keys needed, full privacy.

NameDescription
OllamaThe easiest way to run local LLMs. One command to download and run any model. macOS, Linux, Windows. GitHub
LM StudioPolished desktop GUI. Browse, download, and chat with models. Built-in model browser and local API server.
llama.cppHigh-performance C++ inference engine. Runs on CPU and GPU. Supports GGUF quantization. Powers most other local tools.
JanOpen-source ChatGPT alternative for desktop. Built-in model downloader, local API server. GitHub
GPT4AllPrivacy-focused local chatbot. Runs on consumer hardware. Built-in model browser. GitHub
text-generation-webui (Oobabooga)Feature-rich web UI. Supports multiple backends (Transformers, llama.cpp, ExLlama, AutoGPTQ).
LocalAIDrop-in OpenAI API replacement. Run models locally with an OpenAI-compatible API. GitHub
KoboldCPPSingle-file executable for running GGUF models. Focused on story generation but general-purpose.
llamafile (Mozilla)Distributable single-file executables that run LLMs. No installation needed.
vLLMHigh-throughput production inference engine. Uses PagedAttention for efficient serving.
SGLangFast inference framework with structured generation and RadixAttention.
TensorRT-LLM (NVIDIA)NVIDIA's optimized inference engine. Best performance on NVIDIA GPUs.
ExLlamaV2Fast GPTQ/EXL2 inference library. Less active; newer development on ExLlamaV3.
Aphrodite EngineHigh-performance LLM serving engine with advanced quantization support.
TabbyAPILightweight, fast OpenAI-compatible API server for ExLlamaV2.
LlamaEdgeLightweight inference framework for edge devices. OpenAI-compatible API for open-source models. Runs on WasmEdge for portability. GitHub
MLC LLMUniversal deployment engine by UW/SJTU. Runs LLMs on any hardware — laptops, phones, browsers. OpenAI-compatible API.
WebLLMIn-browser LLM inference via WebGPU. Runs models directly in your browser with zero setup. No server needed.
FastChat (LMSYS)Open platform for training, serving, and evaluating LLMs. Provides OpenAI-compatible API and web UI for local models.
Hugging Face TGI⚠️ Archived. Use vLLM or SGLang instead.
DeepSpeed (Microsoft)Deep learning optimization library with inference acceleration. Enables running larger models on limited hardware through ZeRO optimization.
AirLLMRun large models (70B+) on consumer hardware with limited memory. Loads models layer-by-layer for extreme memory efficiency.
AI Toolkit for VS Code (Microsoft)VS Code extension to browse, test, fine-tune, and deploy models locally. Integrates ONNX and llama.cpp.
Ollama Grid SearchDesktop utility for systematic model evaluation. Test multiple models, prompts, and inference parameters side-by-side via a Rust/React GUI.

💬 AI Chatbot UIs

📅 Last checked: June 23, 2026

Free, open-source web interfaces for chatting with AI models — self-host or use hosted versions.

NameDescription
Open WebUIFeature-rich ChatGPT-like interface for Ollama and OpenAI-compatible backends. RAG, image generation, multi-user. GitHub
LibreChatOpen-source ChatGPT clone supporting 40+ providers, multi-user, plugins, and RAG. GitHub
AnythingLLMAll-in-one desktop app for chatting with documents and models. Built-in RAG pipeline. GitHub
Big-AGIFeature-rich AI chat with personas, multi-model support, voice, and code execution. GitHub
LobeChatModern, extensible chat framework with plugin system and multi-provider support. GitHub
Chatbot UISimple, clean ChatGPT interface. Easy to self-host with any OpenAI-compatible API. ⚠️ Open-source repo unmaintained. GitHub
NextChat (ChatGPT-Next-Web)Lightweight cross-platform chat app. Self-host on Vercel or download official desktop/mobile clients.

🖥 AI CLI Tools

📅 Last checked: June 25, 2026

General-purpose terminal-based AI tools — chat, summarization, file operations, and more.

NameReleasedDescription
Gemini CLIFeb 2025Google's open-source terminal AI agent. 1,000 requests/day free on personal Google account. General-purpose agent for code, chat, and shell tasks. Gemini 3 models, 1M context. Apache 2.0. 106k stars.
CodexMay 2025OpenAI's lightweight coding agent. Rust-based with OS-level sandboxing (macOS Seatbelt, Linux Landlock). AGENTS.md support, image input, subagents, MCP. Apache 2.0. 93k stars.
OpenCodeJan 2025Go-based terminal AI agent. Model-neutral, supports 75+ LLM providers, LSP integration, and MCP tools. Desktop app in beta. MIT. 178k stars.
Pi2024Open-source terminal AI agent with unified multi-provider API. Model-agnostic, extensible plugin architecture. 65k stars. MIT.
Hermes AgentFeb 2026Nous Research's self-improving terminal AI agent. Full TUI with slash commands, 40+ tools, persistent memory. Multi-platform gateway (Telegram, Discord, Slack, WhatsApp). Closed learning loop with autonomous skill creation. Apache 2.0. 202k stars.
Vibe CLI2025Mistral's open-source CLI coding agent. Free tier with Mistral Experiment tier (no credit card). Conversational iterative workflow. AGENTS.md support, skills system, voice mode, MCP. Apache 2.0. 4.6k stars.
Goose2024Open-source CLI agent for complex software engineering tasks. Extensible plugin system. Originally by Block, now under the Agentic AI Foundation (AAIF) at the Linux Foundation. Desktop app + CLI + API. Rust-based. Apache 2.0. 50k stars.
MiMo Code2026Xiaomi's terminal AI tool with persistent memory, multi-agent orchestration, and 1M-token context. Free tier available. Supports mimo-v2.5-pro, mimo-v2.5, mimo-v2-omni models. Web UI in alpha.
Tuillem20253-pane terminal AI chat client written in Rust. Switch providers and models mid-conversation. Full markdown rendering, SQLite history with FTS5 search. 10 built-in themes. Plugin system. MIT.
Hai2025Lightweight terminal AI agent. Run commands or ask questions. Supports OpenAI, Claude, Gemini, DeepSeek. Agent mode with auto shell execution, pipe support, predefined prompts. GPL-3.0.
FreebuffAn AI-powered CLI, supported by ads, with multi-agent orchestration.

🤖 AI Coding Assistants

📅 Last checked: June 23, 2026

Free tools that integrate AI into your development workflow.

NameDescription
Continue.devOpen-source AI code assistant. Chat, autocomplete, and edit with any model. GitHub
AiderAI pair programming in the terminal. Edits code in your local git repo. Supports GPT, Claude, and local models. GitHub
Devin Desktop (formerly Windsurf/Codeium)AI code editor with autocomplete, chat, and search. Now by Cognition. Free tier available. Pro $20/mo.
TabbySelf-hosted AI coding assistant with no dependency on external services. GitHub
Cody (Sourcegraph)Free tier for individuals. Chat, autocomplete, and commands with codebase context.
Llama Coder (Nutlope)Free AI code generation tool. Generate entire apps from prompts.
Bolt.new (StackBlitz)Free tier for AI-powered full-stack web app development in browser.
Claude Code (Anthropic)Requires Claude subscription or API account. Terminal-based AI coding assistant.
CursorAI-native code editor with deep model integration and agentic features. Free tier available. Pro $20/mo.
CodeBuff⚠️ Paid only. CLI-based AI coding assistant that understands entire codebases.
ClinePopular autonomous VS Code agent. Creates/edits files, runs terminal commands, browses web. Open-source, BYOK (bring your own API key). GitHub
OpenHandsAutonomous AI software engineer. Navigates file systems, runs shell commands, tests code in browser. Self-hostable. GitHub
Kodu (Claude Coder)VS Code autonomous coding agent. Builds projects from scratch, handles complex tasks with natural language.
GooseOpen-source CLI agent for complex software engineering tasks. Extensible plugin system. Originally by Block, now under the Agentic AI Foundation (AAIF) at the Linux Foundation. GitHub

📝 Code Models

📅 Last checked: June 23, 2026

Specialized for code generation, completion, and analysis.

NameDescription
DeepSeek CoderState-of-the-art open-weight code generation. DeepSeek's coder series leads SWE-bench. MIT license.
Qwen2.5-Coder (Alibaba)Highly capable code model series (1.5B–32B). Excellent balance of speed and quality. Apache 2.0.
Codestral (Mistral)Mistral's dedicated code generation model — fill-in-the-middle, completion, and instruction.
CodeGemma (Google)Google's Gemma architecture fine-tuned for code completion and instruction. Apache 2.0.
StarCoder2 (BigCode)Transparently trained code model covering 619 languages. OpenRAIL-M license.
Yi-Coder (01.AI)Efficient coding model with strong long-context understanding. Yi License (Apache 2.0 compatible).
Phi-4-mini (Microsoft)Lightweight model optimized for reasoning and code. Punches above its weight class. MIT license.
Qwen3-Coder-Next (Alibaba)Early 2026. Latest generation of Qwen's code series. Strong reasoning and long-context coding capabilities. Apache 2.0.
CodeLlama (Meta)Aug 2023. Llama 2-based code generation pioneer. Supports infilling, completion, and instruction. Llama 2 Community License.
WizardCoder (WizardLM)2023. Evol-Instruct fine-tuned for complex coding tasks. Strong general code generation performance. Apache 2.0.
OpenCodeInterpreter2024. Integrates execution feedback to iteratively improve generated code. Bridges generation and execution. Apache 2.0.
Stable Code 3B (Stability AI)Aug 2023. Lightweight 3B code model optimized for fill-in-the-middle. Efficient for local autocompletion. StabilityAI license.
CodeGeeX2 (THUDM)2023. Multilingual code model supporting 20+ languages. Strong in both Chinese and English code tasks. Apache 2.0.
CodeT5+ (Salesforce)2023. Encoder-decoder architecture unifying code generation, completion, and understanding. BSD-3 license.
SantaCoder (BigCode)2023. Light 1.1B model specialized for Python, Java, and JavaScript. Fast and efficient for IDE integration.

🔍 RAG & Vector Databases

📅 Last checked: June 23, 2026

Free tools for building retrieval-augmented generation pipelines — vector storage, embedding search, and document retrieval.

NameDescription
ChromaAI-native open-source embedding database. Runs in-process, no GPU needed. GitHub
QdrantHigh-performance vector search engine. Free tier on Qdrant Cloud or self-host via Docker. GitHub
pgvectorVector similarity search inside PostgreSQL. Free if you already run Postgres.
LanceDBDeveloper-friendly vector database built on Lance columnar format. Runs locally, no server needed. GitHub
WeaviateOpen-source vector database. Free sandbox tier on Weaviate Cloud. GitHub
Milvus (Zilliz)Cloud-native vector database. Free tier on Zilliz Cloud or self-host. GitHub
txtaiAI-powered semantic search and RAG in a single Python package. GitHub
R2R (SciPhi)Production-ready RAG engine with API, user management, and observability.
Docling (IBM)Document understanding and conversion for RAG pipelines. Extracts PDFs, images, and more. GitHub
Unstructured.ioPreprocessing toolkit for documents (PDF, HTML, Word) for RAG pipelines. Free tier available.
RAGFlowOpen-source RAG engine with deep document parsing, OCR, and knowledge base management. Supports多种 document formats.
RAGatouillePython package bringing ColBERT-style late interaction retrieval to RAG pipelines. Works as retriever and reranker. Free and open-source.
RagasOpen-source evaluation framework for RAG pipelines. Measures retrieval accuracy, answer relevance, and faithfulness.

🧩 Agentic Frameworks

📅 Last checked: June 23, 2026

Free, open-source frameworks for building AI agents and multi-agent systems.

NameDescription
LangGraph (LangChain)Low-level framework for building stateful, multi-agent applications. GitHub
CrewAIMulti-agent framework for orchestrating specialized AI agents to work together. GitHub
AutoGen (Microsoft)Extensible framework for building multi-agent conversations. ⚠️ Maintenance mode — use Microsoft Agent Framework instead. GitHub
Agno (formerly Phidata)Full-stack AI framework for building multimodal agents with memory, knowledge, and tools. GitHub
PydanticAIAgent framework by Pydantic with type-safe outputs and dependency injection. GitHub
MastraTypeScript framework for building AI applications and agent workflows. GitHub
OpenAI Agents SDKLightweight SDK for building single and multi-agent systems. GitHub
Semantic Kernel (Microsoft)SDK for orchestrating AI agents with planners, memory, and connectors. GitHub
DifyLLM app development platform with visual workflow builder and agent capabilities. GitHub
FlowiseLow-code visual LLM flow builder with drag-and-drop interface. GitHub
TaskWeaver (Microsoft)⚠️ Archived. Code-first agent framework for planning and executing complex tasks. GitHub
FazmApr 2026. Open-source local computer-use agent for macOS. Drives apps via accessibility APIs, model-agnostic, faster than screenshot-based agents.
Smolagents (Hugging Face)Minimalist agent library where agents "think in code." Lightweight, zero boilerplate. Supports code agents and tool-calling agents.
SwarmsEnterprise-grade multi-agent orchestration framework. Scalable infrastructure for autonomous agent swarms. Highly modular.
Letta (MemGPT)Framework for long-term agent memory. Virtual memory management that pages data in/out of context like an OS. Persistent agents.
GriptapeEnterprise agent framework with strictly typed Pipelines, Workflows, and Agents. Structure-first, production-ready.
OpenAI SwarmExperimental lightweight multi-agent orchestration. Uses Agents and Handoffs abstractions. Educational and minimalist.
Atomic AgentsFramework inspired by Atomic Design. Compose agents from small, reusable, modular components. Testable and scalable.
PraisonAILow-code multi-agent framework. Define agent roles, tasks, and flows via YAML configuration. Wraps underlying agent frameworks.
CogneeGraphRAG framework for agent knowledge management. Builds interconnected knowledge graphs from unstructured data.
AgentZeroSelf-healing autonomous agent with web UI. Manages own workflows, tool use, and environment. Self-evolving capabilities.
MetaGPTMulti-agent framework simulating a full software team. Assigns Agent, Product Manager, Engineer roles. Implements SOPs for end-to-end code generation.
ChatDev (OpenBMB)Virtual software company driven by multi-agent collaboration. Follows waterfall model through design, coding, testing, and documentation.
AutoGPTThe original autonomous agent experiment. Sets its own goals, iterates on tasks, and executes without continuous human input. Web browsing and file management.
Bee Agent Framework (IBM)Production-ready framework for building reliable AI agents in Python and TypeScript. Modular, with built-in observability and IBM research optimizations.
Eliza (elizaOS)Multi-platform agent framework for creating character-driven AI agents. Handles social media interaction, complex decision-making, and autonomous behavior across platforms.
SuperAGI⚠️ Unmaintained. Developer-focused autonomous agent platform with GUI. Built-in resource management, file handling, and multi-tasking for running agents at scale.
AgentVerse (OpenBMB)⚠️ Unmaintained. Framework for building and evaluating multi-agent environments. Easily configure agent teams and measure collaborative performance.
Qwen-Agent (Alibaba)Agent framework tightly integrated with the Qwen model family. Optimized for function calling, code execution, RAG, and tool use with Qwen models.
AGiXTExtensible modular AI agent automation platform. Plugin system for swapping LLMs, memory backends, and tools. Highly customizable agent workflows.
DeusSelf-hosted personal AI assistant framework built around long-term memory, a self-improving evolution loop, and multi-agent orchestration. Backend-neutral (Claude Code, OpenAI/Codex, or fully local Ollama models), container-isolated agents, multi-channel (WhatsApp, Telegram, Slack, Discord, Gmail). MIT.

🎛 Fine-tuning Tools

📅 Last checked: June 23, 2026

Tools to fine-tune free models on your own data — all free and open-source.

NameDescription
UnslothFast memory-efficient fine-tuning. 2x faster, 50% less memory. Supports QLoRA, LoRA, full fine-tune.
AxolotlStreamlined fine-tuning framework supporting multiple model architectures and quantization methods.
LLaMA-FactoryEasy-to-use fine-tuning with web UI. Supports 100+ models, multiple training methods.
Hugging Face TRLTransformer Reinforcement Learning library. SFT, PPO, DPOTrainer, GRPOTrainer for aligning models.
TorchTune (Meta)Native PyTorch library for fine-tuning LLMs. Simple, extensible, efficient. ⚠️ Development wound down in 2025.
XTuner (InternLM)Efficient fine-tuning toolkit supporting QLoRA, LoRA, and full fine-tune with multiple model architectures.
Ludwig (Predibase)Declarative ML framework. Fine-tune models with a simple config file. GitHub
PyTorch LightningFree deep learning framework for training and fine-tuning. Simplifies distributed training, checkpointing, and logging. GitHub
Hugging Face AccelerateZero-config distributed training for PyTorch. Enables easy multi-GPU and TPU training with minimal code changes.
ColossalAIOpen-source distributed training system with parallelism strategies. Supports large model training on limited hardware.
JAX (Google)High-performance ML framework with automatic differentiation and JIT compilation. Powers many modern training pipelines.
Ray TrainDistributed training framework built on Ray. Supports PyTorch, TensorFlow, and JAX with automatic scaling.
Determined AIOpen-source ML training platform with hyperparameter search, GPU scheduling, and experiment tracking.

✨ Prompt Engineering Tools

📅 Last checked: June 23, 2026

Free tools for testing, managing, and optimizing prompts.

NameDescription
PromptfooOpen-source tool for prompt testing and evaluation. Systematic A/B testing of prompts. GitHub
Fabric (Daniel Miessler)Open-source framework for augmenting humans with AI. Library of curated prompts (patterns) for common tasks.
LangFuseOpen-source LLM engineering platform with prompt management, versioning, and evaluation. GitHub
OpenPrompt (THUNLP)⚠️ Unmaintained. Framework for prompt-learning research. Supports template and verbalizer design.
DSPy (Stanford)Framework for algorithmically optimizing LM prompts and weights. GitHub
AgentaOpen-source LLM platform for prompt management, evaluation, and deployment. GitHub
ChainForgeOpen-source visual programming environment for prompt engineering. Test prompts across multiple LLMs, compare responses, and evaluate robustness. GitHub
LatitudeOpen-source prompt engineering platform with versioning, playground, evaluation, and deployment as API endpoints. GitHub
DeepEvalOpen-source evaluation framework for LLM outputs. 50+ metrics, pytest integration, and CI/CD support for prompt regression testing.
PromptLayerPrompt versioning and monitoring platform. Tracks prompt versions, cost, latency, and model behavior. Free tier with 10K calls/month.
OpenPromptHubCommunity-driven prompt engineering platform. Discover, share, and contribute prompt patterns. Free and open-source.

📊 Datasets

📅 Last checked: June 23, 2026

Free, open datasets for training, fine-tuning, and evaluating models.

NameDescription
Hugging Face DatasetsThe standard hub for open datasets. 150,000+ datasets across all tasks.
Common CorpusMassive open-source dataset for training large language models. Gated dataset — requires Hugging Face login.
The Stack v2 (BigCode)Large-scale code dataset covering 619 programming languages. Permissive license.
FineWeb (Hugging Face)High-quality web dataset for LLM pre-training. 15T tokens.
Dolly (Databricks)15k instruction-response pairs for fine-tuning. CC-BY-SA.
OpenAssistant Conversations160k human-generated assistant conversations. Apache 2.0.
ShareGPT (RyokoAI)Real user-ChatGPT conversations for fine-tuning.
UltraChat (Sean C.)200k multi-turn conversations synthesized by ChatGPT.
No Robots (Hugging Face)10k high-quality human-written instructions. Apache 2.0.
RLAIF-V (OpenBMB)AI-generated preference data for RLHF. Apache 2.0.
MMLU / GSM8KStandard benchmarks for evaluation.

☁ Model Hosting Platforms

📅 Last checked: June 23, 2026

Free platforms that host models — run inference without downloading anything.

NameDescription
Hugging Face SpacesFree hosting for ML apps (Gradio, Streamlit). Thousands of community demos.
Hugging Face Inference Endpoints⚠️ Paid service — pay-as-you-go starting $0.06/hr.
Google Colab (Free Tier)Free GPU (T4, sometimes A100). Perfect for running models and fine-tuning.
Kaggle NotebooksFree GPU (T4 x2). 30 hours/week. Good for heavier workloads.
Lightning AI StudioFree tier with GPU access for development and prototyping.
ModalFree monthly credits for serverless GPU compute.
Replicate (Free Tier)Free credits for running community models.
DeepnoteFree tier with GPU for data science and ML notebooks.
Beam$30/mo free credits for serverless GPU compute. Fast cold starts (<1s), auto-scaling, Python SDK. Open-source runtime.
Cerebrium⚠️ Paid service — compute-based billing with sub-second cold starts.
Baseten⚠️ Paid service — serverless GPU inference. Truss open-source framework, auto-scaling.

🏖️ Core AI Execution Sandboxes

📅 Last checked: June 23, 2026

Free, isolated sandbox environments for executing AI agent code, running untrusted scripts, and building agent workflows — no infrastructure to manage.

NameDescription
E2BThe most popular sandbox for AI agents. $100 free credit (one-time), no credit card. Firecracker microVMs, 150ms cold starts, 20 concurrent sandboxes, 1-hour sessions. Python/JS SDKs. Docker MCP Catalog (200+ tools).
Novita AI Sandbox$100 free credits (90-day validity). 5 concurrent sandboxes, 1-hour max session, 2 vCPU / 4 GB RAM. Sub-200ms startup, per-second billing. Code execution, browser automation, computer use.
Hopx$200 free credits, no credit card. Firecracker microVMs, ~100ms cold start. Full Linux with file/exec/PTY access. Persistent state, unlimited runtime. Python, JavaScript, Bash, Go. Per-second billing.
InstaVM$50 free credits, no credit card. MicroVMs for AI agents with persistent state, networking, and secrets injection. Sub-200ms boot, SSH access, full Linux Desktop for browser automation.
OmniRun25 sandbox-hours/month free, no credit card. MicroVM isolation (own kernel per sandbox). ~250ms boot. 6 languages. Network blocked by default. Claude Managed Agents compatible.
SimpleSandbox1M credits/month free (~17 hours compute). 3 concurrent sandboxes, Firecracker microVMs, ~1s cold start. Per-second billing. 50% cheaper than E2B, no enterprise minimums.
SandboxAPI500 executions/month free. 12 languages (Python, Node, Go, Rust, Java, etc.). gVisor isolation, streaming output, persistent sessions. MCP-native — works with Claude Desktop, Cursor, VS Code.
Tensorlake2 concurrent sandboxes free forever. 1 core / 1 GB RAM / 10 GB disk, up to 2-hour sessions. Firecracker microVMs, SOC 2 Type 2. Unmetered sessions.
SmolVM (CelestoAI)Free & open-source (Apache 2.0). Self-hostable microVM sandboxes for AI agents. Sub-second boot, hardware isolation, browser sandbox, file sharing, snapshots. Python SDK. Run Claude Code, Codex, or Pi pre-installed.

📚 Learning Resources

📅 Last checked: June 23, 2026

Free courses, books, and tutorials for learning AI and LLMs.

NameDescription
Fast.aiCode-first deep learning education. Practical, free courses from fundamentals to advanced.
Hugging Face LLM CourseComprehensive free course on transformers, tokenizers, datasets, and deployment.
DeepLearning.AI Short CoursesFree short courses on LLMs, RAG, LangChain, and AI agents.
Full Stack Deep LearningFree course on ML engineering: training, deploying, and maintaining models.
Andrej Karpathy's CourseFrom-scratch neural network implementation videos.
Neural Networks: Zero to HeroYouTube series building neural networks from scratch.
LLM University (Cohere)Free course on LLMs, embeddings, and RAG.
Prompt Engineering Guide (DAIR.AI)Comprehensive free guide on prompt engineering techniques.
Anthropic CookbookFree recipes and patterns for working with Claude.
OpenAI CookbookFree examples and guides for the OpenAI API.

🏆 Resources & Leaderboards

📅 Last checked: June 23, 2026

NameDescription
PerplexityFree AI search and research assistant with real-time answers and source citations.
Hugging Face Open LLM LeaderboardThe primary benchmark for open-weight models. Updated regularly.
LMSYS Chatbot ArenaHuman preference rankings of models. Best source for real-world quality comparisons.
Artificial AnalysisIndependent benchmarks for speed, pricing, and quality across providers.
Hugging Face ModelsSearch 1M+ models. Filter by license, task, framework.
OpenRouter ModelsBrowse models available via API with pricing and free tiers.
Ollama LibraryBrowse models available for one-command local setup.
cheahjs/free-llm-api-resourcesCommunity-maintained list of free LLM API resources.
SweetTea⚠️ Site appears down. Community voting on model quality and preference. May be defunct.

👥 Communities

📅 Last checked: June 23, 2026

NameDescription
Hugging Face DiscordModel releases, discussions, and community support.
r/LocalLLaMAThe largest Reddit community for running local LLMs.
Ollama DiscordOllama community for local model enthusiasts.
LM Studio DiscordLM Studio community.
Hugging Face ForumsDiscussions on models, datasets, and Spaces.
r/MachineLearningGeneral ML/AI research and news.
Discord: AI AgentsCommunity for AI agent development and agentic frameworks.
r/OpenAIOfficial Reddit community for OpenAI models, API discussions, and releases.
r/artificialGeneral AI discussion covering research, news, and ethics.
OpenAI Developer ForumOfficial forum for OpenAI API developers. Share prompts, troubleshoot, and discuss best practices.
Nous Research DiscordCommunity for open-source AI development, Hermes models, and decentralized training (DisTrO).
Learn AI Together DiscordActive learning community with 10K+ members. Ask questions, find teammates, and share projects.

License

CC0

To the extent possible under law, the author has waived all copyright and related or neighboring rights to this work.

关于 About

A curated list of free AI models, APIs, and tools you can use without paying a cent.

语言 Languages

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
84
Total Commits
峰值: 47次/周
Less
More

核心贡献者 Contributors