The most comprehensive, structured guide to AI agent frameworks, tools, and resources.
Updated weekly. Compared side-by-side. Built for developers who ship.
- Orchestration Frameworks
- Coding Agents
- Memory and Context
- Multi-Agent Systems
- Agent Communication Protocols
- Browser and Computer Use Agents
- Agent Tooling and Infrastructure
- Low and No-Code Builders
- Voice and Multimodal Agents
- Safety Guardrails and Observability
- Agent Interfaces and UIs
- Agent Deployment and Hosting
- Agent Evaluation and Benchmarks
- Learning Resources
- Agent Communication
- Data Pipeline and Workflow
- Secure Execution Environments
- Creative AI Agents
- Customer Support and CRM Agents
- Voice Agent Platforms
- Deep Research Agents
- Prompt-to-App Builders
- Multi-Agent Consumer Platforms
- Open-Source Models for Agents
- Market Stats and Benchmarks 2026
- Local and Self-Hosted AI
- AI Governance and Compliance
- Cybersecurity Agents
- Healthcare and Therapy Agents
- Newsletters and Communities
- Changelog
- Star History
- Agency Swarm - Orchestrates multi-agent systems built on the OpenAI Assistants API with role-based collaboration (🏷️
PythonOpenAISDK). - AgentScope - Alibaba multi-agent framework with distributed deployment and fault tolerance for production use (🏷️
PythonAlibabaSDK). - Agno - Multi-agent framework with a runtime and control plane for managing agent deployments at scale (
PythonFastAPICloud) (🏷️PythonFastAPICloud). - Anthropic SDK - Official Claude SDK with tool use, computer control, and streaming for building Claude-native agents (🏷️
PythonAnthropicSDK). - AutoGen - Event-driven multi-agent framework merged with Semantic Kernel for production workflows (🏷️
PythonSemantic KernelSDK). - AXME - Durable multi-agent coordination with crash recovery, human approval gates, and kill switch across languages (🏷️
PythonMulti-AgentSDK). - Bernstein - Deterministic orchestrator for parallel coding agents with test-driven verification using zero LLM tokens on coordination (🏷️
PythonMulti-AgentSDK). - CAMEL - Role-based multi-agent simulation framework for collaborative reasoning and task completion (🏷️
PythonResearchSDK). - CopilotKit - The open-source AI copilot platform for React, allowing developers to embed custom AI agents directly into their applications (🏷️
TypeScriptReactSDK). - CrewAI - Role-playing agent orchestration for collaborative agent teams (🏷️
PythonLangChainSDK). - DSPy - Programmatically optimizes LLM prompts and chains using machine learning instead of manual prompt engineering (🏷️
PythonStanfordSDK). - Google ADK - Modular agent dev kit integrating Gemini and Vertex AI natively (🏷️
PythonVertex AISDK). - Haystack - Production-ready AI orchestration framework focused on building customizable LLM applications and RAG pipelines (🏷️
PythonRAGSDK). - LangGraph - Enterprise framework for stateful, graph-based agent workflows (🏷️
PythonLangChainSDK). - Letta - Formerly MemGPT. Stateful agents with built-in long-term memory and a REST API server (🏷️
PythonRESTSDK). - LlamaIndex - The leading framework for connecting LLMs to your data, with powerful indexing and retrieval capabilities (🏷️
PythonRAGSDK). - MagiC - Kubernetes for AI agents that manages any agent from any framework with routing, cost control, and circuit breaker (🏷️
GoKubernetesPlatform). - Mastra - Opinionated TypeScript framework with RAG, observability, and MCP support built in (🏷️
TypeScriptNext.jsSDK). - MicroAgent - Lightweight agent framework with self-editing prompts and code for minimal-overhead agent loops (🏷️
TypeScriptLightweightSDK). - Miyabi - Issue-Driven Development framework with 7 coding and 14 business agents plus MCP 172+ tools using GitHub as OS (🏷️
TypeScriptGitHubSDK). - Modus - Serverless framework for high-throughput agent workloads with minimal cold starts (🏷️
WebAssemblyHypermodeServerless). - Open-AutoGLM - Open-source phone agent model and framework for building mobile device automation agents (🏷️
PythonMobileSDK). - OpenAI Agents SDK - Lightweight multi-agent SDK with tracing and guardrails from OpenAI (🏷️
PythonOpenAISDK). - Portia AI - Reliable agent framework focused on production stability with structured output guarantees (🏷️
PythonProductionSDK). - PraisonAI - Production multi-agent framework with self-reflection, MCP integration, and workflow automation (🏷️
PythonMulti-AgentCLI). - PydanticAI - Type-safe agent framework from the Pydantic team with a FastAPI-style developer experience (🏷️
PythonPydanticSDK). - Semantic Kernel - Microsoft enterprise SDK for Python, C#, and Java with modular plugins, memory, and goal planning (🏷️
C#MicrosoftSDK). - Smolagents - Hugging Face code-first framework where agents write and execute Python instead of JSON tool calls (🏷️
PythonHugging FaceSDK). - Strands Agents SDK - AWS model-driven agent SDK with native Bedrock integration (🏷️
PythonAWS BedrockSDK). - Upsonic - Minimalist agent framework with MCP support and rapid setup for quick prototyping (🏷️
PythonMCPSDK). - Vercel AI SDK - Streaming-first primitives for AI UIs with React Server Components and edge runtime support (🏷️
TypeScriptNext.jsSDK). - VoltAgent - TypeScript agent framework with built-in observability and a self-improving context engine (🏷️
TypeScriptReactSDK).
- Aider - Terminal-first pair programmer that edits code in local repos, preserves Git history, and supports multi-file changes (🏷️
PythonCLILocal). - Amazon Q Developer - AWS-native AI coding assistant with Lambda, CloudWatch, infrastructure support, and security scanning (🏷️
PythonAWSIDE). - AutoGPT - Mature autonomous agent platform with Forge framework and public benchmarks for evaluating agent capabilities (🏷️
PythonForgeCLI). - Caliber - CLI that fingerprints projects and generates AI agent configs like CLAUDE.md and Cursor rules with quality scoring (🏷️
PythonCLILocal). - Claude Code - Terminal-first agentic coding tool with multi-file edits, test running, and Git operations baked in (🏷️
TypeScriptCLILocal). - Cline - Autonomous coding agent in your IDE that creates/edits files, runs commands, and uses the browser with permission-gated steps (🏷️
TypeScriptVS CodeIDE). - CodeRabbit - AI-powered PR reviewer with inline suggestions, security scanning, and automatic code quality checks (🏷️
CloudGitHubWeb). - Codex CLI - OpenAI's lightweight, open-source terminal coding agent with fast execution and strong benchmark scores (🏷️
PythonCLILocal). - Codex-CLI - CLI tool that turns natural language commands into Bash, ZShell, and PowerShell equivalents (🏷️
C#CLILocal). - Continue - Source-controlled AI checks enforceable in CI, powered by the open-source Continue CLI (🏷️
TypeScriptVS CodeIDE). - Cursor - AI-native IDE (VS Code fork) with deep codebase awareness, multi-file refactoring, and agentic workflows (🏷️
TypeScriptElectronDesktop). - Devika - Open-source AI software engineer that understands high-level instructions and writes full-featured code autonomously (🏷️
PythonDockerWeb). - Devin - Fully autonomous AI software engineer that plans, codes, tests, and deploys in a cloud sandbox (🏷️
CloudWebSaaS). - gemini-cli - Open-source AI agent that brings the power of Gemini directly into your terminal (🏷️
GoCLILocal). - GitHub Copilot - Agent Mode in VS Code with Copilot Workspace for issue-to-PR workflows across Claude, GPT, and Gemini (🏷️
TypeScriptVS CodeIDE). - Goose - Open-source extensible AI agent that goes beyond code suggestions, installs, executes, edits, and tests with any LLM (🏷️
RustCLILocal). - JetBrains AI - Deep AI integration across all JetBrains IDEs with context-aware completions and refactoring (🏷️
KotlinJetBrainsIDE). - Kilo Code - Structured coding agent with tighter context management and reduced hallucinations for VS Code (🏷️
TypeScriptVS CodeIDE). - Kiro - Spec-driven development agent that writes specs, auto-generates tasks, implements code, and automates DevOps workflows (🏷️
CloudAWSIDE). - Open Interpreter - Execute code locally via natural-language model instructions with a ChatGPT-like interface (🏷️
PythonCLILocal). - opencode - Open-source coding agent available as a desktop application with a visual interface (🏷️
TypeScriptElectronDesktop). - OpenCode - BYOK terminal coding agent for developers seeking an open-source Cursor alternative (🏷️
TypeScriptCLILocal). - OpenHands - AI-driven development platform that writes, tests, and deploys code autonomously (🏷️
PythonDockerWeb). - PR-Agent - Open-source AI PR reviewer that auto-describes, reviews, and improves pull requests (🏷️
PythonGitHubCLI). - Qodo - AI code review platform with context-aware PR validation and security analysis (🏷️
CloudEnterpriseWeb). - RooCode - Cline fork with structured modes and reduced hallucinations for more reliable code generation (🏷️
TypeScriptVS CodeIDE). - Snyk Code - AI-powered security scanner with real-time vulnerability detection in agent-generated code (🏷️
CloudSecurityPlatform). - Sourcegraph Cody - AI coding assistant that excels at large codebases with an enterprise context engine (🏷️
TypeScriptEnterpriseIDE). - SWE-agent - Takes a GitHub issue and tries to automatically fix it. Also used for cybersecurity and competitive coding (🏷️
PythonDockerCLI). - Tabnine - Privacy-first AI code completion with on-premise deployment and codebase fine-tuning options (🏷️
TypeScriptEnterpriseIDE). - TaskWeaver - A code-first agent framework from Microsoft for seamlessly planning and executing data analytics tasks (🏷️
PythonJupyterCLI). - Windsurf - AI-native IDE with Cascade agent for multi-step autonomous tasks and team workflows (🏷️
TypeScriptElectronDesktop).
- Acontext - Manages agent skills and long-term memory as a layered data structure for persistent context (🏷️
PythonSDKLocal). - Chroma - Lightweight, embeddable vector store for building memory-augmented AI agents with fast semantic retrieval (🏷️
PythonTypeScriptSDK). - cognee - Knowledge engine for AI agent memory, set up in 6 lines of code with graph-based knowledge extraction (🏷️
PythonNeo4jSDK). - Cortex Memory - Full-stack solution for agent memory covering extraction, vector search, and optimization (🏷️
PythonVector DBSDK). - graphiti - Build real-time knowledge graphs for AI agents with automatic entity extraction and linking (🏷️
PythonKnowledge GraphSDK). - iGPT - Email Intelligence API that converts email threads into reasoning-ready JSON for agent consumption (🏷️
CloudEmailAPI). - LanceDB - Serverless vector search database embedded directly in the agent process with no infrastructure needed (🏷️
RustPythonSDK). - Langmem - Helps agents learn and adapt from their interactions over time with persistent memory (🏷️
PythonLangChainSDK). - Lorg - Permanent intelligence archive for AI agents with structured contributions and cryptographically backed trust scores (🏷️
PythonArchiveSDK). - Mem0 - Memory layer for AI applications with long-term, short-term, and semantic memory extraction (🏷️
PythonVectorCloud). - Memvid - Replace complex RAG pipelines with a serverless, single-file memory layer for instant retrieval (🏷️
PythonMultimodalSDK). - Milvus - Scales vector search to billions of embeddings for large-scale agent knowledge bases (🏷️
GoPythonPlatform). - Motorhead - Manages conversation context windows for agents with automatic background summarization (🏷️
RustPythonSDK). - Nex - Organizational context and memory for AI agents with 60-tool MCP server and 100+ integrations (🏷️
TypeScriptMCPPlatform). - Pathway - Live data RAG engine with real-time streaming for agents that need up-to-the-second knowledge (🏷️
PythonStreamingSDK). - Pinecone - Managed serverless vector database with low-latency retrieval for production agent knowledge bases (🏷️
CloudVector DBAPI). - Qdrant - High-performance vector similarity search engine with rich payload filtering for agent memory (🏷️
RustPythonSDK). - RAGFlow - Open-source RAG engine with agent capabilities and deep document understanding for knowledge bases (🏷️
PythonRAGPlatform). - SimpleMem - Efficient lifelong memory for LLM agents supporting both text and multimodal inputs (🏷️
PythonSDKLocal). - Supermemory - Extremely fast and scalable memory engine and API designed for the AI era (🏷️
TypeScriptVector DBWeb). - Weaviate - Stores and searches vector embeddings with hybrid keyword and semantic retrieval for agent knowledge (🏷️
GoPythonPlatform). - Zep - Enriches agent long-term memory with automatic summarization, entity extraction, and search (🏷️
PythonTypeScriptSDK).
- AgentVerse - Framework for building custom multi-agent environments to accomplish collaborative tasks (🏷️
PythonBenchmarkSDK). - EvoAgentX - Evaluates and evolves agentic workflows over time using automatic optimization (🏷️
PythonOptimizationSDK). - Hivemoot - Autonomous agent teams that collaboratively build software on GitHub (🏷️
PythonGitHubSDK). - MetaGPT - Simulates a full software company workflow from requirements to PRs using role-playing agents (🏷️
PythonRole-PlayingSDK). - Swarm - Lightweight framework for agent handoffs, context variables, and function calling patterns from OpenAI (🏷️
PythonOpenAISDK). - Swarms Framework - Multi-agent orchestration for production use cases with scalability and reliability at its core (🏷️
PythonProductionSDK).
The protocol layer that enables agents to discover tools, communicate with each other, and interoperate across ecosystems.
| Protocol | Purpose | Creator | Status |
|---|---|---|---|
| MCP | Agent-to-tool | Anthropic | Standard |
| A2A | Agent-to-agent | Growing | |
| ACP | Agent communication | IBM/BeeAI | Early |
- A2A Protocol - Google's open protocol enabling AI agents to communicate, collaborate, and delegate tasks across frameworks (🏷️
ProtobufStandardRPC). - Agentify - CLI tool that transforms OpenAPI specs into 9 agent formats including MCP, AGENTS.md, and Claude tools (🏷️
TypeScriptCLIMCP). - Arcade AI - Tool-use platform with authentication, authorization, and logging for agent-tool interactions (🏷️
PythonCloudAPI). - Composio - Integration platform with 250+ pre-built tool connectors for AI agents and LLMs (🏷️
TypeScriptCloudAPI). - Docker MCP - Docker's MCP gateway CLI plugin for running MCP servers in isolated containers (🏷️
GoDockerCLI). - Entroly - Context engineering engine achieving 100% codebase visibility with 78% fewer tokens using knapsack-optimal selection (🏷️
RustMCPSDK). - GNAP - Git-Native Agent Protocol that coordinates AI agent teams with 4 JSON files in any Git repo with no server required (🏷️
JSONGitProtocol). - HCS Agent Protocol - Hedera open standards for agent identity with trustless P2P communication and 187K+ verified agents (🏷️
TypeScriptHederaProtocol). - MCP Registry - Official Model Context Protocol specification and server implementations for standardized tool access (🏷️
JSONStandardRegistry). - mcp-nest - Unified Model Context Protocol (MCP) server for executing code and managing files (🏷️
PythonMCPCLI). - Toolhouse - Cloud-hosted tool infrastructure for agents with optimized execution and low-latency access (🏷️
PythonCloudAPI). - Zapier MCP Server - Connect agents to 7,000+ app integrations via MCP, powered by Zapier's automation platform (🏷️
CloudZapierAPI).
- Agent S2 - Open-source GUI automation framework for building desktop and browser agents with vision (🏷️
PythonVisionSDK). - AgentQL - AI-powered web scraping and automation with a semantic query language for page elements (🏷️
PythonPlaywrightSDK). - Airtop - Enterprise-grade cloud browser infrastructure for AI agent automation at scale (🏷️
CloudEnterprisePlatform). - Amazon Nova Act - AWS-native browser and desktop automation agent with enterprise reliability guarantees (🏷️
CloudAWSPlatform). - Browser Use - Open-source framework to let LLMs navigate and interact with any website programmatically (🏷️
PythonPlaywrightSDK). - Browserbase - Scalable headless browser infrastructure purpose-built for running AI agents in production (🏷️
CloudInfrastructurePlatform). - Dia Browser - AI-native browser from Atlassian and the Browser Company with built-in agent workflows (🏷️
CloudDesktopBrowser). - Fellou - Transparent agentic browser with visual workflow editing and persistent agentic memory (🏷️
CloudDesktopBrowser). - Genspark - On-device AI browser with 169+ local models that runs without internet connection (🏷️
CloudDesktopBrowser). - Grok Computer - Upcoming xAI desktop agent with mouse control and application automation capabilities (🏷️
CloudDesktopPlatform). - LaVague - Large Action Model framework to turn natural language instructions into browser automation (🏷️
PythonSeleniumSDK). - Manus - Autonomous digital employee with Browser Operator extension acquired by Meta for web-based task execution (🏷️
CloudMetaPlatform). - MultiOn - Reliable web automation API with CAPTCHA handling for building production browser agents (🏷️
CloudAPIPlatform). - onUI - Open-source browser extension and MCP server for annotation-first UI pair programming with AI agents (🏷️
TypeScriptMCPExtension). - OpenAI Atlas - AI-native browser with Agent Mode for autonomous web task completion (🏷️
CloudOpenAIBrowser). - OpenAI Operator - ChatGPT autonomous web agent with human checkpoints using Computer Use Agent technology (🏷️
CloudOpenAIPlatform). - Plasmate - Headless browser compiling HTML to structured JSON with 17.5x compression and 13 MCP tools (🏷️
RustMCPSDK). - Playwright MCP - MCP server wrapping Playwright for seamless browser automation integration with LLM agents (🏷️
TypeScriptMCPSDK). - Skyvern - Automate browser-based workflows with computer vision and LLMs, no brittle selectors needed (🏷️
PythonPlaywrightCloud). - Stagehand - AI web browsing framework built on Playwright with natural-language selectors and actions (🏷️
TypeScriptPlaywrightSDK).
Sandboxes, web scrapers, browser automation, and networking layers that agents depend on.
- AgentDock - Framework for building and deploying production-ready AI agents with composable node architecture (🏷️
PythonDockerPlatform). - Crawl4AI - Extracts structured data from web pages using LLM-friendly output formats optimized for agent ingestion (🏷️
PythonPlaywrightSDK). - Docling - Parses PDFs, DOCX, and slides into structured text with deep layout understanding for document agents (🏷️
PythonPDFSDK). - E2B - Cloud sandboxes for AI agents to run code securely in isolated environments (🏷️
TypeScriptCloudSandbox). - Engram - Universal bridge for multi-protocol AI agent systems with automated semantic mapping (🏷️
PythonNetworkingSDK). - Firecrawl - Web scraping API built for LLMs that converts websites to clean, structured markdown (🏷️
TypeScriptCloudAPI). - Jina Reader - Converts any URL to LLM-ready clean text via a simple API prefix for agent ingestion (🏷️
TypeScriptCloudAPI). - LlamaParse - GenAI-native document parser designed to extract complex tables and layouts for RAG pipelines (🏷️
PythonCloudAPI). - Marker - Converts PDF documents to markdown with high accuracy for tables, equations, and figures (🏷️
PythonPDFCLI). - Notte - Browser automation engine optimized for production AI pipelines (🏷️
PythonBrowserSDK). - Pilot Protocol - Networking stack for distributed agent systems with encrypted tunnels (🏷️
GoNetworkingSDK). - Playwright - Automates Chromium, Firefox, and WebKit browsers with a single cross-language API for agent-driven testing (🏷️
TypeScriptPythonSDK). - ScrapeGraphAI - Python web-scraping library that uses LLMs to build intelligent scraping pipelines from natural-language instructions (🏷️
PythonLangChainSDK). - Surya - Runs OCR and layout detection on documents in 90+ languages for multilingual document agents (🏷️
PythonPDFCLI). - Tavily - Search API purpose-built for LLM agents providing real-time, accurate web data with source citations (🏷️
PythonCloudAPI). - Unstructured - Ingests and preprocesses documents across 25+ file types for downstream LLM and agent pipelines (🏷️
PythonPipelineSDK).
- Activepieces - Open-source Zapier alternative with AI-powered automation nodes and visual workflow builder (🏷️
TypeScriptDockerWeb). - AgentGPT - Deploy AI agents in the browser with zero local setup required (🏷️
TypeScriptNext.jsWeb). - cstack - Architecture pattern for autonomous agents using Claude Cowork, Notion, and MCP with no custom infrastructure (🏷️
CloudAnthropicPattern). - Dify - Open-source LLM app development platform with visual workflow builder and RAG orchestration (🏷️
PythonFastAPIWeb). - FastAgency - Deploys multi-agent workflows as production APIs with minimal configuration and setup (🏷️
PythonFastAPISDK). - Flowise - Open-source drag-and-drop UI to build customized LLM flows, RAG pipelines, and agent systems (🏷️
TypeScriptReactWeb). - Langflow - Visual drag-and-drop builder for LLM workflows, RAG agents, and multi-step pipelines (🏷️
PythonFastAPIWeb). - Lindy - No-code AI agent platform with 3000+ app integrations for business workflow automation (🏷️
CloudNo-CodePlatform). - Make - Visual workflow automation platform with AI capabilities and drag-and-drop scenario builder (🏷️
CloudNo-CodePlatform). - Mission Control - Cockpit for managing AI agent swarms with autonomous daemon, Field Ops, and approval workflows (🏷️
PythonMulti-AgentPlatform). - n8n - Open-source workflow automation with AI agent nodes combining visual and code-based orchestration (🏷️
TypeScriptDockerPlatform). - Relevance AI - No-code AI agent builder for sales, support, and research use cases with team collaboration (🏷️
CloudNo-CodePlatform). - Rivet - Visual AI workflow builder with drag-and-drop interface for designing agent pipelines (🏷️
TypeScriptReactDesktop). - Temporal - Durable execution platform for long-running agent workflows with automatic retry and state persistence (🏷️
GoProductionPlatform). - Wordware - Web-hosted IDE where domain experts collaborate with AI engineers to build agent workflows (🏷️
TypeScriptNext.jsWeb). - Zapier AI - Connects 7000+ apps with natural language workflow creation for no-code agent automation (🏷️
CloudNo-CodePlatform).
- Agentset - Production RAG platform with reasoning, hybrid search, and full multimodal support (🏷️
PythonMultimodalPlatform). - LiveKit Agents - Framework for building real-time, multimodal AI agents with voice, video, and data channels (🏷️
PythonLiveKitSDK). - Pipecat - Open-source framework for voice and multimodal conversational AI with streaming pipelines (🏷️
PythonWebRTCSDK). - Rasa - Open-source conversational AI framework with self-hosted NLU training and dialogue management (🏷️
PythonNLUSDK). - Vapi - Platform for building voice AI agents with low-latency speech-to-speech capabilities (🏷️
PythonCloudAPI). - Vocode - Open-source framework for building voice-based LLM agent applications with streaming support (🏷️
PythonVoiceSDK).
- Agent OS - Kernel architecture for governing autonomous AI agents with policy enforcement (🏷️
PythonPolicySDK). - AgentDoG - Diagnostic guardrails that analyze full agent execution trajectories to detect instruction hijacking and tool misuse (🏷️
PythonDiagnosticsSDK). - AgentGuard - Runtime observability and guardrails for AI agents with loop detection and anomaly alerts (🏷️
PythonCyberArkSDK). - agenttrace - Local-first TUI for AI coding agent session observability with tokens, cost, latency, tool failures, anomalies, reports, diffs, and CI health gates (🏷️
GoObservabilityCLI). - APort Agent Guardrails - Pre-action authorization plugin for agent frameworks with policy-based access control (🏷️
PythonAuthorizationSDK). - Arize Phoenix - Open-source observability platform built on OpenTelemetry for tracing, evaluating, and debugging AI agents (🏷️
PythonOpenTelemetryPlatform). - Braintrust - Eval-driven development platform with experiment tracking and prompt optimization for agent quality (🏷️
TypeScriptEvaluationPlatform). - DriftGuard - Semantic memory guardrails using causal graphs to prevent agents from repeating past failures (🏷️
PythonMemorySDK). - Guardrails AI - Adds structural, type, and quality guarantees to LLM outputs for reliable agent responses (🏷️
PythonValidationSDK). - Helicone - Open-source LLM observability with one-line integration for cost tracking and prompt analytics (🏷️
TypeScriptObservabilityPlatform). - Lakera Guard - Real-time protection against prompt injection, data leakage, and toxicity in agent interactions (🏷️
CloudSecurityAPI). - Laminar - Open-source observability and analytics platform purpose-built for the full lifecycle of AI agents (🏷️
RustObservabilityPlatform). - Langfuse - Open-source LLM observability platform for tracing, prompt versioning, and LLM-as-a-judge evaluations (🏷️
TypeScriptNext.jsPlatform). - LangSmith - LangChain platform for tracing, testing, and evaluating agent performance with production monitoring (🏷️
CloudLangChainPlatform). - LLM Guard - Security toolkit for scanning LLM inputs and outputs to prevent prompt injection and data leaks (🏷️
PythonSecuritySDK). - Logfire - Python-native observability from the Pydantic team with deep integration for high-performance agent monitoring (🏷️
PythonPydanticPlatform). - model-watchdog - Auto-rollback system for AI agent config changes that monitors health and reverts on failure (🏷️
PythonMonitoringCLI). - NeMo Guardrails - NVIDIA programmable guardrails toolkit for controlling and securing LLM-powered agent conversations (🏷️
PythonNVIDIASDK). - Orchard Kit - Modules for agent runtime security, self-audit trails, and collective cognition patterns (🏷️
PythonSecuritySDK). - OWASP Top 10 for Agentic Apps - Security framework covering goal hijacking, tool misuse, and cascading failure mitigations for agents (🏷️
PolicySecurityFramework). - Prism Scanner - Open-source security scanner for AI agent skills, plugins, and MCP servers with taint tracking (🏷️
PythonSecurityCLI). - Rebuff - Self-hardening prompt injection detection system for securing agent inputs against adversarial attacks (🏷️
PythonSecuritySDK).
Frontend workspaces and chat interfaces with built-in agent plugins and tool-use capabilities.
- AnythingLLM - All-in-one AI application with RAG, agents, and multi-model support for desktop and Docker (🏷️
TypeScriptDockerDesktop). - DB-GPT - Data interaction platform with local LLM support for 100% private database and analytics agents (🏷️
PythonDatabaseWeb). - KinBot - Self-hosted AI agent platform with persistent memory, 23+ providers, plugin store, and cron scheduling on SQLite (🏷️
PythonSelf-HostedPlatform). - LibreChat - Self-hosted multi-model chat interface supporting all major AI providers with access control (🏷️
TypeScriptDockerWeb). - LobeChat - Modern, open-source AI chat framework with a massive plugin ecosystem for autonomous agent capabilities (🏷️
TypeScriptNext.jsWeb). - OpenClaw - Self-hosted agent across WhatsApp, Telegram, Slack, Discord, and Signal with 5700+ community skills (🏷️
TypeScriptMulti-ChannelPlatform). - openclaw-starter - Fork-and-run template for 24/7 autonomous AI agents with pre-configured SOUL.md and memory system (🏷️
TypeScriptTemplatePlatform). - OpenWebUI - Extensible local AI interface with built-in RAG, tool use, and support for multi-agent workflows (🏷️
TypeScriptSvelteWeb).
- AWS Bedrock AgentCore - Managed AWS infrastructure for Bedrock-based agents with compliance, scaling, and monitoring built in (🏷️
TypeScriptAWSPlatform). - Cerebras Inference - Fastest LLM inference delivering 1000+ tokens per second on Llama 3.3 70B with a free tier (🏷️
CloudInferenceAPI). - Fireworks AI - Serverless LLM inference with fine-tuning, RAG support, and free credits for rapid prototyping (🏷️
CloudInferenceAPI). - Groq Cloud - Ultra-fast LPU-based LLM inference for Mixtral, Llama, and Gemma with a free API tier (🏷️
CloudInferenceAPI). - Modal - Serverless GPU compute purpose-built for AI workloads with fast cold starts and Python-native deployment (🏷️
PythonServerlessCloud). - Northflank - Full-stack platform with GPU orchestration, Git-based CI/CD, and bring-your-own-cloud support (🏷️
KubernetesCloudPlatform). - Railway - One-click deploy from GitHub with persistent volumes and databases for stateful agent deployments (🏷️
NixCloudPlatform). - Together AI - Inference API hosting 200+ open models with fast generation and a free tier for developers (🏷️
CloudInferenceAPI). - Trigger.dev - Background job platform with cron, webhook, and event triggers purpose-built for long-running agent tasks (🏷️
TypeScriptServerlessCloud).
- AgentBench - Comprehensive benchmark for evaluating LLMs as agents across 8 distinct environments (🏷️
PythonBenchmarkSDK). - ARC-AGI-2 - Frontier benchmark for measuring general intelligence capabilities in AI agents beyond pattern matching (🏷️
PythonBenchmarkResearch). - GAIA Benchmark - Benchmark for General AI Assistants measuring real-world reasoning and tool use (🏷️
JSONBenchmarkDataset). - Inspect AI - Framework for evaluating large language models with composable tasks and scoring (🏷️
PythonEvaluationSDK). - SWE-bench - Benchmark for evaluating LLMs on real-world software engineering tasks from GitHub issues (🏷️
PythonDockerBenchmark). - Terminal-Bench - Benchmark for evaluating terminal-based coding agent performance across real-world tasks (🏷️
PythonBenchmarkCLI). - WebArena - Benchmark for web agent evaluation using real websites with realistic task completion metrics (🏷️
PythonBenchmarkWeb).
- AgentBench: Evaluating LLMs as Agents - The benchmark paper for evaluating LLMs as agents across diverse environments (🏷️
PaperArXivPDF). - AI Agents in LangGraph - Short course on building production agents with LangGraph by Andrew Ng's platform (🏷️
CourseDeepLearning.aiWeb). - AI Engineering by Chip Huyen - Comprehensive guide on AI systems design and deployment covering agent architecture patterns (🏷️
BookO'ReillyPrint). - Anthropic Cookbook - Collection of Claude agent recipes and integration patterns from Anthropic (🏷️
GuideAnthropicGitHub). - Building Effective Agents - Anthropic's guide on agent design patterns, evaluation strategies, and production best practices (🏷️
GuideAnthropicWeb). - Google ADK Documentation - Official documentation for Google Agent Development Kit with tutorials and best practices (🏷️
GuideGoogleWeb). - Hugging Face Agents Course - Open-source course on building AI agents using Hugging Face tools and models (🏷️
CourseHuggingFaceWeb). - LATS: Language Agent Tree Search - Combines Monte Carlo tree search with LLM reasoning for complex multi-step planning tasks (🏷️
PaperPythonResearch). - LLM Powered Autonomous Agents - Deep breakdown of LLM-powered agent components: planning, memory, and tool use (🏷️
BlogWengWeb). - Microsoft GenAI for Beginners - 21-lesson course on generative AI concepts and agent development from Microsoft (🏷️
CourseMicrosoftGitHub). - OpenAI Cookbook - Practical guides and recipes for building with OpenAI APIs including agent patterns (🏷️
GuideOpenAIGitHub). - Prompt Engineering Guide - Community-maintained guide covering prompt engineering techniques and agent strategies (🏷️
GuideDAIRWeb). - ReAct: Synergizing Reasoning and Acting in Language Models - The foundational paper behind the ReAct prompting pattern used in most agent frameworks (🏷️
PaperArXivPDF). - Reflexion - Research framework letting agents learn from past mistakes via iterative verbal self-reflection loops (🏷️
PaperPythonResearch). - Tree of Thoughts - Explores multiple parallel reasoning paths before committing to a final answer for complex problems (🏷️
PaperPythonResearch).
Deep dives: patterns/react-pattern.md • patterns/plan-and-execute.md • patterns/reflection-loop.md • patterns/multi-agent.md • workflows/multi-step-reasoning.md
Capabilities that let agents send messages, notifications, and reports across channels.
- Apprise - Sends agent notifications to 100+ services including Slack, Discord, and email from a single Python interface (🏷️
PythonMulti-ChannelSDK). - Discord.py - Lets agents interact with Discord channels and servers for team-facing real-time communication (🏷️
PythonDiscordSDK). - FastAPI-Mail - Adds async email sending to FastAPI-based agent services with SMTP and OAuth2 support (🏷️
PythonFastAPISDK). - Novu - Routes agent notifications across email, SMS, push, and chat from a unified API and workflow engine (🏷️
TypeScriptPythonPlatform). - Ntfy - Pushes real-time agent notifications to phones and desktops via a dead-simple HTTP API (🏷️
GoSelf-hostedAPI). - Resend - Sends transactional emails from agent workflows with a clean, developer-first API (🏷️
TypeScriptCloudAPI). - Slack Bolt - Enables agents to send, receive, and react to Slack messages with event-driven listeners (🏷️
PythonSlackSDK). - Twilio - Sends SMS and voice calls from agent workflows to any phone number worldwide (🏷️
PythonCloudAPI).
Frameworks for orchestrating data transformations and long-running agent-driven workflows.
- AI for Database - Connects to any database for plain English queries with self-refreshing dashboards and automated workflows (🏷️
CloudDatabasePlatform). - Burr - Tracks and manages multi-step agent state machines with built-in observability and persistence (🏷️
PythonStateGraphSDK). - Dagster - Manages data assets and pipelines with built-in lineage tracking for data-intensive agent systems (🏷️
PythonPipelinePlatform). - GPT Researcher - Conducts multi-source autonomous web research and produces cited, structured long-form reports (🏷️
PythonLangChainWeb). - Hamilton - Defines modular data transformations as Python functions wired automatically into a DAG for agent pipelines (🏷️
PythonDAGSDK). - Hex AI - Collaborative data platform with AI-powered analysis and notebook-based data exploration for teams (🏷️
CloudDataPlatform). - Julius AI - Upload CSV or Excel files and analyze data using natural language questions for instant insights (🏷️
CloudDataWeb). - PandasAI - Chat with your data using natural language queries that convert to Pandas and SQL operations (🏷️
PythonDataSDK). - Prefect - Orchestrates agent workflows and data pipelines with retries, caching, and built-in observability (🏷️
PythonPipelinePlatform). - Signals CLI - Intent signal CLI detecting LinkedIn engagers, keyword posters, and funding events with JSON output for agent pipelines (🏷️
CloudCLIPlatform). - STORM - Generates Wikipedia-style long-form articles by autonomously researching and synthesizing multiple sources (🏷️
PythonStanfordResearch).
Deep dives: workflows/data-pipeline.md • workflows/research-agent.md
Infrastructure for running agent-generated code safely in isolated, sandboxed, or serverless environments.
💡 Quick Decision Guide:
- Fastest hosted sandbox: E2B, Daytona
- Self-hosted VM isolation: Firecracker, Kata Containers, gVisor
- Container-based: Docker, Podman, Sysbox
- Serverless: AWS Lambda, Google Cloud Run, Cloudflare Workers
| Tool | Isolation | Cold Start | Languages |
|---|---|---|---|
| E2B | VM (Firecracker) | ~300ms | Python, TS, Go |
| Daytona | Container/VM | ~2s | Python, TS, Go |
| Firecracker | microVM | ~125ms | REST API |
| Docker | Namespace+cgroup | ~500ms | All |
| AWS Lambda | Firecracker VM | ~200ms | Python, TS, Go |
| Cloudflare Workers | V8 isolate | ~0ms | TS, Rust (WebAssembly) |
- agentnotary - Notarize, govern, and audit AI agents — cryptographic seal, runtime guard, EU AI Act docs, and adversarial fuzzer (🏷️
PythonComplianceCLI). - AWS Lambda - Serverless function runtime with Firecracker isolation for executing short-lived agent tasks at scale (🏷️
PythonAWSServerless). - Cloudflare Workers - Ultra-low-latency V8-isolate serverless runtime at the edge for lightweight agent task execution (🏷️
TypeScriptCloudflareServerless). - CodeSandbox SDK - Provides forked, snapshotted microVM environments for real-time collaborative agent code execution (🏷️
TypeScriptPythonSDK). - Daytona - Self-hostable git-based developer environment manager for sandboxed agent code execution with fast provisioning (🏷️
PythonTypeScriptPlatform). - Docker - Standard container runtime providing namespace and cgroup isolation for packaging and running agent workloads (🏷️
GoDockerPlatform). - Firecracker - Lightweight microVM technology with 125ms boot time used by AWS Lambda and E2B under the hood (🏷️
RustAWSInfrastructure). - Google Cloud Run - Fully managed serverless platform with gVisor isolation for running stateless agent containers at scale (🏷️
PythonGCPServerless). - gVisor - User-space kernel that intercepts syscalls to provide VM-level security with container-level overhead (🏷️
GoGoogleInfrastructure). - Kata Containers - Combines VM-level security with the developer experience of containers using lightweight VMs (🏷️
GoOCIInfrastructure). - Podman - Daemonless, rootless container engine ideal for running untrusted agent-generated code securely (🏷️
GoRootlessCLI). - Sysbox - Enhanced container runtime that enables Docker-in-Docker securely for nested agent execution environments (🏷️
GoDockerInfrastructure).
Deep dives: environments/sandboxes.md • environments/vms.md • environments/containers.md • environments/serverless.md
Tools for generating images, video, music, audio, and 3D assets using AI models.
| Generator | Strength | Open Source | Pricing |
|---|---|---|---|
| Midjourney v7 | Artistic quality | No | $10-120/mo |
| FLUX 2 | Photorealism | Yes | Free / API |
| Stable Diffusion | Full control | Yes | Free (OSS) |
| Ideogram v3 | Text rendering | No | Free / $7+/mo |
| Google Imagen 4 | Highest fidelity | No | API |
- Adobe Firefly 3 - Generates commercially safe images from text prompts using a model trained exclusively on licensed data (🏷️
CloudAdobe CCWeb). - DALL-E 3.5 - Generates detailed images from text prompts with 95% text accuracy integrated directly into ChatGPT (🏷️
CloudOpenAIAPI). - FLUX 2 - Open-weight photorealism model producing 4K output at 6x faster generation speed than its predecessor (🏷️
PythonOpen-WeightAPI). - Google Imagen 4 - Produces state-of-the-art photorealistic images from text via Google DeepMind accessible through AI Studio (🏷️
CloudGoogleAPI). - Ideogram v3 - Renders readable text inside generated images with near-zero spelling errors for logos and marketing (🏷️
CloudFreemiumWeb). - InkOS - Autonomous novel-writing CLI agent where agents collaborate to produce long-form fiction with continuity auditing (🏷️
PythonOpen-SourceCLI). - Leonardo AI - Multi-model image platform with Realtime Canvas and 3D gaming asset generation now owned by Canva (🏷️
CloudCanvaWeb). - Midjourney v7 - Produces the highest artistic quality images with unmatched aesthetics via Discord and web interface (🏷️
CloudDiscordWeb). - Recraft - Design-first image generator with native vector art output and brand consistency tools for designers (🏷️
CloudFreemiumWeb). - Stable Diffusion 3.5 - Open-source foundation model with full ControlNet, LoRA, and ComfyUI ecosystem for custom pipelines (🏷️
PythonOpen-SourceLocal).
| Generator | Max Length | Resolution | Open Source |
|---|---|---|---|
| Sora 2 | 25 sec | 1080p | No |
| Google Veo 3.1 | Variable | 4K | No |
| Runway Gen-4.5 | Variable | 1080p+ | No |
| Kling 3.0 | 2 min | 4K | No |
| Wan 2.1 | Unlimited | Variable | Yes |
| HunyuanVideo | Unlimited | Variable | Yes |
- Google Veo 3.1 - Generates the highest cinematic quality video with native audio from text prompts via Google DeepMind (🏷️
CloudGoogleAPI). - HaiLuo AI - Budget video generation platform offering 10 free videos per day with HD output quality (🏷️
CloudFreemiumWeb). - HunyuanVideo - Open-source video generation model from Tencent that runs on consumer-grade GPUs (🏷️
PythonOpen-SourceLocal). - Kling 3.0 - Generates 4K video up to 2 minutes long with native audio at the most competitive pricing available (🏷️
CloudFreemiumWeb). - LTX Video - Commercially safe open-source video generation model trained on fully licensed data (🏷️
PythonOpen-SourceLocal). - Luma Dream Machine - Creates physics-accurate 4K HDR video scenes with 3D-style output for cinematic quality (🏷️
CloudFreemiumWeb). - Pika 2.5 - Beginner-friendly video generation tool with fast turnaround and intuitive editing controls (🏷️
CloudFreemiumWeb). - Runway Gen-4.5 - Professional video generation with Motion Brush and the highest benchmark scores for visual quality (🏷️
CloudProWeb). - Seedance 2.0 - Quad-modal input video generator with built-in lip-sync and 2K resolution output (🏷️
CloudFreemiumWeb). - Sora 2 - Generates 25-second cinematic video clips with strong narrative coherence from text prompts by OpenAI (🏷️
CloudOpenAIWeb). - Wan 2.1 - Open-source self-hosted video generation model with unlimited length and no per-video costs (🏷️
PythonOpen-SourceLocal).
- ElevenLabs Music - Generates vocals and instrumentals with stem separation and per-section editing for audio production (🏷️
CloudElevenLabsWeb). - Meta AudioCraft - Open-source audio generation suite including MusicGen and AudioGen with fine-grained prompt control (🏷️
PythonOpen-SourceLocal). - Stable Audio - Generates instrumental audio tracks with a commercial license included by default for all outputs (🏷️
CloudStability AIWeb). - Suno - Generates complete songs with vocals, instruments, and lyrics from a single text prompt (🏷️
CloudFreemiumWeb). - Udio - Produces high-fidelity music with fine-grained sectional control over generation and arrangement (🏷️
CloudFreemiumWeb).
- Meshy - Generates 3D models from text or images for game assets and product visualization with multi-format export (🏷️
CloudFreemiumWeb). - Tripo AI - Produces fast 3D model generation from text or images with multi-format export for rapid prototyping (🏷️
CloudFreemiumWeb). - Vizcom - Renders industrial design sketches into photorealistic 3D visuals in real-time for product designers (🏷️
CloudProWeb).
AI agents that automate customer support, CRM workflows, sales outreach, and ticket resolution at scale.
| Platform | Auto-Resolution | Best For |
|---|---|---|
| Intercom Fin | 50%+ tickets | SaaS teams |
| Ada | 60%+ tickets | Complex resolution |
| Zendesk AI | 30%+ tickets | Enterprise |
| Freshdesk Freddy | 25%+ tickets | SMBs |
- Ada - Resolves 60% of customer support tickets automatically with complex multi-turn query understanding (🏷️
CloudEnterprisePlatform). - Assembled - Routes support tickets with workforce-aware scheduling and intelligent handoff to human agents (🏷️
CloudEnterprisePlatform). - Dixa - CRM-first conversational support platform with AI-powered routing and customer context enrichment (🏷️
CloudEnterprisePlatform). - Freshdesk Freddy AI - Auto-triages and routes support tickets with smart AI suggestions for budget-conscious SMB teams (🏷️
CloudFreshworksPlatform). - Intercom Fin - Resolves 50% of SaaS support tickets by learning directly from your help center and knowledge base (🏷️
CloudIntercomPlatform). - Zendesk AI - Automates 30% of enterprise support tickets with deep integration into the existing Zendesk ecosystem (🏷️
CloudZendeskPlatform).
- Creatio - No-code CRM platform with pre-built AI agents for sales, marketing, and service automation (🏷️
CloudNo-CodePlatform). - Dynamics 365 Copilot - Drafts, summarizes, and translates across Office 365 with deep Microsoft ecosystem integration (🏷️
CloudMicrosoftPlatform). - HubSpot Breeze - AI copilot with agent workflows and intelligence layer across marketing, sales, and service hubs (🏷️
CloudHubSpotPlatform). - Monday CRM Lexi - AI sales agent with automated lead sourcing, qualification, and pipeline management for SMBs (🏷️
CloudMondayPlatform). - Pipedrive AI - AI-powered email drafting, deal prioritization, and smart reporting for small sales teams (🏷️
CloudPipedrivePlatform). - Salesforce Einstein - Enterprise AI with predictions, autonomous agents, and CRM automation across the Salesforce platform (🏷️
CloudSalesforcePlatform). - Salesmate - CRM with AI-powered call summarization, lead qualification, and pipeline automation for small teams (🏷️
CloudCRMPlatform). - ServiceNow AI Agents - Orchestrates AI agents across IT, HR, and CRM workflows for enterprise service management (🏷️
CloudServiceNowPlatform). - Zoho CRM Zia - Predictive scoring, sentiment analysis, and voice commands for sales automation across Zoho products (🏷️
CloudZohoPlatform).
- Apollo.io - AI prospecting platform with 275M+ contacts, lead scoring, and automated email sequencing (🏷️
CloudFreemiumPlatform). - Clay - Enriches leads from 70+ data providers and generates hyper-personalized outreach at scale (🏷️
CloudProPlatform). - Instantly - Generates AI cold emails with smart sender rotation and built-in domain warmup for deliverability (🏷️
CloudProPlatform). - Lavender - Coaches email writing in real-time with AI response scoring and recipient intelligence (🏷️
CloudFreemiumPlatform). - Overloop CLI - AI outbound CLI agent that sources 450M+ contacts and runs email plus LinkedIn campaigns with JSON output (🏷️
CloudCLIPlatform).
Platforms for building, deploying, and scaling voice-based AI agents across calls, support, and live conversations.
| Platform | Latency | HIPAA | Best For |
|---|---|---|---|
| ElevenLabs | 75ms | Yes | Industry benchmark |
| Vapi | Low | No | Developer-first |
| Bland AI | Medium | Yes | Outbound call automation |
| PolyAI | Low | Yes | Enterprise scale |
- AssemblyAI - Speech-to-text API with speaker diarization, sentiment analysis, and summarization for voice agent pipelines (🏷️
CloudSTTAPI). - Bland AI - Automates outbound phone calls at scale with SOC2 and HIPAA compliance and CRM integration (🏷️
CloudTelephonyAPI). - Deepgram - Sub-300ms speech-to-text and text-to-speech APIs purpose-built for real-time voice agent pipelines (🏷️
CloudSTT/TTSAPI). - ElevenLabs - Industry-leading voice AI with 75ms latency, Conversational AI 2.0, RAG, and HIPAA compliance (🏷️
CloudVoicePlatform). - HeyGen - Creates AI talking avatars with voice cloning and lip-sync for video-based agent interactions (🏷️
CloudAvatarWeb). - PolyAI - Enterprise voice AI platform for natural multi-turn conversations with high-volume call handling (🏷️
CloudEnterprisePlatform). - Retell AI - Builds human-like voice agents with multi-language telephony support and low-latency responses (🏷️
CloudTelephonyAPI). - Synthesia - Generates AI video avatars that speak in 120+ languages for training and communication agents (🏷️
CloudAvatarWeb). - Synthflow - No-code voice agent builder with pre-built templates for SMBs to deploy phone agents quickly (🏷️
CloudNo-CodePlatform). - Voiceflow - No-code builder for voice and chat agents with visual conversation design and team collaboration (🏷️
CloudNo-CodePlatform).
AI platforms that conduct autonomous multi-step research, synthesize findings from multiple sources, and generate structured reports with citations.
| Agent | Context Window | Approach |
|---|---|---|
| Claude Research | 200K tokens | Multi-step with citations |
| ChatGPT Research | 128K tokens | Extended reasoning + browse |
| Gemini Research | 1M tokens | Google Search + KG |
| Perplexity Pro | Variable | Real-time cited search |
- ChatGPT Deep Research - Conducts extended reasoning with web browsing to produce structured research reports with Canvas output (🏷️
CloudOpenAIWeb). - Claude Deep Research - Performs multi-step investigation with verified source citations and 200K token context window (🏷️
CloudAnthropicWeb). - DeerFlow - Multi-agent research system from ByteDance with planning and execution loops for autonomous investigation (🏷️
PythonOpen-SourceResearch). - Gemini Deep Research - Leverages Google Search and Knowledge Graph integration with 1M token context for breadth-first research (🏷️
CloudGoogleWeb). - Perplexity Pro - Provides real-time search answers with inline citations and follow-up threads for iterative research (🏷️
CloudFreemiumWeb).
Platforms that generate full-stack applications, components, or web apps from natural language prompts.
| Builder | Output | Hosting |
|---|---|---|
| Bolt.new | Full-stack app | Browser/deploy |
| Lovable | Full web app | Auto-deploy |
| v0 by Vercel | React components | Vercel |
| Replit Agent | Full-stack | Replit |
- Bolt.new - Generates full-stack web applications in the browser from natural language prompts with instant deploy (🏷️
CloudFreemiumWeb). - Dyad - Privacy-first open-source prompt-to-app builder that generates full-stack applications locally (🏷️
TypeScriptOpen-SourceLocal). - Google Project IDX - AI-powered cloud IDE with free Claude Opus access for building full projects across all frameworks (🏷️
CloudGoogleWeb). - Lovable - Iterates from chat to deployed web application with auto-deploy and real-time preview (🏷️
CloudFreemiumWeb). - PlayCode Agent - Browser-only lightweight web app creation tool with AI-assisted code generation (🏷️
CloudProWeb). - Replit Agent - Builds and deploys full-stack projects from prompts with integrated hosting and collaboration (🏷️
CloudFreemiumWeb). - v0 by Vercel - Generates React and Tailwind UI components from text descriptions with one-click Vercel deployment (🏷️
CloudFreemiumWeb).
All-in-one AI platforms providing access to agents, tools, and models through consumer-facing interfaces without writing code.
| Platform | Context | Flagship Features |
|---|---|---|
| ChatGPT | 128K | GPTs, Deep Research, Canvas, Agent Mode |
| Claude | 200K | Computer Use, MCP, Cowork |
| Gemini | 1M | Deep Think, Gems, NotebookLM |
| Grok | 131K | Real-time X data, Grok Build |
- ChatGPT - General-purpose AI platform with GPTs, Deep Research, Canvas, and Agent Mode across the widest plugin ecosystem (🏷️
CloudOpenAIWeb). - Claude - Reasoning-focused AI with Computer Use, MCP integration, and Chrome browsing for coding and long document analysis (🏷️
CloudAnthropicWeb). - Coze - No-code agent builder with a marketplace of 100+ custom agents and plugin integrations (🏷️
CloudByteDanceWeb). - Cursor AI Automated Team - 4-role AI team with PM, DEV, OPS, and QA agents in Cursor IDE with file-based task routing (🏷️
TypeScriptCursorIDE). - Gemini - Google's multimodal AI with 1M token context, Deep Think, Gems, and NotebookLM for workspace integration (🏷️
CloudGoogleWeb). - Grok - Real-time AI with live X data access, Grok Build for 8-agent parallel code generation, and image generation (🏷️
CloudxAIWeb). - Meta AI - Llama-powered AI integrated across WhatsApp, Messenger, and Instagram for conversational assistance (🏷️
CloudMetaWeb). - Microsoft Copilot - AI assistant integrated into Office 365, Teams, and Power Platform for enterprise productivity workflows (🏷️
CloudMicrosoftWeb). - TeamHero - Open-source multi-agent orchestration with web dashboard, task lifecycle, knowledge base, and autopilot mode built on Claude Code (🏷️
PythonClaudeWeb).
Open-weight LLMs optimized for agentic tasks including tool use, planning, multi-step reasoning, and code generation.
| Model | Organization | Parameters | Agent Strength |
|---|---|---|---|
| Llama 4 | Meta | 8B-405B+ | Strong tool use |
| Qwen 3 | Alibaba | 0.6B-235B | MCP-native, multilingual |
| DeepSeek V3/R1 | DeepSeek | 671B MoE | 68x cheaper than GPT-4 |
| Gemma 3 | 1B-27B | Efficient edge agents |
- Baichuan 3 - Chinese-optimized open model family with strong domain performance for multilingual agent deployments (🏷️
PythonOpen-SourceLocal). - Command R+ - Cohere's 104B parameter model optimized specifically for RAG and enterprise tool use in agent workflows (🏷️
CloudCohereAPI). - DeepSeek V3 - 671B MoE model that delivers GPT-4 level reasoning at 68x lower cost for budget-conscious agent deployments (🏷️
PythonOpen-SourceLocal). - Falcon 3 - Efficient open model from TII UAE with strong multilingual support across 10B to 180B parameter variants (🏷️
PythonOpen-SourceLocal). - Gemma 3 - Efficient multimodal model from Google in 1B to 27B sizes ideal for edge and on-device agents (🏷️
PythonGoogleLocal). - Gemma 4 - Consumer and IoT-optimized model from Google in 2B to 31B sizes with E2B and E4B variants (🏷️
PythonGoogleLocal). - GLM-4 - Zhipu AI's 744B MoE model achieving the lowest hallucination rate and 77.8% on SWE-bench (🏷️
PythonOpen-SourceLocal). - InternLM 3 - Shanghai AI Lab model with strong long-context reasoning capabilities for complex agent tasks (🏷️
PythonOpen-SourceLocal). - Llama 4 - Meta's flagship open model family with Scout and Maverick variants and strong tool use capabilities (🏷️
PythonMetaLocal). - Mistral Large 2 - Leading European open model at 123B parameters with strong function calling for agent workflows (🏷️
PythonMistralAPI). - Phi-4 - Microsoft's compact 14B parameter model delivering strong reasoning for on-device and edge agent deployments (🏷️
PythonMicrosoftLocal). - Qwen 3 - MCP-native multilingual model family from Alibaba with thinking mode across 0.6B to 235B sizes (🏷️
PythonAlibabaLocal). - Qwen 3.5-Omni - Native multimodal model from Alibaba with text, image, and audio processing in a 397B MoE architecture (🏷️
PythonAlibabaLocal). - Qwen 3.6-Plus - Agentic-focused model with 1M context window, repo-level coding, and MCP-native tool use (🏷️
PythonAlibabaLocal). - StarCoder 2 - Code-focused open model trained on 600+ programming languages for specialized coding agents (🏷️
PythonBigCodeLocal). - Yi-Lightning - 01.AI's multilingual model with long context and fast inference across 6B to 200B parameter sizes (🏷️
PythonOpen-SourceLocal).
Key data points on the AI agent market covering adoption, growth, valuations, and fastest-growing open-source projects.
| Year | Market Size | Growth |
|---|---|---|
| 2024 | $5.1B | Baseline |
| 2025 | $7.6B | +49% YoY |
| 2026 | $10.91B | +43% YoY |
| 2027 | $16.8B | +54% YoY |
| 2030 | $52.63B | 46.3% CAGR |
| Metric | Value |
|---|---|
| Orgs with agents in production | 57% |
| Devs using AI coding tools regularly | 85% |
| Companies planning agent deployment | 72% |
| Enterprises with dedicated agent budget | 41% |
| Use Case | Share |
|---|---|
| Customer Service | 26.5% |
| Research and Analysis | 24.4% |
| Workflow Automation | 18.0% |
| Code Generation | 15.0% |
| Data Processing | 10.0% |
| Project | Stars (Jan 2026) | Stars (May 2026) | Growth |
|---|---|---|---|
| Browser Use | 10K | 50K | 5x |
| Playwright MCP | 1K | 10K | 10x |
| DeerFlow | 5K | 25K | 5x |
| Company | Valuation |
|---|---|
| OpenAI | $90B+ |
| Cursor | $29.3B |
| Anthropic | $20B |
| ElevenLabs | $11B |
| Mistral | $6B |
| Hugging Face | $4.5B |
- AI Agent Market Report 2026 - Comprehensive market analysis projecting the AI agent market to reach $52.63B by 2030 at 46.3% CAGR (🏷️
ReportMarketResearch). - State of AI Agents Survey - Enterprise survey showing 57% of organizations now have AI agents in production workflows (🏷️
ReportSurveyResearch).
Tools for running LLMs locally and self-hosting AI agent platforms with full privacy and control.
- GPT4All - Open-source local chat application that runs LLMs on consumer hardware without cloud dependencies (🏷️
C++DesktopLocal). - Jan - Open-source ChatGPT alternative that runs 100% offline with local model management and privacy guarantees (🏷️
TypeScriptElectronDesktop). - Llamafile - Distributes LLMs as single executable files with zero setup required from Mozilla (🏷️
C++PortableLocal). - llama.cpp - C/C++ LLM inference engine supporting CPU, GPU, and Apple Silicon as the foundation of local AI (🏷️
C++Cross-PlatformLocal). - LM Studio - Desktop application for running local LLMs with a polished UI across all major platforms (🏷️
TypeScriptElectronDesktop). - LocalAI - Drop-in OpenAI API replacement that runs models locally without requiring a GPU (🏷️
GoDockerLocal). - Ollama - Run LLMs locally with a dead-simple CLI interface and 162K+ GitHub stars (🏷️
GoCLILocal). - vLLM - High-throughput LLM serving engine with PagedAttention for production-grade local inference (🏷️
PythonCUDALocal).
Frameworks and tools for AI risk management, regulatory compliance, and governance as EU AI Act obligations take effect August 2026.
- AuditOne - Automated risk assessments and audit-ready documentation for AI agent compliance workflows (🏷️
CloudCompliancePlatform). - Bifrost - Real-time security enforcement engine for agent pipelines with policy-based access controls (🏷️
PythonSecuritySDK). - Credo AI - End-to-end AI governance platform with EU AI Act policy packs and model inventory management (🏷️
CloudGovernancePlatform). - EU AI Act - Official EU regulatory framework classifying AI systems by risk tier from unacceptable to minimal risk (🏷️
PolicyRegulationEU). - IBM watsonx.governance - Enterprise AI risk, compliance, and model monitoring platform for regulated industries (🏷️
CloudIBMPlatform). - Microsoft Agent Governance Toolkit - Runtime policy enforcement and guardrails for Azure-hosted AI agents (🏷️
CloudMicrosoftPlatform). - NIST AI RMF - US framework for AI risk management covering Govern, Map, Measure, and Manage functions (🏷️
PolicyFrameworkUS). - OneTrust AI Governance - Risk classification, consent management, and compliance workflows for AI agent deployments (🏷️
CloudCompliancePlatform).
AI agents specialized in penetration testing, vulnerability discovery, threat detection, and security analysis.
- CAI - AI-powered penetration testing and red teaming agent with human-in-the-loop vulnerability discovery (🏷️
PythonSecurityCLI). - CrowdStrike Charlotte AI - Enterprise AI security analyst for automated threat hunting and incident response (🏷️
CloudEnterprisePlatform). - Microsoft Security Copilot - Enterprise threat detection and incident response AI integrated across Microsoft security products (🏷️
CloudMicrosoftPlatform). - PentestGPT - GPT-powered penetration testing tool with automated reasoning for vulnerability assessment (🏷️
PythonSecurityCLI). - YAWNING TITAN - Graph-based cybersecurity simulation environment for training and testing defensive AI agents (🏷️
PythonSimulationResearch).
AI agents for mental health support, cognitive training, and therapy-adjacent applications with clinical validation.
- Akili Interactive - FDA-cleared video game-based digital medicine for ADHD using neuroplasticity-focused cognitive training (🏷️
CloudFDA-ClearedMedical). - CittaVerse - AI-assisted reminiscence therapy for elderly cognitive training with narrative quality scoring and life story book generation (🏷️
PythonResearchTherapy). - Elomia - Clinician-designed AI therapy chatbot with natural conversation flow and anonymous interaction by design (🏷️
CloudMental HealthWeb). - Ginger - On-demand mental healthcare platform combining AI coaching with live therapists using CBT and DBT approaches (🏷️
CloudMental HealthMobile). - Headspace Health - Meditation and mental health platform with CBT-based courses, sleep programs, and clinical partnerships (🏷️
CloudWellnessMobile). - Replika - AI companion with Rogerian-style support, persistent memory, and mood tracking for social wellbeing (🏷️
CloudCompanionMobile). - Sanvello - CBT tools with mood tracking and coaching covered by insurance plans for accessible mental healthcare (🏷️
CloudMental HealthMobile). - Talkspace AI - AI-assisted therapy matching platform with human therapist backup for personalized mental health care (🏷️
CloudTherapyMobile). - Tess by X2AI - SMS-based therapy coach using CBT with clinical validation showing 28% depression reduction (🏷️
CloudMental HealthSMS). - Woebot - CBT-based mental health chatbot that is FDA-cleared and Stanford-validated for therapeutic conversations (🏷️
CloudFDA-ClearedMobile). - Wysa - NHS-approved AI mental health companion using CBT and DBT with anonymous design principles (🏷️
CloudMental HealthMobile). - Youper - Emotional health assistant using CBT and ACT with Stanford-tested clinical effectiveness for mood tracking (🏷️
CloudMental HealthMobile).
Curated newsletters, podcasts, and communities for staying current with AI agent development.
- AI Engineering Newsletter - AI engineering podcast and newsletter by Swyx and Alessio covering agent architectures and tooling (🏷️
NewsletterPodcastWeb). - aibtc.news - Bitcoin-focused agent news platform with bounties and classifieds for the agent economy (🏷️
NewsletterBitcoinWeb). - Awesome Agents Newsletter - Weekly curated tools and reviews covering the latest in AI agent development (🏷️
NewsletterWeeklyWeb). - Ben's Bites - Daily AI newsletter with a builder focus covering tools, launches, and trends (🏷️
NewsletterDailyWeb). - r/ClaudeAI - Reddit community for Claude users sharing agent workflows, prompts, and integration patterns (🏷️
CommunityRedditForum). - r/LangChain - Reddit community for agent developers using LangChain, LangGraph, and related frameworks (🏷️
CommunityRedditForum). - r/LocalLLaMA - Reddit community for self-hosted LLM users sharing local deployment and agent setup guides (🏷️
CommunityRedditForum). - State of Agent Engineering - Annual industry report surveying 1300+ practitioners on agent development trends (🏷️
ReportAnnualWeb). - The Rundown AI - Daily AI digest reaching 600K+ subscribers with concise coverage of agent news and launches (🏷️
NewsletterDailyWeb).
See CHANGELOG.md for the full update history.
Your contributions are what keep this list useful. Read Contributing.md for the entry format, inclusion criteria, and style guide.
