# Product Roadmap Aden Agent Framework aims to help developers build outcome-oriented, self-adaptive agents. Please find our roadmap here ```mermaid flowchart TB %% Main Entity User([User]) %% ========================================= %% EXTERNAL EVENT SOURCES %% ========================================= subgraph ExtEventSource [External Event Source] E_Sch["Schedulers"] E_WH["Webhook"] E_SSE["SSE"] end %% ========================================= %% SYSTEM NODES %% ========================================= subgraph WorkerBees [Worker Bees] WB_C["Conversation"] WB_SP["System prompt"] subgraph Graph [Graph] direction TB N1["Node"] --> N2["Node"] --> N3["Node"] N1 -.-> AN["Active Node"] N2 -.-> AN N3 -.-> AN %% Nested Event Loop Node subgraph EventLoopNode [Event Loop Node] ELN_L["listener"] ELN_SP["System Prompt
(Task)"] ELN_EL["Event loop"] ELN_C["Conversation"] end end end subgraph JudgeNode [Judge] J_C["Criteria"] J_P["Principles"] J_EL["Event loop"] <--> J_S["Scheduler"] end subgraph QueenBee [Queen Bee] QB_SP["System prompt"] QB_EL["Event loop"] QB_C["Conversation"] end subgraph Infra [Infra] SA["Sub Agent"] TR["Tool Registry"] WTM["Write through Conversation Memory
(Logs/RAM/Harddrive)"] SM["Shared Memory
(State/Harddrive)"] EB["Event Bus
(RAM)"] CS["Credential Store
(Harddrive/Cloud)"] end subgraph PC [PC] B["Browser"] CB["Codebase
v 0.0.x ... v n.n.n"] end %% ========================================= %% CONNECTIONS & DATA FLOW %% ========================================= %% External Event Routing E_Sch --> ELN_L E_WH --> ELN_L E_SSE --> ELN_L ELN_L -->|"triggers"| ELN_EL %% User Interactions User -->|"Talk"| WB_C User -->|"Talk"| QB_C User -->|"Read/Write Access"| CS %% Inter-System Logic ELN_C <-->|"Mirror"| WB_C WB_C -->|"Focus"| AN WorkerBees -->|"Inquire"| JudgeNode JudgeNode -->|"Approve"| WorkerBees %% Judge Alignments J_C <-.->|"aligns"| WB_SP J_P <-.->|"aligns"| QB_SP %% Escalate path J_EL -->|"Report (Escalate)"| QB_EL %% Pub/Sub Logic AN -->|"publish"| EB EB -->|"subscribe"| QB_C %% Infra and Process Spawning ELN_EL -->|"Spawn"| SA SA -->|"Inform"| ELN_EL SA -->|"Starts"| B B -->|"Report"| ELN_EL TR -->|"Assigned"| EventLoopNode CB -->|"Modify Worker Bee"| WorkerBees %% ========================================= %% SHARED MEMORY & LOGS ACCESS %% ========================================= %% Worker Bees Access Graph <-->|"Read/Write"| WTM Graph <-->|"Read/Write"| SM %% Queen Bee Access QB_C <-->|"Read/Write"| WTM QB_EL <-->|"Read/Write"| SM %% Credentials Access CS -->|"Read Access"| QB_C ``` --- ## Core Architecture & Swarm Primitives ### Node-Based Architecture Implement the core execution engine where every Agent operates as an isolated, asynchronous graph of nodes. - [x] **Core Node Implementation** - [x] NodeProtocol with JSON parsing utilities (graph/node.py) - [x] EventLoopNode with LLM conversation management (graph/event_loop_node.py) - [x] Flexible input/output keys with nullable output handling - [x] Node wrapper SDK for agent creation - [x] Tool access layer with MCP integration - [x] **Graph Executor** - [x] Graph traversal execution (graph/executor.py) - [x] Node transition management - [x] Error handling and output mapping - [x] ExecutionResult with success/error status - [x] **Shared Memory Access** - [x] SharedState manager (runtime/shared_state.py) - [x] Session-based storage (storage/session_store.py) - [x] Isolation levels: ISOLATED, SHARED, SYNCHRONIZED - [x] **Default Monitoring Hooks** - [ ] Performance metrics collection - [ ] Resource usage tracking - [ ] Health check endpoints ### Node Protocol Build the standard communication protocol for inter-node messaging and data passing. - [x] **Edge Specifications** - [x] ALWAYS: Always traverse (graph/edge.py) - [x] ON_SUCCESS: Success-based routing - [x] ON_FAILURE: Failure-based routing - [x] CONDITIONAL: Expression-based routing with safe_eval - [x] LLM_DECIDE: Goal-aware LLM-powered routing - [x] **Event Bus System** - [x] Full event bus implementation (runtime/event_bus.py) - [x] LLM text deltas, tool calls, node transitions - [x] Graph-scoped event routing for multi-agent scenarios - [x] **Conversation Management** - [x] NodeConversation tracks message history (graph/conversation.py) - [x] Tool results, streaming content, metadata support ### Judge in Event Loop A separate LLM-powered judge to determine if the workers finish their job. - [x] **Conversation Judge (Level 2)** - [x] Evaluates node completion against success criteria (graph/conversation_judge.py) - [x] Reads recent conversation and assesses quality - [x] Returns verdict: ACCEPT or RETRY with confidence scores - [x] **Test Evaluation Judge** - [x] Provider-agnostic (OpenAI, Anthropic, Google Gemini) (testing/llm_judge.py) - [x] JSON response parsing for structured evaluation - [ ] **Multi-Level Judgment Integration** - [ ] Judge node integration with event loop - [ ] Automatic retry logic based on judge verdict - [ ] Judge performance monitoring ### Swarm Hierarchy Develop the distinct behavioral logic for the Queen Bee (Orchestrator), Judge Bee (Evaluator), and Worker Bee (Executor). - [x] **Judge Bee (Evaluator)** - [x] Evaluation criteria framework (graph/goal.py) - [x] Success/failure determination - [x] Quality assessment with confidence scores - [x] **Hive Coder Agent (Builder)** - [x] Coder node: forever-alive event loop (agents/hive_coder/nodes/) - [x] Guardian node: event-driven watchdog for supervised agents - [x] Tool discovery (discover_mcp_tools) - [x] Agent aware (list_agents, inspect sessions) - [x] Post-build testing (run_agent_tests) - [x] Debugging capabilities (inspect checkpoints, memory) - [ ] **Queen Bee (Orchestrator)** - [ ] Multi-agent coordination layer - [ ] Task distribution logic - [ ] Dynamic worker agent creation - [ ] Swarm-level goal management - [ ] **Worker Bee (Executor)** - [ ] Worker taxonomy definition - [ ] Worker agent templates - [ ] Task execution patterns ### Coding Agent Workflows Implement the Goal Creation Session via the Queen Bee and the dynamic Worker Agent Creation flow. - [x] **Goal Creation Session** - [x] Goal object schema definition (graph/goal.py) - [x] SuccessCriterion: Measurable success (5+ criteria per goal) - [x] Constraint: Hard/soft boundaries (time, cost, safety, scope, quality) - [x] GoalStatus: DRAFT → READY → ACTIVE → COMPLETED/FAILED - [x] Instruction back and forth in Hive Coder - [x] Test case generation - [x] Test case validation for worker agent - [x] **Agent Creation Flow** - [x] Hive Coder reads templates and discovers tools (builder/package_generator.py) - [x] Generates agent.py, nodes/__init__.py, config.py - [x] MCP server configuration discovery - [x] Dynamic tool binding - [ ] **Worker Agent Dynamic Creation** - [ ] Template agent initialization from Queen Bee - [ ] Runtime worker instantiation - [ ] Worker lifecycle management ### Security Layer Build robust, local Credential Management interfaces for secure API key handling. - [x] **Unified Credential Store** - [x] Multi-backend storage (credentials/store.py) - [x] EncryptedFileStorage: Encrypted local storage (~/.hive/credentials) - [x] EnvVarStorage: Environment variable mapping - [x] InMemoryStorage: Testing - [x] HashiCorp Vault: Enterprise secrets (credentials/storage.py) - [x] Template resolution: `{{cred.key}}` patterns - [x] Caching with TTL (default 5 min, configurable) - [x] Thread-safe operations with RLock - [x] **OAuth2 Providers** - [x] Base provider pattern (credentials/oauth2/) - [x] HubSpot provider integration - [x] Lifecycle management (refresh tokens) - [x] Browser opening for auth flows (tools/credentials/browser.py) - [x] **Aden Sync Provider** - [x] Syncs OAuth2 tokens from Aden authentication server (credentials/aden/) - [x] Falls back to local storage if Aden unavailable - [x] Auto-refresh on sync - [ ] **Enterprise Secret Managers** - [ ] AWS Secrets Manager integration - [ ] Azure Key Vault integration - [ ] Audit logging for compliance/tracking - [ ] Per-environment configuration support --- ## Tooling Ecosystem & General Compute ### Sub-agents Parallel Execution Develop the Sub-agent execution environment for parallel tasks execution. The subagents are designed with isolation for repeatability. - [x] **Multi-Graph Sessions** - [x] Load multiple agent graphs in single session (runtime/agent_runtime.py) - [x] Shared state between graphs - [x] Independent execution streams - [x] Graph lifecycle management (load/unload/start/restart) - [x] **Concurrent Execution Management** - [x] Max concurrent executions configuration - [x] Isolation levels: isolated, shared, synchronized - [ ] **Sub-agent Execution Environment** - [ ] Isolated sub-agent runtime environment - [ ] Task isolation mechanisms - [ ] Result aggregation - [ ] Error handling for parallel tasks - [ ] Repeatability guarantees ### Browser Use Node Implement native browser-integrated automation so agents can take over a browser for auth and agents perform the automation jobs. This node comes with a specific set of tools and system prompts. - [x] **Web Scraping with Playwright** - [x] Headless Chromium launch (tools/web_scrape_tool/) - [x] Stealth mode via playwright_stealth - [x] JavaScript rendering with wait-for-domcontentloaded - [x] CSS selector support - [x] User-agent spoofing - [x] Sandbox/automation detection evasion - [x] **Browser Launch Utilities** - [x] Platform-specific browser opening (macOS/Linux/Windows) (tools/credentials/browser.py) - [x] OAuth2 flow integration - [ ] **Full Browser Use Node** - [ ] Multi-page automation workflows - [ ] Form filling with vision-guided interactions - [ ] Interactive screenshot capabilities - [ ] Session management across navigations - [ ] Browser-specific tool set - [ ] System prompts for browser tasks ### Core Graph Framework Infra Ship essential framework utilities: Node validation, HITL (Human-in-the-loop pause/approve), and node lifecycle management. - [x] **Node Validation** - [x] Pydantic-based validation - [x] Schema enforcement - [x] Output key validation (Level 0) - [x] **Human-in-the-Loop (HITL)** - [x] HITLRequest and HITLResponse protocol (graph/hitl.py) - [x] Question types: FREE_TEXT, STRUCTURED, SELECTION, APPROVAL, MULTI_FIELD - [x] Haiku-powered response parsing - [x] User-friendly display formatting - [x] Pause/approve workflow - [x] State saved to checkpoint - [x] Resume with HITLResponse merged into context - [x] ~~**TUI Integration**~~ *(deprecated — see AGENTS.md; use `hive open` browser UI instead)* - [x] ~~Chat REPL with streaming support (tui/app.py)~~ - [x] ~~Multi-graph session management~~ - [x] ~~User presence detection~~ - [x] ~~Real-time log viewing~~ - [x] **Node Lifecycle Management** - [x] Start/stop/pause/resume in execution stream - [x] State persistence via checkpoint store - [x] Recovery mechanisms with checkpoint restore - [ ] **Advanced HITL Features** - [ ] Callback handlers for custom intervention logic - [ ] Streaming interface for real-time monitoring - [ ] Approval workflows at scale ### Infrastructure Tools Port popular tools, and build out the Runtime Log, Audit Trail, Excel, and Email integrations. - [x] **File Operations (36+ tools)** - [x] read_file, write_file, edit_file (builder/package_generator.py) - [x] list_directory, search_files - [x] apply_diff / apply_patch for code modification (tools/file_system_toolkits/) - [x] data_tools (CSV/Excel parsing) - [x] **Web Tools** - [x] Web Search (tools/web_search_tool/) - [x] Web Scraper (tools/web_scrape_tool/) - [x] Exa Search (tools/exa_search_tool/) - [x] News Tool (tools/news_tool/) - [x] SerpAPI (tools/serpapi_tool/) - [x] **Data Tools** - [x] CSV tools (tools/csv_tool/) - [x] Excel tools (tools/excel_tool/) - [x] PDF tools (tools/pdf_read_tool/) - [x] Vision tool for image analysis (tools/vision_tool/) - [x] Time tool (tools/time_tool/) - [x] **Communication Tools (8 tools)** - [x] Email tool (tools/email_tool/) - [x] Gmail tool (tools/gmail_tool/) - [x] Slack tool (tools/slack_tool/) - [x] Discord tool (tools/discord_tool/) - [x] Telegram tool (tools/telegram_tool/) - [x] Google Docs (tools/google_docs_tool/) - [x] Google Maps (tools/google_maps_tool/) - [x] Cal.com (tools/calcom_tool/) - [x] **CRM/API Integrations (5+ tools)** - [x] HubSpot (tools/hubspot_tool/) - [x] GitHub (tools/github_tool/) - [x] Apollo (tools/apollo_tool/) - [x] BigQuery (tools/bigquery_tool/) - [x] Razorpay (tools/razorpay_tool/) - [x] Calendar (tools/calendar_tool/) - [x] **Security/Scanning Tools (5 tools)** - [x] DNS Security Scanner (tools/dns_security_scanner/) - [x] SSL/TLS Scanner (tools/ssl_tls_scanner/) - [x] Port Scanner (tools/port_scanner/) - [x] Subdomain Enumerator (tools/subdomain_enumerator/) - [x] Tech Stack Detector (tools/tech_stack_detector/) - [x] **Runtime & Logging** - [x] Runtime Logger with L1/L2/L3 levels (runtime/runtime_logger.py) - [ ] **Audit Trail System** - [ ] Decision tracing beyond logs - [ ] Compliance reporting - [ ] Historical query capabilities --- ## Memory, Storage & File System Capabilities ### Memory Tools Simple pure file-based memory management - [x] **Short-Term Memory (STM)** - [x] SharedState manager for in-memory state (runtime/shared_state.py) - [x] Session-based storage (storage/session_store.py) - [x] State-based short-term memory layer - [x] **Conversation Memory** - [x] NodeConversation tracks message history (graph/conversation.py) - [x] Tool results, streaming content, metadata - [x] Context building for LLM prompts - [ ] **Long-Term Memory (LTM)** - [ ] Semantic indexing for memory retrieval - [ ] RLM (Retrieval-augmented Long-term Memory) implementation - [ ] Memory persistence beyond session - [ ] Content-based memory search ### Durable Scratchpad Integrate a lightweight, persistent DB for long-term memory using the filesystem-as-scratchpad pattern. - [x] **Filesystem as Scratchpad** - [x] File-based persistence layer (storage/) - [x] Session store implementation - [x] Data durability guarantees - [x] **Checkpoint System** - [x] Save/restore execution state (storage/checkpoint_store.py) - [x] TTL-based cleanup - [x] Async checkpoint support - [x] Max age configuration - [ ] **Message Model & Session Management** - [ ] Message class with structured content types - [ ] Session classes for conversation state - [ ] Per-message file persistence - [ ] Migration from monolithic run storage ### Memory Isolation Enforce session-local memory isolation to prevent data bleed between concurrent agent runs. - [x] **Session Isolation** - [x] Session-local memory implementation (storage/session_store.py) - [x] Data bleed prevention - [x] Concurrent run safety - [x] Isolation levels: ISOLATED, SHARED, SYNCHRONIZED - [x] **State Management** - [x] SharedState with thread-safe operations (runtime/shared_state.py) - [x] Session-scoped state access - [ ] **Context Management** - [ ] Message.stream(sessionID) implementation - [ ] Full context building optimization - [ ] Message to model conversion improvements ### Agent Capabilities Implement File I/O support, streaming mode, and allow users to supply custom functions as libraries/nodes. - [x] **File I/O** - [x] File read/write operations (builder/package_generator.py) - [x] File system navigation - [x] Directory listing and search - [x] **Execution Streaming** - [x] Real-time event streaming (runtime/execution_stream.py) - [x] Token-by-token output via event bus - [x] Tool call streaming - [x] **Custom Tool Integration** - [x] MCP server discovery (builder/package_generator.py) - [x] Dynamic tool binding - [x] Custom tool registration - [ ] **Streaming Mode Enhancements** - [ ] Progressive result delivery optimization - [ ] Backpressure handling - [ ] **Custom Function Libraries** - [ ] User-supplied function libraries as nodes - [ ] Library versioning and management - [ ] **Proactive Memory Compaction** - [ ] Overflow detection - [ ] Backward-scanning pruning strategy - [ ] Token tracking integration for compaction decisions ### File System Enhancements Add semantic search capabilities and an interactive file system for frontend product integration. - [x] **File Search** - [x] search_files tool (builder/package_generator.py) - [x] Directory traversal - [ ] **Semantic Search** - [ ] Semantic indexing of files - [ ] Natural language file search - [ ] Content-based retrieval with embeddings - [ ] **Interactive File System** - [ ] Frontend file browser integration - [ ] Real-time file system updates - [ ] Visual file navigation in GUI --- ## Eval System, DX, & Open Source Guardrails ### Eval System Build the failure recording mechanism and an SDK for defining custom failure conditions. - [x] **Multi-Level Evaluation** - [x] Level 0: Output key validation (all required keys set) - [x] Level 1: Literal checks (output_contains, output_equals) - [x] Level 2: Conversation-aware judgment (graph/conversation_judge.py) - [x] **Goal-Based Constraints** - [x] Hard constraints (violation = failure) (graph/goal.py) - [x] Soft constraints (prefer not to violate) - [x] Categories: time, cost, safety, scope, quality - [x] Constraint checking infrastructure - [x] **Success Criteria Definition** - [x] Weighted criteria (0.0-1.0) - [x] Metrics: output_contains, output_equals, llm_judge, custom - [x] 90% threshold for goal success - [x] **Test Framework** - [x] TestCase, TestResult, TestStorage classes (testing/) - [x] LLM-based judgment for semantic evaluation (testing/llm_judge.py) - [x] Approval CLI for manual approval workflows - [x] Categorization and test result reporting - [ ] **Failure Recording** - [ ] Failure capture mechanism - [ ] Failure analysis tools - [ ] Historical failure tracking - [ ] Continuous improvement loop - [ ] **Custom Failure Conditions SDK** - [ ] SDK for defining custom failure conditions - [ ] Custom evaluator framework extension - [ ] Condition validation DSL ### Guardrails SDK Implement deterministic condition guardrails directly in the node, complete with mitigation tracking and audit logs. - [x] **Goal Constraints (Basic Guardrails)** - [x] Hard/soft constraint definitions (graph/goal.py) - [x] Constraint checking in goals - [ ] **Deterministic Guardrails SDK** - [ ] In-node guardrail implementation - [ ] Condition-based guardrails - [ ] Guardrail SDK for custom rules - [ ] **Monitoring & Tracking** - [ ] Mitigation tracking for violations - [ ] Audit log system for guardrails - [ ] Compliance reporting - [ ] **Basic Monitoring Hooks** - [ ] Agent node SDK monitoring hooks - [ ] Event hook system for guardrails - [ ] Default monitoring hooks in nodes ### DevTools CLI Release CLI tools specifically for rapid memory management and credential store editing. - [x] **Main CLI** - [x] Run, info, validate, list commands (cli.py) - [x] Dispatch mode for batch execution - [x] Shell mode for interactive use - [x] Model selection configuration - [x] **Testing CLI** - [x] test-run, test-debug, test-list, test-stats (testing/cli.py) - [x] Pytest integration - [x] Test categorization - [x] ~~**TUI (Terminal UI)**~~ *(deprecated — see AGENTS.md; use `hive open` browser UI instead)* - [x] ~~Interactive chat with streaming (tui/app.py)~~ - [x] ~~Multi-graph management UI~~ - [x] ~~Log pane for real-time output~~ - [x] ~~Keyboard shortcuts (Ctrl+C, Ctrl+D, etc.)~~ - [ ] **Memory Management CLI** - [ ] Memory inspection commands - [ ] Memory cleanup utilities - [ ] Session management commands - [ ] **Credential Store CLI** - [ ] Interactive credential editing - [ ] Secure credential viewer - [ ] Credential validation tools - [ ] **Debugging Tools** - [ ] Interactive debugging mode beyond TUI - [ ] Breakpoint support in execution - [ ] Step-through execution ### Observability Support user-driven log analysis, basic monitoring hooks from the SDK, and an interactive debugging mode. - [x] **Runtime Logging** - [x] L1 (summary), L2 (detailed), L3 (tool) logging levels (runtime/runtime_logger.py) - [x] Session logs directory storage - [x] Audit trail for decision tracing in logs - [x] **Event Bus Monitoring** - [x] Real-time event streaming (runtime/event_bus.py) - [x] LLM text deltas, tool calls, node transitions - [x] Graph-scoped event routing - [ ] **Log Analysis Tools** - [ ] User-driven log analysis (OSS approach) - [ ] Log aggregation utilities - [ ] Log visualization tools - [ ] **Monitoring Hooks** - [ ] Basic observability hooks from SDK - [ ] Performance metrics collection - [ ] Health checks system - [ ] **Token Tracking** - [ ] Reasoning token tracking - [ ] Cache token tracking - [ ] Token metrics in compaction logic ### Developer Success Write the Quick Start guide, detailed tool usage documentation, and set up the MVP README examples. - [x] **Documentation** - [x] Quick start guide - [x] Goal creation guide - [x] Agent creation guide - [x] README with examples - [x] Contributing guidelines - [x] GitHub Page setup - [x] **Tool Usage Documentation** - [ ] Comprehensive tool documentation - [ ] Tool integration examples - [ ] Best practices guide - [ ] **Video Content** - [ ] Introduction video - [ ] Tutorial videos - [ ] **Example Agents** - [ ] Knowledge agent template - [ ] Blog writer agent template - [ ] SDR agent template --- ## Deployment, CI/CD & Community Templates ### Self-Deployment Standardize the Docker container builds and establish headless backend execution APIs. - [x] **Docker Support** - [x] Python 3.11-slim base image (tools/Dockerfile) - [x] Playwright Chromium installation - [x] Non-root user for security - [x] Health check endpoint - [x] Volume mount for workspace persistence - [x] Exposes port 4001 for MCP server - [x] **Agent Runtime** - [x] AgentRuntime: Top-level orchestrator (runtime/agent_runtime.py) - [x] Multiple entry points (manual, webhook, timer, event, api) - [x] Concurrent execution management - [x] State persistence via session store - [x] Outcome aggregation - [x] **Async Entry Points** - [x] AsyncEntryPointSpec: Webhook, timer, event triggers (graph/edge.py) - [x] Timer config: cron expressions or interval_minutes - [x] Event triggers for custom events - [x] Isolation levels: isolated, shared, synchronized - [ ] **Headless Backend Enhancements** - [ ] Standardized backend execution APIs - [ ] Frontend attachment interface - [ ] Self-hosted setup guide with examples ### Lifecycle APIs Expose basic REST/WebSocket endpoints for external control (Start, Stop, Pause, Resume). - [x] **Webhook Server** - [x] FastAPI-based webhook server (runtime/webhook_server.py) - [x] Route configuration per entry point - [x] Optional secret validation - [x] **Graph Lifecycle Management** - [x] Load/unload/start/restart in AgentRuntime - [x] State persistence - [x] Recovery mechanisms - [x] **REST API Endpoints** - [ ] Start endpoint for agent execution - [ ] Stop endpoint for graceful shutdown - [ ] Pause endpoint for execution suspension - [ ] Resume endpoint for continuation - [ ] Status query endpoint for monitoring - [ ] **WebSocket API** - [ ] Real-time event streaming to clients - [ ] Bidirectional communication - [ ] Connection management with reconnection ### CI/CD Pipelines Implement automated test execution, agent version control, and mandatory test-passing for deployment. - [x] **Test Execution** - [x] Test framework with pytest integration (testing/) - [x] Test result reporting - [x] Test CLI commands (test-run, test-debug, etc.) - [x] **Automated Testing Pipeline** - [ ] CI integration (GitHub Actions, etc.) - [ ] Mandatory test-passing gates - [ ] Coverage reporting - [ ] **Version Control** - [ ] Agent versioning system - [ ] Semantic versioning for agents - [ ] Version compatibility checks - [ ] **Deployment Automation** - [ ] Continuous deployment pipeline - [ ] Rollback mechanisms - [ ] Blue-green deployment support ### Distribution Launch the official PyPI package, Docker Hub image, and the community Discord channel. - [ ] **Package Distribution** - [ ] Official PyPI package - [ ] Docker Hub image publication - [ ] Version release automation - [ ] Installation documentation - [ ] **Community Channels** - [ ] Discord channel setup - [ ] Community support structure - [ ] Contribution guidelines enforcement - [ ] **Cloud Deployment** - [ ] AWS Lambda integration - [ ] GCP Cloud Functions support - [ ] Azure Functions support - [ ] 3rd-party platform integrations - [ ] Self-deploy with orchestrator connection ### Example Agents Ship ~20 ready-to-use templates including GTM Sales, Marketing, Analytics, Training, and Smart Entry agents. - [x] **Hive Coder Agent** - [x] Agent builder template (agents/hive_coder/) - [x] Guardian node for supervision - [ ] **Sales & Marketing Agents** - [ ] GTM Sales Agent (workflow automation) - [ ] GTM Marketing Agent (campaign management) - [ ] Lead generation agent - [ ] Email campaign agent - [ ] Social media agent - [ ] **Analytics & Insights Agents** - [ ] Analytics Agent (data analysis) - [ ] Data processing agent - [ ] Report generation agent - [ ] Dashboard agent - [ ] **Training & Education Agents** - [ ] Training Agent (onboarding) - [ ] Content creation agent - [ ] Knowledge base agent - [ ] Documentation agent - [ ] **Automation & Forms Agents** - [ ] Smart Entry / Form Agent (self-evolution emphasis) - [ ] Data validation agent - [ ] Workflow automation agent - [ ] Integration agent - [ ] **Additional Templates** - [ ] Customer support agent - [ ] Document processing agent - [ ] Scheduling agent - [ ] Research agent - [ ] Code review agent --- ## Open Hive ### Local API Gateway Build a lightweight local server (e.g., FastAPI or Node) that securely exposes the Hive framework's core Event Bus and Memory Layer to the local browser environment. - [x] **MCP Server Foundation** - [x] FastMCP server implementation (builder/package_generator.py) - [x] Agent builder tools exposed - [x] Port 4001 exposed in Docker - [x] **Event Bus Architecture** - [x] Event Bus implementation (runtime/event_bus.py) - [x] Real-time event streaming - [x] Graph-scoped event routing - [ ] **Local API Gateway** - [ ] Lightweight local server (FastAPI or Node) - [ ] Secure authentication layer for browser - [ ] CORS and security configuration - [ ] Event Bus API endpoints for browser access - [ ] Event subscription management for frontend - [ ] **Memory Layer API** - [ ] Memory read/write endpoints - [ ] Session management API for frontend - [ ] Memory visualization data endpoints ### Visual Graph Explorer Implement an interactive, drag-and-drop canvas (using libraries like React Flow) to visualize the Worker Graph, Queen Bee, and active execution paths in real-time. - [ ] **Graph Visualization** - [ ] React Flow integration - [ ] Worker Graph rendering from agent definitions - [ ] Node type visualization (EventLoop, Function, etc.) - [ ] Edge visualization with condition types - [ ] Active execution path highlighting - [ ] **Interactive Features** - [ ] Drag-and-drop canvas for graph editing - [ ] Node editing capabilities - [ ] Real-time graph updates during execution - [ ] Zoom and pan controls - [ ] Node inspection on click - [ ] **Integration with Runtime** - [ ] Live execution visualization - [ ] Node state indicators - [ ] Edge traversal animation ### TUI to GUI Upgrade Port the existing Terminal User Interface (TUI) into a rich web application, allowing users to interact directly with the Queen Bee / Coding Agent via a browser chat interface. > **Note:** The legacy TUI (`tui/app.py`) is deprecated and no longer maintained (see AGENTS.md). The items below reflect legacy work completed before deprecation. New development should target the browser-based GUI (`hive open`). - [x] ~~**TUI Foundation**~~ *(deprecated)* - [x] ~~Terminal chat interface (tui/app.py)~~ - [x] ~~Streaming support~~ - [x] ~~Multi-graph management~~ - [x] ~~Log pane display~~ - [x] ~~Keyboard shortcuts~~ - [ ] **Web Application** - [ ] Modern web UI framework setup (React/Vue/Svelte) - [ ] Responsive design implementation - [ ] Cross-browser compatibility - [ ] **Chat Interface** - [ ] Browser-based chat UI - [ ] Hive Coder interaction (Queen Bee proxy) - [ ] Coding Agent interface - [ ] Message history and search - [ ] Rich message formatting (markdown, code blocks) - [ ] **TUI Feature Parity** - [ ] All TUI commands in GUI - [ ] Keyboard shortcuts in browser - [ ] Command palette (Cmd+K style) ### Memory & State Inspector Create a UI component to inspect the Shared Memory and Write-Through Conversation Memory, allowing developers to click on any node and see exactly what it is thinking. - [x] **Session State** - [x] Session state retrieval (builder/package_generator.py) - [ ] **Memory Inspector UI** - [ ] Shared Memory visualization - [ ] Conversation memory view (NodeConversation display) - [ ] Memory search and filter - [ ] Memory timeline view - [ ] **Node State Inspection** - [ ] Click-to-inspect functionality - [ ] Node thought process display (LLM reasoning) - [ ] State history timeline per node - [ ] Input/output inspection - [ ] **Debug Tools** - [ ] Memory diff viewer (state changes between nodes) - [ ] State snapshot comparison - [ ] Memory leak detection ### Local Control Panel Build a dashboard for localized Credential Management (editing the ~/.hive/credentials store safely) and swarm lifecycle management (Start, Pause, Kill, and HITL approvals). - [x] **Credential Management Backend** - [x] CredentialStore with file/env/vault backends (credentials/store.py) - [x] OAuth2 provider support (credentials/oauth2/) - [x] Template resolution and caching - [ ] **Credential Management Dashboard** - [ ] Safe credential editing interface (web UI) - [ ] ~/.hive/credentials store management UI - [ ] Credential validation and testing UI - [ ] Encryption status display - [ ] OAuth2 flow initiation from browser - [ ] **Swarm Lifecycle Management** - [ ] Start/Stop controls for agents - [ ] Pause/Resume functionality - [ ] Kill process management - [ ] HITL approval interface in browser - [ ] Multi-agent orchestration view - [ ] **Monitoring Dashboard** - [ ] Active agents display - [ ] Resource usage monitoring (CPU, memory, tokens) - [ ] Performance metrics visualization - [ ] Execution history ### Local Model Integration Build native frontend configurations to easily connect Open Hive's backend to local open-source inference engines like Ollama, keeping the entire stack offline and private. - [x] **LLM Integration Layer** - [x] Provider-agnostic LLM support via LiteLLM (graph/event_loop_node.py) - [x] Model configuration in agent definitions - [ ] **Local Model Support** - [ ] Ollama integration and configuration - [ ] Local LLM configuration UI - [ ] Model selection and management dashboard - [ ] Model performance monitoring - [ ] **Offline Mode** - [ ] Full offline functionality (no cloud API calls) - [ ] Local-only execution mode flag - [ ] Privacy-first architecture enforcement - [ ] Local model fallback mechanisms - [ ] **Model Configuration** - [ ] Easy model switching in UI - [ ] Model parameter tuning (temperature, top_p, etc.) - [ ] Performance optimization settings - [ ] Multi-model support (different models per node) - [ ] Model cost tracking for local models ### Cross-Platform Support - [ ] **JavaScript/TypeScript SDK** - [ ] TypeScript SDK development - [ ] npm package distribution - [ ] Node.js runtime support - [ ] Browser runtime support - [ ] **Platform Compatibility** - [x] Windows support improvements - [ ] macOS optimization - [ ] Linux distribution support ### Coding Agent Integration - [ ] **IDE Integrations** - [ ] Claude Code integration - [ ] Cursor integration - [ ] Opencode integration - [ ] Antigravity integration - [ ] Codex CLI integration (in progress)