# Product Roadmap
Aden Agent Framework aims to help developers build outcome-oriented, self-adaptive agents. Please find our roadmap here
```mermaid
flowchart TB
%% Main Entity
User([User])
%% =========================================
%% EXTERNAL EVENT SOURCES
%% =========================================
subgraph ExtEventSource [External Event Source]
E_Sch["Schedulers"]
E_WH["Webhook"]
E_SSE["SSE"]
end
%% =========================================
%% SYSTEM NODES
%% =========================================
subgraph WorkerBees [Worker Bees]
WB_C["Conversation"]
WB_SP["System prompt"]
subgraph Graph [Graph]
direction TB
N1["Node"] --> N2["Node"] --> N3["Node"]
N1 -.-> AN["Active Node"]
N2 -.-> AN
N3 -.-> AN
%% Nested Event Loop Node
subgraph EventLoopNode [Event Loop Node]
ELN_L["listener"]
ELN_SP["System Prompt
(Task)"]
ELN_EL["Event loop"]
ELN_C["Conversation"]
end
end
end
subgraph JudgeNode [Judge]
J_C["Criteria"]
J_P["Principles"]
J_EL["Event loop"] <--> J_S["Scheduler"]
end
subgraph QueenBee [Queen Bee]
QB_SP["System prompt"]
QB_EL["Event loop"]
QB_C["Conversation"]
end
subgraph Infra [Infra]
SA["Sub Agent"]
TR["Tool Registry"]
WTM["Write through Conversation Memory
(Logs/RAM/Harddrive)"]
SM["Shared Memory
(State/Harddrive)"]
EB["Event Bus
(RAM)"]
CS["Credential Store
(Harddrive/Cloud)"]
end
subgraph PC [PC]
B["Browser"]
CB["Codebase
v 0.0.x ... v n.n.n"]
end
%% =========================================
%% CONNECTIONS & DATA FLOW
%% =========================================
%% External Event Routing
E_Sch --> ELN_L
E_WH --> ELN_L
E_SSE --> ELN_L
ELN_L -->|"triggers"| ELN_EL
%% User Interactions
User -->|"Talk"| WB_C
User -->|"Talk"| QB_C
User -->|"Read/Write Access"| CS
%% Inter-System Logic
ELN_C <-->|"Mirror"| WB_C
WB_C -->|"Focus"| AN
WorkerBees -->|"Inquire"| JudgeNode
JudgeNode -->|"Approve"| WorkerBees
%% Judge Alignments
J_C <-.->|"aligns"| WB_SP
J_P <-.->|"aligns"| QB_SP
%% Escalate path
J_EL -->|"Report (Escalate)"| QB_EL
%% Pub/Sub Logic
AN -->|"publish"| EB
EB -->|"subscribe"| QB_C
%% Infra and Process Spawning
ELN_EL -->|"Spawn"| SA
SA -->|"Inform"| ELN_EL
SA -->|"Starts"| B
B -->|"Report"| ELN_EL
TR -->|"Assigned"| EventLoopNode
CB -->|"Modify Worker Bee"| WorkerBees
%% =========================================
%% SHARED MEMORY & LOGS ACCESS
%% =========================================
%% Worker Bees Access
Graph <-->|"Read/Write"| WTM
Graph <-->|"Read/Write"| SM
%% Queen Bee Access
QB_C <-->|"Read/Write"| WTM
QB_EL <-->|"Read/Write"| SM
%% Credentials Access
CS -->|"Read Access"| QB_C
```
---
## Core Architecture & Swarm Primitives
### Node-Based Architecture
Implement the core execution engine where every Agent operates as an isolated, asynchronous graph of nodes.
- [x] **Core Node Implementation**
- [x] NodeProtocol with JSON parsing utilities (graph/node.py)
- [x] EventLoopNode with LLM conversation management (graph/event_loop_node.py)
- [x] Flexible input/output keys with nullable output handling
- [x] Node wrapper SDK for agent creation
- [x] Tool access layer with MCP integration
- [x] **Graph Executor**
- [x] Graph traversal execution (graph/executor.py)
- [x] Node transition management
- [x] Error handling and output mapping
- [x] ExecutionResult with success/error status
- [x] **Shared Memory Access**
- [x] SharedState manager (runtime/shared_state.py)
- [x] Session-based storage (storage/session_store.py)
- [x] Isolation levels: ISOLATED, SHARED, SYNCHRONIZED
- [x] **Default Monitoring Hooks**
- [ ] Performance metrics collection
- [ ] Resource usage tracking
- [ ] Health check endpoints
### Node Protocol
Build the standard communication protocol for inter-node messaging and data passing.
- [x] **Edge Specifications**
- [x] ALWAYS: Always traverse (graph/edge.py)
- [x] ON_SUCCESS: Success-based routing
- [x] ON_FAILURE: Failure-based routing
- [x] CONDITIONAL: Expression-based routing with safe_eval
- [x] LLM_DECIDE: Goal-aware LLM-powered routing
- [x] **Event Bus System**
- [x] Full event bus implementation (runtime/event_bus.py)
- [x] LLM text deltas, tool calls, node transitions
- [x] Graph-scoped event routing for multi-agent scenarios
- [x] **Conversation Management**
- [x] NodeConversation tracks message history (graph/conversation.py)
- [x] Tool results, streaming content, metadata support
### Judge in Event Loop
A separate LLM-powered judge to determine if the workers finish their job.
- [x] **Conversation Judge (Level 2)**
- [x] Evaluates node completion against success criteria (graph/conversation_judge.py)
- [x] Reads recent conversation and assesses quality
- [x] Returns verdict: ACCEPT or RETRY with confidence scores
- [x] **Test Evaluation Judge**
- [x] Provider-agnostic (OpenAI, Anthropic, Google Gemini) (testing/llm_judge.py)
- [x] JSON response parsing for structured evaluation
- [ ] **Multi-Level Judgment Integration**
- [ ] Judge node integration with event loop
- [ ] Automatic retry logic based on judge verdict
- [ ] Judge performance monitoring
### Swarm Hierarchy
Develop the distinct behavioral logic for the Queen Bee (Orchestrator), Judge Bee (Evaluator), and Worker Bee (Executor).
- [x] **Judge Bee (Evaluator)**
- [x] Evaluation criteria framework (graph/goal.py)
- [x] Success/failure determination
- [x] Quality assessment with confidence scores
- [x] **Hive Coder Agent (Builder)**
- [x] Coder node: forever-alive event loop (agents/hive_coder/nodes/)
- [x] Guardian node: event-driven watchdog for supervised agents
- [x] Tool discovery (discover_mcp_tools)
- [x] Agent aware (list_agents, inspect sessions)
- [x] Post-build testing (run_agent_tests)
- [x] Debugging capabilities (inspect checkpoints, memory)
- [ ] **Queen Bee (Orchestrator)**
- [ ] Multi-agent coordination layer
- [ ] Task distribution logic
- [ ] Dynamic worker agent creation
- [ ] Swarm-level goal management
- [ ] **Worker Bee (Executor)**
- [ ] Worker taxonomy definition
- [ ] Worker agent templates
- [ ] Task execution patterns
### Coding Agent Workflows
Implement the Goal Creation Session via the Queen Bee and the dynamic Worker Agent Creation flow.
- [x] **Goal Creation Session**
- [x] Goal object schema definition (graph/goal.py)
- [x] SuccessCriterion: Measurable success (5+ criteria per goal)
- [x] Constraint: Hard/soft boundaries (time, cost, safety, scope, quality)
- [x] GoalStatus: DRAFT → READY → ACTIVE → COMPLETED/FAILED
- [x] Instruction back and forth in Hive Coder
- [x] Test case generation
- [x] Test case validation for worker agent
- [x] **Agent Creation Flow**
- [x] Hive Coder reads templates and discovers tools (builder/package_generator.py)
- [x] Generates agent.py, nodes/__init__.py, config.py
- [x] MCP server configuration discovery
- [x] Dynamic tool binding
- [ ] **Worker Agent Dynamic Creation**
- [ ] Template agent initialization from Queen Bee
- [ ] Runtime worker instantiation
- [ ] Worker lifecycle management
### Security Layer
Build robust, local Credential Management interfaces for secure API key handling.
- [x] **Unified Credential Store**
- [x] Multi-backend storage (credentials/store.py)
- [x] EncryptedFileStorage: Encrypted local storage (~/.hive/credentials)
- [x] EnvVarStorage: Environment variable mapping
- [x] InMemoryStorage: Testing
- [x] HashiCorp Vault: Enterprise secrets (credentials/storage.py)
- [x] Template resolution: `{{cred.key}}` patterns
- [x] Caching with TTL (default 5 min, configurable)
- [x] Thread-safe operations with RLock
- [x] **OAuth2 Providers**
- [x] Base provider pattern (credentials/oauth2/)
- [x] HubSpot provider integration
- [x] Lifecycle management (refresh tokens)
- [x] Browser opening for auth flows (tools/credentials/browser.py)
- [x] **Aden Sync Provider**
- [x] Syncs OAuth2 tokens from Aden authentication server (credentials/aden/)
- [x] Falls back to local storage if Aden unavailable
- [x] Auto-refresh on sync
- [ ] **Enterprise Secret Managers**
- [ ] AWS Secrets Manager integration
- [ ] Azure Key Vault integration
- [ ] Audit logging for compliance/tracking
- [ ] Per-environment configuration support
---
## Tooling Ecosystem & General Compute
### Sub-agents Parallel Execution
Develop the Sub-agent execution environment for parallel tasks execution. The subagents are designed with isolation for repeatability.
- [x] **Multi-Graph Sessions**
- [x] Load multiple agent graphs in single session (runtime/agent_runtime.py)
- [x] Shared state between graphs
- [x] Independent execution streams
- [x] Graph lifecycle management (load/unload/start/restart)
- [x] **Concurrent Execution Management**
- [x] Max concurrent executions configuration
- [x] Isolation levels: isolated, shared, synchronized
- [ ] **Sub-agent Execution Environment**
- [ ] Isolated sub-agent runtime environment
- [ ] Task isolation mechanisms
- [ ] Result aggregation
- [ ] Error handling for parallel tasks
- [ ] Repeatability guarantees
### Browser Use Node
Implement native browser-integrated automation so agents can take over a browser for auth and agents perform the automation jobs. This node comes with a specific set of tools and system prompts.
- [x] **Web Scraping with Playwright**
- [x] Headless Chromium launch (tools/web_scrape_tool/)
- [x] Stealth mode via playwright_stealth
- [x] JavaScript rendering with wait-for-domcontentloaded
- [x] CSS selector support
- [x] User-agent spoofing
- [x] Sandbox/automation detection evasion
- [x] **Browser Launch Utilities**
- [x] Platform-specific browser opening (macOS/Linux/Windows) (tools/credentials/browser.py)
- [x] OAuth2 flow integration
- [ ] **Full Browser Use Node**
- [ ] Multi-page automation workflows
- [ ] Form filling with vision-guided interactions
- [ ] Interactive screenshot capabilities
- [ ] Session management across navigations
- [ ] Browser-specific tool set
- [ ] System prompts for browser tasks
### Core Graph Framework Infra
Ship essential framework utilities: Node validation, HITL (Human-in-the-loop pause/approve), and node lifecycle management.
- [x] **Node Validation**
- [x] Pydantic-based validation
- [x] Schema enforcement
- [x] Output key validation (Level 0)
- [x] **Human-in-the-Loop (HITL)**
- [x] HITLRequest and HITLResponse protocol (graph/hitl.py)
- [x] Question types: FREE_TEXT, STRUCTURED, SELECTION, APPROVAL, MULTI_FIELD
- [x] Haiku-powered response parsing
- [x] User-friendly display formatting
- [x] Pause/approve workflow
- [x] State saved to checkpoint
- [x] Resume with HITLResponse merged into context
- [x] ~~**TUI Integration**~~ *(deprecated — see AGENTS.md; use `hive open` browser UI instead)*
- [x] ~~Chat REPL with streaming support (tui/app.py)~~
- [x] ~~Multi-graph session management~~
- [x] ~~User presence detection~~
- [x] ~~Real-time log viewing~~
- [x] **Node Lifecycle Management**
- [x] Start/stop/pause/resume in execution stream
- [x] State persistence via checkpoint store
- [x] Recovery mechanisms with checkpoint restore
- [ ] **Advanced HITL Features**
- [ ] Callback handlers for custom intervention logic
- [ ] Streaming interface for real-time monitoring
- [ ] Approval workflows at scale
### Infrastructure Tools
Port popular tools, and build out the Runtime Log, Audit Trail, Excel, and Email integrations.
- [x] **File Operations (36+ tools)**
- [x] read_file, write_file, edit_file (builder/package_generator.py)
- [x] list_directory, search_files
- [x] apply_diff / apply_patch for code modification (tools/file_system_toolkits/)
- [x] data_tools (CSV/Excel parsing)
- [x] **Web Tools**
- [x] Web Search (tools/web_search_tool/)
- [x] Web Scraper (tools/web_scrape_tool/)
- [x] Exa Search (tools/exa_search_tool/)
- [x] News Tool (tools/news_tool/)
- [x] SerpAPI (tools/serpapi_tool/)
- [x] **Data Tools**
- [x] CSV tools (tools/csv_tool/)
- [x] Excel tools (tools/excel_tool/)
- [x] PDF tools (tools/pdf_read_tool/)
- [x] Vision tool for image analysis (tools/vision_tool/)
- [x] Time tool (tools/time_tool/)
- [x] **Communication Tools (8 tools)**
- [x] Email tool (tools/email_tool/)
- [x] Gmail tool (tools/gmail_tool/)
- [x] Slack tool (tools/slack_tool/)
- [x] Discord tool (tools/discord_tool/)
- [x] Telegram tool (tools/telegram_tool/)
- [x] Google Docs (tools/google_docs_tool/)
- [x] Google Maps (tools/google_maps_tool/)
- [x] Cal.com (tools/calcom_tool/)
- [x] **CRM/API Integrations (5+ tools)**
- [x] HubSpot (tools/hubspot_tool/)
- [x] GitHub (tools/github_tool/)
- [x] Apollo (tools/apollo_tool/)
- [x] BigQuery (tools/bigquery_tool/)
- [x] Razorpay (tools/razorpay_tool/)
- [x] Calendar (tools/calendar_tool/)
- [x] **Security/Scanning Tools (5 tools)**
- [x] DNS Security Scanner (tools/dns_security_scanner/)
- [x] SSL/TLS Scanner (tools/ssl_tls_scanner/)
- [x] Port Scanner (tools/port_scanner/)
- [x] Subdomain Enumerator (tools/subdomain_enumerator/)
- [x] Tech Stack Detector (tools/tech_stack_detector/)
- [x] **Runtime & Logging**
- [x] Runtime Log Tool (tools/runtime_logs_tool/)
- [x] Runtime Logger with L1/L2/L3 levels (runtime/runtime_logger.py)
- [ ] **Audit Trail System**
- [ ] Decision tracing beyond logs
- [ ] Compliance reporting
- [ ] Historical query capabilities
---
## Memory, Storage & File System Capabilities
### Memory Tools
Simple pure file-based memory management
- [x] **Short-Term Memory (STM)**
- [x] SharedState manager for in-memory state (runtime/shared_state.py)
- [x] Session-based storage (storage/session_store.py)
- [x] State-based short-term memory layer
- [x] **Conversation Memory**
- [x] NodeConversation tracks message history (graph/conversation.py)
- [x] Tool results, streaming content, metadata
- [x] Context building for LLM prompts
- [ ] **Long-Term Memory (LTM)**
- [ ] Semantic indexing for memory retrieval
- [ ] RLM (Retrieval-augmented Long-term Memory) implementation
- [ ] Memory persistence beyond session
- [ ] Content-based memory search
### Durable Scratchpad
Integrate a lightweight, persistent DB for long-term memory using the filesystem-as-scratchpad pattern.
- [x] **Filesystem as Scratchpad**
- [x] File-based persistence layer (storage/)
- [x] Session store implementation
- [x] Data durability guarantees
- [x] **Checkpoint System**
- [x] Save/restore execution state (storage/checkpoint_store.py)
- [x] TTL-based cleanup
- [x] Async checkpoint support
- [x] Max age configuration
- [ ] **Message Model & Session Management**
- [ ] Message class with structured content types
- [ ] Session classes for conversation state
- [ ] Per-message file persistence
- [ ] Migration from monolithic run storage
### Memory Isolation
Enforce session-local memory isolation to prevent data bleed between concurrent agent runs.
- [x] **Session Isolation**
- [x] Session-local memory implementation (storage/session_store.py)
- [x] Data bleed prevention
- [x] Concurrent run safety
- [x] Isolation levels: ISOLATED, SHARED, SYNCHRONIZED
- [x] **State Management**
- [x] SharedState with thread-safe operations (runtime/shared_state.py)
- [x] Session-scoped state access
- [ ] **Context Management**
- [ ] Message.stream(sessionID) implementation
- [ ] Full context building optimization
- [ ] Message to model conversion improvements
### Agent Capabilities
Implement File I/O support, streaming mode, and allow users to supply custom functions as libraries/nodes.
- [x] **File I/O**
- [x] File read/write operations (builder/package_generator.py)
- [x] File system navigation
- [x] Directory listing and search
- [x] **Execution Streaming**
- [x] Real-time event streaming (runtime/execution_stream.py)
- [x] Token-by-token output via event bus
- [x] Tool call streaming
- [x] **Custom Tool Integration**
- [x] MCP server discovery (builder/package_generator.py)
- [x] Dynamic tool binding
- [x] Custom tool registration
- [ ] **Streaming Mode Enhancements**
- [ ] Progressive result delivery optimization
- [ ] Backpressure handling
- [ ] **Custom Function Libraries**
- [ ] User-supplied function libraries as nodes
- [ ] Library versioning and management
- [ ] **Proactive Memory Compaction**
- [ ] Overflow detection
- [ ] Backward-scanning pruning strategy
- [ ] Token tracking integration for compaction decisions
### File System Enhancements
Add semantic search capabilities and an interactive file system for frontend product integration.
- [x] **File Search**
- [x] search_files tool (builder/package_generator.py)
- [x] Directory traversal
- [ ] **Semantic Search**
- [ ] Semantic indexing of files
- [ ] Natural language file search
- [ ] Content-based retrieval with embeddings
- [ ] **Interactive File System**
- [ ] Frontend file browser integration
- [ ] Real-time file system updates
- [ ] Visual file navigation in GUI
---
## Eval System, DX, & Open Source Guardrails
### Eval System
Build the failure recording mechanism and an SDK for defining custom failure conditions.
- [x] **Multi-Level Evaluation**
- [x] Level 0: Output key validation (all required keys set)
- [x] Level 1: Literal checks (output_contains, output_equals)
- [x] Level 2: Conversation-aware judgment (graph/conversation_judge.py)
- [x] **Goal-Based Constraints**
- [x] Hard constraints (violation = failure) (graph/goal.py)
- [x] Soft constraints (prefer not to violate)
- [x] Categories: time, cost, safety, scope, quality
- [x] Constraint checking infrastructure
- [x] **Success Criteria Definition**
- [x] Weighted criteria (0.0-1.0)
- [x] Metrics: output_contains, output_equals, llm_judge, custom
- [x] 90% threshold for goal success
- [x] **Test Framework**
- [x] TestCase, TestResult, TestStorage classes (testing/)
- [x] LLM-based judgment for semantic evaluation (testing/llm_judge.py)
- [x] Approval CLI for manual approval workflows
- [x] Categorization and test result reporting
- [ ] **Failure Recording**
- [ ] Failure capture mechanism
- [ ] Failure analysis tools
- [ ] Historical failure tracking
- [ ] Continuous improvement loop
- [ ] **Custom Failure Conditions SDK**
- [ ] SDK for defining custom failure conditions
- [ ] Custom evaluator framework extension
- [ ] Condition validation DSL
### Guardrails SDK
Implement deterministic condition guardrails directly in the node, complete with mitigation tracking and audit logs.
- [x] **Goal Constraints (Basic Guardrails)**
- [x] Hard/soft constraint definitions (graph/goal.py)
- [x] Constraint checking in goals
- [ ] **Deterministic Guardrails SDK**
- [ ] In-node guardrail implementation
- [ ] Condition-based guardrails
- [ ] Guardrail SDK for custom rules
- [ ] **Monitoring & Tracking**
- [ ] Mitigation tracking for violations
- [ ] Audit log system for guardrails
- [ ] Compliance reporting
- [ ] **Basic Monitoring Hooks**
- [ ] Agent node SDK monitoring hooks
- [ ] Event hook system for guardrails
- [ ] Default monitoring hooks in nodes
### DevTools CLI
Release CLI tools specifically for rapid memory management and credential store editing.
- [x] **Main CLI**
- [x] Run, info, validate, list commands (cli.py)
- [x] Dispatch mode for batch execution
- [x] Shell mode for interactive use
- [x] Model selection configuration
- [x] **Testing CLI**
- [x] test-run, test-debug, test-list, test-stats (testing/cli.py)
- [x] Pytest integration
- [x] Test categorization
- [x] ~~**TUI (Terminal UI)**~~ *(deprecated — see AGENTS.md; use `hive open` browser UI instead)*
- [x] ~~Interactive chat with streaming (tui/app.py)~~
- [x] ~~Multi-graph management UI~~
- [x] ~~Log pane for real-time output~~
- [x] ~~Keyboard shortcuts (Ctrl+C, Ctrl+D, etc.)~~
- [ ] **Memory Management CLI**
- [ ] Memory inspection commands
- [ ] Memory cleanup utilities
- [ ] Session management commands
- [ ] **Credential Store CLI**
- [ ] Interactive credential editing
- [ ] Secure credential viewer
- [ ] Credential validation tools
- [ ] **Debugging Tools**
- [ ] Interactive debugging mode beyond TUI
- [ ] Breakpoint support in execution
- [ ] Step-through execution
### Observability
Support user-driven log analysis, basic monitoring hooks from the SDK, and an interactive debugging mode.
- [x] **Runtime Logging**
- [x] L1 (summary), L2 (detailed), L3 (tool) logging levels (runtime/runtime_logger.py)
- [x] Session logs directory storage
- [x] Audit trail for decision tracing in logs
- [x] **Event Bus Monitoring**
- [x] Real-time event streaming (runtime/event_bus.py)
- [x] LLM text deltas, tool calls, node transitions
- [x] Graph-scoped event routing
- [ ] **Log Analysis Tools**
- [ ] User-driven log analysis (OSS approach)
- [ ] Log aggregation utilities
- [ ] Log visualization tools
- [ ] **Monitoring Hooks**
- [ ] Basic observability hooks from SDK
- [ ] Performance metrics collection
- [ ] Health checks system
- [ ] **Token Tracking**
- [ ] Reasoning token tracking
- [ ] Cache token tracking
- [ ] Token metrics in compaction logic
### Developer Success
Write the Quick Start guide, detailed tool usage documentation, and set up the MVP README examples.
- [x] **Documentation**
- [x] Quick start guide
- [x] Goal creation guide
- [x] Agent creation guide
- [x] README with examples
- [x] Contributing guidelines
- [x] GitHub Page setup
- [x] **Tool Usage Documentation**
- [ ] Comprehensive tool documentation
- [ ] Tool integration examples
- [ ] Best practices guide
- [ ] **Video Content**
- [ ] Introduction video
- [ ] Tutorial videos
- [ ] **Example Agents**
- [ ] Knowledge agent template
- [ ] Blog writer agent template
- [ ] SDR agent template
---
## Deployment, CI/CD & Community Templates
### Self-Deployment
Standardize the Docker container builds and establish headless backend execution APIs.
- [x] **Docker Support**
- [x] Python 3.11-slim base image (tools/Dockerfile)
- [x] Playwright Chromium installation
- [x] Non-root user for security
- [x] Health check endpoint
- [x] Volume mount for workspace persistence
- [x] Exposes port 4001 for MCP server
- [x] **Agent Runtime**
- [x] AgentRuntime: Top-level orchestrator (runtime/agent_runtime.py)
- [x] Multiple entry points (manual, webhook, timer, event, api)
- [x] Concurrent execution management
- [x] State persistence via session store
- [x] Outcome aggregation
- [x] **Async Entry Points**
- [x] AsyncEntryPointSpec: Webhook, timer, event triggers (graph/edge.py)
- [x] Timer config: cron expressions or interval_minutes
- [x] Event triggers for custom events
- [x] Isolation levels: isolated, shared, synchronized
- [ ] **Headless Backend Enhancements**
- [ ] Standardized backend execution APIs
- [ ] Frontend attachment interface
- [ ] Self-hosted setup guide with examples
### Lifecycle APIs
Expose basic REST/WebSocket endpoints for external control (Start, Stop, Pause, Resume).
- [x] **Webhook Server**
- [x] FastAPI-based webhook server (runtime/webhook_server.py)
- [x] Route configuration per entry point
- [x] Optional secret validation
- [x] **Graph Lifecycle Management**
- [x] Load/unload/start/restart in AgentRuntime
- [x] State persistence
- [x] Recovery mechanisms
- [x] **REST API Endpoints**
- [ ] Start endpoint for agent execution
- [ ] Stop endpoint for graceful shutdown
- [ ] Pause endpoint for execution suspension
- [ ] Resume endpoint for continuation
- [ ] Status query endpoint for monitoring
- [ ] **WebSocket API**
- [ ] Real-time event streaming to clients
- [ ] Bidirectional communication
- [ ] Connection management with reconnection
### CI/CD Pipelines
Implement automated test execution, agent version control, and mandatory test-passing for deployment.
- [x] **Test Execution**
- [x] Test framework with pytest integration (testing/)
- [x] Test result reporting
- [x] Test CLI commands (test-run, test-debug, etc.)
- [x] **Automated Testing Pipeline**
- [ ] CI integration (GitHub Actions, etc.)
- [ ] Mandatory test-passing gates
- [ ] Coverage reporting
- [ ] **Version Control**
- [ ] Agent versioning system
- [ ] Semantic versioning for agents
- [ ] Version compatibility checks
- [ ] **Deployment Automation**
- [ ] Continuous deployment pipeline
- [ ] Rollback mechanisms
- [ ] Blue-green deployment support
### Distribution
Launch the official PyPI package, Docker Hub image, and the community Discord channel.
- [ ] **Package Distribution**
- [ ] Official PyPI package
- [ ] Docker Hub image publication
- [ ] Version release automation
- [ ] Installation documentation
- [ ] **Community Channels**
- [ ] Discord channel setup
- [ ] Community support structure
- [ ] Contribution guidelines enforcement
- [ ] **Cloud Deployment**
- [ ] AWS Lambda integration
- [ ] GCP Cloud Functions support
- [ ] Azure Functions support
- [ ] 3rd-party platform integrations
- [ ] Self-deploy with orchestrator connection
### Example Agents
Ship ~20 ready-to-use templates including GTM Sales, Marketing, Analytics, Training, and Smart Entry agents.
- [x] **Hive Coder Agent**
- [x] Agent builder template (agents/hive_coder/)
- [x] Guardian node for supervision
- [ ] **Sales & Marketing Agents**
- [ ] GTM Sales Agent (workflow automation)
- [ ] GTM Marketing Agent (campaign management)
- [ ] Lead generation agent
- [ ] Email campaign agent
- [ ] Social media agent
- [ ] **Analytics & Insights Agents**
- [ ] Analytics Agent (data analysis)
- [ ] Data processing agent
- [ ] Report generation agent
- [ ] Dashboard agent
- [ ] **Training & Education Agents**
- [ ] Training Agent (onboarding)
- [ ] Content creation agent
- [ ] Knowledge base agent
- [ ] Documentation agent
- [ ] **Automation & Forms Agents**
- [ ] Smart Entry / Form Agent (self-evolution emphasis)
- [ ] Data validation agent
- [ ] Workflow automation agent
- [ ] Integration agent
- [ ] **Additional Templates**
- [ ] Customer support agent
- [ ] Document processing agent
- [ ] Scheduling agent
- [ ] Research agent
- [ ] Code review agent
---
## Open Hive
### Local API Gateway
Build a lightweight local server (e.g., FastAPI or Node) that securely exposes the Hive framework's core Event Bus and Memory Layer to the local browser environment.
- [x] **MCP Server Foundation**
- [x] FastMCP server implementation (builder/package_generator.py)
- [x] Agent builder tools exposed
- [x] Port 4001 exposed in Docker
- [x] **Event Bus Architecture**
- [x] Event Bus implementation (runtime/event_bus.py)
- [x] Real-time event streaming
- [x] Graph-scoped event routing
- [ ] **Local API Gateway**
- [ ] Lightweight local server (FastAPI or Node)
- [ ] Secure authentication layer for browser
- [ ] CORS and security configuration
- [ ] Event Bus API endpoints for browser access
- [ ] Event subscription management for frontend
- [ ] **Memory Layer API**
- [ ] Memory read/write endpoints
- [ ] Session management API for frontend
- [ ] Memory visualization data endpoints
### Visual Graph Explorer
Implement an interactive, drag-and-drop canvas (using libraries like React Flow) to visualize the Worker Graph, Queen Bee, and active execution paths in real-time.
- [ ] **Graph Visualization**
- [ ] React Flow integration
- [ ] Worker Graph rendering from agent definitions
- [ ] Node type visualization (EventLoop, Function, etc.)
- [ ] Edge visualization with condition types
- [ ] Active execution path highlighting
- [ ] **Interactive Features**
- [ ] Drag-and-drop canvas for graph editing
- [ ] Node editing capabilities
- [ ] Real-time graph updates during execution
- [ ] Zoom and pan controls
- [ ] Node inspection on click
- [ ] **Integration with Runtime**
- [ ] Live execution visualization
- [ ] Node state indicators
- [ ] Edge traversal animation
### TUI to GUI Upgrade
Port the existing Terminal User Interface (TUI) into a rich web application, allowing users to interact directly with the Queen Bee / Coding Agent via a browser chat interface.
> **Note:** The legacy TUI (`tui/app.py`) is deprecated and no longer maintained (see AGENTS.md). The items below reflect legacy work completed before deprecation. New development should target the browser-based GUI (`hive open`).
- [x] ~~**TUI Foundation**~~ *(deprecated)*
- [x] ~~Terminal chat interface (tui/app.py)~~
- [x] ~~Streaming support~~
- [x] ~~Multi-graph management~~
- [x] ~~Log pane display~~
- [x] ~~Keyboard shortcuts~~
- [ ] **Web Application**
- [ ] Modern web UI framework setup (React/Vue/Svelte)
- [ ] Responsive design implementation
- [ ] Cross-browser compatibility
- [ ] **Chat Interface**
- [ ] Browser-based chat UI
- [ ] Hive Coder interaction (Queen Bee proxy)
- [ ] Coding Agent interface
- [ ] Message history and search
- [ ] Rich message formatting (markdown, code blocks)
- [ ] **TUI Feature Parity**
- [ ] All TUI commands in GUI
- [ ] Keyboard shortcuts in browser
- [ ] Command palette (Cmd+K style)
### Memory & State Inspector
Create a UI component to inspect the Shared Memory and Write-Through Conversation Memory, allowing developers to click on any node and see exactly what it is thinking.
- [x] **Runtime Logs Tool**
- [x] Inspect agent session logs (tools/runtime_logs_tool/)
- [x] Session state retrieval (builder/package_generator.py)
- [ ] **Memory Inspector UI**
- [ ] Shared Memory visualization
- [ ] Conversation memory view (NodeConversation display)
- [ ] Memory search and filter
- [ ] Memory timeline view
- [ ] **Node State Inspection**
- [ ] Click-to-inspect functionality
- [ ] Node thought process display (LLM reasoning)
- [ ] State history timeline per node
- [ ] Input/output inspection
- [ ] **Debug Tools**
- [ ] Memory diff viewer (state changes between nodes)
- [ ] State snapshot comparison
- [ ] Memory leak detection
### Local Control Panel
Build a dashboard for localized Credential Management (editing the ~/.hive/credentials store safely) and swarm lifecycle management (Start, Pause, Kill, and HITL approvals).
- [x] **Credential Management Backend**
- [x] CredentialStore with file/env/vault backends (credentials/store.py)
- [x] OAuth2 provider support (credentials/oauth2/)
- [x] Template resolution and caching
- [ ] **Credential Management Dashboard**
- [ ] Safe credential editing interface (web UI)
- [ ] ~/.hive/credentials store management UI
- [ ] Credential validation and testing UI
- [ ] Encryption status display
- [ ] OAuth2 flow initiation from browser
- [ ] **Swarm Lifecycle Management**
- [ ] Start/Stop controls for agents
- [ ] Pause/Resume functionality
- [ ] Kill process management
- [ ] HITL approval interface in browser
- [ ] Multi-agent orchestration view
- [ ] **Monitoring Dashboard**
- [ ] Active agents display
- [ ] Resource usage monitoring (CPU, memory, tokens)
- [ ] Performance metrics visualization
- [ ] Execution history
### Local Model Integration
Build native frontend configurations to easily connect Open Hive's backend to local open-source inference engines like Ollama, keeping the entire stack offline and private.
- [x] **LLM Integration Layer**
- [x] Provider-agnostic LLM support via LiteLLM (graph/event_loop_node.py)
- [x] Model configuration in agent definitions
- [ ] **Local Model Support**
- [ ] Ollama integration and configuration
- [ ] Local LLM configuration UI
- [ ] Model selection and management dashboard
- [ ] Model performance monitoring
- [ ] **Offline Mode**
- [ ] Full offline functionality (no cloud API calls)
- [ ] Local-only execution mode flag
- [ ] Privacy-first architecture enforcement
- [ ] Local model fallback mechanisms
- [ ] **Model Configuration**
- [ ] Easy model switching in UI
- [ ] Model parameter tuning (temperature, top_p, etc.)
- [ ] Performance optimization settings
- [ ] Multi-model support (different models per node)
- [ ] Model cost tracking for local models
### Cross-Platform Support
- [ ] **JavaScript/TypeScript SDK**
- [ ] TypeScript SDK development
- [ ] npm package distribution
- [ ] Node.js runtime support
- [ ] Browser runtime support
- [ ] **Platform Compatibility**
- [x] Windows support improvements
- [ ] macOS optimization
- [ ] Linux distribution support
### Coding Agent Integration
- [ ] **IDE Integrations**
- [ ] Claude Code integration
- [ ] Cursor integration
- [ ] Opencode integration
- [ ] Antigravity integration
- [ ] Codex CLI integration (in progress)