chnjinlei/hive

Author	SHA1	Message	Date
Richard Tang	9ad95fde59	chore: ruff lint	2026-04-09 18:22:16 -07:00
Richard Tang	6eaa609f63	feat: queen scope memory	2026-04-09 17:33:14 -07:00
Hundao	df29c49bd0	fix(test): update queen memory reflection test mocks for litellm format (#6991 ) * fix(test): update queen memory test mocks to match litellm ModelResponse format PR #6976 refactored reflection_agent to extract tool calls from litellm ModelResponse objects (choices[0].message.tool_calls) instead of plain dicts. The test mocks were not updated, causing tool calls to silently fail and two tests to break. Fixes #6990 * style: ruff format	2026-04-08 14:44:11 +08:00
Richard Tang	19469ff404	chore: lint format	2026-04-07 13:57:05 -07:00
Richard Tang	6637bc8d96	feat: simplify memory implementation	2026-04-07 12:08:35 -07:00
Hundao	aaa5d661c3	fix(ci): unbreak main - playwright deps + framework test suite (#6955 ) * fix(tools): move playwright back to main dependencies playwright was moved to the browser extra in `c7e85aa9` as part of the GCU refactor to use a browser extension. But web_scrape_tool still imports playwright at module level and requires it unconditionally, so CI's Test Tools job breaks with ModuleNotFoundError. web_scrape_tool has no fallback without playwright — it's a hard dependency, not optional. Put it back in main deps. Fixes CI failure on Test Tools (ubuntu-latest). * chore: remove dead test_highlights.py script tools/test_highlights.py is orphaned from the GCU refactor in `c7e85aa9`: - imports highlight_coordinate and highlight_element from gcu.browser.highlight, but highlight.py was deleted in that refactor - calls BrowserSession.start(), open_tab(), get_active_page(), stop() — none of these methods exist on the current BrowserSession class The script can't run at all, and it's tripping ruff's I001 import-order check (fail on Lint CI after cache invalidation). * test: fix browser/refs tests broken by GCU refactor Tests were still testing the old Playwright-based API after `c7e85aa9` moved GCU to an extension-bridge architecture. test_refs.py (6 tests): Refs system now produces CSS selectors like [role="button"][aria-label="Submit"]:nth-of-type(1) for the bridge's DOM matcher, instead of Playwright's role=button[name="Submit"] >> nth=0. Updated expected values to match. Renamed test_escapes_quotes_in_name to test_quoted_name_passes_through and added a comment noting that inner quotes aren't currently escaped (follow-up concern). test_browser_tools_comprehensive.py (4 tests): - test_screenshot_full_page: browser_screenshot passes selector=None when no selector is provided; update assertion. - test_file_upload: browser_upload validates file paths exist on disk. Create real tmp files and mock the CDP calls it makes. - test_evaluate_with_bare_return: renamed to test_evaluate_passes_script_through_to_bridge. IIFE wrapping lives in bridge.evaluate, not in the browser_evaluate tool — mocking the bridge bypasses the wrapping logic, so the tool just passes the script through. - test_evaluate_complex_script: browser_evaluate returns bridge's raw result (no 'ok' wrapper); check for 'result' key instead. test_browser_advanced_tools.py (deleted): The whole file patched get_session and page.wait_for_function (the old Playwright-based API). The bug it guarded against (user text interpolated into a JS source string) is architecturally impossible in the new bridge-based tools, which send text via structured RPC. Coverage for browser_wait exists in test_browser_tools_comprehensive.py. * test(core): fix event_loop tests broken by hive-v1 refactor Several framework tests were left failing or hanging after the hive-v1 refactor landed. This un-breaks CI without touching production code. - Worker auto-escalation: 8 tests were hanging because EventLoopNode with event_bus treats non-queen/non-subagent nodes as workers and auto-escalates to queen, then blocks on _await_user_input forever (no queen in standalone tests). Opt out via is_subagent_mode=True. - MockConversationStore: added clear() to match the production store (storage/conversation_store.py), which event_loop_node.py:425 calls. - Executor output semantics: result.output now only contains terminal- node outputs; two handoff tests now read intermediate outputs from result.session_state["data_buffer"]. - Restore filter: test_restore_from_checkpoint needs set_current_phase so restore()'s phase_id filter matches. - Removed two _build_context tests whose target method no longer exists (replaced by standalone build_node_context()). Remaining execution_id coverage is adequate in TestExecutionId + integration tests. * style: ruff format + drop em dash in comment * test(core): fix remaining framework tests broken by hive-v1 refactor Rounds out the fix started in the previous commit. Full framework suite now passes (1589 passed, 0 failed). - conftest.py: force-bind framework.runner submodules (mcp_registry, mcp_client, mcp_connection_manager) as attributes on the parent package. Without this, pytest monkeypatch.setattr with dotted-string paths fails because the attribute walker can't resolve the submodule even though __init__.py imports from it. Affects ~25 MCP tests. - test_queen_memory: _execute_tool() grew a required caller kwarg for worker type-restrictions. Pass caller="queen" so path-traversal checks run without caller restrictions interfering. - test_session_manager_worker_handoff: _subscribe_worker_digest was removed in the refactor, dropped the dead monkeypatches. - test_skill_context_protection: NodeConversation now reads _run_id in add_tool_result(), so the __new__-based test helper has to initialise it. - test_node_conversation: restore() now filters parts by run_id for crash recovery. Renamed the stale test and flipped the assertion to match the new filtering semantics. - test_tool_registry: CONTEXT_PARAMS was updated (workspace_id out, profile in). Switched the test's example stripped params. * docs: drop circular PR reference in test_refs comment Addresses CodeRabbit nitpick. The comment referenced the PR that was adding the comment, which becomes a self-reference after merge.	2026-04-05 14:21:32 +08:00
Richard Tang	ed8d417bef	chore: ruff lint	2026-04-03 20:31:14 -07:00
Richard Tang	95655a4c85	feat: better reflection tracking	2026-04-03 16:09:46 -07:00
Richard Tang	771efd5ce4	feat: simplify worker reflection	2026-04-03 13:03:47 -07:00
Richard Tang	4f588b3010	fix: remove outdated memory cursor design	2026-04-03 12:38:05 -07:00
Richard Tang	ec08ae7438	feat: worker agent memory	2026-04-02 17:05:32 -07:00
Richard Tang	1a37fb2f36	tests: add tests to memory functions	2026-04-01 17:41:25 -07:00
Richard Tang	1765e1cb6c	feat: debugger and simplication	2026-04-01 17:28:54 -07:00
Richard Tang	b25de61363	feat(wip): new queen memory	2026-04-01 15:03:21 -07:00
Sundaram Kumar Jha	890d303d26	test: cover queen memory date formatting on Windows	2026-03-27 09:46:42 +05:30

15 Commits