chnjinlei/hive

Author	SHA1	Message	Date
Timothy	e2bfb9d3af	fix: frame resize	2026-04-19 13:02:12 -07:00
Timothy	e55cea97ef	fix: diagnostics	2026-04-19 12:52:04 -07:00
Timothy	ddaafe0307	Merge remote-tracking branch 'origin/main' into fix/image-coordinate-precision	2026-04-18 23:32:28 -07:00
Richard Tang	8e4468851c	chore: ruff format	2026-04-18 21:45:34 -07:00
Richard Tang	82ffcb17ac	Merge remote-tracking branch 'origin/main' into fix/colony-skill-leak	2026-04-18 21:36:23 -07:00
Richard Tang	656401e199	feat: real snapshot after interaction	2026-04-18 19:51:52 -07:00
Hundao	90aadf247a	fix(ci): unbreak main — ruff format, test_refs, test_model_catalog (#7084 ) * fix(ci): apply ruff format to browser tool files Refs #7083 * fix(ci): unbreak test_refs (img regression) and test_model_catalog test_refs: - Add `img` back to CONTENT_ROLES so named images get refs again. The recent `cc6ec97a feat: multiple modes browser snapshot tool` refactor renamed NAMED_CONTENT_ROLES → CONTENT_ROLES and accidentally dropped `img`, breaking `test_named_content_roles_get_refs`. - Drop the `navigation` assertion from `test_skips_structural_roles`. That same refactor intentionally added landmark roles (navigation, main, listitem) to CONTENT_ROLES so AI agents can ref them by name, and the test was not updated to reflect that. test_model_catalog: - Add 5 openrouter models that were added to model_catalog.json by #7081 (UI/UX improvements) but not reflected in the test. Refs #7083 * fix(ci): wait for event propagation in subagent report test on Windows `test_worker_report_emits_subagent_report_event` waited only for `worker.is_active` to flip to False, then immediately asserted on the collected events. On Windows the event loop scheduling differs enough that the SUBAGENT_REPORT subscriber callback can run a few ticks after the worker is marked inactive, so the assertion fires against an empty list. Wait for both conditions. Refs #7083	2026-04-18 19:09:15 +08:00
Timothy	2fd7e9172a	fix: y-offset inspection	2026-04-17 19:24:41 -07:00
Timothy	dde4dfaec9	Merge branch 'feature/colony-sqlite' into feature/clean-context	2026-04-17 04:12:35 -07:00
Richard Tang	d788e5b2f7	chore: ruff lint	2026-04-16 23:33:48 -07:00
Richard Tang	583a5b41b4	fix: ununsed reference	2026-04-16 23:23:38 -07:00
Richard Tang	83cc44bdef	Merge branch 'feature/full-image-size'	2026-04-16 23:15:59 -07:00
Timothy	558813e7fa	feat: fraction-based visual clicks	2026-04-16 22:36:41 -07:00
Timothy	aba0ff07ba	fix: model invariant screenshot	2026-04-16 20:29:05 -07:00
Timothy	4303a36df0	fix: namespaced browser tab groups	2026-04-16 20:07:05 -07:00
Richard Tang	c6b6a5a2f7	feat: GCP skills and prompts improvements	2026-04-16 17:43:52 -07:00
Richard Tang	18f5f078fc	feat: dashed highlighter for browser type focus	2026-04-16 17:26:09 -07:00
Richard Tang	cc6ec97a75	feat: multiple modes browser snapshot tool	2026-04-16 17:22:44 -07:00
Richard Tang	44d114f0d0	feat: default 1ms delay and prompt improvements	2026-04-16 16:19:38 -07:00
Richard Tang	9e71f16d15	Merge remote-tracking branch 'origin/fix/browser-behaviour-improvements' into fix/browser-behaviour-improvements	2026-04-16 16:14:43 -07:00
Richard Tang	28cad2376c	feat: separate type focus tool	2026-04-16 16:08:43 -07:00
Timothy	8222cd306e	fix: simplify canonical workflow	2026-04-16 16:02:37 -07:00
Timothy	b50f237506	fix: screenshot skill diction	2026-04-16 15:16:22 -07:00
Richard Tang	916803889f	feat: browswer control tools improvement and debugger	2026-04-16 15:14:08 -07:00
Richard Tang	810cf5a6d3	Merge remote-tracking branch 'origin/main' into feature/colony-sqlite	2026-04-16 11:10:34 -07:00
Hundao	9051c443fb	fix(tests): resolve Windows CI failures (#7061 ) - test_background_job: use sys.executable and double quotes instead of single-quoted 'python -c' which Windows cmd.exe doesn't understand - test_cli_entry_point: guard against None stdout on Windows with (result.stdout or "").lower() - test_safe_eval: bump DEFAULT_TIMEOUT_MS from 100 to 500 to accommodate slow Windows CI runners where SIGALRM is unavailable	2026-04-16 21:05:09 +08:00
Hundao	e5a93b059f	fix(tests): resolve test failures across framework and tools (#7059 ) * fix(tests): resolve test failures across framework and tools Framework tests (52 -> 1 failure): - Add missing `model` attribute to mock LLM classes (MockStreamingLLM, CrashingLLM, ErrorThenSuccessLLM, etc.) to match new agent_loop.py requirement at line 624 - Update skill count assertions from 6 to 7 (new writing-hive-skills) - Fix phase compaction test to match new message format (no brackets) - Update model catalog test for current gemini model names - Fix queen memory test: set phase="building" to match prompt_building, adjust reflection trigger count to match cooldown behavior Tools tests (52 -> 0 failures): - Update csv_tool tests: remove agent_id parameter, use absolute paths, patch _ALLOWED_ROOTS instead of AGENT_SANDBOXES_DIR - Fix browser_evaluate test to allow toast wrapper around script Remaining: 1 pre-existing failure in test_worker_report where mock LLM gets stuck when scenarios are exhausted (separate bug). * fix(tests): resolve remaining test failures - Add text stop scenario to test_worker_report so worker terminates cleanly after tool_calls finish instead of replaying the last scenario forever - Remove duplicated hive home isolation fixture from test_colony_fork_live; reuse conftest autouse fixture and only add config copy on top * fix(tests): prevent mock LLM infinite loops on exhausted scenarios fix(core): accept both pruned tool result sentinel formats MockStreamingLLM and _ByTaskMockLLM replay the last scenario forever when call_index exceeds the scenario list, causing worker timeouts in CI. Fix by emitting a text stop when scenarios are exhausted (scenarios mode) or already consumed (by_task mode). Also fix pruned tool result sentinel mismatch: conversation.py produces "Pruned tool result ..." but compaction.py and conversation.py only checked for "[Pruned tool result". Now both formats are accepted. Also remove duplicated hive home isolation fixture from test_colony_fork_live; reuse conftest autouse fixture instead.	2026-04-16 20:13:43 +08:00
Hundao	589c5b06fe	fix: resolve all ruff lint and format errors across codebase (#7058 ) - Auto-fixed 70 lint errors (import sorting, aliased errors, datetime.UTC) - Fixed 85 remaining errors manually: - E501: wrapped long lines in queen_profiles, catalog, routes_credentials - F821: added missing TYPE_CHECKING imports for AgentHost, ToolRegistry, HookContext, HookResult; added runtime imports where needed - F811: removed duplicate method definitions in queen_lifecycle_tools - F841/B007: removed unused variables in discovery.py - W291: removed trailing whitespace in queen nodes - E402: moved import to top of queen_memory_v2.py - Fixed AgentRuntime -> AgentHost in example template type annotations - Reformatted 343 files with ruff format	2026-04-16 19:30:01 +08:00
Timothy	45df68c146	feat: ensure sqlite3 installation	2026-04-15 18:34:33 -07:00
Timothy	252710fb41	fix: context health and eviction	2026-04-15 11:40:45 -07:00
Richard Tang	edc3135797	Merge branch 'feature/new-colony'	2026-04-14 19:56:08 -07:00
Hundao	2f58cce781	fix(tools): web_scrape truncation no longer exceeds max_length (#7044 ) The previous code did `text[:max_length] + "..."`, which made the returned content always 3 chars longer than the requested max_length. Reserve room for the ellipsis inside the limit so the contract holds. Fixes #2098	2026-04-14 14:24:42 +08:00
Timothy	fd3ef36a15	fix: side panel	2026-04-13 21:08:11 -07:00
Timothy	846f3f2470	feat: improve tool call reliability	2026-04-13 19:34:47 -07:00
Timothy	eeb46a2b3e	fix: tool credential filter	2026-04-11 12:54:26 -07:00
Timothy	b5e05fefae	fix: screenshot	2026-04-11 09:53:53 -07:00
Timothy	bdfbb7698a	fix: browser click	2026-04-10 23:34:39 -07:00
Timothy	70d90fda19	fix: screenshot	2026-04-10 21:11:49 -07:00
Richard Tang	8ea3fb8cfe	chore: align the hive tool names	2026-04-10 16:38:21 -07:00
Richard Tang	e0f1e9d494	feat: efficient mcp loading in initialization	2026-04-10 16:23:36 -07:00
Richard Tang	7fb0da26fc	feat: register available MCP tools	2026-04-10 16:01:42 -07:00
Timothy	0964758b12	Merge branch 'feature/colony-orchestrate' into feature/hive-experimental-comp-pipeline	2026-04-10 15:48:02 -07:00
Richard Tang	d96875932a	fix: correct aden support tag	2026-04-10 12:03:39 -07:00
Richard Tang	238d90871a	feat: stable credential states	2026-04-10 11:33:34 -07:00
Timothy	da0aa65c31	refactor: big test cleanup	2026-04-09 22:04:23 -07:00
Timothy	cbf7cc0a37	feat(agent): simple fork	2026-04-09 20:42:28 -07:00
Richard Tang	9ad95fde59	chore: ruff lint	2026-04-09 18:22:16 -07:00
Bryan	c058029ac0	feat: add aden credentials storage adapter	2026-04-09 16:59:16 -07:00
Timothy	df43f36385	fix: issues	2026-04-09 12:59:42 -07:00
Timothy	dee3980dbe	fix: browser, csv tools	2026-04-08 16:32:26 -07:00

1 2 3 4 5 ...

935 Commits