agent-runtime

Archived

This repository has been archived on 2026-04-03. You can view files and clone it, but cannot push or open issues or pull requests.

History

Nico 4e679a3ad9 Add model matrix test suite: 3 tests × 3 variants = 9 combos

New 'matrix' suite runs same API tests with different LLM model configs:
- Variants: gemini-flash (baseline), haiku, gpt-4o-mini
- Tests: eras_query (SQL correctness), eras_artifact (data output), social_reflex (fast path)
- Posts results as test_name[variant] to /tests dashboard
- All 9 combos passing (6/9 verified locally, ~35s for ERAS tests)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-03 18:12:24 +02:00

run_tests.py

Add model matrix test suite: 3 tests × 3 variants = 9 combos

2026-04-03 18:12:24 +02:00

test_engine.py

Wire model overrides end-to-end: API → runtime → frame engine

2026-04-03 18:07:42 +02:00

test_matrix.py

Add model matrix test suite: 3 tests × 3 variants = 9 combos

2026-04-03 18:12:24 +02:00