Overview
goalseek is a local-first Python package for running disciplined, git-backed research loops with coding-agent providers. It gives you a repeatable way to let an agent propose a change, apply it, verify it, and keep or revert it based on mechanical metrics.

What it does
- Scaffolds isolated research projects with manifests, logs, and run artifacts.
- Drives a loop of planning, implementation, verification, and decision-making.
- Stores every baseline and iteration under
runs/andlogs/. - Exposes both a CLI and a Python API for automation and local workflows.
- Supports
codex,claude_code,opencode,gemini, and afakeprovider for tests.
Core philosophy
The package is built around a simple rule set:
- read project context before write operations
- make one focused change per iteration
- judge outcomes with objective verification
- preserve history with real git commits and explicit reverts
That combination is what makes the loop inspectable. The system is not trying to hide its reasoning behind a service boundary. It leaves plans, prompts, metrics, diffs, and result records on disk where you can inspect them.
Typical workflow
- Initialize a project scaffold.
- Add research assets and validate the manifest.
- Run setup once to prepare the workspace.
- Run a baseline to capture the retained metric.
- Run one or more iterations and inspect the resulting artifacts.
Start here
- Use Quickstart to get a project running.
- Use System Architecture to understand the loop internals.
- Use Kaggle Demo for the end-to-end example included in this repo.