PHPackages                             carmelosantana/coqui-harbor-external - PHPackages - PHPackages  [Skip to content](#main-content)[PHPackages](/)[Directory](/)[Categories](/categories)[Trending](/trending)[Leaderboard](/leaderboard)[Changelog](/changelog)[Analyze](/analyze)[Collections](/collections)[Log in](/login)[Sign up](/register)

1. [Directory](/)
2. /
3. carmelosantana/coqui-harbor-external

ActiveLibrary

carmelosantana/coqui-harbor-external
====================================

Harbor benchmarking toolkit for Coqui — task management, eval execution, and result analysis via the Harbor CLI

v0.1.0(1mo ago)00MITPHPPHP ^8.4CI passing

Since Apr 9Pushed 1mo agoCompare

[ Source](https://github.com/carmelosantana/coqui-harbor-external)[ Packagist](https://packagist.org/packages/carmelosantana/coqui-harbor-external)[ RSS](/packages/carmelosantana-coqui-harbor-external/feed)WikiDiscussions main Synced yesterday

READMEChangelog (1)Dependencies (3)Versions (2)Used By (0)

Coqui Harbor External
=====================

[](#coqui-harbor-external)

Harbor benchmarking toolkit for [Coqui](https://github.com/carmelosantana/coqui). Manage tasks, run evaluations, and analyze benchmark results via the [Harbor](https://harborframework.com) CLI.

Requirements
------------

[](#requirements)

- PHP 8.4+
- [Harbor CLI](https://harborframework.com/docs/getting-started) (`uv tool install harbor`)
- Docker (for local evaluations)
- [Coqui](https://github.com/carmelosantana/coqui)

Installation
------------

[](#installation)

```
composer require carmelosantana/coqui-harbor-external
```

The toolkit is auto-discovered by Coqui — no code changes needed.

Tools Provided
--------------

[](#tools-provided)

### Discovery &amp; Validation

[](#discovery--validation)

ToolDescription`harbor_check`Verify Harbor CLI, Python, Docker, and uv are installed`harbor_task_validate`Validate a task directory has the required structure`harbor_dataset_list`List registered datasets from the Harbor registry### Task Authoring

[](#task-authoring)

ToolDescription`harbor_task_init`Scaffold a new task directory (instruction.md, task.toml, environment/, tests/)`harbor_task_list`List all tasks in a local dataset directory`harbor_task_delete`Delete a task directory *(gated — requires confirmation)*### Execution

[](#execution)

ToolDescription`harbor_run`Run a Harbor evaluation against a dataset or task path *(gated)*`harbor_run_status`Check job progress (trial completion, overall status)`harbor_view`Launch Harbor's web-based results viewer### Analysis

[](#analysis)

ToolDescription`harbor_results`Parse job results: pass/fail, reward distribution, durations`harbor_trial_inspect`Inspect a trial's trajectory, verifier logs, and reward`harbor_compare`Compare two or more jobs for regression detection`harbor_failures`Extract failed trials with root cause details`harbor_cleanup`Delete old job directories *(gated)*Python Agent Wrapper
--------------------

[](#python-agent-wrapper)

The package includes a Python external agent that bridges Harbor's evaluation framework with Coqui's CLI. This allows Harbor to drive Coqui as the agent under test.

### Setup

[](#setup)

```
cd agent
uv pip install -e .
```

### Usage

[](#usage)

```
harbor run \
  -p ./my-tasks \
  --agent-import-path coqui_harbor_agent.agent:CoquiExternalAgent \
  -m anthropic/claude-sonnet-4-20250514
```

### Configuration

[](#configuration)

Environment VariableDefaultDescription`COQUI_BIN``coqui`Path to the Coqui binary`COQUI_TIMEOUT``600`Max seconds per task`COQUI_MAX_ITERATIONS``100`Agent iteration limit`COQUI_MODEL`*(from Harbor -m)*Model override`COQUI_ROLE``coder`Agent role`COQUI_AUTO_APPROVE``true`Auto-approve tool calls`COQUI_EXTRA_ARGS`Additional CLI argumentsBundled Skill
-------------

[](#bundled-skill)

The `harbor-benchmarking` skill provides an operational SOP for running benchmark campaigns — including task creation, evaluation execution, failure triage, regression detection, and reporting. It is auto-discovered when the package is installed.

Bundled Loop
------------

[](#bundled-loop)

The `benchmark` loop definition automates a full benchmark cycle:

1. **Plan** — validate tasks, define success criteria, create plan artifact
2. **Coder** — execute benchmark runs, analyze results, create report artifact
3. **Reviewer** — verify completeness, check for regressions, approve or request changes

Terminates when the reviewer responds with `APPROVED`.

Development
-----------

[](#development)

```
composer install
composer test      # Run Pest tests
composer analyse   # Run PHPStan (level 8)
```

License
-------

[](#license)

MIT

###  Health Score

36

—

LowBetter than 82% of packages

Maintenance93

Actively maintained with recent releases

Popularity0

Limited adoption so far

Community6

Small or concentrated contributor base

Maturity41

Maturing project, gaining track record

 Bus Factor1

Top contributor holds 100% of commits — single point of failure

How is this calculated?**Maintenance (25%)** — Last commit recency, latest release date, and issue-to-star ratio. Uses a 2-year decay window.

**Popularity (30%)** — Total and monthly downloads, GitHub stars, and forks. Logarithmic scaling prevents top-heavy scores.

**Community (15%)** — Contributors, dependents, forks, watchers, and maintainers. Measures real ecosystem engagement.

**Maturity (30%)** — Project age, version count, PHP version support, and release stability.

###  Release Activity

Cadence

Unknown

Total

1

Last Release

34d ago

### Community

Maintainers

![](https://avatars.githubusercontent.com/u/597820?v=4)[Carmelo Santana](/maintainers/carmelosantana)[@carmelosantana](https://github.com/carmelosantana)

---

Top Contributors

[![carmelosantana](https://avatars.githubusercontent.com/u/597820?v=4)](https://github.com/carmelosantana "carmelosantana (1 commits)")

---

Tags

benchmarktoolkitphp-agentscoquiEvaluationharborterminal-benchtbench

###  Code Quality

TestsPest

Static AnalysisPHPStan

Type Coverage Yes

### Embed Badge

![Health badge](/badges/carmelosantana-coqui-harbor-external/health.svg)

```
[![Health](https://phpackages.com/badges/carmelosantana-coqui-harbor-external/health.svg)](https://phpackages.com/packages/carmelosantana-coqui-harbor-external)
```

PHPackages © 2026

[Directory](/)[Categories](/categories)[Trending](/trending)[Changelog](/changelog)[Analyze](/analyze)
