End-to-end workflow

This guide wires Dr.Gero into a production LLM loop.

1. Environment

bash

source examples/env.sh

Edit examples/env.sh with your token and IDs.

bash

bash examples/create-leaderboard-push.sh

Capture the returned leaderboard ID and set:

bash

export PUSH_LEADERBOARD_ID="..."

bash

bash examples/push-dataset.sh

Send rows from your backend whenever you have an input/output pair, feedback signal, or trace event worth evaluating later.

Use the UI or API to add at least two candidate models. For OpenRouter:

bash

bash examples/add-openrouter-model.sh

bash

bash examples/run-leaderboard.sh

Wait for completion, then inspect the Ranking and Run Logs tabs.

Once a run completes and a winner is selected:

bash

bash examples/inference.sh

Store response headers such as X-Dr.Gero-Trace-Id in your application logs.

bash

bash examples/traces.sh

Use traces for debugging, observability, dataset improvement, or future fine-tuning.