Prompter – Compare and benchmark Ollama models side-by-side in your terminal
Prompter is a terminal-based tool designed for evaluating and comparing multiple Ollama models simultaneously. It allows users to run the same prompt through different models and provides structured evaluation modes for in-depth analysis. Results are saved in markdown files, offering detailed statistics and insights into model performance.
- ▪Prompter enables side-by-side streaming of responses from multiple models in real-time.
- ▪It features four evaluation modes: Default, Ralph, Council, and Tribunal, each tailored for specific types of analysis.
- ▪Users can conduct a 20-test benchmark across models to assess their capabilities in various tasks.
Opening excerpt (first ~120 words) tap to expand
Multi-model Ollama comparison, benchmarking, and evaluation — in your terminal. Zero dependencies. One file. Standard library only. Quick Start wget https://raw.githubusercontent.com/whonixnetworks/prompter/main/prompter.py chmod +x prompter.py python3 prompter.py Python 3.7+. Ollama running locally. That is it. What is Prompter Prompter is a terminal-based multi-model evaluation tool for Ollama. Run the same prompt through multiple models simultaneously and watch responses stream in side by side. Go beyond simple comparison with structured evaluation modes: self-review loops, multi-model debate panels, and adversarial interrogation. Results are saved as collapsible markdown files with full stats — timing, token counts, tool call traces, and response text.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.