Skills vs. MCP vs. prompts: which agent setup works best?
The article compares different agent setups for converting PDF pages into HTML. The step-by-step approach achieved the highest accuracy and pass rate among the tested methods. The findings highlight the importance of structured prompts and self-checking in improving performance.
- ▪The step-by-step setup achieved 95% accuracy and cleared all 10 pages with a structural fidelity score of 95%.
- ▪The baseline setup had an accuracy of 82% and cleared 9 out of 10 pages.
- ▪The study utilized the Agent Voyager Project to standardize and record the performance of various agent configurations.
Opening excerpt (first ~120 words) tap to expand
Skills vs. MCP vs. prompts: which agent setup works best?Welcome to the Captain's Log, where we break down the voyages our agents undertook each week. For this inaugural run, we set out to test how different agent setups (skills vs MCPs vs prompts) compare.The task: read a PDF page and rebuild it as a webpage (from ParseBench, a public LlamaIndex benchmark). Every setup uses the same model (claude-haiku-4-5), so any differences come from how the agent is set up, not the model itself.How we score it: each page gets a structural-fidelity score from 0 to 100% (column headers, row count, cell content, merged-cell topology, compared against the reference HTML).
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at AVP.