Show HN: I built a way to see if your SDK is AI-friendly
A new open-source CLI tool has been developed to assess the compatibility of SDKs with Agentic AI systems. This tool allows users to create test suites and evaluate their SDKs using both test-taker and judge agents in a sandboxed environment. The results can help improve SDKs either manually or through automation.
- ▪The CLI is open-source and licensed under Apache 2.0.
- ▪It allows for the creation of test suites based on source code and documentation.
- ▪Test-taker agents only access public information, while judge agents can access both public and private data.
Opening excerpt (first ~120 words) tap to expand
Have you ever wonder if your SDKs is friendly for Agentic AI like Claude Code or Codex? I built an opensource (Apache 2.0) CLI that answer that question for you.With it you can create a test suite either manually or with an Agent based on the source code and documentation. The CLI will dispatch Agents with their own sandboxed microVMs to solve each test. Results then get graded by another Judge Agent.Test-taker agents only have access to public information (guides, blogs, package metadata), while Judge agents have access to both public and private information (source code, internal documents)After the test result are generated you can make improvement to your SDK manually, or use an Agent to automate the process.Agents are sandboxed, this means: - Host machine secrets (API keys) are not…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Ycombinator.