Whet
Blog

Blog

Notes on Whet, the benchmark, and whatever comes up. No fixed cadence.

  • Apr 17, 2026·8 min read
    new

    8 AIs, 50 prompts, 19 runs: the first leaderboard snapshot

    Jamba leads, reasoning doesn't help, Sonnet edges Opus, and everyone is better at Portuguese. What five days of benchmarking told us about the current cohort.

    read post
RSSPrivacy