Can we make Claude Opus 4.5 even better? Anthropic/Claude has said that with Skills, this is possible. So I went and built a Skills library for @vibeshipco Spawner, but in a much more sophisticated format. Now dominating regular Claude Opus 4.5 by a wide margin on certain skills. Here is the backend, error-handling skill, for instance. Benchmarked and juried by 5 LLMs. The verdict: - Vibeship Skills win by +34.9 points - Regular Claude Opus 4.5 = 59.5/100 avg - Vibeship Skill Claude Opus 4.5 = 94.4/100 avg Working on improving them all step by step. People are already amazed by Opus 4.5, but I think we can take it even further. I'll share more benchmark findings on this thread.

From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share



