Gemma4 just dropped. How does it handle tool calls?
I ran ToolCall-15 across the full Gemma4 families.
Gemma4 31b = Qwen3.5 27b. Both perfect 15/15.
But here's what's wild:
Qwen3.5 9b already clears 13/15, Gemma4 needs 26b to match that.
Results and Comparison (Gemma4 & Qwen3.5)



From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share
Relevant content




