PANews reports on April 10th that, according to TechCrunch, Google has released a new AI model Gemini 2.5 Flash, specifically designed for efficient processing, high throughput, and low-cost scenarios. The model will soon be launched on Google's Vertex AI platform, supporting users in adjusting the balance between speed, accuracy, and cost, suitable for real-time tasks such as customer service and document parsing. As an "inference-type" model, Gemini 2.5 Flash can perform self-verification before answering. Google also plans to deploy the model to local environments in Q3 this year, achieving compliance through Google Distributed Cloud and Nvidia Blackwell systems.
Google launches Gemini 2.5 Flash model, focusing on high-efficiency and low-latency AI application scenarios
This article is machine translated
Show original
Source
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share
Relevant content





