Anthropic has announced upgrades to its AI portfolio, including an enhanced Claude 3.5 Sonnet model and the introduction of Claude 3.5 Haiku, alongside a “computer control” feature in public beta. The upgraded Claude 3.5 Sonnet demonstrates substantial improvements across all metrics, with particularly notable advances in coding capabilities. The model achieved an impressive 49.0% on... Read more »
The post Anthropic unveils new Claude AI models and ‘computer control’ appeared first on AI News.

<a href="https://www.anthropic.com/" rel="nofollow">Anthropic</a> has announced upgrades to its AI portfolio, including an enhanced Claude 3.5 Sonnet model and the introduction of Claude 3.5 Haiku, alongside a “computer control” feature in public beta.The upgraded Claude 3.5 Sonnet demonstrates substantial improvements across all metrics, with particularly notable advances in coding capabilities. The model achieved an impressive 49.0% on the SWE-bench Verified benchmark, surpassing all publicly available models, including OpenAI’s offerings and specialist coding systems.In a pioneering development, Anthropic has introduced computer use functionality that enables Claude to interact with computers similarly to humans: viewing screens, controlling cursors, clicking, and typing. This capability, currently in public beta, marks Claude 3.5 Sonnet as the first frontier AI model to offer such functionality.<figure><div></div></figure>Several major technology firms have already begun implementing these new capabilities.“The upgraded Claude 3.5 Sonnet represents a significant leap for AI-powered coding,” reports GitLab, which noted up to 10% stronger reasoning across use cases without additional latency.The new Claude 3.5 Haiku model, set for release later this month, matches the performance of the previous Claude 3 Opus whilst maintaining cost-effectiveness and speed. It notably achieved 40.6% on SWE-bench Verified, outperforming many competitive models including the original Claude 3.5 Sonnet and GPT-4o.<figure><img src="https://static.fwimg.io/img/feed/29baaab8ae930c8c3868bd9da81a65c5.jpg" alt="Model benchmarks comparing new Claude AI models from Anthropic."><figcaption>(Credit: Anthropic)</figcaption></figure>Regarding computer control capabilities, Anthropic has taken a measured approach, acknowledging current limitations whilst highlighting potential. On the OSWorld benchmark, which evaluates computer interface navigation, Claude 3.5 Sonnet achieved 14.9% in screenshot-only tests, significantly outperforming the next-best system’s 7.8%.The developments have undergone rigorous <a href="https://www.artificialintelligence-news.com/news/uk-and-us-sign-pact-develop-ai-safety-tests/" rel="nofollow">safety evaluations</a>, with pre-deployment testing conducted in partnership with both the US and UK AI Safety Institutes. Anthropic maintains that the ASL-2 Standard, as detailed in their Responsible Scaling Policy, remains appropriate for these models.(Image Credit: Anthropic)See also: <a href="https://www.artificialintelligence-news.com/news/ibm-granite-3-ai-models-open-source-commitment/" rel="nofollow">IBM unveils Granite 3.0 AI models with open-source commitment</a><figure><a href="https://www.ai-expo.net/" rel="nofollow"><img src="https://static.fwimg.io/img/feed/4a794fbd5bc45fd981f688dfbed9aa3b.jpg" alt=""></a></figure>Want to learn more about AI and big data from industry leaders? Check out<a href="https://www.ai-expo.net/" rel="nofollow"> AI &amp; Big Data Expo</a> taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including <a href="https://intelligentautomation-conference.com/northamerica/" rel="nofollow">Intelligent Automation Conference</a>, <a href="https://www.blockchain-expo.com/" rel="nofollow">BlockX</a>,<a href="https://digitaltransformation-week.com/" rel="nofollow"> Digital Transformation Week</a>, and <a href="https://www.cybersecuritycloudexpo.com/" rel="nofollow">Cyber Security &amp; Cloud Expo</a>.Explore other upcoming enterprise technology events and webinars powered by TechForge <a href="https://techforge.pub/events/" rel="nofollow">here</a>.The post <a href="https://www.artificialintelligence-news.com/news/anthropic-new-claude-ai-models-and-computer-control/" rel="nofollow">Anthropic unveils new Claude AI models and ‘computer control’</a> appeared first on <a href="https://www.artificialintelligence-news.com/" rel="nofollow">AI News</a>.

Anthropic unveils new Claude AI models and ‘computer control’

Anthropic 宣布升级其 AI 产品组合，包括增强型 Claude 3.5 Sonnet 模型和 Claude 3.5 Haiku 的推出，以及公开测试版中的“计算机控制”功能。升级后的 Claude 3.5 Sonnet 在所有指标上都表现出了显著的改进，尤其是在编码能力方面取得了显著的进步。该模型在... 阅读更多»
Anthropic 推出新的 Claude AI 模型和“计算机控制”一文首先出现在 AI News 上。

<a href="https://www.anthropic.com/" rel="nofollow">Anthropic</a>宣布升级其AI产品组合,包括增强的Claude 3.5 Sonnet模型和新推出的Claude 3.5 Haiku,以及公开测试版的"计算机控制"功能。升级后的Claude 3.5 Sonnet在所有指标上都有显著改善,尤其是在编码能力方面取得了显著进步。该模型在SWE-bench Verified基准测试中获得了49.0%的成绩,超过了所有公开可用的模型,包括OpenAI的产品和专门的编码系统。作为一项开创性的发展,Anthropic引入了计算机使用功能,使Claude能够像人类一样与计算机进行交互:查看屏幕、控制光标、点击和键入。这一功能目前处于公开测试阶段,标志着Claude 3.5 Sonnet成为首个提供此类功能的前沿AI模型。<figure><div></div></figure>几家主要科技公司已经开始实施这些新功能。"升级后的Claude 3.5 Sonnet代表了AI编码的重大飞跃,"GitLab报告称,在不增加延迟的情况下,该模型在各种用例中的推理能力提高了高达10%。新的Claude 3.5 Haiku模型计划于本月晚些时候发布,其性能与之前的Claude 3 Opus相匹配,同时保持了成本效益和速度。它在SWE-bench Verified基准测试中达到了40.6%,超过了许多竞争对手模型,包括原始的Claude 3.5 Sonnet和GPT-4o。<figure><img src="https://static.fwimg.io/img/feed/29baaab8ae930c8c3868bd9da81a65c5.jpg" alt="Model benchmarks comparing new Claude AI models from Anthropic."><figcaption>(Credit: Anthropic)</figcaption></figure>关于计算机控制功能,Anthropic采取了谨慎的方法,承认了当前的局限性,同时也强调了潜在的可能性。在评估计算机界面导航的OSWorld基准测试中,Claude 3.5 Sonnet在仅使用截图的测试中达到了14.9%,明显优于下一个最佳系统的7.8%。这些发展已经经过了严格的<a href="https://www.artificialintelligence-news.com/news/uk-and-us-sign-pact-develop-ai-safety-tests/" rel="nofollow">安全评估</a>,在部署前与美国和英国AI安全研究所进行了合作测试。Anthropic表示,其负责任的扩展政策中详述的ASL-2标准仍然适用于这些模型。(图片来源:Anthropic)另见:<a href="https://www.artificialintelligence-news.com/news/ibm-granite-3-ai-models-open-source-commitment/" rel="nofollow">IBM发布Granite 3.0 AI模型并承诺开源</a><figure><a href="https://www.ai-expo.net/" rel="nofollow"><img src="https://static.fwimg.io/img/feed/4a794fbd5bc45fd981f688dfbed9aa3b.jpg" alt=""></a></figure>想了解更多来自行业领导者的AI和大数据信息吗?请查看在阿姆斯特丹、加利福尼亚和伦敦举办的<a href="https://www.ai-expo.net/" rel="nofollow">AI & Big Data Expo</a>。这个全面的活动与其他领先的活动如<a href="https://intelligentautomation-conference.com/northamerica/" rel="nofollow">Intelligent Automation Conference</a>、<a href="https://www.blockchain-expo.com/" rel="nofollow">BlockX</a>、<a href="https://digitaltransformation-week.com/" rel="nofollow">Digital Transformation Week</a>和<a href="https://www.cybersecuritycloudexpo.com/" rel="nofollow">Cyber Security & Cloud Expo</a>同时举办。您可以在这里探索由TechForge提供的其他即将到来的企业技术活动和网络研讨会<a href="https://techforge.pub/events/" rel="nofollow">here</a>。本文最初发表于<a href="https://www.artificialintelligence-news.com/news/anthropic-new-claude-ai-models-and-computer-control/" rel="nofollow">AI News</a>。

Anthropic 推出全新 Claude AI 模型和“计算机控制”

Polymarket 上一玩家 1 年交易 61,793 次，狂赚 10.6 万美元。

蚊子肉，滚出 10 万美元利润

全球最大资产管理公司贝莱德披露将购入去中心化交易平台 Uniswap 的原生代币 UNI。此举不仅展现传统金融 […]
〈贝莱德宣布购买 Uniswap 平台币 UNI！$UNI 跳涨 23%〉这篇文章最早发布于动区BlockTempo《动区动趋-最具影响力的区块链新闻媒体》。

贝莱德宣布购买Uniswap 平台币UNI！ $ UNI跳涨23%

美国劳工统计局（BLS） 1 月非农报告即将发布，市场预计此次修正将抹去约 100 万个就业岗位。
文章作者：赵颖
文章来源：华尔街见闻
美国劳工统计局（BLS）将于今晚发布延迟的 1 月非农报告，同时进行年度基准修正和方法论更新。市场预计此次修正将抹去约 100 万个就业岗位，这是美国就业统计史上规模最大的下修之一。
根据 BLS 初步估计，2024 年 4 月至 2025 年 3 月期间的就业...