Mô hình BitNet b1.58 2B4T mới của Microsoft với 2 tỷ tham số đạt hiệu suất tốt hơn nhiều đối thủ cùng kích thước, đồng thời sử dụng ít bộ nhớ hơn đáng kể.

Mô hình BitNet b1.58 2B4T mới của Microsoft với 2 tỷ tham số đạt hiệu suất tốt hơn nhiều đối thủ cùng kích thước, đồng thời sử dụng ít bộ nhớ hơn đáng kể.Các nhà nghiên cứu của <a href="https://phocapblockchain.net/microsoft-ra-mat-cong-cu-nghien-cuu-ai-moi/" rel="nofollow">Microsoft</a> vừa công bố phát triển thành công mô hình AI 1-bit (còn gọi là “bitnet”) lớn nhất từ trước đến nay. Mô hình có tên BitNet b1.58 2B4T được <a href="https://huggingface.co/microsoft/bitnet-b1.58-2B-4T" rel="nofollow">phát hành theo giấy phép MIT</a> và có khả năng chạy trên CPU thông thường, bao gồm cả chip Apple M2.<h2>Cuộc cách mạng hiệu quả cho AI nhẹ</h2>Bitnets là các mô hình AI được nén để có thể chạy trên phần cứng nhẹ. Trong khi các mô hình tiêu chuẩn hiện nay thường cần lượng bit lớn để biểu diễn các trọng số (weights), bitnets lượng tử hóa các trọng số xuống chỉ còn ba giá trị: -1, 0 và 1. Điều này giúp bitnets tiết kiệm bộ nhớ và tài nguyên tính toán hơn đáng kể so với hầu hết các mô hình hiện nay.BitNet b1.58 2B4T là mô hình bitnet đầu tiên của Microsoft đạt 2 tỷ tham số (parameters). Mô hình được huấn luyện trên tập dữ liệu khổng lồ với 4 nghìn tỷ token – tương đương khoảng 33 triệu cuốn sách. Theo công bố của Microsoft, mô hình này vượt trội hơn các mô hình truyền thống có kích thước tương tự.Trong các bài kiểm tra hiệu suất, BitNet b1.58 2B4T đã vượt qua Llama 3.2 1B của Meta, Gemma 3 1B của Google và Qwen 2.5 1.5B của Alibaba trên nhiều bộ benchmark quan trọng như GSM8K (bộ bài toán cấp tiểu học) và PIQA (kiểm tra khả năng suy luận về thế giới vật lý).Đặc biệt ấn tượng là tốc độ của BitNet b1.58 2B4T, trong một số trường hợp nhanh gấp đôi các mô hình cùng kích thước, đồng thời sử dụng chỉ một phần nhỏ bộ nhớ so với đối thủ.Tuy nhiên, để đạt được hiệu suất tối ưu, BitNet b1.58 2B4T yêu cầu sử dụng framework bitnet.cpp của Microsoft, hiện chỉ tương thích với một số phần cứng nhất định. Đáng chú ý, danh sách này không bao gồm GPU – loại chip đang thống trị cơ sở hạ tầng AI hiện nay.Bitnets dường như rất hứa hẹn cho các thiết bị có tài nguyên hạn chế, nhưng vấn đề tương thích vẫn là rào cản lớn và có thể sẽ tiếp tục tồn tại trong tương lai gần.

Microsoft phát triển mô hình AI 1-bit

The new BitNet b1.58 2B4T model from Microsoft with 2 billion parameters achieves much better performance compared to competitors of the same size, while using significantly less memory.

Microsoft's new BitNet b1.58 2B4T model with 2 billion parameters performs significantly better than competitors of the same size, while using considerably less memory.Researchers from <a href="https://phocapblockchain.net/microsoft-ra-mat-cong-cu-nghien-cuu-ai-moi/" rel="nofollow">Microsoft</a> have just announced the successful development of the largest 1-bit AI model (also known as "bitnet") to date.The model named BitNet b1.58 2B4T was <a href="https://huggingface.co/microsoft/bitnet-b1.58-2B-4T" rel="nofollow">released under the MIT license</a> and can run on standard CPUs, including Apple M2 chips.<h2>An Efficient Revolution for Lightweight AI</h2>Bitnets are compressed AI models that can run on lightweight hardware. While current standard models typically require large bit quantities to represent weights, bitnets quantize weights down to just three values: -1, 0, and 1. This helps bitnets save significantly more memory and computational resources compared to most current models.BitNet b1.58 2B4T is Microsoft's first bitnet model with 2 billion parameters. The model was trained on a massive dataset of 4 trillion tokens – equivalent to approximately 33 million books. According to Microsoft's announcement, this model outperforms traditional models of similar size.In performance tests, BitNet b1.58 2B4T surpassed Meta's Llama 3.2 1B, Google's Gemma 3 1B, and Alibaba's Qwen 2.5 1.5B on several important benchmarks like GSM8K (elementary school problem set) and PIQA (testing physical world reasoning capabilities).Particularly impressive is BitNet b1.58 2B4T's speed, which in some cases is twice as fast as models of the same size, while using only a small fraction of the memory compared to competitors.However, to achieve optimal performance, BitNet b1.58 2B4T requires using Microsoft's bitnet.cpp framework, which is currently only compatible with certain hardware. Notably, this list does not include GPUs – the chip type currently dominating AI infrastructure.Bitnets seem very promising for devices with limited resources, but compatibility issues remain a significant barrier and may continue to exist in the near future.

Microsoft develops 1- Bit AI model

Original article | Odaily Odaily(@OdailyChina)
Author｜Azuma (@azuma_eth)
In 2014, CZ, who had only been exposed to the concept of cryptocurrency for a year, made the most daring investment of his life—selling his apartment in Shanghai and "All In" on about 1,500 BTC for a three-figure price. Twelve years later, if CZ had never sold, this investment would have brought in a substantial profit of hundreds of millions of dollars (peak return of about $189 million).
Compared to the later establishment of Bina...

The day CZ missed his best investment, Crypto missed out on AI.

Dubai-based media personality Sheikhah Alya has disclosed that she sold all her $XRP holdings to increase her exposure to Shiba Inu.
The move follows the ongoing relief rally in the crypto market afte...

Dubai Investor Dumps All XRP Holdings for Shiba Inu

Story Highlights
Singapore Gulf Bank launches Virtual Accounts to automate enterprise collections and treasury cash management operations.
COBO and POBO features allow structured account identifiers f...