Ai2 is releasing OLMo 2, a family of open-source language models that advances the democratisation of AI and narrows the gap between open and proprietary solutions. The new models, available in 7B and...

<a href="https://allenai.org/" rel="nofollow">Ai2</a> is releasing OLMo 2, a family of open-source language models that advances the democratisation of AI and narrows the gap between open and proprietary solutions.The new models, available in 7B and 13B parameter versions, are trained on up to 5 trillion tokens and demonstrate performance levels that match or exceed comparable fully open models whilst remaining competitive with open-weight models such as Llama 3.1 on English academic benchmarks.“Since the release of the first OLMo in February 2024, we’ve seen rapid growth in the open language model ecosystem, and a narrowing of the performance gap between open and proprietary models,” explained Ai2.The development team achieved these improvements through several innovations, including enhanced training stability measures, staged training approaches, and state-of-the-art post-training methodologies derived from their <a href="https://arxiv.org/abs/2411.15124" rel="nofollow">Tülu 3</a> framework. Notable technical improvements include the switch from nonparametric layer norm to RMSNorm and the implementation of rotary positional embedding.<h3>OLMo 2 model training breakthrough</h3>The training process employed a sophisticated two-stage approach. The initial stage utilised the OLMo-Mix-1124 dataset of approximately 3.9 trillion tokens, sourced from DCLM, Dolma, Starcoder, and Proof Pile II. The second stage incorporated a carefully curated mixture of high-quality web data and domain-specific content through the Dolmino-Mix-1124 dataset.Particularly noteworthy is the OLMo 2-Instruct-13B variant, which is the most capable model in the series. The model demonstrates superior performance compared to Qwen 2.5 14B instruct, Tülu 3 8B, and Llama 3.1 8B instruct models across various benchmarks.<figure><img src="https://static.fwimg.io/img/feed/5b1fe92b7d2f1b8b21ecc8ffb9ebfb47.jpg" alt="Benchmarks comparing the OLMo 2 open large language model to other models such as Mistral, Qwn, Llama, Gemma, and more."><figcaption>(Credit: Ai2)</figcaption></figure><h3>Commiting to open science</h3>Reinforcing its commitment to open science, Ai2 has released comprehensive documentation including weights, data, code, recipes, intermediate checkpoints, and instruction-tuned models. This transparency allows for full inspection and reproduction of results by the wider AI community.The release also introduces an evaluation framework called OLMES (Open Language Modeling Evaluation System), comprising 20 benchmarks designed to assess core capabilities such as knowledge recall, commonsense reasoning, and mathematical reasoning.OLMo 2 raises the bar in open-source AI development, potentially accelerating the pace of innovation in the field whilst maintaining transparency and accessibility.(Photo by <a href="https://unsplash.com/@weareambitious?utm_content=creditCopyText&amp;utm_medium=referral&amp;utm_source=unsplash" rel="nofollow">Rick Barrett</a>)See also: <a href="https://www.artificialintelligence-news.com/news/openai-enhances-ai-safety-new-red-teaming-methods/" rel="nofollow">OpenAI enhances AI safety with new red teaming methods</a><figure><a href="https://www.ai-expo.net/" rel="nofollow"><img src="https://static.fwimg.io/img/feed/4a794fbd5bc45fd981f688dfbed9aa3b.jpg" alt=""></a></figure>Want to learn more about AI and big data from industry leaders? Check out<a href="https://www.ai-expo.net/" rel="nofollow"> AI &amp; Big Data Expo</a> taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including <a href="https://intelligentautomation-conference.com/northamerica/" rel="nofollow">Intelligent Automation Conference</a>, <a href="https://www.blockchain-expo.com/" rel="nofollow">BlockX</a>,<a href="https://digitaltransformation-week.com/" rel="nofollow"> Digital Transformation Week</a>, and <a href="https://www.cybersecuritycloudexpo.com/" rel="nofollow">Cyber Security &amp; Cloud Expo</a>.Explore other upcoming enterprise technology events and webinars powered by TechForge <a href="https://techforge.pub/events/" rel="nofollow">here</a>.The post <a href="https://www.artificialintelligence-news.com/news/ai2-olmo-2-raising-bar-open-language-models/" rel="nofollow">Ai2 OLMo 2: Raising the bar for open language models</a> appeared first on <a href="https://www.artificialintelligence-news.com/" rel="nofollow">AI News</a>.

Ai2 OLMo 2: Raising the bar for open language models

Ai2正在釋出OLMo 2,這是一系列開源語言模型,推進了AI民主化,縮小了開源和專有解決方案之間的差距。新模型提供7B和<SOL>、<OP>、<AR>、<Vai>、<AVA>等版本。

<a href="https://allenai.org/" rel="nofollow">Ai2</a>釋出了OLMo 2,這是一系列開源語言模型,推動了人工智慧的民主化,縮小了開源和專有解決方案之間的差距。新的模型有7B和13B引數版本,訓練於多達5萬億個令牌,在英語學術基準測試中的效能水平與或超過可比的完全開源模型,同時仍然與開源權重模型(如Llama 3.1)具有競爭力。"自從2024年2月首次釋出OLMo以來,我們看到了開源語言模型生態系統的快速增長,以及開源和專有模型之間效能差距的縮小,"Ai2解釋道。該開發團隊通過幾項創新實現了這些改進,包括增強的訓練穩定性措施、分階段的訓練方法,以及從他們的<a href="https://arxiv.org/abs/2411.15124" rel="nofollow">Tülu 3</a>框架派生的最先進的訓練後方法。值得注意的技術改進包括從非引數層歸一化切換到RMSNorm,以及實施旋轉位置嵌入。<h3>OLMo 2模型訓練突破</h3>訓練過程採用了複雜的兩階段方法。初始階段利用了約3.9萬億個令牌的OLMo-Mix-1124資料集,該資料集來自DCLM、Dolma、Starcoder和Proof Pile II。第二階段透過Dolmino-Mix-1124資料集,融合了高質量的網路資料和特定領域的內容。值得特別注意的是OLMo 2-Instruct-13B變體,這是該系列中最強大的模型。該模型在各種基準測試中的效能優於Qwen 2.5 14B指令、Tülu 3 8B和Llama 3.1 8B指令模型。<figure><img src="https://static.fwimg.io/img/feed/5b1fe92b7d2f1b8b21ecc8ffb9ebfb47.jpg" alt="將OLMo 2開放大型語言模型與Mistral、Qwn、Llama、Gemma等其他模型進行基準測試的對比。"><figcaption>(來源:Ai2)</figcaption></figure><h3>致力於開放科學</h3>為了強化對開放科學的承諾,Ai2釋出了全面的文件,包括權重、資料、程式碼、配方、中間檢查點和指令調整模型。這種透明度允許人工智慧社群全面檢查和複製結果。該釋出還引入了一個名為OLMES(開放語言建模評估系統)的評估框架,包括20個基準,旨在評估知識回憶、常識推理和數學推理等核心能力。OLMo 2提高了開源人工智慧開發的標準,可能會加快該領域創新的步伐,同時保持透明度和可訪問性。(照片由<a href="https://unsplash.com/@weareambitious?utm_content=creditCopyText&amp;utm_medium=referral&amp;utm_source=unsplash" rel="nofollow">Rick Barrett</a>拍攝)另見:<a href="https://www.artificialintelligence-news.com/news/openai-enhances-ai-safety-new-red-teaming-methods/" rel="nofollow">OpenAI透過新的紅隊方法增強人工智慧安全性</a><figure><a href="https://www.ai-expo.net/" rel="nofollow"><img src="https://static.fwimg.io/img/feed/4a794fbd5bc45fd981f688dfbed9aa3b.jpg" alt=""></a></figure>想從行業領導者那裡瞭解更多關於人工智慧和大資料的知識嗎?檢視在阿姆斯特丹、加利福尼亞和倫敦舉辦的<a href="https://www.ai-expo.net/" rel="nofollow">AI & Big Data Expo</a>。這個全面的活動與其他領先的活動如<a href="https://intelligentautomation-conference.com/northamerica/" rel="nofollow">Intelligent Automation Conference</a>、<a href="https://www.blockchain-expo.com/" rel="nofollow">BlockX</a>、<a href="https://digitaltransformation-week.com/" rel="nofollow">Digital Transformation Week</a>和<a href="https://www.cybersecuritycloudexpo.com/" rel="nofollow">Cyber Security & Cloud Expo</a>同時舉辦。在這裡探索由TechForge提供的其他即將到來的企業技術活動和網路研討會<a href="https://techforge.pub/events/" rel="nofollow">events</a>。本文最初發表於<a href="https://www.artificialintelligence-news.com/news/ai2-olmo-2-raising-bar-open-language-models/" rel="nofollow">AI News</a>。

Ai2 OLMo 2：提高開放語言模型的標準

2024年的最後一個月份已至。在這最後一個月，山寨幣季節到來了。比特幣仍舊未突破10萬美元里程碑。雖然由於熱點向山寨幣轉向，但meme市場並未徹底熄火，每天仍有很多新的項目爆發。
Followin將在這篇文章中不斷更新每天的熱門meme幣，一起掌握最新的meme動態。
12月3日
- MOODENG（+90%）
- SHIRO（+56%）
- CHAO（+147000%）
MOODENG（+90%...

瘋狂meme季節，今日哪些meme幣在漲？（持續更新中）

這是 <Pantera Capital> 透過兩個獨立基金啟動的籌資活動的結果。

<Pantera Capital> 希望向 <TON> 區塊鏈再投資 2000萬 <刀>。

根據一份...

Pantera Capital又向TON區塊鏈「注入」2000萬鎂

貪婪指數接近歷史頂峰，MVRV、ahr999 等技術面指標接近 21 年 11 月與 24 年 3 月，其餘維度仍在中期。