avatar
JG
03-29
本文為機器翻譯
展示原文

1. 如果大規模訓練能夠帶來顯著的性能提升,那麼伊利亞之前關於“我們正在從規模化配方(預訓練)轉向規模化思維過程(推理)”的預測可能就錯了。 2. 其他實驗室複製該模型的成本意味著,目前可能是大多數公眾能夠接觸到最先進模型的最佳時機。從現在開始,成本將呈K曲線式增長,屆時普通消費者將使用開源級別的模型。

Andrew Curran
@AndrewCurran_
03-29
Three weeks ago there were rumors that one of the labs had completed its largest ever successful training run, and that the model that emerged from it performed far above both internal expectations and what people assumed the scaling laws would predict. At the time these were
相关赛道:
來自推特
免責聲明:以上內容僅為作者觀點,不代表Followin的任何立場,不構成與Followin相關的任何投資建議。
喜歡
收藏
評論