Shrinking the frontier: model size at constant capability
Much of the conversation around AI progress focuses on the top of the leaderboard — which model is best today. […]
Much of the conversation around AI progress focuses on the top of the leaderboard — which model is best today. […]
Efficiency, not scale, is now the main battleground in AI. In the last few weeks Google has advanced on two
Moonshot AI, a Chinese AI Lab backed by Ali Baba and Tencent, has just released a breakthrough open-source model: Kimi
When AI’s interest conflicts with ours Building on our previous discussions of robot constitutions and the International AI Safety Report,
Isaac Asimov, the science fiction writer, devised decades ago the Three Laws of Robotics to ensure robots would align with
We might be on the verge of having even more efficient state of the art large language models. Autoregressive transformer
Alibaba announced today the release of QwQ-32B, a compact reasoning model which delivers remarkable performance for its size. The Alibaba
Large Language Models (LLMs) can be improved by using more time and compute before coming up with a better answer
Google has provided broad access to their latest generation TPU servers, Trillium, their sixth generation servers, which is another competitive