Microsoft builds a supercomputer for OpenAI for training massive AI models

published 19.05.2020 17:00

Share

Microsoft trained one language understanding model, called Turing NLR, and is reusing that same model adapted for various scenarios across multiple products in Microsoft Bing, Word, SharePoint, and Outlook.

Microsoft also makes various accelerators and services available under its "Azure AI" banner to customers who don't need a dedicated supercomputer.

While the AI supercomputer Microsoft has built is exclusively for OpenAI, Microsoft is planning to make its large AI models and training optimization tools available through Azure AI services and GitHub, they have said.

Officials said at Build that they are going to begin open-sourcing the Microsoft Turing models "soon," as well as recipes for training them using Azure Machine Learning.

While Microsoft customers can't directly use the OpenAI supercomputer, they can use the company's upgraded Azure compute infrastructure; its open-sourced DeepSeed software to train large-scale models; and its ONNX runtime to deploy and run the models faster and more cheaply, officials said.