Deep Learning Compiler and Optimizer

Project Overview

This project aims to build a deep learning compiler and optimizer infrastructure that can provide automatic scalability and efficiency optimization for distributed and local execution.  Overall, this stack covers two types of general optimizations: fast distributed training over large-scale servers and efficient local execution on various hardware devices.  Currently, our optimizations focus on many different parts of the system stack, such as fast distributed training over RDMA, automatic computation placement across devices, automatic operator batching and kernel fusion, tensor algebra compiler, sparse and quantization optimizations, and so on.

graphical user interface, application

Open-source Release

Some of our projects have been open-sourced, and welcome to try, contribute and collaborate with us.

Job Opportunity

 

 

人数

Lingxiao Maの肖像

Lingxiao Ma

Senior Researcher

Youshan Miaoの肖像

Youshan Miao

Senior Researcher

Wenxiang Huの肖像

Wenxiang Hu

Senior RSDE

Wei Cuiの肖像

Wei Cui

Principle Researcher

Fan Yangの肖像

Fan Yang

Sr. Principal Research Manager

Lidong Zhouの肖像

Lidong Zhou

Corporate Vice President, Chief Scientist of Microsoft Asia Pacific R&D Group, Managing Director of Microsoft Research Asia