Return to Microsoft Research Lab – Redmond

Deep Learning Group

News & features

Articles

ECCV Workshop on “Computer Vision in the Wild”

September 8, 2022

Website: https://computer-vision-in-the-wild.github.io/eccv-2022/ (opens in new tab) Workshop: The research community has recently witnessed a trend in building transferable visual models that can effortlessly adapt to a wide range of downstream computer vision (CV) and multimodal (MM) tasks. We are organizing…

Diagram showing GODEL’s architecture. The environment of the dialog system consists of both structured and unstructured content, which it uses to retrieve information. This source content, which we term “grounding,” is updated and repeatedly used by GODEL to produce a new response after each user input.

Microsoft Research Blog

GODEL: Combining goal-oriented dialog with real-world conversations

June 23, 2022 | Baolin Peng, Michel Galley, Lars Liden, Chris Brockett, Zhou Yu, and Jianfeng Gao

They make restaurant recommendations, help us pay bills, and remind us of appointments. Many people have come to rely on virtual assistants and chatbots to perform a wide range of routine tasks. But what if a single dialog agent, the…

In the news | Analytics India

Interview with the team behind Microsoft’s µTransfer

March 23, 2022

Recently, researchers – Edward Hu, Greg Yang, Jianfeng Gao from Microsoft, introduced µ-Parametrization, which offers maximal feature learning even in infinite-width limit.

In the news | The Register

Microsoft, OpenAI method could make training large neural networks cheaper

March 14, 2022

Cost of tuning hyperparameters using μTransfer was 7% of what it would be to pre-train GPT-3. Companies scaling up their neural network models could cut expensive training costs by employing a technique developed by researchers at Microsoft and OpenAI.

In the news | TechRadar

Microsoft, OpenAI may have solved a fundamental AI bottleneck

March 9, 2022

Microsoft and Open AI have developed a new method for optimizing massive AI models that are too expensive to train multiple times, such as GPT-3. A blog post published by Microsoft Research describes a technique called µ-Parametrization (or µP), which…

Microsoft Research Blog

µTransfer: A technique for hyperparameter tuning of enormous neural networks

March 8, 2022 | Edward Hu, Greg Yang, and Jianfeng Gao

Great scientific achievements cannot be made by trial and error alone. Every launch in the space program is underpinned by centuries of fundamental research in aerodynamics, propulsion, and celestial bodies. In the same way, when it comes to building large-scale…

Microsoft Research Blog

SOLOIST: Pairing transfer learning and machine teaching to advance task bots at scale

June 16, 2021 | Baolin Peng, Chunyuan Li, Jinchao Li, Lars Liden, and Jianfeng Gao

The increasing use of personal assistants and messaging applications has spurred interest in building task-oriented dialog systems (or task bots) that can communicate with users through natural language to accomplish a wide range of tasks, such as restaurant booking, weather…

Microsoft Research Blog

HEXA: Self-supervised pretraining with hard examples improves visual representations

February 25, 2021 | Chunyuan Li, Lei Zhang, and Jianfeng Gao

Humans perceive the world through observing a large number of visual scenes around us and then effectively generalizing—in other words, interpreting and identifying scenes they haven’t encountered before—without heavily relying on labeled annotations for every single scene. One of the…

Microsoft Research Blog

VinVL: Advancing the state of the art for vision-language models

January 14, 2021 | Pengchuan Zhang, Lei Zhang, and Jianfeng Gao

Humans understand the world by perceiving and fusing information from multiple channels, such as images viewed by the eyes, voices heard by the ears, and other forms of sensory input. One of the core aspirations in AI is to develop…