
ORCAS is a click-based dataset associated with the TREC Deep Learning Track. It covers 1.4 million of the TREC DL documents, providing 18 million connections to 10 million distinct queries.
Tip-of-the-tongue (ToT) known-item retrieval is defined as “an item identification task in which the searcher has previously experienced an item but cannot recall a reliable identifier” (i.e., “It’s on the tip of my tongue…”). The…
The TREC Deep Learning Track studies information retrieval in a large training data regime. This is the case where the number of training queries with at least one positive label is at least in the…
GitHub Publication Publication Publication Publication Publication Publication Publication Publication Publication
LLM-Derived Knowledge Graphs GraphRAG (Graphs + Retrieval Augmented Generation) is a technique for richly understanding text datasets by combining text extraction, network analysis, and LLM prompting and summarization into a single end-to-end system. Open Source…
In this issue: New research helps COMET embrace African languages; FeatUp improves deep features, a computer vision research cornerstone; LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error; Benchmarking LLMs across languages and…
Microsoft researchers are taking a comprehensive and dynamic approach to help Copilot (web) continuously learn from interaction and feedback, improving the AI system and making it increasingly useful for consumers. Learn more.