Microsoft open sources algorithm that gives Bing some of its smarts
以下为 快照 页面，建议前往来源网站查看，会有更好的阅读体验。
Search engines today are more than just the dumb keyword matchers they used to be. You can ask a question—say, "How tall is the tower in Paris?"—and they'll tell you that the Eiffel Tower is 324 metres (1,063 feet) tall, about the same as an 81-story building. They can do this even though the question never actually names the tower.
The way this is done is, as with everything else these days, machine learning. Machine-learning algorithms are used to build vectors—essentially, long lists of numbers—that in some sense represent their input data, whether it be text on a Web page, images, sound, or videos. Bing captures billions of these vectors, for all the different kinds of media that it indexes. To search the vectors, Microsoft uses an algorithm it calls SPTAG ("Space Partition Tree and Graph"). An input query is converted into a vector, and SPTAG is used to quickly find "approximate nearest neighbors" (ANN), which is to say, vectors that are similar to the input.
This (with some amount of hand-waving) is how the Eiffel Tower question can be answered: a search for "How tall is the tower in Paris?" will be "near" to pages talking about towers, Paris, and how tall things are. Such pages are almost surely going to be about the Eiffel Tower.
Microsoft has released today the SPTAG algorithm as MIT-licensed open source on GitHub. This code is proven and production-grade, used to answer questions in Bing. Developers can use this algorithm to search their own sets of vectors and do so quickly: a single machine can handle 250 million vectors and answer 1,000 queries per second. There's some samples and explanations in Microsoft's AI Lab , and Azure will have a service using the same algorithms.
Microsoft CEO Satya Nadella has spoken of a number of occasions of his desire to "Democratize AI" and make it available to everyone; not just a centralized, specialized tool that demands considerable expertise, but something that a wide range of developers, solving a wide range of problems, can use as part of their toolkit. The release of SPTAG is an example of how Microsoft is putting those words into practice; the combination of an Azure service and open source means that developers can start with the more constrained, easy-to-use service, and as their expertise or requirements grow more complex, they can use SPTAG to build their own services.
Yes, WPF has been around for a while As part of today’s Connect(); virtual event for developers, Microsoft announced t...
Project Mu aims to make it easier for developers to quickly create and frequently update the firmware of their devices, enabling Project Mu devices to adopt a Firmware as...
At its Build 2019 conference, Microsoft has announced it will open-source parts of its Quantu...
Microsoft Research open-sourced TensorWatch, their debugging tool for AI and deep-learning. TensorWatch supports PyTorch as wel...
SandDance, the beloved data visualization tool from Microsoft Research, has been re-released as an open source project on GitHub . This new version of SandDance has be...
cups - Official CUPS Sources
OnePlus is releasing the kernel sources and the device trees for the OnePlus 5T today. This paves the way for custom AOSP-based ROMs such as LineageOS.
Despite releasing the Mi A1 as the company's first Android One device two months ago, Xiaomi has yet to release kernel sources for it, breaking the GPLv2.