*
Quick Links|Home|Worldwide
Microsoft*
Search for


A Machine Learning Toolkit using DryadLINQ

Overview

software stack

We aim to build a library of Machine Learning algorithms and a toolkit for writing new ones. Our toolkit is implemented using the DryadLINQ distributed computation framework. This enables the algorithms to process very large amounts of data using computer clusters.

The Large Vector library provides a very simple API, similar to map-reduce, for processing large distributed collections of numeric data.

Project Members

References

Slides from a recent talk (January 2008)

Cheng-Tao Chu, Sang Kyun Kim, Yi-An Lin, YuanYuan Yu, Gary Bradski, Andrew Y. Ng and Kunle Olukotun, Map-Reduce for Machine Learning on Multicore Neural Information Processing Systems (NIPS), December 3-6 2007, Vancouver.


©2008 Microsoft Corporation. All rights reserved. Terms of Use |Trademarks |Privacy Statement