Speaker James McCaffrey
Affiliation MSR Redmond
Host Carlos Garcia Jurado Suarez
Date recorded 16 October 2013
Data clustering is process of grouping data items together so that similar items belong to the same group. Although data clustering has been studied by researchers for decades, there is relatively little practical information available that describes how to actually implement clustering algorithms. This talk explains data clustering from a developer's point of view with an emphasis on how to code clustering methods using the C# programming language. Topics covered include: clustering numeric data, clustering categorical data, key data structures, and determining the optimal number of clusters. Complete C# clustering source code will be presented that can be used as-is, or modified to meet special clustering scenarios.
©2013 Microsoft Corporation. All rights reserved.