*
Quick Links|Home|Worldwide
Microsoft*
Search for


Steven Greenberg

 

Steven Greenberg

Contact Information
Principal Strategist for the Web Audience group at AOL

Biography
Steve Greenberg works on web strategy for America Online, where he has also managed instant messaging and community products. Previously he managed the engineering and services groups for a startup in the wine industry, and was senior engineer at Epinions.com. He was a Principal Consultant for Netscape Communications where he founded and ran the Knowledge Management practice. After spending an unspecified number years doing unspecified things for unspecified government agencies, he returned to the world with an MS in Computer Information Systems from Boston University. Your bank probably still uses his code to process deposits.

Position Paper
I spent about half of my 20s working in Heidelberg, Germany, and fell completely in love with the place. One of my favorite uses for Flickr, in fact, is to leave a slideshow running of photos tagged with “Heidelberg”. Flickr shows me photos of smiling tourists, mountain vistas, a moss covered castle… and industrial printing presses. It turns out that “Heidelberg” is also the brand name of a company that makes the sort of presses used to produce newspapers and glossy magazines. Figuring out how to separate these contexts is one of the central hurdles facing systems that depend upon user classification.

Flickr has created value by gathering these pictures and helping me find them, but they are limited by the fact that uploaders have little incentive to provide more than the bare minimum context they require for their own use. An enterprise can hire librarians to build a structured taxonomy and force everyone to fully qualify their tags, but it is too much to expect that mass market consumers will learn and follow any set of rules.

This is not to say that I think user tagging is useless or unworkable in the mass market. Rather, I suggest that a change of approach can get us to a “good enough” state without asking an unrealistic amount of effort from either taggers or searchers. I propose that we turn the standard clustering approach on its head. Rather than grouping things that are similar, we should focus on separating what is dissimilar.

When the user assigns a tag, the system could ask one or two follow up questions based upon other tags related to the one just provided, but which are infrequently used together. It could also suggest common synonyms already in use. These actions can help the community converge upon a standard vocabulary while still allowing individuals to expand it when they see fit. Thus, the Flickr user who posts a photo simply tagged “Heidelberg” would be asked whether “castle” or “printing press” is more likely to describe the content. A learning system could identify sets of tags frequently and infrequently used together, relying upon a base taxonomy to provide suggestions in cases where there are not enough existing tag sets to make statistical predictions. By using word frequency, this approach can also help classify text documents.

The ambiguity of human language is what makes it such a flexible tool. That same ambiguity, however, demands that real communication be based upon either voluminous detail from the speaker or a feedback loop. Simulating a feedback loop with follow up questions gets us good enough results with the minimum of effort.

 

Back to Social Computing Symposium 2005

 


©2008 Microsoft Corporation. All rights reserved. Terms of Use |Trademarks |Privacy Statement