Share on Facebook Tweet on Twitter Share on LinkedIn Share by email

eScience in the Cloud 2014
April 29–30, 2014 | Redmond, WA, United States

Gabriel Antoniu

Dr. Gabriel Antoniu is a Senior Research Scientist at Inria. He leads the KerData research team at Inria’s Rennes - Bretagne Atlantique Research Center, which focuses on storage and I/O management for Big Data processing on scalable large-scale infrastructures, including clouds and HPC systems. He received his Ph.D. degree in Computer Science in 2001 from ENS Lyon. His research interests include: Big Data, scalable cloud storage, large-scale distributed data management and sharing, data consistency models and protocols, P2P systems. He has co-advised 13 PhD students and published over 100 papers in the aforementioned areas. He leads the MapReduce project funded by the French National Research Agency, in partnership with Argonne National Nab (USA), the University of Illinois at Urbana Champaign (USA) and IBM France. Within the Inria-Microsoft Joint Research Center, he led the A-Brain Inria-Microsoft Research project on optimized cloud storage for joint genetics and neuroimaging analysis and currently leads the follow-up Z-CloudFlow project. He serves as a Program Chair for the IEEE Cluster 2014 conference and regularly serves as a PC member of major conferences in the area of HPC, cloud computing, and Big Data (SC, HPDC, Big Data, CCGRID, etc.).

Victor Bahl

Victor Bahl is a Principal Researcher and founding Director of the Mobility & Networking Research (MNR) Group in Microsoft Research. MNR's mission is "to invent & research technologies that make Microsoft's networks, services and devices indispensable to the world". In addition to pursuing untethered research and shepherding brilliant researchers, Victor helps shape Microsoft's long-term vision related to networking technologies through associated policy engagement with governments and industries around the world. He and his group have had far-reaching impact on the research community, government policy, and Microsoft products through many significant technology transfers. His personal research spans a variety of topics in mobile computing, wireless systems, cloud services, and datacenter networking and management. Over his career, he has built several highly cited seminal systems, published prolifically in top conferences and journals, authored over 100 patents, given over 35 keynote talks, won numerous awards and honors including ACM SIGMOBILE’s Lifetime Achievement Award and IEEE Outstanding Leadership and Professional Service Award, and has engaged in significant professional and company-wide leadership activities. Victor received his PhD from the University of Massachusetts Amherst in 1997. He is a Fellow of the ACM, IEEE, and AAAS.

Roger Barga

Roger Barga is a Group Program Manager in the Cloud Machine Learning group of Microsoft and a lecturer in Data Science at the University of Washington. Roger has held a number of positions in his career at Microsoft, from Researcher in the Database Research Group of Microsoft Research, to Architect in Microsoft Research Connections, to Engineering Manager in the eXtreme Computing Group of Microsoft Research.

Chaitan Baru

Chaitan Baru is Associate Director, Data Initiatives, San Diego Supercomputer Center (SDSC), UC San Diego, where he is also Director, Center for Large-Scale Data Systems Research. He has played a leadership role in a number of national-scale cyberinfrastructure R&D initiatives across a wide range of science disciplines from earth science to ecology and biomedical informatics. In 2012, he initiated an industry-academia effort to define big data benchmarks via the Workshops for Big Data Benchmarking. He currently leads the Institute for Data Science and Engineering (IDSE) initiative at SDSC, for training, education, and consulting in data science and engineering.

Baru has been a participant in the NSF EarthCube initiative whose goal is to develop a national data infrastructure for Earth System science. He co-chaired the EarthCube Community Group on Data Discovery, Mining, and Access. He is one of the leaders of the recently formed EarthCube Cloud Commons group (also known as GeoCloud) whose objectives is to evaluate, educate, encourage and facilitate adoption of cloud computing by the EarthCube Geosciences community. The group is developing a prototype cloud portal and a sample set of virtual machines. Baru also co-chairs the National Institute for Standards and Technology’s (NIST) Working Group on Big Data.

Tanya Berger-Wolf

Dr. Tanya Berger-Wolf is an Associate Professor in the Department of Computer Science at the University of Illinois at Chicago, where she heads the Computational Population Biology Lab. Her research interests are in applications of computational techniques to problems in ecology, from genetics to social interactions. She has published over 60 papers and given numerous invited talks on the subject. As a legitimate part of her research she gets to fly in a super-light airplane over a nature preserve in Kenya, taking a hyper-stereo video of zebra populations.

Dr. Berger-Wolf has received her Ph.D. in Computer Science from University of Illinois at Urbana-Champaign in 2002. After spending some time as a postdoctoral fellow working in computational phylogenetics and doing research in computational epidemiology, she returned to Illinois. She has received numerous awards for her research and mentoring, including the US National Science Foundation CAREER Award in 2008 and the UIC Mentor of the Year (2009) and Graduate Mentor (2012) awards.

Mercè Crosas

Mercè Crosas, Ph.D., is the Director of Data Science at the Institute for Quantitative Social Science (IQSS) at Harvard University. Her group includes the Dataverse project, data acquisition and curation, the Murray Research Archive, statistical programming (Zelig and related R statistical packages), the Consilience project on text analysis, and DataTags for sharing sensitive data. She is affiliated with the Seamless Astronomy group, and currently collaborating with a Sloan funded project to persistently link journals to data, and NSF funded projects to develop data privacy tools and data sociometric tools, and visualize geospatial data by connecting Dataverse and WorldMap.

Before joining IQSS, Crosas led software development teams in the educational software and biotech industries. Prior to that, she was at the Harvard-Smithsonian Center for Astrophysics, first completing work for her Ph.D. in Astrophysics from Rice University, and later as a researcher and software engineer. Crosas also earned a B.S. in Physics from the Universitat de Barcelona, Spain.

Wuchun Feng

Wu Feng is the Elizabeth & James E. Turner Fellow and Professor of Computer Science, of Electrical & Computer Engineering, and of the Health Sciences at Virginia Tech, where he also directs the Synergy Laboratory and serves as the VT site co-director of the NSF Center for High-Performance Reconfigurable Computing (CHREC). His expertise in in parallel and distributed computing has led to the creation of innovative hardware systems (e.g., Green Destiny and HokieSpeed), systems software (e.g., CoreTSAR, EnergyFit, EcoDaemon), middleware (e.g., MOON), tools (e.g., CU2CL, OpenDwarfs), and ecosystems (e.g., ParaMEDIC) to enable high-performance computing from the embedded space to the datacenter and cloud. To demonstrate the efficacy of his interdisciplinary research, he has applied the above to many application domains, including the life sciences (e.g., mpiBLAST, SeqInCloud, Compute the Cure for Cancer), seismology (e.g., WaveProp), and neuroscience (e.g., TDM).

Dr. Feng received a B.S. in Electrical & Computer Engineering and Music (Honors) and an M.S. in Computer Engineering from Penn State University in 1988 and 1990, respectively. He earned a Ph.D. in Computer Science from the University of Illinois at Urbana-Champaign in 1996. His previous professional stints include IBM T.J. Watson Research Center, NASA Ames Research Center, Vosaic, University of Illinois at Urbana-Champaign, Purdue University, The Ohio State University, Orion Multisystems, and Los Alamos National Laboratory. He is a Distinguished Member of the ACM, Senior Member of the IEEE, and two-time designee of HPCwire's Top People to Watch List. 

Geoffrey Fox

Geoffrey Fox received a Ph.D. in Theoretical Physics from Cambridge University and is now distinguished professor of Informatics and Computing, and Physics at Indiana University where he is director of the Community Grids Laboratory and Associate Dean for Research and Director of the Data Science program at the School of Informatics and Computing. He previously held positions at Caltech, Syracuse University, and Florida State University. He has supervised the PhD of 66 students and published approximately 1,000 papers in physics and computer science with an h-index of 68 and over 24,500 citations.

He currently works in applying computer science to Bioinformatics, Sensor Clouds, Earthquake and Ice-sheet Science, and Particle Physics. He is principal investigator of FutureGrid—a facility to enable development of new approaches to computing including clouds, grids and distributed systems. He is involved in several projects to enhance the capabilities of Minority Serving Institutions including the eHumanity portal. He has experience in online education and its use in MOOC’s for areas like Data and Computational Science. He is a Fellow of APS and ACM.

Dennis Gannon

Dr. Dennis Gannon is Director of Cloud Research Engagements in Microsoft Research. As part of that effort, he has provided cloud resource research projects in 13 countries in collaboration with the US National Science Foundation; the European Commission; the Japanese National Informatics Institute; the national laboratories CSIRO, NICTA, and ANU in Australia; the Taiwan National Science Council; and the National Academy of Science in China. Prior to coming to Microsoft, Dr. Gannon was a professor and chairman of Computer Science at Indiana University and the Science Director for the Indiana Pervasive Technology Labs. Dr. Gannon's research interests include cloud computing, large-scale cyberinfrastructure, distributed computing, computer networks, parallel programming, and computational science. He led several software projects for Defense Advanced Research Projects Agency (DARPA) and the US Department of Energy related to programming massively parallel systems. He has worked extensively with National Science Foundation on interdisciplinary science projects.

David Gauthier

As director of Data Center Architecture and Design Strategy for Microsoft Global Foundation Services (GFS), David Gauthier is responsible for the technical strategy and direction of Microsoft’s global data center footprint. Through a balanced approach to reliability, scalability, and performance, GFS delivers data center strategies enabling cost-optimized solutions for Microsoft’s online, live, and cloud services for consumers and businesses worldwide.

Prior to his current role, David was a founding member of the GFS Data Center Research and Engineering organization, where he was responsible for the design of Microsoft’s Generation 4 Modular Data Center Architecture and led a team of electrical and mechanical engineers to develop innovative and industry-leading approaches for extremely efficient, scalable and flexible data center facilities. In 15 years at Microsoft, David has had technical, program, and project management responsibility for the design and construction of more than 12 international and domestic enterprise-class data centers including Microsoft’s Quincy, San Antonio, Chicago, Dublin, and Boydton, Virginia facilities. David is a Seattle native and attended Washington State University.

Jonathan Goldstein

Jonathan Goldstein began his research career at the University of Wisconsin, where he received his Ph.D. in 1999. In the 15 years since, he has worked at Microsoft: 12 years as a researcher, and 3 years as a product architect and developer. As a researcher, Dr. Goldstein has contributed in a variety of areas related to data processing, including big data processing, stream processing, similarity search, and relational database query processing. He has published more than 20 papers in top database conferences, and is an inventor of more than 20 patents. His work has received various awards, including the SIGMOD test of time award.

One of Dr. Goldstein’s projects, the CEDR streaming system, became the basis of Microsoft StreamInsight, a streaming data processing product released in 2009. Dr. Goldstein’s three years as an architect were spent helping to develop this product, and during this time he contributed to five releases. Since coming back to Microsoft Research, Dr. Goldstein has expanded his interest to the confluence of big data processing, the cloud, and streaming data processing.

Daron Green

Dr. Green is the Senior Regional Manager of Microsoft Research Connections responsible for MSR’s global research investments. Previously he was General Manager of Microsoft’s Technology Policy Group; responsible for identifying business opportunities and innovations likely from potential disruptive technologies. In that role he provided oversight for key mechanisms for Microsoft’s internal processes of innovation and ideation such as ThinkWeek and external efforts such as Microsoft’s Cloud Research Engagements and Microsoft’s Environmental Sustainability program. Prior to this, he was General Manager for Microsoft Research’s external engagement and investment strategy. With a global portfolio which included diverse topics such as Health and Wellbeing, Education and Scholarly Communications, Computer Science and the Environment. Dr. Green’s initial research background was in molecular modeling and equations of state for fluid mixtures—his BSc is in Chemical Physics (1989, Sheffield) and PhD in molecular simulation of fluid mixtures (1992, Sheffield). He went on to do post-doctoral research in simulation of polymer and protein folding (1993–4, UCD). This naturally led to application porting and optimization for large-scale parallel and distributed computing in a range of application domains including computational chemistry (molecular dynamics and quantum mechanical codes), radiography, Computational Fluid Dynamics and Finite Element analysis. Dr. Green then moved more fully into HPC and was responsible for some of Europe’s largest HPC Framework V programs for the European Commission, major HPC procurements in the UK for the UK Research Councils and UK Defense clients; he also led detailed investigations into the maturity and adoption for European HPC Software tools (published). From there Dr. Green went to work for the SGI/Cray—helping to set up the European Professional Services organization from which he span out a small team out to establish the European Professional Services for Selectica Inc—Selectica specialized in on-line configuration/logic-engine technologies offered via web services. Given an HPC/distributed computing background and familiarity with the then embryonic area of Web Services, IBM invited Dr. Green to help establish its Grid Computing Strategy and emerging business opportunity (Grid EBO) team. He subsequently moved to British Telecom to head-up its Global Services business incubation and, as part of this, in 2007 he established and launched BT’s Sustainability practice—responsible for BT’s business offerings to commercial customers which help reduce their carbon footprints and establish business practices which are sustainable in terms of their social and economic impact.

Albert Greenberg


Bill Howe

Bill Howe is the Associate Director of the University of Washington eScience Institute and holds an Affiliate Assistant Professor appointment in Computer Science & Engineering, where he studies big data systems for science applications. Howe developed an online course, “Introduction to Data Science,” that attracted over 100,000 students and has received two Jim Gray Seed Grant awards from Microsoft Research for work on in science databases. He co-authored what are currently the most-cited papers from both VLDB 2010 and SIGMOD 2012 and has had multiple “Best of Conference” papers selected by the VLDB Journal.

Howe serves on the program and organizing committees for a number of conferences in the area of databases and scientific data management, and serves on the Science Advisory Board of the SciDB project. He has a Ph.D. in Computer Science from Portland State University and a bachelor's degree in Industrial & Systems Engineering from Georgia Tech.

Marty Humphrey


John Krumm

John Krumm graduated from Carnegie Mellon University in 1993 with a PhD in robotics and a thesis on texture analysis in images. He worked at the Robotics Center of Sandia National Laboratories in Albuquerque, New Mexico, for the next four years. His main projects there were computer vision for object recognition for use in robots and vehicles. He has been at Microsoft Research in Redmond, Washington, since 1997, and is currently a principal researcher. He concentrates on location tracking of people and devices as well as methods to use location data to benefit the user. He holds 48 US patents. He was a PC chair for UbiComp 2007, ACM SIGSPATIAL 2013, and is a PC chair for the ACM SIGSPATIAL conference in 2014. He served on the editorial board of IEEE Pervasive Computing Magazine, and is currently a co-editor in chief for the Journal of Location Based Services.

Jie Liu

Dr. Jie Liu is a Principal Researcher at Microsoft Research, Redmond, Washington, and the manager of its Sensing and Energy Research Group. His research interests root in understanding and managing the physical properties of computing. Examples include timing, location, energy, and the awareness of and impact on the physical world. He has been publishing in and servicing areas like sensor networks, embedded systems, ubiquitous computing, and energy efficient cloud computing. Dr. Liu is an ACM Distinguished Scientist. He received his Ph.D. degree from Electrical Engineering and Computer Sciences, UC Berkeley in 2001. From 2001 to 2004, he was a research scientist at Palo Alto Research Center (formerly Xerox PARC).

Yung-Hsiang Lu

Many cameras have been deployed for various purposes, such as monitoring traffic, watching natural scenes, and observing weather. In many cases, the video streams are not analyzed by computers and simply discarded. Analyzing videos requires significant amounts of computing resources and cloud computing may meet this need. This project aims to develop a cloud-based software infrastructure for analyzing video streams. This infrastructure manages the cloud resources and meets the performance requirements of the analysis programs. The requirements are affected by many factors, including the videos' frame rates, resolutions, scene contents, as well as the types and the numbers of cloud virtual machines. Through this software infrastructure, analysis programs run on Microsoft Azure and process many videos simultaneously.

Biography: Yung-Hsiang Lu is an associate professor in the School of Electrical and Computer Engineering and (by courtesy) the Department of Computer Science of Purdue University. He is an ACM distinguished scientist (2013) and ACM distinguished speaker (2013–2016). His research areas include computer systems, mobile computing, and image processing. He obtained the Ph.D. from the Department of Electrical Engineering at Stanford University.

Shahrokh Mortazavi

Shahrokh Mortazavi currently works in the Developer Division of Microsoft on Python and Node.js tool chain. Previously, he was in the High Performance Computing group at Microsoft. He worked on the Phoenix Compiler tool chain (code gen, analysis, JIT) at Microsoft Research and for 10 years led Sun Microsystems’ Code Generation & Optimization compiler backend teams.

Steven Roberts

Steven Roberts is an Associate Professor in the School of Aquatic and Fishery Sciences where his research centers around characterizing the response of aquatic organisms to environmental change. Prior to coming to the University of Washington in 2007, he was at the Marine Biological Laboratory in Woods Hole, Massachusetts, and received his PhD from the University of Notre Dame. In graduate school, he spent most of his time transferring agarose gels, and now he spends most of his time transferring files.

Kristin Tolle

Kristin M. Tolle, Ph.D., is a Director of Environmental Science Infrastructure in the Microsoft Research Connections team and co-editor with Tony Hey of the book The Fourth Paradigm: Data Intensive Scientific Discovery. She has been working at Microsoft for 14 years, mainly in Microsoft Research. Prior to joining Microsoft, Dr. Tolle did data-driven biomedical research at the University of Arizona, Artificial Intelligence Lab. Her present research interests include: big data management and curation, the development of freely available tools designed to facilitate time to discovery in environmental science, and data driven models to glean inferences about climate change impacts.

Evelyne Viegas

Evelyne Viegas is the Director of Semantic Computing at Microsoft Research, based in Redmond, Washington. Semantic computing is about interacting with data in rich, safe, and semantically meaningful ways, to create the path from data to information, knowledge, and intelligence. In her current role, Evelyne is building initiatives that focus on information seen as an enabler of innovation, working in partnership with universities and government agencies worldwide. In particular, she is creating programs related to computational intelligence research to drive open innovation and agile experimentation via cloud-based services, as well as projects to advance the state-of-the-art in machine learning, knowledge representation and reasoning under uncertainty at web scale.

Prior to her present role, Evelyne worked as a technical lead at Microsoft delivering natural language processing components to projects for MSN, Office, and Windows. Before Microsoft, and after completing her Ph.D. in France, she worked as a Principal Investigator at the Computing Research Laboratory in New Mexico on an ontology-based machine translation project. Evelyne serves on international editorial, program, and award committees.

Alex Wade

Alex Wade is Director for Scholarly Communication at Microsoft Research, where he oversees a portfolio of research-focused products and services. Alex holds a bachelor's degree in Philosophy from the University of California at Berkeley, and a master's of Librarianship degree from the University of Washington. During his career at Microsoft, Alex has managed the corporate search and taxonomy management services; created solution-accelerators for Office, and shipped Windows Search in Windows Vista and Windows 7. Prior to joining Microsoft, Alex was Systems Librarian at the University of Washington, and held multiple library positions at the University of Michigan and the University of California at Berkeley.

Yi Zhang

Yi Zhang is a Program Manager from Microsoft Excel team, in charge of design and implementation of Excel data analysis features and tools on Apple platform. Recently his team just shipped Office for iPad. Prior to Excel, Yi worked on Microsoft Exchange Server project as a program manager from 2006. Over the years, Yi’s area of expertise spans from enterprise messaging to data visualization, and to data analysis and Big Data. Recently, Jeff has taken on a role of driving the Microsoft BI solutions on cross-platforms.