Agenda

Microsoft eScience Workshop 2012

Monday, October 8, 2012

Time

Room: Crystal A  Room: Crystal C

7:30–8:30

Continental Breakfast/Check-in – Room Crystal B 

8:30–9:30

Welcome

Speaker: Tony Hey, Microsoft Research | slides

Keynote

Defensible Modeling of the Biosphere

Chair: Kristin Tolle, Microsoft Research

Speaker: Drew Purves, Microsoft Research | video | slides

9:30–10:00

Beverage Break – Room Crystal B 

10:00–12:00

Open Data for Open Science—Data Interoperability

Chair: Yan Xu, Microsoft Research

 

General Informatics

Chair: Kristin Tolle, Microsoft Research

Panel: Open Data for Open Science—Data Interoperability | video
Speakers: 

Robert Gurney, University of Reading

Philip Murphy, University of Redlands | slides

Karen Stocks, University of California, San Diego | slides

Yan Xu, Microsoft Research | slides

Ilya Zaslavsky University of California, San Diego | slides

Panel: Enabling Multi-Scale Science | video

Speakers:

Roberto Cesar, University of Sao Paulo (USP) | slides

James Hunt, University of California, Berkeley | slides

Claudia Bauzer Medeiros, University of Campinas (UNICAMP) | slides

 

The Internet of Databases—Generalizing the Archaeo Informatics Approach | video | slides
Speaker: Chris van der Meijden, Ludwig Maximilians University of Munich, Germany

 

Combining Semantic Tagging and Support Vector Machines to Streamline the Analysis of Animal Accelerometry Data | video | slides
Speaker: Nigel Ward, The University of Queensland

12:00–1:15

Lunch – Room Crystal B 

1:15–2:45

Handling Big Data for the Environmental Informatics

Chair: Yan Xu, Microsoft Research | slides

 

Active Publications

Chair: Dennis Gannon, Microsoft Research

Panel: Handling Big Data for the Environmental Informatics / Real-Time Environmental Observation, Modeling, and Decision Support | video

Speakers:

Jeff Dozier, University of California, Santa Barbara | slides

David Maidment, University of Texas, Austin | slides

Barbara Minsker, University of Illinois, Urbana-Champaign | slides

Chaowei Yang, George Mason University | slides

Active Publications | video
Speakers:

Ian Foster, University of Chicago and Argonne National Laboratory | slides

Tanu Malik, University of Chicago and Argonne National Laboratory | slides

2:45–3:15

Beverage Break – Room Crystal B 

3:15–4:30

The Cloud and Big Data

Chair: Kenji Takeda, Microsoft Research | slides

 

Machine-Assisted Thought

Chair: Harold Javid, Microsoft Research

Panel: Cloud Computing - What Do Researchers Want? | video

Speakers:

Fabrizio Gagliardi, Microsoft Research | slides

Dennis Gannon, Microsoft Research | slides

Marty Humphrey, University of Virginia | slides

Paul Watson, Newcastle University | slides

Machine-Assisted Thought | video | slides
Speaker: Michael J. Kurtz, Harvard-Smithsonian Center for Astrophysics

4:30–6:00  

DemoFest | video | slides

Chair: Jim Pinkelman, Microsoft Research

 

Layerscape: Tools for Collaborative Analysis of Complex Data
Presenter:
Rob Fatland, Microsoft Research

 

Globus Online: Research Data Management as a Service

Presenter: Ian Foster, University of Chicago and Argonne National Laboratory; Steve Tuecke, University of Chicago; Vas Vasiliadis, University of Chicago

 

The Open-Source ISA Metadata Tracking Framework: from Data Curation and Management at the Source, to the Linked Data Universe

Presenter: Eamonn Maguire, University of Oxford

 

SOLE: Connecting Publications to Large Online Data Repositories

Presenter: Tanu Malik, University of Chicago and Argonne National Laboratory

 

DataUp: A Tool for Documenting and Sharing Scientific Tabular Data
Presenter: Carly Strasser, California Digital Library

 

Databib: An Online Catalog of Research Data Repositories
Presenter: Michael Witt, Purdue University

 

12,000 Human Genomes from Raw Sequence to Result, on Windows and Windows Azure
Presenter: Dong Xie, Oxford University

 

OData and Environmental Informatics 
Presenter: Yan Xu, Microsoft Research

6:00–6:30

Bus to Adler Planetarium

6:30–8:30

Adler Planetarium Reception

8:30–9:00

Bus to Hyatt and Swissôtel

 

Tuesday, October 9, 2012

Time

Room: Crystal A  Room: Crystal C

7:30–8:30

Continental Breakfast/Check-in – Room Crystal B 

8:30–9:30

Keynote 

Biology: A Move to Dry Labs | video

Chair: Dan Fay, Microsoft Research | slides

Speaker: David Heckerman, Microsoft Research | slides
9:30–10:00

Beverage Break – Room Crystal B 

10:00–12:00

Data Scientists: Part I

Chair: Gail Steinhart, Cornell University

 

Citizen Science and Big Data

Chair: Chris Mentzel, Gordon and Betty Moore Foundation

Panel: Educating Data Scientists for Scientific Data | video

Moderator: Gail Steinhart, Cornell University

 

Teaching Scientific Data Management in Data Science Education and Workforce Development Programs for Science Communities | video | slides
Speaker: Robert R. Downs, Columbia University

 

Educating Scientists About the Data Life Cycle | slides 
Speaker: William Michener, University of New Mexico

 

Priorities for Data Curation Education: Data Center Partnerships and Long-Tail Science | video | slides
Speaker: Carole Palmer, University of Illinois at Urbana-Champaign

 

Educating a New Breed of Data Scientists for Scientific Data Management | video | slides
Speaker: Jian Qin, Syracuse University

The Utility of a Human/Computer Learning Network For Improving Biodiversity Conservation and Research in eBird | video | slides

Speaker: Carl Lagoze, University of Michigan

 

Tools and Techniques for Outreach and Popular Engagement in eScience | video | slides

Speaker: Rafael Santos, Instituto Nacional de Pesquisas Espaciais

 

Big Data Processing on the Cheap | video | slides
Speaker: Joe Hummel, University of California, Irvine  

12:00–1:30

Lunch – Room Crystal B 

1:30–3:00

Data Scientists: Part II
Chair:
Kenji Takeda, Microsoft Research

 

Publishing and eScience | video
Co-Chairs: Mark Abbott, Oregon State University; Jeff Dozier, University of California, Santa Barbara

What Is a Data Scientist? | video

Speakers:

Liz Lyon, UKOLN-DCC, University of Bath UK | slides

Kenji Takeda, Microsoft Research

 

Informatics, Information Science, Computer Science, and Data Science Curricula | video | slides
Speakers: Geoffrey Fox, Indiana University

 

Data Science Curricula at the University of Washington eScience Institute | video | slides
Speaker: Bill Howe, University of Washington

Scientific Publishing in a Connected, Mobile World | slides 
Speaker: Mark Abbott, Oregon State University 

 

How to Collaborate with the Crowd: a Method for “Publishing” Ongoing Work | slides

Speaker: Jeff Dozier, University of California, Santa Barbara, Visiting Researcher Microsoft Research

 

When Provenance Gets Real: Implications of Ubiquitous Provenance for Scientific Collaboration and Publishing | slides

Speaker: James Frew, University of California, Santa Barbara

 

Data Journal Challenge for the Fourth Paradigm-Trust through Data on Environmental Studies and Projects | slides

Speaker: Shuichi Iwata, The Graduate School of Project Design

3:00–3:30

Beverage Break – Room Crystal B 

3:30–5:00 Data Curation
Chair: Kristin Tolle, Microsoft Research

Novel Approaches to Data Visualization | video

Chair: George Djorgovski, California Institute of Technology

Panel: Scientific Data: the Current Landscape, Challenges, and Solutions | video | slides
Moderator: Carly Strasser, California Digital Library

Speakers:

Jeff Dozier, University of California, Santa Barbara

Chris Mentzel, Gordon and Betty Moore Foundation

William Michener, University of New Mexico

Dave Vieglais, The University of Kansas

Stephanie Wright, University of Washington

Data Visualization in Virtual Spaces and High Dimensions | slides
Speaker: George Djorgovski, California Institute of Technology

 

CT and Imaging Tools for Windows HPC Clusters and Azure Cloud | slides

Speaker: Darren Thompson, CSIRO (Advanced Scientific Computing)

 

Work in Progress Toward Enhancing Multidimensional Visualization with Analytical Workflows | slides

Speaker: Dawn Wright, Environmental Systems Research Institute

5:15–6:00

Reception – Crystal Foyer

6:00–9:00

Microsoft Jim Gray eScience Award Banquet – Room Crystal B 

6:00–7:00

Dinner 

7:00–7:30

Announcement of Jim Gray eScience Award Recipient

Host: Tony Hey, Microsoft Research | video (subsequent keynote address also on this video) | slides

7:30–9:00

Keynote 

The Possibilities and Pitfalls Internet-Based Chemical Data

Chair: Tony Hey, Microsoft Research

Speaker: Antony John Williams, Royal Society of Chemistry | video (Jim Gray Award precedes keynote on this video) | slides