Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Our research
Content type
+
Downloads (453)
+
Events (445)
 
Groups (152)
+
News (2726)
 
People (747)
 
Projects (1101)
+
Publications (12525)
+
Videos (5663)
Labs
Research areas
Algorithms and theory47205 (335)
Communication and collaboration47188 (214)
Computational linguistics47189 (234)
Computational sciences47190 (220)
Computer systems and networking47191 (757)
Computer vision208594 (906)
Data mining and data management208595 (105)
Economics and computation47192 (104)
Education47193 (82)
Gaming47194 (76)
Graphics and multimedia47195 (233)
Hardware and devices47196 (215)
Health and well-being47197 (91)
Human-computer interaction47198 (886)
Machine learning and intelligence47200 (878)
Mobile computing208596 (54)
Quantum computing208597 (32)
Search, information retrieval, and knowledge management47199 (674)
Security and privacy47202 (309)
Social media208598 (42)
Social sciences47203 (261)
Software development, programming principles, tools, and languages47204 (620)
Speech recognition, synthesis, and dialog systems208599 (129)
Technology for emerging markets208600 (32)
1–25 of 886
Sort
Show 25 | 50 | 100
1234567Next 
Dongwook Yoon, Nicholas Chen, François Guimbretière, and Abigail Sellen

This paper introduces a novel document annotation system that aims to enable the kinds of rich communication that usually only occur in face-to-face meetings. Our system, RichReview, lets users create annotations on top of digital documents using three main modalities: freeform inking, voice for narration, and deictic gestures in support of voice. RichReview uses novel visual representations and timesynchronization between modalities to simplify annotation access and navigation. Moreover, RichReview’s...

Publication details
Date: 1 October 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Suman Ravuri and Andreas Stolcke

Utterance classification is a critical pre-processing step for many speech understanding and dialog systems. In multi-user settings, one needs to first identify if an utterance is even directed at the system, followed by another level of classification to determine the intent of the user’s input. In this work, we propose RNN and LSTM models for both these tasks. We show how both models outperform baselines based on ngram-based language models (LMs), feedforward neural network LMs, and boosting...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: ISCA - International Speech Communication Association
Ahmed Kharrufa, James Nicholson, Paul Dunphy, Steve Hodges, Pam Briggs, and Patrick Olivier

In addition to their popularity as personal devices, tablets, are becoming increasingly prevalent in work and public settings. In many of these newly-established application domains a supervisor user – such as the teacher in a classroom – oversees the function of one or more devices. Access to supervisory functions is typically controlled through the use of a passcode, but experience shows that keeping this passcode secret can be problematic. In this paper we introduce SwipeID, a method of identifying...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: IFIP
Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey Zweig, and Gokhan Tur

Spoken language understanding (SLU) in today’s conversational systems focuses on recognizing a set of domains, intents, and associated arguments, that are determined by application developers. User requests that are not covered by these are usually directed to search engines, and may remain unhandled. We propose a method that aims to find common user intents amongst these uncovered, out-of-domain utterances, with the goal of supporting future phases of dialog system design. Our approach relies on...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: Interspeech 2015 Conference
Gerard Pons-Moll, Jonathan Taylor, Jamie Shotton, Aaron Hertzmann, and Andrew Fitzgibbon

We present a new method for inferring dense data to model correspondences, focusing on the application of human pose estimation from depth images. Recent work proposed the use of regression forests to quickly predict correspondences between depth pixels and points on a 3D human mesh model. That work, however, used a proxy forest training objective based on the classification of depth pixels to body parts. In contrast, we introduce Metric Space Information Gain (MSIG), a new decision forest training...

Publication details
Date: 1 August 2015
Type: Article
Publisher: Springer
Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: 12th IEEE International Conference On Advanced Video and Signal-based Survillance
Stuart Schechter and Joseph Bonneau

Nearly all smartphones and tablets support unlocking with a short user-chosen secret: e.g., a numeric PIN or a pattern. To address users’ tendency to choose guessable PINs and patterns, we compare two approaches for helping users learn assigned random secrets. In one approach, built on our prior work [16], we assign users a second numeric PIN and, during each login, we require them to enter it after their chosen PIN. In a new approach, we re-arrange the digits on the keypad so that the user’s...

Publication details
Date: 22 July 2015
Type: Inproceeding
Publisher: USENIX – Advanced Computing Systems Association
Sarah K. Tyler, Jaime Teevan, Peter Bailey, Sebastian de la Chica, and Nikhil Dandekar

Information on almost any given topic can be found on the Web, often accessible via many different websites. But even when the topical content is similar across websites, the websites can have different characteristics that appeal to different people. As a result, individuals can develop preferred websites to visit for certain topics. While it has long been speculated that such preferences exist, little is understood about how prevalent, clear, and stable these preferences actually are. We characterize...

Publication details
Date: 1 June 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-48
Sameh Khamis, Jonathan Taylor, Jamie Shotton, Cem Keskin, Shahram Izadi, and Andrew Fitzgibbon

We describe how to learn a compact and efficient model of the surface deformation of human hands. The model is built from a set of noisy depth images of a diverse set of subjects performing different poses with their hands. We represent the observed surface using Loop subdivision of a control mesh that is deformed by our learned parametric shape and pose model. The model simultaneously accounts for variation in subject-specific shape and subject-agnostic pose. Specifically, hand shape is...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Kuansan Wang

Human is the only species on earth that has mastered the technologies in writing and printing to capture ephemeral thoughts and scientific discoveries. The capabilities to pass along knowledge, not only geographically but also generationally, have formed the bedrock of our civilizations. We are in the midst of a silent revolution driven by the technological advancements: no longer are computers just a fixture of our physical world but have they been so deeply woven into our daily routines that they are...

Publication details
Date: 18 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
John Vilk, David Molnar, Benjamin Livshits, Eyal Ofek, Chris Rossbach, Alexander Moshchuk, Helen J. Wang, and Ran Gal

Immersive experiences that mix digital and real-world objects are becoming reality, but they raise serious privacy concerns as they require real-time sensor input. These experiences are already present on smartphones and game consoles via Kinect, and will eventually emerge on the web platform. However, browsers do not expose the display interfaces needed to render immersive experiences. Previous security research focuses on controlling application access to sensor input alone, and do not deal...

Publication details
Date: 18 May 2015
Type: Proceedings
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Jarrod Knibbe, Hrvoje Benko, and Andrew D. Wilson

Projector-camera (pro-cam) systems afford a wide range of interactive possibilities, combining both natural and mixed-reality 3D interaction. However, the latency inherent within these systems can cause the projection to ‘slip’ from its intended target, detracting from the overall experience. Because of this, pro-cam systems have typically shied away from truly dynamic scenarios. In turn, research has been exploring latency reduction techniques across a range of domains, but these techniques typically...

Publication details
Date: 11 May 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-35
Helen J. Wang, Alexander Moshchuk, Michael Gamon, Mona Haraty, Shamsi Iqbal, Eli T. Brown, Ashish Kapoor, Chris Meek, Eric Chen, Yuan Tian, Jaime Teevan, Mary Czerwinski, and Susan Dumais

In this paper, we advocate “activity” to be a central abstraction between people and computing instead of applications. We outline the vision of the activity platform as the next-generation social platform.

Publication details
Date: 8 May 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-38
Cecily Morrison, Marcus D'Souza, Kit Huckvale, Jonas F Dorn, Jessica Burggraaff, Christian Philipp Kamm, Saskia Marie Steinheimer, Peter Kontschieder, Antonio Criminisi, Bernard Uitdehaag, Frank Dahlke, Ludwig Kappos, and Abigail Sellen

Background: Sensor-based recordings of human movements are becoming increasingly important for the assessment of motor symptoms in neurological disorders beyond rehabilitative purposes. ASSESS MS is a movement recording and analysis system being developed to automate the classification of motor dysfunction in patients with multiple sclerosis using depth-sensing computer vision. It aims to provide a more consistent and finer-grained measurement of motor dysfunction than currently possible....

Publication details
Date: 1 May 2015
Type: Article
Aditya Vashistha, Edward Cutrell, Gaetano Borriello, and William Thies

Interactive voice forums have emerged as a promising platform for people in developing regions to record and share audio messages using low-end mobile phones. However, one of the barriers to the scalability of voice forums is the process of screening and categorizing content, often done by a dedicated team of moderators. We present Sangeet Swara, a voice forum for songs and cultural content that relies on the community of callers to curate high-quality posts that are prioritized for playback to others....

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Madeline E. Smith and John C. Tang

While group messaging has become popular, particularly among adolescents, it has not yet been explored in the HCI literature. We interviewed 48 adolescents, aged 15-24, who use group messaging regularly. We present a framework for understanding the types of groups they communicate with according to three dimensions: focus, membership, and duration. We also present findings about factors influencing their choice of group messaging tools and the problems they have managing multiple group threads using...

Publication details
Date: 1 May 2015
Type: Technical report
Number: MSR-TR-2015-43
M. R. P. Thomas, H. Gamper, and I. J. Tashev

Beamforming and channel equalizers can be formulated as optimal multichannel filter-and-sum operations with different objective criteria. It has been shown in previous studies that the combination of both concepts under a common framework can yield results that combine both the spatial robustness of beamforming and the dereverberation performance of channel equalization. This paper introduces an additional method for leveraging both approaches that exploits channel estimates in a wanted spatial location...

Publication details
Date: 23 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Hannes Gamper, Mark R. P. Thomas, and Ivan J. Tashev

The estimation of acoustic propagation delays from a sound source to a listener’s ear entrances is useful for understanding and visualising the wave propagation along the surface of the head, and necessary for individualised spatial sound rendering. The interaural time difference (ITD) is of particular research interest, as it constitutes one of the main localisation cues exploited by the human auditory system. Here, an approach is proposed that employs ray tracing on a 3-D head scan to estimate and...

Publication details
Date: 22 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Felicia Lim, Mark R. P. Thomas, and Ivan J. Tashev

Reverberation time is an important parameter for characterizing acoustic environments. It is useful in many applications including acoustic scene analysis, robust automatic speech recognition and dereverberation. Given knowledge of the acoustic impulse response, reverberation time can be measured using Schroeder’s backward integration method. Since it is not always practical to obtain impulse responses, blind estimation algorithms are sometimes desirable. In this work, the reverberation problem is...

Publication details
Date: 21 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Masa Ogata and Masaaki Fukumoto

FluxPaper is a new paper-based medium that enables physical movement and dynamic interaction between a high-power magnetized paper and a programmable magnetic field. FluxPaper has a very thin patterned magnetic layer (0.1 mm) pasted behind the paper. A thin but strong neodymium-based magnet realizes fast, powerful, and precise physical actions while retaining the original characteristics of the paper that is widely used in our daily lives. Owing to an effective magnetic pattern and a computer-controlled...

Publication details
Date: 20 April 2015
Type: Proceedings
Publisher: ACM – Association for Computing Machinery
Rachel Jacobs, Steven Benford, and Ewa Luger

Publication details
Date: 18 April 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Ewa Luger, Lachlan Urquhart, Tom Rodden, and Michael Golembewski

The regulatory climate is in a process of change. Design, having been implicated for some time, is now explicitly linked to law. This paper recognises the heightened role of designers in the regulation of ambient interactive technologies. Taking account of incumbent legal requirements is difficult. Legal rules are convoluted, uncertain, and not geared towards operationalisable heuristics or development guidelines for system designers. Privacy and data protection are a particular moral, social and legal...

Publication details
Date: 17 April 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Elena L. Glassman, Juho Kim, Andrés Monroy-Hernández, and Meredith Ringel Morris

Educators have developed an effective technique to get feedback after in-person lectures, called “muddy cards.” Students are given time to reflect and write the “muddiest” (least clear) point on an index card, to hand in as they leave class. This practice of assigning end-of-lecture reflection tasks to generate explicit student feedback is well suited for adaptation to the challenge of supporting feedback in online video lectures. We describe the design and evaluation of Mudslide, a prototype system...

Publication details
Date: 1 April 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Alex S Taylor, Siân Lindley, Tim Regan, David Sweeney, Vasillis Vlachokyriakos, Lillie Grainger, and Jessica Lingel

We present findings from a year-long engagement with a street and its community. The work explores how the production and use of data is bound up with place, both in terms of physical and social geography. We detail three strands of the project. First, we consider how residents have sought to curate existing data about the street in the form of an archive with physical and digital components. Second, we report endeavours to capture data about the street’s environment, especially of vehicle traffic....

Publication details
Date: 1 April 2015
Type: Proceedings
Publisher: ACM – Association for Computing Machinery
Awards: Honorable Mention
Tien T. Nguyen, Duyen T. Nguyen, Shamsi T. Iqbal, and Eyal Ofek

Striking up a good conversation with new acquaintances is often difficult. In this paper we introduce a system that uses a ranking recommendation algorithm to generate real-time personalized topic suggestions during a conversation. The system then delivers the suggestions via Google Glass. We conducted a study with 38 pairs of strangers, who received such suggestions while conversing with a person they met for the first time. Participants found the suggestions to be helpful, but only at the right...

Publication details
Date: 1 April 2015
Type: Inproceeding
Publisher: ACM Conference on Human Factors for Computing Systems (CHI)
1–25 of 886
Sort
Show 25 | 50 | 100
1234567Next 
> Our research