Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Our research
Content type
+
Downloads (455)
+
Events (449)
 
Groups (151)
+
News (2743)
 
People (734)
 
Projects (1106)
+
Publications (12580)
+
Videos (5747)
Labs
Research areas
Algorithms and theory47205 (341)
Communication and collaboration47188 (215)
Computational linguistics47189 (243)
Computational sciences47190 (223)
Computer systems and networking47191 (762)
Computer vision208594 (911)
Data mining and data management208595 (106)
Economics and computation47192 (105)
Education47193 (85)
Gaming47194 (77)
Graphics and multimedia47195 (233)
Hardware and devices47196 (216)
Health and well-being47197 (92)
Human-computer interaction47198 (894)
Machine learning and intelligence47200 (893)
Mobile computing208596 (57)
Quantum computing208597 (32)
Search, information retrieval, and knowledge management47199 (691)
Security and privacy47202 (311)
Social media208598 (46)
Social sciences47203 (264)
Software development, programming principles, tools, and languages47204 (623)
Speech recognition, synthesis, and dialog systems208599 (138)
Technology for emerging markets208600 (32)
1–25 of 894
Sort
Show 25 | 50 | 100
1234567Next 
Dongwook Yoon, Nicholas Chen, François Guimbretière, and Abigail Sellen

This paper introduces a novel document annotation system that aims to enable the kinds of rich communication that usually only occur in face-to-face meetings. Our system, RichReview, lets users create annotations on top of digital documents using three main modalities: freeform inking, voice for narration, and deictic gestures in support of voice. RichReview uses novel visual representations and timesynchronization between modalities to simplify annotation access and navigation. Moreover, RichReview’s...

Publication details
Date: 1 October 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Ahmed Kharrufa, James Nicholson, Paul Dunphy, Steve Hodges, Pam Briggs, and Patrick Olivier

In addition to their popularity as personal devices, tablets, are becoming increasingly prevalent in work and public settings. In many of these newly-established application domains a supervisor user – such as the teacher in a classroom – oversees the function of one or more devices. Access to supervisory functions is typically controlled through the use of a passcode, but experience shows that keeping this passcode secret can be problematic. In this paper we introduce SwipeID, a method of identifying...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: IFIP
Suman Ravuri and Andreas Stolcke

Utterance classification is a critical pre-processing step for many speech understanding and dialog systems. In multi-user settings, one needs to first identify if an utterance is even directed at the system, followed by another level of classification to determine the intent of the user’s input. In this work, we propose RNN and LSTM models for both these tasks. We show how both models outperform baselines based on ngram-based language models (LMs), feedforward neural network LMs, and boosting...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: ISCA - International Speech Communication Association
Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey Zweig, and Gokhan Tur

Spoken language understanding (SLU) in today’s conversational systems focuses on recognizing a set of domains, intents, and associated arguments, that are determined by application developers. User requests that are not covered by these are usually directed to search engines, and may remain unhandled. We propose a method that aims to find common user intents amongst these uncovered, out-of-domain utterances, with the goal of supporting future phases of dialog system design. Our approach relies on...

Publication details
Date: 1 September 2015
Type: Inproceeding
Publisher: Interspeech 2015 Conference
Barrett Ens, Eyal Ofek, Neil Bruce, and Pourang Irani

We introduce a layout manager that exploits the robust sensing capabilities of next-generation head-worn displays by embedding virtual application windows in the user’s surroundings. With the aim of allowing users to find applications quickly, our approach leverages spatial memory of a known body-centric configuration. The layout manager balances multiple constraints to keep layouts consistent across environments while observing geometric and visual features specific to each locale. We compare various...

Publication details
Date: 9 August 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Wangjiang Zhu, Baoyuan Wang, and Steve Lin
Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: 12th IEEE International Conference On Advanced Video and Signal-based Survillance
Gerard Pons-Moll, Jonathan Taylor, Jamie Shotton, Aaron Hertzmann, and Andrew Fitzgibbon

We present a new method for inferring dense data to model correspondences, focusing on the application of human pose estimation from depth images. Recent work proposed the use of regression forests to quickly predict correspondences between depth pixels and points on a 3D human mesh model. That work, however, used a proxy forest training objective based on the classification of depth pixels to body parts. In contrast, we introduce Metric Space Information Gain (MSIG), a new decision forest training...

Publication details
Date: 1 August 2015
Type: Article
Publisher: Springer
Stuart Schechter and Joseph Bonneau

Nearly all smartphones and tablets support unlocking with a short user-chosen secret: e.g., a numeric PIN or a pattern. To address users’ tendency to choose guessable PINs and patterns, we compare two approaches for helping users learn assigned random secrets. In one approach, built on our prior work [16], we assign users a second numeric PIN and, during each login, we require them to enter it after their chosen PIN. In a new approach, we re-arrange the digits on the keypad so that the user’s...

Publication details
Date: 22 July 2015
Type: Inproceeding
Publisher: USENIX – Advanced Computing Systems Association
Hanif Baharin, Stephen Viller, and Sean Rintel

SonicAIR is an ambient awareness technology probe designed to explore how connecting the soundscapes of friends or family members might reduce the isolation of seniors living independently. At its core, SonicAIR instruments kitchen activity sites to produce an always-on real-time aural representation of remote domestic rhythms. This paper reports how users in two pilot SonicAIR deployments used the sounds as resources for recognizing comfortable narratives of sociability. Used alongside telecare...

Publication details
Date: 1 July 2015
Type: Article
Publisher: ACM – Association for Computing Machinery
T. J. Tsai, Andreas Stolcke, and Malcolm Slaney

The goal of addressee detection is to answer the question, “Are you talking to me?” When a dialogue system interacts with multiple users, it is crucial to detect when a user is speaking to the system as opposed to another person. We study this problem in a multimodal scenario, using lexical, acoustic, visual, dialogue state, and beamforming information. Using data from a multiparty dialogu system, we quantify the benefits of using multiple modalities over using a single modality. We also assess the...

Publication details
Date: 1 July 2015
Type: Article
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Sarah K. Tyler, Jaime Teevan, Peter Bailey, Sebastian de la Chica, and Nikhil Dandekar

Information on almost any given topic can be found on the Web, often accessible via many different websites. But even when the topical content is similar across websites, the websites can have different characteristics that appeal to different people. As a result, individuals can develop preferred websites to visit for certain topics. While it has long been speculated that such preferences exist, little is understood about how prevalent, clear, and stable these preferences actually are. We characterize...

Publication details
Date: 1 June 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-48
Sameh Khamis, Jonathan Taylor, Jamie Shotton, Cem Keskin, Shahram Izadi, and Andrew Fitzgibbon

We describe how to learn a compact and efficient model of the surface deformation of human hands. The model is built from a set of noisy depth images of a diverse set of subjects performing different poses with their hands. We represent the observed surface using Loop subdivision of a control mesh that is deformed by our learned parametric shape and pose model. The model simultaneously accounts for variation in subject-specific shape and subject-agnostic pose. Specifically, hand shape is...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Zhaoyuan Ma, Darren Edge, Leah Findlater, and Hong Tan

The present study used a flat keyboard without moving keys and enabled with haptic keyclick feedback to examine the effect of haptic keyclick feedback on touch typing performance. We investigated, with well-controlled stimuli and a within-participant design, how haptic keyclick feedback might improve typing performance in terms of typing speed, typing efficiency and typing errors. Of the three kinds of haptic feedback we tested, all increased typing speed and decreased typing errors compared to a...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
John Vilk, David Molnar, Benjamin Livshits, Eyal Ofek, Chris Rossbach, Alexander Moshchuk, Helen J. Wang, and Ran Gal

Immersive experiences that mix digital and real-world objects are becoming reality, but they raise serious privacy concerns as they require real-time sensor input. These experiences are already present on smartphones and game consoles via Kinect, and will eventually emerge on the web platform. However, browsers do not expose the display interfaces needed to render immersive experiences. Previous security research focuses on controlling application access to sensor input alone, and do not deal...

Publication details
Date: 18 May 2015
Type: Proceedings
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Kuansan Wang

Human is the only species on earth that has mastered the technologies in writing and printing to capture ephemeral thoughts and scientific discoveries. The capabilities to pass along knowledge, not only geographically but also generationally, have formed the bedrock of our civilizations. We are in the midst of a silent revolution driven by the technological advancements: no longer are computers just a fixture of our physical world but have they been so deeply woven into our daily routines that they are...

Publication details
Date: 18 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Jarrod Knibbe, Hrvoje Benko, and Andrew D. Wilson

Projector-camera (pro-cam) systems afford a wide range of interactive possibilities, combining both natural and mixed-reality 3D interaction. However, the latency inherent within these systems can cause the projection to ‘slip’ from its intended target, detracting from the overall experience. Because of this, pro-cam systems have typically shied away from truly dynamic scenarios. In turn, research has been exploring latency reduction techniques across a range of domains, but these techniques typically...

Publication details
Date: 11 May 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-35
Helen J. Wang, Alexander Moshchuk, Michael Gamon, Mona Haraty, Shamsi Iqbal, Eli T. Brown, Ashish Kapoor, Chris Meek, Eric Chen, Yuan Tian, Jaime Teevan, Mary Czerwinski, and Susan Dumais

In this paper, we advocate “activity” to be a central abstraction between people and computing instead of applications. We outline the vision of the activity platform as the next-generation social platform.

Publication details
Date: 8 May 2015
Type: Technical report
Publisher: Microsoft Research
Number: MSR-TR-2015-38
Aditya Vashistha, Edward Cutrell, Gaetano Borriello, and William Thies

Interactive voice forums have emerged as a promising platform for people in developing regions to record and share audio messages using low-end mobile phones. However, one of the barriers to the scalability of voice forums is the process of screening and categorizing content, often done by a dedicated team of moderators. We present Sangeet Swara, a voice forum for songs and cultural content that relies on the community of callers to curate high-quality posts that are prioritized for playback to others....

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Madeline E. Smith and John C. Tang

While group messaging has become popular, particularly among adolescents, it has not yet been explored in the HCI literature. We interviewed 48 adolescents, aged 15-24, who use group messaging regularly. We present a framework for understanding the types of groups they communicate with according to three dimensions: focus, membership, and duration. We also present findings about factors influencing their choice of group messaging tools and the problems they have managing multiple group threads using...

Publication details
Date: 1 May 2015
Type: Technical report
Number: MSR-TR-2015-43
Cecily Morrison, Marcus D'Souza, Kit Huckvale, Jonas F Dorn, Jessica Burggraaff, Christian Philipp Kamm, Saskia Marie Steinheimer, Peter Kontschieder, Antonio Criminisi, Bernard Uitdehaag, Frank Dahlke, Ludwig Kappos, and Abigail Sellen

Background: Sensor-based recordings of human movements are becoming increasingly important for the assessment of motor symptoms in neurological disorders beyond rehabilitative purposes. ASSESS MS is a movement recording and analysis system being developed to automate the classification of motor dysfunction in patients with multiple sclerosis using depth-sensing computer vision. It aims to provide a more consistent and finer-grained measurement of motor dysfunction than currently possible....

Publication details
Date: 1 May 2015
Type: Article
M. R. P. Thomas, H. Gamper, and I. J. Tashev

Beamforming and channel equalizers can be formulated as optimal multichannel filter-and-sum operations with different objective criteria. It has been shown in previous studies that the combination of both concepts under a common framework can yield results that combine both the spatial robustness of beamforming and the dereverberation performance of channel equalization. This paper introduces an additional method for leveraging both approaches that exploits channel estimates in a wanted spatial location...

Publication details
Date: 23 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Hannes Gamper, Mark R. P. Thomas, and Ivan J. Tashev

The estimation of acoustic propagation delays from a sound source to a listener’s ear entrances is useful for understanding and visualising the wave propagation along the surface of the head, and necessary for individualised spatial sound rendering. The interaural time difference (ITD) is of particular research interest, as it constitutes one of the main localisation cues exploited by the human auditory system. Here, an approach is proposed that employs ray tracing on a 3-D head scan to estimate and...

Publication details
Date: 22 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Felicia Lim, Mark R. P. Thomas, and Ivan J. Tashev

Reverberation time is an important parameter for characterizing acoustic environments. It is useful in many applications including acoustic scene analysis, robust automatic speech recognition and dereverberation. Given knowledge of the acoustic impulse response, reverberation time can be measured using Schroeder’s backward integration method. Since it is not always practical to obtain impulse responses, blind estimation algorithms are sometimes desirable. In this work, the reverberation problem is...

Publication details
Date: 21 April 2015
Type: Inproceeding
Publisher: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
Masa Ogata and Masaaki Fukumoto

FluxPaper is a new paper-based medium that enables physical movement and dynamic interaction between a high-power magnetized paper and a programmable magnetic field. FluxPaper has a very thin patterned magnetic layer (0.1 mm) pasted behind the paper. A thin but strong neodymium-based magnet realizes fast, powerful, and precise physical actions while retaining the original characteristics of the paper that is widely used in our daily lives. Owing to an effective magnetic pattern and a computer-controlled...

Publication details
Date: 20 April 2015
Type: Proceedings
Publisher: ACM – Association for Computing Machinery
Rachel Jacobs, Steven Benford, and Ewa Luger

Publication details
Date: 18 April 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
1–25 of 894
Sort
Show 25 | 50 | 100
1234567Next 
> Our research