In this paper we present the architecture and the main building blocks of the sound capturing and speech enhancement system of Kinect. Two blocks contribute most for achieving the final results: the surround sound echo cancellation and the microphone array processing. The Kinect device is designed to add gesture and speech to the human-machine interface of Microsoft’s gaming console Xbox, but the technologies behind Kinect go way beyond gaming and entertainment.
|Published in||International Journal on Information Technology and Security|
|Publisher||Union of Scientists in Bulgaria|