ATLC is always keen to establish research collaborations with local research entities and universities with the goal of advancing the state-of-the-art in computer science fields and promote world class research work with in the local research community. One such effort is undertaken with Nile University in the field of computer vision.
The ubiquity of different image and video capturing devices such as digital cameras and mobile phones has resulted in an unprecedented volume of image and video data produced and shared every day. This has urged the development of more effective techniques for accessing this data whether for retrieving, browsing, filtering or summarizing. To date, most commercially deployed solutions are based on metadata associated with the image and video content. Unfortunately, only a small percentage of multimedia data comes with user annotations and even for the small percentage of metadata accompanied multimedia data, the metadata is far from covering the diverse user intentions specially that an image is worth a thousand words and video even richer in semantics. Besides, the textual annotations are usually associated with the video content as a whole, rendering it difficult to randomly access a particular time point of the video. The goal of this collaboration with Nile University is to enable such in-content access of a video footage. We are investigating a set of research directions that would advance the current state-of-the-art approaches for detecting and recognizing activities in real-world videos. For example, one outcome of the project is to provide a textual label for a scene in the video sequence by a tag such as “person running”, “walking” or “shop lifting”.
Project outcomes so far:
The project has resulted in a number of conference papers and paper submissions:
- Mohamed A. Naiel, Mohamed M. Abdel Nasser, Moataz M. Abdelwahab and Motaz El-Saban, “Human classification in images employing two dimensional principal component analysis on histogram of oriented gradients”, submitted to ICIP 2011.
2. Mohammad Nael, Mohammad Mostafa, Moataz Abd EL Wahab and Motaz El-Saban, “Human Action Recognition System Employing 2DPCA in the Spatial or Transform domain”, submitted to ICCV 2011.
3. Mohammad Nael, Moataz Abd El Wahab and Motaz El-Saban, “Multi-view Human Action Recognition System Employing 2DPCA”, WACV 2011.
Moataz Abd El Wahab, assistant professor, Nile University
Motaz EL Saban, researcher, ATLC.
- Mohammad Nael, Moataz Abd El Wahab, and Motaz El Saban, Multi-view Human Action Recognition System Employing 2DPCA, in WACV, IEEE, 2011.
- Mohammad Nael, Moataz Abd El Wahab, Motaz El Saban, and Mikhail Wasfy, Highly Efficient Human Action Recognition using compact 2DPCA-based descriptors in the Spatial and Transform domains, in Midwest Symposium on Circuits and Systems (MWSCAS), IEEE, 2011.