enter search term and/or author name
Visual Understanding with RGB-D Sensors: An Introduction to the Special Issue
Richang Hong, Shuicheng Yan, Zhengyou Zhang
Article No.: 11
Considering that the existing depth recovery approaches have different limitations when applied to Kinect depth data, in this article, we propose to integrate their effective features including adaptive support region selection, reliable depth...
The rapid spreading of RGB-D cameras has led to wide applications of 3D videos in both academia and industry, such as 3D entertainment and 3D visual understanding. Under these circumstances, extensive research efforts have been dedicated to RGB-D...
A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion
Nadia Figueroa, Haiwei Dong, Abdulmotaleb El Saddik
Article No.: 14
We propose a 6D RGB-D odometry approach that finds the relative camera pose between consecutive RGB-D frames by keypoint extraction and feature matching both on the RGB and depth image planes. Furthermore, we feed the estimated pose to the highly...
The availability of massive RGB-depth (RGB-D) images poses a compelling need for effective RGB-D content understanding techniques. RGB-D images provide synchronized information from multiple views (e.g., color and depth) of real-world objects and...
Mining object-level knowledge, that is, building a comprehensive category model base, from a large set of cluttered scenes presents a considerable challenge to the field of artificial intelligence. How to initiate model learning with the least...
Accurate and Robust Moving-Object Segmentation for Telepresence Systems
Meiyu Huang, Yiqiang Chen, Wen Ji, Chunyan Miao
Article No.: 17
Moving-object segmentation is the key issue of Telepresence systems. With monocular camera--based segmentation methods, desirable segmentation results are hard to obtain in challenging scenes with ambiguous color, illumination changes, and...
Fusing Multiple Features for Depth-Based Action Recognition
Yu Zhu, Wenbin Chen, Guodong Guo
Article No.: 18
Human action recognition is a very active research topic in computer vision and pattern recognition. Recently, it has shown a great potential for human action recognition using the three-dimensional (3D) depth data captured by the emerging RGB-D...
An Evaluation of Gamesourced Data for Human Pose Estimation
Scott Spurlock, Richard Souvenir
Article No.: 19
Gamesourcing has emerged as an approach for rapidly acquiring labeled data for learning-based, computer vision recognition algorithms. In this article, we present an approach for using RGB-D sensors to acquire annotated training data for human...
Latent Support Vector Machine Modeling for Sign Language Recognition with Kinect
Chao Sun, Tianzhu Zhang, Changsheng Xu
Article No.: 20
Vision-based sign language recognition has attracted more and more interest from researchers in the computer vision field. In this article, we propose a novel algorithm to model and recognize sign language performed in front of a Microsoft Kinect...
Hand posture recognition (HPR) is quite a challenging task, due to both the difficulty in detecting and tracking hands with normal cameras and the limitations of traditional manually selected features. In this article, we propose a two-stage HPR...
Real-Time System for Driver Fatigue Detection by RGB-D Camera
Liyan Zhang, Fan Liu, Jinhui Tang
Article No.: 22
Drowsy driving is one of the major causes of fatal traffic accidents. In this article, we propose a real-time system that utilizes RGB-D cameras to automatically detect driver fatigue and generate alerts to drivers. By introducing RGB-D cameras,...
An Approach to Ballet Dance Training through MS Kinect and Visualization in a CAVE Virtual Reality Environment
Matthew Kyan, Guoyu Sun, Haiyan Li, Ling Zhong, Paisarn Muneesawang, Nan Dong, Bruce Elder, Ling Guan
Article No.: 23
This article proposes a novel framework for the real-time capture, assessment, and visualization of ballet dance movements as performed by a student in an instructional, virtual reality (VR) setting. The acquisition of human movement data is...
Bag-of-visual-words (BOVW)-based image representation has received intense attention in recent years and has improved content-based image retrieval (CBIR) significantly. BOVW does not consider the spatial correlation between visual words in...
Pattern Matching Techniques for Replacing Missing Sections of Audio Streamed across Wireless Networks
Jonathan Doherty, Kevin Curran, Paul McKevitt
Article No.: 25
Streaming media on the Internet can be unreliable. Services such as audio-on-demand drastically increase the loads on networks; therefore, new, robust, and highly efficient coding algorithms are necessary. One method overlooked to date, which can...
An Association-Based Unified Framework for Mining Features and Opinion Words
Zhen Hai, Kuiyu Chang, Gao Cong, Christopher C. Yang
Article No.: 26
Mining features and opinion words is essential for fine-grained opinion analysis of customer reviews. It is observed that semantic dependencies naturally exist between features and opinion words, even among features or opinion words themselves. In...
A Hybrid Multigroup Coclustering Recommendation Framework Based on Information Fusion
Shanshan Huang, Jun Ma, Peizhe Cheng, Shuaiqiang Wang
Article No.: 27
Collaborative Filtering (CF) is one of the most successful algorithms in recommender systems. However, it suffers from data sparsity and scalability problems. Although many clustering techniques have been incorporated to alleviate these two...
Data Mining of Online Genealogy Datasets for Revealing Lifespan Patterns in Human Population
Michael Fire, Yuval Elovici
Article No.: 28
Online genealogy datasets contain extensive information about millions of people and their past and present family connections. This vast amount of data can help identify various patterns in the human population. In this study, we present methods...