Deep Learning in Object Recognition, Detection, and Segmentation

Xiaogang Wang

doi:10.1561/2000000071

Foundations and Trends® in Signal Processing > Vol 8 > Issue 4

Deep Learning in Object Recognition, Detection, and Segmentation

By Xiaogang Wang, The Chinese University of Hong Kong, Hong Kong, xgwang@ee.cuhk.edu.hk

Suggested Citation

Xiaogang Wang (2016), "Deep Learning in Object Recognition, Detection, and Segmentation", Foundations and Trends® in Signal Processing: Vol. 8: No. 4, pp 217-382. http://dx.doi.org/10.1561/2000000071

Publication Date: 14 Jul 2016

Subjects

Book details

ISBN: 978-1-68083-116-0

184 pp. $99.00

Buy book (pb)

ISBN: 978-1-68083-117-7

184 pp. $130.00

Buy E-book (.pdf)

Table of contents:

1. Historical overview of deep learning

2. Introduction to classical deep models

3. What makes deep learning work?

4. Deep learning in object recognition on ImageNet

5. Deep learning in face recognition

6. Deep learning in video classification

7. Deep learning in general object detection

8. Deep learning in pedestrian detection

9. Deep learning in face and human landmark detection

10. Deep learning in image segmentation

11. Face parsing and human parsing

12. Deep CNN for saliency detection

13. Discussions and future works

References

Deep Learning in Object Recognition, Detection, and Segmentation

As a major breakthrough in artificial intelligence, deep learning has achieved impressive success on solving grand challenges in many fields including speech recognition, natural language processing, computer vision, image and video processing, and multimedia. This monograph provides a historical overview of deep learning and focuses on its applications in object recognition, detection, and segmentation, which are key challenges of computer vision and have numerous applications to images and videos.

Specifically the topics covered under object recognition include image classification on ImageNet, face recognition, and video classification. In detection, the monograph covers general object detection on ImageNet, pedestrian detection, face landmark detection (face alignment), and human landmark detection (pose estimation). Finally, within segmentation, it covers the most recent progress on scene labeling, semantic segmentation, face parsing, human parsing, and saliency detection. Concrete examples of these applications explain the key points that make deep learning outperform conventional computer vision systems.

Deep Learning in Object Recognition, Detection, and Segmentation provides a comprehensive introductory overview of a topic that is having major impact on many areas of research in signal processing, computer vision, and machine learning. This is a must-read for students and researchers new to these fields.

Deep Learning in Object Recognition, Detection, and Segmentation

Free Preview:

Share

Journal details

Abstract

Book details

Deep Learning in Object Recognition, Detection, and Segmentation