- Princeton NJ, US Quoc-Huy Tran - Santa Clara CA, US Xiang Yu - Mountain View CA, US Manmohan Chandraker - Santa Clara CA, US Chi Li - San Jose CA, US
International Classification:
G06K 9/00 G06T 15/10 G06N 3/02
Abstract:
An action recognition system and method are provided. The action recognition system includes an image capture device configured to capture an actual image depicting an object. The action recognition system includes a processor configured to render, based on a set of 3D CAD models, synthetic images with corresponding intermediate shape concept labels. The processor is configured to form a multi-layer CNN which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN thereto to generate an image pair including a 2D and 3D geometric structure of the object. The processor is configured to control a device to perform a response action in response to an identification of an action performed by the object, wherein the identification of the action is based on the image pair.
Landmark Localization On Objects In Images Using Convolutional Neural Networks
- Princeton NJ, US Quoc-Huy Tran - Santa Clara CA, US Xiang Yu - Mountain View CA, US Manmohan Chandraker - Santa Clara CA, US Chi Li - San Jose CA, US
A system and method are provided. The system includes an image capture device configured to capture an actual image depicting an object. The system also includes a processor. The processor is configured to render, based on a set of 3D Computer Aided Design (CAD) models, a set of synthetic images with corresponding intermediate shape concept labels. The processor is also configured to form a multi-layer Convolutional Neural Network (CNN) which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is further configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN to the actual image to output an image pair including a 2D geometric structure and a 3D geometric structure of the object depicted in the actual image.
Surveillance System With Landmark Localization On Objects In Images Using Convolutional Neural Networks
- Princeton NJ, US Quoc-Huy Tran - Santa Clara CA, US Xiang Yu - Mountain View CA, US Manmohan Chandraker - Santa Clara CA, US Chi Li - San Jose CA, US
International Classification:
G06T 7/73 G06N 3/08
Abstract:
A surveillance system and method are provided. The surveillance system includes an image capture device configured to capture an actual image of a target area depicting an object. The surveillance system further includes a processor. The processor is configured to render, based on a set of 3D Computer Aided Design (CAD) models, synthetic images with intermediate shape corresponding concept labels. The processor is further configured to form a multi-layer Convolutional Neural Network (CNN) which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is also configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN to the actual image to generate an image pair including a 2D and 3D geometric structure of the object depicted in the actual image. The surveillance system further includes a display device configured to display the image pair.
Advanced Driver-Assistance System With Landmark Localization On Objects In Images Using Convolutional Neural Networks
- Princeton NJ, US Quoc-Huy Tran - Santa Clara CA, US Xiang Yu - Mountain View CA, US Manmohan Chandraker - Santa Clara CA, US Chi Li - San Jose CA, US
International Classification:
G08G 1/16 G06K 9/00 G05D 1/02
Abstract:
A system and method are provided for driving assistance. The system includes an image capture device configured to capture an actual image relative to an outward view from a motor vehicle and depicting an object. The system further includes a processor configured to render, based on a set of 3D CAD models, synthetic images with corresponding intermediate shape concept labels. The processor is further configured to form a multi-layer CNN which jointly models multiple intermediate shape concepts, based on the rendered synthetic images. The processor is also configured to perform an intra-class appearance variation-aware and occlusion-aware 3D object parsing on the actual image by applying the CNN to the actual image to output an image pair including a 2D and 3D geometric structure of the object. The processor is additionally configured to perform an action to mitigate a likelihood of harm involving the motor vehicle, based on the image pair.
350 W Julian St, San Jose, CA 95110 1735 Technology Dr St, San Jose, CA 95110 4084371400
Chi C. Li M
Glocal LLC
6775 Casa Linda Dr, Las Vegas, NV 89103 7128 Atherton Cv, Memphis, TN 38119 3613 Meadowlark St, El Monte, CA 91732 6516 Sterling Spg Pkwy, Las Vegas, NV 89108
Apple Apr 2018 - Sep 2019
Senior Machine Learning and Computer Vision Engineer
Johns Hopkins Whiting School of Engineering Sep 2012 - Apr 2018
Research Fellow
Johns Hopkins Whiting School of Engineering Jun 2014 - Apr 2018
Senior Research Fellow
Nec Laboratories America, Inc. May 2016 - Sep 2016
Research Intern
Microsoft May 2013 - Sep 2013
Research Intern
Education:
The Johns Hopkins University 2010 - 2018
Doctorates, Doctor of Philosophy, Computer Science, Philosophy
The Johns Hopkins University 2012 - 2014
Master of Science, Masters, Computer Science
Xiamen University 2008 - 2012
Bachelor of Engineering, Bachelors, Cognitive Science, Engineering
Xiamen University 2008
Xiamen Foreign Language School
Skills:
Matlab Machine Learning Python C++ C Statistics Research Artificial Intelligence Mathematical Modeling Deep Learning Latex Algorithms Image Processing Java Pattern Recognition Ros Point Cloud Library Opencv Signal Processing Caffe Tensorflow Torch Blender