Generic placeholder image

Recent Advances in Electrical & Electronic Engineering

Editor-in-Chief

ISSN (Print): 2352-0965
ISSN (Online): 2352-0973

General Research Article

Multi-collaborative Regression Convolutional Neural Network for Vehicle Angle Detection

Author(s): Guoqiang Chen*, Mengchao Liu, Hongpeng Zhou and Bingxin Bai

Volume 13, Issue 8, 2020

Page: [1227 - 1241] Pages: 15

DOI: 10.2174/2352096513999200628100357

Price: $65

Abstract

Background: The vehicle pose detection plays an important role in monitoring vehicle behavior and the parking situation. The real-time detection of vehicle pose with high accuracy is of great importance.

Objective: The goal of the work is to construct a new network to detect the vehicle angle based on the regression Convolutional Neural Network (CNN). The main contribution is that several traditional regression CNNs are combined as the Multi-Collaborative Regression CNN (MCR-CNN), which greatly enhances the vehicle angle detection precision and eliminates the abnormal detection error.

Methods: Two challenges with respect to the traditional regression CNN have been revealed in detecting the vehicle pose angle. The first challenge is the detection failure resulting from the conversion of the periodic angle to the linear angle, while the second is the big detection error if the training sample value is very small. An MCR-CNN is proposed to solve the first challenge. And a 2- stage method is proposed to solve the second challenge. The architecture of the MCR-CNN is designed in detail. After the training and testing data sets are constructed, the MCR-CNN is trained and tested for vehicle angle detection.

Results: The experimental results show that the testing samples with the error below 4° account for 95% of the total testing samples based on the proposed MCR-CNN. The MCR-CNN has significant advantages over the traditional vehicle pose detection method.

Conclusion: The proposed MCR-CNN cannot only detect the vehicle angle in real-time, but also has a very high detection accuracy and robustness. The proposed approach can be used for autonomous vehicles and monitoring of the parking lot.

Keywords: Regression convolutional neural network, vehicle angle detection, vehicle pose detection, multi-collaborative regression convolutional neural network, parking lot monitoring, autonomous vehicle.

Graphical Abstract

[1]
P. Song, L. Qi, and X. Qian, "Detection of ships in inland river using high-resolution optical satellite imagery based on mixture of deformable part models", J. Parall. Distrib. Comput., vol. 132, pp. 1-7, 2019.
[http://dx.doi.org/10.1016/j.jpdc.2019.04.013]
[2]
J. Ma, Z. Zhou, and B. Wang, "Ship detection in optical satellite images via directional bounding boxes based on ship center and orientation prediction", Remote Sens., vol. 11, p. 2173, 2019.
[http://dx.doi.org/10.3390/rs11182173]
[3]
F. Wu, Z. Zhou, and B. Wang, "Inshore ship detection based on convolutional neural network in optical satellite images", IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 11, pp. 4005-4015, 2018.
[http://dx.doi.org/10.1109/JSTARS.2018.2873190]
[4]
A. Asvadi, L. Garrote, and C. Premebida, "Multimodal vehicle detection: Fusing 3D-LIDAR and color camera data", Pattern Recognit. Lett., vol. 115, pp. 20-29, 2018.
[http://dx.doi.org/10.1016/j.patrec.2017.09.038]
[5]
Y. Xue, and X. Qian, "Vehicle detection and pose estimation by probabilistic representation", IEEE International Conference on Image Processing (ICIP), Beijing, China, pp. 3355-3359, 2017.
[http://dx.doi.org/10.1109/ICIP.2017.8296904]
[6]
S.H. Kim, J.S. Kim, and W.Y. Kim, "A method of detecting parking slot in hough space and pose estimation using rear view image for autonomous parking system", In: IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC), Beijing, China, 2016, pp. 452-454.
[http://dx.doi.org/10.1109/ICNIDC.2016.7974615]
[7]
S. Azam, A. Rafique, and M. Jeon, "Vehicle pose detection using region based convolutional neural network", In: International Conference on Control, Automation and Information Sciences (ICCAIS), Ansan, South Korea, 2016, pp. 194-198.
[http://dx.doi.org/10.1109/ICCAIS.2016.7822459]
[8]
Q.H. Tran, M. Chandraker, and H.J. Kim, Autonomous Vehicle Utilizing Pose Estimation. U.S. Patent Application 16,100,462.
[9]
D.F. DeMenthon, Computer vision system for accurate monitoring of object pose. U.S. Patent 5,388,059
[10]
E. Ivorra, M. Ortega, and M. Alcañiz, "Multimodal computer vision framework for human assistive robotics", In: In: Workshop on Metrology for Industry 4.0 and IoT, Brescia, Italy, 2018.
[http://dx.doi.org/10.1109/METROI4.2018.8428330]
[11]
Y. Mae, J. Choi, and H. Takahashi, "Interoperable vision component for object detection and 3D pose estimation for modularized robot control", Mechatronics, vol. 21, pp. 983-992, 2011.
[http://dx.doi.org/10.1016/j.mechatronics.2011.03.008]
[12]
R. Wagner, M. Thom, and M. Gabb, "Convolutional neural networks for night-time animal orientation estimation", In: 2013 IEEE Intelligent Vehicles Symposium (IV), Gold Coast, QLD, Australia, 2013, pp. 316-321.
[http://dx.doi.org/10.1109/IVS.2013.6629488]
[13]
J. Bharatharaj, L. Huang, and R. Mohan, "Head pose detection for a wearable parrot-inspired robot based on deep learning", Appl. Sci. (Basel), vol. 8, p. 1081, 2018.
[http://dx.doi.org/10.3390/app8071081]
[14]
H. Cheng, and M.Q.H. Meng, "A grasp pose detection scheme with an end-to-end CNN regression approach", In: IEEE International Conference on Robotics and Biomimetics (ROBIO), Kuala Lumpur, Malaysia, 2018, pp. 544-549.
[http://dx.doi.org/10.1109/ROBIO.2018.8665219]
[15]
A Doumanoglou V. Balntas, and R. Kouskouridas, ""Siamese regression networks with efficient mid-level feature extraction for 3d object pose estimation"", arXiv preprint arXiv:1607.02257, 2016.
[16]
L. Leal-Taixé, C. Canton-Ferrer, and K. Schindler, "Learning by tracking: Siamese CNN for robust target association", In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, USA, 2016, pp. 33-40.
[http://dx.doi.org/10.1109/CVPRW.2016.59]
[17]
J.H. Lee, and S. Sull, "Regression tree CNN for estimation of ground sampling distance based on floating-point representation", Remote Sens., vol. 11, p. 2276, 2019.
[http://dx.doi.org/10.3390/rs11192276]
[18]
W.V. Gansbeke, B.D. Brabandere, and D. Neven, "End-to-end lane detection through differentiable least-squares fitting", In: Proceedings of the IEEE International Conference on Computer Vision Workshops, Seoul, South Korea, 2019.
[http://dx.doi.org/10.1109/ICCVW.2019.00119]
[19]
S. Sudholt, and G.A. Fink, "Evaluating word string embeddings and loss functions for CNN-based word spotting", In: 14th IAPR International Conference On Document Analysis and Recognition (ICDAR), Kyoto, Japan, 2017, pp. 439-498.
[http://dx.doi.org/10.1109/ICDAR.2017.87]
[20]
A. Kumar, A. Alavi, and R. Chellappa, "KEPLER: Keypoint and pose estimation of unconstrained faces by learning efficient H-CNN regressors", In: 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA, 2017, pp. 258-265.
[http://dx.doi.org/10.1109/FG.2017.149]
[21]
A. Jain, J. Tompson, and Y. LeCun, "Modeep: A deep learning framework using motion features for human pose estimation", In: Asian Conference on Computer Vision, Singapore, Singapore, 2014, pp. 302-315.
[22]
H.R. Torres, B. Oliveira, and J. Fonseca, "real-time human body pose estimation for in-car depth images", In: Doctoral Conference on Computing, Electrical and Industrial Systems, vol. vol. 553. 2019, pp. 169-182.
[http://dx.doi.org/10.1007/978-3-030-17771-3_14]
[23]
D. Levi, Unified Deep Convolutional Neural Net for Free-Space Estimation, Object Detection and Object Pose Estimation. U.S. Patent Application 15,642,816.
[24]
S. Prokudin, P. Gehler, and S. Nowozin, "Deep directional statistics: Pose estimation with uncertainty quantification", In: Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 2018, pp. 534-551.
[http://dx.doi.org/10.1007/978-3-030-01240-3_33]
[25]
S. Mahendran, H. Ali, and R. Vidal, "3D pose regression using convolutional neural networks", In: Proceedings of the IEEE International Conference on Computer Vision, Honolulu, HI, USA, 2017, pp. 2174-2182.
[26]
M. Braun, Q. Rao, and Y. Wang, "Pose-RCNN: Joint object detection and pose estimation using 3d object proposals", In: IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil, 2016, pp. 1546-1551.
[http://dx.doi.org/10.1109/ITSC.2016.7795763]
[27]
W. Michael, F. Michael, and M. Zollner, "Direct 3D detection of vehicles in monocular images with a CNN based 3D decoder", In: 30th IEEE Intelligent Vehicles Symposium, IV, 2019, pp. 417-423.
[28]
A. Toshev, and C. Szegedy, "Deep pose: Human pose estimation via deep neural networks", In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 1653-1660.
[http://dx.doi.org/10.1109/CVPR.2014.214]
[29]
A. Bulat, and G. Tzimiropoulos, "Human pose estimation via convolutional part heatmap regression", In: European Conference on Computer Vision, vol. vol. 9911. 2016, pp. 717-732.
[http://dx.doi.org/10.1007/978-3-319-46478-7_44]
[30]
Z. He, M. Kan, and J. Zhang, "A fully end-to-end cascaded CNN for facial landmark detection", In: 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA, 2017, pp. 200-207.
[http://dx.doi.org/10.1109/FG.2017.33]
[31]
J.G. López, A. Agudo, and F. Moreno-Noguer, Vehicle Pose Estimation Using G-Net: Multi-Class Localization and Depth Estimation., Artificial Intelligence Research and Development: Barcelona, Spain, 2018, pp. 355-364.
[32]
M. Oberweger, and V. Lepetit, "Deepprior++: improving fast and accurate 3d hand pose estimation", In: Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 585-594.
[http://dx.doi.org/10.1109/ICCVW.2017.75]
[33]
A.T. Pas, M. Gualtieri, and K. Saenko, "Grasp pose detection in point clouds", Int. J. Robot. Res., vol. 36, pp. 1455-1473, 2017.
[http://dx.doi.org/10.1177/0278364917735594]
[34]
B. Celikkol, F. Eren, and S. Pe’eri, Pose Detection and Control of Unmanned Underwater Vehicles (Uuvs) Utilizing an Optical Detector Array. U.S. Patent Application 10,183,732.
[35]
E. Ochin, "Spoofing detection for underwater acoustic GNSS-like positioning systems", Scient. J. Maritime Univers. Szczecin, vol. 57, no. 57, pp. 38-46, 2019.
[36]
C. Chen, W. Gong, and Y. Hu, "Learning oriented region-based convolutional neural networks for building detection in satellite remote sensing images", Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci, vol. 42, p. 461, 2017.
[http://dx.doi.org/10.5194/isprs-archives-XLII-1-W1-461-2017]
[37]
Y. Jiang, X. Zhu, and X. Wang, R2CNN: Rotational region CNN for orientation robust scene text detection, ArXiv preprint arXiv, Comput., Vis. Patt. Recog, 2017.
[38]
Z. Liu, J. Hu, and L. Weng, "Rotated region-based CNN for ship detection", In: IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 900-904.
[39]
G. Cheng, P. Zhou, and J. Han, "RIFD-CNN: Rotation-invariant and fisher discriminative convolutional neural networks for object detection", In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 2884-2893.
[http://dx.doi.org/10.1109/CVPR.2016.315]
[40]
F. Ghorbani, H. Badi, and A. Sedaghat, "Geospatial target detection from high-resolution remote-sensing images based on PIIFD descriptor and salient regions", Photonirvachak (Dehra Dun), vol. 5, pp. 879-891, 2019.
[http://dx.doi.org/10.1007/s12524-019-00944-4]
[41]
X.Q. Liu, J.J. Xuan, and F. Hussain, "ARM-based behavior tracking and identification system for group housed pigs", Recent Adv. Electr. Electron. Eng., vol. 12, no. 6, pp. 554-565, 2019.
[http://dx.doi.org/10.2174/2352096512666190329230400]
[42]
J. Wang, C. Lu, and W. Jiang, "Simultaneous ship detection and orientation estimation in SAR images based on attention module and angle regression", Sensors (Basel), vol. 18, no. 9, p. 2851, 2018.
[http://dx.doi.org/10.3390/s18092851] [PMID: 30158490]
[43]
F. Eren, S. Pe’eri, M.W. Thein, Y. Rzhanov, B. Celikkol, and M.R. Swift, "Position, orientation and velocity detection of Unmanned Underwater Vehicles (UUVs) using an optical detector array", Sensors (Basel), vol. 17, no. 8, p. 1741, 2017.
[http://dx.doi.org/10.3390/s17081741] [PMID: 28758936]
[44]
S.C. Tangirala, I.W.K. Feldman, and C.H. Debrunner, Estimating Position and Orientation of an Underwater Vehicle Based on Correlated Sensor Data. U.S. Patent 8,965,682.
[45]
D. Cheng, Research on Orientation Detection of Objects on Rru Model Based on Deep Learning., Harbin Institute of Technology, 2018.
[46]
K. Park, T. Patten, and J. Prankl, "Multi-task template matching for object detection, segmentation and pose estimation using depth images", In: 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, Canada, 2019, pp. 7207-7213.
[http://dx.doi.org/10.1109/ICRA.2019.8794448]
[47]
H.C. Li, Research on Object Localization and Pose Estimation of Industrial Robot Based on Deep Learning. University of Chinese Academy of Science: Chongqing Institute of Green and Intelligent Technology Chinese Academy of Science 2018
[48]
Z. Yu, G. Zhang, and W. Zhang, "Motion Recognition Based on Angle-Reg Network", In: Proceedings of the 2019 International Conference on Artificial Intelligence and Computer Science, Wuhan, China, 2019, pp. 215-218.
[49]
Á. Arnaiz-González, M. Blachnik, and M. Kordos, "Fusion of instance selection methods in regression tasks", Inf. Fusion, vol. 30, pp. 69-79, 2016.
[http://dx.doi.org/10.1016/j.inffus.2015.12.002]
[50]
S. Malek, F. Melgani, and Y. Bazi, "One‐dimensional convolutional neural networks for spectroscopic signal regression", J. Chemometr., vol. 32, p. 2977, 2018.
[http://dx.doi.org/10.1002/cem.2977]
[51]
J. Malik, A. Elhayek, and F. Nunnari, "Simple and effective deep hand shape and pose regression from a single depth image", Comput. Graph., vol. 85, pp. 85-91, 2019.
[http://dx.doi.org/10.1016/j.cag.2019.10.002]
[52]
T.T. Sarker, and J.M. Banda, Solar Event Tracking with Deep Regression Networks: A Proof of Concept Evaluation, 2019.
[http://dx.doi.org/10.1109/BigData47090.2019.9006273]
[53]
A. Muhammad, J.M. Lee, and S.W. Hong, "Deep learning application in power system with a case study on solar irradiation forecasting", In: 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), Okinawa, Japan, 2019, pp. 275-279.
[http://dx.doi.org/10.1109/ICAIIC.2019.8668969]
[54]
A. Doumanoglou, V. Balntas, and R. Kouskouridas, Siamese Regression Networks with Efficient Mid-Level Feature Extraction for 3D Object Pose Estimation, 2016.
[55]
S. Moritz, J. Pfab, and T. Wu, "Cascaded-CNN: Deep learning to predict protein backbone structure from high-resolution cryo-EM density maps", bioRxiv, 2019.
[56]
C. Zhang, X. Xu, and D. Tu, "Face detection using improved faster RCNN", "preprint arXiv", 2018.
[57]
N.C. Di, M. Beccani, and P. Valdastri, "Real-time pose detection for magnetic medical devices", IEEE Trans. Magn., vol. 49, pp. 3524-3527, 2013.
[http://dx.doi.org/10.1109/TMAG.2013.2240899]
[58]
Q.J. Wei, and W.B. Wang, "Research on image retrieval using deep convolutional neural network combining L1 regularization and PRelu activation function", IOP Conference Series: Earth and Environmental Science, vol. vol. 69, 2017.
[59]
R. Hou, C. Chen, and M. Shah, "Tube convolutional neural network (T-CNN) for action detection in videos", In: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5822-5831.
[http://dx.doi.org/10.1109/ICCV.2017.620]
[60]
Y.D. Zhang, C. Pan, and J. Sun, "Multiple sclerosis identification by convolutional neural network with dropout and parametric ReLU", J. Comput. Sci., vol. 28, pp. 1-10, 2018.
[http://dx.doi.org/10.1016/j.jocs.2018.07.003]
[61]
W. Luo, A.G. Schwing, and R. Urtasun, "Efficient deep learning for stereo matching", In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Shanghai, China, 2016, pp. 5695-5703.
[62]
V. Turchenko, E. Chalmers, and A. Luczak, A Deep Convolutional Auto-Encoder with Pooling-Unpooling Layers in Caffe, 2017.
[63]
S. Dachasilaruk, Y. Rangsanseir, and P. Thitimashima, "Application of multiscale edge detection to speckle reduction of SAR images", In: Asian Conference on Remote Sensing Hong Kong, China, (ACRS), 1999.
[64]
Y.Q. Jin, and S.Q. Wang, "An algorithm for ship wake detection from the SAR image using the Radon transform and morphological image processing", J. Syst. Eng. Electron., vol. 12, pp. 7-12, 2001.
[65]
H. Qi, Derivation of Backpropagation in Convolutional Neural Network., CNN, 2016.
[66]
Y. Wu, L. Deng, G. Li, J. Zhu, and L. Shi, ""Unsupervised domain adaptation by backpropagation"", Front. Neurosci., 2014.arXiv preprint arXiv: 1409.7495
[67]
Y. Wu, L. Deng, G. Li, J. Zhu, and L. Shi, "Spatio-temporal backpropagation for training high-performance spiking neural networks", Front. Neurosci., vol. 12, p. 331, 2018.
[http://dx.doi.org/10.3389/fnins.2018.00331] [PMID: 29875621]
[68]
S.R. Price, and C.D. Price, “Pre-screener for automatic detection of road damage in SAR imagery via advanced image processing techniques”, Proceed., Patt. Recog. Track., XXIX, 2018.
[http://dx.doi.org/10.1117/12.2305052]
[69]
F. Hoseini, A. Shahbahrami, and P. Bayat, "Adapt ahead optimization algorithm for learning deep CNN applied to MRI segmentation", J. Digit. Imaging, vol. 32, no. 1, pp. 105-115, 2019.
[http://dx.doi.org/10.1007/s10278-018-0107-6] [PMID: 30039425]
[70]
G.S. Jayalakshmi, and V.S. Kumar, "Performance analysis of Convolutional Neural Network (CNN) based cancerous skin lesion detection system", In: 2019 International Conference on Computational Intelligence in Data Science (ICCIDS), Chennai, India, 2019, pp. 1-6.
[http://dx.doi.org/10.1109/ICCIDS.2019.8862143]
[71]
Á. Arcos-García, J.A. Álvarez-García, and L.M. Soria-Morillo, "Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods", Neural Netw., vol. 99, pp. 158-165, 2018.
[http://dx.doi.org/10.1016/j.neunet.2018.01.005] [PMID: 29427842]
[72]
Y.W. Zhang, and Z.Q. Pan, "Detecting object open angle and direction using machine learning", Instit. Electric. Electron. Eng. Inc., vol. 8, pp. 12300-12306, 2020.
[http://dx.doi.org/10.1109/ACCESS.2020.2965537]
[73]
J.J. Fei, Z.C. Wang, and Z.H. Yu, "Multi-scale oriented object detection in aerial images based on convolutional neural networks with global attention", In: 11th International Symposium on Multispectral Image Processing and Pattern Recognition: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications, MIPPR, Wuhan, China, 2020.
[http://dx.doi.org/10.1117/12.2541855]
[74]
J.Y. Kang, Y.W. Jo, and D.J. Lee, "Real-time road surface marking detection from a bird’s-eye view image using convolutional neural networks", In: 12th International Conference on Machine Vision, ICMV 2019, Amsterdam, The Netherlands, 2020.
[http://dx.doi.org/10.1117/12.2556355]
[75]
K. Fu, Z.G. Chang, and Y. Zhang, "Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images", Elsevier B.V., vol. 161, pp. 294-308, 2020.
[http://dx.doi.org/10.1016/j.isprsjprs.2020.01.025]

Rights & Permissions Print Cite
© 2025 Bentham Science Publishers | Privacy Policy