Open Access   Article Go Back

A Review on Deep Learning in Robotics

himi P S1 , Shajan P X2

Section:Review Paper, Product Type: Journal Paper
Volume-06 , Issue-07 , Page no. 19-25, Sep-2018

Online published on Sep 30, 2018

Copyright © Shimi P S, Shajan P X . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Shimi P S, Shajan P X, “A Review on Deep Learning in Robotics,” International Journal of Computer Sciences and Engineering, Vol.06, Issue.07, pp.19-25, 2018.

MLA Style Citation: Shimi P S, Shajan P X "A Review on Deep Learning in Robotics." International Journal of Computer Sciences and Engineering 06.07 (2018): 19-25.

APA Style Citation: Shimi P S, Shajan P X, (2018). A Review on Deep Learning in Robotics. International Journal of Computer Sciences and Engineering, 06(07), 19-25.

BibTex Style Citation:
@article{S_2018,
author = {Shimi P S, Shajan P X},
title = {A Review on Deep Learning in Robotics},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {9 2018},
volume = {06},
Issue = {07},
month = {9},
year = {2018},
issn = {2347-2693},
pages = {19-25},
url = {https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=460},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_spl_paper_view.php?paper_id=460
TI - A Review on Deep Learning in Robotics
T2 - International Journal of Computer Sciences and Engineering
AU - Shimi P S, Shajan P X
PY - 2018
DA - 2018/09/30
PB - IJCSE, Indore, INDIA
SP - 19-25
IS - 07
VL - 06
SN - 2347-2693
ER -

           

Abstract

During the last few decades, there has been a rush in research in the area of deep learning. In this paper we have made a review on the limitations of deep learning in physical robotic systems, using currently available examples. It is mainly focused on the recent advances made in robotics community and application of deep learning in robotics.

Key-Words / Index Term

Deep neural networks; artificial intelligence; human-robot interaction

References

[1] LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436-444.
[2] Jordan MI, Mitchell TM. Machine learning: arends, perspectives, and prospects. Science. 2015;349(6245):255-260.
[3] Böhmer W, Springenberg JT, Boedecker J, et al. Autonomous learning of state representations for control: an emerging field aims to autonomously learn state representations for reinforcement learning agents from their real-world sensor observations. KI-Künstliche Intelligenz. 2015;29(4):353-362.
[4] Stigler SM. Gauss and the invention of least squares. Ann of Statistics. 1981;9(3):465-474.
[5] Haykin S. Neural networks: a comprehensive foundation. 2nd ed. Upper Saddle River, New Jersey: Prentice Hall; 2004.
[6] Bryson AE, Denham WF, Dreyfus SE. Optimal programming problems with inequality constraints. AIAA Journal. 1963;1(11):2544-2550.
[7] Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back propagating errors. Nature. 1986;323:533-536.
[8] Werbos P. Beyond regression: new tools for prediction and analysis in the behavioral sciences. [Ph.D. dissertation]. Dept. Statistics, Harvard Univ.; 1974.
[9] Cybenko G. Approximation by superpositions of a sigmoidal function. Math of Control, Signals and Sys. 1989;2(4):303-314.
[10] Hochreiter S. Untersuchungen zu dynamischen neuronalen netzen. [Master`s thesis]. Institut Fur Informatik, Technische Universitat; 1991.
[11] Hochreiter S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. of Uncertainty, Fuzziness and Knowledge-Based Syst. 1998;6(2).
[12] Miyamoto H, Kawato M, Setoyama T, & Suzuki R. Feedback-error-learning neural network for trajectory control of a robotic manipulator. Neural Networks 1998;1(3):251-265.
[13] Lewis FW, Jagannathan S, & Yesildirak A. (1998). Neural network control of robot manipulators and non-linear systems. CRC Press.
[14] Miller WT, Werbos PJ, & Sutton RS. (1995). Neural networks for control. MIT Press.
[15] Lin CT., & Lee CSG. Neural-network-based fuzzy logic control and decision system. IEEE Transactions on Computers. 1991;40(12):1320-1336.
[16] Pomerleau DA. (1989). ALVINN, an autonomous land vehicle in a neural network (No. AIP-77). Carnegie Mellon University, Computer Science Department.
[17] Oh K, Jung K. GPU implementation of neural networks. Pattern Recognition. 2004;37(6):1311-1314.
[18] Hinton GE, Osindero S, Teh Y. A fast learning algorithm for deep belief nets. Neural Computation. 2006;18(7):1527-1554.
[19] Dean J, Corrado G, Monga R, et al. Large scale distributed deep networks.Advances in Neural Information Process. Syst. 25; 2012.
[20] Tani J, Ito M, & Sugita Y. Self-organization of distributedly represented multiple behavior schemata in a mirror system: reviews of robot experiments using RNNPB. Neural Networks. 2004;17(8):1273-1289.
[21] Ijspeert AJ. Central pattern generators for locomotion control in animals and robots: a review. Neural Networks. 2008;21(4):642-653.
[22] Gashler M, Martinez T. Temporal nonlinear dimensionality reduction. Neural Networks (IJCNN), 2011 International Joint Conference on; 2011. p. 1959-1966.
[23] Pomerleau DA (2012). Neural network perception for mobile robot guidance (Vol.239). Springer Science & Business Media.
[24] Thrun S. Learning to play the game of chess. Advances in Neural Inform. Process. Syst.: Proc. of the 1994 Conf.
[25] Campbell M, Hoane AJ, Hsu F. Deep blue. Artificial Intelligence. 2002;134(1):57-83.
[26] Pinto N, Cox DD, DiCarlo JJ. Why is real-world visual object recognition hard? PLoS Computational Biology. 2008;4(1).
[27] Schmidhuber J. Deep learning in neural networks: an overview. Neural Networks. 2015;6(1)85-117.
[28] Graves A, Liwicki M, Fernández S, et al. A novel connectionist system for unconstrained handwriting recognition. Pattern Anal and Machine Intell., IEEE trans. on. 2009;31(5):855-868.
[29] Yang M, Ji S, Xu W, et al. Detecting human actions in surveillance videos. TREC Video Retrieval Evaluation Workshop; 2009.
[30] Lin M, Chen Q, Yan S. Network in network. 2013. Available: https://arxiv.org/abs/1312.4400
[31] Ciresan D, Giusti A, Gambardella LM, et al. Deep neural networks segment neuronal membranes in electron microscopy images. Advances in Neural Information Processing Sys 25; 2012.
[32] Roux L, Racoceanu D, Lomenie N, et al. Mitosis detection in breast cancer histological images an ICPR 2012 contest. J Pathol Inform. 2013;4(8).
[33] Cireşan DC, Giusti A, Gambardella LM, et al. Mitosis detection in breast cancer histology images with deep neural networks. In: K. Mori, I. Sakuma, Y. Sato, C. Barillot and N. Navab, editors. Medical Image Computing and Computer- Assisted Intervention–MICCAI 2013. Springer; 2013.
[34] Cireşan D, Meier U, Masci J, et al. A committee of neural networks for traffic sign classification. Neural Networks (IJCNN), 2011 Int. Joint Conf. on; 2011. p. 1918-1921.
[35] Ciresan D, Meier U, Schmidhuber J. Multi-column deep neural networks for image
classification. Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on; 2012. p. 3642-3649.
[36] Dunne RA, & Campbell NA. On the pairing of the softmax activation and crossentropy
penalty functions and the derivation of the softmax activation function.
In Proc. 8th Aust. Conf. on the Neural Networks, Melbourne 1997;181 (Vol.185).
[37] Wilson DR, Martinez TR. The general inefficiency of batch training for gradient descent learning. Neural Networks. 2003;16(10):1429-1451.
[38] Tieleman T, Hinton G. (2012). Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2).
[39] Kingma D, Ba J. (2014). Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980.
[40] Vincent P, Larochelle H, Lajoie I, et al. Stacked denoising dutoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learning Research. 2010;11:3371-3408.
[41] Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems 25; 2012.
[42] LeCun Y, Bengio Y. Convolutional networks for images, speech, and time series. In M. Arbib, editor. The Handbook of Brain Theory and Neural Networks. 2nd edition. Cambridge, MA: MIT Press; 2003.
[43] Werbos PJ. Backpropagation through time: what it does and how to do it. Proc. IEEE. 1990;78(10):1550-1560.
[44] Sjöberg J, Zhang Q, Ljung L, et al. Nonlinear black-box modeling in system identification: a unified overview. Automatica. 1995;31(12):1691-1724.
[45] Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation. 1997;9(8):1735-1780.
[46] Atkeson CG, Santamaria JC. A comparison of direct and model-based reinforcement learning. Robotics and Automation, IEEE Int. Conf. on; Albuquerque, NM. 1997. p. 3557-3564.
[47] Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning. Nature. 2015;518:529-533.
[48] A roadmap for US robotics: from internet to robotics, 2016 edition.
[49] World Technology Evaluation Center, Inc. International Assessment of Research and Development in Robotics. Baltimore, MD, USA; 2006.
[50] FY2009-2034 Unmanned systems integrated roadmap. Washington, DC: Department of Defence (US); 2009.
[51] Material Handling Institute. Material handling and logistics U.S. roadmap 2.0. 2017.
[52] DARPA Robotics Challenge [Internet]. [cited 2017 May 20]. Available from: http://www.darpa.mil/program/darpa-robotics-challenge
[53] Punjani AP, Abbeel P. Deep learning helicopter dynamics models. Robotics and Automation (ICRA), 2015 IEEE International Conference on; 2015. p. 3223- 3230.
[54] Neverova N, Wolf C, Taylor GW, et al. Multi-scale deep learning for gesture detection and localization. Computer Vision-ECCV 2014 Workshops; 2014. p. 474-490.
[55] Mariolis I, Peleka G, Kargakos A, et al. Pose and category recognition of highly deformable objects using deep learning. Advanced Robotics (ICAR), 2015 International Conference on; Istanbul. 2015. p. 655-662.
[56] Yang Y, Li Y, Fermüller C, et al. Robot learning manipulation action plans by watching unconstrained videos from the world wide web. 29th AAAI Conference on Artificial Intelligence (AAAI-15); Austin, TX. 2015.
[57] Levine S, Pastor P, Krizhevsky A, et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. 2016. Available: http://arxiv.org/abs/1603.02199
[58]Ouyang W, Wang X. Joint deep learning for pedestrian detection. Computer Vision, 2013 IEEE Int. Conf. on; Sidney, VIC. 2013. p. 2056-2063.
[59] Wu J, Yildirim I, Lim JJ, et al. Galileo: Perceiving physical object properties by integrating a physics engine with deep learning. Advances in Neural Information Processing Systems 28; 2015.
[60] Schmitz A, Bansho Y, Noda K, et al. Tactile object recognition using deep learning and dropout. Humanoid Robots, 2014 14th IEEE-RAS Int. Conf. on; 2014. p. 1044-1050.
[61]Polydoros AS, Nalpantidis L, Kruger V. Real-time deep learning of robotic manipulator inverse dynamics. Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on; 2015. p. 3442-3448.
[62]Jain A, Koppula HS, Soh S, et al. Brain4Cars: car that knows before you do via sensory-fusion deep learning architecture. 2016. Available: http://arxiv.org/abs/1601.00740
[63]Lenz I, Knepper R, Saxena A. Deepmpc: learning deep latent features for model predictive control. Robotics: Science and Systems XI; Rome, Italy. 2015.
[64]Zhang T, Kahn G, Levine S, et al. Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search. 2015. Available: http://arxiv.org/abs/1509.06791
[65]Pinto L, Gupta A. Supersizing self-supervision: learning to grasp from 50k tries and 700 robot hours. 2015. Available: http://arxiv.org/abs/1509.06825
[66]Kappler D, Bohg J, Schaal S. Leveraging big data for grasp planning. 2015 IEEE International Conference on Robotics and Automation (ICRA); Seattle, WA. 2015. p. 4304-4311.
[67] Pratt GA. Is a cambrian explosion coming for robotics? Journal of Economic Perspectives. 2015;29(3):51-60.
[68] Szegedy C, Zaremba W, Sutskever I, et al. Intriguing properties of neural networks. 2013. Available: http://arxiv.org/abs/1312.6199