This work presents a novel learning method in the context of embodied artificial intelligence and self-organization, which has as few assumptions and restrictions as possible about the world and the underlying model. The learning rule is derived from the principle of maximizing the predictive information in the sensorimotor loop. It is evaluated on robot chains of varying length with individually controlled, noncommunicating segments. The comparison of the results shows that maximizing the predictive information per wheel leads to a higher coordinated behavior of the physically connected robots compared with a maximization per robot. Another focus of this article is the analysis of the effect of the robot chain length on the overall behavior of the robots. It will be shown that longer chains with less capable controllers outperform those of shorter length and more complex controllers. The reason is found and discussed in the information-geometric interpretation of the learning process.

Amari, S. ( 1998). Natural gradient works efficiently in learning. Neural Computation, 10(2), 251-276.
Google Scholar | Crossref | ISI
Ay, N. , Bertschinger, N. , Der, R. , Güttler, F. , & Olbrich, E. ( 2008). Predictive information and explorative behavior of autonomous robots. The European Physical Journal B - Condensed Matter and Complex Systems, 63(3), 329- 339.
Google Scholar | Crossref
Barto, A.G. , Singh, S. , & Chentanez, N. ( 2004). Intrinsically motivated learning of hierarchical collections of skills. In J. Triesch & T. Jebara (Eds.), In Proceedings of 3rd International Conference on Development and Learning (pp. 112-119). San Diego, CA.
Google Scholar
Benveniste, A. , Priouret, P. , & Métivier, M. (1990). Adaptive algorithms and stochastic approximations. New York: Springer-Verlag.
Google Scholar
Bertschinger, N. ( 2008). An information theoretic perspective on cognitive systems: Memory and autonomy. Unpublished doctoral dissertation, University of Leipzig.
Google Scholar
Bialek, W. , Nemenman, I. , & Tishby, N. ( 2001). Predictability, complexity, and learning. Neural Computation, 13(11), 2409-2463.
Google Scholar | Crossref | Medline | ISI
Brooks, R.A. ( 1986). A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, 2(1), 14-23.
Google Scholar | Crossref | ISI
Brooks, R.A. ( 1991). Intelligence without reason. In J. Myopoulos & R. Reiter (Eds.), Proceedings of the 12th International Joint Conference on Artificial Intelligence (IJCAI-91) (pp. 569-595). San Mateo, CA: Morgan Kaufmann publishers.
Google Scholar
Clark, A. ( 1996). Being there: Putting brain, body, and world together again. Cambridge, MA: MIT Press .
Google Scholar
Cliff, D. ( 1990). Computational neuroethology: a provisional manifesto . In From Animals to Animats: Proceedings of the 1st International Conference on Simulation of Adaptive Behavior (pp. 29-39). Cambridge, MA: MIT Press.
Google Scholar
Cover, T.M. , & Thomas, J.A. ( 2006). Elements of information theory (2 nd ed.). New Jersey: Wiley.
Google Scholar
Crutchfield, J.P. , & Young, K. ( 1989). Inferring statistical complexity. Physical Review Letters, 63(2), 105- 108.
Google Scholar | Crossref | Medline | ISI
Der, R. ( 2001). Self-organized acquisition of situated behavior. Theory in Biosciences, 120, 179-187.
Google Scholar | Crossref | ISI
Der, R. , Güttler, F. , & Ay, N. ( 2008). Predictive information and emergent cooperativity in a chain of mobile robots. In S. Bullock, J. Noble, R. Watson, & M. A. Bedau (Eds.), In Artifical Life XI. Proceedings of the Eleventh International Conference on the Simulation and Synthesis of Living Systems (pp. 166-172). Cambridge, MA: MIT Press.
Google Scholar
Der, R. , & Liebscher, R. ( 2002). True autonomy from self-organized adaptivity. In R. Damper & D. Ciff (Eds.) In Proceedings of the EPSRC/BBSRC International Workshop on Biologically-Inspired Robotics: The Legacy of Grey Walter. (pp. 134-141). Southampton, UK: University of Southampton.
Google Scholar
Di Paolo, E.A. ( 2000). Homeostatic adaptation to inversion of the visual field and other sensorimotor disruptions. In J.-A. Meyer, A. Berthoz, H. Floreano, D. Roitblat , & S. Wilson (Eds.), From Animals to Animats 6. Proceedings of the VI International Conference on Simulation of Adaptive Behavior. (pp. 440-449) Cambridge, MA: MIT Press.
Google Scholar
Förster, H. von. ( 1993). Wissen und Gewissen : Versuch einer Brücke (1 . Aufl. ed.; S. J. Schmidt , Ed.). Frankfurt am Main, D: Suhrkamp.
Google Scholar
Grassberger, P. ( 1986). Toward a quantitative theory of self-generated complexity . International Journal of Theoretical Physics, 25(9), 907-938.
Google Scholar | Crossref | ISI
Hofbauer, J. , & Sigmund, K. ( 2003). Evolutionary game dynamics. Bulletin of the American Mathematical Society, 40, 479-519.
Google Scholar | Crossref | ISI
Kakade, S. ( 2002). A natural policy gradient. Advances in neural information processing systems, 14 (pp. 1531-1538). Cambridge, MA: MIT Press.
Google Scholar
Kaplan, F. , & Oudeyer, P.-Y. (2004). Maximizing learning progress: An internal reward system for development. Embodied artificial intelligence (pp. 259-270). Berlin: Springer.
Google Scholar
Laughlin, S. ( 1981). A simple coding procedure enhances a neuron’s information capacity. Zeitschrift fr Naturforschung C, 36(9-10), 910-912.
Google Scholar
Linsker, R. ( 1988). Self-organization in a perceptual network. IEEE Computers, 88, 105-117.
Google Scholar
Lungarella, M. , & Sporns, O. ( 2005). Information self-structuring: Key principle for learning and development. In IEEE Proceedings of the 4th International Conference on Development and Learning (pp. 25-30). San Diego, CA: IEEE Press.
Google Scholar
Meltzoff, A. , & Moore, M.K. ( 1997). Explaining facial imitation: A theoretical model. Early Development and Parenting, 6, 179-192.
Google Scholar | Crossref | Medline
Mondada, F. , Franzi, E. , & Ienne, P. ( 1993). Mobile robot miniaturization: A tool for investigation in control algorithms. In Proceedings of the 3rd International Symposium on Experimental Robotics (pp. 501-513). Berlin: Springer Verlag.
Google Scholar
Nolfi, S. , & Floreano, D. ( 2000). Evolutionary robotics. Cambridge, MA: MIT Press.
Google Scholar
Oudeyer, P.-Y. , Kaplan, F. , & Hafner, V. ( 2007). Intrinsic motivation systems for autonomous mental development . IEEE Transactions on Evolutionary Computation, 11(2), 265-286.
Google Scholar | Crossref | ISI
Peters, J. , Vijayakumar, S. , & Schaal, S. ( 2005). Natural actor-critic. In Proceedings of the 16th European Conference on Machine Learning (ECML 2005) (pp. 280-291). Berlin: Springer.
Google Scholar
Pfeifer, R. , & Bongard, J.C. ( 2006). How the body shapes the way we think: A new view of intelligence . Cambridge, MA: MIT Press (Bradford Books).
Google Scholar
Pfeifer, R. , Lungarella, M. , & Iida, F. ( 2007). Self-organization, embodiment, and biologically inspired robotics. Science, 318(5853), 1088-1093.
Google Scholar | Crossref | Medline | ISI
Polani, D. , Nehaniv, C. , Martinetz, T. , & Kim, J.T. ( 2006). Relevant information in optimized persistence vs. progeny strategies. In L. M. Rocha , M. Bedau , D. Floreano , R. Goldstone , A. Vespignani , & L. Yaeger (Eds.), Proceedings of Artificial Life X. (pp. 337-343). Cambridge, MA: MIT Press.
Google Scholar
Porr, B. ( 2003). Sequence-learning in a self-referential closed-loop behavioural system. Phd thesis, Faculty of Human Sciences, Department of Psychology. University of Stir-ling.
Google Scholar
Schmidhuber, J. ( 1990). A possibility for implementing curiosity and boredom in model-building neural controllers. In From Animals to Animats: Proceedings of the 1st International Conference on Simulation of Adaptive Behavior (pp.222-227). Cambridge, MA: MIT Press.
Google Scholar
Schmidhuber, J. ( 2009). Driven by compression progress: A simple principle explains essential aspects of subjective beauty, novelty, surprise, interestingness, attention, curiosity, creativity, art, science, music, jokes. Anticipatory Behavior in Adaptive Learning Systems (pp. 48-76). Berlin : Springer.
Google Scholar
Shannon, C.E. ( 1948). A mathematical theory of communication. Bell System Technical Journal, 27, 379-423.
Google Scholar | Crossref
Steels, L. ( 2004). The autotelic principle. Embodied artificial intelligence (pp. 231-242). Berlin: Springer.
Google Scholar
Steels, L. , & Wellens, P. ( 2007). Scaffolding language emergence using the autotelic principle . In H. A. Abbass, M, Bedau, S. Nolfi & J. Wiles (Eds.), in Proceedings of the 1st IEEE Symposium on Artificial Life (pp. 325-332). Honolulu, HI: IEEE Press.
Google Scholar
Still, S. ( 2009). Information-theoretic approach to interactive learning . EPL, 85(2), 28005.
Google Scholar | Crossref | ISI
Storck, J. , Hochreiter, S. , & Schmidhuber, J. (1995). Reinforcement driven information acquisition in non-deterministic environments. In F. Fogelman-Soulíe, J. C. Rault , P. Gallinari & G. Dreyfus (Eds.), in Proceedings of the International Conference on Artificial Neural Networks (pp. 159- 164). Paris: EC2 & Cie.
Google Scholar
Thelen, E. , & Smith, L.B. ( 1996). A dynamic systems approach to the development of cognition and action. Cambridge, MA: MIT Press .
Google Scholar
Uexkuell, J. von. ( 1957). A stroll through the worlds of animals and men. In C. H. Schiller (Ed.), Instinctive behavior (pp. 5-80). New York: International Universities Press. (Original work published 1934)
Google Scholar
Williams, T. , & Kelley, C. ( 2009). gnuplot 4.2.6. http://www.gnuplot.info .
Google Scholar
Zahedi, K. & Paseman, F. ( 2007). Adaptive behavior control with self-regulating neurons . In M. Lungarella , F. Iida , J. Bongard , & R. Pfeifer (Eds.), (pp. 196-205). Berlin Heidelberg : Springer.
Google Scholar
Zahedi, K. , von Twickel, A. , & Pasemann, F. ( 2008). Yars: A physical 3d simulator for evolving controllers for real robots. In S. Carpin et al. (Eds.), Simpar 2008 (pp.71-82). Berlin: Springer.
Google Scholar
Access Options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Research off-campus without worrying about access issues. Find out about Lean Library here

Your Access Options


Purchase

ADB-article-ppv for $41.50
Single Issue 24 hour E-access for $300.66

Cookies Notification

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more.
Top