At last the work on neural conditioning with delayed rewards is available online on Frontiers in Neurorobotics
http://www.frontiersin.org/Neurorobotics/10.3389/fnbot.2013.00006/abstract
Source code at : http://andrea.soltoggio.net/data/projects/icub-frontiers2013/
Support video