EP 2939187 A4 20170816 - NEURAL MODEL FOR REINFORCEMENT LEARNING

Title (en)

NEURAL MODEL FOR REINFORCEMENT LEARNING

Title (de)

NEURONALES MODELL FÜR BESTÄRKENDES LERNEN

Title (fr)

MODÈLE NEURAL D'APPRENTISSAGE PAR RENFORCEMENT

Publication

EP 2939187 A4 20170816 (EN)

Application

EP 13860582 A 20130516

Priority

US 201261732590 P 20121203
US 2013041451 W 20130516

Abstract (en)

[origin: CN104823205A] A neural model for reinforcement-learning and for action-selection includes a plurality of channels, a population of input neurons in each of the channels, a population of output neurons in each of the channels, each population of input neurons in each of the channels coupled to each population of output neurons in each of the channels, and a population of reward neurons in each of the channels. Each channel of a population of reward neurons receives input from an environmental input, and is coupled only to output neurons in a channel that the reward neuron is part of. If the environmental input for a channel is positive, the corresponding channel of a population of output neurons are rewarded and have their responses reinforced, otherwise the corresponding channel of a population of output neurons are punished and have their responses attenuated.

IPC 8 full level

G06N 3/04 (2006.01); G06N 3/063 (2006.01)

CPC (source: EP US)

G06N 3/045 (2023.01 - EP); G06N 3/049 (2013.01 - EP US); G06N 3/063 (2013.01 - EP); G06N 3/08 (2013.01 - US); G06N 20/00 (2018.12 - US)

Citation (search report)

[A] US 2009182697 A1 20090716 - MASSAQUOI STEVE G [US]
[ID] D. R. W. BARR ET AL: "Implementation of multi-layer leaky integrator networks on a cellular processor array", PROCEEDINGS OF THE 2007 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN'07), 12 August 2007 (2007-08-12), pages 1560 - 1565, XP031154826, DOI: 10.1109/IJCNN.2007.4371190
[I] T. C. STEWART ET AL: "Learning to select actions with spiking neurons in the basal ganglia", FRONTIERS IN NEUROSCIENCE, vol. 6, 2, 31 January 2012 (2012-01-31), XP055389198, DOI: 10.3389/fnins.2012.00002
[I] E. DAUCÉ: "Hebbian reinforcement learning in a modular dynamic network", PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SIMULATION OF ADAPTIVE BEHAVIOR, 13 July 2004 (2004-07-13), pages 305 - 314, XP007913889, ISBN: 978-0-262-69341-7
[A] J. IGARASHI ET AL: "Real-time simulation of a spiking neural network model of the basal ganglia circuitry using general purpose computing on graphics processing units", NEURAL NETWORKS, vol. 24, no. 9, 30 June 2011 (2011-06-30), pages 950 - 960, XP028298414, DOI: 10.1016/J.NEUNET.2011.06.008
See references of WO 2014088634A1

Designated contracting state (EPC)

AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DOCDB simple family (publication)

CN 104823205 A 20150805; CN 104823205 B 20190528; EP 2939187 A1 20151104; EP 2939187 A4 20170816

DOCDB simple family (application)

CN 201380063033 A 20130516; EP 13860582 A 20130516