--- abstract: "Systems with multiple parallel goals (e.g. autonomous mobile robots) have a problem analogous to that of action selection in ethology. Architectures such as the subsumption architecture (Brooks) involve multiple sensing-and-acting agents within a single robot on its own if allowed. Which to give control at a given moment is normally regarded as a (difficult) problem of design. In a quest for a scheme where the agents decide for themselves in a sensible manner, I introduce a model where the agents are not only autonomous but are in full competition with each other for control of the robot. Interesting robots are ones where no agent achieves total victory, but rather a serires of compromises are reached. Having the agents operate by the reinforcement learning algorithm Q-learning (Watkins) allows the introduction of an algorithm called `W-learning', by which the agents learn to focus their competitive efforts in a manner similar to agents with limited spending power in an economy. In this way, the population of agents organises its own action selection in a coherent way that supports parallelism and opportunism. In the empirical section, I show how the relative influence an agent has on its robot may be controlled by adjusting its rewards. The possibility of automated search of agent-combinations is considered." altloc: - http://www.ed.ac.uk/~humphrys/Publications/e.planning.scheduling.ps.gz - or - ftp://ftp.essex.ac.uk/pub/csc/technical-reports/CSM-255.ps.Z chapter: ~ commentary: ~ commref: ~ confdates: ~ conference: ~ confloc: ~ contact_email: ~ creators_id: [] creators_name: - family: Humphrys given: Mark honourific: '' lineage: '' date: 1995 date_type: published datestamp: 1998-06-09 department: Department of Computer Science dir: disk0/00/00/04/50 edit_lock_since: ~ edit_lock_until: ~ edit_lock_user: ~ editors_id: [] editors_name: [] eprint_status: archive eprintid: 450 fileinfo: /style/images/fileicons/application_postscript.png;/450/2/e.planning.scheduling.ps full_text_status: public importid: ~ institution: University of Essex isbn: ~ ispublished: pub issn: ~ item_issues_comment: [] item_issues_count: 0 item_issues_description: [] item_issues_id: [] item_issues_reported_by: [] item_issues_resolved_by: [] item_issues_status: [] item_issues_timestamp: [] item_issues_type: [] keywords: 'reactive systems, action selection, autonomous mobile robots, reinforcement learning, multi-module learning' lastmod: 2011-03-11 08:53:57 latitude: ~ longitude: ~ metadata_visibility: show note: ~ number: ~ pagerange: ~ pubdom: FALSE publication: ~ publisher: ~ refereed: FALSE referencetext: ~ relation_type: [] relation_uri: [] reportno: technical report no. 255 rev_number: 10 series: ~ source: ~ status_changed: 2007-09-12 16:28:07 subjects: - bio-ani-behav - bio-etho - comp-sci-art-intel - comp-sci-mach-dynam-sys - comp-sci-mach-learn - comp-sci-robot succeeds: ~ suggestions: ~ sword_depositor: ~ sword_slug: ~ thesistype: ~ title: Towards self-organising Action Selection type: techreport userid: 69 volume: ~