Humphrys, Mark (1995) Towards self-organising Action Selection. [Departmental Technical Report]
Full text available as:
Postscript
222Kb |
Abstract
Systems with multiple parallel goals (e.g. autonomous mobile robots) have a problem analogous to that of action selection in ethology. Architectures such as the subsumption architecture (Brooks) involve multiple sensing-and-acting agents within a single robot on its own if allowed. Which to give control at a given moment is normally regarded as a (difficult) problem of design. In a quest for a scheme where the agents decide for themselves in a sensible manner, I introduce a model where the agents are not only autonomous but are in full competition with each other for control of the robot. Interesting robots are ones where no agent achieves total victory, but rather a serires of compromises are reached. Having the agents operate by the reinforcement learning algorithm Q-learning (Watkins) allows the introduction of an algorithm called `W-learning', by which the agents learn to focus their competitive efforts in a manner similar to agents with limited spending power in an economy. In this way, the population of agents organises its own action selection in a coherent way that supports parallelism and opportunism. In the empirical section, I show how the relative influence an agent has on its robot may be controlled by adjusting its rewards. The possibility of automated search of agent-combinations is considered.
Item Type: | Departmental Technical Report |
---|---|
Keywords: | reactive systems, action selection, autonomous mobile robots, reinforcement learning, multi-module learning |
Subjects: | Biology > Animal Behavior Biology > Ethology Computer Science > Artificial Intelligence Computer Science > Dynamical Systems Computer Science > Machine Learning Computer Science > Robotics |
ID Code: | 450 |
Deposited By: | Humphrys, Mark |
Deposited On: | 09 Jun 1998 |
Last Modified: | 11 Mar 2011 08:53 |
Metadata
- ASCII Citation
- Atom
- BibTeX
- Dublin Core
- EP3 XML
- EPrints Application Profile (experimental)
- EndNote
- HTML Citation
- ID Plus Text Citation
- JSON
- METS
- MODS
- MPEG-21 DIDL
- OpenURL ContextObject
- OpenURL ContextObject in Span
- RDF+N-Triples
- RDF+N3
- RDF+XML
- Refer
- Reference Manager
- Search Data Dump
- Simple Metadata
- YAML
Repository Staff Only: item control page