Stefan Bosse - Distributed Agent-Based Computing in Material-Embedded Sensor Network Systems With the Agent-on-Chip Architecture

IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014 2159

Distributed Agent-Based Computing in

Material-Embedded Sensor Network Systems

With the Agent-on-Chip Architecture

Stefan Bosse

Abstract—Distributed material-embedded systems like sensor

networks integrated in sensorial materials require new data

processing and communication architectures. Reliability and

robustness of the entire heterogeneous environment in the pres-

ence of node, sensor, link, data processing, and communication

failures must be offered, especially concerning limited service of

material-embedded systems after manufacturing. In this paper,

multiagent systems with state-based mobile agents are used for

computing in unreliable mesh-like networks of nodes, usually

consisting of a single microchip, introducing a novel design

approach for reliable distributed and parallel data processing

on embedded systems with static resources. An advanced high-

level synthesis approach is used to map the agent behavior to

multiagent systems implementable entirely on microchip-level

supporting agent-on-chip (AoC) processing architectures. The

agent behavior, interaction, and mobility are fully integrated

on the microchip using a reconﬁgurable pipelined communicat-

ing process architecture implemented with ﬁnite-state machines

and register-transfer logic. The agent processing architecture is

related to Petri Net token processing. A reconﬁguration mech-

anism of the agent processing system achieves some degree of

agent adaptation and algorithmic selection. The agent behavior,

interaction, and mobility features are modeled and speciﬁed with

an activity-based agent behavior programming language. Agent

interaction and communication is provided by a simple tuple-

space database implemented on node level and signals providing

remote inter-node level communication and interaction.

Index Terms— Mobile agents, intelligent agents, parallel

processing, distributed information systems.

I. INTRODUCTION AND OVERVIEW

MBEDDED systems required for sensorial perception

and structural monitoring (perceptive networks), used,

for example in Cyber-Physical-Systems (CPS) and Structural

Health Monitoring (SHM) [6], perform the monitoring and

control of complex physical processes using applications

running on dedicated execution platforms in a resource-

constrained manner and with real-time processing constraints.

Trends emerging recently in engineering and micro-system

applications such as the development of sensorial materials

[15] show a growing demand for autonomous networks of

Manuscript received September 2, 2013; revised December 31, 2013;

accepted January 6, 2014. Date of publication January 22, 2014; date of

current version May 22, 2014. The associate editor coordinating the review

of this paper and approving it for publication was Dr. Dirk Lehmhus.

The author is with the Department of Computer Science, Working Group

Robotics, ISIS Sensorial Materials Scientiﬁc Centre, University of Bremen,

Bremen 28359, Germany (e-mail: sbosse@uni-bremen.de).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/JSEN.2014.2301938

miniaturized smart sensors and actuators embedded in tech-

nical structures [6] (see Fig. 1). To reduce the impact of

such embedded sensorial systems on mechanical structure

properties, single microchip sensor nodes (in mm

range) are

preferred. Real-time constraints require parallel data process-

ing usually not provided by microcontrollers. Hence with

increasing miniaturization and node density, new decentralized

network and data processing architectures are required. Multi-

agent systems (MAS) can be used for a decentralized and

self-organizing approach of data processing in a distributed

system like a sensor network [2], enabling the mapping

of distributed raw sensor data to condensed information,

for example based on pattern recognition [5]. In [2], the

agent-based architecture considers sensors as devices used

by an upper layer of controller agents. Agents are organized

according to roles related to the different aspects to integrate,

mainly sensor management, communication and data process-

ing. This organization isolates largely and decouples the data

management from the changing network, while encouraging

reuse of solutions. Multi-agent system-based structural health

monitoring technologies are used to deal with high-density

and different kinds of sensors in reliable monitoring of large

scale engineering structures [5]. In [18] and [19], agents are

deployed for distributed sensing and power management in

wireless sensor networks, but still using embedded system

nodes not suitable for material integration.

Material-embedded data processing systems usually consist

of single microchip nodes connected either wired in mesh-

like networks [6] or wireless using ad-hoc networks [8] with

limited energy supply and processing resources. But tradi-

tionally, mobile agents are processed on generic program-

controlled computer architectures using virtual machines

[7], [8], [18], [19], which usually cannot easily be reduced

to single microchip level like they are required in sensor-

ial materials. Furthermore, agents are treated with abstract

heavy-weighted knowledge-based models, not entirely match-

ing distributed data processing in sensor networks. In [3], a

multi-agent system is used for advanced image processing

making proﬁt from the inherent parallel execution model of

agents.

Application speciﬁc digital logic hardware design has

advantages compared to program controlled microcontroller

approaches concerning power consumption, performance,

and chip resources by exploiting parallel data processing

(covered by the agent model) with lower clock frequencies

and enhanced performance [10].

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

2160 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

Fig. 1. Sensorial materials embedded in robots providing perception information of external applied load forces or internal structure load.

There are actually four major issues related to the scaling of

traditional software-based multi-agents systems to microchip

level and their design:

1) limited static processing, storage,and communication

resources, real-time processing,

2) unreliable communication,

3) suitable simpliﬁed programming models and processing

architectures offering hardware designs with ﬁnite state

machines (FSM) and resource sharing for parallel agent

execution, and

4) a generic high-level synthesis design approach.

Microchip level implementations of multi-agent systems were

originally proposed for low level tasks, for example in

[12] using agents to negotiate network resources and for high-

level tasks using agents to support human beings in ambient-

intelligence environments [14]. The ﬁrst work implements the

agent behavior directly in hardware, the second uses still a

(conﬁgurable) microcontroller approach with optimized paral-

lel computational blocks providing instruction set extension.

A more general and reconﬁgurable implementation of agents

on microchip level is reported in [1], providing a closed-

loop design ﬂow especially focussing on communication and

interaction, though still assuming and applying to program

controlled data processing machines and architectures. Hard-

ware implementations of multi-agent systems are still limited

to single or a few and non-mobile agents ([1], [20]).

In this work, an advanced high-level synthesis approach is

introduced to map the agent behavior of multi-agent systems

on microchip-level with an Agent-On-Chip processing archi-

tecture (AoC). The agent behavior, interaction, and mobility

are fully integrated on the microchip using a reconﬁgurable

pipelined communicating process architecture implemented

with ﬁnite-state machines and register-transfer logic. This

architecture supports parallel agent execution with a resource

shared pipeline approach. In this approach, the agent process-

ing is comparable to Petri Net token processing. A reconﬁgura-

tion mechanism of the agent processing system achieves some

kind of agent adaptation and algorithmic section based on

environmental changes like partial hardware or inter-connect

failures or based on learning and improved knowledge base.

The agent behavior, interaction, and mobility features

are modelled and speciﬁed with an activity-based agent

behavior programming language (AAPL). The activity-

graph based agent model is attractive due to the

proximity to the ﬁnite-state machine model, which simpliﬁes

the hardware implementation. With this AAPL a high-level

agent compiler is able to synthesize a hardware model on

model (C, ML), or a simulation model (XML) suitable to

simulate a multi-agent system using the SeSAm simulator

framework [9]. Agent interaction and communication are

provided ﬁrstly by a simple tuple-space database implemented

on each node providing access and sharing of local data,

and secondly by signals able to propagate in the network

(like messages) preferred for fast and light-weighted remote

inter-node level communication and interaction. To enable

dynamic adaptation of the agent behavior at run-time, the

agent processing architecture implementing the agent behavior

can be (re)conﬁgured by agents by modifying the transitional

network.

Traditionally agent programs are interpreted, leading to a

signiﬁcant decrease in performance. In the approach presented

here the agent processing platform can directly be imple-

mented in standalone hardware nodes without intermediate

processing levels and without the necessity of an operating

system, but still enabling software implementations that can

be embedded in applications.

There is related work concerning agent programming lan-

guages and processing architectures, like APRIL[13] providing

tuple-space like agent communication, and widely used FIPA,

ACL,andKQGML[11] focusing on high-level knowledge

representation and exchange. All those approaches represent

communication and information on complex and abstract

level not fully suited for the synthesis of low-resource data

processing systems in distributed loosely coupled networks,

especially in sensor networks, in contrast to the proposed

AAPL approach, simple enough to enable hardware design

synthesis, but powerful enough to model the agent behavior

of complex distributed systems, which is demonstrated in the

following case study. Though the imperative programming

model is quite simple and closer to a traditional PL it can be

used as a common source and intermediate representation for

different agent processing platform implementations: hardware

(HW), software (SW), and simulation (SIM).

BOSSE: DISTRIBUTED AGENT-BASED COMPUTING 2161

Fig. 2. Prototype of a Sensorial Material using intelligent sensor networks.

What is novel compared to other approaches?

• Reliability and reactivity provided by the autonomy of

mobile state-based agents and reconﬁguration.

• Agent mobility and interaction by using tuple-space

databases and global signal propagation aid solving data

distribution and synchronization issues in distributed sys-

tems design, and tuple spaces represent agent belief.

• One common agent programming language and process-

ing architecture enables the synthesis of standalone

parallel hardware implementations, alternatively stand-

alone software implementations, and behavioral simula-

tion models, enabling the design and test of large-scale

heterogeneous systems.

• AAPL provides powerful statements for computation and

agent control with static resources.

• A token-based pipelined multi-process agent process-

ing architecture suitable for hardware platforms with

putational resources and speed.

• Improved scaling in large network applications compared

with full or semi centralized and pure message based

processing architectures.

II. F

IELDS OF APPLICATION:SENSORIAL MATERIALS

Sensorial Materials are equipped with material-embedded

high miniaturized distributed sensor networks performing load

monitoring or environmental perception [15], shown in prin-

ciple in Fig. 1. These embedded sensor networks consist

of nodes equipped with sensor signal electronics and digital

logic performing computation and communication. Optionally

there are material-embedded energy sources (energy harvester)

supplying the nodes locally.

Fig. 2 shows a prototype of a Sensorial Material using

intelligent sensor networks. Each autonomous network node is

connected with up to four neighbors and strain-gauge sensors

mounted on a rubber sheet, altogether equipped with nine

bi- axial strain-gauge sensors placed at a distance of 70 mm.

The Sensorial Material was used to retrieve load information

about the sheet (applied by external forces) by using advanced

machine learning methods from a small set of uncalibrated

sensors with unknown electro-mechanical model still provid-

ing high spatial resolution compared with the distance of the

sensors to each other.

Fig. 3 shows the usage of such material in an intersection

element of a robot arm manipulator providing external envi-

ronmental perception required for robot control. The proposed

robot manipulator [16] consists of actors (joint drives) and

intersection elements with integrated smart sensor networks.

Distributed data processing is provided by mobile agents.

The agent behavior is implemented with SoC designs on

hardware-level. The intersection element connects two joint

actors with a rigid double-pipe construction, which is sur-

rounded by two opposite placed load sensitive skins (bent

rubber plate), equipped each with four strain-gauge sensor

pairs (bi-axially aligned). Each sensor pair is connected to a

sensor node providing parallel data processing, agent behavior

implementation, and communication/networking. All sensor

nodes are arranged in a mesh-like network connected with

serial point-to-point links. Communication is established by a

smart and robust routing protocol.

III. S

TATE-BASED AGENTS AND THE AGENT

PROGRAMMING LANGUAGE APL

Initially, a sensor network is a collection of independent

computing nodes. Interaction between nodes is required to

manage and distribute data and to compute information.

One common interaction model is the state-based mobile

agent.

The behavior of a state-based agent can be easily mod-

elled with a ﬁnite-state machine completely implementable

with register-transfer logic (RTL) on microchip level eas-

ing high-level synthesis and the exploitation of concurrency

2162 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

Fig. 3. Robot arm manipulator intersection element equipped with smart sensor networks providing perception information of external applied load forces

based on preliminary work with ﬂat rubber plate.

Fig. 4. Agent behavior programming level with activities and transitions (AAPL programming level, left); agent class model organizing activities and

transitions in graphs (middle); agent instantiation, processing, and interaction on the network node level (right).

required for material-embedded real-time data processing. The

implementation of mobile multi-agent systems for resource

constrained embedded systems with a focus on microchip

level is a complex design challenge. High-level agent pro-

gramming languages can aid to solve this design issue.

Though there are already several agent modelling, interaction,

and communication languages, they are not fully suitable

to carry out multi-agent systems on microchip level. For

this purpose, the Agent Programming Language AAPL was

designed to enable the optimized design of state-based agents

and microchip scaled processing dealing with limited static

resources. This language consists of generic imperative and

computational statements and a type system derived from a

subset of the Modula-3 language, allowing subrange types

required for hardware synthesis, and agent speciﬁc statements

to specifying the behavior, mobility, and interaction of agents.

This technology-independent programming model can be

directly synthesized to hardware, alternatively to software and

simulation model targets without modiﬁcation.

The agent behavior is partitioned and modelled with an

activity graph, with activities (representing the control state

of the agent) and conditional transitions enabling activities.

Activities provide sequential execution of procedural

data processing statements. An activity is activated by

a conditional transition depending on the evaluation of

agent data (conditional transition), or using unconditional

transitions. An agent belongs to a speciﬁc parameterized

agent class AC, specifying local agent data(only visible for

BOSSE: DISTRIBUTED AGENT-BASED COMPUTING 2163

the agent itself), types, signals, activities, signal handlers, and

transitions, shown in principle in Fig. 4.

New agents of a speciﬁc class can be created at runtime by

agents using the new AC(v1,v2,..) statement returning

a node unique agent identiﬁer. An agent can create multiple

living copies of itself with a fork mechanism, creating child

agents of the same class with inherited data and control state

but with different parameter initialization, done by using the

fork(v1,v2,..) statement. Agents can be destroyed by

using the kill(ID) statement.

Statements inside an activity are processed sequentially and

consist of data assignments (x:=) operating on agent’s

private data, control ﬂow statements (conditional branches and

loops), and special agent control and interaction statements,

summarized in Def. 1.

Agent interaction and synchronization is provided by a

tuple-space database server available on each node. An agent

can store an n-dimensional data tuple (v1,v2,..) in the database

by using the out(v1,v2,..) statement(commonly the ﬁrst

value is treated like a key). A data tuple can be removed or

read from the database by using the in(v1,p2?,v3,..)

or rd(v1,p2?,v3,..) statements with a pattern template

based on a set of formal (variable,?) and actual (constant)

parameters. These operations block the agent processing until a

matching tuple was found/stored in the database. These simple

operations solve the mutual exclusion problem in concurrent

systems easily. Only agents processed on the same network

node can exchange data this way.

The existence of a tuple can be checked by using the

exist? function or with atomic test-and-read behavior using

the try_in/rd functions. A tuple with a limited lifetime

(a marking) can be stored in the database by using the

mark statement. Tuples with exhausted lifetime are removed

automatically by a garbage collector. Tuples matching a

speciﬁc pattern can be removed with the rm statement.

Remote light-weighted interaction between agents is pro-

vided by signals with optional parameters, implementing a

remote-procedure call interface.

A signal can be raised by an agent using the

send(ID,S,V) statement specifying the ID of the tar-

get agent (which must be created by the sending agent),

the signal name S, and an optional argument value V

propagated with the signal. The receiving agent must pro-

vide a signal handler (like an activity) to handle signals

(asynchronously). Alternatively, a signal can be sent to a

group of agents belonging to the same class AC within a

bounded region using the broadcast(AC,DX,DY,S,V)

statement.

Migration of agents to a neighbor node (by preserving the

local data and processing state) is performed by an agent using

the moveto(DIR) statement, assuming the arrangement of

network nodes in a mesh- or cube-like network. To test if

a neighbor node is reachable (testing connection liveliness),

the link?(DIR) statement returning a boolean result can

be used.

Within activities agents can change the transitional net-

work (initially speciﬁed in the transition section) by chang-

ing, deleting, or adding (conditional) transitions using the

Agent Class Deﬁnition

agent class (arguments)= deﬁnitions end;

Activity Deﬁnition

activity name = statements end;

Data Statements

var x,y,z:type;

x:= (variable,value,constant);

Conditional Statements

if cond then statements else statements end;

case  of | v1 -> statements |.. end;

Loop Statements

for i := range do statements end;

while cond do statements end;

Transition Network Deﬁnition

transitions = transitions end;

a1 -> a2: cond ;

Tuple Database Statements

out(v1,v2,..); .. exist?(v1,?,..)..

in(v1,x1?,v2,x2?,...); rd(v1,x1?,v2,x2?,...);

try_in(timeout,

v1,..); try_rd(timeout,v1,..);

mark(timeout,v1,v2,..); rm(v1,?,..);

Signals

signal S:datatype;

handler S(x) = statements end;

send(ID,S,v); reply(S,v);

broadcast(AC,DX,DY,S,v);

timer+(timeout,S);timer-(S); sleep; wakeup;

Exceptions

exception E; raise E;

try statements except E -> statements end;

Mobility, Creation, and Reproduction

moveto(direction);

.. link?(direction)..

id := new class (arguments);

id := fork(arguments);

kill(id);

Reconﬁguration

transition+(a1,a2,cond);transition*(a1,a2,cond);

transition-(a1,a2);

Def. 1: Summary of the AAPL Language (.. x.. means x is part of an

expression , and ; terminates procedural statements)

transition♦(S1,S2,cond) statements (with ♦=‘+’:

add, ‘-’: remove, and ‘*’: change transition).

The usage of the programming language is illustrated in

more detail in the following case study.

IV. A

GENT-ON-CHIP:THE AGENT PROCESSING

ARCHITECTURE AND SYNTHESIS

The agent processing architecture required at each network

node must implement different agent classes and must be

scalable to the microchip level to enable material-integrated

embedded system design, and represent a central design issue

for new the Agent-on-Chip data processing approach, further

focussing on parallel agent processing and optimized resource

sharing.

A. Activity Processing

In this work the agent behavior is implemented with a recon-

ﬁgurable pipelined communicating process model derived

2164 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

Fig. 5. Mapping of the agent behavior programming level to the agent processing architecture with pipelined communicating sequential processes and the

ﬁnal mapping on RT level. Agent tokens are passed by queues and conditional selectors from an outgoing to an incoming activity process.

from the Communicating Sequential Process model (CSP)

proposed by Hoare (1985). The set of activities {A

} is mapped

on a set of sequential processes {P

} executed concurrently.

The set of transitions {T

} is mapped on a set of synchronous

queues {Q

} and transition selectors {S

} providing inter-

activity-process communication, shown in Fig. 5. Agents are

represented by tokens (natural numbers equal to the agent

identiﬁer, unique on each node), which are transferred by the

queues between activity processes depending on the speciﬁed

transition conditions. This multi-process model is directly

mappable to RTL hardware and software implementations.

Each process P

is mapped to a ﬁnite state machine FSM

controlling process execution and a register-transfer data path.

Local agent data is stored in a region of a memory mod-

ule assigned to each individual agent. There is only one

incoming transition queue for each process consuming tokens,

performing processing, and ﬁnally passing tokens to outgoing

queues, which can depend on conditional expressions. There

are computational and IO/event based activity statements. The

latter ones can block the agent processing until an event occurs

(for example, the availability of a data tuple in the database).

Blocking statements {s

j,i

} of an activity A

are assigned to

separate intermediate IO processes {P

i, j

} handling only IO

events or additional post computations, as shown on the bottom

of Fig. 5.

Agents in different activity states can be processed con-

currently. Thus, activity processes that are shared by several

agents may not block. To prevent blocking of IO-event based

processes (for example waiting for data), not-ready processes

pass the agent token back to the input queue. An IO process

either processes unprocessed agent tokens or waits for the

happening of events, controlled by the agent manager.

The pipeline architecture offers advanced resource sharing

and concurrent processing of agents in different activity states.

Only one activity process chain implementation for each agent

class is required on each node, in contrast to previous program-

mable architectures [4] providing only limited concurrency and

resource sharing.

B. Resources

A rough estimation of the resource requirements R for the

hardware implementation of the agent processing architecture

supporting a set of N different agent classes {AC

}isshown

in Eq. 1, with each class having M

activitie, T

transitions,

data cells with a resource weight w

data

,andw

act,i, j

for

each activity, and a maximal number of managed agents for

each class N

agents,i

. The tuple space database requires w

ts,i

resources for each supported n-dimensional space. The C

values are control parts independent of the above values.

R  (w

data



i∈AC

agents

)

+ C

sched

+ w

sched

(



i∈AC

) + w

sched

max(N

agents

)

+ C

comm

+ (w

queue

+ w

cond

)(



i∈AC

)

+ (



i∈AC



j∈AT

act

i, j

)+C

+ (



i∈TS

) (1)

BOSSE: DISTRIBUTED AGENT-BASED COMPUTING 2165

Fig. 6. Agent migration and signal propagation using message trans-

fers (LAH:Local Agent Handler,LID : Local Agent Identiﬁer,AC:Agent

Class,LIVE: agent life,STATE: agent state, DX and DY: spatial displacement

vector,SID: Signal Identiﬁer,SIG: pending signal. ID).

For example, assuming simpliﬁed four agent classes with

N=16 agents for each class, each class requires D=512

bit memory, M=10 (w

act

=500), T=16, three tuple spaces

(1,2,3) with S=32 (and w

=32, w

=64, w

=128) entries each,

and w

data

=4, w

queue

=150, w

sched

=60, w

cond

=50, w

act

=500,

sched

=5000, C

comm

= 10000, C

=1000 (based on experi-

mental experiences, all w and C values in eq. gates units),

which results in 189400 eq. gates for the HW implementation.

C. Power/Efﬁciency

Agents are often heavy-weighted processing entities inter-

preted by software-based virtual machines. In contrast, in the

proposed RTL architecture the agent behavior is mapped on

ﬁnite state machines and a data path with data word length

scaling, offering minimized power- and resource requirements,

both in the control and data path. Most activity statements are

executed by the platform in one or two clock cycles! All com-

monly administrative parts like the agent manager, communi-

cation protocols, and the tuple-space database commonly part

of an operating system are implemented in hardware, offering

advanced computational power enabling low-frequency and

low-power designs, well suited for energy-autonomous sys-

tems. Transition network changes can be performed within a

few clock cycles.

D. Agent Manager

The agent manager provides a node level interface for

agents, and it is responsible for the creation, control (including

signals, events, and transition network conﬁguration), and

migration of agents with network connectivity, implementing a

main part of an operating system. The agent manager controls

the tuple-space database server and signal events required for

IO/event based activity processes.

The agent manager uses agent tables and caches to store

information about created, migrated, and passed through

agents (req., for ex.,for signal propagation), see Fig. 6.

E. Migration

Migration of agents between nodes incorporates only the

transfer of the agent state consisting of data (the content

of body variables) and the control state (a pointer to the

next activity to be executed after migration and the transition

conﬁguration) of the agent together with a unique global agent

identiﬁer (extending the local ID with the agent class and

the relative displacement of its root node) encapsulated in

messages with low overhead, shown in Fig. 6. This approach

minimizes network load and energy consumption signiﬁcantly.

Migration of simple agents results in a message size between

100–1000 bits. The agent start-up time after the data transfer

is low (about some hundred clock cycles).

F. Transition Network

A switched transition network allows the reconﬁguration

of the activity transitions at runtime. Though the possible

reconﬁguration and the conditional expressions must be known

at compile time (static resource constraint!), a reconﬁguration

can release the use of some activity processes and enhances

the utilization for concurrent processing of other agents of the

same class. The transition network is implemented with tables

in case of the HW implementation, and with dynamic lists

in case of the SW and SIM implementations. Agent activity

transition conﬁgurations can be inherited by child agents.

G. Tuple-Space Database

Each n-dimensional tuple-space TS

(storing n-ary tuples)

is implemented with ﬁxed size tables in case of the hardware

implementation, and with dynamic lists in the case of the

software and simulation model implementations. The access of

each tuple-space is handled independently. Concurrent access

of agents is mutually exclusive. The HW implementation

implicates further type constraints, which must be known at

design time (e.g. limitation of integer ranges) provided by sub-

range-typing in the AAPL speciﬁcation.

H. Signals

Signals must be processed asynchronously. Therefore, agent

signal handlers are implemented with a separate activity

process pipeline, one for each signal handler. For each pending

agent signal, the agent manage injects an agent token in

the respective handler process pipeline independent of the

processing state of the agent. Remote signals are processed

by the agent manager, which encapsulate signals in messages

sent to the appropriate target node and agent, shown in Fig. 6.

I. Synthesis

The database driven synthesis ﬂow is illustrated in Fig. 7.

The AAPL program is parsed and mapped to an abstract

syntax tree (AST). The ﬁrst compiler stage analyzes, checks,

and optimizes the agent speciﬁcation AST. The second stage

is divided into three parts: an activity to process mapper, a

transition to queue mapper, a transition (pipelined process-

ing architecture) network builder, and a message generator

2166 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

Fig. 7. Simpliﬁed agent-on-chip high-level synthesis ﬂow producing different (independent) output targets.

Fig. 8. Distributed feature extraction in an unreliable and incomplete mesh network (with missing links) by using distributed agents with directed diffusion

migration and self-organization behavior.

supporting agent and signal migration. There are different

supported backends (HW/SW/SIM). The high-level hardware

description enables the SoC synthesis using the ConPro high-

level synthesis framework [10], which maps activity processes

on ﬁnite state machines and the RT datapath level.

The ConPro programming model reﬂects an extended CSP,

which provides atomic guarded actions on shared resources

access. Each process is implemented with a FSM and a RT

datapath. Furthermore, a software description (C), which can

be embedded in application programs, and a simulation model

usable for MAS simulation using the SeSAm simulator [9], can

be derived.

All implementation models (HW/SW/SIM) provide equal

functional behavior, and only differ in their timing, resource

requirements,and execution environments.

J. Simulation

In addition to real hardware-implemented agent processing

platforms there is the capability of the simulation of the

agent behavior, mobility, and interaction on a functional level

using the SeSAm simulation framework [9], which offers a

platform for the modelling, simulation, and visualization of

mobile multi-agent systems employed in a two-dimensional

world. The behavior of agents is modelled with activity graphs

(specifying the agent reasoning machine) close to the AAPL

model. But some special transformations must be applied to

enable the simulation: 1. AAPL activities (IO/event-based) can

block the agent processing until an event occurs. Blocking

agent behavior is not provided directly by SeSAm. ⇒ activity

decomposition 2. The transition network can change during

run-time ⇒ use of a transition scheduler 3. The handling of

concurrent asynchronous signals used in AAPL for inter-agent

communication cannot be established with the generic activity

processing in SeSAm ⇒ use of a signal scheduler.

V. C

ASE STUDY:STRUCTURAL HEALTH MONITORING

A small example implementing a distributed feature detec-

tion in an incomplete and unreliable mesh-like sensor network

using mobile agents should demonstrate the suitability of

the proposed agent processing approach. The sensor network

consists of nodes with each node attached to a sensor (e.g.

Strain-gauge). The nodes can be embedded in a mechanical

structure, for example, used in a robot arm. The goal of the

MAS is to ﬁnd extended correlated regions of increased sensor

intensity (compared to the neighborhood) due to mechani-

cal distortion resulting from externally applied load forces.

A distributed directed diffusion behavior and self-organization

(see Fig. 8) is used, derived from the image feature extraction

approach proposed in [17]. Single sporadic sensor activities

BOSSE: DISTRIBUTED AGENT-BASED COMPUTING 2167

Fig. 9. Simulation results for two different sensor network situations (left: start, middle: exploration, right: ﬁnal result situation). Top row: sensor activity

within clusters, bottom row: sensor activity scattered over the network.

not correlated with the surrounding neighborhood should be

distinguished from an extended correlated region, which is the

feature to be detected.

There are three different agent classes: an exploration, a

node agent, and a deliver agent. A node agent is immobile

and is primarily responsible for sensor measurement and

observation.

The feature detection is performed by the mobile explo-

ration agent that supports two main different behaviors: diffu-

sion and reproduction. The diffusion behavior is used to move

within a region, mainly limited by the lifetime of the agent, and

to detect the feature, here the region with increased mechanical

distortion (more precisely the edge of such an area). The

detection of the feature enables the reproduction behavior

that induces the agent to stay at the current node, setting a

feature marking and sending out more exploration agents in

the neighborhood. The local stimuli H(i,j) for an exploration

agent to stay at a speciﬁc node with the coordinates (i,j)is

given by eq. 2.

H(i, j) =



s=− R



t=− R

{



S(i + s, j + t) − S(i, j)



≤ δ}

S : Sensor Signal Strength

R : Square Region around (i, j) (2)

The calculation of H at the current location (i, j)oftheagent

requires the sensor values within the square area (the region

of interest ROI) R around this location. If a sensor value

S(i+s, j+t) with i, j∈{−R,..,R} is similar to the value S at

the current position (diff. is smaller than the parameter δ),

H is incremented by one.

If the H value is within a parameterized interval

 = [

, 

], the exploration agent has detected the feature

and will stay at the current node to reproduce new exploration

agents send to the neighborhood. If H is outside this interval,

the agent will migrate to a neighbor different node and restarts

exploration (diffusion).

The calculation of H is performed by a distributed cal-

culation of partial sum terms by sending out child explorer

agents to the neighborhood, which itself can send out more

agents until the boundary of the region R is reached. Each

child agent returns to its origin node and hand over the partial

sum term to his parent agent, shown in Fig. 8. Because a

node in the region R can be visited by more than one child

agent, the ﬁrst agent reaching a node sets a marking MARK.

If another agent ﬁnds this marking, it will immediately return

to the parent. This multi-path visiting has the advantage of

an increased probability of reaching nodes with missing (non

operating) communication links (see Fig. 8). A deliver agent,

created by the node agent, ﬁnally delivers exploration results

to interested nodes by using directed diffusion approaches, not

discussed here.

Ex. 1 shows the AAPL behavior speciﬁcation for the

exploration agent. The agent behavior is partitioned in nine

activities and two signal handlers. If a sensor node agent

observes an increased sensor value, it creates a new explorer

agent that enters the start activity (lines 8–19). Each explorer

agent is initialized on creation with two parameter arguments:

a direction and a radius value. The ﬁrst agent created by

the sensor node has no speciﬁc direction. Child agents with

a speciﬁc direction move to the respective node (line 11).

In line 18, the transition move → percept_neighbor

is created (all existing transitions starting from activity move

are deleted ﬁrst). The start activity transitions to the percept

activity, which creates child agents (lines 44–46). Forked

agents inherit all parent data and the current transition network

conﬁguration. For this, in line 56 the transition percept →

move is established (and inherited), but after forking reseted in

2168 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

TABLE I

IGH-LEVEL AND GATE -LEVEL SYNTHESIS RESULTS FOR ONE SENSOR NODE

lines 61 and 62 for the parent agent behavior, which await the

return of all child agents and a decision for behavior selection

(reproduce/diffuse).

The child agents enter the move (lines 20–25) activity

after forking and will migrate in the speciﬁc direction to the

neighbor node. Finally, the percept_neighbor activity is

reached, which performs the local calculation (line 55) if there

was no marking found, and ﬁnally stores the partial result in

the tuple database. Further child agents are sent out if the

boundary of the ROI is still not reached.

Otherwise the agent goes back to his origin (parent)

by entering the goback activity performing the migration

(lines 66–68), previously updating its h value from the tuple

database. If the returning agent has arrived, it will deliver its

h value by adding it to the local H value stored in the database

(lines 71 and 72) and raising the WAKEUP signal to notify the

parent,which causes the entering of the parent’s signal handler

(lines 77–79).

If there is enough input and all child agents had returned

(or a time-out has occurred handled by the signal handler

TIMEOUT, lines 80 and 81), the exploration agent either enters

the diffuse or reproduce activity.

Diffusion and reproduction is limited by a lifetime

(decreased each time an explorer agent is replicated or on

migration, lines 27 & 36).

A. Synthesis and Simulation

The agent behavior speciﬁcation was synthesized to a digital

logic hardware implementation (single SoC) and a simulation

model with equal functional behavior suitable for the MAS

simulator environment SeSAm[9]. The suitability of the self-

organizing approach for feature detection was justiﬁed by

simulation results shown in Fig. 9 for two different sensor

network situations, each consisting of a 10 by 10 network with

autonomous sensor nodes. Each node is connected with up to

four neighbors. One situation creates signiﬁcant sensor values

arranged in a bounded cluster region, for example, caused

my mechanical forces applied to the structure, and the other

situation creates signiﬁcant sensor values scattered around the

network without any correlation, for example, caused by noisy

or damaged sensors.

In the ﬁrst clustered situation, the explorer agents are

capable to detect the bounded region feature for the two

separated regions (indicated by the change of the agent colour

to black). Due to the reproduction-behavior there are several

agents at one location, shown in the right agent density contour

plot. In the second unclustered situation, the explorer agents

did not ﬁnd the feature and vanish due to their limited lifetime

behavior.

The feature-search is controlled by a set of parameters:

{δ, 

, 

, lifetime, search radius R}.

The synthesis results of the hardware implementation

for one sensor node are shown in Table I, which are in

accordance with the resource estimation from Sec. IV. The

AAPL speciﬁcation was compiled to the ConPro program-

ming model and synthesized to an RTL implementation

creating VHDL models. Two different target technologies

were synthesized by using gate-level synthesis: 1. FPGA,

Xilinx XC3S1000 device target using Xilinx ISE 9.2 soft-

ware, 2. ASIC standard cell LIS10K library using the

Synopsys Design Compiler software. The agent processing

architecture consisted of the activity process chain for the

explorer and node agent, the agent manager, the tuple-

space database (supporting two- and three-dimensional

tuples with integer type values), and the communication

unit.

This case study showed ﬁrstly the suitability of the multi-

agent-based approach for feature detection in large scale

sensor networks, for example used in real-time structural

health monitoring for sensor data ﬁltering, and secondly

the suitability of the proposed agent modelling and syn-

thesis approach for single System-on-Chip microchip-level

implementations.

Ex. 1: Excerpt of the AAPL speciﬁcation for agent class

Explore implementing a feature extraction agent with dis-

tributed directed diffusion and self-organizing behavior.

type keys={ADC,FEATURE,H,MARK};direction={..}

signal WAKEUP,TIMEOUT; val RADIUS := 4; ...

agent explore(dir:direction,

radius: integer[1..16]) =

var dx,dy:integer[-100..100];

live:integer[0..15]; ......

var* s: integer[0..1023]; ......

activity start =

dx := 0; dy := 0; h:= 0;

if dir <> ORIGIN then

moveto(dir);

case dir of

| NORTH -> backdir := SOUTH

| SOUTH -> ......

else

BOSSE: DISTRIBUTED AGENT-BASED COMPUTING 2169

live := MAXLIVE; backdir := ORIGIN

group := random(integer[0..1023]);

transition*(move,percept_neighbor);

out(H,id(self),0); rd(ADC,s0?)

activity move =

case dir of

| NORTH -> backdir := SOUTH; incr(dy)

| SOUTH -> backdir := NORTH; decr(dy)

| WEST -> ....

moveto(dir)

activity diffuse =

decr(live); rm(H,id(self),?);

if live > 0 then

case backdir of

| NORTH -> dir :=

random({SOUTH,EAST,WEST})

| SOUTH -> ....

else kill(ME)

activity reproduce =

var n:integer;

decr(live);

if live > 0 then

for nextdir in direction do

if nextdir <> backdir and

link?(nextdir) then

fork(nextdir,radius)

transition*(reproduce,stay)

activity percept = – Master perception –

enoughinput := 0; transition*(percept,move);

for nextdir in direction do

if nextdir <> backdir and

link?(nextdir)then

incr(enoughinput);fork(nextdir,radius)

transition*(percept,diffuse, (h<ETAMIN or

h > ETAMAX) and enoughinput < 1);

transition+(percept,reproduce,h>=ETAMIN and

h <= ETAMAX and enoughinput < 1);

timer+(TMO,TIMEOUT)

activity percept_neighbor =

if not exist?(MARK,group) then

mark(TMO,MARK,group); enoughinput:= 0;

rd(ADC,s?); out(H,id(self),calc());

transition*(percept_neighbor,move);

for nextdir in direction do

if nextdir <> backdir and

inbound(nextdir) and

link?(nextdir) then

incr(enoughinput);fork(nextdir,radius)

transition*(percept_neighbor,goback,

enoughinput < 1);

timer+(TMO,TIMEOUT)

else

transition*(percept_neighbor,goback) end

activity goback =

h:= 0; try_in(0,H,id(self),h?);

moveto(backdir);

activity deliver =

var v:integer;

in(H,id(parent),v?); out(H,id(parent),h+v);

send(id(parent),WAKEUP); kill(ME)

activity stay =

rm(H,id(self),?);

n:=0; try_in(0,FEATURE,n?);

out(FEATURE,n+1)

handler WAKEUP =

decr(enoughinput); try_rd(0,H,id(self),h?);

if enoughinput < 1 then timer-(TIMEOUT)end

handler TIMEOUT =

enoughinput := 0; again := true

function calc():integer=

if abs(s-s0) <= DELTA then return 1

else return 0

function inbound(nextdir:direction):bool=

case nextdir of

| NORTH -> return (dy < RADIUS)

| SOUTH -> ......

transitions =

start -> percept; percept -> move;

move -> percept_neighbor;

VI. CONCLUSION

A novel design approach using mobile agents for reli-

able distributed and parallel data processing in low-resource

networks with embedded hardware nodes was introduced.

A multi-agent programming language AAPL provides com-

putational statements and statements for agent creation, inher-

itance, mobility, interaction, reconﬁguration, and information

exchange, based on the agent behavior partitioning in an activ-

ity graph, which can be directly synthesized to the microchip

level by using a high-level synthesis approach and ﬁnite state

machines on RT level.

Agent interaction is delivered by a simple but powerful tuple

database approach. The tuple-space is a central part of the

agent’s belief, and contributes to the decision making process

of agents. Agents can be created dynamically at runtime by

other agents.

This proposed agent processing architecture implements

a resource and speed optimized virtual machine consisting

of a reconﬁgurable pipelined communicating process chain.

Only one virtual machine is required for each agent class,

which should be supported on a particular network node. The

pipeline approach enables concurrent agent processing and

advanced resource sharing. Replication of activity processes

can increase the computational performance signiﬁcantly, for

example, based on timed Petri-Net analysis.

Unique identiﬁcation of agents does not require unique

absolute node identiﬁers or network addresses, a prerequisite

for loosely coupled and dynamic networks (due to failures,

reconﬁguration,or expansion).

Reconﬁguration of the activity transition network supported

on programming level offers agent behavior adaptation at

runtime based on the data state of the agent resulting from

environmental changes like partial hardware or interconnect

failures or based on learning and improved knowledge base.

The transitional conﬁguration can be inherited by child agents.

Finally, improved resource sharing for parallel processing is

offered.

A case study implementing a self-organizing multi-agent

system in a sensor network demonstrated the suitability of

the proposed programming model, processing architecture,

and synthesis approach. Migration of agents requires only

the transfer of the control and data space of an agent using

2170 IEEE SENSORS JOURNAL, VOL. 14, NO. 7, JULY 2014

messages. The agent behavior is ﬁxed and bound to each node.

The high-level synthesis tool enables the synthesis of different

output models from a common programming source, includ-

ing hardware, software, and simulation models delivering an

advanced design methodology for functional testing.

EFERENCES

[1] Y. Meng, “An agent-based reconﬁgurable system-on-chip architecture

for real-time systems,” in Proc. 2nd ICESS, Dec. 2005, pp. 166–173.

[2] M. Guijarro, R. Fuentes-Fernandez, and G. Pajares, “A multi-

agent system architecture for sensor networks,” in Multi-Agent

Systems—Modeling, Control, Programming, Simulations and Applica-

tions, F. Alkhateeb, Ed. Rijeka, Croatia: InTech, 2011.

[3] M. Lückenhaus and W. Eckstein, “A multi-agent based system for

parallel image processing,” Proc. SPIE, vol. 3166, pp. 21–30, Sep. 1997.

[4] S. Bosse and F. Pantke, “Distributed computing and reliable communi-

cation in sensor networks using multi-agent systems,” Prod. Eng.,vol.7,

no. 1, pp. 43–51, Jan. 2013.

[5] X. Zhao, S. Yuan, Z. Yu, W. Ye, and J. Cao, “Designing strategy for

multi-agent system based large structural health monitoring,” Expert

Syst. Appl., vol. 34, no. 2, pp. 1154–1168, Feb. 2008.

[6] F. Pantke, S. Bosse, D. Lehmhus, and M. Lawo, “An artiﬁcial intel-

ligence approach towards sensorial materials,” in Proc. 3rd Int. Conf.

Future Comput. Technol. Appl., Sep. 2011.

[7] H. Peine and T. Stolpmann, “The architecture of the ara platform for

mobile agents,” in Mobile Agents (Lecture Notes in Computer Science).

New York, NY, USA: Springer-Verlag, 1997.

[8] A. I. Wang, C. F. Sørensen, and E. Indal, “A mobile agent architecture for

heterogeneous devices,” in Proc. Wireless Opt. Commun., 2003, pp. 1–7.

[9] F. Klügel, “SeSAm: Visual programming and participatory simulation

for agent-based models,” in Multi-Agent Systems—Simulation and Appli-

cations, A. M. Uhrmacher and D. Weyns, Eds. Boca Raton, FL, USA:

CRC Press, 2009.

[10] S. Bosse, “Hardware-software-co-design of parallel and distrib-

uted systems using a unique behavioural programming and multi-

process model with high-level synthesis,” Proc. SPIE, vol. 8067,

pp. 80670G-1–80670G-13, Apr. 2011.

[11] S. Napagao, B. Auffarth, and N. Ramirez. (2007). Agent

Language Analysis:3-APL [Online]. Available: http://www-lehre.inf.

uos.de/∼bauffart/mas_3apl.pdf

[12] M. Ebrahimi, M. Daneshtalab, P. Liljeberg, J. Plosila, and H. Tenhunen,

“Agent-based on-chip network using efﬁcient selection method,” in Proc.

IEEE/IFIP 19th Int. Conf. VLSI-SoC, Oct. 2011, pp. 284–289.

[13] F. G. McCabe and K. L. Clark, “APRIL—Agent process interaction

language,” in Intelligent Agents (Lecture Notes in Computer Science),

vol. 890. M. Wooldridge and N. R. Jennings, Eds. New York, NY, USA:

Springer-Verlag, 1995.

[14] I. del Campo, K. Basterretxea, J. Echanobe, G. Bosque, and F. Doctor,

“A system-on-chip development of a neuro–fuzzy embedded agent for

ambient-intelligence environments,” IEEE Trans. Syst., Man, Cybern.,

B, Cybern., vol. 42, no. 2, pp. 501–512, Apr. 2012.

[15] W. Lang, F. Jakobs, E. Tolstosheeva, H. Sturm, A. Ibragimov, A. Kesel,

et al., “From embedded sensors to sensorial materials—The road to

function scale integration,” Sens. Actuators A, Phys., vol. 171, no. 1,

pp. 3–11, 2011.

[16] S. Bosse, F. Pantke, and S. Edelkamp, “Robot manipulator with emergent

behaviour supported by a smart sensorial material and agent systems,”

in Proc. SSI Conf., Mar. 2013.

[17] J. Liu, Autonomous Agents and Multi-Agent Systems. Singapore: World

Scientiﬁc, 2001.

[18] C. Muldoon, G. O’Hare, M. O’Grady, and R. Tyan, “Agent migra-

tion and communication in WSNs,” in Proc. 9th PDCAT, Dec. 2008,

pp. 425–430.

[19] R. Tynan, D. Marsh, D. O’Kane, and G. O’Hare, “Agents for wire-

less sensor network power management,” in Proc. ICPP Workshops,

Jun. 2005, pp. 413–418.

[20] H. Naji, “Creating an adaptive embedded system by applying multi-

agent techniques to reconﬁgurable hardware,” Future Generat. Comput.

Syst., vol. 20, no. 6, pp. 1055–1081, Aug. 2004.

Stefan Bosse studied physics at the University of

Bremen. He received the Ph.D. degree in physics

from the University of Bremen, in 2002.

In 2004, he joined the Department of Mathemat-

ics and Computer Science and the working group

robotics. Since 2002, he has been involved in parallel

and distributed systems, sensorial materials, circuit

design, computer aided design, and artiﬁcial intel-

ligence. Since 2008, he conducts and contributes

to different projects in the ISIS Sensorial Materials

Scientiﬁc Centre pushing interdisciplinary research,

and recently joining the ISIS Council.

He acts as a Reviewer for several international journals (ACM TODAES,

IEEE Sensors), is a Guest Editor of journals (IEEE Sensors, Elsevier Mecha-

tronics), and a member of the international conference program and organizing

committees (SysInt).