Biofractal

Electro-Flock: Modelling Flocks using Simple Electro-Magnetism

2013-07-03T21:38:00.000+01:00

Flocks are easy to model if you use physics instead of biology

Look - Its all done with magnets!

Electro-Flock is a simple flocking algorithm that relies only on Coulomb's Law, the single physical equation that describes how electrically charged particles interact. The project uses CoffeeScript, JQuery, HTML5 and Processing.js.

I started this project with the feeling that flocking algorithms were too specific. I wanted to see if I could write a really simply algorithm that didn't know it was meant to be a flocking algorithm. I had a feeling that electro-magnetism might just be the key to a really fundamental, self-organising flock. Was this feeling justified? - Judge for yourself Play with the Electro-Flock Demo

What did you use to build Electro-Flock?
For a complete description of the technology stack please see the prequel post:

Self Organising Fun : A Force Directed Graph in CoffeeScript

Where can I see the demo and get the code?

Not another Flocking Algorithm surely?

It has been done before, many times, but something has always bothered me about the fluffy view of flocking. Being a firm advocate of evolutionary-biology I imagine Mother Nature as a cruel and selfish parent.

I think that apparent co-operation is most often an illusion created by constrained individualism, in other words individuals acting to serve their own goals in competition with others often end up acting in ways that just happen to look like they are co-operating and coordinating. In fact I had a feeling that the last thing most flock members would want to do is be part of a flock, or worse, part of a bait-ball! (see below for a definition and picture of a bait-ball)

All this pontificating finally led me to two interesting questions:

What would happen if I were to create a selfish bot that actively spurns flocking, a bot that attempts to stay as far as possible from all other bots?
What would happen if a collection of these selfish bots, an anti-flock, started to pursue their own selfish goals in competition with other identical bots?

The answer, as you can see from the demo, turns out to be amazing. Despite themselves the collection of entirely selfish individual bots magically cohere to form beautiful, fluid, dynamic flocks and tight little bait-balls.

Does it really need magic though?
Happily this anti-flock effect is simple to achieve without the aid of magic. Instead it relies on the single, prosaic principle of electro-magnetism. In fact the entire demo consists of nothing more than a combination of various types of magnet - yet the results are complex, very realistic and entirely emergent.

How do you make an anti-flock out of magnets?
The rest of this post will try to explain why anti-flocks work but before delving into any more theory you should definitely play with the demo. And remember - all the code is available from GitHub so you can see how to do it for yourself.

The Electro-Flock Demo - A Quick Start Guide

The top tool-bar

The top tool-bar allows you to set up your flocking environment. Choose the number of bots and, crucially, the number of attractors they will be chasing (see below for an explanation of why the attractors are important). You can also add some obstacles for the bots to avoid. All these things are just different types of magnets.

The bottom tool-bar

The bottom tool-bar allows you to switch off both the attractors and the obstacles. Switching off the attractors allows you to see the bots in their natural anti-flock state where they try their best to avoid flocking (see below for a picture of this). You can also reset the obstacles to mix things up. The Help link brings you back to this page.

Definitions and Discussion

A Flock
Normally a flock is defined as a number of birds that are feeding, resting or travelling together. I would say that a flock is a collection of individuals determined to achieve their own goals whilst avoiding others that are attempting to do the same.

A flock of birds

A Bait-Ball
A bait-ball occurs when small fish swarm in a tightly packed spherical formation about a common centre. It is commonly described as a last-ditch defensive measure adopted by small schooling fish when they are threatened by predators. I say this is wrong and that a bait-ball occurs when individual fish all selfishly attempt to get as far away as possible from a common set of predators.

The more predators, the tighter the ball

A constrained, two dimensional bait-ball

What is your Point?

Simple really. A flock is not a coordinated thing. A bait-ball is not a cooperative thing. These phenomenon emerge coincidentally whenever a collection of genetically similar actors find themselves selfishly pursuing their individual goals in the same location. Birds and fish are individualists, they are not members of a flock. The flock is simply a coincidence of time and space.

Yes, birds of a feather do flock together but not because they want to. Instead they flock simply because they all want the same things at the same time and so end up trying to occupy the same coordinates in space-time.

Clearly two things cannot occupy the same space-time coordinates so the individuals just do the best they can whilst selfishly optimising their individual trajectories - each chasing down whatever is attracting them or evading whatever is repulsing them whilst keeping the maximum distance from each other.

Therefore the effect that we pattern-seeking humans see as a flock or bait-ball does not require any form of co-operation, coordination or communication. Instead it is simply the result of a group of individuals all trying to be in exactly the same place at the same time but failing. The only thing keeping the flock from imploding in a flurry of feathers or fish-scales is the mutual repulsion the individuals have for each other.

This is the opposite of the usual way we imagine flocks.

We normally think that the flock has some property of coherence, something that holds the flock together, but that is wrong. The flock is always trying to explode apart, the flock members want to be as far apart from each other as they can be, but since they are effectively identical (genetically) then they are all attracted and repulsed to the same degree by the same things.

This creates two opposing forces:

The repulsion all bots feel for all other bots
The shared attraction bots feel to the things in the environment

And the critical, dynamic balance point between these two opposing forces is a flock.

Um .. OK. So why all this talk of magnets?
If I am correct then I should be able to model this all really easily. On the one hand I say that each bot should be repulsed by all other bots. On the other hand each bot should be attracted by elements of their environment. For a bird-flock that might be a positive attraction to a number of suitable roosting spots, for fish-bait-ball that might be a negative attraction to a number of predators.

All this attraction, both positive and negative, can be easily modelled using magnets, or more specifically using Coulomb's Law which is a law of physics that describes the electrostatic interaction between electrically charged particles.

Bots are magnets - right. So what happens when you try it out?
Unsurprisingly if you model the bots as magnets that simply repel each other you end up with a pure anti-flock, something like this:

The bots fly away from each other just as fast as they can

Until they are equally spaced as far as possible

You can see this for yourself if you switch the attractors off in the demo

The next step is to add some more magnets called attractors. The attractors positively attract bots to themselves causing the bots to chase them. And wouldn't you know it, as soon as you add attractors into the mix something amazing happens - the anti-flock turns into a beautiful, flowing dynamic flock and the more attractors you add the tighter and more coherent the flock becomes.

A loose flock created by two attractors

A tight bait-ball style flock created by twenty attractors

It works! That's great. But remind me - what is going on?
Its so simple - bots are repelled by bots and attracted to attractors - that's it.

This creates a system that seeks a balance point where the bots are as close as they can be to an attractor whilst being as far away as possible from each other. If the attractors were stationary then this balance point would be a static cloud of bots. However (and I didn't mention this before) the attractors are also repelled by the bots. So as the bots try to chase them down the attractors try to escape. Thus the balance point of the system is forever changing. Adding more than one attractor makes the balance point both dynamic and complex and the result is a beautiful flowing cloud of bots, in other words, a flock.

It looks like a flock sure but come on, birds and fish don't work like this do they?
Don't they? Maybe we have been making our explanations of flocking more complex and specific than they need to be because we have been thinking too much like biologists. Thinking like a physicist, that is thinking in terms of electrically charged particles means we get the same (or better) results using a single incredibly simple formula - Coulomb's law. Occam's razor surely applies.

Go on then, tell me what you really think.
Thinking electrically then - birds flock not because they want to flock or try to flock but because each individual bird finds itself flying through a type of five dimensional Gauge Field. That's right - five dimensions. There are the four dimensions of space-time plus a bird-perceived fifth dimension of environmental attractiveness. Its just like flying through an electro-magnetic field with a compass.

At each point in space-time the bird experiences a greater or lesser degree of attraction to any given direction of flight and simply responds accordingly, like iron filings lining up in a magnetic field. The varying field strengths throughout four dimensional space-time are a product of everything in the bird's environment, for example the desirability of roosting spots, the presence of predators, the availability or otherwise of food sources, the wind direction - you name it, its an infinity of causes.

Birds of the same species will perceive and react to their attractor-field in essentially the same way and so they will end up trying to be in the same place at the same time. Since this is impossible it implies that birds must avoid each other, in other words that birds are repulsed by each other. Thus we have the tension between the two forces of attraction and repulsion - and a flock is the dynamically balanced result.

The same goes for a bait ball. The attractor field is in this case is negative. Instead of being positively attracted to elements in their environment a bait-ball is characterised by fish being repulsed by predators - but the direction of attraction does not matter as the effect is just the same. Lots of functionally identical fish all experiencing the same attractor-field all want to go to the same place at the same time but they can't because of all the other fish that are doing the exact same thing.

The bait ball is a pure and therefore very clear demonstration of the attractor field effect. Individually each fish acutely senses the varying degrees of repulsion as it travels through points in attractor-space-time. This has the effect of aligning the fish to the underlying attractor field lines and, in so doing, maximising the distance between the fish and the predators. With enough predators arranged around them in three dimensional space the least repulsive location becomes a perfect singularity, a point in five dimensional attractor-space-time around which all the fish align themselves with a precision that is distorted only by the presence of the other fish.

This bait-ball effect is exactly what we see happening with the demo when we add lots of attractors.

I sense a summary...
Don't think of a fluffy, co-operating flock. Instead think of electrically charged particles moving through an electro-magnetic field. Swap the word 'magnetism' for 'environmental attractiveness' (both positive and negative) and just use the standard formula as described by Coulomb. What you get is flocking!

This strikes me as a beautiful and potentially useful simplification of the current descriptions of natural flocking behaviours and that, for me at least, serves to make this apparently complex but actually fundamentally simple phenomenon even more awe inspiring.

Self Organising Fun : A Force Directed Graph in CoffeeScript

2013-05-04T14:52:00.001+01:00

Emergent Self Organising Behaviour using CoffeeScript, JQuery and Processing.js

What is a Force Directed Graph and why bother spending time coding one in CoffeeScript?
A Force Directed Graph is a collection of nodes and links that self-organises until its nodes are as far apart as possible and the links do not cross. They appeal because self-organisation is just so intrinsically fascinating and, more professionally, because the project allowed me to bring together a whole set of exciting web technologies including CoffeeScript, JQuery, HTML5 and Processing.js.

Before reading any further why not Play with the Force Directed Graph Demo

Or if you really want to blow your mind why not take a look at the innovative and beautiful flocking system based solely on the principles of electromagnetism Electro-Flock: Modelling Flocks using Simple Electro-Magnetism.

They did it all by themselves

How did it turn out?
I started this project wondering if CoffeeScript would be worth learning and finished it vowing never again to write another line of naked JavaScript. I also found the combination of CoffeeScript and JQuery to be a powerful and elegant solution to the problem of coding against the browser. Finally, using the Processing.js library coupled to an Html5 canvas tag meant I could tap the GPU of the client's graphics card and so achieve the smooth visualisation of forces and vectors I wanted.

For more information on Force Directed Graphs please see the Definitions and Discussion section below.

Technology Stack

Visual Studio 2012 Express
Free editor. No extensions supported
Jurassic Coffee
A coffescript compiler and linker. I run it as a build event.
CoffeeScript
The shorter, better version of javascript
JQuery
A javascript library for dealing with the browser DOM
Processing.js
A javascript library for high performance 2D drawing onto an html5 Canvas
Bootstrap
A CSS library so I don't have to think about it.
html5
Its got the new Canvas element that allows for modern animation in the browser.

The Demo

The demo allows you play with Force Directed Graphs of various sorts. Here is a quick run-down.

This is very important

The first thing - don't overlook clicking on the screen. This toggles a Mouse attractor on and off. The effect is dramatic as you convert a Force Directed Graph into a poor man's flocking algorithm. Its also handy for generally mixing things up.

The top tool-bar

The first drop-down determines the number of copies of each graph to generate. Each copy will repel the others so multiple copies can introduce some interesting, higher-order patterns.

The second drop-down determines the type of graph to be generated. Each graph type differs in the number of nodes, the interconnections between the nodes and the node mass. The physical constants that determine magnetic repulsion and spring strength remain the same for all.

The third drop-down allows you to change the behaviour of a node as it encounters the edge of the drawing area.

The bottom tool-bar

The Noise drop-down allows you to introduce random noise into the mix. It is known that Force Directed Graphs will often beach themselves on the desolate shores of sub-optimality . These locally optimal solutions trap the pattern close to the globally optimal solution. Randomness can help fix that by jiggling the system to overcome the local optima and allow the system to keep falling towards perfection.

The Springs check-box toggles the springs on and off. How do the nodes act when they are unfettered?

The Magnets check-box toggles the magnetic repulsor fields on and off.

The Help link brings you back to this page.

Resources

The Nature of Code by Daniel Shiffman
This was my number 1 resource. It is well written and engaging but most of all it tells you everything you need to know about vectors, forces, springs and magnets and how to code them up.
A Force-Directed Diagram Layout Algorithm
Another force-directed example this time written in C#. I reckon I probably stole some code from here.
Springs
An example of how to code various springs in Processing.js, the 2D drawing package I used.

Definitions and Discussion

Graph
A picture of an interconnected set of items where each item is represented by a node and each connection by a link.

A simple, static Graph

Force Directed Graph
A graph that self-organises until its nodes are as far apart as possible and the links do not cross. The nodes repulse one another whereas the links act like springs.

A dynamic Force Directed Graph achieves a state of cybernetic balance

No Really, What is a Force Directed Graph?

Imagine a single tennis ball connected to ten otherwise identical balls by equal lengths of string. How to disentangle this?

Triangular tennis balls?

You could take the brute force approach and sit cross-legged unpicking the knots and arranging the balls. Or you could just let the whole thing self-organise until no balls touch and no strings cross.

A globally optimal solution

You may think that a system of tennis balls and string cannot self-organise but that is not true. This system, like all physical systems, will organise itself until it reaches a low-energy state. It will stop self-organising the moment no energy-lowering moves remain, in other words, when every remaining move implies a rise in the energy state of the system.

The problem: For tennis balls and string this final state is vastly more likely to be a tangled mess rather than a pleasing, symmetrical pattern. We could re-engineer the system using exotic materials but doing this in the real world would be impractical.

The trick is to simulate a node-and-link system (a graph) so that its lowest energy state happens to resemble the pattern you want. Such a system should then untangle itself and, as a ball rolls downhill, self-organise toward the desired pattern.

But we don’t want to just define a low energy state and have the system go there. That’s no fun. Instead we want to simulate a real system, albeit one with the impractical physical properties. It is the nature of these properties, their interactions, and the final dynamic balance point that determines the final pattern. Once modelled we should be able to just sit back and watch as the complex behaviour spontaneously emerges, powered only by the laws of pseudo-thermodynamics.

To achieve this we must replace the tennis balls with imaginary repulsor magnets. The closer these magnets get to each other the greater the repulsive force they feel. The links are replaced with springs. As a link is stretched so it pulls with ever increasing force on the magnets at either end. All that remains is to tweak the strength of the springs and magnets until a nice, critical balance is reached.

A system like this will seek a low energy state just as any real physical system would. The combination of the magnets repelling plus the springs attracting make the system spontaneously untangle and re-order. To stretch the analogy - the system is powered by virtual kinetic energy harvested from virtual potential energy as the system spontaneously tumbles down an imaginary energy gradient. The process stops when the system finally achieves the lowest energy state, which we have cunningly engineered to be both ordered and aesthetically pleasing.

It’s not magic it but it sometimes looks that way. The fascination of watching Force Directed Graphs gracefully disentangle themselves may well stem from the fact that real-life systems overwhelmingly tend not be ordered at low energy states. Disordered states so vastly outnumber ordered states that our evolutionary expectations assume that any non-living system’s low energy state will be disordered, and this is usually correct. However it is not true for Force Directed Graphs and so, as they spontaneously transition from disorder to order they create a strong illusion of purposeful behaviour, sometimes they seem almost eerily life-like.

A Force Directed Graph is therefore a computer representation of an edge-case physical system. It models a system of repulsor-nodes and springy-links with the final shape of the optimally ordered pattern being determined by the interconnections and the precise balance of the opposing forces of attraction and repulsion.

And finally, don't forget to take a look at the exciting electromagnetic flocking system Electro-Flock: Modelling Flocks using Simple Electro-Magnetism.

Goodbye OpenRasta - Hello Nancy

2012-09-10T12:34:00.004+01:00

This is a painful post to write as I still feel a strong residual loyalty to OpenRasta (OR) however I have now moved over the Nancy and so I would like to give you my reasons why. These are not technical reasons, far from it, instead I give you nothing more than sorrowful account of feelings hurt, a comparison between the brutal, distant and dominant love provided by OR versus the beautiful, seductive and submissive framework that is sweet Nancy.

To establish my bona fides: The pain I have experienced using OR cannot be easily dismissed, the problems were not simply due to a lack of familiarity or a sanguine welcome to the razor's edge of open-source development. I have now delivered two projects with OR as well as spending time (and money) with Sebastien Lambla (Seb) taking his REST course down in London - and I think that's enough experience to get the gist of the framework, even if I am not exactly a glassy-eyed contributor.

The final result, in my opinion, is this: OR was almost great, and may yet be great still, but it has been hamstrung by the side of the tracks that spawned it. It was not born of the grotesque, flabby, Microsoft world (that pays my bills), but sprang forth from the lean, hyper-cool loins of the command-line world where Linux is something other than a Charlie Brown character and eidetic memories for endless strings of command-line linear-B are de-rigueur.

And I almost made it. I almost stopped being a child-coder with Microsoft chocolate smeared around my mouth, I almost became a ethically-focused, command-line vegan coder because OR, for a while, bridged those two worlds. But then Seb took his eye off the ball and, like Macbeth, the vaunting ambition that made him great now led to his downfall. And fall down he did, back into the world of black-windows with strange commands, the no-mouse, no GUI world, the Essene asceticism of keyboard short-cuts, and instead of continuing with OR, pushing hard to fix its early-years verbosity and birthing difficulties he wandered away to become an absent father.

Perhaps this is how open-source is meant to work, perhaps it was now down to the acolytes to push the code forward, and for a long while I hoped so but I am not a source-code Github-jockey, I want something I can install and use. And yes, I know, I should be cool with incanting my own assemblies from magical hieroglyphic streams but my flesh is weak. I don't really care what a make.bat is - something to do with windows 3.1 if memory serves - which it doesn't usually.

So when it came to being a player, a contributor, an open-source-head, I failed, and I wept as I failed, I raged against the purity of others as I failed, but I failed all the same. OR worked, yes it did, and compared to MVC (oh, I am not that bad come on, MVC, really ... anybody? ...) it was a revelation of RESTful cleanliness, but it did not work well enough, not for MS fan-boi coder like me, with my fat behind and nervous giggle, sweating in a wannabee black turtle-neck, stretched and lumpy, splattered with dribbles of full-fat Starbucks coffee. The marathon running dream receded with each month that went by without an official OpenRasta release, the vegan dream with each cheeseburger I crammed into my eggy-bearded mouth, the Linux dream when I realised I couldn't even master the miniature piano far less remember du -s * | sort -nr

Personally I think Seb got bored and needed a new challenge. OpenWrap (his package manager) was a brilliant idea but MS got there second with the inferior Nuget and, if Seb had not been so pure, he would have realised that his true calling was to follow through with his beautiful, but flawed, OR and thus raise himself as the RESTful saviour for soft-palmed coders like me who just don't get why icons are bad, coders who guiltily stuff their mouths with creamy Nuget packages because its so easy, coders who right-click on their projects and select 'Manage Nuget' even though they know that for some vague, undefined reason this is meant to be bad, bad, bad.

Then Nancy sashayed into the room and one look was all it took. She was just so easy. Everything just worked (as it would have just worked in OR if Seb had not wandered back into the desert seeking further visions). The Nancy site looked great (as the OR site would have looked if such things had any interest to hard-muscled, flinty-eyed Linux coders). And now I love Nancy, I just can't help it, she is beautiful and she is getting better all the time - with regular updates to help smooth my brow and keep my underpants clean. She smiles with concerned understanding at my code-smells and quietly helps me to REST at night, she doesn't even seem to mind my guilty habits, my perverse but harmless need to update my project with her latest code, even as I cast a side-ways glance before furtively click-click-clicking my dirty little mouse.

I know I am wrong, I know that OR and OpenWrap are probably better, but I also think that none of that matters for the majority of coders like me, who want to impress the boss with the sites they make, coders who try to care about the command line and silently weep tears of acid guilt because it makes no sense, coders who cannot resist unstopping their ears because they want to hear the sirens calling, calling us back to the colours and the pictures and the pretty-little-clicky-things, coders who will work with impure products just as long as it makes their grubby lives that little bit easier.

I am sorry Seb, I really am. I think you are great, a visionary, but your eye wandered from the little people and I felt unloved. I am just a humble C# coder, I need my visual studio crutch because I love all those code colours and I am addicted to the soft whisperings of my metham-resharper. The command-line is a cold place to be and I need warm cuddles - I think you lost me when you forgot that.

RavenDB or SQL Server? Which one should I use?

2012-05-09T16:19:00.004+01:00

SQL Server is not a database for storing anything except ad-hoc reporting data. It is splendid for that, ad-hoc reports, data-mining, real time relationship discovery - and its optimal for these uses because way back when Codd designed the rules, that is what it was designed for (see The Data Driven Conspiracy for more details)

Of course you can store other types of data in a relational database. For example you can serialise your domain state to a SQL Server database and retrieve that state at a later date but this is a terrible use of a relational system. So bad, in fact, that whole layers of code (so called 'data-layers', or DALs), many thousands of lines, are required to make this kind of task remotely do-able, testable, maintainable.

The great, decades long confidence trick has been for the SQL based retailers to convince us that there was no better way to store hierarchical / document data other than real-time conversion between radically different data-structures. DALs and the latest ORM have all been trumpeted but, in the end, these are nothing more than codecs to thunk your bits back and forth between different data organisations - different patterns on the disk.

Madness!

So forget all the itsy-bitsy, angels-on-a-pinhead arguments about atomicity, consistency, isolation, durability and all that jazz. The elephant in the room is that if you are using SQL Server and you are not a pharmaceutical company that is dynamically mining data-cubes for hidden inferences within large populations (are you?) then you are using the wrong technology to store your data.

If, on the other hand, you are a humble application developer - a coder who, like most of us, simply wishes to serialize domain-state in a way that does not mean learning a whole new set of ideologies and syntax then, for goodness sake, do not use a relational database. Just don't.

RavenDB? I have been using this for a while (having evolved through DB4O and CouchBase) and I can say that it does what it says on the tin. It stores my stuff and gives me it back when I ask. I did not have to write any data-layers nor use a third party ORM. I did not have to learn a structured query language and I did not have to bolt on extras to make Full Text search work (RavenDB is based on Lucene so it all 'just works'). So far so good.

But really, does it matter what you use as long as it is, for the task at hand, easy to code, fast and efficient? For normal application development RavenDB is all of these things whereas SQL Server is not. That is not to say that relational DBs are bad - let's not blame the victim - I fully acknowledge that for certain specialised developments you may indeed need something a little more ... exotic, such a relational datastore.

But just in case you are still clinging to the wreckage, you need to ask yourself this: If we had all been using simple, fast, elegant datastores that closely fitted our needs as application developers, and then I wrote an opinion piece urging you to adopt SQL Server then you would either laugh at me or pity me, since that is the rational response. One thing is for sure - you would not swap to SQL Server. Thus we demonstrate that inertia alone is the guiding force that keeps SQL Server tacked into the mainstream. Inertia and ignorance. Inertia, ignorance and corporate greed ....

Agile and Cost Breakdowns

2012-05-09T14:03:00.000+01:00

At the beginning of an Agile project it is often very tricky to provide a cost breakdown for individual items of work. This is because central to the Agile methodology is the demand that at the start of a project everybody admits they do not really know what details to expect! This is a brave thing to do. It is brave of a developer to admit that the implementation details of a proposed project are not clear. It is brave of a client to accept that they do not really know, in real-life detail, what it is they want. The pay back for all this honesty is a much better chance that the project will be useful, delivered on time and delivered within budget.

That said, many clients require the sort of detailed costing that Agile does not supply. In response it has become common to adopt the following approach: Each unit of work is assigned a number of points where the more points the more difficult the task is considered. This number encapsulates all the experience of the developers, their previous projects and their expectations for the current project, however it does not directly relate to cost. Instead, as the project unfolds so the developers keep careful track of the effort required to deliver each unit of work. This creates a relationship between the points and the actual, real-life cost of delivering those points for the current project under the current conditions. As the project continues the mapping between the abstract points and the concrete costs becomes ever more accurate and useful. This is in direct contrast to other project management techniques where, as the project unfolds, the predictions made at the start become ever more inaccurate and useless.

This still does not answer the client’s needs however. They need a hard and fast prediction, an amount of cash per unit of work, a crystal ball that allows them to peer 6 months into the future and follow the convoluted, unpredictable, surprise strewn trajectory of a hand-made, complex software artefact that, as yet, nobody can truthfully define in detail. As unfair as it may seem, the business demands that we need to answer the following question: “How much will it cost to build something you don’t really understand, given that you don’t know how long it is going to take and given that what is possible and what is desired will certainly change along the way”. The answer has to be “I don’t know”, any other answer is a guess.

Can we do better than guessing? Yes, a little. We know the total (estimated) number of points. From this we can generate a cost estimate, informed by experience, for a total cost expressed as a min->max range. A detailed breakdown of costs can now be derived (for what it is worth) by simply dividing the range values by the total number of points to give a min->max range per point. From this a cost value per unit of work can be estimated. For example:

Total Points for Project = 100
Estimated Total Cost = 30K -> 45K
Estimated Cost per Point = [30K / 100] -> [45K / 100]
= £300 -> £500 per point

This estimate will be wrong but, because this is an Agile project, as time goes by the estimated costs will become steadily more accurate. This is good, because most of the cost overruns on a traditional product come near the end. In contrast, the final stages of an Agile project are the least risky and the most productive. This is when everybody finally understands what the project is about, the client knows what is possible, the developer knows what is required and given this shared knowledge, both realise there is more that should or can be done – budget allowing.

Predicting complex, dynamic systems or why Stalin, Hitler and stock market analysts have failed us

2010-04-02T14:43:00.036+01:00

The ability to predict a future event lies at the heart of what it is to be human. A hunter predicting the presence of animals at a waterhole can drastically optimise the chances of a successful kill. Empathetically predicting a rival's or ally's actions can determine the outcome in hierarchical social competition. Predicting in advance the consequences of leaving a small fire unquenched can save the lives of your family. The need to predict the consequences of complex, dynamic systems has been a primary driver of human evolution, both biological and technical.

What does it mean to predict something? It is a simple question with a relatively straightforward answer but its misapplication can have profound and disturbing consequences. Mistaking the systems that can or cannot be usefully predicted is a core political and financial flaw of the last 100 years that has lead to spectacularly negative consequences, from the rise of totalitarianism, both right and left, to financial bubbles whose froth still fizzes through our financial institutions today.

Predicting an event is easy - all you need is a model. Simply ensure the model's initial state resembles reality and run it for as long as you need. When the model stops its final state is your prediction.

If making a "model whose initial state resembles reality" sounds a bit complicated then, fortunately for us, all humans come with some pretty good models right out of the box. As babies we all quickly boot up an hard-wired physics engine in our heads. As your hand collides with a glass of water and you see it tip over the edge of the table it is a matter of a few trillion, near instantaneous yet autonomous calculations for you to know, with crashing certainty, the fate of the glass. There may be the odd slightly surprising variation, the glass may not break, it may teeter then just recover from the edge of catastrophe, but as sure as apples it will not fall half way to the floor, stop and then float back to its original position. Philosophers may argue that you can't tell what the glass will do until you try but evolution makes the assumption that gravity is universal and so we have evolved an extremely powerful, ultra-fast physical modelling system that allows us to determine the likely environmental effect of a given cause. Without this internal modelling ability we would be sea anemones, which is perhaps a little cruel to anemones who, with their reflex withdrawal of their tentacles when touched are also making cause and effect assumptions with associated energy expenditure cost / benefit ratios all encoded in their simple neural nets. It is almost safe to say that if something is alive then it is in the business of modelling reality because, to remain alive, the future must, to some extent, be predicted.

Prediction is a fundamental property of life and so it is fortunate that making predictions for simple linear systems is really quite easy - the model must be able to run faster than reality. Imagine firing a cannon and then sitting down with pencil and paper to work out the distance the cannon ball will travel. If you know the equations (the model) and have practised the arithmetic beforehand then you could probably calculate the answer before the ball lands and so predict the future. If you have to first remember the equations from your school physics class, then labour over the arithmetic then the ball may land before you have finished your calculation and you may as well use a measuring tape to find the actual answer. What makes a dynamic prediction worthwhile is that it happens before the event itself and for that to occur the model must be able to run faster than reality it is modelling.

For a model to run faster than reality it must be different from reality. If you were to model a portion of reality with absolute fidelity then it would be running at real-time speed. Thus a predictive model must take short cuts, it must round numbers up or down, it must miss out extraneous details and ignore apparently unconnected variables. Thus a model is compressed, its is squeezed of its useless details to leave the essential, highly optimised set of relationships that describe a precise set of cause and effect relationships. The equations for the trajectory of a cannon ball work very well and give answers that are more than good enough yet they do not include any information on the type, weight or nutritional content of cannon manufacturer's favourite breakfast cereal. Not including this information saves valuable time and enables dynamic predictions about firing distances to be made before the actual event occurs.

This is a very seductive idea. It would seem that in order to predict something accurately then all that is required is a sufficiently accurate model. Certainly Stalin and Hitler thought so. Interestingly both Stalin and Hitler lived in Vienna at exactly the same time, just a few years before the outbreak of the first world war and whilst there they both wrote about walking in the same park and how they enjoyed watching the spectacle of Emperor Franz Josef I clattering past in his gilded carriage surrounded by his glittering horse guards. From this imperial demonstration, and many other life experiences, they both drew the same lessons concerning the absolute centralization of state power. They wanted to create a bureaucratic model of both a society and the economy upon which it depends then, using this model, a tiny elite could efficiently predict and so command the real society and economy thus satisfying the needs of the people.

What was behind this revolutionary notion and was it coincidence that the idea seemed to ripen at just that time? In those hopeful, pre-war days it seemed as though science had amply demonstrated that careful observation allowed for the creation of genuinely accurate models and these models then allowed for useful predictions and finally that those predictions allowed complicated systems, such as manufacturing processes, to be accurately and profitably controlled. Empires were forged on the back of this assumption and both Stalin and Hitler simply extended it to come to an identical conclusion, that with enough observations, card indexing and information, an entire economy and an entire society could be usefully modelled and so predicted and so controlled - scientifically - from the top down.

After leaving Vienna they both set about creating a compressed model of their societies. Vast armies of bureaucrats and technicians set about gathering and collating data to create a huge, hive-like interactive model of the economy and the society. The human cogs in the state calculating machine churned their way through mountains of detail. They gathered data from the secret police, the shop floor and factory gate to feed the model and so generate predictions for future consumption. These would then become production targets. The targets would be met or not, the differences observed and fed back into the model, future targets created - and so the whole system would hum along in a dynamic, cybernetic balance freeing up the workers to enjoy a völkisch idyll or a glorious dictatorship-of-the-proletariat depending on your taste in utopian fantasy.

Were the great political and economic modellers of the 20th Century a success? No, the goddess of the eternal court of history did not acquit them. Their models failed, their societies required repression to keep the elites in power and dystopia reigned - but why? The historically accurate answer is, as you would expect, complex and multi-stranded but at its heart lies the misapplication of predicative modelling, specifically a missing piece of the explanatory jigsaw, the notion of chaotic systems. It is not that the science of the early 20th Century got it fundamentally wrong. Modelling does allow predictions and predictions do enable the exertion of control however this process cannot be usefully applied to every complex system. There exists a certain type of complex, dynamic system that is not amenable to prediction and it is these systems that today we label as chaotic.

Consider a system that is not chaotic. Such as system can guarantee that for every observable cause there will be a equivalent observable effect and that cause and effect scale together in a linear fashion. These systems can certainly be modelled and therefore predicted. Firing a cannon ball at a certain velocity will cause it to travel a certain distance. The distance the ball travels scales well with the initial muzzle velocity, the faster it is travelling the further it will go.

Now imagine a cannon where this cause and effect linkage did not scale in a simple linear fashion. Firing the cannon with a little gunpowder causes the ball to dribble out of the muzzle (as expected), a grain or two more powder and the ball sails into the heavens (a shock), add a bucket load more and the ball lands just a few feet away (profoundly disturbing). In a non-linear system the results are not random but the primary cause, in this case the amount of gunpowder, cannot be used to usefully predict the primary effect, the distance travelled by the cannon ball. This is because non-linear systems are sensitive to much more than their primary cause. The results they produce are combinations of many subtle interactions that feedback into each other, amplifying some minuscule causes and attenuating other gross causes to create predictive confusion. In the case of the non-linear cannon it may be that the temperature of the barrel affects the trajectory and therefore, without realising it, the amount of gunpowder used in the previous firings is feeding back into the performance of the current firing causing complex results. Eliminate this variable and you discover that another subtle set of interactions takes its place, and another, and another. The system is perfectly rational and explicable its just that the amount of detail required to usefully model it is intractable.

Not all non-linear systems are chaotic, some may just be complex, and the recent rise of staggering computational power has allowed more of these complex systems to be usefully predicted. What defines a truly chaotic system is its extreme, perhaps infinite, sensitivity to any change in its state. Modelling such a system now becomes officially impossible because, as we have seen, all models are short cuts. A model must be compressed in order to be predictive and since it is compressed it is different from the reality it is modelling. Yet for a chaotic system any deviation from reality will cause an unpredictable outcome. Surely though, it is just a matter of the degree of precision? If the model is not working because it is imprecise then simply add more precision. This is the basic flaw ignored by all experts who purport to predict and control complex, dynamic systems such as economies and societies.

To give a fundamental example, a model cannot have the ability to represent a number to infinite precision, at some stage a model's numerical values must be rounded either up or down. Applied to a model of a chaotic system, this deviation from reality is deadly. The rounding error, no matter how infinitesimal, will be churned through internal feedback loops within the system and amplified and attenuated. It is like taking a nice round lump of pizza dough and drawing two spots on it side by side. Now stretch the dough (amplification) and then fold it back on itself (attenuation), stretch and fold. The dough never breaks, the system is never disjointed, but after a few cycles where are the two dots? Just about the only thing you can predict is that they are not still side by side.

Now imagine a number with two values after the decimal point, say 1.23 and multiply that number by ten and we get 12.3X. What is the new number X? Is it zero or some other random number? There is nothing special about zero, it is a just a guess at the true value and it has a 1 in ten chance of being correct. Now divide the number by 1000 to give 0.01 and them multiply is by 1000 to give 1.00 and now information is lost. This is the same as stretching and folding the dough.Positive and negative feedback loops in a model constrained by finite precision must lead to information generation and information destruction and since all models must enforce a finite precision then the stretching and folding of finite precision numbers must perturb a model's trajectory through the space of all possible results and cause it to veer away from reality.

It must be appreciated that upping the precision of a model cannot solve this problem. It simply does not matter how small the error is, the feedback loops will eventually inflate it and so cause the model to spiral away into an alternative, non-useful, reality. The more accurate you make the model the more useful time you can buy but always and eventually the model must deviate from reality simply because it is a finite model and not the infinitely detailed reality it pretends to represent - the map is not the terrain.

This then is the reason top-down, command economies always fail and why Stalin and Hitler are not revered today as our economic and social saviours. The reality of an economy or society is a web of interlinked interactions within which feedback loops, both positive and negative, nested and isolated, exists in rampant abundance. This means that models of such systems, with their inevitable short cuts, must ultimately fail in their predictions. A chaotic system cannot be controlled by commands from the top because the models required to generate the commands cannot be compressed without radically disconnecting their predictions from reality. This is the lesson that the totalitarians of the 20th Century have bequeathed to us. Have we taken it to heart?

The recent financial crash has shown that, at least in the realm of predicting the stock market, we have not. It is actually trivially simple to make money from the stock market, a sure-fire thing, and here is how you do it. Pick a global company with an instantly recognisable name whose shares are traded daily. Now send off 1024 spam emails, half predicting that the stock price will rise and half predicting it will fall. Wait a week and send out 512 mails to the people who received your previous correct prediction, as before half predict up, half predict down. Repeat the process for the 256 recipients who have now received two correct predictions, then again for the 128 who have now received three correct predictions, 64 for four correct predictions, 32 for five, 16 for six, 8 for seven, 4 for eight correct predictions. These last four will think either that you are a magician or that you have insider knowledge. They may even have already made themselves very rich, entirely legitimately, from your astonishing ability to predict the market. You now ask the last four to send you £1000 for your final prediction with the promise that, if you get it wrong, you will pay them back - the recipients simply cannot lose. You receive £4000 and send back £2000. You make £2000 and walk away. Rinse and repeat.

If that sounds far fetched then you have never worked in an investment bank. This is pretty much how they make their money. They simply employ enough traders and analysts so that some of them are right for long periods of time and they look like they can predict the system. There will always be a more than enough traders who are currently on a lucky run so that investors can be deluded into thinking they are magicians or, in our scientific times, have some esoteric, possibly algorithmic system that gives them the ability to predict the movements of the market. Of course they can't predict those movements no matter how often their predictions turn out to be correct. They can't predict the chaotic market for all the same reasons of finite precision models attempting to simulate an uncompressible, chaotic system we have just seen, yet they look like they are for the same reason the spam trick works.

The global economic system is the very definition of a chaotic, complex, dynamic system and to think that it can be modelled and so predicted is our 21st Century hubris that simply invites Nemesis to do her worst - and so she does - conjuring for our arrogance stock market analysts with their liquid promises just as she conjured, in the last century, Stalin and Hitler with their utopian visions.

In short - Beware the prophet bearing finite precision models.

An Introduction to Applied Evolutionary Metaheuristics

2009-09-07T15:19:00.053+01:00

Jonathan Anderson

First delivered by me at "Selected Topics on Complex Systems Engineering" an international symposium held at Morelia, Mexico in October 2008. It was subsequently published in the European Journal of Operational Research : Applications of metaheuristics

View slide show

Abstract

This paper introduces some of the main themes in modern evolutionary algorithm research while emphasising their application to problems that exhibit real-world complexity. Evolutionary metaheuristics represent the latest breed of biologically inspired computer algorithms that promise to usefully optimise models that display fuzzy, complex and often conflicting objectives. Until recently, evolutionary algorithms have circumvented much of this complexity by defining a single objective to be optimised. Unfortunately nearly all real-world problems do not compress neatly to a single optimisation objective especially when the problem being modelled is non-linear. Recent research into multi-objective evolutionary metaheuristic algorithms has demonstrated that this single-objective constraint is no longer necessary and so new opportunities have opened up in many fields including environmental health and sustainability.

With their proven ability to simultaneously optimise multiple, conflicting objectives, evolutionary metaheuristics appear well suited to tackle ecological problems. Such algorithms deliver a range of optimal trade-off solutions that allow an appropriate profit / cost balance to be selected according to the decision maker's imperatives. This paper concludes with an examination of a powerful multi-objective evolutionary algorithm called IC-SPEA2 (Martínez-García & Anderson, 2007) and its application to a real world problem namely the maximisation of net revenue for a beef cattle farm running on temperate pastures and fodder crops in Chalco, Mexico State. Some counter-intuitive results and their impact on the farm's overall sustainability are discussed.

What is a Metaheuristic?

A heuristic is a 'rule of thumb', that is a algorithm that provides a solution to a problem without considering whether the solution is formally optimal but which will, nonetheless, tend to be good enough for real-world application.

Figure 1: The law of diminishing returns indicates that perfection is not a realistic goal.

A metaheuristic is an algorithm that synthesises two or more heuristics into a single compound. A metaheuristic algorithm is therefore a heuristic that relies on other heuristics either as sub-components or outsourced to black-box functions.

A metaheuristic example is found in neural-networks. Researchers found that manually training neural networks was inefficient (Alba, Enrique, Marti, Rafael. 2006.). This issue has been solved by employing search heuristics, including evolutionary algorithms, to discover appropriate training regimes for the neural networks (Alba et al). The compound algorithms that results from the synthesis of a neural-networks and evolutionary search heuristics are metaheuristic algorithms.

No Free Lunch

It may seem as though metaheuristics have the potential to create a general or universal search / optimisation mechanism to solve complex problems for which no problem-specific heuristic currently exists. However a theoretical stumbling block stands in the way of this goal.

The No Free Lunch Theorem states that : "any two algorithms are equivalent when their performance is averaged across all possible problems" (Wolpert & Macready 1995)

This implies that for any optimisation algorithm, gaining additional performance over one class of problems is exactly paid for in performance over another class. A universal problem solver is therefore not possible.

Despite this lack of universality metaheuristics remains a vibrant topic of research with many exciting approaches under investigation:

Simulated annealing
Ant colony optimization
Harmony search
Evolutionary algorithms

What is an Evolutionary Algorithm?

Being alive is a very complex objective, perhaps the most complex there is, and yet solutions to this problem exist in astonishing variety and subtly.
If we cast biology in the role of an optimisation strategy then we can say that biology seeks optimal solutions to the problem of being alive. If, as computer scientists, we could unlock the process by which this most complex of combinatorial-optimisation problems is successfully solved then we would surely have found a powerful heuristic tool.
The process by which biological complexity emerges is Evolution by Natural Selection. In his book “The Blind Watchmaker”, professor Richard Dawkins provides a precise definition of this algorithmic process, precise enough for eventual translation into an computer code:
"Natural Selection is the result of the non-random replication of randomly varying replicators" (Dawkins. 1986)
Natural selection discovers optimal replicators via iterative replication, random variation and environmentally guided selection. If being alive is the problem then, according to Dawkins, the non-random replication of randomly varying replicators is the solution. An evolutionary algorithm is simply this definition converted directly into a computer code.

A General Evolutionary Algorithm

InitializationFirst an initial population must be created. This population will contain replicators whose characteristics have been randomly generated.

Figure 2: The main loop at the heart of every evolutionary algorithm.
Fitness assignment
Each replicator must be evaluated and assigned a fitness value according to a problem-specific definition of fitness.
Environmental selection
A subset of the fittest replicators is selected to be used as the breeding stock when breeding a new population of child replicators. The selection process is deterministic and based on the problem specific fitness values assigned previously.
Termination
If stopping condition is met then Stop.
Breeding and Variation
Breed fittest to create a new population of child replicators. Breeding will mainly involve the mixing of parent characteristics, multiple parents to produce a single child, with the additional chance of a random mutation to slightly alter randomly selected child characteristics.
Go to Fitness Assignment

Over many generations the population will efficiently search the replicator state-space, the n-dimensional set of possible replicator states where n = the number of mutable replicator characteristics, producing replicators with ever increasing fitness (Goldberg 1989).

Another way of conceiving the evolutionary process is to imagine a complex process deconstructed into a set of rules that together make up a model of that process. The purpose of the evolutionary algorithm is therefore to optimise this model. A model will have many variables that go to define its current state any given point in time. These are usually called the model's decision values. A collection of decision values represents the variability between different instances of a model.

When optimising a model certain characteristics are selected to be the targets of optimisation. These are usually called the objective values. The purpose of the model is to convert decision values into objective values according to the model's internal rules. The objective values are then used by the evolutionary algorithm to calculate a fitness value. Thus a replicator is a set of decision values with an associated set of objective values. The model is the set of environment rules within which the replicator finds its physical expression. The evolutionary algorithm is the set of rules that evaluates, selects and breeds future generations of replicators ready for their brief flicker of life within the model.

Replicators consist of a series of values, analogous to genes, and so can be conveniently encoded as a set of characters, or string, which is analogous to the genome. Often binary is used to encode the replicator values and so the genome will consist of a long string of binary '1' and '0' characters. This makes it relatively easy to manipulate the genome, breaking it into pieces and splicing those pieces back together using standard string handling techniques common to most computer languages. For example during breeding when parental values are mixed together to create a novel child using the mechanism called crossover.

The Crossover Mechanism

During breeding the characteristics of two selected parents are mixed together using a mechanism called crossover to create a novel child. The selection process depends on the fitness values of the parents where the fitter the parent the more likely it is to be selected. This means that the average fitness of the population tends to increase over the generations.

Select two parents
Pick a locus somewhere on the parental genome
Split both the parent's genomes at the selected locus
Take the first portion of the first parent's genome and join it to the second portion of the second parent's genome to create a new child genome

Figure 3: Illustration of the crossover mechanism that recombines parental characteristics to create novel child combinations.

The mixing of parental values does not affect the values themselves but it does create novel combinations of pre-existing values. This means that short combinations of values which happen to result in higher fitness are preserved between generations. During crossover multiple, high fitness values are moved as a single unit or schema (Goldberg. 1989) between parents and their offspring.

Implicit Parallelism

As a result of selection and crossover fit building blocks or schemata are propagated from generation to generation. Each population member thus provides multiple points at which the state-space is being sampled and tested. Since a population has many members and each member is sampling the state-space with many schemata the resulting search is implicitly parallel.(Goldberg. 1989)

This multiple parallel sampling of the state-space is a unique feature of evolutionary algorithms and it dramatically leverages the efficiency of the search without requiring any special book keeping, processor or memory overheads.

When attempting to solve NP-hard problems, increasing the number of values that define a replicator will exponentially increases the size of the state-space to be searched (NP-hard is a class of problem that cannot be solved in less time than the exponential of the problem size). Evolutionary algorithms display a remarkable insensitivity to the inflation of their target state-space. This insensitivity is due to implicit parallelism. As the number of replicator variables grows then so must the length of the string used to encode them. This simply creates more schemata and so increases the degree of implicit parallelism. In this way an evolutionary algorithm leverages the inflation of the state-space to amplify the effects of implicit parallelism and so mitigates the effects of state-space inflation.

Single Objective Evolutionary Algorithms

Single objective problems are problems whose objectives can be collapsed or aggregated into a single overall objective to be maximised or minimised.A standard NP-hard benchmark single objective problem is the travelling salesperson problem or TSP which states: Given a number of cities and the distance from any city to any other city, what is the shortest round-trip route that visits each city exactly once and then returns to the starting city?Exact, non-heuristic algorithms will give precisely the shortest possible route. For example various branch-and-bound algorithms1 can be used to process TSPs containing 0-60 cities whereas progressive improvement algorithms2 work well for up to 200 cities (Dekker 2008)

Evolutionary algorithms do not necessarily return the shortest route but they do promise to return a route that is sufficiently short to be useful. The payback for accepting this heuristic uncertainty is the insensitivity to state-space inflation. Evolutionary algorithms can usefully process TSPs up to 100,000 cities and beyond (Dekker 2008).

Single-objective evolutionary algorithms allow single objective problems to be usefully solved. By contrast multi-objective problems do not resolve to a single global optimum but are instead optimise to a potentially large set of trade-off solutions. Finding this set of optimal trade-offs requires a more powerful class of evolutionary algorithm.

What is a Multi-Objective Problem?

A multi-objective problem is characterised by having two or more conflicting objectives. If objectives do not conflict then they can always be aggregated back into a single compound objective. It is the conflict between different objectives that defines a multi-objective problem and creates the additional complexity that defeats the single-objective evolutionary algorithm approach.

Multi-objective problems can be found in various fields:

Product and process design
Finance
Aircraft design
Oil and gas industry
Vehicle design

Or wherever decisions are taken where trade-offs exist between two or more conflicting objectives:

Maximising performance and minimising fuel consumption
Maximising strength and minimising weight

Figure 4: A real world evolutionary solution to the strength versus weight contradiction.

Multi-objective problems cannot be resolved to a single solution that simultaneously optimises all of the objectives. As soon as two or more conflicting objectives are present then the single global optimum solution disappears to be replaced by a set of non-dominated or Pareto-optimal solutions.

A solution is considered Pareto-optimal if every objective has been optimised to such an extent that attempting to further optimise one of the objectives will result in the degradation of the other objectives. Therefore every Pareto-optimal solution represents an optimal trade-off between the conflicting objectives. Since there are many combinations of objective trade-off there are also many different Pareto-optimal solutions.

The members of a Pareto-optimal set cannot be distinguished in terms of optimality but they can be distinguished in terms of there perceived utility. In order to make a choice between Pareto-optimal solutions external information must be employed. This information can come directly from a human decision maker via a software interface or by encoding the known preferences of a human decision maker as set of post-optimisation rules.

Solving problems with multiple conflicting objectives is challenging. The simplest approach is to construct a single aggregate objective function by assigning each objective a scalar weight which is combined into a single function that can be solved by any single-objective optimisation algorithm (Messac, A., Ismail-Yahaya, A., & Mattson. 2003). The problem with this approach is the globally optimal solution obtained will strongly depend on the values of the arbitrary objective weights. If a higher weight is specified for one objective relative to the others then the optimal solution will be one that favours that objective over the others. Solutions obtained using the weighted sum are always Pareto-optimal, but establishing a meaningful combinations of objective weightings can be very challenging (Messac et al).

Multi-Objective Evolutionary Algorithms

Evolutionary algorithms are well-suited for optimisation problems involving several conflicting objectives. A multi-objective evolutionary algorithm simultaneously optimises multiple conflicting objectives for a given problem resulting a Pareto-optimal set of solutions where each solution represents an optimal trade-off between the conflicting objectives. Although various evolutionary approaches to multi-objective optimisation are capable of searching for multiple solutions concurrently in a single run, in general the discovery of the Pareto-optimal set of solutions involves two implicit goals (Zitzler, Eckart and Laumanns, Marco and Thiele, Lothar. 2001):

Minimise the distance from the evolving Pareto front to the undiscovered Pareto-optimal set.
Maximise the diversity of the generated solutions.

Minimise the Distance

The set of intermediate solutions generated as the algorithm progresses is called the Pareto front. It is an n-dimensional surface (where n is the number of objectives) whose shape evolves through state-space as the algorithm discovers ever more optimal solutions (Zitzler et al)

Over time the shape of the Pareto front approaches the shape of the Pareto-optimal set or the set of solutions that fully dominates all other possible solutions (Zitzler et al). One solution is said to fully dominate another if each of its objective values is superior (smaller if the objective is being minimised or greater if the objective is being maximised).

To guide the evolving Pareto front, multi-objective algorithms can employ a technique called elitism whereby the best solutions found to date are retained in a secondary population called the archive. An elite solution is replaced in the archive when another solution is discovered that fully dominates it. This avoids the population being dominated by solutions that improve one objective at the expense of another.

Maximise the Diversity

As the algorithm progresses the elite members of the archive tend to converge on a series of points along the non-dominated front creating clusters of solutions. Such clusters indicate a lack of diversity within the emerging Pareto front. Solution diversity is important as rarefied areas of the front will remain poorly investigated with the potential to miss interstitial solutions that may be good enough for application in the real world.

To prevent this gradual loss of diversity modern algorithms tend to incorporate a pruning mechanism whereby solutions that lie too close to their neighbours are removed or pruned to leave room for additional novel solutions. The distance between solutions is calculated in terms of the position in the multi-dimensional state-space with solutions that appear in dense clusters identified and removed (Zitzler et al).

Figure 5: The pruning mechanism. The non-dominated set is shown on the left. The centre indicates those solutions selected for removal. The final, pruned set is shown on the right. This figure has been adapted from Zitler et al (pp8).

The pruning mechanism needs to be carefully designed if good solutions, especially good outrider solutions lying at the edges of the Pareto front, are not to be discarded. The goal is to maintain solution diversity by maximising the spread of solutions across the Pareto front with each solution evenly spaced with respect to its neighbours (Zitzler et al).

Stopping the Algorithm

Due to the intractability of NP-hard, real-world problems the globally optimal Pareto set is typically unknown. This implies that it is impossible to know when the optimisation process should end since there is no metric to determine whether the current Pareto set is the unknown global optimum set. However this is a common feature of heuristic algorithms and is usually resolved by accepting that solutions only need to be good enough and not necessarily optimal in the formal sense.

What is a Constraint?

All real world problems are constrained by physical limits. Solutions that lie outside of these limits are infeasible. Constraints model the physical limits of a problem and therefore a constraint defines a component of solution feasibility. If a solution violates one or more of its constraints then the solution is considered infeasible (Wright J. A., Loosemore H. 2001).

Figure 6: Constraints are the rules that bound the feasible set of solutions. All solutions that do not satisfy the constraints are considered infeasible.

Where advanced knowledge of the problem domain is available it is often possible to define constraints and so confer upon a solution a measure of feasibility. For example the Travelling Salesperson Problem typically does not consider the needs of the salesperson however in real life a salesperson will have a physical limit to the distance they can travel before they must return home. In other words there is an upper limit to the route length and any route that exceeds this length must be considered infeasible. The longer a route is over the upper limit the greater the degree to which the constraint is violated. This allows infeasible routes to be compared and ranked according their degree of infeasibility.

The Problem with Constraints

Since the infeasible solutions will vastly outnumber feasible solutions then throwing away infeasible solutions as they are discovered forces the algorithm to spend most of its time randomly walking through the state-space trying to find a population of feasible solutions to get the ball rolling. The algorithm is being presented with a cliff-face to climb at the outset of any run.

A more effective approach is to provide a mechanism whereby the algorithm can evolve its way from infeasibility to feasibility so converting the cliff-face into a gradual incline. A good way to create a gradual slope is to add an infeasibility objective to the problem(Wright et al). An infeasibility objective is a single measure of a solution's infeasibility. It is treated as an independent objective by a multi-objective evolutionary algorithm. The measure of infeasibility should represent both the number of active constraints and the extent to which each constraint is violated. A measure of infeasibility that has these properties is the sum of the normalised constraint violation values for all violated constraints (Wright et al). The new infeasibility objective is set to be minimised, the smaller the objective value the more feasible the solution is. For feasible solutions the value for the infeasibility objective should equal zero.

With the infeasibility objective in place it is not necessary for a given solution to be feasible in order for it to take part in the ongoing evolutionary process. This allows a population of infeasible solutions to be generated at the start of a run and used to slowly evolve towards a set of feasible solutions.

Applying an Evolutionary Metaheuristic

The hybrid metaheuristic algorithm IC-SPEA2 was first proposed by Martínez-García & Anderson, 2007. It consists of an an infeasibility constrained (IC) version of the Strength Pareto Evolutionary Algorithm 2 (SPEA2) (Zitzler et al).

The core algorithm, SPEA2, is a powerful multi-objective evolutionary metaheuristic that evolves a set of initially infeasible solutions towards the feasible Pareto-optimal set using elitism, a fine-grained fitness assignment strategy and a sophisticated pruning mechanism for maintaining solution diversity without losing outriders.

The state-space is constrained by the addition of an infeasibility objective that forces the algorithm to evolve and maintain a feasible set of solutions. At the end of each optimisation run the non-dominated Pareto-optimal set of solutions is presented to the human decision maker for evaluation and selection.

SPEA2 Main Loop (Zitzler et al)

Input: N population size
N² archive size
T maximum number of generations
Output: ND (non-dominated set)

Step 1. Initialization
Generate an initial population P0 and create the empty archive P²0 → Ø. Set t = 0.

Step 2. Fitness assignment
Calculate fitness values of individuals in Pt and P²(t)

Step 3. Environmental selection
Copy all non-dominated individuals in Pt and P²(t) to P²(t+1). If size of P²(t+1) exceeds N² then reduce P²(t+1) by means of the truncation operator, otherwise if size of P²(t+1) is less than N² then fill P²(t+1) with dominated individuals in Pt and Pt

Step 4. Termination
If t »= T or another stopping condition is satisfied then set ND to the set of decision vectors represented by the non-dominated individuals in P²(t+1). Stop.

Step 5. Mating selection
Perform binary tournament selection with replacement3 on P²(t+1) in order to fill the mating pool.

Step 6. Variation
Apply crossover and mutation operators to the mating pool and set P(t+1) to the resulting population. Increment generation counter (t = t + 1) and go to Step 2.

Testing the Algorithm: The [0/1] Knapsack Problem

The [0-1] knapsack problem is a classic test problem in combinatorial optimisation (Goddard). It seeks the best choice of essential equipment that can fit into one knapsack to be carried on a trip. Given a set of items, each with a weight and a profit value, determine the combination of items to pack into the knapsack so that the total weight is less than or equal to the knapsack's total capacity and the total profit is as large as possible. The problem is called a “[0-1]” problem because each item must be entirely accepted or rejected, that is you cannot sub-divide an item.

Problem Description

Given a knapsack with maximum capacity and a set of items where each item has some weight and profit value, what items should be packed into the knapsack to achieve the maximum profit for the minimum weight?

Figure 7: The knapsack test rig showing a perfect Pareto optimal curving through the 2D state space. Note the lack of clumping. The red dots are infeasible solutions due to a constrained sack size.

Problem Objectives:

Maximise Profit
Minimise Weight

Since each item has both a positive profit and weight then adding more items increases the profit, which is desired, but also increases the weight, which is not desired. Conversely removing items decreases the weight, which is desired, but also decreases the profit which is not desired. In other words the objectives are conflict.

Conflicting objectives means that no single knapsack will represents the best possible combination of both weight and profit. Instead a range of non-dominated knapsacks, the Pareto-optimal set, is required to represent the full range of trade-offs between weight and profit leaving the external human decision maker to choose according to their own preferences.

Applying the IC-SPEA2 Algorithm: A Mexican Beef Cattle Farm

A farm is a dynamic system emerging from the imperatives of the environment coupled to the experience of the farmer and the way the available resources are manipulated when trying to achieve multiple and most probably conflicting objectives. A farmer's capacity to act is always bounded by multiple trade-offs, environmental constraints and shifting measures of success (Martínez-García A, Anderson J. 2005).

To explore the problem of farm profitability a model of a Mexican beef cattle farm running on temperate pastures and fodder crops was created. The model served as a black-box objective function taking a set of decision variables and convert these values, via the complex, iterative model rules, into multiple objective values for evaluation by IC-SPEA2, the multi-objective evolutionary algorithm.

Figure 3: An illustration of the relationships between the farm manager, the farm, the farm model and the IC-SPEA2 algorithm used to generate Pareto optimal farming strategies

The purpose of the modelled beef farm is make money by converting food into live animal weight. The farmer can control the rate of weight gain by varying the amount and type of food to be included in the diet formulation for each of the twelve months of the season. The different food types have different nutritional benefit and purchase / production costs which vary throughout the season.

The optimisation objectives for the farm were:

Maximise net revenue
Minimise the cost of the diet
This conflicts with maximising net revenue since cheap food tends to have low nutritional value.
Maximise average daily weight gain
This conflicts with minimising diet costs since increasing the weight gain involves using food with the highest nutritional value of the diet resulting in increased diet costs. It may also conflict with maximising net revenue since heavy animals lead to higher variable costs.

The farm manager was asked to suggest an optimal strategy based on his experience and perceived objectives. He suggested the use of as much pasture as possible to feed the herd, supplementing this with alfalfa only when seasonal shortages of pasture occurred, and further only using silage and stubble when both pasture and alfalfa were in short supply.

In other words the farmer suggests feeding his animals in such a way as to maximise the weight gain while minimising the diet costs. If these two objectives are achieved then it seems perfectly reasonable to assume that third objective maximise the net revenue will be achieved since fatter animals should sell for more (Martínez-García et al).

Interpreting the Results – A Counter Intuitive Strategy is Discovered

Table 9: Martínez-García et al (pp. 647)

IC-SPEA2 returned multiple, distinct Pareto-optimal strategies, of which two were of particular interest:

Solution 2
The strategy that gave the highest net revenue.
Solution 1
The strategy that gave the highest live weight gain.

These results were counter-intuitive as it was assumed that the strategy that gave the highest live weight gain would be the same strategy that gave the highest net revenue.
Even the farmer assumed that the goal was to make the animals as heavy as possible since heavy animals sell for more money.

Yet IC-SPEA2 found a strategy that gave the highest net revenue while delivering lighter animals. This has potentially profound implications for the sustainable management of the farm.

The Implications for Farm Sustainability

Among the most surprising discoveries resulting from using IC-SPEA2 were the counter-intuitive nature of the solutions, their diversity and the fact that such a set allows for a compromise between the manager's goals and the precautionary principle4.

This point is demonstrated by the fact that for the solution with the highest net revenue the animals eat less allowing for a compromise between the farmer's objectives and the sustainability of the farm. Maintaining net revenue while reducing live weight gain can correspondingly reduces the stress on the available pasture and allows the farmer to meet his profit obligations while operating the farm at below its maximum carrying capacity.

Furthermore, IC-SPEA2 showed the potential usefulness of on demand optimisation displaying an ability to simulate and optimise a large, non-linear model with an intractable state-space using a cheap laptop computer. Martínez-García et al (2005) states that:

... as self-generated, cognitive systems with humans at their core, the main technical problem for achieving sustainable [farming systems] is how to enhance the decision-makers skills for choosing appropriate courses of action, in real time, in response to their own internal, dynamic purposes, while increasing their number of choices to face complex environmental conditions.

This suggests farmers making regular use of real-time evolutionary metaheuristic software may be able to respond to changes in their dynamic farm systems and so perform frequent, incremental, strategic re-evaluations in the field. These small adjustments measured against the backdrop of a shifting reality may well buffer against the over-shoot and performance fluctuations that can prove damaging to the sustainability of a complex, dynamic farm system.

Finally it suggests that the ability of multi-objective evolutionary heuristics to generate a Pareto set of multiple, optimal choices gives the farmer the control and scope to dynamically re-balance the farm's objectives to maintain its sustainability in a measured and controlled manner even as the environmental conditions fluctuate and the farmer's measures of success evolve.

References

Alba, Enrique, Marti, Rafael (Eds). 2006. Metaheuristic Procedures for Training Neural Networks. Operations Research/Computer Science Interfaces Series, Vol.35

Dawkins, R. 1996. The Blind Watchmaker. UK: Penguin Books

Dekker, M.D. 2008. Travelling salesman problem. http://en.wikipedia.org/wiki/Travelling_salesman_problem

Goddard, S. Dynamic programming 0-1 Knapsack problem. CSCE 310J. Data Structures & Algorithms. http://www.cse.unl.edu/goddard/Courses/CSCE310J/Lectures/Lecture8-DynamicProgramming.pdf

Goldberg, D. E. (1989). Genetic Algorithms in Search, Optimization, and Machine Learning. Reading, MA: Addison-Wesley.

Martínez-García A, Anderson J. (2005). Cárnico-ICSPEA2 – A metaheurístic co-evolutionary navigator for a complex co-evolutionary farming system. European Journal of Operational Research 179 (2007) 634–655

Messac, A., Ismail-Yahaya, A., and Mattson, C.A. (2003) The Normalized Normal Constraint Method for Generating the Pareto Frontier, Structural and Multidisciplinary Optimization, Vol. 25, No. 2, 2003, pp. 86-98.

Wolpert, D.H., Macready, W.G. (1995), No Free Lunch Theorems for Search, Technical Report SFI-TR-95-02-010 (Santa Fe Institute).
Wright J. A., Loosemore H. (2001). An Infeasibility Objective for Use in Constrained Pareto Optimization. EMO 2001: 256-268

Zitzler, Eckart and Laumanns, Marco and Thiele, Lothar (2001) SPEA2: Improving the Strength Pareto Evolutionary Algorithm. Evolutionary Methods for Design, Optimisation, and Control.

Notes

1 A general optimisation algorithm where all possible solutions are processed and discarded in blocks if they lie outside the estimated minimum and maximum bounds of the quantity being optimized

2 Progressive improvement algorithms follow a direction of improvement until no further optimisation can be made. With luck the global optimum is now located.

3 Two random solutions are selected from the population and the more fit of the two wins the tournament. A clone of the winning solution is made leaving the original in the population so it can continue to take part in future tournaments.

4 A moral and political principle that invokes caution to avoid damaging the environment in ways that cannot be reversed where the scientific evidence is not conclusive.

The Broken Waterfall

2009-08-09T15:58:00.033+01:00

The traditional predictive approach to project management is being rejected in favour an adaptive or Agile approach.
This is not a matter of buzz-words or faddish management technologies, instead it is a genuine commitment to help clients get the software they actually want - on time and within budget.

The Problem

There is a problem with the delivery of software. The more complex a project the greater the chance the project will be delivered over budget and behind schedule. As a project grows in complexity there comes a point where this potential for failure becomes almost a guarantee. Most experienced project managers understand this and strain their sinews to prevent it from happening and most experienced programmers have lived through the intense disappointment of seeing their work fail to achieve its initial promise. Yet time and again, despite the best efforts of genuinely talented and motivated people, software projects are delivered late, cost too much and do not function as the client expected - Why is this?

For each failed software project the problem typically turns out to be the plan. Now that may seem trivially obvious. Looking back over a failed project it is easy to suggest that if only the plan had been more precise then the project could have been more controlled and so more successful.

This is not correct.

The problem does not lie in the quality of the planning, the problem lies in the type of plan, specifically the attempt to create an up-front plan that covers the entire project life-cycle. This is not so obvious - how can you run a project without deciding what you need up-front?

To understand why up-front planning impedes the successful delivery of quality software it is first necessary to understand what is meant by a plan in this traditional sense, and then see how this concept can be dispensed with and replaced with a new type of planning mechanism.

What's in a Plan

At the start of a traditional project there is the familiar requirements-capture phase. This typically involves the writing of various specifications, a user specification that outlines the requirements in the language of the client, a functional specification that outlines the requirements in the language of the programmer and then perhaps a fully detailed technical specification that describes the requirements in a pseudo programming language.

Once complete, these detailed specifications provide the basis for all future work. They allow predictions to be made about the project's costs as well as its anticipated schedule. Specification documents also serve a secondary function. They give both the client and the engineers a form of 'contract' that, upon project delivery, allows everybody to compare what was promised with what was actually delivered.

This up-front planning process is often called the 'waterfall' model, it is a highly structured methodology that steps through requirements-capture, analysis, design, coding, and testing in a strict, pre-planned sequence. Progress is generally measured in terms of deliverable artifacts: requirement specifications, design documents, test plans and code reviews.

The Waterfall is Broken

There are good reasons why traditional, up-front planning fails. Unfortunately these reasons tend to make both clients and engineers feel uncomfortable so they are rarely spoken out loud.

Firstly, up-front planning means that the specification documents are written before any software is built. Experts, using all their intellectual powers and experience, attempt to imagine the software and in doing so mentally traverse all of its myriad details. Since no software has yet been built, the hypothetical assertions contained within these documents cannot be tested experimentally. In science an hypothesis that cannot be tested is called pseudo-science and by the same token a specification whose assumptions cannot be tested should be considered pseudo-planning.

Secondly, at the start of any reasonably complex project there is always an inescapable knowledge gap. This gap exists between:

The business knowledge brought by the client
The technical knowledge brought by the engineers

To begin with these two bodies of knowledge do not mix well as the clients do not really understand the language of software engineering and the engineers do not really understand the language of the client's specific business. This will change as time goes on and eventually the distinct bodies of information will mix and become one shared information landscape. However, at the start of a project when traditional up-front planning occurs, this inevitable knowledge gap leads to two critical and incorrect assumptions:

1. The client knows what they want their new software to do
Many clients come to a project with good idea of what they want, perhaps they have spent time and effort working this out, perhaps they have a legacy system that shows them much of they want and what they do not want. However at the start of a project the client cannot know what they want in sufficient detail to create a complete and precise plan. They can provide a business vision and they can provide business constraints but they cannot state in detail the processes required to deliver their vision because they have not yet absorbed the necessary details of the engineering environment. A superficial understanding can be gleaned during the initial planning meetings but this will not produce a sufficient understanding of the software they are commissioning.

2. The engineers know how to implement the client's business vision
Many engineers come to a project with a good idea of how to build business systems. They will have spent considerable time and effort building other, perhaps similar systems. However at the start of a project engineers cannot know how to implement the precise details of a specific business application because they have not yet absorbed the detailed business knowledge brought by the client. A superficial understanding can be gleaned during the initial planning meetings but this will not produce a sufficient understanding of the software they are being asked to deliver.

Predictive planning fails because an accurate plan requires a genuine, non-superficial understanding of both the client's business knowledge and the engineer's technical knowledge. Traditional specifications are created at the start of a project when both parties have not had enough time to come to such an understanding. It takes much effort to synthesize the two bodies of knowledge into a coherent whole, far more than can reasonably be assigned during the requirements-capture phase.

This means that plans created at the start of the project cannot be more than partially informed guesswork. Given that the nature of complex systems make them particularly sensitive to changes in small details, a plan for a complex system created with incomplete knowledge must perforce be a recipe for failure by degrees.

Does this really make up-front planning redundant? Is there a way to make the synthesis of the client and technical knowledge more efficient, perhaps by using advanced planning software? If this could be achieved then perhaps the planners could write effective up-front specifications that lead to accurate long-term costings and schedules.

Unfortunately there is another, more fundamental reason why detailed specifications must fail - regardless of their precision.

A specification is a description that attempts to outline features and functions in a natural language such as English. Yet software is actually written in the very precise syntax of a machine language. Engineers know that only computer code can truly express the details of a software vision, a natural language specification cannot be logically accurate enough. This means that natural language specifications must leave many implementation details open to interpretation forcing the engineer to skilfully choose from a set of implied options. Yet complex systems are sensitive to precisely these sorts of technical details, different choices will lead to different systems and, as often as not, unfulfilled client expectations.

Therefore, even where a specification guesses correctly, the natural language descriptions will contain subtle choices and hidden contradictions. It is only when the fuzzy language of the specification is transformed into the precise reality of the code that these choices and contradictions become apparent.

This leads to a profound truth about the nature of specifications: Greater precision does not lead to greater control. Instead the greater the precision the more varied and subtle the choices and contradictions become.

Planning For Success

Understanding these fundamental flaws at the heart of traditional software delivery, many forward looking managers and engineers are now moving towards a new project control methodology. In contrast to up-front or predictive planning this new methodology uses repeated bursts of short-term adaptive planning.

Agile Software Development throws out long-term planning and with it the traditional concept of a specification. Instead agile projects start with everybody discussing and sharing a simple vision of the end product. The vision is really no more than a mission statement that, at this early stage, explicitly removes the need for engineers to fully understand the business and for the client to fully understand the technology.

This means that an agile project can get started almost straight away, with the absolute minimum of requirements-capture. Instead of a long, costly and ultimately self-defeating planning phase, the engineers get to work building the first version (iteration) of what will become a rolling beta. Armed with a very short term plan covering just one or two weeks of work, the engineers build the first iteration and deliver it to the client for discussion and criticism. The rolling-beta is still only a sketch, an outline of the most important functions and how they might fit together. Mistakes and incorrect assumptions will have been made, indeed given the knowledge gap they cannot be avoided, but the mistakes are identified and quickly eliminated as the rolling-beta is regularly assessed by the client and engineers in close collaboration.

Once the first iteration is signed-off then the process begins again, a new short term plan is created and work begins on the second iteration. This iterative development continues and as the knowledge gap closes so the requirements and hence the software become ever more detailed and coherent.

Embracing Function Creep

As this hands-on process continues the client comes to properly understand the technical environment, what is expensive and what is possible, and as their knowledge grows so they begin to see new possibilities.

Clients changing their minds or adding new features during development is traditionally called function creep and remains the enemy of traditional planners. Yet to suppress this is to deny that clients can learn and modify their expectations as they see their software progressing. Rather than trying to ignore the client's input, the agile iterative process welcomes it as new and valuable knowledge.

Thus the client is encouraged to re-specify their product as it is being written. This is the ultimate guarantee that, in the end, the client will be satisfied. It is hard for a client to be surprised or disappointed with their software if they have played an active part in designing and deciding the goals at each iteration.

Equally, as the iterative process progresses the engineers will also come to a genuine understanding of the business. This allows the engineers to discuss the business processes with the client in a manner that allows a useful exchange of knowledge to take place. Questions to the client can be appropriately framed using the business terminology both the client and the engineers now share. Since the frequent iterations and short-term planning means that any incorrect business assumptions are quickly discovered, such mistakes can be corrected with the minimum of effort.

Engineers too, once they come to a genuine understanding of the business, can start to usefully contribute to the re-specification of the rolling-beta. New ideas and inspirations, whatever their source, can be welcomed, discussed and possibly incorporated as the software adapts over time.

Job Satisfaction

In summary, an agile software system evolves under the twin constraints of the client's business vision and the engineering environment's technical limitations. As the client and engineers come to a mutual understanding so new ideas bubble up and are incorporated as bad old ideas are identified and discarded. Before starting each iteration everybody discusses, negotiates and quickly reaches an understanding of what is actually required to fulfil the next set of short-term goals.

Thus an agile system organically grows its natural complexity out of a fundamental simplicity. As a result there are fewer surprises, the project risks are minimised and the client is more likely to get software that works.

Domain Driven RIA: Managing Deep Object Graphs in Silverlight-3

2009-04-28T20:38:00.166+01:00

Using RIA Services, can a simple n-tier application manage a deep object graph with eager fetching, lazy loading and silverlight databinding?

Downloads

The Source Code from Assembla
The screencast: A detailed walk-through [duration 28 minutes]

Note: If you have no experience with RIA Services then you may prefer to start with my previous demo, A Domain-Driven DB4O Silverlight-3 RIA, which has links to RIA Services documentation and Microsoft presentations to get you started.

Introduction
RIA Services is a Rich Internet Application (RIA) framework that promises to streamline n-tier Line of Business application development. Reading through the RIA documentation and listening to the RIA team's presentations I was struck by two things:

How potentially useful this framework was.
How skewed the material was in favour of a data-driven design approach

In this post I want to investigate how RIA Services can be used in a Domain-Driven context with a special focus on how it can help with the eager and lazy loading of domain entities.

Where is the Database?

I have chosen not to use a relational database in this example. This is because I want to ensure that my domain instance data can be easily stored and retrieved in the most efficient and maintainable domain-centric manner. I have therefore elected to use an object datastore, in this case DB4O, which provides all the ease, speed and functionality I need. For more information see:

The Technology Stack

Silverlight 3
Handles the client-side application logic and user interface
RIA Services
Provides the client<->server interaction and client-side domain
DB4O
A server-side datastore for domain entity de/serialization

The Software
Here is a sneak preview of the software in action.

The Objectives
Using a combination of RIA Services and DB4O I want to test the following:

Server - When I fetch an instance of the aggregrate root class I expect its inner hierarchy be eagerly fetched.
Client - I want certain collections to be lazy-loaded and so remain unloaded until they are requested.
I do not expect to write my own WCF Service nor do I want to write and Data Transfer Objects (DTOs).
I want to databind my domain entities to silverlight controls. I expect the controls to correctly display my eagerly fetched data as well as handling lazy-loaded data.
Finally, I want to prove that new domain entities can be created on the client and efficiently serialized to the server-side data-store as a batched unit-of-work

The Domain

I have a small hierarchical domain consisting of a single User aggregate root that bounds a one-to-many inner collection of Holding Entities that each contain a further collection of Transaction entities.

The test domain therefore consists of the following hierarchy:

User.Holdings[n].Transactions[n]

Here is the code for the domain hierarchy.

   3:  public abstract class Entity

   4:  {

   5:      [Key]

   6:      public Guid Id { get; set; }

   7:  }

8:

   9:  public partial class User : Entity, IAggregateRoot

  10:  {

  11:      public string Name { get; set; }

  12:      public string Password { get; set; }

  13:      private List<Holding> _holdings = new List<Holding>();

  14:      public List<Holding> Holdings

  15:      {

  16:          get { return this._holdings; }

  17:          set { this._holdings = value; }

  18:      }

  19:  }

20:

  21:  public partial class Holding : Entity

  22:  {

  23:      public Guid UserId { get; set; }

  24:      public string Symbol { get; set; }

  25:      private List<Transaction> _transactions = new List<Transaction>();

  26:      public List<Transaction> Transactions

  27:      {

  28:          get { return this._transactions; }

  29:          set { this._transactions = value; }

  30:      }

  31:  }

32:

  33:  public class Transaction : Entity

  34:  {

  35:      public Guid HoldingId { get; set; }

  36:      public TransactionType Type { get; set; }

  37:      public int Quantity { get; set; }

  38:      public decimal Price { get; set; }

  39:  }

Domain Loading Strategy

Server
Fetching a User should eagerly fetch all of its dependent Holdings. Each Holding should eagerly fetch all its dependent Transactions.
Client
Fetching a User should eagerly fetch all of its dependent Holdings. However due to the potential for large numbers of Transactions, each Holding should not fetch any Transactions instead the Transactions collection must be lazy-loaded.

The Datastore Setup

Before plunging in to the RIA Services code I want to show you just how easy it is to use the DB4O object database.

In the web.config there are two application settings (shown below).

DataFile.Name
Specifies the name of the DB4O datastore file held in the App_Data folder
DataFile.GenerateSampleData
Determines whether the datastore is reset with newly generated sample data whenever the Cassini web application is re-started (useful for testing).

Important: Ensure the DataFile.GenerateSampleData setting is false if you want to retain any changes between application runs.

   1:  <appSettings>

   2:      <add key="DataFile.Name" value="DataStore.db4o"/>

   3:      <add key="DataFile.GenerateSampleData" value="true"/>

   4:  </appSettings>

5:

   6:  public static void ServerOpen()

   7:  {

   8:      if (db4oServer != null)

   9:      {

  10:          return;

  11:      }

12:

  13:      var filename = Path.Combine(HttpContext.Current.Server.MapPath("~/App_Data"), ConfigFileName);

14:

  15:      var generateSampleData = bool.Parse(GenerateSampleData);

  16:      if (generateSampleData && File.Exists(filename))

  17:      {

  18:          File.Delete(filename);

  19:      }

  20:      db4oServer = Db4oFactory.OpenServer(GetConfig(), filename, 0);

  21:      if (generateSampleData)

  22:      {

  23:          SampleData.Generate();

  24:      }

  25:  }

In order to create the server-side eager fetch strategy outlined above, the DB4O datastore requires some configuration. The following GetConfig() method shows the Domain being scanned for types that implement IAggregateRoot with DB4O instructed to automatically fetch, save and delete the inner dependecies for those types.

   1:  private static IConfiguration GetConfig()

   2:  {

   3:      var config = Db4oFactory.NewConfiguration();

   4:      config.UpdateDepth(2);

   5:      var types = Assembly.GetExecutingAssembly().GetTypes();

   6:      for (var i = 0; i < types.Length; i++)

   7:      {

   8:          var type = types[i];

   9:          if (type.GetInterface(typeof (IAggregateRoot).Name) == null)

  10:          {

  11:              continue;

  12:          }

  13:          var objectClass = config.ObjectClass(type);

  14:          objectClass.CascadeOnUpdate(true);

  15:          objectClass.CascadeOnActivate(true);

  16:          objectClass.CascadeOnDelete(true);

  17:          objectClass.Indexed(true);

  18:      }

  19:      return config;

  20:  }

RIA Services
N-Tier applications are defined by the machine boundary that exists between the client and the server. Getting to grips with RIA Services begins by understanding how it tries to help you write applications that span that machine boundary.

As you write your server-side domain code RIA Services tries to discover the way you intend to use this domain on the client. As it does so it generates a client-side version of your domain that fulfils those intentions. This means that you do not need to write a client-side version of your domain in order to use its features on the client, nor do you need to write any explicit mechanism for transferring domain instance data across the machine boundary (no WCF, no DTOs).

RIA Services discovers your intentions via a combination of Convention and Metadata. For example, I intend to utilize my User class on the client and so I need to be able to fetch User instances from the data store. This implies that somewhere I must write a server-side service method to perform the User fetch.

RIA Services simply asks that I put that User fetch service method in a class that derives from the RIA DataService class and that I follow some simple naming rules for the method signature. For more information on these conventions see .NET RIA Services Overview for Mix 2009 Preview

If I follow the prescribed conventions then RIA will be able to determine that I intend utilizing the User class on the client and so generate a client-side version of my User class. This generated version is not the same class as my 'real' server-side User class, it only has as much or as little functionality as I decide to share (see later) but it does allow the client-code to operate as if I had access to the User class so I can use it in my silverlight code.

This is what the conventional User fetch method looks like.

   1:  [EnableClientAccess]

   2:  public class DataStore : DomainService

   3:  {

4:

   5:      public IQueryable<User> GetUser(string name, string password)

   6:      {

   7:          using (var db = DataService.GetSession<User>())

   8:          {

   9:              return db.GetList(x => x.Name.Equals(name) && x.Password.Equals(password)).AsQueryable();

  10:          }

  11:      }

  12:      ... other code

  13:  }

The presence of this method stimulates RIA into generating a client-side version of my User class however it will only carry over simple properties such as the User.Name and User.Password. So what happens if I want to make client-side use of a more complex property such as the User.Holdings collection?

This is a new intention so I must tell RIA about it. Only then can RIA generate the appropriate client-side code to fulfil the new intention.

This is achieved in two steps.

The Holding class must define a UserId property. When a new Holding is instantiated this property must be set to the Id of its parent User

The User.Holdings Collection must be decorated with the appropriate attributes.

To decorate a server-side domain entity with attributes targeted to client-side behaviour seems impure but fortunately RIA provides a pattern that brushes it all under the carpet and allows you to retain your domain-driven dignity.

First you must ensure the main User class is partial. This allows you to create a new partial User segment in a separate code file called User.meta.cs. You can then add the following code to that file. In this way you can keep all the RIA meta-data tucked away in their own partial file segments.

   1:  [MetadataType(typeof (UserMetadata))]

   2:  public partial class User

   3:  {

   4:      internal sealed class UserMetadata

   5:      {

   6:          [Include]

   7:          [Association("User_Holdings", "Id", "UserId")]

   8:          public List<Holding> Holdings { get; set; }

   9:      }

  10:  }

You will note there are two attributes being used here. What are they doing?

[Association]
This attribute is informing RIA that the Holdings collection can be reconstructed on the client by comparing the User.Id to the Holding.UserId. When these match the Holding belongs to the collection.
[Include]
This attribute is more mysterious. Perhaps, like me, you might assume it means "Include this property in the generated code". This is not correct. In fact it means "Automatically recreate this collection on the client", in other words the client-side collection will be eagerly fetched and made available without any further intervention on your part. This is the behaviour we want for the User.Holdings collection and gives us our first clue about how we might set up the lazy loading for the Holding.Transactions collection.

RIA allows us to define the shape of our hierarchy on the client using a combination of convention for the fetch method signatures and metadata using the [Include] + [Association] attributes. But a class must also define functionality or it is just a DTO.

Can I pick and choose the functions I want to appear in the client-side versions of my domain entities?

Sharing Domain Functions
On the client I want to add a new Holding to my User.Holdings collection. Being a conscientious domain-driven coder I want to ensure that my code follows the Law of Demeter, which means I cannot reach into the Holdings collection directly like this:

User.Holdings.Add(...)

Instead I need to write a method to do this for me:

User.AddHolding(...)

This is easy to write for my server-side domain but if I intend the same features to be available on the client I must tell RIA services about those intentions and so allow it to generate the appropriate client-side code.

Ensure the class with shared features is partial
Put the shared code in a partial segment stored a code file called MyClass.shared.cs
Decorate the shared methods with the [Shared] attribute

Here is the code for the shared AddHolding method held in the User.shared.cs file.

   1:  public partial class User

   2:  {

   3:      [Shared]

   4:      public Holding AddHolding(Holding holding)

   5:      {

   6:          this.Holdings.Add(holding);

   7:          return holding;

   8:      }

   9:  }

More Shared Code
When I create a new Holding I would prefer to use a factory method found in my DomainFactory class. This is a useful method so I want it to be available on the client as well as the server. As it happens the Factory class also contains a number of methods I would like to share, so instead of creating a partial class and sharing out individual methods as before I can just share the entire Factory class.

The following code is held in a file DomainFactory.shared.cs

   1:  [Shared]

   2:  public class DomainFactory

   3:  {

4:

   5:      [Shared]

   6:      public static User User(string name, string password)

   7:      {

   8:          return new User

   9:          {

  10:              Id = Guid.NewGuid(),

  11:              Name = name,

  12:              Password = password,

  13:          };

  14:      }

15:

  16:      [Shared]

  17:      public static Holding Holding(User user, string symbol)

  18:      {

  19:          return new Holding

  20:          {

  21:              Id = Guid.NewGuid(),

  22:              UserId = user.Id,

  23:              Symbol = symbol

  24:          };

  25:      }

26:

  27:      [Shared]

  28:      public static Transaction Transaction(Holding holding, TransactionType type, int quantity, decimal price)

  29:      {

  30:          return new Transaction

  31:          {

  32:              Id = Guid.NewGuid(),

  33:              HoldingId = holding.Id,

  34:              Type = type,

  35:              Quantity = quantity,

  36:              Price = price

  37:          };

  38:      }

  39:  }

Some Client-Side Code
Now we have informed RIA about our intentions it is time to see some client-side code that shows the resulting RIA generated client domain in use. This code is taken from the silverlight application that accompanies the web application.

First of all, here is the code that does some setup and then the initial fetch for the User.

   1:  public HomePage()

   2:  {

   3:      this.InitializeComponent();

   4:      this._dataStore.Submitted += this.DataStoreSubmitted;

   5:      this._dataStore.Loaded += this.DataStoreLoaded;

   6:      this._dataStore.LoadUser("biofractal", "x", null, "LoadUser");

   7:      this.Holdings.SelectionChanged += this.Holdings_SelectionChanged;

   8:  }

  12:  private void DataStoreLoaded(object sender, LoadedDataEventArgs e)

  13:  {

  14:      var userState = e.UserState;

  15:      if(userState==null)

  16:      {

  17:          return;

  18:      }

  19:      switch (userState.ToString())

  20:      {

  21:          case "LoadUser":

  22:              var user = e.LoadedEntities.First() as User;

  23:              this.User.DataContext = user;

  24:              this.Holdings.ItemsSource = user.Holdings;

  25:              break;

  26:      }

  27:  }

The _dataStore variable references an instance of the DataStore class which is derived from the RIA client-side DomainContext class. This class is auto-generated by RIA Services. It is the primary RIA generated artefact.

The DataStore.LoadUser() calls the GetUser() service method on the server. This is an asynchronous service call so the return must be caught in the DataStore.Loaded() event handler. Here the silverlight controls can be data-bound to their data sources and, because the User.Holdings collection was decorated with the [Include] attribute, RIA will ensure that it is automatically fetched. Using the Holdings collection as a binding data source will therefore display the correct list of Holdings for the current User without requiring an explicit fetch.

Lazy Loading the Transactions
In contrast to the User.Holdings collection, the Holding.Transactions collection is not automatically loaded when the User is initially fetched. Instead the client-side domain behaviour requires that the Transactions collection is lazy loaded on-demand. How is this achieved using RIA Services?

As before, the metadata is used to inform RIA of our intentions. The [Association] attribute is again used to decorate the collection definition in a partial class segment held in distinct code file (Holding.meta.cs). However this time there is no [Include] attribute.

   1:  [MetadataType(typeof (HoldingMetadata))]

   2:  public partial class Holding

   3:  {

   4:      internal sealed class HoldingMetadata

   5:      {

   6:          #region Properties

7:

   8:          [Association("Holding_Transactions", "Id", "HoldingId")]

   9:          public List<Transaction> Transactions { get; set; }

10:

  11:          #endregion

  12:      }

  13:  }

As a result RIA Services will generate the appropriate client-side code for the manipulation of Transactions however as there is no [Include] attribute RIA will not automatically fetch the members of a Transactions collection when its parent Holding is instantiated.

To manually load a list of Transactions it is necessary to write a parameterized server-side service method to perform the datastore lookup.

   1:  [EnableClientAccess]

   2:  public class DataStore : DomainService

   3:  {

   4:      ... other code

5:

   6:      public IQueryable<Transaction> GetTransactionsForHolding(Guid holdingId)

   7:      {

   8:          using (var db = DataService.GetSession<Transaction>())

   9:          {

  10:              return db.GetList(x => x.HoldingId.Equals(holdingId)).AsQueryable();

  11:          }

  12:      }

  13:  }

The GetTransactionsForHolding(...) method is scanned by RIA Services causing it to generate a client-side equivalent method on the DataStore class. This can then be used in client-side code to fetch a set of Transactions belonging to a specified Holding. The code below shows this happening. The call is being made within the SelectionChanged event of the Accordion control.

   1:  private void Holdings_SelectionChanged(object sender, SelectionChangedEventArgs e)

   2:  {

   3:      if (e.AddedItems.Count == 0)

   4:      {

   5:          return;

   6:      }

   7:      var holding = e.AddedItems[0] as Holding;

   8:      if (holding == null || holding.Transactions.Count > 0)

   9:      {

  10:          return;

  11:      }

  12:      this._dataStore.LoadTransactionsForHolding(holding.Id);

  13:  }

When an Accordion item is opened by a user click it fires the SelectionChanged event above. The newly selected Holding is extracted from the Accordion and its Holding.Id is passed into the RIA generated LoadTransactionsForHolding(...) method. This automatically calls the GetTransactionsForHolding(...) service method which returns the appropriate list of Transactions for the specified Holding.Id.

Where do these Transactions go? How is it that simply calling this method automatically fills the correct Holding.Transactions collection and displays that collection in the data-bound Accordion?

The list of Transactions is loaded into a flat list of Transactions generated and maintained by RIA Services. When a Holding.Transactions collection is requested RIA will dynamically create and return the correct list of Transactions as a conseqence of the information specified in the [Association] attribute. This is why each Transaction needs a HoldingId and each Holding a UserId. Finally, because RIA generated collections are ObservableCollections then changes automatically stimulate any data-bound containers to refresh themselves.

This means that a call to the LoadTransactionsForHolding() method will set off a chain of events that results in the lazy-loading of the selected list of Holding.Transactions and its subsequent display in the newly expanded Accordion item.

Creating and Saving Domain Instances
RIA Services makes the creating and saving new domain instances particularly easy. Once again the process begins with a statement of intention. This time RIA must be informed of our intention to add new Holdings to the User.Holdings collection and new Transactions to the Holding.Transactions collection. This is achieved via convention, by adding service methods whose signatures follow the convention shown below.

   1:  [EnableClientAccess]

   2:  public class DataStore : DomainService

   3:  {

   4:      ...other code

5:

   6:      public void CreateHolding(Holding holding)

   7:      {

   8:          using (var db = DataService.GetSession<User>())

   9:          {

  10:              var user = db.GetFirst(x => x.Id.Equals(holding.UserId));

  11:              user.AddHolding(holding);

  12:              db.Save(user);

  13:          }

  14:      }

15:

  16:      public void CreateTransaction(Transaction transaction)

  17:      {

  18:          using (var db = DataService.GetSession<Holding>())

  19:          {

  20:              var holding = db.GetFirst(x => x.Id.Equals(transaction.HoldingId));

  21:              holding.AddTransaction(transaction);

  22:              db.Save(holding);

  23:          }

  24:      }

  25:  }

Adding these service methods tells RIA that we intend to add new Holdings and Transactions via client-side code. Without these methods any attempt to add an item will result in a runtime error. For example, if the CreateHolding() method above is commented out and a new Holding is added to the User.Holdings collection via client-side code, the following error is displayed.

Serializing New Entities

Domain entities added on the client are not automatically serialized to the server-side data-store. Instead RIA services keeps track of the changes you have made so that when a save is requested only the changes are submitted for server-side serialization.

This is a good example of the Unit of Work pattern and in this way RIA helps to minimise the traffic over the wire as well as giving you much more flexibility with respect to rolling back or cancelling changes, providing save on demand or automatic timed-interval saves.

The following code shows how to add and save new domain items.

   1:  public partial class HomePage : Page

   2:  {

   3:      private readonly DataStore _dataStore = new DataStore();

   4:      private ProgressDialog _progressDialog;

5:

   6:      public HomePage()

   7:      {

   8:          this.InitializeComponent();

   9:          this._dataStore.Submitted += this.DataStoreSubmitted;

10:

  11:          ...other code

  12:      }

13:

  14:      private void ShowProgressDialog(string message)

  15:      {

  16:          this._progressDialog = new ProgressDialog(message);

  17:          this._progressDialog.Show();

  18:      }

19:

  20:      private void DataStoreSubmitted(object sender, SubmittedChangesEventArgs e)

  21:      {

  22:          if (e.EntitiesInError.Count() != 0)

  23:          {

  24:              this._progressDialog.ShowError();

  25:          }

  26:          else

  27:          {

  28:              this._progressDialog.Close();

  29:          }

  30:      }

31:

  32:      private void SubmitChanges_Click(object sender, RoutedEventArgs e)

  33:      {

  34:          this.ShowProgressDialog("Saving Changes...");

  35:          this._dataStore.SubmitChanges();

  36:      }

37:

  38:      private void NewHolding_Click(object sender, RoutedEventArgs e)

  39:      {

  40:          var user = ((User) this.User.DataContext);

  41:          if (user == null)

  42:          {

  43:              return;

  44:          }

  45:          user.Holdings.Add(DomainFactory.Holding(user, NewHoldingSymbol.Text));

  46:          this.Holdings.SelectedItem = this.Holdings.Items[this.Holdings.Items.Count-1];

  47:      }

  49:      private void Buy_Click(object sender, RoutedEventArgs e)

  50:      {

  51:          var holding = ((Button) e.OriginalSource).DataContext as Holding;

  52:          if (holding == null)

  53:          {

  54:              return;

  55:          }

  56:          holding.AddTransaction(DomainFactory.Transaction(holding, TransactionType.Buy, 42, 0.42m));

  57:      }

58:

  59:      private void Sell_Click(object sender, RoutedEventArgs e)

  60:      {

  61:          var holding = ((Button) e.OriginalSource).DataContext as Holding;

  62:          if (holding == null)

  63:          {

  64:              return;

  65:          }

  66:          holding.AddTransaction(DomainFactory.Transaction(holding, TransactionType.Sell, 42, 0.42m));

  67:      }

68:

  69:      ...other code

70:

  71:  }

This code shows how to add new domain items to their correct location in the domain hierarchy using the shared DomainFactory class discussed earlier. These changes are then asynchronously submitted as a batched unit of work to the server, displaying a progress dialog to keep the user informed. The return is trapped so that the progress dialog can be dismissed and any errors displayed.

The Verdict
How did RIA Services and DB4O manage?

Server - When I fetch an instance of the aggregrate root class I expect its inner hierarchy be eagerly fetched.
The server-side domain de/serialisation behaviour was handled by DB4O. Being an object database it is simple to create this behaviour using a few lines of initialisation code.
Client - I want certain collections to be lazy-loaded and so remain unloaded until they are requested.
RIA Services provides a set of attributes that allow both eager and lazy loading to be specified as client-side behaviour and wired up with minimal code.
I do not expect to write my own WCF Service nor do I want to write and Data Transfer Objects (DTOs).
RIA Services replaces the explicit WCF layer with an implicit data transafer layer via its DomainService class and the data manipulation methods you write to extend it.
I want to databind my domain entities to silverlight controls. I expect the controls to correctly display my eagerly fetched data as well as handling lazy-loaded data.
Because RIA Services generates its own observable collections the silverlight databinding flows smoothly with little intervention. The lazy loading of new data stimulates the silverlight bound controls to refresh and so display changes as they occur.
Finally, I want to prove that new domain entities can be created on the client and efficiently serialized to the server-side data-store as a batched unit-of-work.
RIA Services implements a Unit of Work pattern that allows only those items that have been changed to be batched and serialized to the server-side data-store when required.

I think that RIA Services plus DB4O performed well in handling the demands of my simple Line of Business Rich Internet Application. I would certainly recommend you try it out for yourself to see what you think. Good Luck.

A Domain-Driven, DB4O Silverlight RIA

2009-03-31T20:54:00.024+01:00

Build a Domain Driven Rich Internet Application using Silverlight, RIA Services and DB4O

Download the Example Solution for Silverlight 3

Download the Example Solution for Silverlight 4

I was recently ranting about Silverlight-2 and my annoyance with the WCF layer needed to serialize object instance data to the server. Well Microsoft must have had their mind-reading machines turned up high because the newly announced Mix09 Silverlight 3 & RIA services preview has solved nearly all the problems I was having.

Its the RIA (Rich Internet Application) Services that has the real wow-factor. This is Silverlight finally growing up. Instead of creating Frankenstein client-server apps crudely stitched together with WCF, RIA services allows you to treat your client and server as one, almost seamless application. You share domain design intentions between the client and the server so that your domain acts as you intended regardless of which side of the machine boundary you are on. This is how RIA development was always meant to be.

You can download all the software kit you need from here the main Silverlight-3 site. I found the best way to get started was to watch the following Mix09 presentations then read the RIA Services Overview.

Building Amazing Business Centric Applications with Microsoft Silverlight 3
"Come hear how simple it is to build end-to-end data-intensive Silverlight applications with the new set of features in Silverlight 3 and .NET RIA Services. Explore Silverlight improvements that help to enable rapid development for business applications and to make your development process more productive"
Building Data-Driven Applications in ASP.NET and Silverlight
"Learn how Microsoft is simplifying the traditional n-tier application pattern by bringing together ASP.NET and Silverlight. Learn about patterns for working with data, implementing reusable and independently testable application logic, and application services that readily scale with growing requirements"

Of course Microsoft just could not help plugging their dystopian Data-Driven-Design vision. Look at the title of that second presentation for goodness sake.

To counter this and to provide a solid Domain-Driven template for future Silverlight RIA apps I have created an example that does away with the monolithic SqlServer and Igor, the Entity Framework, in favour of an light, fast object database repository that promotes good Code Cohesion, Separation of Concerns and Inversion of Control.

Here is the tech I use in the example:

An Overview of the Domain Driven RIA Example

This example does just enough to highlight the primary functions of RIA services as I see them.

You can define and use your domain objects on the server and then effectively re-use domain logic on the client.
You can transmit domain objects between the client and server without polluting your domain or requiring an additional DTO transformation layer.

The example is a stripped down version of an app I am currently writing so I have left in all the domain structure even if it is not actually used in the example. This makes the application ready to go if you want to start fleshing it out with your own code.

The functionality is quite simple. You are presented with a Silverlight 3 navigation application. There is a sign-in link that takes you a view containing a username and password field, sign-in and register buttons.

The code works as you would expect with the additional extra that typing into the Name field triggers a 1 second timer that will check (on the server) that the name is unique. Clicking the Register button creates a new user in the server-side DB40 database. Clicking the Sign In button checks the db for the supplied credentials and, if they exist will return the appropriate User instance and cleverly navigate you back to the home view.

If you have watched the videos you might at this point think I have cheated and used the built in Authentication Domain Service that come with RIA. Not so. That service relies on having a fat SQLServer file squatting in your solution and that is no good for a Domain-Driven purist. Instead I have implemented a very simple custom identity authentication that could easily be linked up to either Forms or Windows authentication in the usual way.

Download the Example Solution From Assembla

NB: If you get an error about the URI Prefix then you need to reset the startup application to DomainDrivenRIA.Web and the startup page to DomainDrivenRIA.SilverlightTestPage.aspx

Convert dotnet assemblies to Silverlight

2009-03-17T20:58:00.046+00:00

The SilverLighter

Download Installer
Download Source
SVN http://subversion.assembla.com/svn/biofractal/trunk/Blog/Silverlighter

I have recently been trying to convert an object database (db40 and NeoDatis) for use in a Silverlight application. Along the way I discovered a few interesting things. For example I found that although Visual Studio will not allow you to reference dotnet assemblies from a Silverlight application or class library, this restriction is actually a bit heavy handed. Sometimes it is very useful to use a known Silverlight-safe dotnet assembly, as long as you take responsibility for your own actions.

To enable the re-use of dotnet assemblies in Silverlight I wrote a handy WPF application called "The Silverlighter". This tool allows you to convert any dotnet assembly into a Silverlight assembly ready for use in your Silverlight applications or class libraries.

The knowledge I needed to write The Silverlighter was gleaned primarily from the excellent article by David Betz called Reusing .NET Assemblies in Silverlight. This article clearly explains how the dotnet->Silverlight conversion process works and why it is not a hack. I thought this article was a fascinating insight into the similarities between the dotnet and Silverlight assemblies and is well worth a read.

But before you get carried away (I know I did) and imagine that you are just a click away from re-using your favourite 3rd party assemblies, a word or two of caution.

Unfortunately just because you can reference an assembly from Silverlight does not mean it will work with Silverlight. If you have read David Betz' article you will know that Silverlight uses a distinct set of System.* assemblies (v 2.0.5.0). These assemblies do not contain all the features of their equivalent dotnet assemblies. For example the following collection types are missing in Silverlight:

ArrayList
Hashtable
SortedList
NameValueCollection

Instead Silverlight only allows generic collections to be used, which is a good thing unless your referenced ex-dotnet assembly happens to use one or more of the missing types, in which case your code will blow up with an error similar to "cannot load type ArrayList from assembly System 2.0.5.0"

Replacing these missing collections with the generic equivalents (you can decompile most assembly code using Reflector) is actually quite trivial however more serious problems are lurking. Silverlight is also missing core features such as:

BinarySerializer and the [Serializable} attribute
Threading.SetData
TCP Socket related features

You can see that Silverlight has removed or redesigned features that are security risks. Again that is a good thing, until you try to re-use assemblies that use these features - then boom.

Anyway, The Silverlighter app does some cool IL trickery so if you have dotnet dlls you know are fine and just want to use them in Silverlight without any nonsense from Visual Studio then it might be just the thing you need.

Additional Notes

There are a few options available to help tweak the functionality.

You can choose to convert only selected System assemblies, although it is recommended that you just leave them all selected unless you have a good reason not to.

The "Recursively process dependent assemblies" feature will, when checked, pick out references from the IL to non System assemblies (your own or 3rd party references) and recursively convert these to Silverlight compliant assemblies as well. The entire dependency tree will be processed in this way.

Finally the path to the ILdasm.exe is exposed just in case you have it at a different location on your system. If you don't have ILdasm.exe anywhere you can get by installing .NET Framework 2.0 Software Development Kit (SDK)

If you notice any bugs or want to add new features then please feel free to checkout the source code and make the updates. Just paste the SVN url into your TortoiseSVN repo-browser, check out the source code under subversion and you are off and running. Good luck.

SVN URL = http://subversion.assembla.com/svn/biofractal/trunk/Blog/Silverlighter

Silverlight and Object Databases

2009-03-16T21:46:00.008+00:00

Stop Press: Most of the issues below have now been resolved with the release of Silverlight-3 & RIA Services. See the following post for more details - A Domain-Driven, DB4O Silverlight-3 RIA

I recently decided to write myself a quick Silverlight application for a bit of fun. I wanted an app that could grab stock quotes, do a calculation of my current losses and maybe have some nice blue-gel buttons and what-not. A few evenings of light coding pleasure and good break from all that architectural stuff. Naturally it did not quite work out that way.

I got a wireframe working pretty quickly but that was not good enough. I wanted to serialize my data so I could start to get some fancy Silverlight graphs with bouncy bars. So I found myself grabbing multiple quotes and using the data to instantiate domain objects in the Silverlight client. Now what? I needed to serialize these instances on the web server. So how do I do that?

I had two options

1. Use isolated storage on the client then use Mesh to synch the data. The only problem with this approach is that the last sentence is the total extent of my knowledge. I want a slope not a cliff face.

2. Use a good old database on the server, why not, and since I am about it lets make that an object database so we don't have to concern ourselves with all that old-fashioned NHibernate, ORM tosh (ah - how quickly I become intolerant).

I chose option #2. And right there it started to get annoying. What I really wanted to do was just use my ODB in the Silverlight client code as if I was coding normally. But of course the Silverlight sandbox had lots to say about that.

To get around this you must write a WCF service layer. Call this an API and you will feel much better because that makes it sound quite cool and computery. You need this comfort blanket because you are about to climb into a time machine a go back 10 years to when you were a script kiddie banging out ASP code.

This WCF 'API' service is going to be the only means you have to serialize data and transmit it between the Silverlight client and the web server. The [DataContract] will require [DataMember] methods to cover every possible action, just like writing all those CRUD sprocs for a relational DB. Of course you could break this jumbled, untestable mess into smaller WCF contracts but that is just more fantasy to cover the poor design, like putting prefixes on your sproc names so they group together to simulate a business layer.

No, the real solution is to do away with the WCF data-layer before you even start. Now don't get me wrong, WCF is fine and very useful but I think it is an abuse to use it a domain serialization interface. We just got rid of RDB DALs so why create another DAL for SaaS?

What is needed is grown up client / server relationship. That would be much better. Then, instead of writing a sprawling WCF thunking layer replete with DTO auto-mappers and all that junk paraphanalia, you get a nice generic client-centric solution with code like this:

myODB.Store(myObject) ;

myODB.Commit();

This focuses coding attention away from the boilerplate WCF data transformation layer and back on to the business useful world of the Domain - where it belongs.

By the way, you can use dotnet assemblies in a Silverlight project. It needs a few quick (but legitimate) 'fixes' to the IL to get them past the Visual Studio fascist guards. I am finishing off a tool that automates the IL work for you makes it nice and easy to convert dotnet assemblies into Silverlight reference-ready assemblies. I will post it here soon.

Stop Press: You can now download the IL Tool mentioned above. See the following blog post for more details - Convert dotnet assemblies to Silverlight

Generic Lists of Anonymous Type

2009-02-27T12:01:00.015+00:00

[cross-posted to StormId blog]

Anonymous types can be very useful when you need a few transient classes for use in the middle of a process.

Of course you could just write a class in the usual way but this can quickly clutter up your domain with class definitions that have little meaning beyond the scope of their transient use as part of another process.

For example, I often use anonymous types when I am generating reports from my domain. The snippet below shows me using an anonymous type to store data values that I have collected from my domain.

for (var k = 0; k < optionCount; k++)

    var option = options[k];

    var optionTotal = results[option.Id];

    var percent = (questionTotal > 0) ? ((optionTotal/(float)questionTotal) * 100): 0;

    reportList.Add(new

            Diagnostic = diagnostic.Name,

            Question = question.Text,

            Option = option.Text,

            Count = optionTotal,

            Percent = percent

});

Here I am generating a report on the use of diagnostics (a type of survey). It shows how often each option of each question in each diagnostic has been selected by a user, both count and percent.

You can see that the new anonymous type instance is being added to a list called reportList. This list is strongly typed as can been seen by this next bit of code where I order the list using LINQ.

reportList = reportList

    .OrderBy(x => x.Diagnostic)

    .ThenBy (x => x.Question)

    .ThenBy (x => x.Percent)

    .ToList();

This is where the problem comes in, how is it possible to create a strongly typed (generic) list for an anonymous type? The answer is to use a generics trick, as the following code snippet shows.

public static List<T> MakeList<T>(T example)

    return new List<T>();

The MakeList method takes in a parameter of type <T> and returns a generic list of the same type. Since this method will accept any type then we can pass an anonymous type instance with no problems. The next snippet shows this happening.

var exampleReportItem = new

        Diagnostic = string.Empty,

        Question = string.Empty,

        Option = string.Empty,

        Count = 0,

        Percent = 0f

};

var reportList = MakeList(exampleReportItem);

So here is the context for all these snippets. The following code gathers my report data and stores it in a strongly typed list containing a transient anonymous type.

var exampleReportItem = new

        Diagnostic = string.Empty,

        Question = string.Empty,

        Option = string.Empty,

        Count = 0,

        Percent = 0f

};

var reportList = MakeList(exampleReportItem);

for (var i = 0; i < count; i++)

    var diagnostic = diagnostics[i];

    var questionCount = diagnostic.Questions.Count;

    for (var j = 0; j < questionCount; j++)

        var question = diagnostic.Questions[j];

        var questionTotal = results[question.Id];

        var options = question.Options;

        var optionCount = options.Count;

        for (var k = 0; k < optionCount; k++)

            var option = options[k];

            var optionTotal = results[option.Id];

            var percent = (questionTotal > 0) ? ((optionTotal/(float)questionTotal) * 100): 0;

            reportList.Add(new

                    Diagnostic = diagnostic.Name,

                    Question = question.Text,

                    Option = option.Text,

                    Count = optionTotal,

                    Percent = percent

});

Perhaps you are wondering how the type of the anonymous exampleReportItem is the same as the type of the anonymous object I add to the reportList?

This works because of the way the type identities are assigned for anonymous types. If two anonymous types share the same public signature, that is if their property names and types are the same (you can't have methods on anonymous types) then the compiler treats them as the same type.

This is how the MakeList method can do its job. The exampleReportItem instance sent to the MakeList function has exactly the same properties as the anonymous type added to the generic reportList. Because they have the same signatures then they are recognised as the same anonymous type and all is well.

The Data Driven Conspiracy

2009-02-18T20:50:00.031+00:00

In a previous post called An Irrational Love of the Relational I asked whether the relational database (RDB) is the best data store for an object-oriented application. Along the way I mentioned that RDBs tend to make coders adopt a data driven design approach instead of the more object-oriented domain driven design.

That got me thinking: "Why has the RDB remained at the heart of our technology stack despite a steadily growing demand for a domain-centric design methodology?"

I think the answer to that comes in two parts.

Historical
Conspiratorial

1. Historical

Edgar Codd, an employee of IBM in the 1970's, was getting frustrated at the lack of a search feature that would allow him to quickly retrieve information from his IBM mainframe hard-drive array. So he worked out a cool way of structuring the data on the drive that would allow queries to be constructed and so allow for ad hoc data retrieval. He released a paper describing this work called A Relational Model of Data for Large Shared Data Banks [1].

In this paper he described, for the first time, all the features of an RDB that we find so familiar today. Data normalisation, columns , rows, tables, foreign keys etc. Codd's rules laid out clearly what constituted a true relational database and allowed manufacturers to create their own RDB management systems.

This tech was very cool and so much better than anything that had gone before. It allowed for data-integrity, which helped the coders and it facilitated a whole new class of program that relied on queries that sliced and diced huge data-sets revealing subtle and surprising inter-connections.

2. Conspiratorial

By the 1980's most young coders were convinced that relational data and the RDB was the answer to life, the universe and everything. New companies grew up around the RDB, Microsoft got in on the act with SqlServer and began, slowly, to dominate the market. Time passed and eventually those young turks grew older, their beards grew greyer and, like the RDB software they depended on, they became bloated. Then, just when people were forgetting that data could be stored in any other way, the first cracks began to appear in the RDB monolith.

It started when object-orientation finally broke free of the university labs and escaped into the wider coding world. Programs written as a hierarchical collection objects (an object domain) had been around since the 1960's when a pair of Norwegian academics invented a language called Simula-67. But this technology did not really take off until the 1990's with the widespread adoption of C++.

As soon as business coders started to regularly cast complex business systems into objects they began to notice a fundamental problem. Hierarchical object domains of de-normalised state data do not look anything like the relational, normalised data used by RDBs.

But the companies who provided the RDB monolith software and the generation of coders who had invested their youth evangelising RDB tech could not accept the cognitive dissonance this observation created. It became imperative to the profits of the RDB suppliers and the reputations of the RDB evangelists that some way to ignore the problem was found. Thus the data-driven design methodology was born.

Data-driven design demands that the relational database must be designed first. Only then can the database be translated into the radically different structure of the object domain and to perform this translation we must write not one but two additional layers of logic.

Stored Procedures (Sprocs)
The Data Access Layer (DAL)

Stored Procedures

Codd's original stored procedure language was called SEQUEL, which IBM reworked and cheekily named the 'new' language SQL.

SQL is an excellent language for constructing data queries and hence mining data sets. SQL also has a range of management features - you can Add, Edit and Delete the structured data on the disk and the RDB data itegrity rules will help prevent the data getting messed up. These management features are nice but they are not the core purpose of SQL, which remains the ability to perform complex data retrieval by exploiting the relationships created in the structured data.

Yet it is precisely these data management features (add, delete etc), along with some typically trivial queries, that are used 99% by object-oriented coders in order to save and retrieve (serialize) their object state data.

Data Access Layer

Communicating with sprocs from code is a complicated business and requires its own set of coding techniques, objects libraries and tools. These all come together in the data access layer (DAL). The DAL is the code that marshals object state to and from the Sprocs.

This code is typically complex and fragile. As the application requirements evolve, or bugs are discovered and fixed, so the RDB tables, columns etc must be updated to reflect those design changes. This is the heart of data driven design. It forces a cascade of changes to the sprocs and then changes to the DAL code before finally allowing the class design to be updated.

Data-Driven Design = Change DB -> Sprocs -> DAL -> Class

The Enlightenment

At last things are changing. Leading the way is Domain Driven Design. This is, at its heart, simply a statement of the obvious - that the best way to design an object hierarchy is to design the constituent objects.

Domain Driven Design = Change Class

The domain driven enlightenment has been born out of the fundamental realisation that the old ways of writing software just do not work. Replacing them is Agile, a coding methodology that demands that we refactor, evolve and simplify. These Agile concepts are the very antithesis of data driven design, with its multi-layered, stultifying, baroque complexity.

Thus Agile demands that we throw out the DAL, the Sprocs and the RDB because they are not, and never were, an appropriate minimal solution to the problem of object state serialisation.

Object state serialisation does not imply a relational database

References

Codd, E.F. (1970). "A Relational Model of Data for Large Shared Data Banks" In: Communications of the ACM 13 (6): 377–387.

An Irrational Love of the Relational

2009-02-13T16:20:00.066+00:00

Is a relational database the best data-store for serializing object-graphs?

Caveats

This applies to coders who employ Agile, Domain Driven Design
The application you are building is not a data mining app

Imagine building an object-oriented application. You find that you need to save and subsequently retrieve an object's state (serialize and de-serialize your object graph). Is a relational database (RDB) the best data store for you?

The answer seems pretty clear to me - Relational databases are just about the worst kind of data store for storing an hierarchical object-graph.

Of course an RDB can store object state. Mapping-tables, foreign keys and normalisation can all be used to project your natural object hierarchy into tables, rows and columns in the way that a globe can be projected onto the flat page of an atlas.

But the map is not the terrain, the RDB projection is nothing less than a distortion of the object graph. Translating between the object-graph and its distorted, relational representation requires work - and not just computer cycles, although it needs plenty of those, but also the hand-rolled code / bugs required to manage the transformation of data to and from the RDB object-store.

Traditionally this code was encapsulated into a bespoke Data Access Layer (DAL). It makes me shudder to remember just how much of my life I have wasted writing and debugging DAL code. But all that wasted time is not the critical problem, the real kicker is that a DAL severely limits how Agile you can be.

Take a typical agile process, the quick refactoring of a class definition, say the addition of a new public property.

Add new property to class
Add equivalent mapping to DAL class
Add equivalent parameter to SPROC
Add equivalent column to RDB table

Notice how that list feels back to front? Surely life would easier if I did things the other way around?

Add new column to RDB table
Add equivalent parameter to SPROC
Add equivalent mapping to DAL class
Add equivalent property to class

This shows up another problem with using RDBs - It promotes data-driven design.

Data Driven Design

tables

[Domain Driven Design] = Design your objects by designing objects

To me the natural way to design a domain is to play with the objects but an RDB + DAL approach flips this natural flow on its head and makes you design backwards. It makes you design the data-store before the domain, from tables -> objects.

Data driven design means that you do all the work up-front (table, sproc, DAL then finally object) which greatly increases the cost of experimentation. This ossifies the design process. Data-driven design strongly inhibits a successful design evolving out a series of cheap experiments. This is why agile coders tend to use domain driven design.

So why suffer all this RDB pain? Why not use a data store whose intrinsic architecture fits the structure of an object graph and does not require piles of buggy DAL code just to satisfy the basics of object serialization? DBAs often cite two main reasons:

You can run reports that cut across the object graph
You can keep you data application agnostic allowing future applications to use the data

Reason 1 - Because I am not writing a data-mining app I know my report designs in advance. Therefore I don't need ad-hoc, dynamic reports. Since my reports are pre-defined they can be represented in my object-graph as a collection of serializable report objects. Reports are just filtered collections of report objects.

Reason 2 - I adhere to strict YAGNI (you aren't going to need it) principles therefore I only write code to do the job. No matter how tempting it is, I do not write code just in case it might be needed in the future.

So what can the modern object-oriented coder do to make life a bit easier?

Use an Object Database

You can use a database that has been specifically designed to store object state data. They are called object databases. I have played around with the open source object database called DB4Objects which I found to be very fast and easy to use (as easy as NHibernate) but when I was playing its development was in a state of rapid flux. If you have used an object database then please leave a comment.

Scrap the DAL

Object Relational Mappers (ORM) e.g NHibernate allow you to scrap the DAL. ORMs automate, as far as possible, the cruddy DAL code and get you closer to the agile ideal of fast and cheap refactoring. You can create new classes, add and remove and interface elements and the ORM will take care of adding new tables and columns. You need never write another object persistence save / update / delete SPROC again (almost).

Whilst much better than writing DAL code, ORMs are not perfect and are not pain free. Every once in a while you have to go to the DB and mess around. This might seem trivial but you will be amazed how quickly all that DB experience evaporates.

Also, to ORM-enable your objects you need to somehow provide the mapping information that links the objects to their equivalent tables. These days this can be handled by marking up your classes with fairly simple [attribute] metadata but it is still a surprising amount of work to keep the attributes up to date.

So ORMs are not the final answer because they are really just papering over the the underlying problem: A relational database is not the best data store for serializing object-graphs.

Code Coverage: How much is enough?

2009-02-11T20:28:00.023+00:00

How much code should you cover with unit tests. How much code coverage is enough?

This was a question that came up in our regular technical team meeting down the pub the other day. I have been pretty evangelical about unit tests ever since the penny dropped about a year ago but it has to be said that I am struggling to stick to the high moral standards set by the 100% code-coverage purists.

You all know my grubby little excuses. Unless you live in some coding Utopia then time pressure, last minute changes that need to be deployed now, flashes of code-god inspiration that require input right now because it is just too exciting to wait etc. I admit it - my flesh is weak.

But should I be so hard on myself? Is 100% really worth aiming for? At what point does the law of diminishing returns kick in? Nobody I have spoken to seems to have any evidence (as opposed to anecdotal opinion) one way or the other. It boils down to unsupported assertions like "you should at least aim for > 95% code coverage".

I wonder if there is more efficient way to balance time spent on unit tests and the value those test return. Fighting entropy requires a lot of work so perhaps we reduce the work by breaking the problem into smaller pieces and create a hierarchy of code coverage.

Public Interfaces
These should have unit tests created for each possible interaction with the interface members. There should be an iron rule that public interfaces have 100% code coverage.
Black Box Code
The plumbing code that supports the public interface functionality will be covered by tests as required by TDD (test driven development) but new tests should only be written for new bug fixes and any obvious TDD development.

In a perfect world I accept that 100% is the ideal, but in a world where I need to make money for my company then 100% code coverage 100% of the time is surely too much.

100% coverage for selected, critical code and significantly less elsewhere might just be the way to go.

Agile Technical Specifications

2009-02-10T21:42:00.002+00:00

A technical specification document attempts to bridge the gap between a business driven functional specification and the raw code pouring from a developers mind.

Yet technical specifications now belong to a bygone, pre-Agile age. Back then business people and technical managers collaborated to produce a functional specification intended to describe all the features of the final system. The technical specification was then derived from this functional specification. It was a structured document that described the system's architecture and intended features in unambiguous terms.

It is now understood that this top-down process is unrealistic. Agile is, in part, a response to the divergence between pragmatic coding and 'command-economy' management. Agile accepts that nobody can really specify a software system before the first versions of the code are written. Nowadays we realise that code must evolve alongside the specification with each informing the other and the whole informing the client in a system of mutual feedback loops. The rigid, top-down information flows of yesterday have given way to a much more flexible and dynamic approach to system specification, planning and construction.

But you don't know what you have until you lose it. Now technical specifications have gone the way of the Dodo I can't help thinking that they were not entirely evil after all. If we imagine a utopian technical specification that really did describe a complete system with no mistakes then this would surely be a very useful document.

Is there any way we can get to an Agile version of the technical specification? Can we achieve a technical specification that delivers the usefully specific technical information whilst still being agile, flexible and evolutionary?

I think so. I think that if I was given a set of automatically generated, pre-defined class interfaces and a corresponding set of unit tests then together these would constitute an Agile Technical Specification.

[Public Interfaces] + [Unit Tests] = [Agile Technical Specification]

Given a set of interfaces, my job as a coder would be to create an equivalent set of domain classes that implemented those interface contracts.

Given a set of unit tests my job would be to use all my creative powers to flesh out my domain classes until those unit tests passed.

If all the unit tests passed then I would have satisfied the functional requirements and this functionality would, perforce, be presented via a public API specified by the interfaces. In other words I would have translated my Agile Technical Specification into working code that both defined and self-certified its own features.

So how do we 'generate' the interfaces and unit tests that comprise an Agile Technical Specification? Well we certainly don't want to be writing these by hand. That would just put us back into the old boat where time constraints cause the code to relentlessly drift away from the specification. Instead these artefacts need to be automatically generated if they are to be of any use.

If they are to be auto-generated then what specifies and drives the auto-generation? The answer is an Agile Functional Specification. Technical specifications are always derived from functional specifications, the difference is that now this process will be automatic.

Fortunately Agile Functional Specifications already exist. They are the sets of user stories that evolve in conjunction with the client or their business representatives. This implies that as the user stories evolve so the Technical Specification will evolve as an automatic derivative.

To achieve this the user stories must be written in a context-aware structured syntax, in other words a domain specific language (DSL). Then it will be possible to consume the user stories and from them auto-generate both the interfaces and the unit tests required to create an Agile Technical Specification. The interfaces are derived from the user story setups and the unit tests from the user story 'Where' clauses (constraints).

References

Agile Documentation

TAGRI (they aren't going to read it)