World of Seven - Viz: SLS Engineering Suite

To address the SLS Design and Tuning Problem, one needs to analyze the SLS behavior (which is closely related to its performance). However, analyzing SLS behavior is difficult. For example, if we look at this RunLog file (a file that records the search information every iteration) alone, we probably will not get many information from it. Thus people devise SLS analysis techniques that can be classified as either statistical techniques or information visualization techniques. With such analysis, people hope to gain insights on how to further improve the SLS algorithms.

We propose FLST visualization as an extension cum combination of the existing Fitness Distance Correlation (FDC) and Run Time Distribution (RTD) analysis. The basic ingredients to form this visualization are the fitness landscape components (search space, distance function, and objective function) of the COP instance and the solutions visited by an individual SLS runs. Obviously, visualizing exponentially large fitness landscape is not trivial. We explain the FLST visualization ideas using the series of illustrations below:

Explanation	Illustration
We visualize fitness landscape of a COP instance like this mountain range picture: *search space (very big), is visualized as a collection of a lot of points (solutions) *distance function spatially separates one mountain (solution) and the other mountains *objective function determines the height of each mountain (solution) *global optima is the highest mountain (solution with the best objective value) *local optima is high mountains but not the highest Notes: This fitness landscape formulation was proposed by P. Merz in his PhD thesis. This definition is slightly different with the one in H.H. Hoos and T. Stuetzle's book: SLS: Foundations and Applications where the fitness landscape is defined as <search space, neighborhood function, and objective function>. We do not use neighborhood function as it will cause our FLST visualization to be unstable (changing every SLS iteration).
We visualize the search trajectory of an individual SLS run as a movement of that SLS on the fitness landscape of the COP instance being attacked. The movement can be due to a local move within a local neighborhood or a non local move due to strong diversification mechanism. The objective is to find the global optima: imagine that you are one tiny human in this mountain range and can only see the surroundings within radius 1 km (local view) and you need to navigate locally to find the highest mountain. However, without any 'reference point', it is quite hard to describe/explain what is going on here (see the picture on the right and try your best to explain the movement)... Notes: This search trajectory formulation is defined in SLS: Foundations and Applications.
Now suppose that we record the solution denoted by the yellow rectangle (see the picture on the right). We can now describe the same search trajectory above as follows: "The search trajectory once hits the yellow rectangle solution, then it moves somewhere else, then after certain number iterations, it hits the yellow rectangle solution again. Is this a solution cycling phenomenon? Is the SLS trapped?". See that now we can say more things with the existence of a reference point.
To build the FLST visualization, we gather a constant amount of high quality (usually local optima), diverse, frequently visited solutions in the fitness landscape using the SLS algorithms themselves! We know that we cannot expect to record all points (exponential space!) therefore we should expect to miss some good points (look at the small blue arrows pointing at the other two mountain peaks at the background). Nevertheless, if the anchor points collected are reasonably good, diverse, and -that is- the important ones, we can say that we have a reasonable approximation of the fitness landscape. In order to collect these points, we run the SLS with different configurations, with longer run times, and let it loose. It will then sample various points in the search space. We then filter the interesting points which we called: the Anchor Points, abbreviated as APs.
We can also add quality information by labeling these anchor points with color+shape: black dot-very bad, yellow rectangle-bad, green triangle-medium, blue circle-good. We use both color and shape as color alone is hard to be distinguished in black and white scientific papers! Remember that not all scientific publications out there are in color at this point of time.
So now, we have this Fitness Landscape visualization based on these 4 selected APs. With these APs, we can now describe the search trajectory: In the picture on the right, the pink search trajectory encounters solution cycling in yellow/black (poor) APs. It fails to reach the better green/blue (better) APs.
And for this picture on the right, the blue search trajectory performs a diversification strategy after hitting an AP (local optima). From this visualization, we see that it manages to reach the better green/blue APs and we can say that it performs better than the pink search trajectory above. This is our proposed FLST visualization. If you have any comments, please don't hesitate to email the main author: stevenhalim at gmail dot com

Basic Ideas