Introduction to Chaos, Bounds, and Randomness
Have you ever noticed that events seem to occur in bunches? This is more than your imagination. A few decades ago, this phenomenon was analyzed by Benoit Mandelbrot, an engineer and mathematician who worked for International Business Machines (IBM). Mandelbrot noticed that similar patterns are often found in apparently unrelated phenomena, such as the fluctuations of cotton prices and the distribution of personal incomes. His work gave birth to the science of chaos theory.
Was Andrew "Due"?
In the early summer of 1992, south Florida hadn't had a severe hurricane since Betsy in 1965. The area around Miami gets a minimal hurricane once every 7 or 8 years on the average, and an extreme storm once or twice a century. Was Miami ''due'' for a hurricane in the early 1990s? Was it ''about time'' for a big blow? Some people said so. By now you should know enough about probability to realize that 1992 was no more or less special, in that respect, than any other year. In fact, as the hurricane season began in June of that year, the experts predicted a season of below-normal activity.
The so-called ''law of averages'' (which is the basis for a great deal of misinformation and deception) seemed to get its justice on August 24, 1992. Hurricane Andrew tore across the southern suburbs of Miami and the Everglades like a cosmic weed-whacker, and became the costliest hurricane ever to hit the United States up to that date. Did the severity of Andrew have anything to do with the lack of hurricanes during the previous two and a half decades? No. Did Andrew's passage make a similar event in 1993 or 1994 less likely than it would have been if Andrew had not hit south Florida? No. There could have been another storm like Andrew in 1993, and two more in 1994. Theoretically, there could have been a half dozen more like it later in 1992!
Have you ever heard about a tornado hitting some town, followed three days later by another one in the same region, and four days later by another, and a week later by still another? Have you ever flipped a coin for a few minutes and had it come up ''heads'' 18 times in a row, even though you'd normally have to flip it for days to expect such a thing to happen? Have you witnessed some vivid example of ''event-bunching,'' and wondered if anyone will ever come up with a mathematical theorem that tells us why this sort of thing seems to happen so often?
Slumps and Spurts
Athletes such as competitive swimmers and runners know that improvement characteristically comes in spurts, not smoothly with the passage of time. An example is shown in Fig. 7-7 as a graph of the date (by months during a hypothetical year) versus time (in seconds) for a hypothetical athlete's l00-meter (100-m) freestyle swim. The horizontal scale shows the month, and the vertical scale shows the swimmer's fastest time in that month.

Fig. 7-7. Monthly best times (in seconds) for a swimmer whose specialty is the 100-m freestyle, plotted by month for a hypothetical year.
Note that the swimmer's performance does not improve for a while, and then suddenly it does. In this example, almost all of the improvement occurs during the summer training season. That's not surprising, but another swimmer might exhibit performance that worsens during the same training season. Does this irregularity mean that all the training done during times of flat performance is time wasted? The coach will say no! Why does improvement take place in sudden bursts, and not gradually with time? Sports experts will tell you they don't know. Similar effects are observed in the growth of plants and children, in the performance of corporate sales departments, and in the frequency with which people get sick. This is ''just the way things are.''
Correlation, Coincidence, or Chaos?
Sometime in the middle of the 20th century, a researcher noticed a strong correlation between the sales of television sets and the incidence of heart attacks in Great Britain. The two curves followed remarkably similar contours. In fact the shapes of the graphs were, peak-for-peak and valley-for-valley, almost identical.
It is tempting to draw hasty conclusions from a correlation such as this. It seems reasonable to suppose that as people bought more television sets, they spent more time sitting and staring at the screens; this caused them to get less exercise; the people's physical condition therefore deteriorated; this rendered them more likely to have heart attacks. But even this argument, if valid, couldn't explain the uncanny exactness with which the two curves followed each other, year after year. There would have been a lag effect if television-watching really did cause poor health, but there was none.
Do television sets emit electromagnetic fields that cause immediate susceptibility to a heart attack? Is the programming so terrible that it causes immediate physical harm to viewers? Both of these notions seem ''far-out.'' Were the curves obtained by the British researcher coincident for some unsuspected reason? Could it be that people who had heart attacks were told by their doctors to avoid physical exertion while recovering, and this caused them to buy television sets to help pass the time? Or was the whole thing a coincidence? Was there no true correlation between television sales and heart attacks, a fact that would have become apparent if the experiment had continued for decades longer or had involved more people?
Now consider this if you dare: Could the correlation between television sales and heart attacks have taken place as a result of some unfathomable cosmic consonance, even in the absence of a cause-and-effect relationship?
Do scientists sometimes search for nonexistent cause-and-effect explanations, getting more and more puzzled and frustrated as the statistical data keeps pouring in, demonstrating the existence of a correlation but giving no clue as to what is responsible for it? Applied to economic and social theory, this sort of correlation-without-causation phenomenon can lead to some scary propositions. Is another world war, economic disaster, or disease pandemic inevitable because that's ''just the way things are''? Chaos theory suggests that the answer to some of these questions is yes!
Scale-Recurrent Patterns
Benoit Mandelbrot noticed that patterns tend to recur over various time scales. Large-scale and long-range changes take place in patterns similar to those of small-scale and short-term changes. Events occur in bunches; the bunches themselves take place in similar bunches following similar patterns. This effect exists both in the increasing scale and in the decreasing scale.
Have you noticed that high, cirrostratus clouds in the sky resemble the clouds in a room where someone has recently lit up a cigar? Or that these clouds look eerily like the interstellar gas-and-dust clouds that make up diffuse nebulae in space? Patterns in nature often fit inside each other as if they were nested geometric shapes, as if the repetition of patterns over scale takes place because of some principle ingrained in nature itself. This is evident when you look at the so-called Mandelbrot set (Fig. 7-8) using any of the numerous zooming programs available on the Internet. This set arises from a simple mathematical formula, yet it is infinitely complicated. No matter how much it is magnified – that is, however closely we zoom in on it – new patterns appear. There is no end to it! Yet the patterns show similarity at all scales.

The images in Fig. 7-8 were generated with a freeware program called Fractint. This program was created by a group of experts called the Stone Soup Team. The program itself is copyrighted, but images created by any user become the property of that user.
The Maximum Unswimmable Time
If our hypothetical swimmer keeps training, how fast will he eventually swim the 100-m freestyle? We already know that he can do it in a little more than 48 seconds. What about 47 seconds? Or 46 seconds? Or 45 seconds? There are obvious lower bounds to the time in which the 100-m freestyle can be swum by a human. It's a good bet that no one will ever do it in 10 seconds. How about 11 seconds? Or 12? Or 13? How about 20 seconds? Or 25? Or 30? If we start at some ridiculous figure such as 10 seconds and keep increasing the number gradually, we will at some point reach a figure – let's suppose for the sake of argument that it is 41 seconds – representing the largest whole number of seconds too fast for anyone to swim the 100-m freestyle.
Once we have two whole numbers, one representing a swimmable time (say 42 seconds) and the next smaller one representing an unswimmable time (say 41 seconds), we can refine the process down to the tenth of a second, and then to the hundredth, and so on indefinitely. There is some time, exact to however small a fraction of a second we care to measure it, that represents the maximum unswimmable time (MUST) that a human being can attain for the 100-m freestyle swim. Figure 7-10 shows an educated estimate (translation: wild guess) for this situation.

No one knows the exact MUST for the 100-m freestyle, and a good argument can be made for the assertion that we cannot precisely determine it. But such a time nevertheless exists. How do we know that there is a MUST for the 100-m freestyle, or for any other event in any other timed sport? A well-known theorem of mathematics, called the theorem of the greatest lower bound, makes it plain: ''If there exists a lower bound for a set, then there exists a greatest lower bound (GLB) for that set.'' A more technical term for GLB is infimum. In this case, the set in question is the set of ''swimmable times'' for the 100-m freestyle. The lower bounds are the ''unswimmable times.''
What's the probability that a human being will come to within a given number of seconds of the MUST for the 100-m freestyle in, say, the next 10 years, or 20 years, or 50 years? Sports writers may speculate on it; physicians may come up with ideas; swimmers and coaches doubtless have notions too. But anyone who makes a claim in this respect is only guessing. We can't say ''The probability is 50% that someone will swim the 100-m freestyle in so-and-so seconds by the year such-and-such.'' Remember the old probability fallacy from Chapter 3! For any theoretically attainable time, say 43.50 seconds, one of two things will happen: someone will swim the 100-m freestyle that fast someday, or else no one will.
The Butterfly Effect
The tendency for small events to have dramatic long-term and large-scale consequences is called the butterfly effect. It gets its name from a hypothetical question that goes something like this: Can a butterfly taking off in China affect the development, intensity, and course of a hurricane 6 months later in Florida? At first, such a question seems ridiculous. But suppose the butterfly creates a tiny air disturbance that produces a slightly larger one, and so on, and so on, and so on. According to butterfly-effect believers, the insect's momentary behavior could be the trigger that ultimately makes the difference between a tropical wave and a killer cyclone.
We can never know all the consequences of any particular event. History happens once and only once. We can't make repeated trips back in time and let fate unravel itself multiple times, after tweaking this or that little detail. But events can conspire, or have causative effects over time and space, in such a manner as to magnify the significance of tiny events in some circumstances. There are computer models to show it.
Suppose you go out biking in the rain and subsequently catch a cold. The cold develops into pneumonia, and you barely survive. Might things have turned out differently if the temperature had been a little warmer, or if it had rained a little less, or if you had stayed out for a little less time? There is no practical way to tell which of these tiny factors are critical and which are not. But computer models can be set up, and programs run, that in effect ''replay history'' with various parameters adjusted. In some cases, certain variables have threshold points where a tiny change will dramatically affect the distant future.
Scale Parallels
In models of chaos, patterns are repeated in large and small sizes for an astonishing variety of phenomena. A good example is the comparison of a spiral galaxy with a hurricane. The galaxy's stars are to the hurricane's water droplets as the galaxy's spiral arms are to the hurricane's rainbands. The eye of the hurricane is calm and has low pressure; everything rushes in towards it. The water droplets, carried by winds, spiral inward more and more rapidly as they approach the edge of the eye. In a spiral galaxy, the stars move faster and faster as they fall inward toward the center. A satellite photograph of a hurricane, compared with a photograph of a spiral galaxy viewed face-on, shows similarities in the appearance of these systems.
Air pressure and gravitation can both, operating over time and space on a large scale, produce the same kind of spiral. Similar spirals can be seen in the Mandelbrot set and in other similar mathematically derived patterns. The Spiral of Archimedes (a standard spiral easily definable in analytic geometry) occurs often in nature, and in widely differing scenarios. It's tempting to believe that these structural parallels are more than coincidences, that there is a cause-and-effect relationship. But what cause-and-effect factor can make a spiral galaxy in outer space look and revolve so much like a hurricane on the surface of the earth?
The Malthusian Model
Chaos theory has been applied in grim fashion to describe the characteristics of the earth's population growth. Suppose we want to find a function that can describe world population versus time. The simplest model allows for an exponential increase in population, but this so-called Malthusian model (named after its inventor, Thomas Malthus) does not incorporate factors such as disease pandemics, world wars, or the collision of an asteroid with the planet.
The Malthusian model is based on the idea that the world's human population increases geometrically in the way bacteria multiply, while the world's available supply of food and other resources increase arithmetically. It is easy to see that a pure Malthusian population increase can only go on for a certain length of time. When a certain critical point is reached, the population will no longer increase, because the earth will get too crowded and there won't be enough resources to keep people alive. What will happen then? Will the population level off smoothly? Will it decline suddenly and then increase again? Will it decline gradually and then stay low? The outcome depends on the values we assign to certain parameters in the function we ultimately find that describes population versus time.
A Bumpy Ride
The limiting process for any population-versus-time function depends on the extent of the disparity between population growth and resource growth. If we consider the earth's resources to be finite, then the shape of the population-versus-time curve depends on how fast people reproduce until a catastrophe occurs. As the reproduction rate goes up – as the ''function is driven harder'' – the time period until the first crisis decreases, and the ensuing fluctuations become more and more wild. In the worst cases, the predictions become dire indeed.
The Malthusian equation for population increase is:
where n is a whole number starting with n = 0, and r is a factor that represents the rate of population increase. (This is not the same r factor that represents correlation, defined earlier in this chapter.) Statisticians, social scientists, biologists, mathematicians, and even some politicians have run this formula through computers for various values of r, in an attempt to predict what would happen to the world's population as a function of time on the basis of various degrees of ''population growth pressure.'' It turns out that a leveling-off condition occurs when the value of r is less than about 2.5. The situation becomes more complicated and grotesque with higher values of r. As the value of the r factor increases, the function is ''driven harder,'' and the population increases with greater rapidity – until a certain point in time. Then chaos breaks loose.
According to computer models, when the r factor is low, the world population increases, reaches a peak, and then falls back. Then the population increases again, reaches another peak, and undergoes another decline. This takes place over and over but with gradually diminishing wildness. Thus, a damped oscillation occurs in the population function as it settles to a steady state (Fig. 7-11A).

In real life, the r factor can be kept low by strict population control and public education. Conversely, the r factor could become higher if all efforts at population control were abandoned. Computers tell us with unblinking screens what they ''think'' will happen then. If the value of r is large enough, the ultimate world population does not settle down, but oscillates indefinitely between limiting values. The amplitude and frequency of the oscillation depends on how large the r factor is allowed to get (Fig. 7-11B). At a certain critical value for the r factor, even this vestige of orderliness is lost, and the population-versus-time function fluctuates crazily, never settling into any apparent oscillation frequency, although there are apparent maximum and minimum limits to the peaks and valleys (Fig. 7-11C).
A graph in which the world's ultimate human population is plotted on the vertical (dependent-variable) axis and the r factor is plotted on the horizontal (independent-variable) axis produces a characteristic pattern something like the one shown in Fig. 7-12. The function breaks into oscillation when the r factor reaches a certain value. At first this oscillation has defined frequency and amplitude. But as r continues to increase, a point is reached where the oscillation turns into noise. As an analogy, think about what happens when the audio gain of a public-address system is increased until feedback from the speakers finds its way to the microphone, and the speakers begin to howl. If the audio gain is turned up higher, the oscillations get louder. If the system is driven harder still, the oscillations increase in fury until, in the absence of all restraint, the system roars like thunder.

Does the final population figure in the right-hand part of Fig. 7-12 truly represent unpredictable variation between extremes? If the computer models are to be believed, it does. By all indications, the gray area in the right-hand part of Fig. 7-12 represents a sort of randomness.
What is Randomness?
In statistical analysis, there is often a need to obtain sequences of values that occur at random. What constitutes randomness? Here's one definition that can be applied to single-digit numbers:
- A sequence of digits from the set {0, 1, 2, 3, 4, 5, 6, 7, 8, 9} can be considered random if and only if, given any digit in the sequence, there exists no way to predict the next one.
At first thought, the task of generating a sequence of random numbers in this way seems easy. Suppose we chatter away, carelessly uttering digits from 0 to 9. Everyone has a leaning or preference for certain digits or sequences of digits, such as 5 or 58 or 289 or 8827. If a sequence of digits is truly random, then over long periods a given digit x will occur exactly 10% of the time, a given sequence xy will occur exactly 1% of the time, a given sequence xyz will occur exactly 0.1% of the time, and a given sequence wxyz will occur exactly 0.01% of the time. These percentages, over time, should hold for all possible sequences of digits of the given sizes, and similar rules should hold for sequences of any length. But if you speak or write down or keypunch digits for a few days and record the result, it's a good bet that this will not be the case. (Your imagination may seem wild to you, but there is some order to it no matter what.)
Here's another definition of randomness. This definition is based on the idea that all artificial processes contain inherent orderliness:
- In order for a sequence of digits to be random, there must exist no algorithm capable of generating the next digit in a sequence, on the basis of the digits already generated in that sequence.
According to this definition, if we can show that any digit in a sequence is a function of those before it, the sequence is not random. This rules out many sequences that seem random to the casual observer. For example, we can generate the value of the square root of 2 (or 21/2) with an algorithm called extraction of the square root. This algorithm can be applied to any whole number that is not a perfect square. If we have the patience, and if we know the first n digits of a square root, we can find the (n + 1)st digit by means of this process. It works every time, and the result is the same every time. The sequence of digits in the decimal expansion of the square root of 2, as well as the decimal expansions of π, e, or any other irrational number, is the same every time a computer grinds it out. The decimal expansions of irrational numbers are therefore not random-digit sequences.
If the digits in any given irrational number fail to occur in a truly random sequence, where can we find digits that do occur randomly? Is there any such thing? If a random sequence of digits cannot be generated by an algorithm, does this rule out any thought process that allows us to identify the digits? Are we looking for something so elusive that, when we think we've found it, the very fact that we have gone through a thought process to find it proves that we have not? If that is true, how is the statistician to get hold of a random sequence that can actually be used?
In the interest of practicality, statisticians often settle for pseudorandom digits or numbers. The prefix pseudo- in this context means ''pretend'' or ''for all practical purposes.'' Computer algorithms exist that can be used to generate strings of digits or numbers that can be considered random in most real-world applications.
The Net to the Rescue
You can search the Internet and find sites with information about pseudorandom and random numbers. There's plenty of good reading on the Web (as well as plenty of nonsense), and even some downloadable programs that can turn a home computer into a generator of pseudorandom digits. For a good start, go to the Google search engine at www.google.com, bring up the page to conduct an advanced search, and then enter the phrase ''random number generator.'' Be careful what you download! Make sure your anti-virus program is effective and up to date. If you are uneasy about downloading stuff from the Web, then don't do it.
A safer way to get random digits is a site maintained by the well-known and respected mathematician Dr. Mads Haahr of Trinity College in Dublin, Ireland. It can be brought up by pointing your Web browser to www.random.org. The author describes the difference between pseudorandom and truly random numbers. He also provides plenty of interesting reading on the subject, and links to sites for further research.
Dr. Haahr's Web site makes use of electromagnetic noise to obtain real-time random-number and pseudorandom-number sequences. For this scheme to work, there must exist no sources of orderly noise near enough to be picked up by the receiver. Orderly noise sources include internal combustion engines and certain types of electrical appliances such as old light dimmers. The hissing and crackling that you hear in a radio receiver when it is tuned to a vacant channel is mostly electromagnetic noise from the earth's atmosphere and from outer space. Some electrical noise also comes from the internal circuitry of the radio.
Chaos, Bounds, and Randomness Practice Problems
Practice 1
Does the tendency of athletic performance to occur in ''spurts'' mean that a gradual improvement can never take place in any real-life situation? For example, is it impossible that the curve for the swimmer's times (Fig. 7-7) could look like either the solid or dashed lines in Fig. 7-9 instead?

Fig. 7-7. Monthly best times (in seconds) for a swimmer whose specialty is the 100-m freestyle, plotted by month for a hypothetical year. Illustration for Practice 1 and 2.

Fig. 7-9. Illustration for Practice 1 and 2.
Solution 1
Gradual, smooth improvement in athletic performance is possible. Either of the graphs (the straight, dashed line or the solid, smooth curve) in Fig. 7-9 could represent a real-life situation. But orderly states of affairs such as this are less common than more chaotic types of variations such as that shown in Fig. 7-7.
Practice 2
What is the fastest possible time for the 100-m freestyle swimming event, as implied by the graphs in Fig. 7-7 or Fig. 7-9?

Fig. 7-7. Monthly best times (in seconds) for a swimmer whose specialty is the 100-m freestyle, plotted by month for a hypothetical year. Illustration for Practice 1 and 2.

Fig. 7-9. Illustration for Practice 1 and 2.
Solution 2
Neither of these graphs logically implies that there is a specific time that represents the fastest possible 100-m swim. It can be mathematically proven that there exists a maximum unswimmable time for this event. But determining the actual time quantitatively, and then proving that the determination is valid, is another problem.
Practice 3
Can the MUST scenario, in which there is a greatest lower bound, apply in a reverse sense? Can there be, for example, a minimum unattainable temperature (MUTT) for the planet earth?
Solution 3
Yes. The highest recorded temperature on earth, as of this writing, is approximately 58°C (136°F). Given current climatic conditions, it's easy to imagine an unattainable temperature, for example, 500°C. We then start working our way down from this figure. Clearly, 400°C is unattainable, as is 300°C, and also 200°C (assuming runaway global warming doesn't take place, in which our planet ends up with an atmosphere like that of Venus). What about 80°C? What about 75°C? A theorem of mathematics, called the theorem of the least upper bound, makes it plain: ''If there exists an upper bound for a set, then there exists a least upper bound (LUB) for that set.'' This means there is a MUTT for our planet, given current climatic conditions. Figuring out an exact number, in degrees, for the MUTT is another problem.
Practice 4
Name two poor sources of pseudorandom digits, and two good sources, that can be obtained by practical means.
Solution 4
The digits of a known irrational number such as the square root of 20 represent an example of a poor source of pseudorandom digits. These digits are predestined (that is, they already exist and are the same every time they are generated). In addition, they can be produced by a simple machine algorithm.
The repeated spinning of a wheel or rotatable pointer, calibrated in digits from 0 to 9 around the circumference, is another example of a poor way to get a pseudorandom sequence. The wheel can never be spun forcefully enough to randomize the final result. In addition, friction is not likely to be uniform at all points in the wheel bearing's rotation, causing it to favor certain digits over others.
A good source of pseudorandom digits can be obtained by building a special die that has the shape of a regular dodecahedron, or geometric solid with 12 identical faces. The faces can be numbered from 0 to 9, with two faces left blank. This die can be repeatedly thrown and the straight-upward-facing side noted. (If that side is blank, the toss is ignored.) The results over time can be tallied as pseudorandom digits. The die must have uniform density throughout, to ensure that it doesn't favor some digits over others.
Another good source of pseudorandom digits is a machine similar to those used in some states' daily televised lottery drawings. A set of 10 lightweight balls, numbered 0 through 9, are blown around by powerful air jets inside a large jar. To choose a digit, the lid of the jar is opened just long enough to let one of the balls fly out. After the digit on the ''snagged'' ball is noted, the ball is put back into the jar, and the 10 balls are allowed to fly around for a minute or two before the next digit is ''snagged.''
Practice problems for these concepts can be found at:
Correlation, Causation, Order, and Chaos Practice Test
Add your own comment