Generating Techniques¶

Terrain generation is hardly uncharted territory. In fact, this project is meant to be less about breaking new ground than it is integrating and implementing techniques to make a "good game" or "good simulation," ideally one that's easy enough to use that it can be repurposed and modified for other games, other simulations, and other environments.

Let's begin by going over some of the popular techniques.

Noise¶

"Noise," in the technical sense we're using here, means a continuous function in one or more dimensions, that exhibit some random or pseudo-random properties.

For one-dimensional noise, think of it as a line that "wanders" between two values (typically either -1 and 1 or else 0 and 1) in an unpredictable fashion. "Continuous" here means that if you were drawing the line, it could be done without lifting your pencil from the paper.

Of course, computers aren't great at continuous things, and this kind of continuity is broken if you're not thinking of the function as an infinitely-divisible line over real numbers. In real implementations, this function is "sampled" at discrete, usually fixed intervals. So long as the intervals are small compared the to average distances over which the function changes value ("period" or the "inverse frequency", if we think of the function as a signal), an illusion of continuity is maintained. If the interval approaches or becomes longer than the period, the resulting output will show moire effects or break down altogether. This is the same effect that makes fans and propellers appear to slow, then spin backwards when seen on film.

Probably the most famous of the computational noise generation functions is Perlin noise. Developed for the movie Tron, it's been around for decades, and won its creator an Oscar for screen effects that used it.

Perlin noise generates "slow," smooth changes in value over the X axis. These values are typically scaled from their [0,1] range into whatever is useful for a given application, and sampled at different frequencies in order to speed up or slow down the rate of change.

Early in its existence, it was noticed that Perlin noise at certain frequencies tends to look like a natural "ridge line" when viewed edge-on. This effect is significantly magnified by sampling the Perlin noise function at several different rates (often called "octaves" in the literature), and adding or subtracting the resulting curves.

This effect persists when Perlin noise is extended to two dimensions. When two-dimensional Perlin noise is generated over a range of (x, y) values, it produces a sort of "lumpy" or "cloudlike" pattern (very visible if you make an image of it, with each pixel given a greyscale value corresponding to the Perlin value at that position). Again, sampling at different frequencies and adding the results together makes the clouds "fluffier."

If you take that two-dimensional Perlin bitmap, and treat it as a heightmap, where the value indicates the "height" in the third dimension, you get something that often bears a remarkable similarity to natural terrain. Messing with the scale, frequency, number of octaves, and interpretation can produce a wide variety of terrain effects, from rolling plains through dunes, hills, and mountains. A huge amount of the "procedurally generated" terrains you see online are created directly from Perlin noise or a similar function.

The Problems with Perlin Noise¶

Randomness¶

If the solution were "Perlin noise produces great terrains--we're done," this would be a short document, indeed. But there are some downsides.

The first is technical. Perlin noise describes a single, fixed function. Assuming no bugs, your Perlin noise generator and my Perlin noise generator will generate the same function every time. If you use it to generate a terrain, it will generate the same terrain each run.

Pseudo-random number generation has had this problem for ages, of course, and the solution there is to introduce a seed value, effectively a developer-provided starting point for the sequence (often itself derived from some near-random value like the microsection portion of the current clock time).

Perlin noise is unseeded, but we can fake the seed by simply offsetting the input value by a "seed" distance in the X or (X,Y) dimensions, basically just moving the "origin" of the Perlin function somewhere else.

There's a second issue with randomness, although it's not obvious until you look at some Perlin noise results for a while. There's a distinct "grid" to it--that is, terrain features tend to appear in horizontal and vertical lines. Whether or not this is a problem depends on the implementation; the "grid" is generally visible only when large amounts of the terrain are visible at once, so for a game where the player spends most of his or her time on the ground, it may never be obvious.

Simplex noise¶

Both of these problems are solved in a more modern noise function, called Simplex (also developed by Ken Perlin). First, it takes an explicit seed, so you can generate repeatable or non-repeatable output as desired. Second, it's built over a hexagonal structure rather than a gridded one, and with more deviation from those directions. This generates far fewer visible "lines" in the resulting noise. Simplex is also somewhat more computationally efficient, so it takes less computer time to generate.

A few specific implementations and uses for Simplex noise were covered by a U.S. patent, but there are a large number of non-infringing libraries and implementations available (e.g. "OpenSimplex"), and in any case the patent expired in early 2022. Generally speaking, it's almost always better to use simplex noise rather than Perlin.

Natural-ish?¶

The biggest problem with both Perlin and Simplex noise is that while it generates some of the shapes and structures of natural terrain, the terrains described are only superficially "natural." A geologist looking at noise-generated terrain won't be fooled, and even laypeople will tend to find it "boring" compared to real-world terrains. "Sharp" features like geologically young mountain peaks, deep canyons, badlands, cliffs, and other features where sudden discontinuities are common aren't produced at all by most noise-based implementations, and require a lot of tweaking to achieve even if you're trying for them. And of course, in the typical implementations, these are generating height maps over a 2D grid, so there are never any overhangs at all.

The last problem can be addressed in part by moving to 3D noise, which is how games like Minecraft produce their bridges, overhangs, underground caves, and other features. But even if you ignore the blocky-ness of Minecraft, its terrains are actually less natural looking than 2D implementations: it covers more of nature, but at the cost of producing a lot of things nature never could. And this sort of terrain virtually requires an expensive voxel implementation because of it's three-dimensional nature.

In the real world, mountain ranges tend to have foothills because colliding plates ripple like two pieces of gelatin shoved together. Canyons are carved by rivers, and often still have them in the bottom. Desert dunes are formed by the effects and direction of wind. "Sharp" features in climatologically wet areas will tend to crumble and produce debris fields at the bottom. Cliffs next to oceans or lakes will be undercut over time, resulting in sea caves or just collapse. Hurricanes and tsunamis re-shape low-lying islands. Unstable geology falls under gravity, structures become smoother and flatter as they (geologically) age. Hot spots in the mantle produce "arcs" of volcanic mountains, caulderas, or islands as continental plates move over them.

Noise-based algorithms deal with precisely none of that. They generate terrains that make sense locally but not globally. One site discussing perlin-based terrains put it succinctly as "the terrain has no history."

None of this is to say that noise-based terrains have no place. Sometimes realism isn't important. Sometimes local is enough. And almost always, you can use noise-based data to add realistic detail to terrains generated by other methods.

Midpoint Displacement¶

Another popular noise-like algorithm is "Midpoint Displacement." There are numerous variations of this, but in general, the algorithm is:

On a very coarse grid, choose (or randomize) some known positions to be your minimum and maximum heights. These could, for example, be mountaintops, sea floors, etc.
Subdivide the grid in both dimensions by 2x, basically adding a new position between each existing one in both dimensions.
Set the height of that new position to the average of the 2/4 positions around it, plus or minus a small random displacement.
Repeat until you're at the desired resolution, possibly halving the displacement each time.

This is a little more complex in practice than it looks like, since some of the positions need to be handled slightly differently than the others (there's a center square in each quad that has no neighbors with existing values when created), but it's relatively simple. ("Diamond square" is a popular specific algorithm, there are others, as well.)

This doesn't generate quite as nice looking a terrain as the perlin/simplex versions, but it has the advantage that you can force certain positions to have certain values: if you want to insure that the edges of your map are below sea level, for example, or that there are mountains/continents in specific places or quantities.

On the other hand, it shares a fair amount of disadvantages with the other noise methods: the terrain tends to be either smoother or rougher in places than seems "natural," and it in general tends toward the smooth. If you want a deep canyon with steep walls, for example, you'll need to place the high walls and the deep floor next to each other "manually," since they algorithm will almost always produce relatively shallow slopes.

Also, despite seeming fairly fractal, there's really only two scales: the initial "gross features" one and the detailed "rough surface" one. Everything else tends to get smoothed out between them. It's pretty good (maybe better than any of the other algorithms) at creating the rough structure of continents and seas, but is much poorer at things like mountain ranges and hills, unless you provide them in the initial data (which sort of defeats the point).

Voronoi¶

Voronoi diagrams/Voronoi maps are a sort of "region map." A bunch of "seed points" are randomly distributed over a surface, and then every other point is assigned to the "region" containing the closest seed point to it. The region edges are polygons, whose edges are along the exact midpoint between each pair of seed points. This process effectively divides space into a random set of convex polygons.

For example, using Alex Beutel's online Voronoi generator here, we can create an example diagram:

Example Voronoi diagram

The seed points here are the black dots, and the various colored polygons are their associated regions. Note that because they are distributed randomly, the seed points are often quite far from the center of regions they define. Also note that every polygon is convex, and all of the edges are straight lines. The size of the regions is defined by the density of the corresponding seed points; in the diagram above, sparser points on the left tend to lead to larger polygons than on the right.

Despite their apparent simplicity, Voronoi diagrams can be used in a number of ways for terrain generation, as we'll see later.

Height Map: Using the seed points as mountain "peaks" and defining either a fixed slope or one dependent on the polygon size can give you a base for mountain ranges and passes (the mountain passes correspond to the polygon edges, which are the low points of the height maps). Mountains generated in this way are unnaturally conical and "spiky," so this would generally be the starting point for an algorithm that then roughens the surface by adding noise, erosion, etc.

Elevation Generation: Let's take the above map and manipulate it a little bit. If we take every polygon that touches the edge and make it blue, then every polygon that does not touch a blue one white, and the remainder green, you get this:

Same Voronoi diagram, but with coloration showing "distance" from the edge of the map.

If you squint at that hard enough, you can see an island surrounded by water, with central snowy mountain peaks. Using the "distance" (here meaning smallest number of polygons to an edge) to generate elevation zones can give you, again, a starting point for other generation techniques. To get more elevation levels and rougher coastlines, start with more seed points. Amit Patel has a frequently-referenced online article where he takes this technique a lot farther, which you can find here: http://www-cs-students.stanford.edu/~amitp/game-programming/polygon-map-generation/

Others: There are lots of other possibilities. Andy Lo uses them to define continental plates: https://squeakyspacebar.github.io/2017/07/12/Procedural-Map-Generation-With-Voronoi-Diagrams.html, then models the "stress" along their edges as they push together or pull apart. Voronoi maps can be used to place moisture maps, biomes, political divisions, or what have you. Empyrion appears to use a similar technique to determine zone of control around planetary bases (which act as the seed points). Basically any time you've got "zones of influence" that can't overlap, you're ending up with a Voronoi diagram or something like it under the covers.

Natural Processes¶

The natural world is shaped by continental drift, volcanism, gravity, water, and wind (and the occasional drive-by attack by comets and asteroids). To one degree or another, all of these are relatively simple phenomenon that interact in decidedly non-simple ways to produce the astounding diversity of our planet's geography.

This sort of "simple rules to produce non-simple emergent behavior" is the whole idea of procedural generation, and of course it's ideal for computational approximation.

Continental Drift¶

Continental Drift, or more accurately Plate Tectonics, is the science that described the large-scale movement of sections of the Earth's crust, called plates. While it's accepted today as the best scientific description of planetary geology, it's surprisingly young--there was considerable debate about it as late as the 1960's.

For our purposes, we can simplify the ideas to this, while losing some details: The surface of some Earth-like planet is divided into large regions which float on top of the "liquid" mantle. These plates move about, very slowly (centimeters per year). Interesting things happen at the edges:

Where two plates push into each other, the earth buckles up and you get mountain ranges. This isn't a stable situation (mountains can't grow indefinitely), and the heavier plate will be pushed under the lighter one, a process called "subduction." If one or both the the plates are oceanic rather than landmasses, the physics are a little different and you'll generally get a trench or deep ocean valley instead (often with a line of volcanos or volcanic islands some distance back along the uplifted side).
Where two plates pull apart, you get significant volcanism, and the formation of rift valleys (on land) and oceanic ridges at sea, where volcanism effectively produces "new" seafloor to fill in the gaps.
Where plates slide along each other, you get buildups and releases of friction, which cause earthquakes, generally in combination with one of the other two scenarios. The edges of plates are fault lines.

From an implementation standpoint, we can do things as complicated as measuring stresses (see the Andy Lo article above), or as simple as just realizing that mountain ranges, rifts, and trenches tend to follow plate boundaries, and that the interiors of plates distant from the edges tend toward flatness.

Plate tectonics as it occurs on Earth may not be a universal phenomenon; science is ongoing about its existence (or lack of same) on other worlds.

Volcanism¶

Volcanos are a more local effect, especially if you broaden the definition to anytime magma, ash, or hot materials are brought to the surface. They occur around continental plate edges (e.g. the Pacific "Ring of Fire"), often on the uplifted (higher) side of a subduction zone. Those tend to be the clustered ones you see in mountain ranges, like the numerous volcanoes in the Cascade range of the U.S. Pacific Northwest. But volcanoes can form anywhere that there's a sufficient "hot spot" below the earth's crust. Hot spot volcanoes like this are often responsible for island chains--for example, Hawaii's islands were formed by repeated volcanic eruptions of a hot spot as the tectonic plate moved over it. They can also be absolutely massive, such as the super-volcanos that formed the Yellowstone caldera and Lake Toba.

Erosion¶

Gravity, wind, and water all combine into erosion; the sometimes gradual, sometimes sudden process by which parts of the terrain are separated from their initial position, carried away (or just dropped), and deposited somewhere else. Water is perhaps the most powerful of these effects; it can dissolve materials, wear them away though battering, seep into and widen cracks as temperatures change, and carry the resulting debris substantial distances. At colder temperatures, ice is an even more powerful source of erosion.

These changes take place over timescales ranging from seconds in the case of landslides and floods through millennia in the case of mountains flattening. Generally speaking, we can't hope to simulate the actual processes, just the net results. That usually means removing height from one location and adding some or all of it to another. A crude form of this is the Gaussian Blur we use so often to smooth algorithmic maps -- it generally degrades sharp peaks and fills in sharp valleys.

Erosion is often tied to steepness. Avalanches, landslides, hydraulic shearing, and similar techniques rely on significant grades. They tend to steepen the terrain at the point of loss, and make it shallower at the point of deposition. Tidal erosion (where water hits shores) behaves very differently if it hits cliffs (generating sea caves and overhangs, and sometimes causing the higher terrain to fall into the sea) rather than shallower shores (where it will deposit beaches and generally both flatten the terrain and make the undersea portion shallower for some distance). Rivers run faster, narrower, and cut deeper in steep terrains, then spread out, slow down, and deposit over much larger areas when they hit shallower plains.