Editors of prominent mathematics journals are used to fielding grandiose claims from obscure authors, but this paper was different. Written with crystalline clarity and a total command of the topic's current state of the art, it was evidently a serious piece of work, and the Annals editors decided to put it on the fast track.
Just three weeks later -- a blink of an eye compared to the usual pace of mathematics journals -- Zhang received the referee report on his paper.
"The main results are of the first rank," one of the referees wrote. The author had proved "a landmark theorem in the distribution of prime numbers."
Rumors swept through the mathematics community that a great advance had been made by a researcher no one seemed to know -- someone whose talents had been so overlooked after he earned his doctorate in 1992 that he had found it difficult to get an academic job, working for several years as an accountant and even in a Subway sandwich shop.
"Basically, no one knows him," said Andrew Granville, a number theorist at the Universite de Montreal. "Now, suddenly, he has proved one of the great results in the history of number theory."
Reminds me of a certain patent clerk and his theories about time and space. History doesn't repeat itself, but it does rhyme. (via @daveg)
In August of 2012, mathematician Shinichi Mochizuki posted a series of four papers online that purported to prove the ABC Conjecture, "a famed, beguilingly simple number theory problem that had stumped mathematicians for decades". Then, nothing. Or nearly nothing.
The problem, as many mathematicians were discovering when they flocked to Mochizuki's website, was that the proof was impossible to read. The first paper, entitled "Inter-universal Teichmuller Theory I: Construction of Hodge Theaters," starts out by stating that the goal is "to establish an arithmetic version of Teichmuller theory for number fields equipped with an elliptic curve...by applying the theory of semi-graphs of anabelioids, Frobenioids, the etale theta function, and log-shells."
This is not just gibberish to the average layman. It was gibberish to the math community as well.
"Looking at it, you feel a bit like you might be reading a paper from the future, or from outer space," wrote Ellenberg on his blog.
But seeming jibberish by a genius might just be solid mathematics, but Mochizuki isn't doing much to help other mathematicians confirm or refute his assertions. Which raises an interesting point: mathematics isn't all just logic and truth...there's a social element to it as well.
"You don't get to say you've proved something if you haven't explained it," she says. "A proof is a social construct. If the community doesn't understand it, you haven't done your job."
In 2006, Garth Sundem and John Tierney published an equation in the NY Times that attempted to predict celebrity marriage crackups using a few metrics: age, fame, sexiness, etc. The pair recently modified the equation based on the evidence of the last five years and surprisingly, the equation is simpler.
What went right with them -- and wrong with our equation? Garth, a self-professed "uber-geek," has crunched the numbers and discovered a better way to gauge the toxic effects of celebrity. Whereas the old equation measured fame by counting the millions of Google hits, the new equation uses a ratio of two other measures: the number of mentions in The Times divided by mentions in The National Enquirer.
"This is a major improvement in the equation," Garth says. "It turns out that overall fame doesn't matter as much as the flavor of the fame. It's tabloid fame that dooms you. Sure, Katie Holmes had about 160 Enquirer hits, but she had more than twice as many NYT hits. A high NYT/ENQ ratio also explains why Chelsea Clinton and Kate Middleton have better chances than the Kardashian sisters."
Garth's new analysis shows that it's the wife's fame that really matters. While the husband's NYT/ENQ ratio is mildly predictive, the effect is so much weaker than the wife's that it's not included in the new equation. Nor are some variables from the old equation, like the number of previous marriages and the age gap between husband and wife.
You are comfortable with feeling like you have no deep understanding of the problem you are studying. Indeed, when you do have a deep understanding, you have solved the problem and it is time to do something else. This makes the total time you spend in life reveling in your mastery of something quite brief. One of the main skills of research scientists of any type is knowing how to work comfortably and productively in a state of confusion.
The distance between the metal bands holding the cylindrical structure together decreases from top to bottom because the pressure the water exerts increases with depth. The top band only needs to fight against the water at the very top of the tower but the bottom bands have to hold the entire volume from bursting out.
And in celebration, this is my new favorite fact about pi: we have calculated pi out to over 6.4 billion digits but only 39 of them are needed to calculate the circumference of a circle as big as the universe "with a precision comparable to the radius of a hydrogen atom". (via @santheo)
This equation's initial purpose, he wrote, was to put meaningful prices on the terrestrial exoplanets that Kepler was bound to discover. But he soon found it could be used equally well to place any planet-even our own-in a context that was simultaneously cosmic and commercial. In essence, you feed Laughlin's equation some key parameters -- a planet's mass, its estimated temperature, and the age, type, and apparent brightness of its star -- and out pops a number that should, Laughlin says, equate to cold, hard cash.
At the time, the exoplanet Gliese 581 c was thought to be the most Earth-like world known beyond our solar system. The equation said it was worth a measly $160. Mars fared better, priced at $14,000. And Earth? Our planet's value emerged as nearly 5 quadrillion dollars. That's about 100 times Earth's yearly GDP, and perhaps, Laughlin thought, not a bad ballpark estimate for the total economic value of our world and the technological civilization it supports.
Nothing in the news media yet, but many folks on Twitter and colleague Nassim Taleb are reporting that the father of fractal geometry is dead at age 85. We're not there yet, but someday Mandelbrot's name will be mentioned in the same breath as Einstein's as a genius who fundamentally shifted our perception of how the world works.
This 45-minute documentary on Andrew Wiles' proof of Fermat's Last Theorem is surprisingly powerful and emotional. Give it until 1:45 or so and you'll want to watch the whole thing. The film is not really about math; it's about all of those movie trailer cliches -- "one man!", "finds the truth!", "fights the odds!", etc. -- except that this is actually true and poignant.
With Friedman's work, it seems Gödel's delayed triumph has arrived: the final proof that if there is a universal grammar of numbers in which all facets of their behaviour can be expressed, it lies beyond our ken.
But don't worry..."the most severe implications are philosophical". Phew?
After analyzing dozens of Hollywood films, a team of researchers has found evidence that the visual rhythm of movies at the shot level matches a pattern called the 1/f fluctuation, the same pattern that is found in dozens of natually occurring phenomena, including the length of the human attention span.
These results suggest that Hollywood film has become increasingly clustered in packets of shots of similar length. For example, action sequences are typically a cluster of relatively short shots, whereas dialogue sequences (with alternating shots and reverse-shots focused sequentially on the speakers) are likely to be a cluster of longer shots. In this manner and others, film editors and directors have incrementally increased their control over the visual momentum of their narratives, making the relations among shot lengths more coherent over a 70-year span.
I was really into fractals in college (I know...) when I was making rave flyers (I know!) for a friend's parties in Iowa (I know! I know! Shut up already!). Anyway, the thing that I really used to love doing with this fractal application that I had on my computer was zooming in to different parts of the familiar Mandelbrot set as far as I could. I never got very far...between 5 or 6 zooms in, my Packard Bell 486/66 (running Windows 3.11) would buckle under the computational pressure and hang. Therefore, I absolutely love this extremely deep HD zoom into the Mandelbrot set:
Just how deep is this computational rabbit hole?
The final magnification is e.214. Want some perspective? a magnification of e.12 would increase the size of a particle to the same as the earths orbit! e.21 would make a particle look the same size as the milky way and e.42 would be equal to the universe. This zoom smashes all of them all away. If you were "actually" traveling into the fractal your speed would be faster than the speed of light.
After awhile, the self-similarity of the thing is almost too much to bear; I think I went into a coma around 5:00 but snapped to in time for the exciting (but not unexpected) conclusion. Full-screen in a dark room is recommended.
I'll be writing about the elements of mathematics, from pre-school to grad school, for anyone out there who'd like to have a second chance at the subject -- but this time from an adult perspective. It's not intended to be remedial. The goal is to give you a better feeling for what math is all about and why it's so enthralling to those who get it.
More subject blogs like this, please. There are lots of art, politics, technology, fashion, economics, typography, photography, and physics blogs out there, but almost none of them appeal to the beginner or interested non-expert. (thx, steve)
The ham sandwich theorem is also sometimes referred to as the "ham and cheese sandwich theorem", again referring to the special case when n = 3 and the three objects are
1. a chunk of ham, 2. a slice of cheese, and 3. two slices of bread (treated as a single disconnected object).
The theorem then states that it is possible to slice the ham and cheese sandwich in half such that each half contains the same amount of bread, cheese, and ham. It is possible to treat the two slices of bread as a single object, because the theorem only requires that the portion on each side of the plane vary continuously as the plane moves through 3-space.
No idea how this is related to the I Cut You Choose conundrum.
If you and someone else hate the same third person, but like each other, balance theory says you're golden -- all three can persist without changing their opinions. On the other hand, if all three of you despise the others, it's an unstable triad, as well as a wildly common plot point for crime movies. While there are numerous resolutions -- one person changes his preference toward another, a relationship tie is cut -- another route back to stability, albeit a messy one, is the gunning down of at least one person.
Arbesman has some videos and stills on his web site from the movies mentioned in the article as well as the relevant mathematical materials.
Let's say, for example, you want to bet on one of the highlights of the British sporting calendar, the annual university boat race between old rivals Oxford and Cambridge. One bookie is offering 3 to 1 on Cambridge to win and 1 to 4 on Oxford. But a second bookie disagrees and has Cambridge evens (1 to 1) and Oxford at 1 to 2.
Each bookie has looked after his own back, ensuring that it is impossible for you to bet on both Oxford and Cambridge with him and make a profit regardless of the result. However, if you spread your bets between the two bookies, it is possible to guarantee success (see diagram, for details). Having done the calculations, you place £37.50 on Cambridge with bookie 1 and £100 on Oxford with bookie 2. Whatever the result you make a profit of £12.50.
I say relatively because there are literally millions of pages on the web just about blackjack statistics. For instance, it's easy to see how you'll lose money playing blackjack in the long run -- card counting aside -- by looking at this house edge calculator. The only real advantage to the player occurs with a one-deck shoe and a bunch of other pro-player rules, which I imagine are difficult to find at the casinos. (via big contrarian)
Now, here's the part that really boggled me: the Consumption/Waste idea is a 1:1 correspondence (something in yields something out), what mathematicians call a linear function. The Parabola idea connects, pretty obviously, with parabolas -- now we're looking at x raised to the power of two. Annular Systems are modeled by circles which are given in analytic geometry by equations with both x^2 and y^2. Limits and Infinity, of course, become necessary in order to find the area of shapes under curves like parabolas and three-dimensional projections of circles.
Whoa. That is a tiny bit mind-blowing...do I really have time for a reread right now? (thx, nick)
Watch as David Attenborough signals his interest in mating with a male cicada. Scientists think that cicadas have 13- or 17-year mating cycles because, being prime numbers, those periods are not divisible by those periods of potential predators. From Stephen J. Gould:
Many potential predators have 2-5-year life cycles. Such cycles are not set by the availability of cicadas (for they peak too often in years of nonemergence), but cicadas might be eagerly harvested when the cycles coincide. Consider a predator with a life-cycle of five years: if cicadas emerged every 15 years, each bloom would be hit by the predator. By cycling at a large prime number, cicadas minimize the number of coincidences (every 5 x 17, or 85 years, in this case). Thirteen- and 17-year cycles cannot be tracked by any smaller number.
How big a role does randomness play in our lives? Do we live in a world of magic and meaning or ... is it all just chance and happenstance? To tackle this question, we look at the role chance and randomness play in sports, lottery tickets, and even the cells in our own body. Along the way, we talk to a woman suddenly consumed by a frenzied gambling addiction, two friends whose meeting seems purely providential, and some very noisy bacteria.
Regarding the game of Who Can Name the Bigger Number?, Scott Aaronson shows that while 9^9^9^9 might cut the mustard in the first couple of rounds, the numbers and the notation used to express them get much more complicated.
Exponentials are familiar, relevant, intimately connected to the physical world and to human hopes and fears. Using the notational systems I'll discuss next, we can concisely name numbers that make exponentials picayune by comparison, that subjectively speaking exceed 9^9^9^9 as much as the latter exceeds 9.
Geoffrey West of the Santa Fe Institute and his colleagues Jim Brown and Brian Enquist have argued that a 3/4-power law is exactly what you'd expect if natural selection has evolved a transport system for conveying energy and nutrients as efficiently and rapidly as possible to all points of a three-dimensional body, using a fractal network built from a series of branching tubes -- precisely the architecture seen in the circulatory system and the airways of the lung, and not too different from the roads and cables and pipes that keep a city alive.
Joe liked the idea of measuring how long this number would be if it were set in type, which immediately called into question the choice of font. The number's length would depend chiefly on the width of the font selected, and even listener-friendly choices like Times Roman and Helvetica would produce dramatically different outcomes. Small eccentricities in the design of a particular number, such as Times Roman's inexplicably scrawny figure one, would have huge consequences when multiplied out to this length. But even this isn't the hairy part. Where things get difficult, as always, is in the kerning.
In some cases, properly kerning the number resulted in a difference of more than 1000 feet for 12 pt. text.
In the complex formula L represents the number of lumps in the batter and C equals its consistency. The letter F stands for the flipping score, k is the ideal consistency and T is the temperature of the pan. Ideal temp of pan is represented by m, S is the length of time the batter stands before cooking and E is the length of time the cooked pancake sits before being eaten. The closer to 100 the result is -- the better the pancake.
However, a commenter notes:
According to that formula, if you left the pancake batter standing for ten years, (s-e) would be large, and so the pancake would be near perfect. If you let it stand for the same time as you left the pancake to cool, (s-e) would be zero and the pancake would be infinitely bad.
In The Method, Archimedes was working out a way to compute the areas and volumes of objects with curved surfaces, which was also one of the problems that motivated Newton and Leibniz. Ancient mathematicians had long struggled to "square the circle" by calculating its exact area. That problem turned out to be impossible using only a straightedge and compass, the only tools the ancient Greeks allowed themselves. Nevertheless, Archimedes worked out ways of computing the areas of many other curved regions.
The same thing happened: something would look good at first and then turn out to be horrifying. For example, there was a book that started out with four pictures: first there was a windup toy; then there was an automobile; then there was a boy riding a bicycle; then there was something else. And underneath each picture it said, "What makes it go?"
I thought, "I know what it is: They're going to talk about mechanics, how the springs work inside the toy; about chemistry, how the engine of the automobile works; and biology, about how the muscles work."
It was the kind of thing my father would have talked about: "What makes it go? Everything goes because the sun is shining." And then we would have fun discussing it:
"No, the toy goes because the spring is wound up," I would say. "How did the spring get wound up?" he would ask.
"I wound it up."
"And how did you get moving?"
"And food grows only because the sun is shining. So it's because the sun is shining that all these things are moving." That would get the concept across that motion is simply the transformation of the sun's power.
DARPA is soliciting research proposals for people wishing to solve one of twenty-three mathematical challenges, many of which deal with attempting to find a mathematical basis underlying biology.
What are the Fundamental Laws of Biology?: This question will remain front and center for the next 100 years. DARPA places this challenge last as finding these laws will undoubtedly require the mathematics developed in answering several of the questions listed above.
- The air in the Empire State Building weighs about 4 million pounds.
- The energy consumption of the world's population will be greater than the energy coming from the sun in less than 500 years. (Peak photons?)
What's surprising about such estimates is how often they are very close to the reality. This is especially true in a multi-step approximation, where over- and underestimates at various steps tend to cancel each other out, usually resulting in something not too far off from the truth.
Both Microsoft and Google use questions like these as part of their job interview process. We did a bunch of them in my freshman physics class; I loved them.
Banknote patterns fascinate me. I can get lost for hours in all the details, seeing how the patterns fit together, how the lettering works, the tiny security 'flaws' -- they're amazing. Central to banknote designs are Guilloche patterns, which can be created mechanically with a geometric lathe, or more likely these days, mathematically. The mathematical process attracted me immediately as I don't have a geometric lathe and nor do I have anywhere to put one. I do, however, have a computer, and at the point I first started playing with the designs (mid-2004) Illustrator and Photoshop had gained the ability to be scripted.
In case you're wondering, the most densely populated block group is one in New York County, New York -- 3,240 people in 0.0097 square miles, for about 330,000 per square mile. The least dense is in the North Slope Borough of Alaska -- 3 people in 3,246 square miles, or one per 1,082 square miles. The Manhattan block group I mention here is 360 million times more dense than the Alaska one; population densities vary over a huge range.
That's approximately the same range from the height of an iPod to the diameter of the Earth. (via fakeisthenewreal)
Benoit Mandelbrot and Paola Antonelli talk about, among other things, fractals, self-similarity in architecture, algorithms that could specify the creation of entire cities, visual mathematics, and generalists.
This has been for me an extraordinary pleasure because it means a certain misuse of Euclid is dead. Now, of course, I think that Euclid is marvelous, he produced one of the masterpieces of the human mind. But it was not meant to be used as a textbook by millions of students century after century. It was meant for a very small community of mathematicians who were describing their works to one another. It's a very complicated, very interesting book which I admire greatly. But to force beginners into a mathematics in this particular style was a decision taken by teachers and forced upon society. I don't feel that Euclid is the way to start learning mathematics. Learning mathematics should begin by learning the geometry of mountains, of humans. In a certain sense, the geometry of...well, of Mother Nature, and also of buildings, of great architecture.
Speaking of the Yankees, Derek Jeter always seems to get a lot of credit for those four World Series victories in five years but a quick look at the OBP stats for those years shows that Bernie Williams was the engine driving that offense. Jeter's a little overrated maybe?
Called "Hilbert" after the influential German mathematician, David Hilbert, the newly licensed software will be browser accessible and, utilizing AJAX technologies, will emulate the desktop version of the software with remarkable fidelity. "The magic of AJAX will allow OST to combine or 'mash-up' Mathematica with other web-based technologies to deliver and support high quality science and mathematics courses online such as the Calculus&Mathematica courses currently taught through NetMath at the University of Illinois and other universities," explains Scott Gray, Director of the O'Reilly School of Technology.
Hilbert should be available before the end of the year.
Infinite Jest once again proved finite, although it's taken me since August to get through it. This book was such a revelation the first time through that I was afraid of a reread letdown but I enjoyed it even more this time around...and got much more out of the experience too.
MICHAEL SILVERBLATT: I don't know how, exactly, to talk about this book, so I'm going to be reliant upon you to kind of guide me. But something came into my head that may be entirely imaginary, which seemed to be that the book was written in fractals.
DAVID FOSTER WALLACE: Expand on that.
MS: It occurred to me that the way in which the material is presented allows for a subject to be announced in a small form, then there seems to be a fan of subject matter, other subjects, and then it comes back in a second form containing the other subjects in small, and then comes back again as if what were being described were -- and I don't know this kind of science, but it just -- I said to myself this must be fractals.
DFW: It's -- I've heard you were an acute reader. That's one of the things, structurally, that's going on. It's actually structured like something called a Sierpinski Gasket, which is a very primitive kind of pyramidical fractal, although what was structured as a Sierpinski Gasket was the first- was the draft that I delivered to Michael in '94, and it went through some I think 'mercy cuts', so it's probably kind of a lopsided Sierpinski Gasket now. But it's interesting, that's one of the structural ways that it's supposed to kind of come together.
MS: "Michael" is Michael Pietsche, the editor at Little, Brown. What is a Sierpinski Gasket?
DFW: It would be almost im- ... I would almost have to show you. It's kind of a design that a man named Sierpinski I believe developed -- it was quite a bit before the introduction of fractals and before any of the kind of technologies that fractals are a really useful metaphor for. But it looks basically like a pyramid on acid --
To answer Silverblatt's question, a Sierpinski Gasket is constructed by taking a triangle, removing a triangle-shaped piece out of the middle, then doing the same for the remaining pieces, and so on and so forth, like so:
The result is an object of infinite boundary and zero area -- almost literally everything and nothing at the same time. A Sierpinski Gasket is also self-similar...any smaller triangular portion is an exact replica of the whole gasket. You can see why Wallace would have wanted to structure his novel in this fashion.
What's sort of great about it is that it will happen to everybody if you live long enough. If you were born in 2000, it happens instantaneously. The people who were born at the end of the century have to take care of themselves.
Basically, as the leaf grows it is constrained to a 2-d surface, but the cells of some leaves reproduce fast enough to require more surface area than a pi-r-squared plane surface can provide. Its only recourse is to buckle out-of-plane, giving the wrinkles. Since the exuberant growth continues as the leaf grows outward, the buckling process repeats and you get the multi-scale (ripples on ripples on ripples) shape that you see in kale and daffodils.
Cadaeic Cadenza is a 3834-word story by Mike Keith where each word in sequence has the same number of letters as the corresponding digit in pi. (thx, mark, who has more info on constrained writing) Related: The Feynman point is the sequence of six 9s which begins 762 digits into pi. "[Feynman] once stated during a lecture he would like to memorize the digits of pi until that point, so he could recite them and quip 'nine nine nine nine nine nine and so on.'"
And if that weren't enough excitement for one day, it's also Pi Day. (Whoa, the Pi Day web site uses Silkscreen!) I bet the Pi Dayers are really looking forward to 2015 when they can extend the fun to two additional decimal places.
Update:Jeff writes: "Way back when we only used film I learned you could tell before looking at the photo whether someone blinked by asking them what color was the flash. If it was white or bluish white, then their eyes were open. If it was orange, then their eyes were closed and they had 'seen' the flash through their eyelids."
A look at Saks Fifth Avenue's new logo and identity. The identity system consists of cutting up the logo into patterns....98,137,610,226,945,526,221,323,127,451,938,506, 431,029,735,326,490,840,972,261,848,186,538, 906,070,058,088,365,083,852,800,000,000,000 possible patterns.
Last month I covered the hubbub surrounding the still-potential proof of the Poincare conjecture. The best take on the situation was a New Yorker article by Sylvia Nasar and David Gruber, detailing the barest glimpse of the behind-the-scenes workings of the mathematics community, particularly those involving Grigory Perelman, a recluse Russian mathematician who unveiled his potential Poincare proof in 2002 and Shing-Tung Yau, a Chinese mathematician who, the article suggested, was out for more than his fair share of the credit in this matter.
After declining the Fields Medal, the Nobel Prize of mathematics, Perelman has quit mathematics and lives quietly in his native Russia. Yau, however, is upset at his portrayal (both literally and literary) in the New Yorker article and has written a letter to the New Yorker asking them to make a prominent correction and apologize for an illustration of Yau that accompanied the article. From the letter:
I write in the hope of enlisting your immediate assistance, as well as the assistance of The New Yorker, in undoing, to the extent possible, the literally world-wide damage done to Dr. Yau's reputation as a result of the publication of your article. I also write to outline for you, on a preliminary basis, but in some detail, several of the more egregious and actionable errors which you made in the article, and the demonstrably shoddy "journalism" which resulted in their publication.
The letter, addressed to the two authors as well as the fact-checker on the article and CC'd to David Remnick and the New Yorker's general counsel, runs 12 pages, so you may want to have a look at the press release instead. A webcast discussing all the details of the letter is being held at noon on September 20...information on how to tune in will be available at Dr. Yau's web site. (thx, david)
As I mentioned yesterday, the New Yorker published an article by Sylvia Nasar1 and David Gruber about the recent proof of the Poincare Conjecture2. (Previous coverage in the NY Times and the Guardian.) The article, which is unavailable from the New Yorker's web site (they've now made it available), contains the only interview I've seen with Grigory Perelman, the Russian mathematician who published a potential proof of the conjecture in late 2002, gave a series of lectures in the US, and then went back to Russia. Since then, he hasn't communicated with anyone about the proof, has quit mathematics, and recently refused the Fields Medal, the most prestigious award that mathematics has to offer, saying:
It was completely irrelevent for me. Everybody understood that if the proof is correct then no other recognition is needed.
Meanwhile, a Chinese group of mathematicians, led by Shing-Tung Yau3, are claiming that Perelman's proof was too complicated and are offering a reworked proof instead of Perelman's. That is, they're claiming the first complete proof of the conjecture. Yau The active director of Yau's mathematics institute explained the relative contributions thusly:
Hamilton contributed over fifty per cent; the Russian, Perelman, about twenty five per cent; and the Chinese, Yau, Zhu, and Cao et al., about thirty per cent. (Evidently, simple addition can sometimes trip up even a mathematician.)
Clearly the Chinese gave more than 100% in solving this proof, but Yau is regarded by some mathematicians as attempting to grab glory that does not belong to him. John Morgan, a mathematician at Columbia University, says:
Perelman already did it and what he did was complete and correct. I don't seen anything that [Yau et al.] did different.
Yau wants to be associated with the proof of the Poincare Conjecture, to have China associated with it, and for his student, Zhu, to be elevated in status by it. The $1 million in prize money for the proof of the conjecture offered by the Clay Mathematics Institute can't be far from Yau's mind as well. For his part, Grigory Perelman won't say whether he'll accept the prize money until it is offered. Stay tuned, I guess.
 Poincare (properly written as Poincaré) is pronounced Pwan-cah-RAY, not Poyn-care as I said it up until a few weeks ago. ↩
 Yau proved a conjecture by Eugenio Calabi which gave birth to a highly useful mathematical structure called a Calabi-Yau manifold; Yau won the Fields Medal for it. The C-Y manifold is important in string theory and Andrew Wiles used it as part of his proof of Fermat's Last Theorem. In short, Yau is a mathematical stud, no question. ↩
Benford's Law describes a curious phenomenon about the counterintuitive distribution of numbers in sets of non-random data:
A phenomenological law also called the first digit law, first digit phenomenon, or leading digit phenomenon. Benford's law states that in listings, tables of statistics, etc., the digit 1 tends to occur with probability ~30%, much greater than the expected 11.1% (i.e., one digit out of 9). Benford's law can be observed, for instance, by examining tables of logarithms and noting that the first pages are much more worn and smudged than later pages (Newcomb 1881). While Benford's law unquestionably applies to many situations in the real world, a satisfactory explanation has been given only recently through the work of Hill (1996).
I first heard of Benford's Law in connection with the IRS using it to detect tax fraud. If you're cheating on your taxes, you might fill in amounts of money somewhat at random, the distribution of which would not match that of actual financial data. So if the digit "1" shows up on Al Capone's tax return about 15% of the time (as opposed to the expected 30%), the IRS can reasonably assume they should take a closer look at Mr. Capone's return.
Since I installed Movable Type 3.15 back in March 2005, I have been using its "post to the future" option pretty regularly to post my remaindered links...and have been using it almost exclusively for the last few months. That means I'm saving the entries in draft, manually changing the dates and times, and then setting the entries to post at some point in the future. For example, an entry with a timestamp like "2006-02-20 22:19:09" when I wrote the draft might get changed to something like "2006-02-21 08:41:09" for future posting at around 8:41 am the next morning. The point is, I'm choosing basically random numbers for the timestamps of my remaindered links, particularly for the hours and minutes digits. I'm "cheating"...committing post timestamp fraud.
That got me thinking...can I use the distribution of numbers in these post timestamps to detect my cheating? Hoping that I could (or this would be a lot of work wasted), I whipped up a MT template that produced two long strings of numbers: 1) one of all the hours and minutes digits from the post timestamps from May 2005 to the present (i.e. the cheating period), 2) and one of all the hours and minutes digits from Dec 2002 - Jan 2005 (i.e. the control group). Then I used a PHP script to count the numbers in each string, dumped the results into Excel, and graphed the two distributions together. And here's what they look like, followed by a table of the values used to produce the chart:
As expected, 1 & 2 show up less than they should during the cheating period, but not overly so. The real fingerprint of the crime lies with the 8s. The number 8 shows up during the cheating period ~64% more than expected. After thinking about it for awhile, I came up with an explanation for the abundance of 8s. I often schedule posts between 8am-9am so that there's stuff on the site for the early-morning browse and I usually finish off the day with something between 6pm-7pm (18:00 - 19:00). Not exactly the glaring evidence I was expecting, but you can still tell.
The obvious next question is, can this technqiue be utilized for anything useful? How about detecting comment, trackback. or ping spam? I imagine IPs and timestamps from these types of spam are forged to at least some extent. The difficulties are getting enough data to be statistically significant (one forged timestamp isn't enough to tell anything) and having "clean" data to compare it against. In my case, I knew when and where to look for the cheating...it's unclear if someone who didn't know about the timestamp tampering would have been able to detect it. I bet companies with services that deal with huge amounts of spam (Gmail, Yahoo Mail, Hotmail, TypePad, Technorati) could use this technique to filter out the unwanted emails, comments, trackbacks, or pings...although there's probably better methods for doing so.
 I've been doing this to achieve a more regular publishing schedule for kottke.org. I typically do a lot of work in the evening and at night and instead of posting all the links in a bunch from 10pm to 1am, I space them out over the course of the next day. Not a big deal because increasing few of the links I feature are time-sensitive and it's better for readers who check back several times a day for updates...they've always got a little something new to read.
 You'll also notice that the distributions don't quite follow Benford's Law either. Because of the constraints on which digits can appear in timestamps (e.g. you can never have a timestamp of 71:95), some digits appear proportionally more or less than they would in statistical data. Here's the distribution of digits of every possible time from 00:00 to 23:59:
Here's a sampling of the rest of the AIGA Design Conference, stuff that I haven't covered yet and didn't belong in a post of it's own:
Juan Enriquez gave what was probably my favorite talk about what's going on in the world of genetics right now. I've heard him give a variation of this talk before (at PopTech, I think). He started off talking about coding systems and how when they get more efficient (in the way that the Romance languages are more efficient than Chinese languages), the more powerful they become in human hands. Binary is very powerful because you can encode text, images, video, etc. using just two symbols, 1 and 0. Segue to DNA, a four symbol language to make living organisms...obviously quite powerful in human hands.
Enriquez: All life is imperfectly transmitted code. That's what evolution is, and without the imperfections, there would be no life. The little differences over long periods of time are what's important.
Enriquez again: The mosquito is a flying hypodermic needle. That's how it delivers malaria to humans. We could use that same capability for vaccinating cows against disease.
1. Design is the easy part. 2. Learn from your clients, bosses, collaborators, and colleagues. 3. Content is king. 4. Read. Read. Read. 5. Think first, then design. 6. Never forget how lucky you are. Enjoy yourself.
Nicholas Negroponte: If programmers got paid to remove code from sofware instead of writing new code, software would be a whole lot better.
Negroponte also shared a story about outfitting the kids in a school in Cambodia with laptops; the kids' first English word was "Google", and from what Negroponte said, that was followed closely by "Skype". He also said the children's parents loved the laptops because at night, it was the brightest light in the house.
Ben Karlin and Paula Scher on the challenges of making America, The Book: Books are more daunting than doing TV because print allows for a much greater density of jokes. In trying to shoot the cover image, they found that bald eagles cannot be used live for marketing or advertising purposes. The solution? A golden eagle and Photoshop. And for a spread depicting all the Supreme Court Justices in the buff, they struggled -- even with the Web -- to find nude photos of older people until they found a Vermont nudist colony willing to send them photos because they were big fans of The Daily Show.
Bill Strickland blew the doors off the conference with his account of the work he's doing in "curing cancer" -- his term for revitalizing violent and crime-ridden neighborhoods -- in Pittsburgh. I can't do justice to his talk, so two short anecdotes. Strickland said he realized that "poor people never have a nice day" so when he built his buildings in these poor black neighbohoods, he put nice fountains out front so that people coming into the building know that they're entering a space where it's possible to have a good day. Another time, a bigwig of some sort was visiting the center and asked Strickland about the flowers he saw everywhere. Flowers in the hood? How'd these get here? Strickland told him "you don't need a task force or study group to buy flowers" and that he'd just got in his car, bought some flowers, brought them back, and set them around the place. His point in all this was creating a place where people feel less dissimilar to each other...black, white, rich, poor, everybody has a right to flowers and an education and to be treated with respect and to have a nice day. You start treating people like that, and surprise!, they thrive. Strickland's inner city programs have produced Fulbright Scholars, Pulitzer Prize winners, and tons of college graduates.
And some final thoughts from others at the conference. Peter Merholz says that "form-makers", which make up the vast majority of the AIGA audience, "are being passed by those who are attempting to use design to serve more strategic ends". (That's an interesting thought...) A pair of reviews from Speak Up: Bryony was a bit disappointed with the opening Design Gala but left, like everyone else, in love with emcee John Hockenberry while Armin noted that the preservation of digital files is a big concern for museums in building a collection of graphic design pieces...in 35 years, how are you going load that Quark file or run that Flash movie?
The list of the 100 greatest theorems in mathematics is topped by The Irrationality of the Square Root of 2 from that nutball Pythagoras. Jesus, who does Godel have to sleep with to get higher on this list...I mean, all the man did was destroy math! (I know, I know, oversimplification, please don't send me any email....) (via cyn-c)
The competitive Scrabble world is starting to see some top-notch players for whom English is not their native language. At he highest level of competition, "Scrabble's secret is that it's a math game: board geometry, strategic decision making, probability and chance." And sometimes it's better not knowing English so the player can focus solely on the memorization of patterns and gameplay. Interesting stuff.
xThink Calculator is a math calculation program that recognizes handwritten input from a Tablet PC (check out the screenshots). Pretty darn nifty and reminiscent of Denim, a tool for UI design. (thx nick)
"The hairy ball theorem of algebraic topology states that, in layman's terms, 'one cannot comb the hair on a ball in a smooth manner'". Heh. Looks like Wikipedia has some new measures in placeto deal with spam/trolls: "This page has been protected from editing to deal with vandalism."
They wonder whether the digits contain a hidden rule, an as yet unseen architecture, close to the mind of God. A subtle and fantastic order may appear in the digits of pi way out there somewhere; no one knows. No one has ever proved, for example, that pi does not turn into nothing but nines and zeros, spattered to infinity in some peculiar arrangement. If we were to explore the digits of pi far enough, they might resolve into a breathtaking numerical pattern, as knotty as "The Book of Kells," and it might mean something. It might be a small but interesting message from God, hidden in the crypt of the circle, awaiting notice by a mathematician.
The Chudnovsky article also reminds me of Contact by Carl Sagan in which pi is prominently featured as well.
According to Wolfram Research's Mathworld, the current world record for the calculation of digits in pi is 1241100000000 digits, held by Japanese computer scientists Kanada, Ushio and Kuroda. Kanada is named in the article as the Chudnovskys main competitor at the time.
(Oh, and as for patterns hidden in pi, we've already found one. It's called the circle. Just because humans discovered circles first and pi later shouldn't mean that the latter is derived from the former.)
DFW is a favorite of mine, but I was disappointed in Everything and More. Perhaps I wasn't part of the intended audience, but with an interest in all things Wallace, a college degree in physics, a general interest in mathematics, and avid reader of popular science books, if not me, then for whom was this book written?
Mostly I was bothered by Wallace's signature writing style, which usually challenges the reader in delightful ways. In E&M, he ratcheted his style up to such a degree that it became as obfuscating as the math he was trying to explain. Not that he should have used only words of four letters or less, but a greater degree of clarity and simplicity would have been appreciated to let the parodoxical beauty and the beautiful paradox of transfinite math show (which Jim Holt did more successfully than Wallace in his New Yorker review of the book).