Advertise here with Carbon Ads

This site is made possible by member support. ❤️

Big thanks to Arcustech for hosting the site and offering amazing tech support.

When you buy through links on kottke.org, I may earn an affiliate commission. Thanks for supporting the site!

kottke.org. home of fine hypertext products since 1998.

🍔  💀  📸  😭  🕳️  🤠  🎬  🥔

kottke.org posts about weblogs

A list of the most-linked blogs from

A list of the most-linked blogs from September 2000 (scroll for the notes at the end). Metalog and Metalog Ratings comprised the first weblog tracking/ranking system, predating most of the current crop by at least a year or two.


Short interview with Moulitsas Zuniga, founder of Daily Kos.

Short interview with Moulitsas Zuniga, founder of Daily Kos.


Most comments

After linking to a particularly active thread on a politics blog, Chris asks:

What is the record for the most amount of comments left on a blog?

The Matrix Reloaded thread (it actually spans two threads because MT was beginning to buckle under the pressure) got 1767 comments in six months. MetaFilter’s longest thread has 1729 comments. I’ve seen 1000+ comment threads on Dooce and political blogs like Daily Kos probably have 1000+ comments threads all the time. This Engadget thread has 3324 comments. Slashdot’s thread on the end of the 2004 Presidential election garnered 5687 comments. (This SpyMac forum thread seems to have about 167,000 comments, but it’s not a blog and seems like cheating because it was an attempt at the longest thread ever.)

Any other contenders? Digg? Huffington Post?


Speaking of wine blogs, Wine Library has

Speaking of wine blogs, Wine Library has a video blog about wine. Not sure about the spit bucket thing on camera tho… (thx, erik)


Beer as in journalism

Glenn Reynolds makes an interesting analogy about journalism and beer making in his new book:

Without formal training and using cheap equipment, almost anyone can do it. The quality may be variable, but the best home-brews are tastier than the stuff you see advertised during the Super Bowl. This is because big brewers, particularly in America, have long aimed to reach the largest market by pushing bland brands that offend no one. The rise of home-brewing, however, has forced them to create “micro-brews” that actually taste of something. In the same way, argues Mr Reynolds, bloggers—individuals who publish their thoughts on the internet—have shaken up the mainstream media (or MSM, in blogger parlance).

What, no “drunk on power” quip? Curiously, the Economist piece fails to mention the name of Reynolds’ book, An Army of Davids, although it appears over in the right sidebar, almost camouflaged as an ad.


The Pour is a wine blog by

The Pour is a wine blog by the NY Times wine guy, Eric Asimov. Asimov joins Frank Bruni on the food and bev blogging front for the Times. The Pour includes a list of links to other wine blogs and resources as well. Nicely done.


Lots of links about the Internet of

Lots of links about the Internet of Things, objects that blog, spimes, and Everyware. Cyberdyne Systems, here we come…blogging pigeons will beget blogging F-16s faster than you’d think.


Jeremy Keith on comments: “I’d like to

Jeremy Keith on comments: “I’d like to propose a corollary of Sturgeon’s Law for blogs: Comments should be disabled 90% of the time.”


Under Odysseus is a weblog written by

Under Odysseus is a weblog written by Eurylochus, a Greek participating in the Trojan War. “There was a lot of shit-talking. Hector kept shouting that Ajax wasn’t much of a substitution for Achilles. Ajax would respond that Hector was just flattering himself.” (thx, mark)


Jonathan Crowe ran an Olympics-themed weblog for

Jonathan Crowe ran an Olympics-themed weblog for Athens 2004 and Torino 2006. Interestingly, the 2004 version got a lot more traffic, but more recent one made him more money via Google AdSense. “Whether [the increase is] due to better ad block positioning, ‘better’ ads (more on-target or more lucrative), a ‘better’ audience, or simply a more mature advertising network, I have no idea.”


Slashlinks

Ben Engebreth, a compadre of mine at the Eyebeam OpenLab, has released Slashlinks, a tool for automatically mirroring links from del.icio.us to your personal web site. At first glance it might sound like a simple archiving tool, a way to get your data out of del.icio.us, but what it actually does is reproduces your del.icio.us links on your web site.

Check out Ben’s links for an example. If you click on a tag name, you can see that not only the links but the underlying tag structure has been reproduced locally. Once the links are on your site, you can style them how you wish (as Ben has), publish them where you want, etc. And Slashlinks will also keep your local links fresh…if you keep using the publishing tools at del.icio.us to add links, they will automagically show up on your site.


Malcolm Gladwell has a blog.

Malcolm Gladwell has a blog.


Oh, what a year

One year ago today, I asked the readers of kottke.org to become micropatrons and support my efforts in producing the site for a year. Over the course of three weeks, people generously sent in their financial support[1], giving me enough to pay my salary for the entire year[2] and not have to bug you about it every few days.

So the year is up and I’ve been trying to think about what to say on this occasion for, oh, about six months now, but I’m undecided even now. I guess I’ll start with the important bit.

I’m not going to be asking for contributions again. Part of it has to do with the reasons outlined at the bottom of this post. I haven’t grown traffic enough or developed a sufficient cult of personality to make the subscription model a sustainable one for kottke.org…those things just aren’t interesting to me.

The other big reason is that my life has changed a lot in the past year. Growing a new business with a novel (or at least challenging) business model requires lots of time and energy to build the necessary momentum…basically approaching it with a startup mentality: long hours, work on the weekends, less time to spend with family and friends, making work the #1 priority, etc. My (unstated) intention from the beginning was to approach the site as a startup, but along the way life intervened (in a good way) and I couldn’t focus on it as much as I wanted to. The site became a normal job, a 9-to-5 affair, which meant that I could keep up with it, but growth was hard to come by.

So what’s going to happen with kottke.org? I’m not quite sure at this point. In the short term, it’s going to be taking a back seat to some other things going on in my life. Longer term, who knows? I might look for other ways to fund my efforts on the site or maybe it goes back to being more of a hobby. But there will be posts and links and other things here almost daily, just like there have been for almost 8 years now.

And that leaves approximately everything else, if anything, unsaid. If you’re curious about something related to the end of the micropatron experiment, send me an email with your question. I’ll choose the most interesting and/or representative ones and post my responses to them in a future entry. I’ll give special consideration to questions from micropatrons. Or post your thoughts to your blog, send me a link, and I’ll compile those as well. And as always, your feedback is appreciated via email. (And sorry in advance if I can’t respond to your questions individually, although I’ll try my best.)

[1] Again, thanks to everyone who contributed for their support. In this age of ad-supported media, it means a great deal to me that you felt strongly enough about kottke.org to support it directly. I’d also like to thank Eyebeam, the companies and people who contributed the fund drive gifts, thelist, Jonah, and Meg for their help and support.

[2] Since everyone and their uncle has been asking, about 1450 micropatrons contributed $39,900 over the past year…99.9% of that coming during the 3 week fund drive.


Catching cheaters with Benford’s Law

Benford’s Law describes a curious phenomenon about the counterintuitive distribution of numbers in sets of non-random data:

A phenomenological law also called the first digit law, first digit phenomenon, or leading digit phenomenon. Benford’s law states that in listings, tables of statistics, etc., the digit 1 tends to occur with probability ~30%, much greater than the expected 11.1% (i.e., one digit out of 9). Benford’s law can be observed, for instance, by examining tables of logarithms and noting that the first pages are much more worn and smudged than later pages (Newcomb 1881). While Benford’s law unquestionably applies to many situations in the real world, a satisfactory explanation has been given only recently through the work of Hill (1996).

I first heard of Benford’s Law in connection with the IRS using it to detect tax fraud. If you’re cheating on your taxes, you might fill in amounts of money somewhat at random, the distribution of which would not match that of actual financial data. So if the digit “1” shows up on Al Capone’s tax return about 15% of the time (as opposed to the expected 30%), the IRS can reasonably assume they should take a closer look at Mr. Capone’s return.

Since I installed Movable Type 3.15 back in March 2005, I have been using its “post to the future” option pretty regularly to post my remaindered links…and have been using it almost exclusively for the last few months[1]. That means I’m saving the entries in draft, manually changing the dates and times, and then setting the entries to post at some point in the future. For example, an entry with a timestamp like “2006-02-20 22:19:09” when I wrote the draft might get changed to something like “2006-02-21 08:41:09” for future posting at around 8:41 am the next morning. The point is, I’m choosing basically random numbers for the timestamps of my remaindered links, particularly for the hours and minutes digits. I’m “cheating”…committing post timestamp fraud.

That got me thinking…can I use the distribution of numbers in these post timestamps to detect my cheating? Hoping that I could (or this would be a lot of work wasted), I whipped up a MT template that produced two long strings of numbers: 1) one of all the hours and minutes digits from the post timestamps from May 2005 to the present (i.e. the cheating period), 2) and one of all the hours and minutes digits from Dec 2002 - Jan 2005 (i.e. the control group). Then I used a PHP script to count the numbers in each string, dumped the results into Excel, and graphed the two distributions together. And here’s what they look like, followed by a table of the values used to produce the chart:

Catching cheaters

Digit   5/05-now   12/02-1/05   Difference
131.76%33.46%1.70%
211.76%14.65%2.89%
310.30%9.96%0.34%
410.44%9.58%0.86%
510.02%10.52%0.51%
64.83%5.40%0.57%
75.66%4.96%0.70%
87.62%4.65%2.97%
97.60%6.81%0.79%

As expected, 1 & 2 show up less than they should during the cheating period, but not overly so[2]. The real fingerprint of the crime lies with the 8s. The number 8 shows up during the cheating period ~64% more than expected. After thinking about it for awhile, I came up with an explanation for the abundance of 8s. I often schedule posts between 8am-9am so that there’s stuff on the site for the early-morning browse and I usually finish off the day with something between 6pm-7pm (18:00 - 19:00). Not exactly the glaring evidence I was expecting, but you can still tell.

The obvious next question is, can this technqiue be utilized for anything useful? How about detecting comment, trackback. or ping spam? I imagine IPs and timestamps from these types of spam are forged to at least some extent. The difficulties are getting enough data to be statistically significant (one forged timestamp isn’t enough to tell anything) and having “clean” data to compare it against. In my case, I knew when and where to look for the cheating…it’s unclear if someone who didn’t know about the timestamp tampering would have been able to detect it. I bet companies with services that deal with huge amounts of spam (Gmail, Yahoo Mail, Hotmail, TypePad, Technorati) could use this technique to filter out the unwanted emails, comments, trackbacks, or pings…although there’s probably better methods for doing so.

[1] I’ve been doing this to achieve a more regular publishing schedule for kottke.org. I typically do a lot of work in the evening and at night and instead of posting all the links in a bunch from 10pm to 1am, I space them out over the course of the next day. Not a big deal because increasing few of the links I feature are time-sensitive and it’s better for readers who check back several times a day for updates…they’ve always got a little something new to read.

[2] You’ll also notice that the distributions don’t quite follow Benford’s Law either. Because of the constraints on which digits can appear in timestamps (e.g. you can never have a timestamp of 71:95), some digits appear proportionally more or less than they would in statistical data. Here’s the distribution of digits of every possible time from 00:00 to 23:59:

1 - 25.33
2 - 17.49
3 - 12.27
4 - 10.97
5 - 10.97
6 - 5.74
7 - 5.74
8 - 5.74
9 - 5.74


Decent article about blogs (a rarity these

Decent article about blogs (a rarity these days) from the Financial Times. “Each blogger was his, or her, own printing press, spontaneously exercising their freedom to criticise. Which is great. But along the way, opinion became the new pornography on the internet.”


You’re Safired!

Wes Felter calls for the ass fact-checking of William Safire over the latter’s article in the NY Times about blog jargon and he’s not wrong. Wes correctly notes the etymology of “weblog” and “blog” and hopefully the people responsible for things like the AP Style Guide, English dictionaries, and influential columns like On Language will, at some point, do the 20 minutes of research necessary to convince them and the unwashed journalist masses that “blog” is not and was never short for “web log”.

Safire also gets tripped up on where the word “blogosphere” came from. While William Quick’s usage in 2002 popularized the term, Brad Graham first used the term in 1999.


Not fit to print

Earlier today I posted a link to Frank Bruni’s new food blog over at the NY Times. At the same time, I added a comment to this post about how restaurant reservations work here in NYC. I went back to see if there was any further conversation and my comment had been deleted (or had otherwise disappeared). Not such a good start. I’ve resubmitted the comment…we’ll see how long it lasts.


Between the Squibs is a blog highlighting

Between the Squibs is a blog highlighting articles from the Complete New Yorker DVD set.


NY Times food critic Frank Bruni has

NY Times food critic Frank Bruni has a new blog where he’s going to write about some of the stuff that happens during his eating week that doesn’t make it into the newspaper. Here’s the intro post.


The Dumpster — a new project by

The Dumpster — a new project by Golan Levin — is a “portrait of romantic breakups collected from blogs in 2005.


Keynoting(!) at SXSW 2006

Through an improbable series of clerical errors, I am scheduled to participate in a “keynote conversation” about professional blogging with Heather Armstrong at SXSW in Austin, Texas next month. Armstrong, so the story goes, got fired for blogging at work and was rewarded with a loving husband, cutie-pie daughter, photogenic dog, several television appearances, hundreds of media mentions, and a new job — talking about poop all day — that supports her entire family. And so but by the way, she’s also headlining the entire SXSW Festival along with Rock and Roll Hall of Famer Neil Young. Which makes me approximately chopped liver. When I told Meg about the headlining thing, she said, “boy, that conversation had better be good”. Pressure’s on, Heather.

To sum up, a piece of chopped liver will be having a chat with a nice lady from Utah next month about blogging for groceries. Should be fun.


DFL is a blog highlighting the last

DFL is a blog highlighting the last place finishers in Olympic events. Eddie the Eagle should be the site’s mascot.


Skiing the online slopes

Since I’ve been skiing a little bit recently (for the first time in years), I decided to check out what was happening online in the skiing world. Specifically I wondered if there were any ski blogs out there and if the many ski magazines offer online archives of their content.

Just like every other topic under the sun, skiing is well covered in blog land; no chance for fresh tracks here. A couple of quick searches uncovered blogs about backcountry skiing, New England skiing, ski adventures from around the country, skiing products and fashion, Colorado skiing, an attempt to ski 120 days of powder, Euro-centric skiing, and even a skiing videoblog.

Most of the skiing blogs I found focus on their respective author’s adventures on the slopes. If someone wanted to start a skiing meta-blog (blogging not just skiing adventures but other skiing-related topics and pointing to other people’s adventures), would there be enough good information out there to point to? The magazine racks of ski country convenience stores are filled with all kinds of periodicals about skiing…how much of that content is online? From what I can tell, the skiing magazines do offer content on their sites, but not necessarily from the pages of their print magazines. Both SKI Magazine and Skiing Magazine have archived print articles on their sites, but only from June 2005 and earlier. Both have other resources like forums, skiing news, resort details, videos, and online-only features. Neither site is organized particularly well for quick information perusal and retrieval. Skipressworld offers PDF versions of their entire print magazine online, including the current issue. Powder magazine has some online archives as well as online-only features like videos and message boards.

And so on…Google News is currently featuring over 10,000 articles about skiing (although much of that is due to the impending Winter Olympics), Flickr has thousands of skiing photos, and nearly all the ski areas an resorts have web sites on which you can check the current conditions, the lines at the chairlift via webcams, and trail maps. Killington is even doing podcasts.

So there’s lots of skiing info out there. I know there must be a few skiers among the kottke.org readership…what are your favorite skiing sites and resources online?


This. Is. Hilarious.

This. Is. Hilarious.


Blogs versus the NY Times in Google

In 2002, Dave Winer of Scripting News and Martin Nisenholtz of the New York Times made a Long Bet about the authority of weblogs versus that of NY Times in Google:

In a Google search of five keywords or phrases representing the top five news stories of 2007, weblogs will rank higher than the New York Times’ Web site.

I decided to see how well each side is doing by checking the results for the top news stories of 2005. Eight news stories were selected and an appropriate Google keyword search was chosen for each one of them. I went through the search results for each keyword and noted the positions of the top results from 1) “traditional” media, 2) citizen media, 3) blogs, and 4) nytimes.com. Finally, the scores were tallied and an “actual” winner (blogs vs. nytimes.com) and an “in-spirit” winner (any traditional media source vs. any citizen media source) were calculated. (For more on the methodology, definitions, and caveats, read the methodology section below.)

So how did the NY Times fare against blogs? Not very well. For eight top news stories of 2005, blogs were listed in Google search results before the Times six times, the Times only twice. The in-spirit winner was traditional media by a 6-2 score over citizen media. Here the specific results:

1) Hurricane Katrina hits New Orleans.
Search term: “hurricane katrina”

3. Top citizen media result (Wikipedia)
13. Top media result (CNN)
56. Top NY Times mention (NY Times).
61. Top blog result (Kaye’s Hurricane Blog)

Winner (in spirit): Citizen media
Winner (actual): NY Times

2) Big changes in the US Supreme Court (Rhenquist dies, O’Conner retires, Roberts appointed Chief Justice, Harriet Miers rejected).
Search term: “harriet miers”

4. Top media result (Washington Post)
5. Top citizen media result (Wikipedia)
8. Top NY Times mention (NY Times)
11. Top blog result (TalkLeft)

Winner (in spirit): Media
Winner (actual): NY Times

3) Terrorists bomb London, killing 52.
Search term: “london bombing”

1. Top media result (CNN)
2. Top citizen media result (Wikipedia)
21. Top blog result Schneier on Security
No NY Times article appears in the first 100 results.

Winner (in spirit): Media
Winner (actual): Blogs

4) First elections in Iraq after Saddam.
Search term: “iraq election”

1. Top media result (BBC News)
6. Top blog result (Iraq elections newswire)
6. Top citizen media result (Iraq elections newswire)
14. Top NY Times mention (NY Times)

Winner (in spirit): Media
Winner (actual): Blogs

5) Terri Schiavo legal fight and death.
Search term: “terri schiavo”

2. Top blog result (Abstract Appeal)
2. Top citizen media result (Abstract Appeal)
4. Top media result (CNN)
65. Top NY Times mention (NY Times)

Winner (in spirit): Citizen media
Winner (actual): Blogs

6) Pope John Paul II dies and Cardinal Joseph Ratzinger appointed Pope Benedict XVI.
Search term: “pope john paul ii death”

1. Top media result (CNN)
3. Top citizen media result (Wikipedia)
58. Top blog result (The Pope Blog: Pope Benedict XVI)
No NY Times article appears in the first 100 results.

Winner (in spirit): Media
Winner (actual): Blogs

7) The Israeli withdrawal from the Gaza Strip.
Search term: “gaza withdrawal”

1. Top media result (Worldpress.org)
31. Top blog result (Simply Appalling)
31. Top citizen media result (Simply Appalling)
No NY Times article appears in the first 100 results.

Winner (in spirit): Media
Winner (actual): Blogs

8) The investigation into the Valerie Plame affair, Judith Miller, Scooter Libby indicted, etc..
Search term: “scooter libby indicted”:

1. Top media result (CNN)
15. Top blog result (Seven Generational Ruminations)
15. Top citizen media result (Seven Generational Ruminations)
43. Top NY Times mention (NY Times)

Winner (in spirit): Media
Winner (actual): Blogs

And just for fun here’s a search for “judith miller jail” (not included in the final tally):

1. Top media result (Washington Post)
3. Top blog result (Gawker)
3. Top citizen media result (Gawker)
No NY Times article appears in the first 100 results (even though there are several matching articles on the Times site).

In covering the jailing of their own reporter, the Times lagged in the Google results behind such informational juggernauts as Drinking Liberally, GOP Vixen, and Feral Scholar.

Winner (in spirit): Media
Winner (actual): Blogs

Here’s the overall results, excluding the Judith Miller search:

Overall winner (in spirit): Media (beating citizen media 6-2).
Overall winner (actual): Blogs (beating the NY Times 6-2).

Some observations:

  • My feeling is that Mr. Nisenholtz will likely lose his bet come 2007. Even though the nytimes.com fares very well in getting linked to by the blogosphere, it does very poorly in Google. This isn’t exactly surprising given that most NY Times articles disappear behind a paywall after a week and some of their content (TimesSelect) isn’t even publicly accessible at all. Also, I didn’t look too closely at the HTML markup of the NY Times, but it could also be that it’s not as optimized for Google as well as that of some weblogs and other media outlets.
  • “www.nytimes.com” has a PageRank of 10/10, higher than that of “www.cnn.com” (9/10), yet stories from CNN consistently appeared higher in the search results than those from the Times. The Times clearly has overall authority according to Google, but when it comes to specific instances, it falls short. In some cases, a NY Times story didn’t even appear in the first 100 search results for these keyword searches.
  • By 2007, it may be difficult to differentiate a blog from a traditional media source. All of the Gawker and Weblogs, Inc. sites are presented in a blog format and are referred to as blogs but otherwise how are they distinguishable from traditional media? Engadget paid to send 12 people to cover the CES technology conference, probably as many or more than the Times sent. The Sundance film festival was heavily covered by paid writers for both companies as well. In the spirit in which this bet was made, I’d have a hard time counting any of their sites as blogs. (And what about kottke.org? I get paid to write it. Am I still a member of the citizen media or have I crossed over?)
  • Choosing appropriate news stories and keywords for those stories was difficult in some cases. Katrina was a no-brainer, but was the Terri Schiavo story really one of the top eight news stories of 2005? Resolving the methodology for this bet in 2007 will be tricky. I wonder how the Long Bets Foundation will handle its determination of the victory.
  • Wikipedia does very well in Google results for topical search terms. Overall, traditional media still dominates (in first appearance as well as number of results), but blogs and Wikipedia do very well in some instances.
  • What do these results mean? Probably not a whole lot. Nisenholtz asserts that “[news] organizations like the Times can provide that far more consistently than private parties can” while Winer says that “in five years, the publishing world will have changed so thoroughly that informed people will look to amateurs they trust for the information they want”. It’s difficult to draw any conclusions on this matter based on these results. Contrary to what most people believe, PageRank has a bias, a point of view. That POV is based largely (but not entirely) on what people are linking to. As someone said in the discussion of this bet, this bet is about Google more than influence or reputation, so these results probably tell us more about how Google determines influence on a keyword basis rather than how readers of online informational sources value or rate those sources. Do web users prefer the news coverage of blogs to that of the NY Times? I don’t think you can even come close to answering that question based on these results.

Methodology and caveats

The eight news stories were culled from various sources (Lexis-Nexis, Wikipedia, NY Times) and narrowed down to the top stories that would have been prominently covered in both the NY Times and blogs.

The keyword phrase for each of the eight stories was selected by the trial and error discovery of the shortest possible phrase that yielded targeted search results about the subject in question. In some cases, the keyword phrase chosen only returned results for a part of a larger news story. For instance, the phrase “pope john paul” was not specific enough to get targeted results, so “pope john paul ii death” was used, but that didn’t give results about the larger story of his death, the conclave to select a new pope, and the selection of Cardinal Joseph Ratzinger as Pope Benedict XVI. In the case of “katrina”, that single keyword was enough to produce hundreds of targeted search results for both Hurricane Katrina and its aftermath. Keyword phrases were not tinkered with to promote or demote particular types of search results (i.e. those for blogs or nytimes.com); they were only adjusted for the relevence of overall results.

The searches were all done on January 27, 2006 with Google’s main search engine, not their news specific search.

Since the spirit of the bet deals with the influence of traditional media versus that of citizen-produced media, I tracked the top traditional media (labeled just “media” above) results and the top citizen media results in addition to blog and nytimes.com results. For the purposes of this exercise, relevent results were those that linked to pages that an interested reader would use as a source of information about a news story. For citizen media, this meant pages on Wikipedia, Flickr (in some cases), weblogs, message boards, wikis, etc. were fair game. For traditional media, this meant articles, special news packages, photo essays, videos, etc.

In differentiating between “media” & citizen media and also between relevent and non-relevent results, in only one instance did this matter. Harriet Miers’s Blog!!!, a fictional satire written as if the author were Harriet Miers, was the third result for this keyword phrase, but since the blog was not a informational resource, I excluded it. In all other cases, it was pretty clear-cut.


High volume flow

David Carr wrote an article for the NY Times about the Washington Post’s recent decision to close down comments on their blog when one of their threads turned ugly. As the article points out, the issue of web sites having problems dealing with feedback (particularly published feedback like comments) is not localized to mainstream media publications:

Mickey Kaus of kausfiles.com, which does not carry comments, said that “the world is crying out for the jerk-zapper,” although he added that he thought that The Washington Post’s Web site overreacted. BoingBoing, a heavily trafficked “directory of wonderful things,” shut down its comments section last year. “We took a lot of heat over it,” said Xeni Jardin, a founder of the site. “But until we are able to come up with a better comments system - most of what is out there is too crude - it is not worth the trouble.

If you’re wondering why the comments on kottke.org aren’t on more often, this is the reason.[1] This site is a one-person operation and even though I work on it full-time, I don’t have the throughput to manage a lot of threads. Comment gardening (as I call it) is hard work if you want to maintain an appropriate level of discourse. And as Xeni said, the current technological and user experience solutions suck. Approved commenting, sign-in to comment, Slashdot-like comment moderation…they all have their problems.

As an experiment back in October, I opened the comments on all threads on kottke.org for a little over a week. During that time, I kept track of my comment gardening duties, basically everything I did to keep those threads clear of trolling, flaming, off-topic comments, and the like. The only thing I didn’t record was how many times per day I checked for activity in all the open threads — every 15-30 minutes or so while I was awake (~8am to midnight) — because I would have been too busy recording the checking to actually do the checking. At one point, I had almost 60 simultaneous threads open and was spending half my day keeping up with all of them.

After more than a week, I stopped recording everything…even though most of the threads were still open and the comments, flames, trolls, and spam kept pouring in. But the resulting document will still give you some idea of what’s involved with opening comments on kottke.org. I would love better tools to deal with this because I enjoy having comments open on the site and so do my readers. But for now, I think it’s a better use of my time to focus on other aspects of the site and open comments when I feel a particular post would benefit from them.

[1] You can’t imagine the reasons I’ve heard about why comments are off on kottke.org. Most of them are variations on the theme of: “All the big bloggers have their comments turned off because they’re too stuck-up and self-important to care what their readers have to say, those arrogant bastards. They can’t stand people disagreeing with them.” And so on.


Quality editorial

Two weeks ago, I wrote:

In terms of editorial and quality, I am unconvinced that a voting system like Digg’s can produce a quality editorial product.

Lloyd Shepherd, Deputy Director of Digital Publishing at Guardian Unlimited, has been thinking along similar lines:

Everything we do to “edit” the [Guardian Unlimited] site seeks to keep a balance between editorial instinct and the desires of the audience, and that, in doing that, we may be reflecting the “community” more fairly, both mathematically and ethically, than the likes of digg.

So how do you reflect the community more fairly? Paging Mr. Surowiecki:

In order for a crowd to be smart, [Surowiecki] says it needs to satisfy four conditions: 1. Diversity, 2. Independence, 3. Decentralization, and 4. Aggregation.

Much of the online media we’re familiar with uses a mix of humans and automated systems to perform the aggregating task. Human editors choose the stories that will run in the newspaper (drawing from a number of sources of information as Lloyd illustrated), blog authors select what links and posts to put on their blog (by reading other blogs & media outlets, listening to reader feedback, and sifting through already aggregated sources like del.icio.us or Digg), and the editors of Slashdot filter through hundreds of reader submissions a day to create Slashdot’s front page. Google News uses technology to decide which stories are important, based primarily on what the publishers are publishing. Digg and del.icio.us rely almost entirely on the crowd to submit and determine by a simple vote what stories go on its front page.

Some of these methods work better than others for different tasks. The product of 50,000 diverse, independent, decentralized bloggers is probably more editorially interesting, fair, and complete than that of 50,000 diverse, independent, decentralized Digg users, but the Digg vote & tally approach is less time-intensive for all concerned and the information flows faster. A site like Slashdot sits in the middle…it’s a little slower than Digg but offers a more consistent editorial product. A hybrid Digg+Slashdot approach (which is not unlike the one used by individual bloggers) would be for Digg to produce a “Digg digest”, a human selected (could use simple voting or let the most highly respected community members choose) collection of the best stories of the day that incorporates what was said in the comments and around the web as well. Actually, I think if you wanted to start a blog that did this, it would do very well.


I just found the most niche weblog

I just found the most niche weblog ever: Hay in Art, which consists of pictures of art that feature hay in them.


Last 100 posts, part 6

[This is a semi-regular feature following up on stuff I’ve posted here recently.]

As expected, the Digg vs. Slashdot post got featured on Digg but not on Slashdot. In my analysis, I noted:

The Digg link happened late Saturday night in the US and the Slashdot link occurred midday on Sunday. Traffic to sites like Slashdot and Digg are typically lower during the weekend than during the weekday and also less late at night. So, Digg might be at somewhat of a disadvantage here and this is perhaps not an apples to apples comparison.

Several folks complained about this, some saying that it invalided the whole thing. The Digging of the DvS piece gives us another look at the Digg effect, from right in the middle of a weekday. Digg #2 was dugg 1441 times, got 98 comments, and sent around 10,200 people to kottke.org. By contrast, Digg #1 was dugg 1387 times, garnered 65 comments, and sent ~20,000 people to kottke.org. Digg #1 was actually more successful in driving traffic to kottke.org on a Saturday night than Digg #2 on a Thursday afternoon. Here’s a graph that compares the three events:

Digg #1 vs Digg #2

It’s hard to see the exact effect of Digg #2 on this graph (I forgot to grab a screenshot of the bandwidth graph when it happened, so all I have is the historical wide view), but it doesn’t stand out that much from what happened the previous day (each one of those “bumps” is a day) and didn’t have much of an effect beyond the initial spike. However, judging from the traffic that the individual Digg pages drove to kottke.org (Digg #1: 4525 people; Digg #2: 2668 people), it looks like the iPod feature was more interesting to the Digg audience than the Digg v. Slashdot post (which makes sense). So, still not exactly a fair comparison and raises more questions than provides answers.

The James Frey thread ended up with almost 950 comments before I shut it down because of redundancy and a lot of nastiness on the part of a few participants. The kottke.org record for most comments on a post is nearly 1800 on this post about The Matrix Reloaded (continued here)….that conversation, while nerdy, was a lot more civil.

After reading some of those comments and other things written about the controversy (but without having read the book), my take on Frey is that memories are subjective and readers need to cut authors some slack on that when writing memoirs. However, Frey stepped over the line in manufacturing situations that didn’t happen and deserves the backlashing he’s now receiving. My favorite observation on this whole deal was made by Stephen on a mailing list we’re both on. In a 2003 interview for The Observer, Frey said:

I don’t give a fuck what Jonathan Safran whatever-his-name or what David Foster Wallace does. I don’t give a fuck what any of those people do. I don’t hang out with them, I’m not friends with them, I’m not part of the literati…A book [Eggers’ AHBWOSG] that I thought was mediocre was being hailed as the best book written by the best writer of my generation. Fuck that. And fuck him and fuck anybody who says that. I don’t give a fuck what they think about me.

To Oprah on Larry King last week, Frey had this to say:

I admire you tremendously and thank you very much for your support. And, you know, it’s — I’m still incredibly honored to be associated with you, and I will for the rest of my life. Thank you.

The man knows who buttered his bread, that’s for sure. Oh, and The Onion’s take is good too. “Accounts of assault with a deadly weapon, narcotics possession, and incitement of riot actually happened during 2002 Grand Theft Auto session.”

Several folks picked up on the year in cities meme…check out the trackbacks on my post and on IceRocket for a bunch of other people’s lists.

Many didn’t realize that my letter to Apple Support was a joke. Sure, I had post-MacWorld gadget lust, but my new Powerbook is great, does everything I want, and I don’t really want the new one. Besides, everyone knows you don’t buy the first version of new Apple hardware…I’m waiting until they work all the kinks out. Here’s a not-so-positive review of the MacBook Pro announcement at Unsanity.

More chatter about the new corporate logos for Kodak, Intel, UPS, and AT&T.


Knickers, the lingerie weblog, has compiled a

Knickers, the lingerie weblog, has compiled a lingerie shopping guide for Valentine’s Day with breakdowns by size and style (from virgin to vixen). Possibly NSFW.