Patrick Ruffini: SiteMeter inadvertently inflating Kos’s traffic by 60%?
posted at 2:08 pm on October 3, 2007 by Allahpundit
Share on Facebook | regular view
Even if he’s right, dKos is still doing as much as us and MM put together. Is he right? I’ve heard other people complain, as Ruffini does, that a front-page link at dKos makes for a surprisingly small spike but that may be less suggestive of hits being overcounted than readers pouring into individual diaries instead of the front page. The crux of his argument is that SiteMeter only “remembers” 100 visitors at any one time, which means that if you click over to dKos and then wait for a few seconds while 100 new people stream in, the next click you make will count as a totally separate visit. So assuming that dKos receives 100 new visitors every 12 seconds, you could conceivably account for five visits yourself in the span of one minute if you space out your clicks appropriately. To wit:
Then it hit me: SiteMeter only accounts for the last 100 visitors individually. On a site like Daily Kos, the 100th most recent visitor could have been 15 seconds ago. If you are the 101st most recent visitor and you click on a new page, you are counted as a new unique visitor in SiteMeter’s all important count. On a normal site, this wouldn’t matter, since it’s highly unlikely you’ll stick around long enough to have 100 others show up after you. On a site with hundreds of thousands of page views a day, it’s extremely likely you will.
Ruffini’s talking about visits, specifically. According to SiteMeter, though, “visits” are defined as “a series of page views by one person with no more than 30 minutes in between page views.” In other words, if Ruffini’s right, you’re going to have overcounting on any site that attracts more than 100 visits in any 30-minute span.* Granted, most blogs get small enough traffic that that shouldn’t be a problem but SM’s been the industry standard for big blogs too since 2002. I find it hard to believe they wouldn’t have either corrected this glitch yet or at least noted it somewhere for the benefit of new customers. In fact, compare SM’s free basic service to the premium service: for $6.95 a month, SM will provide details for the last 4,000 visits, not the last 100. Which means, if Ruffini’s glitch theory is correct, that a premium account will result in a much lower (and more accurate) visit count than a free, basic account. That’s an odd feature to include in your pay service, especially when bloggers depend upon higher visit counts for ad rates.
He makes a good point, though, about how visits shrink to impossibly small durations according to SM’s metrics as sites get bigger. I’m not sure how to explain that other than the way he does. I’ll leave you with this — Dan Riehl’s side by side comparisons of dKos, Instapundit, National Review, and MM using the independent (and imperfect) metrics of Alexa. As you’ll see from his last graph, dKos’s “reach” — defined as the percentage of global Internet users who visit a site according to Alexa’s best guesstimates — is roughly three times that of MM and InstaGlenn. If Ruffini’s SM glitch theory is correct, MM’s “real” traffic numbers are on the order of 130,000 visits a day. That suggests, but of course doesn’t prove, that Kos’s traffic is quite a bit higher than the “real” number of 283,000 that Ruffini attributes to him. Here’s another graph of dKos and Instapundit head to head. Glenn’s “real” traffic would probably be 150,000 or so according to Ruffini’s theory. Kos is still way up:

Exit question: Am I right or did I miss something?
________
* Actually, this is incorrect. You’re going to have a risk of overcounting in these circumstances, but whether any overcounting actually occurs will depend on whether there are repeat visitors during that 30-minute period and how long they took between clicks.
You must be logged in to post a comment.

















Blowback
Note from Hot Air management: This section is for comments from Hot Air's community of registered readers. Please don't assume that Hot Air management agrees with or otherwise endorses any particular comment just because we let it stand. A reminder: Anyone who fails to comply with our terms of use may lose their posting privilege.
Trackbacks/Pings
Trackback URL
Comments
Comment pages:
Trust figures from the likes of them? Are you kidding me?
Two words.
Ron Paul
bbz123 on October 3, 2007 at 2:13 PM
I found it interesting that the charts on Dan Riehl’s site show all of the sites having nearly identical spikes and pretty much the exact same time.
doriangrey on October 3, 2007 at 2:16 PM
Question: Who cares?
Hoodlumman on October 3, 2007 at 2:17 PM
It’s fairly easy to fudge the numbers depending on site design. Only the hosts and dKos have an idea of a truly accurate count.
On a side note you might want to read Charles Johnson’s tear down of site meter and it’s nefarious cookie use.
bj1126 on October 3, 2007 at 2:18 PM
Advertisers who are deciding how much to pay for a spot on a site, and politicians who are deciding whether to listen to Kos or blow him off.
see-dubya on October 3, 2007 at 2:19 PM
As I said in the headlines, pretty minor complaint here.
these free stat services are for advertisers only, they aren’t even really meant to be used for hard data. Use awstats or something for that.
Per Alexa. This is a poor estimation as well. Alexa only measures traffic based upon who has the toolbar installed. And the only group who has it installed is webmasters.
lorien1973 on October 3, 2007 at 2:21 PM
What I’ve never figured out is why most visitors to my site have a site length visit of zero seconds. These people are speed readers.
see-dubya on October 3, 2007 at 2:21 PM
Probably folks like me that block sitemeter’s cookies. Sorry. Want me to be more selective?
desertdweller on October 3, 2007 at 2:23 PM
Sitemeter can only count duration if a visitor clicks a second time. 0 seconds is a entry click with no follow up click within the site.
a4g on October 3, 2007 at 2:25 PM
If traffic is the name of the game, then who’s to say that the DKos numbers aren’t being “helped” by scores of people who click on the site all day, not actually reading, but deliberately driving up the counter? Think about the people who are demonstrably manipulating digg and youtube…
RushBaby on October 3, 2007 at 2:25 PM
I always hit the “back” button as soon as I realize that I have accidentally gone to your site, see-dubya;)
Troy Rasmussen on October 3, 2007 at 2:25 PM
Kos also has urchin installed on the site (google analytics). Might be cool to compare the two and see what you come up with. Or test hotair’s traffic with analytics or awstats and see the difference as well.
lorien1973 on October 3, 2007 at 2:25 PM
That sounds rezonable. Another mystery cleared up; I’m going to go throw some ropes around in some boxes and see if they form knots.
see-dubya on October 3, 2007 at 2:28 PM
Does Townhall’s extremely annoying habit of splitting articles into two pages factor into this at all?
James on October 3, 2007 at 2:32 PM
see-dubya,
It takes me less than a second to digest all the content on a web sites’ front page…..that’s why I personally account for (approx.) 11.785% of all internet traffic.
ChrisM on October 3, 2007 at 2:32 PM
I think it’s all smoke and no fire. Sitemeter is notoriously unreliable. It doesnt, for example, count visitors from your RSS feeds. That’s a huge underreport right there.
Really, this is a topic where Ruffini is punching above his weight. He’s not a techie.
Meryl Yourish on October 3, 2007 at 2:36 PM
Kos’s mirror probably says “Objects In Mirror Are Larger Than They Appear” at the bottom, too.
ReubenJCogburn on October 3, 2007 at 2:37 PM
SiteMeter used to show DailyKos getting dozens of hits from… DailyKos. Yep. Internal hits. Diaries, too. Now it just shows dozens of “unknown” hits. My guess is that DK’s hits are still primarily internal and Kos just found a way to keep them from showing up on SiteMeter.
JinxMcHue on October 3, 2007 at 2:43 PM
Ahhh… but Kos has used these stats to get more attention from Politicians.
So what if they still have a large percentage of internet users, whats important is not the percentage, but the total numbers when talking to politicians.
If their numbers are inflated… which I would guess they are, then they are potraying themselves as being representative of more voters than they are…
Its called fraud… if they know it…
This really needs to be checked out.
Romeo13 on October 3, 2007 at 3:10 PM
Does Site Meter have any ability to distinguish between a bunch of clicks made by a bot from the same computer (with the appropriate timing) and separate users from separate computers or are they all just site clicks?
Speakup on October 3, 2007 at 3:20 PM
Looks like right now half of Hotair’s entry pages are coming internally. It’s over 90% for dailykos, so yeah, their visit figures are meaningless.
pedestrian on October 3, 2007 at 3:22 PM
I never liked Sitemeter. Too limiting.
Stat Counter and Google Analytics are much better, as they provide more detailed results in a user friendly way.
Darnell Clayton on October 3, 2007 at 3:25 PM
I agree that Alexa stats are even more squirrely and less transparent than those of SiteMeter. Alexa does not even disclose total traffic counts that it relies upon, probably because those counts would reveal how many visitors it is missing. Alexa’s visit tracking system may contain some of the same errors as SiteMeter’s.
Besides, the fact that Alexa’s statistics are derived from use of its toolbar, which is popular primarily with techies, guarantees that Alexa’s statistics will be badly distorted in favor of whichever websites happen to be favored by that community.
These may be the best tools available so far, but Patrick Ruffini has raised an important question that SiteMeter needs to address — either by showing that its tracking system does not double- or triple-count visitors to high-volume websites during the same visit based on each page view, or by upgrading its software so that it can more reliably track traffic on high-volume websites.
Gina Cobb on October 3, 2007 at 3:47 PM
None of the visit counters our remote traffic monitors do the job.
The only accurate way is from the server logs themselves.
There is a lot of available software to calculate and consolidate data from the logs of multiple servers necessary for a high traffic site.
CommentGuy on October 3, 2007 at 4:46 PM
Let us recall the lessons of Occam’s Razor: perhaps it is my site that is siphoning off Mr. Kos’ traffic.
bosteed on October 3, 2007 at 4:49 PM
Daily Kos has a habit of inflating numbers.
SoulGlo on October 3, 2007 at 4:55 PM
A quick note for those who look at these stats for potential advertising. No smart advertiser would go by sitemeter when determining where to advertise. I mean the only reason you use something like that, is to make sure that folks see the highest, unverifiable numbers and them going ga ga over the potential reach.
Believe it or not, I came across a site that was advertising more than 10 million visitors per day. Except their numbers are HITS, not unique visitors. Big difference.
Bottom line, install an analytics software and check uniques. If you’re looking advertise, go with a good ad agency, or a well known site who’s advertising is third-party managed. Those are generally most reliable, but more expensive.
Right Tracker on October 3, 2007 at 4:56 PM
This info might give me some relief. I honestly thought a very large number of Americans had truly gone
around the bend, but, if this holds up, than the advertisers
have gotten hit bad, politicians and lobbyists also. I check with Hotair several times a day all day to keep up to date so does that inflate HA’s numbers? To get back to my first thoughts, I feel much better thinking only l/3 of the citizens are crazy loons (this includes the Ron Paul groupies too).
sharinlite on October 3, 2007 at 5:02 PM
Does it matter how long your stay on a site? I tend to leave browser windows open overnight and I don’t know if this is getting AP closer to his iPhone.
MamaAJ on October 3, 2007 at 5:39 PM
see-dubya on October 3, 2007 at 2:28 PM
.
try to pull them out gently with out problem and BOOM, Knots!
shooter on October 3, 2007 at 5:39 PM
No, it doesn’t matter — unless the site uses auto-refresh, which we do not.
Allahpundit on October 3, 2007 at 5:39 PM
Good Q, anyone know?
shooter on October 3, 2007 at 5:40 PM
So what kind of activity constitutes or determines a “visit length”?
shooter on October 3, 2007 at 5:43 PM
You’re right. It’s a classic trick and even his own members don’t know what he’s doing.
Connie on October 3, 2007 at 6:55 PM
Yeah, Kos’ massive traffic really haleped them with
SenatorLeft-Wing Loser Lamont.BKennedy on October 3, 2007 at 7:20 PM
It seems someone with a paid Sitemeter account and a very active site can test this point by registering for an unpaid account, adding the second counter to the page for a few days, and comparing the counts. The staff at Sitemeter may be willing to help set up and monitor that test or a better-devised one, unless they already have solid information to settle reasonable questions. It will probably be good for them to take an interest in this matter; it seems an enterprising trial lawyer would.
Kralizec on October 3, 2007 at 9:05 PM
I made the comment that follows on Meryl Yourish’s post. Yourish moderates comments before allowing them to appear on the site, so I’m cross-posting the comment here rather than risk its loss.
Kralizec on October 3, 2007 at 10:19 PM
I think the bottom line is that Kos is getting more
free commercial air time.
canopfor on October 3, 2007 at 11:06 PM
Question: If Kos throws a temper tantrum and yells “Action Alert!”… but no one hears him… does it make a sound?
Answer: Who cares?
BadBrad on October 4, 2007 at 7:23 AM
This one is easy. They’re not tracking how long your browser stays open on the site, they’re tracking the time between your first page view and your last page view, within a “visit” (which for Sitemeter, is any series of page views with no more than 30 minutes between the two.). So if you click at noon, then on another page at 12:10pm, then go to the grocery store, your visit will have lasted 10 minutes.
“Zero second” page views happen when the time between your first page view and your last page view is zero. That is, your first page view is your last page view, because you only viewed one page. So as the average visit length approaches zero, that is a sign of your inability to retain visitors. But don’t take it as a literal — because even for the “zero seconds” page views, the person probably spent more time than that. What would be better is to say “24% of visitors viewed more than one page. The average time spent on the site for these visitors was 4 minutes” — that is, don’t count 0 second views.
As for the “SiteMeter only tracks 100 visitors” thing by Ruffini — he’s misreading what SiteMeter says. The limit is on tracking visitor details. He’s assuming that if they’re not showing him details for more than 100 visitors, they’re not tracking more than 100 visitors. All it takes to track visitors is the storing of two things: an IP address, and a timestamp.
Yourish is right. And Ruffini is out of his depth. The only reason this is getting any traction at all is because it calls into question Daily Kos traffic numbers. If Gore had won in FL in 2000 by a big margin, no one on the left would have been complaining about “disenfranchisement.” Cui bono?
Mark Jaquith on October 4, 2007 at 11:54 AM
Crow for dinner: Ruffini was right, even if he didn’t have the evidence in hand for his first post. Turns out the “visits” statistic as tracked by SiteMeter is not accurate for high traffic sites and their FAQ misrepresents how “visits” expire (by only mentioning the 30 minute timeout).
Mark Jaquith on October 8, 2007 at 5:56 PM
Comment pages: