The Big Red Scary Numbers, or “Analytics Says Traffic is Down”

September 2nd, 2010

“Analytics says traffic is down! I thought SEO was supposed to INCREASE our traffic?”

UntouchableMarketing gets an email like the above (paraphrased) every once in a while. The first few times this happens it can be nerve-wracking, but I’m more jaded now and I know that 99% of the time there is nothing to worry about.

Google Analytics has attempted to boil down all the power the Urchin platform provides into digestible pieces for the lay web-user. The most prominent of these reductions is the dashboard view, showing the trailing 30-days performance of your website(s), in terms of visits, time on site, bounce rate, and completed goals.

The Google Analytics dashboard.  What a pretty green view!

The Google Analytics dashboard. What a pretty green view!

Rounding out this summary is the “% Change” figure, a bolded number which is either red or green depending on if traffic, as compared to the preceding trailing-30, is down or up respectively.

Whoa!  Those are some scary, red, bold numbers!

Whoa! Those are some scary, red, bold numbers!

Guess which number most users instinctively fixate on when they log in to analytics?

In most cases, when our clients witness the Big, Red, Scary Numbers, the explanation falls into one of three categories:

1. We recently added a filter to remove their office IP address, from which a large number of requests are made to the site daily. The filters don’t apply retroactively, so the previous month’s traffic looks greater in comparison.

2. The client had an article published on a popular site like Techcrunch or the New York Times which generated a large spike in traffic to their site for one or two days, which is counted toward the previous month’s total.

3. The client’s clients make use of pages on the client site to do things like: sell items from a store, register people for an event, or sign up for email/SMS updates. A client with a large user-base may have made use of the site for a big event, which has now passed.

How to Check if the Big Red Number is Due to an Aberration, or is Actually Affected by Lower Search Volume

Open analytics, click on ‘View Report’ from the dashboard. On the left side, click on ‘Traffic Sources’ then ‘Keywords’.

If you only want to check Organic Traffic, click on ‘Non-Paid’ just under the graph, to the left.

Change the date to a longer range and select “Monthly” for the graph aggregation.

Scroll to the bottom of the list of keywords and find the ‘Filter Keyword’ box. Change the first box to ‘Excluding’ and in the second box type in some words that you would consider “brand terms” e.g. your copany name, your website. Put them between parens and separate with pipes, like so:

Click ‘Go’ and the graph will change to show you all of the people who used non-branded keywords to search and find your site, in the selected months. The graph should be pointing up and to the right if your search marketing strategy is performing well.

Simulated pushpins added to help you follow the instructions above.

Simulated pushpins added to help you follow the instructions above.

Don’t be afraid of the Big Red Numbers, but make sure you find the right cause!

Tags for This Post: scary numbers, spike, web user, google, techcrunch, new york times, sms updates, digestible pieces, dashboard view, bold numbers

Benford’s Law and Click-Through Rates on SERPs

February 4th, 2010

The other day, as I perused the latest “nerd news” from news.ycombinator.com I came across a very interesting blog post that captured my attention. It was a summary of the properties of Benford’s Law.

By way of brief introduction, Frank Benford first became aware of this property while working at General Electric, when he noticed that pages of a book containing logarithmic tables showed much more wear on the page for the number 1, than they did for other numbers.

After considerable research, he found that any man-made data has a tendency for the leading digit of each number to be a 1 around 30% of the time, and a 9 only about 4% of the time. This is true with tax returns, baseball stats, and just about any other data you can name. This is very odd, because one would assume that the numbers 1-9 follow a normal distribution (and they do, when generated randomly).

This article stuck in my mind, and somewhere over the last couple days it connected with something else that had been bothering me.

Benford and Search

By now, you probably know about the research that was done on the AOL Study Data that gave us a baseline for click-through rates on search results. This study showed that the number 1 result on a page gets clicked on about 35% of the time. The graph looks something like this:

AOL data CTR

Although this data has been very useful, what has always bugged me about it is that the study was only performed on the first 10 results, and so the only way to extrapolate a formula would be to graph the results, add a trendline, and use the equation of the trendline to predict other values greater than 10.

But take a look at this graph of numbers following Benford’s law (which has a known equation):

Frank Benford's Discovery_1265314649134

Eerily similar isn’t it? Putting the numbers side by side really makes it obvious that there is an interesting correlation between Benford’s law and SERPs behavior.

Result # % of total traffic Benford’s
1 35.64% 30.10%
2 17.82% 17.61%
3 11.88% 12.49%
4 8.91% 9.69%
5 7.13% 7.92%
6 5.94% 6.69%
7 5.09% 5.80%
8 4.45% 5.12%
9 3.96% 4.58%
10 3.56% 4.14%

Hammering Out the Details

First I took the equation for Benford’s Law and fed it the inputs of all numbers from 1 to 100. Unfortunately, beginning at #11, the numbers depart from what my gut feeling is on the actual click-through for pages beyond 1.

For example, the equation tells us we could expect a ranking at #11 to receive roughly a 3% CTR. Obviously, this seems high for the first result on the second page. But what if we assume that a similar number of people will click on the 10th result as will click on the ‘Next Page’ button?

Point of Clarification: The original algorithm provides for a 4.14% CTR on the 10th result. Assuming that another 4.14% of people will scroll down the page and click on ‘Next Page’ we use that percentage as the total amount for the next page.

Download an XLS file with the comparison and formulas.

By applying the same equation to the percentage of people who are predicted to click on #10, we see that roughly 1.14% of searchers will be predicted to click on #11. That seems reasonable, doesn’t it? I would say about 1 out of 100 times I will hit the second page of results and click on one of the listings.

Using this formula, it is predicted that searchers will basically not go past result #110, which is a pretty good prediction based on everything I’ve ever seen about search.

It’s All Voodoo

Of course, much of this is just arbitrary tinkering with numbers. But there is a great body of evidence that says that Benford’s law has applications to any human system of data.

Because search engines are merely aggregating data and applying a formula for ranking, it makes sense that it may be subject to some of the same underlying laws as the stock market, baseball stats, and other masses of human data.

Tags for This Post: baseball stats, trendline, logarithmic tables, aol study, normal distribution, couple days, nerd news, frank benford