The Most Popular Fonts On The Web: A Study
If you’ve ever worked on a web site, you already know that choosing the right fonts is one of the most important aesthetic decisions in the design of a site. But, like all aesthetic decisions, it is a highly subjective.
I decided to try to bring a little bit of objectivity into the equation by finding out, empirically, what fonts the most popular sites on the web are using today.
I wrote a Python program that crawls the front page of the 100,000 most popular web sites according to Alexa’s top sites list. It parses the HTML using BeautifulSoup, and parses all in-line and linked CSS stylesheets using cssutils. It then looks for
font-family rules in the CSS rules, and stores the normalized form of each font in those rules in order. The result is kept in a SQLite database for analysis.
This all sounds pretty straightforward, but in fact it took me two weeks to build a crawler that doesn’t choke on all the crazy crap people throw at browsers. I will write up some of the more interesting cases in a separate post.
The final crawl success rate was about 96%. The size of all HTML and stylesheets downloaded was about 30GB.
The Most Popular First-Choice Fonts
First, let’s take a look at the first-choice/most preferred fonts, i.e., the ones that appear first in
font-family rules. These fonts most closely represent the intention of the web site designers without compatibility compromises.
For each font, we calculate the number of distinct web sites that have at least one CSS
font-family rule that lists the font as the first choice. This means that if a site has two rules, one listing Arial and another listing Times, it will count towards both. Thus, the numbers add up to much higher than the total number of sites.
Without further ado, here’s the breakdown of the top fonts on the web:
(The chart is interactive - hover/click to see actual numbers.)
A couple of observations:
Sans-serif fonts dominate the web. The top 25 fonts list only includes 4 serif fonts, compared to 16 sans-serif fonts. The most popular serif-font, Georgia, ranks #4 on the list with about 20% of sites.
Monospace fonts get no love. Most sites don’t bother specifying a custom monospace font; the most common monospace font specification just uses the browser default (12%). The most popular monospace font, Monaco, is featured on 7.2% of sites. Both of these are quite high, in fact, considering that we only crawl the front page of these sites.
The most popular fonts of each family:
Sans-serif: Arial (#1, 62%)
Serif: Georgia (#4, 20%)
Monospace: Monaco (#11, 7.2%)
Old Microsoft fonts still reign over the web. The top non-Microsoft font, Helvetica Neue, ranks #6 with an impressive 18% of top web sites.
Font Awesome is awesome. About 4.6% of the top 100K sites already use it for universal icons.
The Chinese web is on the rise. 3 out of the top 25 fonts are Chinese fonts, compared to 21 Latin fonts (and 1 symbol font). No other scripts made their way into the list.
What If We Considered…
Fonts in all positions: (graph) not much difference.
The top 1,000 web sites only: (graph) even more Arial (#1, 74%).
The Most Popular Heading Fonts
Next, let’s look at first-choice fonts used in headings or titles.
I used a really crude metric to determine whether a CSS rule matches a heading: whether the CSS selector is for an
H6 tag or contains the strings "heading" or "title". While this finds only a subset of actual headings, it is not a bad approximation as it matches rules on about 58% of the top 100,000 web sites.
Let’s start with the graph:
Header fonts are more diverse. While Microsoft fonts still reign supreme, a number or more exotic fonts are also on the list. In terms of distribution, there’s a much longer tail. It may be a sign that designers pay a lot more attention to fonts used in headings.
Serif fonts are more popular in headings than elsewhere. While Arial still claims to top spot with 27.31% of sites, the top serif font, Georgia, rises to 2nd place with 9% of sites.
No monospace fonts in headings. Not surprising.
Among Chinese fonts, 宋体 is more commonly used for body text, while 雅黑 (or Yahei) is more commonly used for heading text.
Arial and friends are the most hated fonts ever. Quoting The Scourge of Arial:
Despite its pervasiveness, a professional designer would rarely - at least for the moment - specify Arial.
And yet, Arial is still the default font on the vast majority of sites on the web, followed closely by its friends. Why?
Quoting Stop Using Arial & Helvetica:
Some people actually have a reason to use them but most use it mindlessly - just because everyone else does. Often, no thought is given to design of the site, let alone typography.
This is pretty sad. Paraphrasing Stop Using Lame Fonts, a good font stack has the potential to really make a site design shine, and it’s a shame web designers aren’t exploiting this opportunity.
There are a couple of caveats with this dataset.
First, I am only surveying the landing page of each site. For many sites, notably those whose main interface is hidden behind a login (Facebook, Evernote, etc.), we may not be finding the styles that matter most to users. However, I figured since it would be pretty poor design to build a landing page that is aesthetically inconsistent with the rest of the site, it is not very likely that the font selection on the landing page would be too different from the fonts used elsewhere on the site. Of course, without creating fake accounts on these sites, we have no way to verify.
The numbers are not weighted by prominence on the web pages. One could argue that the font of the main body text on a page should carry more weight than that of the tiny disclaimer text at the bottom which no one reads. It would be tricky to determine what the right weight function is though.
The numbers are not weighted by the prominence of the web sites. For example, we could make it so that Google’s use of Arial would get more weight than some random obscure site’s use of Arial, as Google’s choice impacts many more users and was probably the result of deliberation by a team of expert designers. Again, it’s not entirely obvious how the weights should be assigned.
I am only looking at CSS rules in
<style> tags or linked stylesheets. That means I am ignoring
style="…" attributes in tags,
I can think of quite a few other useful questions one might find the answer to from this dataset:
What are the most common font pairings?
What are the popular choices for heading fonts, given that I’ve chosen font X as my body text font?
What are the most common fonts on news sites/forums/productivity web apps/social media sites?
Is there any correlation between font choice and site popularity?
What do you think?
Please feel free to download the top 100 first-choice fonts as a CSV file.