Metrics and Analytics, Research, Social Role, Technology, World of Tomorrow

A Trillion URLs

Google in 1998Image via Wikipedia

According to a recent blog post at Google, the number of distinct URLs they have found and indexed online just crossed the 1 trillion threshold. This is an increase from 1998, when Google started with an index of 26 million pages. Google also claims that they are seeing a few billion URLs added every day.

Something the blog post mentions that I hadn’t thought about is how the Web is now throwing off URLs automatically. For instance, blog sites create a new calendar entry every day. There is really no upward limit on the number of URLs that Google will have to index in order to have a comprehensive accounting of the Web.

But, as TechCrunch has pointed out, discovering all these URLs and actually storing information about them are two different things. Google probably only stores about 40 billion URLs, after eliminating spam, duplicates, and other noise.

Also, is Google actually the most complete index of the Web? Apparently not as of this week, as Cuil (pronounced “cool,” but it doesn’t work for me) unveiled its search engine, which it promotes as the world’s biggest. Using it, I think maybe they focused on scale before they focused on usability.

Zemanta Pixie

About Kent Anderson

I am the CEO/Publisher of the Journal of Bone & Joint Surgery, Inc. Prior to this, I was an executive at the New England Journal of Medicine. I also was Director of Medical Journals at the American Academy of Pediatrics.

Discussion

No comments yet.

Leave a Reply

Fill in your details below or click an icon to log in:

Gravatar
WordPress.com Logo

Please log in to WordPress.com to post a comment to your blog.

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Side Dishes by Stewart Wills

Find Posts by Category

Find Posts by Date

July 2008
S M T W T F S
« Jun   Aug »
 12345
6789101112
13141516171819
20212223242526
2728293031  

The Scholarly Kitchen on Twitter

SSP_LOGO
The mission of the Society for Scholarly Publishing (SSP) is "[t]o advance scholarly publishing and communication, and the professional development of its members through education, collaboration, and networking." SSP established The Scholarly Kitchen blog in February 2008 to keep SSP members and interested parties aware of new developments in publishing.
......................................
The Scholarly Kitchen is a moderated and independent blog. Opinions on The Scholarly Kitchen are those of the authors. They are not necessarily those held by the Society for Scholarly Publishing nor by their respective employers.
Follow

Get every new post delivered to your Inbox.

Join 344 other followers