« Roots for Kids: A Genealogy Guide for Young People | Main | (+) How to Make Money Selling Genealogy Information - Part III »

June 28, 2007

The Wayback Machine

Wayback Despite its name, the Wayback Machine is not a time travel machine from a science fiction movie or from a television cartoon. Instead, it is an archive of Internet pages.

Would you like to look at a Web page as it existed several years ago? Perhaps you want to look for information that was available on the Web at one time but has since disappeared. The Wayback Machine may be the tool you need. Now you can surf the Web as it was.

The Internet Archive, working with Alexa Internet, has created the Wayback Machine. This free service makes it possible to surf pages stored in the Internet Archive's web archive.

The Internet Archive Wayback Machine contains almost 2 petabytes of data and is currently growing at a rate of 20 terabytes per month. This eclipses the amount of text contained in the world's largest libraries, including the Library of Congress. (A petabyte is one million gigabytes or one billion megabytes.) The Wayback Machine is the largest such database in the world, containing multiple copies of the entire publicly available web, even bigger than Google's huge database. Google typically stores one copy of each web site whereas the Wayback Machine stores multiple copies. The Wayback Machine presently stores 85 billion web pages. That is one huge disk farm!

The Wayback Machine only collects publicly accessible Web pages. You will not find web pages that require a password to access, pages tagged for "robot exclusion" by their owners, pages that are only accessible when a person types into and sends a form, or pages on secure servers. You typically will not find pages created within the past six months but older pages are available, usually back to 1996.

I used the Wayback Machine this week to look at some Web pages that I have been maintaining for years, some of which are not connected with genealogy. It was interesting to look at some of my older HTML work. I also looked at some of today's more popular genealogy Web sites. I must say that Ancestry.com has come a long way from their home page of October 28, 1996! See http://web.archive.org/web/19961028055925/http://www.ancestry.com/.

The Wayback Machine stores all the text of standard HTML pages. Graphic images may or may not be stored. Fancier Web pages, using XML or Javascript, probably will not be found In the Wayback Machine.

The Wayback Machine is an excellent tool for finding information that "I saw it once on a web site." You can search sites for information posted years ago and perhaps no longer available today. It an also be a source of amusement as you see "how far we have come." Check out your site or your society's site from years ago!

You can search the 2 petabyte Web archive on The Wayback Machine at: http://www.archive.org/

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Hmmmm, I tried our web page and it didn't have anything. It's over 6 months--over a year, really. Guess we're still the young kids on the block.

Happy Dae.
http://www.ShoeStringGenealogy.com/ssg1.htm

I teach an Adult Education Class at the local High School and found the WAYBACK MACHINE most interesting. I printed it off to share with students. Thank you.

Thanks for the news about Archives! I found my old web page, and my picture! I thought that
it was gone forever... pleasant memories. And, I've sent it to my grandkids. Great to know that
it's stored somewhere - forever?

If you want your site to appear in the Wayback Machine, you can submit it to http://www.alexa.com/site/help/webmasters#crawl_site

Archive.org has more than just web pages, by the way. For example, I just searched the out-of-copyright texts that they've digitized (http://www.archive.org/details/texts) for the keyword genealogy and got 358 results.

-dallan

Many thanks, Dallan. I shall do just that. (Gosh, to be a part of history!)

Happy Dae.
http://www.ShoeStringGenealogy.com/ssg1.htm

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Receive FREE daily newsletter updates by email

  • Enter your email address


    Click here to see a typical e-mail message you will receive.

    I promise that:

    1. I will never sell, rent, or give away your address to any outside party, ever;
    2. I will never send you any unrequested e-mail, besides newsletter updates; and
    3. All unsubscribe requests are honored immediately, period.

My Photo

Search This Site for Past Articles

Meet Dick Eastman in Person

  • Sept. 2 to 5, 2009 - FGS National Conference - Little Rock, AR

    Feb. 13, 2010 - Pinellas Genealogical Society - Largo, Florida

    Feb. 26 to 28, 2010 - Who Do You Think You Are? LIVE! - London, England

    March 27, 2010 - Clayton Library - Houston, TX

    April 10, 2010 - Indiana Genealogical Society (IGS) Annual Conference - Ft. Wayne, IN

July 2009

Sun Mon Tue Wed Thu Fri Sat
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

Amazon Kindle

Offers

Blog powered by TypePad

Amazon Picks

Receive daily newsletter updates by email

  • Enter your Email


    Preview

    (Don't worry, I hate spam as much as you do and you will be able to UNSUBSCRIBE within seconds at any time!)