The WayBack Machine- Internet Archives and Research

7 Comments December 11, 2008

Archives on the World-Wide Web are now starting to hold some weight in their own right. With such a short lifespan of barely 10 years, the Web had not been considered a tool for serious researchers when it came to archival content.

However, as the metabolism of the Internet is at least 3 times faster than that of the average person, we see more happening faster, and more content available. Without restriction, this WWW machine is receiving content by the giga-truckloads 24/7. Good, bad, and useful for Investigators.

The trouble is that as quickly as this content rolls in it overwrites the current material. That can make things difficult for the historical researcher who is looking for yesterday’s news, a copy of the website from last year, or material that has since been erased from the screen. Investigators can utilize historical information for locating

- A photo that was once present, but since removed.
- Content, views, writings and opinions which may have been rescinded by a particular author or group.
- Products being sold illegally, pulled when they realized they were being watched.
- Old job ads, company statements, company affiliations, or product lines

Solutions are found in at least two services, free and easy to use…

Google.com as always comes through with some sort of solution. From the results of your standard search in Google.com you will see cache results. The cache is a snapshot of the site, when Google.com indexed it for their servers. Unfortunately, no dates are given so you are never sure of when that site was scanned and indexed. Using Google.com and its cache option are perfect when you are not sure of where your answer will appear. You are not familiar with the web address, or starting point. If you have your address though, you can narrow your google.com cache search by limiting to the site: function. Hence site: anyaddress.com + term. You have to add a term, use something common such as contact if you aren’t sure of anything specific, however names work very well.

If you know the website you are examining, waste no time in visiting the WayBack Machine located at http://www.archive.org

You will see on Archive.org a simple to use plug and play search box where you enter the address and click go. After the http:// enter the address you wish to view older pages on. We tried enron.com to see what was available. Visiting the Enron.com website today you will see content involving the Chapter 11 filings for the company, and post investigations and fallout documents. It’s current, it’s relevant, yet the information today does not tell me what happened 3 years ago, as told through their website. I would prefer to see Internet press releases, photos of executives shaking hands with vendors, and the “about us” section offering the biographies of their top employees prior to the collapse of their empire.

After typing in www.enron.com we see the results are as early as December 10, 1997. Clicking on the link brings you to the actual site from that date in time. The links will work! So if you were trying to find press releases, department heads, affiliations, and any sort of detail you would locate on a current site, but with the older data.

With a company as large as Enron, you can most likely find information in other sources as well. But when investigating a small firm, a one man operation with a website and not a good deal of national press, the WayBack Machine really offers some incredible leads.
.
For example, we checked out La Strada Ristorante, a Chicago based restaurant at random. In particular, we looked at their Wedding Package offers. The web site address is http://www.lastradaristorante.com/weddingpackages.html. Today you can order the Prime Rib dinner for $89.25. After running this address through the WayBack Machine we see the price in 2000 was $67.50 (including horseradish sauce). We also noticed in 2000 it was Surf and Turf $78.95. Now it’s marketed as Filet Mignon and Lobster Tail at $108.00.

We expect to see price and products increase with business, yet we note the jazzier style of the presentation, the careful wording of the menu items, and a measurable increase in prices, which will help us to predict future pricing in the next 2 years. In other words, the WayBack Machine is helpful as a looking glass to the future, or competitive intelligence tool.

Some technical notes into the advanced features…

The asterisks we see next to some of the dates will tell us when WayBack noticed changes from one version of the site to another. This way you do not have to view each and every page looking for change. There is a comparison tool. After you run the initial search you can click on “Compare Archive Pages” at the top of the screen. Then you can choose two dates to compare against each other. What you see are strike outs right on the page, of the older text.
.
If you click on Advanced Features, you can narrow your search by year, convert your results into PDF files (which is helpful for reports and printing results), also a large assortment of check boxes for restraining or enhancing your search options.

WayBack Machine also offers a toolbar utility which is handy if you are visiting a website, you click on your toolbar and the Archive results screen pops up. So you can go from the actual site you are researching to its historical views in one click. Toolbar option is found at http://www.archive.org/web/web.php.

WayBack Machine is not only archiving the big companies and fortune 500 types. According to the site there are “30 billion web pages archived from 1996 to a few months ago.” What about searching for specific data from archived pages? Read our article on the Recall Machine! Combining the WayBack Machine with Google.com cache should give Investigators a descent start on archival research into Web sites.

Cynthia Hetherington
Author of Business Background Investigations (2007) and The Manual to Online Public Records (2008)
.
http://www.hetheringtongroup.com

Tags: , , ,

Leave us a Comment!

Comments (7)

Trackback URL | Comments RSS Feed

  1. David Curran says:

    Terrific article! Some sites such as eBay are difficult to do past research on simply because of the volume of information, but stolen items on eBay are not impossible to backtrack, especially for the previous 30 days.

    My tracking has been exclusively intellectual property which actually offers some advantages to the tracker, but the technique will work for both.

    In my experience the items I was tracking were unique enough that I found the (in this case) DVD bootleggers with little trouble. I did need to my eBay searches to the UK in my case because UK items don’t necessarily show up on general item searches. (ebay.co.uk) Anyone searching on eBay for an item might do well to set the search parameters to a specific search if a general search does not yield results. For example, you can set an eBay search to 200 miles of a specific city, with pick up as the only option. If eBay knows you don’t live within 200 miles of that city you may not find the item in your general search results unless you adjust your search results.

    (‘eBay Hacks’ by David A Karp is a good source for understanding and even programming eBay searches.)

    It is possible on eBay to search back through a lister’s items for the past 30 days. The link can be found on the ‘SITE’ menu, the link to which is on the bottom (and top) of most eBay pages. Go to the listings for ‘More Ways to find items’ and click on ‘completed listings.’

    Sometimes people listing items can obscure what the item was by making the item ‘private.’ There are ways to get around this. One pirate I was after made the items private but then also tried to modify feedback by only publishing good feedback. The item numbers were listed, and by inserting an item number in an eBay search box that item will appear unless it is older than 30 days.

    In a situation where I did not have the item numbers the feedback itself was a hint as to what the item might be. If you are an intellectual property rights owner or the agent of such you can file what is called a VERO complaint. (Just type VERO in eBay’s HELP box and all the forms you need will be made available to you.)

    As an authorized VERO agent once you are sure that the item is, in fact, a bootlegged intellectual item, such as a copyrighted DVD ( VERO covers more) you can file a report and ebay will cancel the item (even if the listing is already completed and the buyer has paid) and let the buyer or would-be buyer know the item has been canceled.

    So it was possible, to prevent bootleggers from profiting on bootlegged items, to contact buyers on an item with feedback that sounded like the item might be the one in question (only necessary if the item is marked private and you can’t find the item number.) and ask them (You are limited to only a few contacts with eBay members who have not purchased from you recently in any given day.) if it is, indeed, the item in question. I do point out that if it is, I will enable them to file a complaint to eBay (and especially PayPal if they paid that way,) that the item is bootlegged, and demand their money back.

    Then whether I knew the item number of found it out from a buyer I file a VERO. (You should file the VERO complaint only after you’ve given them this chance to demand their money back from PayPal, etc. or the fact that the item will simply vanish from eBay records might make it hard for them to file a claim for a return of funds.) I do tell them that they may keep the bootlegged item with the copyright owners blessing.

    This had greatly discouraged our British and Scottish bootleggers. But the searching system might also help in finding resellers of stolen items on eBay.

  2. David Curran says:

    Terrific article! Some sites such as eBay are difficult to do past research on simply because of the volume of information, but stolen items on eBay are not impossible to backtrack, especially for the previous 30 days.

    My tracking has been exclusively intellectual property which actually offers some advantages to the tracker, but the technique will work for both.

    In my experience the items I was tracking were unique enough that I found the (in this case) DVD bootleggers with little trouble. I did need to my eBay searches to the UK in my case because UK items don’t necessarily show up on general item searches. (ebay.co.uk) Anyone searching on eBay for an item might do well to set the search parameters to a specific search if a general search does not yield results. For example, you can set an eBay search to 200 miles of a specific city, with pick up as the only option. If eBay knows you don’t live within 200 miles of that city you may not find the item in your general search results unless you adjust your search results.

    (‘eBay Hacks’ by David A Karp is a good source for understanding and even programming eBay searches.)

    It is possible on eBay to search back through a lister’s items for the past 30 days. The link can be found on the ‘SITE’ menu, the link to which is on the bottom (and top) of most eBay pages. Go to the listings for ‘More Ways to find items’ and click on ‘completed listings.’

    Sometimes people listing items can obscure what the item was by making the item ‘private.’ There are ways to get around this. One pirate I was after made the items private but then also tried to modify feedback by only publishing good feedback. The item numbers were listed, and by inserting an item number in an eBay search box that item will appear unless it is older than 30 days.

    In a situation where I did not have the item numbers the feedback itself was a hint as to what the item might be. If you are an intellectual property rights owner or the agent of such you can file what is called a VERO complaint. (Just type VERO in eBay’s HELP box and all the forms you need will be made available to you.)

    As an authorized VERO agent once you are sure that the item is, in fact, a bootlegged intellectual item, such as a copyrighted DVD ( VERO covers more) you can file a report and ebay will cancel the item (even if the listing is already completed and the buyer has paid) and let the buyer or would-be buyer know the item has been canceled.

    So it was possible, to prevent bootleggers from profiting on bootlegged items, to contact buyers on an item with feedback that sounded like the item might be the one in question (only necessary if the item is marked private and you can’t find the item number.) and ask them (You are limited to only a few contacts with eBay members who have not purchased from you recently in any given day.) if it is, indeed, the item in question. I do point out that if it is, I will enable them to file a complaint to eBay (and especially PayPal if they paid that way,) that the item is bootlegged, and demand their money back.

    Then whether I knew the item number of found it out from a buyer I file a VERO. (You should file the VERO complaint only after you’ve given them this chance to demand their money back from PayPal, etc. or the fact that the item will simply vanish from eBay records might make it hard for them to file a claim for a return of funds.) I do tell them that they may keep the bootlegged item with the copyright owners blessing.

    This had greatly discouraged our British and Scottish bootleggers. But the searching system might also help in finding resellers of stolen items on eBay.

  3. LnddMiles says:

    The best information i have found exactly here. Keep going Thank you

  4. LnddMiles says:

    The best information i have found exactly here. Keep going Thank you

Leave a Reply