Talk:Internet Archive

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search


I have created a userbox to help spread the word that Internet Archive is a useful website archiving service. {{User Internet Archive}}

External links modified[edit]

Hello fellow Wikipedians,

I have just modified 4 external links on Internet Archive. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:

When you have finished reviewing my changes, you may follow the instructions on the template below to fix any issues with the URLs.

As of February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{sourcecheck}} (last update: 15 July 2018).

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.

Cheers.—InternetArchiveBot (Report bug) 03:27, 12 April 2017 (UTC)

Please Update this article with these new figures[edit]

Hi, I'm the Director of Partnerships at IA. I noticed there are a lot of old facts and figures in this article. Here's a source with up-to-date information:

For instance in 2017 we now have 30 petabytes of data.

Some good secondary sources (that were requested) include: Medium: "Never Trust a Corporation to do a Library's Job":

The New Yorker--Jill Lepore's "The Cobweb: Can the Web be Archived?"

Thanks for helping to make this more accurate.

best, Wendy Hanamura — Preceding unsigned comment added by Whanamura (talkcontribs) 21:39, 15 April 2017 (UTC)

Robots.txt to be ignored[edit]

It's unclear at this time exactly when this will apply to sites other than government ones, but have announced in their blog that they are "looking to do this more broadly"

It may be worth mentioning this where the article currently gives a false sense of privacy in saying that robots.txt is obeyed. (talk) 19:09, 24 April 2017 (UTC) >

> Just read this, I'm not sure. Neilc314 (talk) 05:34, 23 May 2018 (UTC)


Is Gifcities notable enough to make a section about it? --Nutshinou (talk) 12:13, 3 September 2018 (UTC)

As much as I love it, probably not. Isn't it enough to describe it as part of the Geocities archival (which is perhaps most relevant for ArchiveTeam in a way)? --Nemo 18:38, 3 September 2018 (UTC)