Skip to main content
  1. Home
  2. Computing
  3. Web
  4. News

Internet Archive will ignore robots.txt files to keep historical record accurate

Add as a preferred source on Google

The Internet Archive has announced that going forward, it will no longer conform to directives given by robots.txt files. These files are predominantly used to advise search engines on which portions of the page should be crawled and indexed to help facilitate search queries.

In the past, the Internet Archive has complied with instructions laid out by robots.txt files, according to a report from Boing Boing. However, it has been decided that the way that these files are calibrated is often at odds with the service that the site sets out to provide.

Recommended Videos

“Over time we have observed that the robots.txt files that are geared toward search engine crawlers do not necessarily serve our archival purposes,” stated a blog post that the organization published last week. “Internet Archive’s goal is to create complete ‘snapshots’ of web pages, including the duplicate content and the large versions of files.”

Robots.txt files are increasingly being used to remove entire domains from search engines following their transition from a live, accessible site to a parked domain. If a site goes out of business, and is rendered inaccessible in this way, it also becomes unavailable for viewing via the Internet Archive’s Wayback Machine. The organization apparently receives queries about these sites on a daily basis.

The Internet Archive hopes that disregarding robots.txt files will help contribute to an accurate representation of prior points in the web’s history, removing their capacity to muddy the waters with instructions intended for search engines.

The organization has already ceased referring to robots.txt files on sites and pages related to the U.S. government and the U.S. military, to account for the enormous changes that can be made to domains between one administration and the next. This decision has caused no major problems, so there are high hopes that discontinuing the use of the files more broadly will be helpful.

Brad Jones
Brad is an English-born writer currently splitting his time between Edinburgh and Pennsylvania. You can find him on Twitter…
Layr is a new macOS clipboard manager that replaces hotkeys with trackpad gestures
This new Mac app opens clipboard history with a four-finger tap instead of a keyboard shortcut
Cursor open on Mac

macOS users already have several clipboard manager options, including Paste and Maccy. Most of them work well, but they are usually built around keyboard shortcuts. That is useful for keyboard-heavy users, but it can feel out of place for users who rely on the trackpad for most of their work.

Layr, a new clipboard manager from the developer behind Declutr, takes a different approach. Rather than assigning a keyboard shortcut to open the clipboard history, the app lets users bring up a clipboard overlay with a four-finger tap on the trackpad.

Read more
YouTube’s AI content labels are getting a much-needed makeover
No more hunting through descriptions. YouTube's AI labels are finally moving front and center.
YouTube ai declaration longform video

This year’s Google I/O marked the transition of Google from a search company to a fully AI-focused company. The company launched several AI tools, but the one that matters the most for YouTubers is Google Omni, built for video generation and editing. 

While tools like Omni lower the barrier for creators, which is a good thing, it also results in the platform being inundated with low-effort AI content. The company understands that this will annoy a large percentage of its users, so it has been asking creators to disclose AI-generated content since 2024. 

Read more
AI models have a religion favoritism problem, and new research exposes it
AI models are subtly steering users toward certain religions, and most people have no idea it's happening.
Artificial Intelligence

A new research consortium has found something worth paying attention to: when you ask AI about grief, love, loss, or moral decisions, it almost never brings religion into the conversation.

The Consortium for Evaluation of Faith and Ethics in AI (CEFE-AI), a collaboration among researchers at Brigham Young University, Baylor University, the University of Notre Dame, and Yeshiva University, published its findings this week at the Summit on AI Ethics in Athens, Greece.

Read more