Organization Type: EU Interinstitutional body
Country: Luxembourg
IIPC contact e-mail: OP-WEB-PRESERVATION@publications.europa.eu
Start Date: 1996 (first preserved capture); 2013 (web archiving program)
Archive interface language(s): English
Access methods: URL search, Full-text search
Harvesting methods: International domain, Event, Thematic
As part of its preservation activities, the Publications Office of the European Union crawls, curates and preserves the content and design of the websites of EU institutions, bodies and agencies, making them available for current and future generations. These websites are mostly hosted on the europa.eu domain and subdomains, and represent the core “European Union” collection of the EU web archive.
Other collections of note include:
- “Horizon 2020”, containing the websites from the Horizon 2020 EU research and innovation framework programme that ran from 2014 to 2020.
- ”Presidencies of the Council of the EU”, preserving the websites from the presidencies of the Council, that rotate among the EU member states every 6 months.
- “Publications”, containing the archives of European Union publications that were produced and disseminated in HTML format.
- “Brexit archive”, preserving a selective list of Europa web pages that existed in the lead-up to Brexit.
This activity began in 2013, with regular quarterly crawls of a continuously-evolving seed list covering the scope of our service. In addition, we perform ad hoc captures, in close collaboration with website owners, prior to major revisions or decommissioning of websites.
To complement our growing dataset, we integrated the content within scope that was captured prior to our active involvement during this last decade. As a result, our collection now spans from 1996 (in the days of the europa.eu.int domain) to today.
Access: The contents of the archive can be accessed from a dedicated page on the Publications Office portal: https://op.europa.eu/en/web/euwebarchive