LIBRARY OF CONGRESS

LIBRARY OF CONGRESSOrganization Type: National Library
Country: United States
IIPC Contact Email: webcapture@loc.gov


Library of Congress Web Archive

Start Date: 2000
Archive interface language(s): English
Access methods: URL Search, Alphabetic Browsing, Subject Browsing, Topical Collections
Harvesting methods: Selective, Event, Thematic

The Library of Congress Web Archives (LCWA) is composed of collections of archived web sites selected by subject specialists to represent web-based information on a designated topic. It is part of a continuing effort by the Library to evaluate, select, collect, catalog, provide access to, and preserve digital materials for future generations of researchers. In 2013, the Library launched a new interface that allows searching of a small set of collections alongside other library materials.

Web Archive Preservation Activities

  • The Library uses the Bagit Specification as a container for sub 1Terabyte bags and uses bit preservation with 2 long-term tape storage copies and 1 disk access copy.
  • At the present time, the Library has not migrated ARC files to WARC files and will undertake and evaluation of those ARCs for long-term preservation.