![]() |
![]() |
contact |
|
DownloadsIn the perspective of setting up a Web archiving chain, the following tools are recommended and used by members of the IIPC: AcquisitionHeritrix, an open-source, extensible, Web-scale, archiving quality Web crawler DeepArc, a portable graphical editor
which allow users to map a relational data model to an XML Schema and export
the database content into an XML document Curator ToolsWeb Curator Tool (WCT), a tool for managing the selective Webharvesting
process is designed for use in libraries and other collecting organisations,
and supports collection by non-technical users while still allowing complete
control of the Webharvesting process. The WCT is now available under the terms of the Apache Public License. NetarchiveSuite, a curator tool allowing librarians to define and control harvests of web material. The system scales from small selective harvests to harvests of entire national domains. The system is fully distributable on any number of machines and includes a secure storage module handling multiple copies of the harvested material as well as a quality assurance tool automating the quality assurance process. Collection storage and maintenanceBAT (BnFArcTools), an API for processing ARC, DAT or CDX files Access and finding aidsWayback, a tool that allows users to see archived versions of web pages across time. NutchWAX (Nutch with Web Archive eXtensions), a tool for indexing and searching Web archives using the Nutch search engine and extensions for searching Web archives WERA (WEb aRchive Access), a Web archive search and navigation application. WERA was built from the NWA Toolset, gives an Internet Archive Wayback Machine-like access to Web archives and allows full-text search. Xinq (XML INQuire), a search and browse tool for accessing an XML database |
| top | © 2004-2011 IIPC | copyright and privacy statements | credits |