The National Library of the Czech Republic has been building the archive of the Czech web since 2000. It deploys a combination of automated large-scale crawls of the TLD (.cz) and harvesting of selected "Czech" websites regardless of the domain. Recently, the library has been experimenting with automated crawls of "Czech" websites outside of the TLD. Automated crawls of .cz are conducted once or twice a year and selective harvests of about 1500 websites every two months. In addition, the library builds thematic or event driven collections. Access to the public part of the archive (from the selective harvests) is provided to anyone online via internet while the rest of the archive is available only to the library patrons onsite from the library building.
A global network of experts archiving the Web for future generations.