IIPC Training Session: Beginners 3A (Slides)

olga

  • Version
  • Download 207
  • File Size 661 KB
  • File Count 1
  • Create Date 02/02/2020
  • Last Updated 13/07/2021

Session 3A: Main Concepts and Technologies: Capture

The purpose of this module is to explain the concept of capture in web archiving and to provide an overview of some of the available tools to support getting started with a web archiving programme.

More information can be found on the training homepage.

Learning Objectives / Motivations

Attendees will be able to:

  • Describe the basic processes of web archiving
  • Describe the types of tools and how they relate to the basic processes
  • Be able to operate choose web archiving tools based on function and capabilities
  • Know where to find and download tools for web archiving

Target Audience

This module is aimed at non-technical practitioners who wish to become familiar with the process of capture in web archiving. The course is less suited for practitioners who wish to implement a long-term sustainable web archiving programme since advanced and robust technologies and tools will only be mentioned briefly.

The beginner course in general aims to provide a complete beginner’s level training course in web archiving. There are no prerequisites for understanding beyond a general familiarity with archives, libraries or information management, meaning that the course is suitable for absolute beginners. The course is divided into eight different sessions that each focus on one specific topic. By the end of the course, participants will have learnt why archiving the web is important, what web archives are, what the main concepts and technologies are, who the users are and what their needs are, how to identify risks and benefits, what the main approaches to web archiving are, how to write a web archiving policy and how to make the case for web archiving.

How to Use and Customize These Materials

Several of the training sessions contain discussion questions which can be removed for brevity if needed. Likewise, there are many case study slides which could potentially be removed, or, more likely, changed for examples relevant to the context in which the training is being delivered. The risk management exercise this session contains a scenario based on a community archive, this might also be replaced if something more relevant to your context.

Speaker Notes

Downloadable version

Tools, References, and Related Resources

https://github.com/iipc/awesome-web-archiving
https://www.httrack.com/
https://www.gnu.org/software/wget/
https://github.com/internetarchive/heritrix3/wiki
https://github.com/internetarchive/umbra
https://github.com/internetarchive/brozzler
https://archive-it.org/
https://support.archive-it.org/hc/en-us/articles/216489103-Archive-It-Video-Curriculum-
https://webcuratortool.readthedocs.io/en/latest/index.html
https://sbforge.org/display/NAS/NetarchiveSuite
https://blog.webrecorder.io/2019/08/14/autopilot
https://guide.webrecorder.io/
https://webrecorder.io/
http://rhizome.org/
https://gwu-libraries.github.io/sfm-ui/
https://perma.cc/

Credits

Created by the IIPC Training Working Group and the Digital Preservation Coalition


Download