Programme

Image

Watch WAC2025 recordings

Wednesday, 9 April 2025

abstracts  |  UNT Collection
*all times in CEST
09:40am OPENING REMARKS: Olga Holownia, IIPC & Jon Carlstedt Tønnessen, National Library of Norway
09:50am OPENING KEYNOTE: Libraries, Copyright, and Language Models
Javier de la Rosa, National Library of Norway
Chaired by Andrew Jackson, Digital Preservation Coalition
10:45am BREAK
10:55am

LIGHTNING SESSION #1
Chair: Ben Els, National Library of Luxembourg

Strategies and Challenges in the Preservation of Mexico’s Web Heritage: First Steps

Carolina Silva Bretón

National Library of Mexico, Mexico


Arquivo.pt Toolkit for Web Archiving

Daniel Gomes

Arquivo.pt, Portugal


Tracking the Political Representations of Life: Methodological Challenges of Exploring the BnF Web Archives

Guillaume Levrier1,2, Dorothée Benhamou-Suesser2

1: Centre de recherches politiques de Sciences Po (CEVIPOF, CNRS), France; 2: Bibliothèque nationale de France, France


Collaborative Curatorial Approaches of the Czech Web Archive Using the Example of Thematic Literary Collections

Marie Haškovcová

National Library of the Czech Republic, Czech Republic

LIGHTNING SESSION #2
Chair: Sawood Alam, Internet Archive

Modelling Archived Web Objects as Semantic Entities to Manage Contextual and Versioning Issues

Tom Storrar1, Manuela Pallotto Strickland2

1: The National Archives (UK), United Kingdom; 2: King's College London, United Kingdom


Modernizing Web Archives: The Bumpy Road Towards a General ARC2WARC Conversion Tool

Pedro Ortiz Suarez, Sebastian Nagel, Thom Vaughan

Common Crawl Foundation, United States of America


Poking Around in Podcast Preservation

Jasper Snoeren

Netherlands Institute for Sound and Vision, Netherlands


Automatic Clustering of Domains by Industry for Effective Curation

Thomas Smedebøl

Royal Danish Library, Denmark


Best Practice of Preserving Posts from Social Media Feeds

Magdalena Sjödahl

Arkiwera wcrify AB, Sweden

11:25am BREAK
11:55am

PANEL #1: Engaging Audiences
Chair: Eveline Vlassenroot, University of Ghent

“Beyond Preservation: Engaging Audiences and Researchers with Web Archives”

Eveline Vlassenroot1, Peter Mechant1, Friedel Geeraert2, Christina Vandendyck2, Cui Cui3,4, Beatrice Cannelli4, Anders Klindt Myrvoll5, Andrea Kocsis6

1: University of Ghent, Belgium; 2: KBR - Royal Library of Belgium, Belgium; 3: University of Sheffield, United Kingdom; 4: Bodleian Libraries, United Kingdom; 5: Royal Danish Library, Denmark; 6: National Library of Scotland, United Kingdom

SESSION #01: Tools Under Construction: Lessons Learned
Chair: Katherine Boss, National Library of Norway

Embedding the Web Archive in an Overall Preservation System

Hansueli Locher

Swiss National Library, Switzerland


UKWA Rebuild

Gil Hoggarth

British Library, United Kingdom


Under Construction: Web Archive of the German National Library

Natanael Arndt

German National Library, Germany

WORKSHOP #01: Exploring Dilemmas in the Archiving of Legacy Webportals: An Exercise in Reflective Questioning

Daniel Steinmeier, Sophie Ham

National Library of the Netherlands, Netherlands

1:00pm LUNCH
2:05pm

SESSION #02: Crawling Tools
Chair: László Tóth, National Library of Luxembourg

Lessons Learned Building a Crawler From Scratch: The Development and Implementation of Veidemann

Marius André Elsfjordstrand Beck

National Library of Norway, Norway


Experiences of Using in-House Developed Collecting Tool ELK

Lauri Ojanen

National Library of Finland, Finland


Better Together: Building a Scalable Multi-Crawler Web Harvesting Toolkit

Alex Dempsey, Adam Miller, Kyrie Whitsett

Internet Archive, United States of America


Lowering Barriers to Use, Crawling, and Curation: Recent Browsertrix Developments

Tessa Walsh, Ilya Kreymer

Webrecorder, United States of America

SESSION #03: Advocacy & User Engagement
Chair: Mark Phillips, University of North Texas Libraries

Insufficiency of Human-Centric Ethical Guidelines in the Age of AI: Considering Implications of Making Legacy Web Content Openly Accessible

Gaja Zornada, Boštjan Špetič

Computer History Museum Slovenia (Računališki muzej), Slovenia


Web Archives for Music Research

Andreas Lenander Ægidius

Royal Danish Library, Denmark


IXP History Collection: Recording the Early Development of the Core of the Public Internet

Sharon Healy1, Gerard Best1, Lara Díaz Martínez2

1: Independent Researcher, Ireland; 2: University of Barcelona, Spain


Lost, but Preserved - A Web Archiving Perspective on the Ephemeral Web

Sawood Alam, Mark Graham

Internet Archive, United States of America

WORKSHOP #02: Web Archive Collections As Data

Gustavo Candela1, Chase Dooley2, Abbie Grotke2, Olga Holownia3, Jon Carlstedt Tønnessen4, Helena Byrne5, Emily Maemura6

1: University of Alicante, Spain; 2: Library of Congress, United States of America; 3: IIPC, United States of America; 4: National Library of Norway, Norway; 5: British Library, UK; 6: University of Illinois Urbana-Champaign, United States of America

3:40pm BREAK
4:40pm

POSTER SLAM
Chair: Olga Holownia, IIPC

‘We Are Now Entering the Pre-election Period’: Experimental Twitter Capture at The National Archives

Jake Bickford
The National Archives (UK), United Kingdom


The BnF DataLab Services and Tools for Researchers Working on Web Archives

Sara Aubry, Dorothée Benhamou-Suesser
Bibliothèque nationale de France, France


Designing Art Student Web Archives

Katherine Martinez
The New School, United States of America


Next Steps Towards A Formal Registry Of Web Archives For Persistent And Sustainable Identification

Eld Zierau
Royal Danish Library, Denmark


Using Web Archives to Construct the History of an Academic Field

Tegan Pyke
University of Bergen, Norway


Consortium on Electronic Literature (CELL)

Hannah Ackermans
University of Bergen, Norway


Arquivo.pt Annual Awards: A Glimpse

Daniel Gomes
Arquivo.pt, Portugal


Arquivo.pt Api/Bulk Access and Its Usage

Vasco Rato, Daniel Gomes
Arquivo.pt, Portugal


Failed Capture or Playback Woes? A Case Study in Highly Interactive Web Based Experiences

Mari Allison
Smithsonian Libraries and Archives United States of America


HAWathon: Participants Experience

Ingeborg Rudomino, Anamarija Ljubek
National and University Library in Zagreb, Croatia


Supporting Best Practices for Archiving Social Media by Heritage Institutions in Flanders (and Beyond)

Ellen Van Keer1, Katrien Weyns2
1: meemoo, Flemish Institute for Archives, Belgium; 2: KADOC at Catholic University of Leuven, Belgium


Planning Web Archiving Within a Four-Year Scope: Making the New Collection Plan for the Years 2025-2028 in the National Library of Finland

Sanna Haukkala
National Library of Finland, Finland


Redirects Unraveled: From Lost Links to Rickrolls

Kritika Garg1, Sawood Alam2, Michele Weigle1, Michael Nelson1, Mark Graham2, Dietrich Ayala3
1: Old Dominion University, United States of America; 2: Internet Archive, United States of America; 3: Filecoin Foundation, Netherlands


Use of Screenshots as a Harvesting Tool for Dynamic Content and Use of AI for Later Data Analysis

Gaja Zornada, Boštjan Špetič
Computer History Museum Slovenia (Računališki muzej), Slovenia


Asynchronous and Modular Pipelines for Fast WARC Annotation

Pedro Ortiz Suarez, Thom Vaughan
Common Crawl Foundation, United States of America


Politely Downloading Millions of WARC Files Without Burning the Servers Down

Pedro Ortiz Suarez, Thom Vaughan, Greg Lindahl
Common Crawl Foundation, United States of America


Robots.txt and Crawler Politeness in the Age of Generative AI

Sebastian Nagel, Thom Vaughan
Common Crawl Foundation, United States of America


Experiences Switching an Archiving Web Crawler to Support HTTP/2

Sebastian Nagel
Common Crawl Foundation, United States of America

4:40pm POSTER SESSION
7:00pm

DINNER

Pre-registration required for this event.