11:20am |
SESSION #1: AI & Machine Learning Chair: Andrea Goethals, National Library of New Zealand 11:00am - 11:20am Re-imagining Large-Scale Search & Discovery for the Library of Congress’s .gov Holdings recording | slides
Benjamin Lee Library of Congress, United States of America
11:20am - 11:40am Extending Classification Models with Bibliographic Metadata: Datasets and Results recording | slides
Mark Phillips1, Cornelia Caragea2, Seo Yeon Park2, Praneeth Rikka1, Saran Pandi2 1: University of North Texas, United States of America; 2: University of Illinois Chicago, United States of America
11:40am - 12:00pm Utilizing Large Language Models for Semantic Search and Summarization of International Television News Archives recording | slides
Sawood Alam1, Mark Graham1, Roger Macdonald1, Kalev Leetaru2 1: Internet Archive, United States of America; 2: GDELT Project, United States of America
12:00am - 12:20pm MeshWARC: Exploring the Semantic Space of the Web Archive recording | slides
Amr Sheta2, Mohab Yousry2, Youssef Eldakar1 1: Bibliotheca Alexandrina, Egypt; 2: Alexandria University, Egypt
|
SESSION #2: Unique Content Chair: Meghan Lyon, Library of Congress 11:00am - 11:20am 80 Thousand Pages On Street Art: Exploring Techniques To Build Thematic Collections recording | slides
Ricardo Basílio Arquivo.pt, Portugal
11:20am - 11:40am Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements recording | slides
Christopher Rauch1, Mat Kelly1, Alexander Poole1, Michele C Weigle2, Michael L Nelson2, Travis Reid2 1: Drexel Univesity, United States of America; 2: Old Dominion University, United States of America
11:40am - 12:00pm Working Together to Capture, Preserve and Provide Access to Digital Artworks recording | slides
Claire Newing1, Tom Storrar1, Patricia Falcao2, Sarah Haylett2, Jane Kennedy2 1: The National Archives, United Kingdom; 2: Tate, United Kingdom
12:00pm - 12:20pm Put it Back! Archived Memes in Context recording | slides
Valérie Schafer University of Luxembourg
|
WORKSHOP #5: Unlocking Access: Navigating Paywalls and Ensuring Quality in Web Crawling (Behind Paywall Websites - Crawl, QA & More) 11:00am - 12:20pm
Anders Klindt Myrvoll1, Thomas Martin Elkjær Smedebøl1, Samuli Sairanen2, Joel Nieminen2, Antares Reich3, László Tóth4 1: Royal Danish Library; 2: National Library of Finland; 3: Austrian National Library; 4: National Library of Luxembourg
|
1:40pm |
SESSION #3: Contextual Chair: Moises Rockembach, University of Coimbra 1:40pm - 2:00pm Averting the “Digital Dark Age”: The Digital Preservation Moment and the Birth of Modern Web Archiving, 1994-1996 recording | slides
Ian Milligan University of Waterloo, Canada
2:00pm - 2:20pm The Form Of Websites: Studying The Formal Development Of Websites, The Case Of Professional Danish Football Clubs 1996-2021 recording | slides
Niels Brügger Aarhus University, Denmark
2:20pm - 2:40pm Challenges of Putting Web Archives in a Comprehensive Context: the Case of Vdl.lu recording
Carmen Noguera University of Luxembourg
|
SESSION #4: Delivery & Access Chair: Lauren Ko, University of North Texas Libraries 1:40pm - 2:00pm Renascer Project Brings Back Old Websites at Arquivo.pt recording | slides
Ricardo Basílio Arquivo.pt, Portugal
2:00pm - 2:20pm Preserving the Uncrawlable: Serving the Server recording | slides
Andrew McDonnell University of Kentucky, United States of America
2:20pm - 2:40pm Lost and Found in Cyberspace: Reconstructing MultiTorg recording | slides
Jon Carlstedt Tønnessen National Library of Norway
2:40pm - 3:00pm Towards Multi-Layered Access with Automatic Classification recording | slides
Jon Carlstedt Tønnessen, Thomas Langvann National Library of Norway
|
WORKSHOP #6: Browser-Based Crawling For All: Introduction to Quality Assurance with Browsertrix Cloud 1:40pm - 3:00pm recording | slides
Andrew Jackson1, Anders Klindt Myrvoll2, Ilya Kreymer3, Tessa Walsh4, Henry Wilkinson4 1: Digital Preservation Coalition, United Kingdom; 2: Royal Danish Library; 3: Webrecorder, United States of America; 4: Webrecorder, Canada
|
3:30pm |
SESSION #5: Collaborations Chair: Helena Byrne, The British Library 3:30pm - 3:50pm LGBT+ and Religion: Queering Web Archive Research recording | slides
Jesper Verhoef Erasmus University Rotterdam, Netherlands
3:50pm - 4:10pm Enter The Trading Zone: When Web Archivists And Researchers Meet To Explore Transnational Events In Archived Web Collections recording | slides
Susan Aasman1, Anat Ben-David2, Niels Brügger3 1: University of Groningen, The Netherlands; 2: Open University of Israel; 3: Aarhus University, Denmark
4:10pm - 4:30pm Web Archiving, Open Access, & Multi-Custodialism recording | slides
Monica Westin1, Jefferson Bailey2 1: Internet Archive; 2: United Kingdom; Internet Archive, United States of America
|
SESSION #6: Legal & Ethical Chair: Jeffrey van der Hoeven, National Library of the Netherlands 3:30pm - 3:50pm Intellectual Property & Privacy Concerns of Web Harvesting in the EU recording | slides
Anastasia Nefeli Vidaki Vrije Universiteit Brussel, Belgium
3:50pm - 4:10pm DSM to the Rescue? Implications of the new EU Copyright Directive for Social Media Archiving: the Case of the Belgian Transposition and the Cultural Heritage Archives in Flanders recording | slides
Ellen Van Keer meemoo, Belgium
4:10pm - 4:30pm Digital Legal Deposit Beyond the Web recording | slides
Vladimir Tybin National Library of France
|
PANEL #1: “Can we capture this?”: Assessing Website Archivability Beyond Trial and Error Chair: Martin Klein, Los Alamos National Laboratory 3:30pm - 4:30pm recording | slides
Meghan Lyon1, Calum Wrench2, Tom Storrar3, Nicholas Taylor4 1: Library of Congress, United States of America; 2: MirrorWeb, United Kingdom; 3: The National Archives, United Kingdom; 4: Los Alamos National Laboratory, United States of America
|
4:40pm |
LIGHTNING TALKS Chair: Martin Klein, Los Alamos National Laboratory
Generative AI In Streamlining Web Archiving Workflows recording | slides Lok Hei Lui University of Toronto, Canada
Scaling Web Archiving: The Challenges of Deduplication recording | slides Alex Dempsey Internet Archive, United States of America
WARC-ing Legacy Archived Web Sites recording | slides Annabel Walz Archive of Social Democracy (Friedrich-Ebert-Stiftung), Germany
Describing Collections with Datasheets for Datasets recording | slides Emily Maemura1, Helena Byrne2 1: University of Illinois, United States of America; 2: British Library, United Kingdom
Visualizing the Web History of the Pandemic COVID-19 Outbreak In México: Last Stage recording | slides Carolina Silva Bretón National Library of México / Bibliographical Research Institute, Mexico
Towards A Formal Registry Of Web Archives For Persistent And Sustainable Identification recording | slides Eld Zierau1, Jon Tonnessen2, Anders Myrvoll1 1: Royal Danish Library; 2: National Library of Norway
Digital Storytelling: Creating An Exhibition of Web-born and Mobile Narratives recording | slides Ian Cooke, Giulia Carla Rossi British Library, United Kingdom
|
5:20pm |
DROP-IN TALKS Chair: Martin Klein, Los Alamos National Laboratory
Web Archives in Documenting AI History recording Juliette Benguigui Ecole nationale des chartes, France
Twenty-year Journey of the Croatian Web Archive (HAW): Key Milestones recording Inge Rudomino National and University Library in Zagreb, Croatia
ARK Persistent Identifier Use Cases in Web Archiving recording John Kunze ARK Alliance and Ronin Institute, United States of America
Web Archiving Update from KBR - Belgium recording Friedel Geeraert KBR, Belgium
Dealing with Donors of Web Archives: Challenges and Advice recording Lisa Lawlis Archives and Special Collections, Western University, Canada
History of Web Archive at UNHCR recording Ndahambelela Hertha IIPINGE United Nations High Commissioner for Refugees (UNHCR), Switzerland
Documenting the Smithsonian on the Web in 2020 recording Lynda Schmitz Fuhrig Smithsonian Libraries and Archives, United States of America
Bringing Web Archive to an Informational Commodity: The Standards Path recording Gregory Miura Université Bretagne Occidentale / ISO, France
Why Web Archive Performing Arts? recording Melissa Wertheimer Library of Congress, United States of America
A Comparative Study on ChatGPT: Generated vs. Manual Metadata Creation for Government Reports recording Peter Chan Stanford University, United States of America
|
5:30pm |
POSTER SESSION #1
Many Hands Make Light(er) Work: Collaborative Web Lifecycle Management Sara Day Thomson, Alice Austin, Stratos Filalithis, Bruce Darby University of Edinburgh, United Kingdom
How to Implement The Long-term Preservation of the Web? The Journey of the Publications Office of The European Union Corinne Frappart Publications Office of the European Union, Luxembourg
Digital Resources – Slovak Web Archive in Context with the New Legislation poster Jana Matúšková, Peter Hausleitner University Library in Bratislava, Slovak Republic
Why WARCs Are Complex But Not Messy Iris Geldermans National Library of the Netherlands
Report on the Scholarly Use of Web Archives Across Ireland: The Past, Present & Future(s) poster Helena Byrne1, Sharon Healy2 1: British Library, United Kingdom; 2: Independent Researcher, Ireland
Blog to Bytes: Exploring the UK Web Archive’s Blog Posts Through Text Analysis poster Helena Byrne, Carlos Lelkes-Raugal, Joan Francis British Library, United Kingdom
Podcasts Collection at the Bibliothèque Nationale de France: From Experimentation to the Implementation of a Functional Harvest poster Nola N'Diaye, Clara Wiatrowski National Library of France
|