Brief description of the project
BnF wishes to host a datathon on web archive collections coming from francophone national libraries with a legal deposit mission. Datathon will be led by Archives Unleashed and will use the datasets from BnF, KBR and BnL.
Goals, outcomes and, deliverables
BnF will host a datathon on web archive collections coming from francophone national libraries with a legal deposit mission. Datathon will be led by Archives Unleashed and will use the datasets from BnF, KBR and BnL. Our first goal is to promote our web archive collections and to favor their use in different research works. To do so, we intend to give the opportunity to use appropriate tools for data analysis, data visualization, text and data mining. The audience must be as large and diverse as possible, as long as the attendees are interested in archived web collections.
- Slideshows of the awarded projects of the datathon.
- Reports on the accomplishment of the event (on the websites of Archives unleashed, IIPC, BnF, KBR, BnL)
How the project furthers the IIPC strategic plan
Datathon will allow the three lead organizations to promote their web archive collections and will give the opportunity to attendees belonging to various partner institutions to experiment with text mining and data visualization treatments on these collections. So datathon fits in with the objective 4 of the IIPC strategic plan: Partnerships and Outreach – To engage and support researcher involvement in, and benefit from, member activities.
Detailed description of the project
BnF plans to host an Archives Unleashed datathon in October of 2020. The event will be an opportunity to involve various participants in text and data mining treatments on web archive collections coming from francophone national libraries with a legal deposit mission. Our purpose would be to focus the datathon on legal deposit data, on francophone data and on multilingual data with a francophone part. In addition, the datathon will constitute an opportunity to test the Archives Unleashed toolkit on legal deposit collections and to identify needs of evolution in the framework of the IIPC project about adaptation of Archives Unleashed Cloud for an in-house use at national & university libraries.
Candidates belonging to member institutions of IIPC will be registered in priority. Since we wish to promote our collections and attract a more general public, the event will also be opened to candidates coming from other institutions.
The Call for participation will be shared on the IIPC channels (mailing list & website). Application by e-form, handled by BnF. Each of the three lead contributors (BnF, KBR, BnL) selects 5 attendees. 15 other attendees are taken via announcement on IIPC website and mailing list. Each candidate must send a 250-word statement and a CV. A committee (two from BnF and two from Archives Unleashed) carries out the final selection. The maximum number of attendees will be 30. Remote participation will not be allowed. Attendees will have to use their own laptops.
Project schedule of completion
- Call for Participation: April 2020
- Registration & notice of acceptance: July 2020
- Event: October 2020
- Report and documentation: November 2020