Status: Past Project
| PROJECT CLOSURE REPORT
SYSTEM INSTALLATION GUIDE
The British Library has received previous funding from the IIPC and developed a prototype application called Twittervane, which is capable of analyzing Twitter feeds and determining which websites are shared most frequently around a given theme over a given time period. These websites can then be presented to curators as potential titles for web archiving, saving time and effort required for manual selection.
- It allows archiving institutions to respond to sudden or sporadic events more quickly;
- It uses popularity of websites as selection criteria and exploits the wisdom of the crowd, adding social aspects to web archiving.
Scope of the proposed project
- Improved documentation covering installation and basic usage.
- Improvement the usability of the current simplistic user interface to make it more intuitive and usable.
- Enable the prototype to store collected tweets in plain JSON files, so large-scale analysis can be performed.
- Put in place data cleaning capabilities to automatically remove old, analyzed tweets.
- Automate background processes to analyze and process captured tweets.
- Put in place management capability for the Twitter streaming process so it can be monitored and the system can respond appropriately in case of any problems. This will likely involve splitting the monitor function from the current prototype and implement it as a separate backend process. Finally, we will also ensure the development of the Twittervane software is made open source (Apache 2.0 license), and moved to GitHub.
- The project final report (containing the evaluations from the IIPC members), and
- The initial release of the Twittervane application itself (including documentation covering installation and basic usage).