Speaker: Jason R. Baron, Professor of the Practice, College of Information Studies, University of Maryland
Abstract: For over a decade, the US Archivist has been attempting to mandate a future start date for the accessioning of all permanent federal government records into the National Archives and Records Administration (NARA) solely in electronic or digital formats. Although the “no more hard copy records” date for doing so has been extended twice, in the interim much has been accomplished in promoting governmental policies that involve the management of hundreds of millions if not billions of electronic records in individual agencies, including most prominently the archiving of e-mail records. Three questions we should be asking: how will NARA meet the challenge of accessioning especially born digital records when the deadline does finally arrive? Are freedom of information laws up to the task of providing timely access in the future to those records? And how can AI tools assist in ensuring that public access to the government’s archives remains real and not just aspirational?
Bio: Jason R. Baron is a Professor of the Practice in the College of Information Studies at the University of Maryland. Between 2013 and 2020, he acted as Of Counsel to the eDiscovery & Information Governance group at Faegre Drinker, LLP. During his prior 33 years in government service, he served as the first appointed director of litigation at the National Archives and Records Administration, and before that as a trial lawyer and senior counsel at the Department of Justice. In those capacities, Mr. Baron acted as lead counsel on landmark lawsuits involving the preservation of White House email, and also played a leading role in improving federal electronic recordkeeping policies. He is a recipient of the international Emmett Leahy Award for his career contributions in records and information management. Mr. Baron received his B.A. magna cum laude with honors from Wesleyan University, and his J.D. from the Boston University School of Law.
Abstract: It seems likely that AI will, at some point in the relatively near future, provide originating organisations with the capability to re-aggregate records in any (or all) of their systems. The organisations concerned could then, if they wished, begin to apply access permissions and retention rules to records on the basis of an entirely different set of aggregations to those used by the end-users that created and received them.
This opens up a theoretical question:
will the coming of such a capability mean that the way that records are aggregated in the day-to-day systems that end-users used to create and receive them (document management systems, collaboration systems, email systems, chat systems etc.) cease to have anything more than a purely temporary significance in recordkeeping? OR
will that ‘original order’ of records instead place constraints and limits on the ways that original organisations can make use of artificial intelligence capabilities for improving the precision of the application of access permissions and retention rules?
This talk will explore the impact of AI on the application of access permissions and retention rules within digital corporate systems, on the basis of a set of thought experiments carried out in James’ doctoral research project.
Bio: James Lappin is a doctoral researcher with Loughborough University’s Centre for Information Management. He has worked in the field of archives and records management for thirty years as a practitioner, consultant, researcher, policy advisor, presenter, blogger, podcaster and cartoonist.
Abstract: Artificial Intelligence (AI) developments in the private sector have quickly permeated throughout all levels of society. Daily we are – either directly or indirectly – interacting with AI. Acknowledging the benefits that AI has brought to the private sector, particularly regarding efficiency gains, many have started to talk about the potential benefits that AI may have when applied in and by the public sector. Certainly, some benefits do exist. However, it is also important to realize that the public sector and the private sector are different, with different requirements, goals, and aims. While the private sector may be concerned with customer acquisition, maximizing profits, or market growth, the public sector from does not have the same motivations. Thus, the reasons for applying AI, the ways in which AI is applied, and the areas where AI can be applied will be similarly different. To maximize the impact that AI can have, these differences must be acknowledged. Starting from this need for differentiation, this talk will help to demystify AI, by defining and outlining what AI is and is not. Following this, the talk will highlight the necessary building blocks for the usage of AI in the public sector: technically, infrastructurally, and legislatively. Finally, the talk will conclude by discussing how the public sector can best use and build AI by discussing procurement best practices and presenting best practice case studies.
Bio: Keegan McBride is a departmental research lecturer in AI, Government, and Policy and Course Director of the Social Science of the Internet MSc program at the Oxford Internet Institute. He is an expert on the topic of digital government and has published research in leading journals and conferences on topics such as Artificial Intelligence in the public sector, open government data, government digitalization, and government interoperability and data exchange systems. In his research he aims to develop an understanding about the future trajectory of the state in the digital age by exploring the complex and co-evolutionary relationships between technology, society, and the state.
Our Workshop 2 will take place on the 4th of May of 2023 in London both in person, at The White Chapel building in East London, and online. The workshop will continue our discussions on AI applied to born-digital archives, particularly government archives.
Details about registration and the full program of speakers will be published soon.
Through a series of talks and a round table, this day long workshop delved into the challenges and opportunities that AI offers to the management and use of digital born archives.
The workshop was an hybrid event (in London and online).
Invited speakers included Professor Stephanie Decker and Dr Adam Nix from the University of Birmingham; Dr Jenny Bunn from The National Archives and University College London; and Dr Tony Russell-Rose from Goldsmiths University.
PLEASE SCROLL DOWN FOR ABSTRACTS, VIDEOS AND SLIDES.
Finding light in dark archives: Using AI to connect context and content in email.
Professor Stephanie Decker and Dr Adam Nix (University of Birmingham).
The practice of digital archival discovery is still emerging, and the approaches future research will take when using digital sources remain unclear. Archival practice has been shaped by paper-based, pre-digital sources and guides assumptions around how researchers will access and make use of such collections. Paradoxically, dealing with the increasing relevance of born-digital records is not helped by the fact that many born-digital collections remain dark, in part while questions of how they should be effectively made available are answered. Our research takes a user perspective on discovery within born-digital archives and seeks to promote more meaningful access to born-digital archives for researchers. In doing so, our work deals with the implications that unfamiliar archival technologies (including artificial intelligence) have on disciplinary traditions in the humanities and social science, with a specific focus on historical and qualitative approaches.
Our work in this area currently focuses on the issue of context within organisational email, and the challenges of searching and interpreting large bodies of email data. We are particularly interested in how effective machine-assisted search and multiple pathways for discovery can be used to open contextually opaque collections. Such access is likely to leverage a collection’s structural and content characteristics, as well as targeted archival selection and categorisation. We ultimately suggest that by combining relatively open user-led interfaces with pre-selective material, digital archives can provide environments suited to both the translation of existing research practices and the integration of more novel opportunities for discovery. Our presentation will summarise our progress in this area and reflect on the technical and methodological questions our work here has raised.
Putting principle into practice: Transparency, recordkeeping and AI.
Dr Jenny Bunn (The National Archives and University College London).
At the level of principle it is difficult to argue against the inherent good-ness of ideals such as transparency, accountability and fairness, but they have never been easy to put into practice. The increasingly advanced assistance technologies, generally placed under the label of AI, can now offer us further complicate this picture; offering as they do new possibilities for us to distance ourselves from both the consequences of our decisions and the very making of them. Recordkeeping has long acted to bridge this distance and this presentation will consider the new forms it may need to take to continue to ensure that accounts are rendered and explanations offered in the enduring spirit of transparency.
Searching, fast and slow: rethinking the query builder paradigm.
Dr Tony Russell-Rose (Reader in Computer Science, Goldsmiths University).
Knowledge workers such as information professionals, legal researchers and librarians need to create and execute search strategies that are comprehensive, transparent, and reproducible. The traditional solution is to use command-line query builders offered by proprietary database vendors. However, these are based on a paradigm that dates from the days when databases could be accessed only via text-based terminals and command-line syntax. In this talk, we explore alternative approaches based on a visual paradigm in which users express concepts as objects on an interactive canvas. This offers a more intuitive UX that eliminates error, makes the query semantics more transparent, and offers new ways to collaborate and share best practices.
Artificial Intelligence (AI) is increasingly necessary in support of records management decisions. The scale, rapidity and complexity required in modern records management operations all contribute to make AI increasingly relevant to practitioners. This talk will start by reviewing recent contributions at the intersection of records management and AI, as well as highlight the many challenges still lying ahead of us. Preliminary considerations on the preconditions and requirements of using AI in support of records management will be proposed for discussion. These will be grounded in concrete examples and in an appreciation of the limits and potential of modern-day AI technology.
He did his PhD at the Digital Humanities Laboratory of the EPFL in Lausanne, working on methods for text mining and citation analysis of scholarly publications, and is co-founder of Odoma, a start-up offering customised machine learning techniques in the cultural heritage domain. Giovanni was also a Co-investigator on the Living with Machines project and convenes the AI for Arts interest group at the Turing.
Prior to joining the UvA, Giovanni has been part of the Research Engineering Group of The Alan Turing Institute, and a researcher at Leiden University (CWTS), the Leibniz Institute of European History in Mainz, and the University of Oxford. He studied computer science (BSc) and history (BA, MA) in Udine, Milan, Padua and Venice in Italy.