*Workshop on Cyberinfrastructure and machine learning for digital libraries and archives* In conjunction with the Joint Conference on Digital Libraries 2018, Fort Worth Texas. https://urldefense.proofpoint.com/v2/url?u=https-3A__www.tacc.utexas.edu_conference_jcdl18&d=DwIFaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=b5NZPQUb9_r2rQ3Zd74ATT3aSs9yKyRnJLOhqJvd7fE&m=zqSKGfek7zcESN2kWflvTW41wJCa-PBfUvHDXOFBHVg&s=22vygEQI8Zd7M12QNq3W9z-JsQ-AOlkhvzFwGOLR6jo&e= *Agenda for June 3rd 2018* The workshop introduces a tryptic model that connects digital libraries and archives, cyberinfrastructure, and machine learning to stimulate research and implementation of automated methods to describe, represent, preserve, and facilitate access and reuse of large-scale scholarly data. Cyberinfrastructure refers to shared online research environments, backed up by advanced computing resources and supported by experts. Coupled with cyberinfrastructure, machine learning methods and tools can provide digital libraries and archives with powerful resources to enhance their ability to curate, organize, represent, and provide persistent access to large-scale collections, thus facilitating their discoverability and reuse. The papers and activities address the combination of cyberinfrastructure and machine learning throughout the lifecycle of digital collections; from data management planning, requirements gathering, description, preservation, access and publication. Bring your laptop for a day of lectures by remarkable researchers and for exciting hands-on exercises. 9:00-9:30 Conference chairs: Welcome and overview of the workshop’s agenda. 9:30-10:00 Matthew McEniry, Jessica Trelogan, and Santi Thompson: *Expanding Library Capacity and Facilitating Reuse through a Consortial Data Repository* 10:00-10:30 Coffee break. 10:30-11:00 Tanya Clement, Jon Dunn, Juliet Hardesty, Chris Lacinak and Amy Rudersdorf, *Audiovisual Metadata Platform Planning Project.* 11:00-11:30 Will Thomas, Benjamin Galewsky, Gregory Jansen, Sandeep Satheesan, Richard Marciano, Shannon Bradley, Jong Lee, Luigi Marini and Kenton McHenry, *Petabytes in Practice: Working with Collections as Data at Scale.* 11:30- 12:00 Maria Esteva, Hands-on tutorial:* A Method for Modeling Large-scale Data Requirements to Cyberinfrastructure and Machine Learning.* After introducing how data modelling was used in the design of the Digital Rocks Portal (https://urldefense.proofpoint.com/v2/url?u=https-3A__www.digitalrocksportal.org_&d=DwIFaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=b5NZPQUb9_r2rQ3Zd74ATT3aSs9yKyRnJLOhqJvd7fE&m=zqSKGfek7zcESN2kWflvTW41wJCa-PBfUvHDXOFBHVg&s=KL2PDSdt-Ch1YhJQGd0pHiKltB7ZmuMVJa_6HLqX-7k&e=), attendees working in multi-disciplinary groups will model large-scale data use cases including analysis, curation, access and publication functions and will map those to cyberinfrastructure. You are welcomed to share large scale data curation and analysis cases to discuss and resolve during the workshop. 12:00-1:30 Lunch Break 1:30 -2:00 Matt Lease: *What can Machine Learning and Crowdsourcing do for you? Exploring New Tools for Scalable Data Processing*. 2:00-2:30 Sachith Withana, Inna Kouper, and Beth Plale, *Data Capsule Appliance for Restricted Data in Libraries.* 2:30-3:00 Ruizhu Huang, Hands-on tutorial:* Machine Learning on Cyberinfrastructure* Attendees will get chance to learn how to log on to a supercomputer and start an interactive sessions using a big data cluster to explore how it can be used for a machine learning project. 3:00-3:30 Coffee Break 3:30-4:00 Dan Wu and Shaobo Liang, *Predicting Library OPAC Users’ Cross-device Transitions.* 4:00-4:30 Amit Gupta, Pankaj Jaiswal, Crispin Taylor, and Weijia Xu, *Improve Accessibility of Biology Papers through Integration of Domain Information Extraction in the Publication Pipeline.* 4:30-5:00 Closing Discussion Mark Conrad NARA Information Services Systems Engineering Division (IT) The National Archives and Records Administration Erma Ora Byrd Conference and Learning Center Building 494, Room 225 610 State Route 956 Rocket Center, WV 26726 Phone: 304-726-7820 Fax: 304-726-7802 Email: [log in to unmask] List archives at http://lists.ufl.edu/archives/recmgmt-l.html Contact [log in to unmask] for assistance To unsubscribe from this list, click the below link. If not already present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the message. mailto:[log in to unmask]