*Workshop on Cyberinfrastructure and machine learning for digital libraries
In conjunction with the Joint Conference on Digital Libraries 2018, Fort
*Agenda for June 3rd 2018*
The workshop introduces a tryptic model that connects digital libraries and
archives, cyberinfrastructure, and machine learning to stimulate research
and implementation of automated methods to describe, represent, preserve,
and facilitate access and reuse of large-scale scholarly data.
Cyberinfrastructure refers to shared online research environments, backed
up by advanced computing resources and supported by experts. Coupled with
cyberinfrastructure, machine learning methods and tools can provide digital
libraries and archives with powerful resources to enhance their ability to
curate, organize, represent, and provide persistent access to large-scale
collections, thus facilitating their discoverability and reuse.
The papers and activities address the combination of cyberinfrastructure
and machine learning throughout the lifecycle of digital collections; from
data management planning, requirements gathering, description,
preservation, access and publication. Bring your laptop for a day of
lectures by remarkable researchers and for exciting hands-on exercises.
9:00-9:30 Conference chairs: Welcome and overview of the workshop’s agenda.
9:30-10:00 Matthew McEniry, Jessica Trelogan, and Santi Thompson: *Expanding
Library Capacity and Facilitating Reuse through a Consortial Data
10:00-10:30 Coffee break.
10:30-11:00 Tanya Clement, Jon Dunn, Juliet Hardesty, Chris Lacinak and Amy
Rudersdorf, *Audiovisual Metadata Platform Planning Project.*
11:00-11:30 Will Thomas, Benjamin Galewsky, Gregory Jansen, Sandeep
Satheesan, Richard Marciano, Shannon Bradley, Jong Lee, Luigi Marini and
Kenton McHenry, *Petabytes in Practice: Working with Collections as Data at
11:30- 12:00 Maria Esteva, Hands-on tutorial:* A Method for Modeling
Large-scale Data Requirements to Cyberinfrastructure and Machine Learning.*
After introducing how data modelling was used in the design of the
Digital Rocks Portal (https://urldefense.proofpoint.com/v2/url?u=https-3A__www.digitalrocksportal.org_&d=DwIFaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=b5NZPQUb9_r2rQ3Zd74ATT3aSs9yKyRnJLOhqJvd7fE&m=zqSKGfek7zcESN2kWflvTW41wJCa-PBfUvHDXOFBHVg&s=KL2PDSdt-Ch1YhJQGd0pHiKltB7ZmuMVJa_6HLqX-7k&e=), attendees
working in multi-disciplinary groups will model large-scale data use cases
including analysis, curation, access and publication functions and will map
those to cyberinfrastructure. You are welcomed to share large scale data
curation and analysis cases to discuss and resolve during the workshop.
12:00-1:30 Lunch Break
1:30 -2:00 Matt Lease: *What can Machine Learning and Crowdsourcing do for
you? Exploring New Tools for Scalable Data Processing*.
2:00-2:30 Sachith Withana, Inna Kouper, and Beth Plale, *Data Capsule
Appliance for Restricted Data in Libraries.*
2:30-3:00 Ruizhu Huang, Hands-on tutorial:* Machine Learning on
Attendees will get chance to learn how to log on to a
supercomputer and start an interactive sessions using a big data cluster to
explore how it can be used for a machine learning project.
3:00-3:30 Coffee Break
3:30-4:00 Dan Wu and Shaobo Liang, *Predicting Library OPAC Users’
4:00-4:30 Amit Gupta, Pankaj Jaiswal, Crispin Taylor, and Weijia Xu, *Improve
Accessibility of Biology Papers through Integration of Domain Information
Extraction in the Publication Pipeline.*
4:30-5:00 Closing Discussion
NARA Information Services
Systems Engineering Division (IT)
The National Archives and Records Administration
Erma Ora Byrd Conference and Learning Center
Building 494, Room 225
610 State Route 956
Rocket Center, WV 26726
Email: [log in to unmask]
List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
To unsubscribe from this list, click the below link. If not already present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the message.
mailto:[log in to unmask]