ASA Connect

 View Only

10-Feb-2026 webinar (1pm ET): Dataverse: an Open-Source Platform for Research Data

  • 1.  10-Feb-2026 webinar (1pm ET): Dataverse: an Open-Source Platform for Research Data

    Posted 02-03-2026 12:46

    Dear Colleagues,

    Data Umbrella has this upcoming webinar, which is free and open to the public.

    About

    Prep Work:

    Abstract

    Research data is foundational to data science, analytics, and research across disciplines-but sharing, preserving, and reusing data effectively remains a challenge.

    In this Data Umbrella webinar, speakers from the Dataverse Project and the Global Dataverse Community Consortium (GDCC) will introduce Dataverse, a widely used, open source research data repository platform that supports the FAIR Guiding Principles. Dataverse enables researchers and institutions around the world to publish, preserve, cite, and reuse research data across disciplines.

    The session will begin with an overview of what research data is, why sharing it matters, and how research data repositories fit into today's data ecosystem. The presenters will then introduce the Dataverse software platform, highlighting key features such as data citation, metadata, versioning, APIs, and integrations that support reproducible and reusable research.

    The webinar will also spotlight the global Dataverse community and the role of the GDCC in coordinating collaboration, governance, and sustainability. Attendees will learn about community working groups, annual Dataverse Community Meetings, and regular community calls-low‑barrier ways for newcomers and experienced users alike to get involved.

    This session is designed for:

    • Data scientists and analysts

    • Researchers and students

    • Librarians, data stewards, and repository managers

    • Anyone interested in open science, open source, and research data infrastructure

    Whether you are looking to find and reuse high‑quality research data, share your own datasets, or contribute to an open source global community, this webinar will provide a practical and community‑focused introduction to Dataverse.

     

    Outline

    This webinar will cover:

    • Introduction to the Dataverse Project

      • What Dataverse is and why it matters

      • A brief history of the project and its growth into a global platform

      • How Dataverse supports FAIR (Findable, Accessible, Interoperable, Reusable) data principles

    • Research Data Sharing & the Repository Ecosystem

      • What research data is and why data sharing is critical for reproducible and efficient research

      • An overview of different types of research data repositories

      • Benefits and challenges of sharing and preserving research data

    • Dataverse in Practice

      • Key features of the Dataverse software platform

      • How data users, analysts, and researchers can find, cite, and reuse data

      • A look at Harvard Dataverse as one example of a Dataverse installation

    • The Global Dataverse Community Consortium (GDCC)

      • How the Dataverse global community is organized and supported

      • The role of GDCC in governance, collaboration, and sustainability

      • Working groups, community calls, and annual Dataverse Community Meetings

    • Getting Involved

      • Ways to engage with the Dataverse community

      • Contributing to open source software, documentation, and working groups

      • Resources for learning more and staying connected

    ----------------------------------------
    How to Join the Webinar
    ----------------------------------------
    You can join via your browser (no app download required). Use Chrome or Firefox. Pre-register for the webinar:
    https://www.bigmarker.com/neo4j/Data-Umbrella-Webinar

    Video Recording


    This event will be recorded and placed on our YouTube. We usually have it up within 24 hours of the event. Subscribe to our YT and set your notifications: https://www.youtube.com/c/DataUmbrella/

    About the Speaker(s)

    [1] Ceilyn Boyd

    Ceilyn Boyd is the Interim Director of Data Science and Product Research at Harvard University's Institute for Quantitative Social Sciences (IQSS). Previously, Ceilyn established and led the Harvard Library Research Data Services Program, which connects the Harvard community to resources and services throughout the research data lifecycle. Boyd holds a B.A. in linguistics from Stanford University, an M.A. in anthropology and women's studies from Brandeis University, and both an M.S. and Ph.D. in library and information science from Simmons University. Ceilyn's research focuses on modeling research data, the sociotechnical characteristics of research data repositories, and investigating how data curators identify, define, and repair research data within these repositories.

    [2] Philipp Conzett

    Philipp works at UiT The Arctic University of Norway as Senior Research Librarian and Head of DataverseNO Repository Management. He is currently chairing the Steering Committee of the Global Dataverse Community Consortium (GDCC).

     

    [3] Sonia Maria Barbosa

    Sonia is the Associate Director of Dataverse Support, Data Curation, and The Murray Research Archive. She collaborates with the Harvard Dataverse Project team to support users of the software and to direct the stewardship and governance of the Harvard Dataverse Repository. She holds a BA and BSN and has over 30 years of experience working in data curation, sensitive data sharing, and reuse. 





    ------------------------------
    Reshama Shaikh
    Statistician
    ------------------------------