Past workshops

Quick links:

PyRATES: Python and R Analysis of Time SerieS

Logo of Pyrates Workshop

June 3-6, 2024, Marina Del Rey, CA

This workshop focused on foundational training in Python and R for the geosciences, with an emphasis on time series analysis.

Outcome

Note that some of these repositories are kept private, awaiting for publication. To view the notebooks, we suggest the use of nbviewer.

Participant Name Affiliation Research Area Repository
Pranay Tirpude University of Delaware Paleoceanography and Paleoclimatology GitHub
Ruixia Bai University of California-Santa Barbara Volcanology, Geochemistry, and Petrology GitHub
Kathryn Chen Scripps Institution of Oceanography Ocean Sciences GitHub
Dewan Haque Louisiana State University Earth and Planetary Surface Processes GitHub
Natalie Teale Miami University Hydrology GitHub
Vasu Sreedevi Florida Institute of Technology Atmospheric Sciences GitHub
Jeng Hann Chong University of New Mexico Geodesy GitHub
Wenjia Li University of Idaho Geoinformatics GitHub
Victor Olawoyin Boston College Seismology GitHub
Somita Chaudhari University of Maryland Baltimore Geoinformatics GitHub

Workshop schedule

Day 1: Getting started (9am - 6pm)

Start End Session Speaker
9:00 10:00 Icebreaker, Overview of of FROGS and PyRATES, the LeapFROGS platform Deborah Khider
10:00 11:00 Reproducible Research: What is it, why is it important and how do we do it? Deborah Khider
11:00 11:30 Working through a reproducible example: the HadCRUT5 data All
11:30 12:00 BREAK ———
12:00 12:30 Reproducing a study in your field: getting started All
12:30 13:00 Reproducing a study in your field: getting started - Peer-to-peer feedback All
13:00 14:00 LUNCH On your own
14:00 16:00 Intro to Python or R Deborah Khider, David Edge, Julien Emile-Geay, Nick McKay
16:00 16:30 BREAK ———
16:30 17:30 Working Groups: Lightning Talks Deborah Khider, David Edge, Julien Emile-Geay, Nick McKay
17:30 18:00 Plenary debrief all

Day 2: Building Up (9am - 6pm)

Start End Session Speaker
9:00 11:00 Intro to Python or R/ Get started on your reproducibility study Deborah Khider, David Edge, Julien Emile-Geay, Nick McKay
11:00 11:30 BREAK ———
11:30 13:00 Concepts in data and software publishing Deborah Khider
13:00 14:00 LUNCH On your own
14:00 15:30 GitHub for software publishing and collaboration Deborah Khider
15:30 16:30 BREAK ———
16:30 18:00 Concepts in timeseries analysis and data processing Julien Emile-Geay, NickMcKay

Day 3: Roaring the Engine (9am - 6pm)

Start End Session Speaker
9:00 10:30 Measures of Association: correlation, regression, degrees of freedom Julien Emile-Geay, Nick McKay
10:30 11:30 BREAK ———
11:30 13:00 Significance and Surrogates Julien Emile-Geay, NickMcKay
13:00 14:00 LUNCH On your own
14:00 15:30 Spectral and Wavelet Analysis Julien Emile-Geay, David Edge
15:30 16:30 BREAK ———
16:30 18:00 Publishing Reproducible Workflows Deborah Khider

Day 4: Fireworks (9am - 12pm)

Start End Session Speaker
9:00 10:30 Preparing to publish your reproducibility study All
10:30 11:00 BREAK ———
11:00 12:00 Citing data and software in your publications Deborah Khider
12:00 12:15 BREAK ———
12:15 13:00 Wrap-up All

FAIRLeap: FAIR publishing in the geosciences

Logo of FAIRLeap Workshop

February 11-14, 2025, Virtual, Anywhere on Earth

This workshop focused on publishing all artifacts of research in a manner consistent with FAIR principles to ensure that science is reproducible.

Outcome

Note that some of these repositories are kept private, awaiting for publication. To view the notebooks, we suggest the use of nbviewer.

Participant Name Affiliation Research Area Repository
Kurt Lindberg University at Buffalo Paleoceanography and Paleoclimatology GitHub
Marion Dugue ETH Zurich Planetary Science GitHub

Schedule

Day 1: Introduction to FAIR Science Publishing

Start End Session Speaker
9:00 10:00 Icebreaker, Overview of FROGS and FAIRLeap, the LeapFROGS platform David Edge
10:00 10:30 Reproducible Research: What is it, why is it important and how do we do it? Deborah Khider
10:30 11:00 Concepts in software and data publishing Deborah Khider
11:00 11:30 Publishing reproducible workflows Deborah Khider
11:30 12:00 Citing data/software and getting started with open science Deborah Khider
12:00 13:00 LUNCH ———-
13:00 17:00 Asynchronous work session All

Day 2: Basics of GitHub for software and project management

Start End Session Speaker
9:00 9:30 Summary from Day 1 - Reproducibility Deborah Khider
9:30 10:00 What is Git? What is GitHub? David Edge
9:30 12:00 Live GitHub tour - creating a repository, branches, pull request, GitHub for project management, obtaining a DOI for your software Deborah Khider (Python) & David Edge (R)
12:00 13:00 LUNCH ———-
13:00 17:00 Asynchronous work session All

Day 3: Sharing reproducible workflows

Topics to be covered: Docker, Binder and myBinder, creating an environment or requirements file, GitHub actions, sharing your Docker container

Start End Session Speaker
9:00 9:30 Summary from Day 2 - Setting up a GitHub repository Deborah Khider
9:30 10:00 What are containers? Introduction to Docker and myBinder David Edge & Deborah Khider
10:00 12:00 Live container demo - creating a Docker container from your GitHub repository using GitHub actions and releasing notebooks on myBinder Deborah Khider (Python) & David Edge (R)
12:00 13:00 LUNCH ———-
13:00 17:00 Asynchronous work session All

Day 4: Fireworks

Start End Session Speaker
9:00 12:00 Presentations All
12:00 13:00 LUNCH ———-
13:00 17:00 Review of a colleague’s work - opening issues on GitHub All

Open Geoscience Hackathon

Logo of FAIRLeap Workshop

This workshop focused on packaging research software for the geosciences.

Outcome

Note that some of these repositories are kept private, awaiting for publication.

Participant Name Affiliation Research Area Repository
Kurt Lindberg University at Buffalo Paleoceanography and Paleoclimatology GitHub
Duyi Li University of Texas at Austin, Institute for Geophysics Cryosphere Sciences GitHub
Jean Costello Boston University GeoHealth GitHub
Preetika Kaur University of Wyoming Hydrology GitHub
Tanaya Gondhalekar University of Southern California Paleoceanography and Paleoclimatology GitHub
Surabhi Upadhyay Colorado School of Mines Hydrology GitHub
Juan S. Acero Triana UCR/ENSC Hydrology GitHub
Nelofar Qulizada University of Arkansas Science and Society GitHub
Lindsay Fitzpatrick University of Michigan Earth and Planetary Surface Processes GitHub
Bo Dong Lawrence Livermore National Laboratory Atmospheric Sciences GitHub
Zach Uhlmann McMillen, Inc. Geoinformatics GitHub

Schedule

Day 1: Basics

Start End Session Speaker
09:00 09:15 Welcome Deborah Khider
09:15 10:30 Basics of Packaging Nick McKay
10:30 10:45 BREAK ——–
10:45 12:00 Hands-on practice with toy package Deborah Khider
12:00 13:00 LUNCH ——–
13:00 17:00 Work on project ——–

Day 2: Documentation and testing

Start End Session Speaker
09:00 10:00 Writing a good documentation and testing Nick McKay
10:00 10:15 BREAK ——–
10:15 12:00 Hands-on practice with toy package Deborah Khider
12:00 13:00 LUNCH ——–
13:00 17:00 Work on project ——–

Day 3: Continuous integration and publishing your package

Start End Session Speaker
09:00 10:00 CI and publishing Nick McKay
10:00 10:15 BREAK ——–
10:15 12:00 Hands-on practice with toy package Deborah Khider
12:00 13:00 LUNCH ——–
13:00 17:00 Work on project ——–