This lesson is still being designed and assembled (Pre-Alpha version)

This lesson is part of The Carpentries Incubator, a place to share and use each other's Carpentries-style lessons. This lesson has not been reviewed by and is not endorsed by The Carpentries.

Managing Open and Reproducible Computational Projects: Glossary

Key Points

Introduction to this course	This material is developed for mid-career and senior researchers in biomedical and biosciences fields. This training aims to build a shared understanding and facilitate the integration of computational reproducibility in data science.
Better and faster research !	motivations
What is special in data science project ?	content organisation
Reproducibility	Version controlled repository help record different contributions and contributor information openly. Open Science is an umbrella term that involve different practices for research in the context of different research objects. Online Persistent Identifiers or Digital Object Identifiers are useful for releasing and citing different versions of research objects.
An introduction to version control	version control git
What IT tools can be used ?	content organisation
Setting up a computational project	Shared repository with well structured and organised files are crucial for starting a project Documentation is as important as data and code to understand the different aspects of the project and communicate about the research. Licencing and open science practices allow proper use and reuse of all research objects, hence should be applied in computational research from the start.
Implementing tools and methods during the project	Make group leaders familiar with practices that are crucial for their teams to develop reproducible code. Encourage researchers to think about code reproducibility through quality check, testing, sharing their code as well as a research environment. Introduce Continuous Integration for automating the testing process.
Research Data Management	Good research data management practices esures findability of your research data. Storing, regular backing-up and archiving prevents data loss. Sharing all types of research data transparently makes them easier to understand and reuse by others. Gives fair recognition to people generating, handling and using data, and further stimulates collaboration with others.
Fostering documentation	Version controlled repository help record different contributions and contributor information openly. Open Science is an umbrella term that involve different practices for research in the context of different research objects. Online Persistent Identifiers or Digital Object Identifiers are useful for releasing and citing different versions of research objects.
Scientific rigour with code	statistics
Coding basics	First key point. Brief Answers to questions. (FIXME)
Code testing and Review	There are many benefits of code review and this should be implemented and practised in research team culture as early and as frequently as possible. Synchronous code review creates opportunities for researchers to get feedback and learn from others in real-time. Asynchronous code review is a good practice when working with busy researchers or collaborators in different time zones.
Code Modularity
Publication and release	First key point. Brief Answer to questions. (FIXME)
Open Science Practices	Version controlled repository help record different contributions and contributor information openly. Open Science is an umbrella term that involve different practices for research in the context of different research objects. Online Persistent Identifiers or Digital Object Identifiers are useful for releasing and citing different versions of research objects.
Data and code citation	First key point. Brief Answer to questions. (FIXME)

Glossary

FIXME