- Website: https://www.eventbrite.co.uk/e/data-science-perspectives-phd-conference-registration-79365276679
- Location: Newcastle, UK
- Date: 10-11 March 2020
- The Turing Way: guide to reproducible, ethical and collaborative data science and research
- Malvika Sharan, Kirstie Whitaker
- Presentation: https://zenodo.org/record/3706577
- Shared HackMd for 100 min relay
- The Alan Turing Institute, London, UK
Reproducible and ethical research is necessary to ensure that scientific work can be trusted. Funders and stakeholders are beginning to require that publications and research outreach include access to the underlying data and analysis code. The goal is to ensure that all results can be independently verified and built upon in future work. This is sometimes easier said than done. Sharing data and code requires an understanding of data management, library sciences, software development, and continuous integration techniques: skills that are not widely taught or expected of academic researchers and data scientists. The Turing Way is an open-source, community-led and community-develop book project that aims to bring together researchers to develop data science practices "too easy not to do". The handbook in its current state includes training material on version control, analysis testing, and open and transparent communication with its users.
A diverse set of community members working across different disciplines and career stages within data research and training have contributed to the development of this project. The collaboration and contributions are facilitated on an online project repository (https://github.com/alan-turing-institute/the-turing-way) through in-person contribution workshops (book dash events) and online co-working calls (collaboration cafes). The Turing Way team members have also delivered a series of workshops on GitHub, git, Jupyter Notebook, BinderHub, Binder, and basic programming to impart skills and confidence to empower new contributors, and thus, to sustain an open-source ecosystem. The Turing Way project will expand into a series of books, which, in addition to reproducibility, will capture the vast amounts of knowledge of data science practices in the areas of project design, communication and outreach, collaboration, and ethics.
In this workshop, we will introduce the project in a short talk on The Turing Way followed by an interactive session where the participants will learn about existing chapters and will gain practical skills to work on an online repository that they can use, share, develop or modify for their own research. All participants will leave the session knowing that "Every Little Helps" when making their work intentionally designed to be open and reproducible. The session facilitator will help participants in identifying their personal route of contribution to an open-source project while improving The Turing Way handbook for future readers.