Software Carpentry - Overview Woods Hole Scientific Community

Software Carpentry Team

November, 2013

Copy This Lecture!







Creative Commons License
Software Carpentry Overview by Software Carpentry is licensed under a Creative Commons Attribution 3.0 Unported License.

More About Software Carpentry

History

What We Teach

What We Actually Teach

How to THINK like a programmer

Who We Teach

Who We Are

Our Goals for You

We will take you on a tour of:

Some High-Level Advice

Be fluent in multiple languages

You speak multiple languages when interacting with a computer. Choosing to use a new tool, library, or language can be similar to learning a new language:

Use domain specific languages and libraries to increase your expressivity

Make it work right first, make it fast later.

Use REPL Environments for Development

REPL (read-eval-print-loop) environments tighten the coupling between the code you write and the results you see, increasing productivity.

REPL non-REPL
Visual Basic Supercomputers
Command Line C
IPython and Python C++
MATLAB Java
Mathematica Fortran
LISP

Don't Repeat Yourself (or Others)

Automate common actions by saving simple blocks of code into scripts

Refactor commonly used blocks of code into functions

Group commonly used functions into libraries

Reduce Complexity

Basic strategies

Back up your data!

Use version control for checkpointing and collaboration

We will learn more about working with git and GitHub today

Verify and Validate your Code

Principles of verification and validation

Document your Computational Work

Principles of documentation

Schedule

Today

Closing Thoughts

Aim for reproducibility

References and Further Reading

Books

Pragmatic Programmer, The: From Journeyman to Master

Andrew Hunt and David Thomas

ISBN 978-0132119177

Describes the important principles and practices of being an effective programmer, instead of teaching a specific language or technique

Code Complete Second Edition

Steve McConnell

ISBN 978-0735619678

Focuses on principles of software construction, with attention to skills, testing, and design.

Verification and Validation in Scientific Computing

William L. Oberkampf and Christopher J. Roy

ISBN 978-0521113601

Focuses on verification and validation of numerical solutions to models described by systems of partial differential and integral equations.

Research Literature

Programming Languages for Scientific Computing

Matthew G. Knepley

Preprint: http://arxiv.org/pdf/1209.1711.pdf

Gives an overview of modern programming languages and techniques such as code generation, templates, and mixed-language designs. This is a preprint, so expect some rough spots.

Two Solitudes

Greg Wilson

Slides: http://www.slideshare.net/gvwilson/two-solitudes

Describes Greg's journey as a scientist and leader for the Software Carpentry project, provides some insight into the differences between industry and academics.

Best Practices for Scientific Computing

D. A. Aruliah, C. Titus Brown, Neil P. Chue Hong, Matt Davis, Richard T. Guy, Steven H. D. Haddock, Katy Huff, Ian Mitchell, Mark Plumbley, Ben Waugh, Ethan P. White, Greg Wilson, Paul Wilson

Preprint: http://arxiv.org/abs/1210.0530

Good summary paper of many fundamental practices for working with and developing scientific software. This is a preprint, so expect some rough spots.

Web References

What Every Computer Scientist Should Know About Floating-Point Arithmetic

David Golberg

Web article: http://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html

Introduction to the IEEE floating-point standard, its implications, and many of the common pitfalls when using floating-point numbers in scientific computing

The Research Software Engineer

Rob Baxter, Neil Chue Hong, Dirk Gorissen, James Hetherington, and Ilian Todorov

Web article: http://dirkgorissen.com/2012/09/13/the-research-software-engineer

Discussion of the current challenges to scientific software engineering as a profession.

Science Code Manifesto

http://sciencecodemanifesto.org

Publicly signed commitment to clear licensing and curation of software associated with research publications.

You sometimes need geeks. You never need dorks.