Introduction

My parents are both physical geographers, which meant on family trips they always pointed out the scars of an old earth weathered over large timescales. They showed my brothers and me the basalt pillars that turned into the black, sandy beaches, toe-searingly hot in the sun, of the crater lake where we swam every summer; the huge grooves carved into the rock faces of the Alps by glaciers that long ago filled the valleys where we went hiking; the Rhine river depositing the clay once used by the brickworks in our hometown. And in the rocks that my father collected, sometimes a fossil of an animal or a plant.

I don’t remember ever not having the vague notion that, over large time scales, species become other species, or go extinct and become fossils. This is what evolution is to most people: a story of things appearing and disappearing, from trilobites to cavemen, all very long ago and neatly organized into eras that you’re supposed to memorize. Only as an undergraduate student, near graduation, did I come to realize that evolution is not actually a story: it's a process, which creates the patterns we see around us in nature.

In my research at the time I was trying to show how primates had evolved to fill up different ecological niches to various levels of specialization, thinking that there might be simple rules that underlie this. My adviser taught me to look at this diversity as the outcome of a repeated natural experiment: from a common ancestor, primate lineages had to come up with different, evolved solutions under different pressures. Or, sometimes, the same solution independently arrived at multiple times, under the same pressures. By drawing the right comparisons across combinations of solutions, the rules that led to them should reveal themselves.

Because all living and extinct species are related to one another we have to be careful with drawing comparisons among them: not every pattern that we see in nature is the result of solutions repeatedly arrived at by evolution. At least some similarities across species are the result of a solution arrived at just once, in a common ancestor, whose characteristics simply stuck around in its descendants. Where we might think there is a general rule, there’s actually just a single event that happened before the tree of life branched out further into new species.

The methods that are used to reconstruct the shape of the tree of life and to place evolutionary changes on this tree are my main scientific interest. To pursue this interest in the way that I want I have had to pick up the nerd skills that broadly have to do with how we digitally represent, describe, combine and analyze observations we make - at the level of DNA all the way to that of individuals and groups. I have so far been able to apply these techniques to interesting questions about evolution in life forms as diverse as primates, pitcher plants, snakes, tomatoes, birds, orchids, and numerous insects.

Recent funding sources

Funding for my research comes from a disparate number of sources. In the last years, these included:

Publications

Mutanen, M., Kivelä, S.M., Vos, R.A., Doorenweerd, C., Ratnasingham, S., Hausmann, A., Huemer, P., Dincă, V., van Nieukerken, E.J., Lopez-Vaamonde, C., Vila, R., Aarvik, L., Decaëns, T., Efetov, K.A., Hebert, P.D.N., Johnsen, A., Karsholt, O., Pentinsaari, M., Rougerie, R., Segerer, A., Tarmann, G., Zahiri, R., Godfray, H.C.J. 2016. Species-Level Para-and Polyphyly in DNA Barcode Gene Trees: Strong Operational Bias in European Lepidoptera. Systematic Biology [10.1093/sysbio/syw044]

Pentinsaari, M., Vos, R.A., Mutanen, M. 2016. Algorithmic single-locus species delimitation: effects of sampling effort, variation and non-monophyly in four methods and 1870 species of beetles. Molecular Ecology Resources [10.1111/1755-0998.12557]

R.A. Vos. 2016. Ten simple rules for managing high-throughput nucleotide sequencing data. bioRxiv [10.1101/049338]

Schilthuizen, M., Pimenta, L., Lammers, Y., Steenbergen, P., Flohil, M., Beveridge, N., van Duijn, P., Meulblok, M., Sosef, N., van de Ven, R., Werring, R., Beentjes, K., Meijer, K., Vos, R., Vrieling, K., Gravendeel, B., Choi, Y., Verpoorte, R., Smit, C., L. Beukeboom. 2016. Incorporation of an Invasive Plant Into a Native Insect Herbivore Food Web. PeerJ [10.7717/peerj.1954]

Koureas, D., Hardisty, A., Vos, R.A., Agosti, D., Arvanitidis, C., Bogatencov, P., Buttigieg, P.L., De Jong, Y., Horvath, F., Gkoutos, G., Groom, Q.J., Kliment, T., Kõljalg, U., Manakos, I., Marcer, A., Marhold, K., Morse, D., Mergen, P., Penev, L., Pettersson, L.B., Svenning, J.-C., Van de Putte, A., Smith, V.S. 2016. Unifying European Biodiversity Informatics (BioUnify). Research Ideas and Outcomes 2: e7787 [10.3897/rio.2.e7787]

Employment history

Lecturer

Onwards from 2017 I coordinate the MSc course "Methods in Biodiversity Analysis" at the Institute of Biology, Leiden University.

Researcher, bioinformaticist

Starting February 1st 2012 I am employed as a researcher in bioinformatics at NCB Naturalis.

Marie Curie Research Fellow

Starting from Fall 2009 to Fall 2011, I was employed as a research fellow at the University of Reading as a member of the Reading Evolutionary Biology Group working with Mark Pagel on natural selection in Primates, developing a comparative framework integrating molecules and natural history.

Bioinformatics consultant

Starting from Fall 2008 till November 2010, I have worked on the TreeBASE project as an outside consultant for the University of Pennsylvania.

Postdoctoral fellow

Starting from 2006 till Fall 2009, I have worked at the University of British Columbia, as an international collaborator on the CIPRES project, contributing to architecture and database design.

Research assistant

From January 2001 through August 2006 I have worked intermittently as a research assistant at Simon Fraser University on a number of projects involving phylogenetic inference.

Software

I am an active contributor of open source bioinformatics software. In the past, I have developed educational software (web-based, for BioMedia, the Zoological Museum of Amsterdam, Simon Fraser University; and CD-ROM based for the AMSTEL institute). The projects listed below – for which I am the primary author – are current and ongoing. Their combined worth is estimated by an independent open source code analysis group at nearly $9 million. My programming skills include Java (Spring, Hibernate), Perl, C, web standards (XML/XSLT/XSD/HTML/CSS), ontologies (OWL, protege), databases (MySQL, PostgreSQL, dbxml, DB2), collaborative development best practices (svn, test-driven development) and design patterns.

TreeBASE

TreeBASE is a relational database of phylogenetic information hosted by the San Diego Supercomputing Center. In the Fall of 2008 I joined the development project to redesign TreeBASE from the ground up. I presently volunteer some of my time as the developer of the web service API.


Bio::Phylo

Bio::Phylo is a collection of modules for phylogenetic analysis and biodiversity assessments. Primary author: Rutger Vos, with contributions from Aki Mimoto, Klaas Hartmann, Jason Caravas and Mark Jensen.


NeXML

NeXML is an emerging data exchange standard for phylogenetics and bioinformatics.


CIPRES

The CyberInfrastructure for Phylogenetic Research is an NSF-sponsored 5 year project to develop an architecture for data exchange and distributed analysis. I am the designer of the perl5 branch of the infrastructure.


COPE

An implementation of OMG-compliant CORBA ORB and related architecture, COPE was originally conceived by Bart Schuller, I am the current maintainer.


Exception::Class::TCF

A port of Java/C++ style exception handling.

Teaching

I enjoy teaching. In addition to the longer courses below I have given guest lectures for Shad Valley, taught seminars on bioinformatics at Simon Fraser University and given various conference talks at the SSE Evolution Conference and the Pacific Ecology Conference. In my classes I seek to combine new media with my lectures. I consistently receive positive student evaluations. Recently, I have also mentored two graduate students (Klaas Hartmann and Jason Caravas) for the NeXML project, and one student (Anurag Priyam) for the NeXML project, both with funding by Google.

Invited speaker, lecturer

In 2007, 2008, 2009 and 2010 I have taught a course on computational phyloinformatics hosted the first two years at the National Evolutionary Synthesis Center in the US, in 2009 at the Gulbenkian Institute in Lisbon, Portugal and in 2010 at the Beijing Genomics Institute in Shenzhen, China.

Teaching assistant

For various courses at Simon Fraser University I have given lectures and organized seminars, notably for a 4th level Biodiversity course (Fall semesters 2001 and 2004), a 1st level Introduction to Biology course (Fall semester 2001) and a 4th level Evolution course (Spring semester 2002).

For an international master course on phylogenetics I developed and taught a web-based lab on the systematics of lemurs at the Zoological Museum of Amsterdam (Fall semester 1999).

Services to the scientific community

Below are some of the things I've volunteered to do over the last few years:

  • Review Editor of Frontiers in Bioinformatics and Computational Biology
  • Programme committee member IEEE e-Science 2011 Workshop on Interoperability in Scientific Computing, Stockholm, 5th December 2011
  • Board member of the Phyloinformatics Research Foundation.
  • Co-PI (with Arlin Stoltzfus and Enrico Pontelli) of the NESCent Hackathons, Interoperability, Phylogenetics working group and conference series (2010-2012)
  • Co-PI (with Arlin Stoltzfus) of the NESCent Evolutionary Informatics working group and conference series (2007-2009)
  • Co-organizer (with Mark Holder, Arlin Stoltzfus, Jason Stajich, Aaron Mackey, Hilmar Lapp and Todd Vision) of the 1st NESCent Phyloinformatics Hackathon (2006).
  • Co-chair (with Patrik Nosil), organizing committee for the 24th Annual Pacific Ecology and Evolution Conference (2003).
  • Webmaster for Scientists For Species, which successfully lobbied for a strengthened Species At Risk Act (SARA; passed by Canadian Senate December 16th, 2002).
  • Guest speaker for Shad Valley on Human Evolution and on Phylogenetics.
  • Member of the Doctoral Students Advisory Group for Burnaby Mountain College.
  • Co-organizer departmental seminar series, Simon Fraser University.
  • Organizer of SFU seminar series on Perl programming for biologists.
  • Reviewer for Phylogenetic supertrees: Combining information to reveal the Tree of Life, The American Naturalist, Systematic Biology, Evolution, Bioinformatics, iEvoBio, BMC Bioinformatics, Molecular Phylogenetics and Evolution, Evolutionary Genomics: statistical and computational methods and the Journal of Biomedical Semantics.

My affiliations

Through my work I am affiliated in one form or another with various research projects and research institutes. Among these are currently:

Past affiliations include:

  • Reading Evolutionary Biology Group, Mark Pagel's lab at the University of Reading, where I have done my Marie Curie Fellowship.
  • The CyberInfrastructure for Phylogenetic Research (CIPRES), a research project dedicated to developing - among other things - a distributed architecture for phylogenetic inference. I contributed to this project by developing a client side and server side API to communicate through the architecture in perl.
  • The FAB* lab, a collective of labs at Simon Fraser University whose joint focus is on organic evolution. FAB* stands for Felix Breden, Arne Mooers, Bernie Crespi and Mike Hart, the main labs involved. I did my doctoral research in the FAB* lab - Arne Mooers was my thesis adviser.
  • Interdisciplinary Research in the Mathematical and Computational Sciences (IRMACS), a research school at Simon Fraser University. IRMACS is where I defended my dissertation - it has great videoconferencing facilities. Most of Arne Mooers's graduate students now hold office there.