🔗 Why open data growth At rOpenSci we try to make it easier for people to use open data and contribute open data to the community. The question often arises: How much open data do we have? Another angle on this topic is: How much is open data growing? We provide access to dozens of data respositories through our various packages. We asked many of them to share numbers on the amount of data they have, and if possible, growth of their data holdings through time....
You only know two things about Docker. First, it uses Linux containers. Second, the Internet won’t shut up about it. – attributed to Solomon Hykes, Docker CEO 🔗 So what is Docker? Docker is a relatively new open source application and service, which is seeing interest across a number of areas. It uses recent Linux kernel features (containers, namespaces) to shield processes. While its use (superficially) resembles that of virtual machines, it is much more lightweight as it operates at the level of a single process (rather than an emulation of an entire OS layer)....
I’m very pleased to announce that rOpenSci has signed a comprehensive fiscal sponsorship agreement with the NumFocus foundation, a 501(c)3 nonprofit that supports R&D for open source scientific software projects. We are delighted to be in the company of esteemed projects such as IPython and Julia that share our goal of promoting reproducible research practices across many scientific communities and developing a rich ecosystem of tools for open scientific computing....
The week after labor day, we had the pleasure of attending the NCEAS open science codefest event in Santa Barbara. It was great to meet folks like the new arrivals at the expanding Mozilla Science Lab, Bill Mills and Abby Cabunoc (Bill even already has a great post up about the codefest), and see old friends from NCEAS and DataONE, among many more. This 2.5 day event ran smoothly thanks to the leadership of Matt Jones....
The Open Tree of Life project aims to synthesize our combined knowledge of how organisms relate to each other, and make the results available to anyone who wants to use them. At present, the project contains data from more than 4,000 published phylogenies, which combine with other data sources to make a tree that covers 2.5 million species. In September, the Open Tree of Life team are holding a hackathon to develop tools that use the project’s web services to extract, annotate and add data....