Large-scale scientific computing work often runs on clusters with petabytes of attached storage and specialized networking. Arvados is a free software platform to store and analyze large data sets, emphasizing reproducibility and compatibility across deployments. It's licensed under the GNU Affero General Public License version 3, with SDKs under the Apache License 2.0. This talk will provide a technical introduction to Arvados, describe how research projects like the Personal Genome Project have used it, and suggest other applications.
Speakers: Brett Smith Ward Vandewege