Case study: Workshop machine learning with Apache Spark
Research datasets are growing rapidly in size and complexity. This makes it difficult to process and edit them with standard applications and systems. Opensource applications such as Hadoop and Apache Spark offer an interesting alternative to data-intensive research. SURFsara has developed a workshop to make this technology accessible.