What is Apache Hadoop Submarine?

Submarine is a new subproject of Apache Hadoop.

Submarine is a project which allows infra engineer / data scientist to run deep learning applications (Tensorflow, Pytorch, etc.) on resource management platform (like YARN).

Run on existing cluster

The submarine program supports YARN and other projects like Kubernetes in the future as resource scheduling frameworks.

Various frameworks

Multiple machine learning frameworks are supported such as Tensorflow, Pytorch, MxNet, etc.

Everything about ML

Submarine is not just a machine learning engine.

It covers the entire process of machine learning: algorithm development, model batch training, model incremental training, model online services and model management.

Questions and Answers

Regarding to developer/user resources like email list, community calls, how to contribute

Please refer to wiki for all these questions.