Submarine is a new subproject of Apache Hadoop.
Submarine is a project which allows infra engineer / data scientist to run deep learning applications (Tensorflow, Pytorch, etc.) on resource management platform (like YARN).
The submarine program supports YARN and other projects like Kubernetes in the future as resource scheduling frameworks.
Multiple machine learning frameworks are supported such as Tensorflow, Pytorch, MxNet, etc.
Submarine is not just a machine learning engine.
It covers the entire process of machine learning: algorithm development, model batch training, model incremental training, model online services and model management.