Parallel execution for h2o AutoML?

Question

Is it possible to kick off h2o's AutoML to train and evaluate models in parallel either locally or on a cluster, using something like joblib or dask?

You may have trouble finding someone that both known H2O and Dask well enough to answer this. Perhaps you should try something and report back specific problems you encounter. — mdurant

Tom Tom · Accepted Answer · 2018-07-30T18:50:24

Closing out this question as the question has migrated, and more appropriately, belongs elsewhere (i.e. the Google Group). Here's where the topic can be found. Also a reference to H2O posting guidelines here.

I'll recap the highlights here:

There's nothing outright that is known right now to do this, BUT:
H2O builds one model at a time, but the creation of that model is a parallel operation
You can potentially parallelize by having multiple API clients pointing to the same cluster, operating on different entries of in the search space

Parallel execution for h2o AutoML?

1 Answers