Persisting topologies upon MesosNimbus restart when using Marathon #133

duggan · 2016-04-22T15:39:08Z

We've got Mesos/Storm deployed via Marathon and it's been working quite nicely, but in the case where the scheduler fails, or is killed, Storm will be brought back up fresh, without any of its topologies.

How do others handle this? We ended up writing Marathon app definitions for each topology that check with Storm whether a given topology is running, and submit it if it's not, but it feels clunky, and I'm wondering if there are simpler ways people are using to bootstrap Mesos/Storm with topologies on launch?

DarinJ · 2016-04-22T16:21:52Z

Currently I deploy the Storm/Mesos Nimbus via a dedicated node. Some use external volumes and reservations with marathon, the reason is that the Nimbus stores some of its state locally (including the frameworkID, which should really be fixed). With HA Nimbus in 1.0.0 there might be better options to explore as we start looking at upgrading the framework.

erikdw · 2016-11-06T00:23:31Z

This relates to #173 and #174. We need to document the recommended approach for using storm-mesos with Marathon. That will take a bit of time as @JessicaLHartog and I look into this.

erikdw changed the title ~~Persisting topologies through Mesos/Storm migrations~~ Persisting topologies upon MesosNimbus restart when using Marathon Nov 6, 2016

erikdw added the docs label Nov 6, 2016

erikdw assigned JessicaLHartog Nov 6, 2016

erikdw added the marathon label Nov 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persisting topologies upon MesosNimbus restart when using Marathon #133

Persisting topologies upon MesosNimbus restart when using Marathon #133

duggan commented Apr 22, 2016

DarinJ commented Apr 22, 2016 •

edited by erikdw

Loading

erikdw commented Nov 6, 2016

Persisting topologies upon MesosNimbus restart when using Marathon #133

Persisting topologies upon MesosNimbus restart when using Marathon #133

Comments

duggan commented Apr 22, 2016

DarinJ commented Apr 22, 2016 • edited by erikdw Loading

erikdw commented Nov 6, 2016

DarinJ commented Apr 22, 2016 •

edited by erikdw

Loading