I have installed DCOS with one agent and 3 masters and tried installing ArangoDB but it is failing to deploy arangodb.
Below is the config seen as per the log.
ArangoDB Image: arangodb/arangodb-mesos:3.0
Mode: cluster
Asynchronous replication flag: 0
SecondariesWithDBservers: 0
CoordinatorsWithDBservers: 0
SecondarySameServer: 0
ArangoDBForcePullImage: 1
ArangoDBPrivilegedImage: 0
Minimal resources agent: mem():2048;cpus():0.25;disk(*):2048
Minimal resources DBserver: mem():4096;cpus():1;disk(*):4096
Minimal resources secondary DBserver: mem():4096;cpus():1;disk(*):4096
Minimal resources coordinator: mem():4096;cpus():1;disk(*):1024
Number of agents in agency: 3
Number of DBservers: 2
Number of coordinators: 2
zookeeper: zk://master.mesos:2181/arangodb3
And below is the error seen in the log file.
0901 07:07:34.769537 23 CaretakerCluster.cpp:422] planned agent instances: 3, running agent instances: 1
I0901 07:07:34.769601 23 Caretaker.cpp:400] Declining offer e2301ebe-fff0-46a5-b71b-ef77b9a7a764-O11
I0901 07:07:37.474743 24 HttpServer.cpp:439] handling http request 'GET /v1/health.json'
I0901 07:07:40.802276 23 CaretakerCluster.cpp:416] And here the offer:
{"id":{"value":"e2301ebe-fff0-46a5-b71b-ef77b9a7a764-O12"},"framework_id":{"value":"37ac79b8-bc37-4493-9558-aa72638290db-0002"},"slave_id":{"value":"37ac79b8-bc37-4493-9558-aa72638290db-S0"},"hostname":"192.168.12.167","url":{"scheme":"http","address":{"hostname":"192.168.12.167","ip":"192.168.12.167","port":5051},"path":"/slave(1)","query":[]},"resources":[{"name":"ports","type":1,"ranges":{"range":[{"begin":1026,"end":2180},{"begin":2182,"end":3887},{"begin":3889,"end":5049},{"begin":5052,"end":8079},{"begin":8082,"end":8180},{"begin":8182,"end":17140},{"begin":17144,"end":32000}]},"role":""},{"name":"disk","type":0,"scalar":{"value":1.17866e+06},"role":""},{"name":"cpus","type":0,"scalar":{"value":7.5},"role":""},{"name":"mem","type":0,"scalar":{"value":12298},"role":""}],"attributes":[],"executor_ids":[]}
I0901 07:07:40.802320 23 CaretakerCluster.cpp:422] planned agent instances: 3, running agent instances: 1
I0901 07:07:40.802383 23 Caretaker.cpp:400] Declining offer e2301ebe-fff0-46a5-b71b-ef77b9a7a764-O12
I believe one agent server is also sufficient. Is it that number of the agents should also be 3 servers ?
Also need to know how to restart the entire cluster and single service if need be ? (Killing processes doesn't seem to be right way)
Can someone suggest what exactly needs to done here...
Thanks in advance!