We installed some Big Data components like Apache Hadoop, Spark, and Kafka differents virtual machines. To manage those VMs on production environment (some physical servers with their local storage and without SAN storage), I want to use OpenStack. Reading OpenStack documents I figure out that it is created of many different components each for a specific purpose. In addition, OpenStack needs some mandatory separate nodes like controller, compute and network (I'm not sure about the network node!). My questions are:
- What OpenStack components are needed for Big Data deployment?
- How many separate physical node (controller, compute and network) OpenStack needs for running on production (except resources node)?
- Can we run OpenStack in virtual machines like VirtualBox, just for test?