They dined on mince, and slices of quince,
Which they ate with a runcible spoon
from the poem by Edward Lear, "The Owl and the Pussy-Cat", which the reader is invited to ponder at leisure.
Meanwhile, as a Big Data trainer, I often need to create what I would call a "stoppable cluster" on AWS, one that I can "pause," or put to sleep for a while. The most obvious use of it is saving money while the students are away, so that instead of $100/day, I would pay $33 per day. That would be reason enough. However, at times, as a developer, I want to stop the cluster that I am running.
If you look in the literature, it will cite two obstacles:
- Ephemeral nodes disappear on stop/start on AWS; and
- IP assignments change.
You can fix both by (1) using root drive and EBS drives; and (2) assigning elastic IPs. Amazon will not let you use more than 5 elastic IPs, but you can call them and ask nicely, and they will give you 10.
Next, Hortonworks Ambari will check the cluster IP assignment and refuse to use the external IPs, even though Amazon promises you the right resolution:
We resolve an external DNS hostname to the public IP address of the instance outside the network of the instance, and to the private IP address of the instance from within the network of the instance.
alas, CM has resolved the IPs to the old internal ones!! And used that in the configuration.
Next installment - constructing proper clusters in the VPC and controlling the internal IP assignments.