Some nice choices there. In addition have a look at dataiku it will play nicer with spark,r,python and nosql than rapid miner.
Also maybe take a look at some of the newer streaming architectures since you mentioned spark it's worth getting into spark streaming or flink. Key architecture in this area is called the "kappa architecture" give it a Google.
Thanks for sharing!
I'm a Coldfusion evacuee myself. ;-) I've migrated to Elixir.
For some different thoughts on data and database design, have a look at this, it's a long but really great read. It has impacted how I think about data.
We're a place where coders share, stay up-to-date and grow their careers.
We strive for transparency and don't collect excess data.