Good question, It is used at startups too. Wherever there is a need to handle big data, Hadoop will be used.
So if you have an app or website that deals with huge amount of data, you will need hadoop. Do comment if you have more doubts π
I am starting to feel like it might be meant for BI / Analytics / AI / ML. For getting employed, it might be good, but for small business, it might be better to rely on outsource or softwares.
Also, I came across Cassandra vs HBase vs MongoDB. It seems like HBase / Hadoop ecosystem might be one of the best.
Good question, It is used at startups too. Wherever there is a need to handle big data, Hadoop will be used.
So if you have an app or website that deals with huge amount of data, you will need hadoop. Do comment if you have more doubts π
Big data includes both structured and unstructured data
It is the volume, variety, velocity, value and veracity that decides whether it's big data or not.
I've seen people use HDP, Hortonworks data platform to host their Hadoop cluster.
It is a self hosted cluster. You can set it up on any cloud service provider, like gcp, aws or azure.
I haven't come across shared hosting, and let me know if you have used it.
Hope others also can weigh in and share their views and perspectives on this topicπ
Thanks for be very informative (about HDP).
I am starting to feel like it might be meant for BI / Analytics / AI / ML. For getting employed, it might be good, but for small business, it might be better to rely on outsource or softwares.
Also, I came across Cassandra vs HBase vs MongoDB. It seems like HBase / Hadoop ecosystem might be one of the best.
Based on cost of software vs storage needs we need to see the tradeoff and decide where Hadoop can be employed or not.
Hbase is part of Hadoop as displayed in the hadoop ecosystem image placed in the article. Where as Cassandra and MongoDB are External Storages.
Based on CAP theorem, we need to decide which Storage to use. Thanks for sharing your interesting perspective π