I’m trying to sift through information on technologies. I have stored in S3 that I want to analyze using EMR. However, when I try to research the pros and cons of Presto, Hive, Spark, or any other technology, I end up drowning in company sponsored benchmark reports or papers written by people with clear biases.

So, my ask: Am I better off just experimenting with each tool, or do you have any suggested resources that offer opinions with substance, and not just marketing buzzwords?

Source link

No tags for this post.


Please enter your comment!
Please enter your name here