Tag Apache Spark

Is Scala a better choice than Python for Apache Spark?

From Marcin Mejran’s answer via Quora: If you mean the API then it depends. First of all, performance won’t most likely matter since it’s almost all Scala under the hood for Spark and you can always use more machines to make up for anything else. Learning curves can be overcome and Spark’s Scala API is rather simple. Ease of use is a toss up honestly and probably the key point to

Continue reading…

MemSQL with Nikita Shamgunov

http://traffic.libsyn.com/sedaily/memsql_nikita_2.mp3Podcast: Play in new window | DownloadMemSQL is a high-performance, in-memory database that combines the horizontal scalability of distributed systems with the familiarity of SQL. Nikita Shamgunov is co-founder and CTO of MemSQL. Questions What types of data does a user want to keep on disk versus on an in-memory database? How does MemSQL compare to MySQL? How do MemSQL users leverage Apache Spark? How does a user onboard with

Continue reading…