Big Data Processing, Apache Spark: Design Patterns
In my previous post, I demonstrated how Spark creates and serializes tasks. In this post, I show how to utilize this knowledge to construct Spark applications in a maintainable and upgradable way, where at the same time “task not serializable” exceptions are avoided. When I participated in a big data project, I needed to program … Read more