-
Notifications
You must be signed in to change notification settings - Fork 41
Spark
Apache Spark is an open-source unified analytics engine for large-scale data processing.
https://medium.com/google-cloud/dataproc-spark-cluster-on-gcp-in-minutes-3843b8d8c5f8
https://codelabs.developers.google.com/codelabs/spark-jupyter-dataproc#0
https://www.freecodecamp.org/news/what-is-google-dataproc/
https://cloud.google.com/blog/products/data-analytics/build-limitless-workloads-on-bigquery/
https://cloud.google.com/blog/products/data-analytics/making-serverless-spark-even-more-powerful
https://www.youtube.com/watch?v=IQfG0faDrzE4
https://www.datacamp.com/community/tutorials/apache-spark-tutorial-machine-learning
https://docs.scala-lang.org/tutorials/scala-for-java-programmers.html
https://github.com/apache/spark
https://blog.allegro.tech/2021/06/1-task-2-solutions-spark-or-beam.html
https://data-flair.training/blogs/comparison-apache-flink-vs-apache-spark/
https://ahana.io/learn/comparisons/spark-sql-vs-presto/
https://hudi.apache.org/docs/comparison/
https://github.com/apache/spark/tree/master/examples/src/main