spark catalyst 中的strategy 自定义研究
2016 Spark-Summit-EU-talk-by-Herman-van-Hovell 的实例:
val tableA: Dataset[Long] = spark.range(100000000).as('a)
val tableB: Dataset[Long] = spark.range(100000000).as('b)
val result =
tableA.join(tableB, tableA("id") === tableB("id"))
.groupBy()
.count()
result.count()
result.show()
result.explain(
