spark catalyst 中的strategy 自定义研究
2016 Spark-Summit-EU-talk-by-Herman-van-Hovell 的实例:
val tableA: Dataset[Long] = spark.range(100000000).as('a) val tableB: Dataset[Long] = spark.range(100000000).as('b) val result = tableA.join(tableB, tableA("id") === tableB("id")) .groupBy() .count() result.count() result.show() result.explain(