微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

java spark转换算子union、intersection、subtract

/**
 * # _*_ coding:utf-8 _*_
 * # Author:xiaoshubiao
 * # Time : 2020/5/14 8:33
 **/
import org.apache.spark.SparkConf;
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext;

import java.util.ArrayList;
import java.util.Arrays;
import java.util.List;

public class union_test {
    public static void main(String[] args) {
        SparkConf conf = new SparkConf().setMaster("local[*]").setAppName("union_test");
        JavaSparkContext sc = new JavaSparkContext(conf);
        List<String> list = Arrays.asList("a","b","c");
        List<String> list2 = Arrays.asList("d","e","f","h","a");
        JavaRDD<String> parallelize = sc.parallelize(list, 1);
        JavaRDD<String> parallelize2 = sc.parallelize(list2,1);
        parallelize.union(parallelize2).collect().forEach(x->System.out.println("union-并集结果:"+x));//是并集不去重的结果,在这a会出现两个
        parallelize.intersection(parallelize2).collect().forEach(x->System.out.println("intersection-交集结果:"+x));
        parallelize.subtract(parallelize2).collect().forEach(x->System.out.println("intersection-差集结果:"+x));

    }
}

 

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 [email protected] 举报,一经查实,本站将立刻删除。

相关推荐