java - org.apache.hadoop.io.Text无法转换为org.apache.hive.hcatalog.data.HCatRecord-6ren

java - org.apache.hadoop.io.Text无法转换为org.apache.hive.hcatalog.data.HCatRecord

转载作者：行者123 更新时间：2023-12-02 22:05:19

我编写了一个脚本，该脚本可以从HBase中获取数据，将其解析然后保存到Hive中。但我收到此错误:

org.apache.hadoop.mapred.YarnChild: Exception running child : java.lang.ClassCastException: org.apache.hadoop.io.Text cannot be cast to org.apache.hive.hcatalog.data.HCatRecord
  at org.apache.hive.hcatalog.mapreduce.FileRecordWriterContainer.write(FileRecordWriterContainer.java:53)
  at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.write(ReduceTask.java:558)
  at org.apache.hadoop.mapreduce.task.TaskInputOutputContextImpl.write(TaskInputOutputContextImpl.java:89)
  at org.apache.hadoop.mapreduce.lib.reduce.WrappedReducer$Context.write(WrappedReducer.java:105)
  at org.apache.hadoop.mapreduce.Reducer.reduce(Reducer.java:150)
  at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:171)
  at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:627)
  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
  at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:415)
  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
  at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)

我知道问题是化简键，值和 job.setOutputKeyClass， job.setOutputValueClass的一些愚蠢的不匹配，但我找不到它:(。请帮助我，这是我的代码:

public class DumpProductViewsAggHive extends Configured implements Tool {

    public static enum LOCAL_COUNTER {
        IGNORED, VALID, INVALID
    }

    private static final String NAME = "DumpProductViewsAggHive"; //Change the name of the job here
    private static final String SEPARATOR = "/t"; //Change the separator here

    private String dateFrom;            //Start date - HBase MR applicable
    private String dateTo;              //Ending date - HBase MR applicable
    private String fileOutput;          //output file path
    private String table = "we_json";   //default HBase table
    private int caching = 500;          //default HBase caching


    /**
     * Map phase HBase
     */
    public static class MapHBase extends TableMapper<Text, Text> {  
        private Text key_out = new Text();
        private Text value_out = new Text();

        private JSONParser parser = new JSONParser();
        private DateFormat formatter = new SimpleDateFormat("yyyyMMdd");
        private String day;
        private Date date = new Date();
        private Double ts = new Double(0);

        public void map(ImmutableBytesWritable row, Result value,
                Context context) throws IOException, InterruptedException {

            String b = new String(value.getValue(Bytes.toBytes("d"),
                    Bytes.toBytes("j")));
            JSONObject obj;

            try {
                obj = (JSONObject) parser.parse(b);
                if (obj.get("e").equals("pview_bcn")) {
                    ts = Double.parseDouble(obj.get("ts").toString());
                    ts = ts * 1000;
                    date.setTime(Math.round(ts));
                    day = formatter.format(date);

                    key_out.set(obj.get("sid").toString());
                    value_out.set(obj.get("variant_id") + SEPARATOR + obj.get("shop")
                            + SEPARATOR + obj.get("status") + SEPARATOR + day
                            + SEPARATOR + "D");
                    context.getCounter(LOCAL_COUNTER.VALID).increment(1);
                    context.write(key_out, value_out);
                } else {
                    context.getCounter(LOCAL_COUNTER.IGNORED).increment(1);
                }
            } catch (Exception pe) {
                // ignore value
                context.getCounter(LOCAL_COUNTER.INVALID).increment(1);
                return;
            }

        }
    }


    /**
     * Reduce phase
     */
    public static class Reduce extends Reducer<Text, Text, NullWritable, HCatRecord>{

        public void reduce (Iterable<Text> key, Text value, Context context)
            throws IOException, InterruptedException{

            Set<Text> sidSet = new HashSet<Text>();
            while (key.iterator().hasNext()) {
                sidSet.add(key.iterator().next());
            }
            String[] tokens = value.toString().split( SEPARATOR );  


            HCatRecord record = new DefaultHCatRecord(6);
            record.set(0, tokens[0].toString());
            record.set(1, tokens[1].toString());
            record.set(2, tokens[2].toString());
            record.set(3, tokens[3].toString());
            record.set(4, tokens[4].toString());
            record.set(5, sidSet.size());
            context.write(NullWritable.get(), record);
        }
    }

    public void getParams(String[] otherArgs) throws ParseException {
        DateFormat formatter = new SimpleDateFormat("yyyyMMdd");
        Calendar cal = Calendar.getInstance();
        int i = 0;

        /*
         * Loop parameters 
         */
        while (i<otherArgs.length) {
            // get parameter -d query only one day. HBase applicable.
            if (otherArgs[i].equals("-d")) {
                cal.setTime(formatter.parse(otherArgs[++i]));
                dateFrom = Long.toHexString(cal.getTimeInMillis()/1000);
                cal.add(Calendar.DATE, 1);
                dateTo = Long.toHexString(cal.getTimeInMillis()/1000);
                System.out.println("Day translated to start: " + dateFrom + "; End: " + dateTo);
            }
            // get start date -f parameter. HBase applicable.
            if (otherArgs[i].equals("-f")) {
                cal.setTime(formatter.parse(otherArgs[++i]));
                dateFrom = Long.toHexString(cal.getTimeInMillis() / 1000);
                System.out.println("From: " + dateFrom);
            }
            // get end date -t parameter. HBase applicable.
            if (otherArgs[i].equals("-t")) {
                cal.setTime(formatter.parse(otherArgs[++i]));
                dateTo = Long.toHexString(cal.getTimeInMillis() / 1000);
                System.out.println("To: " + dateTo);
            }

            // get output folder -o parameter.
            if (otherArgs[i].equals("-o")) {
                fileOutput = otherArgs[++i];
                System.out.println("Output: " + fileOutput);                
            }

            // get caching -c parameter. HBase applicable.
            if (otherArgs[i].equals("-c")) {
                caching = Integer.parseInt(otherArgs[++i]);
                System.out.println("Caching: " + caching);              
            }

            // get table name -tab parameter. HBase applicable.
            if (otherArgs[i].equals("-tab")) {
                table = otherArgs[++i];
                System.out.println("Table: " + table);              
            }

            i++;
        }
    }

    /**
     * 
     * @param fileInput
     * @param dateFrom
     * @param dateTo
     * @param job
     * @param caching
     * @param table
     * @throws IOException
     */ 
    public void getInput(String fileInput, String dateFrom, String dateTo, Job job, int caching, String table) throws IOException {
        // If the source is from Hbase
        if (fileInput == null) {
            /**
             * HBase source
             */
            // If date is not defined
            if (dateFrom == null || dateTo == null) {
                System.err.println("Start date or End Date is not defined.");
                return;
            }
            System.out.println("HBase table used as a source.");
            Scan scan = new Scan(Bytes.toBytes(dateFrom), Bytes.toBytes(dateTo));
            scan.setCaching(caching); // set Caching, when the table is small it is better to use bigger number. Default scan is 1
            scan.setCacheBlocks(false); // do not set true for MR jobs
            scan.addColumn(Bytes.toBytes("d"), Bytes.toBytes("j"));

            TableMapReduceUtil.initTableMapperJob(
                    table,          //name of table
                    scan,           //instance of scan
                    MapHBase.class, //mapper class
                    Text.class,     //mapper output key
                    Text.class,     //mapper output value
                    job);
        }
    }

    /**
     * Tool implementation 
     */
    @SuppressWarnings("deprecation")
    @Override
    public int run(String[] args) throws Exception {

        // Create configuration
        Configuration conf = this.getConf();
        String databaseName = null;
        String tableName = "test";

        // Parse arguments
        String[] otherArgs = new GenericOptionsParser(conf,args).getRemainingArgs();
        getParams(otherArgs);

        // It is better to specify zookeeper quorum in CLI parameter -D hbase.zookeeper.quorum=zookeeper servers
        conf.set( "hbase.zookeeper.quorum",
        "cz-dc1-s-132.mall.local,cz-dc1-s-133.mall.local,"
        + "cz-dc1-s-134.mall.local,cz-dc1-s-135.mall.local,"
        + "cz-dc1-s-136.mall.local");

        // Create job
        Job job = Job.getInstance(conf, NAME);
        job.setJarByClass(DumpProductViewsAggHive.class);


        // Setup MapReduce job
        job.setReducerClass(Reducer.class);
        //job.setNumReduceTasks(0); // If reducer is not needed

        // Specify key / value
        job.setOutputKeyClass(NullWritable.class);
        job.setOutputValueClass(DefaultHCatRecord.class);

        // Input
        getInput(null, dateFrom, dateTo, job, caching, table);

        // Output
        // Ignore the key for the reducer output; emitting an HCatalog record as value
        job.setOutputFormatClass(HCatOutputFormat.class);

        HCatOutputFormat.setOutput(job, OutputJobInfo.create(databaseName, tableName, null));
        HCatSchema s = HCatOutputFormat.getTableSchema(job);
        System.err.println("INFO: output schema explicitly set for writing:" + s);
        HCatOutputFormat.setSchema(job, s);

        // Execute job and return status
        return job.waitForCompletion(true) ? 0 : 1;
    }

    /**
     * Main
     * @param args
     * @throws Exception
     */ 
    public static void main(String[] args) throws Exception {
        int res = ToolRunner.run(new Configuration(), new DumpProductViewsAggHive(), args);
        System.exit(res);
    }

}

最佳答案

Similarly to the question I answered a few minutes ago，您在定义reducer错误:

@Override
public void reduce (Text key, Iterable<Text> values, Context context)
        throws IOException, InterruptedException

请使用@Override批注使编译器为您发现此错误。

关于java - org.apache.hadoop.io.Text无法转换为org.apache.hive.hcatalog.data.HCatRecord，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/24575639/

文章推荐： sql - Hive 查询等价于 sql

文章推荐： java - Hadoop二进制文件输入错误

文章推荐： java - 从hadoop 1.0.4升级后，Hadoop 2.2.0 mapreduce作业未运行

io - 内存映射 IO - IO 设备如何知道值已更改？
IO 设备如何知道属于它的内存中的值在memory mapped IO 中发生了变化？？例如，假设内存地址 0 专用于保存 VGA 设备的背景颜色。当我们更改 memory[0] 中的值时，VGA
ios - Facebook iOS iOS SDK登录错误
我目前正在开发一个使用Facebook sdk登录(通过FBLoginView)的iOS应用。一切正常，除了那些拥有较旧版本的facebook的人。当他们按下“使用Facebook登录”按钮时，他
ios - ios ios nsrange char从结束
假设我有: this - is an - example - with some - dashesNSRange将使用`rangeOfString:@“-”拾取“-”的第一个实例，但是如果我只想要最后
ios - 如何从card.io SDK获取国家名称？ -iOS
Card.io SDK提供以下详细信息: 卡号，有效期，月份，年份，CVV和邮政编码。如何从此SDK获取国家名称。 - (void)userDidProvideCreditCardInfo:(Car
ios - iOS 应用程序如何从网络服务下载图片并在安装过程中将它们安装在用户的 iOS 设备上？
iOS 应用程序如何从网络服务下载图片并在安装过程中将它们安装到用户的 iOS 设备上？可能吗？最佳答案您无法控制应用在用户设备上的安装，因此无法在安装过程中下载其他数据。只需在安装后首次启动应
ios - iOS 企业应用程序和 iOS 零售应用程序之间的区别
我曾经开发过一款企业版 iOS 产品，我们公司曾将其出售给大型企业，供他们的员工使用。该应用程序通过 AppStore 提供，企业用户获得了公司特定的配置文件(包含应用程序配置文件)以启用他们有权使
ios - Card.io ios 与本地化集成
我正在尝试将 Card.io SDK 集成到我的 iOS 应用程序中。我想为 CardIO ui 做一个简单的本地化，如更改取消按钮标题或“在此保留信用卡”提示文本。我在 github 上找到了这个
ios - Card.Io iOS 扫描名称
我正在使用 CardIOView 和 CardIOViewDelegate 类，没有可以设置为 YES 的 BOOL 来扫描 collectCardholderName。我可以看到它在 CardIOP
ios - 如何为最近的原生 ios 应用程序设置名称字段？ - iOS
我有一个集成了通话工具包的 voip 应用程序。每次我从我的 voip 应用程序调用时，都会在 native 电话应用程序中创建一个新的最近通话记录。我在 voip 应用程序中也有自定义联系人(电话应
ios - iOS 应用程序如何在应用程序打开时知道键盘是否已经在屏幕上(iOS 多任务处理)
iOS 应用程序如何知道应用程序打开时屏幕上是否已经有键盘？应用程序运行后，它可以接收键盘显示/隐藏通知。但是，如果应用程序在分屏模式下作为辅助应用程序打开，而主应用程序已经显示键盘，则辅助应用程序不
ios - iOS 上的图像 IO 错误
我在模拟器中收到以下错误: ImageIO: CGImageReadSessionGetCachedImageBlockData *** CGImageReadSessionGetCachedIm
ios - iOS 设备与非 iOS 设备通信
如 Apple 文档所示，可以通过 EAAccessory Framework 与经过认证的配件(由 Apple 认证)进行通信。但是我有点困惑，因为一些帖子告诉我它也可以通过 CoreBluetoo
ios - (iOS) 直接在 iOS 设备上查看日志消息的方式？
尽管现在的调试器已经很不错了，但有时找出应用程序中正在发生的事情的最好方法仍然是古老的 NSLog。当您连接到计算机时，这样做很容易； Xcode 会帮助弹出日志查看器面板，然后就可以了。当您不在办公
ios - Kontakt.io iOS - 按名称识别信标
在我的 iOS 应用程序中，我定义了一些兴趣点。其中一些有一个 Kontakt.io 信标的名称，它绑定(bind)到一个特定的 PoI(我的意思是通常贴在信标标签上的名称)。现在我想在附近发现信标，
ios - Trigger.io iOS 插件从回调返回数据
我正在为警报提示创建一个 trigger.io 插件。尝试从警报提示返回数据。这是我的代码: // Prompt + (void)show_prompt:(ForgeTask*)task{
ios - iOS 4、iOS 5 和 iOS 6 的推送通知有何不同？
您好，我是 Apple iOS 的新手。我阅读并搜索了很多关于推送通知的文章，但我没有发现任何关于 APNS 从 io4 到 ios 6 的新更新的信息。任何人都可以向我提供 APNS 如何在 ios
ios - iOS 8、iOS 9、iOS 10 和 iOS 11 上 UITabBar 的高度是多少？
UITabBar 的高度似乎在 iOS 7 和 8/9/10/11 之间发生了变化。我发布这个问题是为了让其他人轻松找到答案。那么:在 iPhone 和 iPad 上的 iOS 8/9/10/11
ios - 最佳实践。通过支持 iOS 5、iOS 6 和 iOS 7 UI，使 iOS 应用程序变得通用
我想我可以针对不同的 iOS 版本使用不同的 Storyboard。由于 UI 的差异，我将创建下一个 Storyboard: Main_iPhone.storyboard Main_iPad.st
ios - 如何使用 iOS 中的视觉控件在 ios 中选择音轨的一部分？
我正在写一些东西，我将使用设备的 iTunes 库中的一部分音轨来覆盖 2 个视频的组合，例如: AVMutableComposition* mixComposition = [[AVMutableC
ios - iOS 模拟器中存在头文件，但 iOS 设备上不存在...？
我创建了一个简单的 iOS 程序，可以顺利编译并在 iPad 模拟器上运行良好。当我告诉 XCode 4 使用我连接的 iPad 设备时，无法编译相同的程序。问题似乎是当我尝试使用附加的 iPad 时

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - org.apache.hadoop.io.Text无法转换为org.apache.hive.hcatalog.data.HCatRecord