spring - 如何使用 Spring Batch FlatFileItemReader 仅读取 CSV 文件中的子集列？-6ren

spring - 如何使用 Spring Batch FlatFileItemReader 仅读取 CSV 文件中的子集列？

转载作者：行者123 更新时间：2023-12-02 20:13:47

我有一个读者知道如何读取包含 14 列的 CSV 文件，我希望它能够接收包含许多列(~500)的文件并仅读取这 14 列，我知道解决方案应该包括 FieldSetMapper (根据这个问题: read only selective columns from csv file using spring batch )但我找不到合适的例子。这是我当前的读者:

@Bean
public FlatFileItemReader<RowInput> csvRowsReader() {
    FlatFileItemReader<RowInput> reader = new FlatFileItemReader<>();
    Resource resource = new FileSystemResource(new File(FileManager.getInstance().getInputFileLocation()));
    reader.setResource(resource);

    reader.setLinesToSkip(1);
    reader.setLineMapper(new DefaultLineMapper<RowInput>(){{
        setLineTokenizer(new DelimitedLineTokenizer(){{
            setNames(new String[]{"Field_1", "Field_2", "Field_3", "Field_4", "Field_5",
                    "Field_6", "Field_7", "Field_8", "Field_9", "Field_10", "Field_11",
                    "Field_12", "Field_13", "Field_14"});
        }});
        setFieldSetMapper(new BeanWrapperFieldSetMapper<RowInput>(){{
            setTargetType(RowInput.class);
        }});
    }});


    reader.setLinesToSkip(1);
    return reader;
}

异常(exception)是:

原因:org.springframework.batch.item.file.transform.In CorrectTokenCountException:记录中发现的 token 数量不正确:预期 14，实际 544

我尝试使用的FieldSetMapper:

    public class InputFieldSetMapper implements FieldSetMapper<RowInput>{

    public RowInput mapFieldSet(FieldSet fs) {

        if (fs == null) {
            return null;
        }

        RowInput input = new RowInput();
        input.setField1(fs.readString("Field_1"));
        input.setField2(fs.readString("Field_2"));
        // and so on...

        return input;
    }
}

最佳答案

您需要在 LineTokenizer 上设置 includedFields 属性，以指定解析输入文件时要包含哪些字段。在你的情况下，它应该是这样的:

@Bean
public FlatFileItemReader<RowInput> csvRowsReader() {
    FlatFileItemReader<RowInput> reader = new FlatFileItemReader<>();
    Resource resource = new FileSystemResource(new File(FileManager.getInstance().getInputFileLocation()));
    reader.setResource(resource);

    reader.setLinesToSkip(1);
    reader.setLineMapper(new DefaultLineMapper<RowInput>(){{
        setLineTokenizer(new DelimitedLineTokenizer(){{
            setNames(new String[]{"Field_1", "Field_2", "Field_3", "Field_4", "Field_5",
                    "Field_6", "Field_7", "Field_8", "Field_9", "Field_10", "Field_11",
                    "Field_12", "Field_13", "Field_14"});
            setIncludedFields(0, 1, 2, 3, 4, 5, 6, 7, 8, 9,10 ,11 ,12 ,13);
        }});
        setFieldSetMapper(new BeanWrapperFieldSetMapper<RowInput>(){{
            setTargetType(RowInput.class);
        }});
    }});

    return reader;
}

编辑:添加具有非顺序字段的示例

import org.springframework.batch.core.Job;
import org.springframework.batch.core.JobParameters;
import org.springframework.batch.core.Step;
import org.springframework.batch.core.configuration.annotation.EnableBatchProcessing;
import org.springframework.batch.core.configuration.annotation.JobBuilderFactory;
import org.springframework.batch.core.configuration.annotation.StepBuilderFactory;
import org.springframework.batch.core.launch.JobLauncher;
import org.springframework.batch.item.ItemWriter;
import org.springframework.batch.item.file.FlatFileItemReader;
import org.springframework.batch.item.file.builder.FlatFileItemReaderBuilder;
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.context.ApplicationContext;
import org.springframework.context.annotation.AnnotationConfigApplicationContext;
import org.springframework.context.annotation.Bean;
import org.springframework.context.annotation.Configuration;
import org.springframework.core.io.ClassPathResource;

@Configuration
@EnableBatchProcessing
public class MyJob {

    @Autowired
    private JobBuilderFactory jobs;

    @Autowired
    private StepBuilderFactory steps;

    @Bean
    public FlatFileItemReader<Person> itemReader() {
        return new FlatFileItemReaderBuilder<Person>()
                .name("personItemReader")
                .resource(new ClassPathResource("persons.csv"))
                .delimited()
                .includedFields(new Integer[] {0, 2})
                .names(new String[] {"id", "lastName"})
                .targetType(Person.class)
                .build();
    }

    @Bean
    public ItemWriter<Person> itemWriter() {
        return items -> {
            for (Person item : items) {
                System.out.println("person = " + item);
            }
        };
    }

    @Bean
    public Step step() {
        return steps.get("step")
                .<Person, Person>chunk(1)
                .reader(itemReader())
                .writer(itemWriter())
                .build();
    }

    @Bean
    public Job job() {
        return jobs.get("job")
                .start(step())
                .build();
    }

    public static void main(String[] args) throws Exception {
        ApplicationContext context = new AnnotationConfigApplicationContext(MyJob.class);
        JobLauncher jobLauncher = context.getBean(JobLauncher.class);
        Job job = context.getBean(Job.class);
        jobLauncher.run(job, new JobParameters());
    }

    public static class Person {
        String id;
        String firstName;
        String lastName;
        int age;

        public Person() {
        }

        public String getId() {
            return id;
        }

        public void setId(String id) {
            this.id = id;
        }

        public String getFirstName() {
            return firstName;
        }

        public void setFirstName(String firstName) {
            this.firstName = firstName;
        }

        public String getLastName() {
            return lastName;
        }

        public void setLastName(String lastName) {
            this.lastName = lastName;
        }

        public int getAge() {
            return age;
        }

        public void setAge(int age) {
            this.age = age;
        }

        @Override
        public String toString() {
            return "Person{" +
                    "id='" + id + '\'' +
                    ", firstName='" + firstName + '\'' +
                    ", lastName='" + lastName + '\'' +
                    ", age=" + age +
                    '}';
        }
    }

}

输入文件persons.csv如下:

1,foo1,bar1,10
2,foo2,bar2,20

该示例展示了如何仅映射 id 和 lastName 字段。

希望这有帮助。

关于spring - 如何使用 Spring Batch FlatFileItemReader 仅读取 CSV 文件中的子集列？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/52869410/

文章推荐： python - 从 ImageField Django 2.0 获取 EXIF 数据

文章推荐： c - 将 time_t 设置为毫秒

文章推荐： django - 获取模型类中 'created_by' 字段的当前用户

spring-batch - Spring Batch 应用程序与 Spring Batch Admin 集成
我开发了一个 spring batch 应用程序，它使用批处理/shell 脚本部署为可执行 jar。它工作正常。最近我读到有关 spring batch admin 应用程序发布的信息。根据他们的
batch-file - 等待输入 BATCH
我想要的是一个 bat 文件来等待一定时间的输入。如果没有输入，我希望它转到 somethingidk。这是我目前所拥有的。 @echo off :START cls timeout 10 set
batch-file - Batch 无法正确计算方程
我最近尝试在不使用外部命令或工具的情况下批量编写一个程序来计算任何实数(而不是负数)的平方根，该程序基于可以在这里找到的算法:Link1 编辑:我修复了大部分问题，但仍然有一个我没有发现的轻微问题。
batch-file - Batch For循环排除包含以下内容的文件名
我有一个简单的批处理文件，它将遍历所有* Test.htm文件并进行复制。一些文件将包含我不想复制的字符串。 FOR /R "C:\" %%g IN (*Test.htm) DO ( echo %%
batch-file - Batch for 循环是否可以增加步长值？
这可能简短而有趣，但我只是在检查。批处理 for 命令可以有一个递增的步长值吗？ @echo off SetLocal EnableDelayedExpansion set xyz=200 for
batch-file - 文件路径中有空白空间的 Batch-Hell
目前我正处于批处理 hell 中。我想通过批处理文件调用我的 powershell 脚本。只要路径中没有空格，这就可以正常工作。例如，这是有效的 set DATAPATH="%~1
spring-batch - Spring Batch 单线程读取器和多线程写入器
试图找到以前是否有人问过这个问题，但找不到。问题来了。以下必须通过Spring批处理来实现有一个文件需要读取和处理。项目阅读器不是线程安全的。计划是让多线程同质处理器和多线程同质写入器插入由单线程读
spring-batch - 暂停和恢复作业执行 Spring Batch
这里有同样的问题- Spring batch pause/resume vs stop/restart 我在 Spring 检查了 BatchStatus 枚举，没有可用的状态 PAUSED，它仅作为
batch-file - for 循环后的条件 (Batch/CMD)
因此，我目前有这批使用 ffmpeg 将当前文件夹上的每个 .MTS 转换为 .MP4，但是当它完成后，我会在文件夹中同时获得 .mp4 和 .mts。我有 2 个批处理，一个用于转换文件，另一个用
spring-batch - Spring Batch 是不是有点矫枉过正
我需要每周一次将 CSV 加载到数据库中。由于 CSV 文件包含 2 个表的数据，因此需要进行一些数据处理。因此，我将不得不稍微处理一下 CSV 文件，可能会将其转换为 2 个不同的 CSV 文件并将
spring-batch - Spring Batch - 同时执行作业的多个实例
我有一个澄清。我们是否可以同时运行一个作业的多个实例。目前，我们在任何给定时间都有一个作业实例。如果可能，请告诉我如何做。最佳答案是的你可以。 Spring Batch 根据 JobPara
spring-batch - Spring Batch - 跳过过程记录
我想跳过一些过程记录。我尝试过的是，我创建了自定义异常并在我想跳过记录时抛出异常，并且它调用了 onSkipInProcess 方法的跳过监听器。它工作正常。请找到配置。
batch-file - 阻止我的 Batch 在屏幕上滑动
任何人都可以启发我一种方法来阻止我的 bat 在执行时在屏幕上闪烁吗？有没有办法阻止 CMD 窗口执行此操作？？？？最佳答案只是一个猜测，但要防止窗口在看不到打印内容的情况下立即打开和关闭，请在批
batch-file - Windows Batch 将记录添加到主机文件
我需要一个批处理文件来向 windows 中的主机文件添加一条记录，但是我不需要只添加文件，因为我想检查该记录是否已经存在。有可能吗？最佳答案 type "%SystemRoot%\system32
spring-batch - Spring Batch 事务管理如何工作？
我试图了解 Spring Batch 如何进行事务管理。这不是技术问题，而是概念问题:Spring Batch 使用什么方法以及该方法的后果是什么？让我试着澄清一下这个问题。例如，查看 Taskle
batch-file - [NT Batch]如何从用户输入的文件中获取目录？
我需要知道如何从用户输入的文件中提取目录信息，以下面的代码为例: ECHO Drag and drop your .txt file here, after that press Enter: SET
spring-batch - Spring Batch - 如何防止批处理在数据库中存储事务
首先是问题陈述:我在我的 DEV 环境中使用 Spring-Batch 很好。当我将代码移至生产环境时，我遇到了问题。在我的 DEV 环境中，Spring-Batch 能够毫无问题地在我们的 DB2
spring-batch - Spring Batch 架构
你好我是 Spring Batch 领域的新手，最近几天我花了一些时间观看 Michael Minella 的 youtube 视频，阅读了一些文档并成功运行了我在互联网上找到的一些演示项目。我认为
spring-batch - Spring Batch 处理编码的压缩文件
我正在研究使用 spring 批处理来处理编码压缩文件中的记录。记录是可变长度的，其中编码了嵌套的可变长度数据字段。我是 Spring 和 Spring Batch 的新手，这就是我计划构建批处理配
batch-file - Windows Batch - 从字符串中删除第一个单词
我正在尝试批量删除字符串中的第一个单词。示例:“这个 child 喜欢批处理”到“ child 喜欢批处理” 我试过: @echo off set /p text=text: for /f "tok

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

spring - 如何使用 Spring Batch FlatFileItemReader 仅读取 CSV 文件中的子集列？