java - 比较两个文件并删除重复部分以仅合并新内容-6ren

java - 比较两个文件并删除重复部分以仅合并新内容

转载作者：行者123 更新时间：2023-12-01 10:45:04

我有两个 .ckl 文件，我试图从中读取、比较，然后仅将新信息附加到末尾并生成新的 .ckl。某个地方的逻辑存在问题，因为它只是合并两个文件而不删除任何重复部分。

下面是我的代码...

package hellockl;

import java.io.*;
import java.util.ArrayList;
import java.util.List;
import java.util.Scanner;


public class HelloCKL {

    /**
     * @param args the command line arguments
     */
    public static void main(String[] args) throws IOException {
        // TODO code application logic here
        String sourceFile1Path = "test.ckl";
        String sourceFile2Path = "test2.ckl";

        String mergedFilePath = "merged_test.ckl";

        // NEW
    //    System.out.println("here1");
        ArrayList<String> list = (ArrayList<String>) makeVulnList(sourceFile1Path);
    //    System.out.println("here2");

        File[] files = new File[2];
        files[0] = new File(sourceFile1Path);
        files[1] = new File(sourceFile2Path);

        File mergedFile = new File(mergedFilePath);

        mergeFiles(files, mergedFile, list);
    }

    public static void mergeFiles(File[] files, File mergedFile, ArrayList<String> list) {

        FileWriter fstream = null;
        BufferedWriter out = null;
        try {
            fstream = new FileWriter(mergedFile, true);
            out = new BufferedWriter(fstream);
        } catch (IOException e1) {
            e1.printStackTrace();
        }

        for (File f : files) {
            System.out.println("merging: " + f.getName());
            FileInputStream fis;
            try {
                fis = new FileInputStream(f);
                BufferedReader in = new BufferedReader(new InputStreamReader(fis));

                String aLine;
                while ((aLine = in.readLine()) != null) {
                    // NEW
                    if (aLine.equals("<VULN>")) {
                        // save the lines from here til ATTRIBUTE_DATA
                        aLine += in.readLine();
                        aLine += in.readLine();
                        // grab the line that would have the name
                        String nameLine = in.readLine();
                        if (list.contains(nameLine)) {
                            // need to advance the reader past the end of this VULN
                            while (!(aLine.equals("</VULN>"))) {
                                aLine = in.readLine();
                            }
                            continue; // this would skip the writing out to file part
                        }
                        aLine += nameLine; // concat this and go on as usual
                    }
                    // END NEW
                    out.write(aLine);
                    out.newLine();
                }

                in.close();
            } catch (IOException e) {
                e.printStackTrace();
            }
        }

        try {
            out.close();
        } catch (IOException e) {
            e.printStackTrace();
        }

    }

    // This should build a list of lines that have the vulnerability names. It lookes for a 
    // <VULN> tag using a Scanner and saves the <ATTRIBUTE_DATA>V - 3</ATTRIBUTE_DATA> line
    // into an arraylist. The we can use that list to compare other lines to to see if 
    // it already exists. 
    public static ArrayList<String> makeVulnList(String sourceFile1Path) {
    //    System.out.println("IN MAKE VULN LIST");
        ArrayList<String> list = new ArrayList<String>();
        Scanner scanner = new Scanner(sourceFile1Path);
        while (scanner.hasNextLine()) {
            String line = scanner.nextLine();
            System.out.println("on line: " + line);
            if (line.equals("<VULN>")) {
                System.out.println("match!!! : " + line);
                line = scanner.nextLine();
                line = scanner.nextLine();
                line = scanner.nextLine();
                list.add(line);
                System.out.println("adding to list : " + line);
            }
        }
        return list;
    }
}

.ckl 不是最短的东西，但如果有帮助的话我也可以附上它。

谢谢。

最佳答案

我建议您读取两个文件中的所有行并将它们添加到 Set 中(Set 不允许重复的项目)。然后迭代 Set 将其项目添加到合并文件中。

关于java - 比较两个文件并删除重复部分以仅合并新内容，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/34226553/

文章推荐： cloud - 如何处理云上的大量存储(或其他方式？)

文章推荐： scala - 为什么 scalac 会生成额外的/包装闭包

typescript - A 部分部分 io-ts
我在使用 io-ts 时遇到一些问题。我发现它确实缺乏文档，我取得的大部分进展都是通过 GitHub issues 取得的。不，我不明白 HKT，所以没有帮助。基本上，我在其他地方创建一个类型，ty
java - 匹配完整文件正则表达式中的 A 部分，但不匹配 B 部分
我必须创建一个正则表达式来搜索整个文件，以找到与 Java XML 解析器的第一部分(但不是第二部分)的匹配项。这将用于防止某些 XXE 攻击。不幸的是，它确实必须是单个正则表达式，并且它确实需要搜索
c# - 部分/部分中的 asp.net mvs 部分？
我有一些简单的 Shared/_Header.cshtml 文件中的内容。 My Shared/_Layout.cshtml 通过调用插入该代码 @Html.Partial("_Header") 目前
java - Selenium 只执行循环的 if != null 部分，不运行循环的 "else if null "部分
我有一个 if-else 语句，其中: 条件 1:ID 匹配并且自动填充某些字段。然后 if 语句只填充其余字段条件 2:ID 不匹配，所有字段均为空白。 ELSE 语句将它们全部填充当我使条件
javascript - 无法在 JSFIDDLE 中使用滚动魔法(第 1 部分，共 2 部分)
我正在开发一个单页滚动网站。我正在尝试实现 ScrollMagic 并固定第一部分，以便网站的其余部分滚动到固定部分的顶部。我尝试创建一个 jsfiddle 来显示问题，但我似乎无法让 jsfiddl
javascript - 既然有

首页

博学

6Ren·AI

商城

java - 比较两个文件并删除重复部分以仅合并新内容