java - 使用线程同时散列(sha1)多个文件-6ren

java - 使用线程同时散列(sha1)多个文件

转载作者：搜寻专家更新时间：2023-11-01 03:55:31

我有N个大文件(不少于250M)要散列。这些文件在 P 物理驱动器上。

我想用最多 K 个 Activity 线程同时对它们进行哈希处理，但我不能对每个物理驱动器进行超过 M 个文件的哈希处理，因为它会减慢整个过程(我运行了一个测试，解析 61 个文件，并使用 8 个线程比 1 个线程慢；文件几乎都在同一个磁盘上)。

我想知道最好的方法是什么:

我可以使用 Executors.newFixedThreadPool(K)
然后我会使用计数器来确定是否应该添加新任务来提交任务。

我的代码是:

int K = 8;
int M = 1;
Queue<Path> queue = null; // get the files to hash
final ExecutorService newFixedThreadPool = Executors.newFixedThreadPool(K);
final ConcurrentMap<FileStore, Integer> counter = new ConcurrentHashMap<>();
final ConcurrentMap<FileStore, Integer> maxCounter = new ConcurrentHashMap<>();
for (FileStore store : FileSystems.getDefault().getFileStores()) {
  counter.put(store, 0);
  maxCounter.put(store, M);
}
List<Future<Result>> result = new ArrayList<>();
while (!queue.isEmpty()) {
  final Path current = queue.poll();
  final FileStore store = Files.getFileStore(current);
  if (counter.get(store) < maxCounter.get(store)) {
    result.add(newFixedThreadPool.submit(new Callable<Result>() {

      @Override
      public Entry<Path, String> call() throws Exception {
        counter.put(store, counter.get(store) + 1);
        String hash = null; // Hash the file
        counter.put(store, counter.get(store) - 1);
        return new Result(path, hash);
      }

    }));
  } else queue.offer(current);
}