java - 使用 Akka Actor 的文件操作-6ren

java - 使用 Akka Actor 的文件操作

转载作者：行者123 更新时间：2023-12-02 17:11:41

与普通的文件操作方法相比，使用 Akka Actor 有什么优势？。我试图计算分析日志文件所花费的时间。操作是找出登录次数超过50次的IP地址并显示出来。与 Akka Actor 模型相比，普通文件操作更快。为什么会这样？

使用普通的文件操作

public static void main(String[] args) {
        // TODO Auto-generated method stub
        //long startTime = System.currentTimeMillis();
        File file = new File("log.txt");
        Map<String, Long> ipMap = new HashMap<>();

        try {

                FileReader fr = new FileReader(file);
                BufferedReader br = new BufferedReader(fr);
                String line = br.readLine();

                while(line!=null) {
                    int idx = line.indexOf('-');
                    String ipAddress = line.substring(0, idx).trim();
                    long count = ipMap.getOrDefault(ipAddress, 0L);
                    ipMap.put(ipAddress, ++count);
                    line = br.readLine();
                }

                 System.out.println("================================");
                 System.out.println("||\tCount\t||\t\tIP");
                 System.out.println("================================");

                 fr.close();
                 br.close();
                 Map<String, Long> result = new HashMap<>();

                    // Sort by value and put it into the "result" map
                    ipMap.entrySet().stream()
                            .sorted(Map.Entry.<String, Long>comparingByValue().reversed())
                            .forEachOrdered(x -> result.put(x.getKey(), x.getValue()));

                    // Print only if count > 50
                    result.entrySet().stream().filter(entry -> entry.getValue() > 50).forEach(entry ->
                        System.out.println("||\t" + entry.getValue() + "   \t||\t" + entry.getKey())
                    );

//                  long endTime = System.currentTimeMillis();
//                  System.out.println("Time: "+(endTime-startTime));

            } catch (FileNotFoundException e) {
                // TODO Auto-generated catch block
                e.printStackTrace();
            } catch (IOException e) {
                // TODO Auto-generated catch block
                e.printStackTrace();
            }

    }

Using Actors:
1. The Main Class
 public static void main(String[] args) {
        long startTime = System.currentTimeMillis();
        // Create actorSystem
        ActorSystem akkaSystem = ActorSystem.create("akkaSystem");

        // Create first actor based on the specified class
        ActorRef coordinator = akkaSystem.actorOf(Props.create(FileAnalysisActor.class));

        // Create a message including the file path
        FileAnalysisMessage msg = new FileAnalysisMessage("log.txt");

        // Send a message to start processing the file. This is a synchronous call using 'ask' with a timeout.
        Timeout timeout = new Timeout(6, TimeUnit.SECONDS);
        Future<Object> future = Patterns.ask(coordinator, msg, timeout);

        // Process the results
        final ExecutionContext ec = akkaSystem.dispatcher();
        future.onSuccess(new OnSuccess<Object>() {
            @Override
            public void onSuccess(Object message) throws Throwable {
                if (message instanceof FileProcessedMessage) {
                    printResults((FileProcessedMessage) message);

                    // Stop the actor system
                    akkaSystem.shutdown();
                }
            }

            private void printResults(FileProcessedMessage message) {
                System.out.println("================================");
                System.out.println("||\tCount\t||\t\tIP");
                System.out.println("================================");

                Map<String, Long> result = new LinkedHashMap<>();

                // Sort by value and put it into the "result" map
                message.getData().entrySet().stream()
                        .sorted(Map.Entry.<String, Long>comparingByValue().reversed())
                        .forEachOrdered(x -> result.put(x.getKey(), x.getValue())); 

                // Print only if count > 50
                result.entrySet().stream().filter(entry -> entry.getValue() > 50).forEach(entry ->
                    System.out.println("||\t" + entry.getValue() + "   \t||\t" + entry.getKey())
                );
                long endTime = System.currentTimeMillis();
                System.out.println("Total time: "+(endTime - startTime));
            }

        }, ec);

    }

2.文件分析器类

public class FileAnalysisActor extends UntypedActor {

    private Map<String, Long> ipMap = new HashMap<>();
    private long fileLineCount;
    private long processedCount;
    private ActorRef analyticsSender = null;

    @Override
    public void onReceive(Object message) throws Exception {
        /*
            This actor can receive two different messages, FileAnalysisMessage or LineProcessingResult, any
            other type will be discarded using the unhandled method
         */
            //System.out.println(Thread.currentThread().getName());
        if (message instanceof FileAnalysisMessage) {

            List<String> lines = FileUtils.readLines(new File(
                    ((FileAnalysisMessage) message).getFileName()));

            fileLineCount = lines.size();
            processedCount = 0;

            // stores a reference to the original sender to send back the results later on
            analyticsSender = this.getSender();

            for (String line : lines) {
                // creates a new actor per each line of the log file
                Props props = Props.create(LogLineProcessor.class);
                ActorRef lineProcessorActor = this.getContext().actorOf(props);

                // sends a message to the new actor with the line payload
                lineProcessorActor.tell(new LogLineMessage(line), this.getSelf());
            }

        } else if (message instanceof LineProcessingResult) {

            // a result message is received after a LogLineProcessor actor has finished processing a line
            String ip = ((LineProcessingResult) message).getIpAddress();

            // increment ip counter
            Long count = ipMap.getOrDefault(ip, 0L);
            ipMap.put(ip, ++count);

            // if the file has been processed entirely, send a termination message to the main actor
            processedCount++;
            if (fileLineCount == processedCount) {
                // send done message
                analyticsSender.tell(new FileProcessedMessage(ipMap), ActorRef.noSender());
            }

        } else {
            // Ignore message
            this.unhandled(message);
        }
    }
}

3.Logline处理器类

public class LogLineProcessor extends UntypedActor {

    @Override
    public void onReceive(Object message) throws Exception {
        if (message instanceof LogLineMessage) {
            // What data each actor process?
            //System.out.println("Line: " + ((LogLineMessage) message).getData());
            // Uncomment this line to see the thread number and the actor name relationship
           //System.out.println("Thread ["+Thread.currentThread().getId()+"] handling ["+ getSelf().toString()+"]");

            // get the message payload, this will be just one line from the log file
            String messageData = ((LogLineMessage) message).getData();

            int idx = messageData.indexOf('-');
            if (idx != -1) {
                // get the ip address
                String ipAddress = messageData.substring(0, idx).trim();

                // tell the sender that we got a result using a new type of message
                this.getSender().tell(new LineProcessingResult(ipAddress), this.getSelf());
            }
        } else {
            // ignore any other message type
            this.unhandled(message);
        }
    }
}

消息类

文件分析消息

公共(public)类 FileAnalysisMessage {

private String fileName;

public FileAnalysisMessage(String file) {
    this.fileName = file;
}

public String getFileName() {
    return fileName;
}

2.文件处理消息

public class FileProcessedMessage {

    private Map<String, Long> data;

    public FileProcessedMessage(Map<String, Long> data) {
        this.data = data;
    }

    public Map<String, Long> getData() {
        return data;
    }
}

线处理结果

公共(public)类 LineProcessingResult {

private String ipAddress;

public LineProcessingResult(String ipAddress) {
    this.ipAddress = ipAddress;
}

public String getIpAddress() {
    return ipAddress;
}

4.日志消息

public class LogLineMessage {

    private String data;

    public LogLineMessage(String data) {
        this.data = data;
    }

    public String getData() {
        return data;
    }
}

我正在为文件中的每一行创建一个 Actor 。

最佳答案

对于所有并发框架，部署的并发数量与每个并发单元所涉及的复杂性之间始终存在权衡。 Akka 也不异常(exception)。

在您的非 akka 方法中，每行的步骤序列相对简单:

从文件中读取一行
用'-'分隔行
将 ip 地址提交到 HashMap 中并增加计数

相比之下，您的 akka 方法每一行都复杂得多:

创建一个 Actor
创建一条 LogLineMessage 消息
将消息发送给 Actor
用'-'分隔行
创建一个LineProcessingResult 消息
将消息发送回协调参与者
将 ip 地址提交到 HashMap 中并增加计数

如果我们天真地假设上述每个步骤花费相同的时间，那么您将需要 2 个带有 akka 的线程才能以与没有 akka 的 1 个线程相同的速度运行。

让每个并发单元做更多的工作

不是每 1 行有 1 个 Actor，而是让每个 actor 将 N 行处理到它自己的子 HashMap 中(例如，每个 Actor 处理 1000 行):

public class LogLineMessage {

    private String[] data;

    public LogLineMessage(String[] data) {
        this.data = data;
    }

    public String[] getData() {
        return data;
    }
}

那么 Actor 就不会发回像 IP 地址这样简单的东西了。相反，它将为其行子集发送计数哈希:

public class LineProcessingResult {

    private HashMap<String, Long> ipAddressCount;

    public LineProcessingResult(HashMap<String, Long> count) {
        this.ipAddressCount = Count;
    }

    public HashMap<String, Long> getIpAddress() {
        return ipAddressCount;
    }
}

并且协调 Actor 可以负责组合所有不同的子计数:

//inside of FileAnalysisActor
else if (message instanceof LineProcessingResult) {
    HashMap<String,Long>  localCount = ((LineProcessingResult) message).getIpAddressCount();

    localCount.foreach((ipAddress, count) -> {
        ipMap.put(ipAddress, ipMap.getOrDefault(ipAddress, 0L) + count);
    })

然后您可以改变 N 以查看您在特定系统中获得最佳性能的位置。

不要将整个文件读入内存

并发解决方案的另一个缺点是它首先将整个文件读入内存。这对 JVM 来说是不必要的和繁重的。

相反，一次读取文件 N 行。一旦你在内存中有了这些行，就会像前面提到的那样从 Actor 中产生。

FileReader fr = new FileReader(file);
BufferedReader br = new BufferedReader(fr);

String[] lineBuffer;
int bufferCount = 0;
int N = 1000;

String line = br.readLine();

while(line!=null) {
    if(0 == bufferCount)
      lineBuffer = new String[N];
    else if(N == bufferCount) {
      Props props = Props.create(LogLineProcessor.class);
      ActorRef lineProcessorActor = this.getContext().actorOf(props);

      lineProcessorActor.tell(new LogLineMessage(lineBuffer),
                              this.getSelf());

      bufferCount = 0;
      continue;
    }

    lineBuffer[bufferCount] = line;
    br.readLine();
    bufferCount++;
}

//handle the final buffer
if(bufferCount > 0) {
    Props props = Props.create(LogLineProcessor.class); 
    ActorRef lineProcessorActor = this.getContext().actorOf(props);

    lineProcessorActor.tell(new LogLineMessage(lineBuffer),
                            this.getSelf());
}

这将允许文件 IO、线处理和子图组合全部并行运行。

关于java - 使用 Akka Actor 的文件操作，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/48902089/

文章推荐： SAPUI5 - SAPUI5 应用程序的目的地

文章推荐： startup - 添加要在 Pharo Image 启动时运行的内容

文章推荐： dalvik - DalvikVM如何处理switch并尝试smali代码

scala - 使用 Akka/Akka Streams/Akka HTTP 时出现 Akka 版本冲突
我正在尝试使用 Scala 2.12.3、sbt 0.13.6 以及 build.sbt 文件中的以下库依赖项构建一个项目: libraryDependencies ++= Seq( "com.t
akka - Akka 消息传递如何工作
我根本没有任何 Akka 经验。我想知道 Akka 消息传递如何在 JVM 内以及 JVM 之间工作。 JVM 中的消息是一些类似 POJO 的对象吗？ JVM之间的通信是否需要任何类型的JMS(如服
akka - Akka EventBus可以与远程参与者一起使用吗？
Akka的EventBus是否可以与远程 Actor 一起使用？据我所知，它本身不支持此功能。任何人都可以确认吗？看起来有可能对提供相似功能的Actor进行编码。例如。在订阅远程服务器上的Even
akka - Akka 中集群和远程处理的区别
我在单个 JVM 上使用 Akka 开发了我的应用程序。现在我想在多台机器上分配工作负载。我开始阅读文档并感到困惑。有两种方法可以通过集群和远程处理使 Akka 应用程序分发。我不明白两者之间的区
akka - 将 Akka HTTP 连接到 Akka 流
我想使用 Akka HTTP 构建一个连接到现有接收器(带有 Kafka react 流)的 REST 服务，但我不知道如何将 HTTP 流链接到 Akka 流接收器... 我应该选择使用 Flows
akka - 线程模型和 AKKA
我在某处听说默认情况下是一个 Actor 系统，这意味着它的 ExecutorService/Dispatcher 正在创建一个非 Deamon 线程池来运行 Actor。如果确实如此，那将解释我所经
akka - Akka 中的存储和控制感知邮箱
在我的应用程序中，我有一个角色需要在等待某些操作完成时存储消息，同时它需要支持高优先级消息(控制消息)。 stash trait 需要一个 Dequeue邮箱类型，我找不到控制感知出队邮箱是否有意义。
akka - Akka.NET 和原始 Akka 可以使用 Remoting 进行通信吗？
Akka.NET 和原始 Akka 可以使用 Remoting 进行通信吗？换句话说，Akka 可以用于连接系统中的 JVM 和 CLR 吗？最佳答案这个问题在 akka.net Github
akka - Akka Decider 是否可以访问完整的故障场景？
Akka 新手。创建一个扩展 SupervisorStrategy 的新 Scala 类为我提供了以下模板: class MySupervisorStrategy extends Supervisor
akka - 创建用于并行处理集合元素的 Akka 流
我正在尝试为包含并行处理流的 Akka 流定义一个图(我正在使用 Akka.NET，但这应该无关紧要)。想象一个订单的数据源，每个订单由一个订单 ID 和一个产品列表(订单商品)组成。工作流程如下:
akka - 如何取消 Akka Actor ？
我有一个 akka actor(worker)接收请求并回复它。请求处理可能需要 3-60 分钟。来电者(也是 Actor )目前正在使用 !!!并等待 future.get，但是如果需要，可以更改
akka - 我应该如何在 Akka 持久性中构建持久性参与者？
我应该如何在 Akka 持久化 (Eventsourcing/CQRS) 中构建我的 Actor？分层平行我的电子商务应用程序中有这些域对象用户 - 用户可以创建帐户商店 - 用户可以创建商
akka - 如何将多个参与者作为源附加到 Akka 流？
我正在尝试构建和运行一个 akka 流(在 Java DSL 中)，以 2 个 actor 作为源，然后是一个合并结点，然后是 1 个接收器: Source src1 = Source.act
akka - 如何在 Akka 中监督集群单例？
我正在尝试监督 Akka Actor ，更具体地说是 Cluster Singleton创建使用 ClusterSingletonManager .我试图更好地控制异常、日志和 Actor 的生命周期
akka - 何时使用各种 Akka 邮箱类型
我试图了解何时何地使用不同的 built-in Akka mailboxes以及何时适合自己滚动。但是，该页面上的任何地方都没有解释“ 有界邮箱 ”实际上是什么，或者它的行为方式与无界邮箱有何不同。此
akka - 在 Akka 中等待多个结果
在Akka中等待多个actor的结果的正确方法是什么？ Principles of Reactive Programming Coursera 类(class)有一个带有复制键值存储的练习。无需深入研
akka - 崩溃后如何重新创建动态创建的持久化 AKKA actor
我正在为一个项目评估 Akka，我正在尝试弄清楚我是否可以通过将参与者状态保存在高可用数据存储中来使用 Akka-Persistence 实现服务的高可用性。 (我不打算使用 Akka-Cluster
akka-cluster - Akka 集群状态弱起来
我阅读了 Akka 文档并找到了这个 As mentioned before, if a node is unreachable then gossip convergence is not poss
akka - 使用 akka 流有条件地跳过流
我正在使用 akka 流，并且我有一段我需要有条件地跳过的图表，因为流程无法处理某些值。具体来说，我有一个接受字符串并发出 http 请求的流，但是当字符串为空时，服务器无法处理这种情况。但我只需要返
akka - 提高 Akka 远程吞吐量
我们正在考虑使用 Akka 进行客户端服务器通信，并尝试对数据传输进行基准测试。目前我们正在尝试发送一百万条消息，其中每条消息都是一个具有 8 个字符串字段的案例类。目前，我们正在努力获得可接受的性

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - 使用 Akka Actor 的文件操作