High velocity streaming data enrichment with low velocity/slow changing data(低速/慢变数据的高速流数据富集化)-6ren

High velocity streaming data enrichment with low velocity/slow changing data(低速/慢变数据的高速流数据富集化)

翻译作者：bug小助手更新时间：2023-10-26 22:28:41

25

4

My system consists of

我的系统包括

High velocity telemetry data generated by IoT devices.

Relatively static/slow changing reference/lookup data - Alarm Rules

Each IoT device has 0 to 1 Alarm Rules. An Alarm Rule has average size of 1-2 KB.

每个物联网设备都有0到1条告警规则。告警规则的平均大小为1-2 KB。

Most Alarm Rules, once set, stay the same for weeks, months, or even a year or more.

大多数警报规则一旦设置，就会在几周、几个月、甚至一年或更长时间内保持不变。

Eventual consistency of Alarm Rules is also acceptable - if Alarm Rule is edited, it is acceptable for it to take effect in 15-30 minutes.

告警规则的最终一致性也是可以接受的-如果编辑了告警规则，15-30分钟后生效是可以接受的。

Question - What would be the best approach to enrich device telemetry stream with alarm rules?

问题-使用警报规则丰富设备遥测数据流的最佳方法是什么？

Option 1 - RichAsyncFunction + in memory cache

选项1-内存缓存中的RichAsyncFunction+

Each time I receive a telemetry message from device, I execute RichAsyncFunction. It first checks if in memory cache has Alarm Rule. If no Alarm Rule is not found in cache, a request is sent to database. Cache items expire in 30 minutes.

每次我从设备接收遥测消息时，我都会执行RichAsyncFunction。它首先检查内存缓存中是否有告警规则。如果缓存中没有找到告警规则，则向数据库发送请求。缓存项目将在30分钟后过期。

Option 2 - KeyedProcessFunction + state object

选项2-KeyedProcessFunction+状态对象

Same logic as with option 1. Except instead of using in memory cache, I store Alarm Rule for each IoT device into ValueState<> and periodically refresh it using ctx.timerService().register... scheduler (what happens if this gets called multiple times? will onTimer function also get triggered multiple times or just once?).

与选项1的逻辑相同。除了不在内存缓存中使用之外，我将每个物联网设备的警报规则存储到ValueState<>中，并使用ctx.timerService().注册...调度程序(如果多次调用会发生什么情况？OnTimer函数也会被触发多次还是只触发一次？)

Option 3 - CoProcessFunction/KeyedCoProcessFunction + 2 streams, one for telemetry, second for alarms

选项3-CoProcessFunction/KeyedCoProcessFunction+2个流，一个用于遥测，第二个用于报警

This option offers the highest throughput and lowest latency. I would consume Kafka Topic for alarm rules and update ValueState<> with the stream data.

此选项可提供最高的吞吐量和最低的延迟。告警规则使用Kafka主题，流数据更新ValueState<>。

What's stopping me from implementing this solution is Kafka Topic message retention time. By default Kafka messages have 7 day retention time.

阻止我实施这个解决方案的是Kafka主题消息保留时间。默认情况下，Kafka消息有7天的保留时间。

If I have alarm rule A for device B, and I send it to Kafka Topic, if the alarm rule A does not change over the next 7 days, the alarm rule will no longer be visible on 8th day. Basically, on 8th day, when consuming messages from device B, the system won't see any device alarm rules.

如果我有B设备的告警规则A，并将其发送到Kafka Theme，如果告警规则A在接下来的7天内没有变化，则在第8天将看不到该告警规则。基本上，在第8天，当消费来自B设备的消息时，系统不会看到任何设备告警规则。

I could increase retention time to longer period, but that does not seem like a reasonable time. Alternative would be external service that periodically emits all alarm rules to Kafka topic, say, every 6-7 days.

我可以将保留时间延长，但这似乎不是一个合理的时间。另一种选择是外部服务，定期向Kafka主题发出所有告警规则，比如每6-7天。

更多回答

You seem to be assuming the the ground truth for the rules must live outside of Flink. Why not keep the rule state in Flink (permanently)?

你似乎在假设规则的基本真相必须生活在Flink之外。为什么不将规则状态保持为Flink(永久)？

Wouldn't that still require the initial state and future updates to come from Kafka topic/sql database/external service? Or is it possible for my services (for example REST API) to "directly" talk to Flink and insert/update/delete state?

这是否仍然需要来自Kafka主题/SQL数据库/外部服务的初始状态和未来更新？或者，我的服务(例如rest API)是否可以“直接”与Flink对话并插入/更新/删除状态？

I would still use Kafka to decouple the external service from Flink, but the 7-day retention interval doesn't have to be a blocker, if you're willing to rely on checkpoints or savepoints to preserve the rules.

我仍然会使用Kafka来将外部服务与Flink分离，但如果你愿意依靠检查点或保存点来维护规则，7天的保留间隔不一定是一个拦截器。

优秀答案推荐

更多回答

25

4

0

python - 为什么 DataFrame.loc[[1]] 比 df.ix [[1]] 慢 1,800 倍，比 df.loc[1] 慢 3,500 倍？
自己试试看: import pandas as pd s=pd.Series(xrange(5000000)) %timeit s.loc[[0]] # You need pandas 0.15.1
Delphi (DataSnap) 慢
我最近开始使用 Delphi 中的 DataSnap 来生成 RESTful Web 服务。在遵循 Marco Cantu 本人和互联网上其他几个人的指导后，我成功地使整个“链条”正常工作。但是有一
java - 2核Mac上有多个Java线程-慢
我一直在为操作系统类(class)编写以下代码，但结果有些奇怪。该代码创建x线程并同时运行它们，以便将两个平方矩阵相乘。每个线程将输入矩阵的Number_of_rows/Number_of_threa
r - 为什么并行包比只使用apply 慢？
我正在尝试确定何时使用 parallel包以加快运行某些分析所需的时间。我需要做的一件事是创建矩阵，比较具有不同行数的两个数据框中的变量。我在 StackOverflow 上问了一个关于有效方法的问题
haskell - 为什么 <$> 慢？
我最近对我的代码进行了一些清理，并在此过程中更改了此内容(不完全是真实的代码): read = act readSTRef test1 term i var = do t v^!terms.
c# - 分页查询如何*慢*？
我正在计时查询和同一个查询的执行时间，分页。 foreach (var x in productSource.OrderBy(p => p.AdminDisplayName) .Wher
c# - BackgroundWorker 慢
我正在开发一个项目 (WPF)，我有一个 Datagrid 从数据库加载超过 5000 条记录，所以我使用 BackgroundWorker 来通知用户数据正在加载，但它太慢了，我需要等待将近 2分钟
MYSQL 慢 ORDER BY
我在查询中添加 ORDER BY 时遇到问题。没有 ORDER BY 查询大约需要 26ms，一旦我添加 ORDER BY，它大约需要 20s。我尝试了几种不同的方法，但似乎可以减少时间。尝试 F
Android 慢 GridView
我是 Android 开发新手，遇到了性能问题。当我的 GridView 有太多项目时，它会变得有点慢。有什么方法可以让它运行得更快一些吗？这是我使用的代码: 适配器: public class C
java/mysql/慢
这里的要点是: 1.设置query_cache_type = 0;重置查询缓存； 2.在 heidisql(或任何其他客户端 UI)中运行任何查询 --> 执行，例如 45 毫秒 3.使用以下代码运行
PostgreSQL 慢 DISTINCT WHERE
想象下表: CREATE TABLE drops( id BIGSERIAL PRIMARY KEY, loc VARCHAR(5) NOT NULL, tag INT NOT
sql - 慢 WHERE IN 查询结束
我的表 test_table 中的示例数据: date symbol value created_time 2010-01-09 symbol1
php - 很多查询 - 慢？
首先，如果已经有人问过这个问题，我深表歉意，至少我找不到任何东西。无论如何，我将每 5 分钟运行一次 cron 任务。该脚本加载 79 个外部页面，而每个页面包含大约 200 个我需要在数据库中检查
mysql - SQL查询/慢
我有下面的 SQL 代码，它来自 MySQL 数据库。现在它给了我期望的结果，但是查询很慢，我想我应该在进一步之前加快这个查询的速度。表agentstatusinformation有: PKEY(主
ios - 核心数据对象等级(慢)
我需要获取一个对象在 Core Data 中数千个其他对象之间的排名。现在，这是我的代码: - (void)rankMethod { //Fetch all objects NSFet
ios - ABAddressBookCopyArrayOfAllPeople 慢
我正在编写一个应用程序，我需要在其中读取用户的地址簿并显示他所有联系人的列表。我正在测试的 iPhone 有大约 100 个联系人，加载联系人确实需要很多时间。 ABAddressBookRef ad
javascript - InnerHTML 慢？
我正在使用 javascript 将 160 行添加到包含 10 列的表格中。如果我这样做: var cellText = document.createTextNode(value); cell.a
swift - UITableView 慢
我是 Swift 的新手，我已经设置了一个 tableView，它从 JSON 提要中提取数据并将其加载到表中。表格加载正常，但是当表格中有超过 10 个单元格时，它会变得缓慢且有些滞后，特别是它到
c# - 慢 DeterminePostBackMode()
我在 InitializeCulture 和 Page_PreInit 事件之间的 asp.net 页面中遇到性能问题。当我重写 DeterminePostBackMode() 时，我发现问题出在 b
SSL 慢。建立安全连接花费的时间太长
我在 Hetzner 上有一个带有 256GB RAM 6 个 CPU(12 个线程) 的专用服务器，它位于德国。我有 CENTOS 7.5。 EA4。我的问题是 SSL。每天大约 2 小时，我们在

首页

博学

6Ren·AI

商城

High velocity streaming data enrichment with low velocity/slow changing data(低速/慢变数据的高速流数据富集化)