SpannerQueryDatabaseInstanceOperator how to perform a parametrized query(SpannerQueryDatabaseInstanceOperator如何执行参数化查询)-6ren

SpannerQueryDatabaseInstanceOperator how to perform a parametrized query(SpannerQueryDatabaseInstanceOperator如何执行参数化查询)

转载作者：bug小助手更新时间：2023-10-28 10:59:32

25

4

From the documentation SpannerQueryDatabaseInstanceOperator accepts a query parameter. However there's no something smart like the PostgresOperator that accepts also a parameters parameter to use placeholders into the query itself:

从文档SpannerQueryDatabaseInstanceOperator接受查询参数。然而，没有像PostgresOperator这样聪明的东西，它也接受一个parameters参数来使用占位符到查询本身中：

get_birth_date = PostgresOperator(
   task_id="get_birth_date",
   postgres_conn_id="postgres_default",
   sql="SELECT * FROM pet WHERE birth_date BETWEEN SYMMETRIC %(begin_date)s AND %(end_date)s",
   parameters={"begin_date": "2020-01-01", "end_date": "2020-12-31"}
)

I am new to Airflow, but by reading a book on it and the documentation it looks like that the suggestion is to try to avoid using PythonOperator as it might lead to define logic within it rather than using Airflow just to do what it's designed for: orchestrating.

我对Airflow还是个新手，但通过阅读一本关于它的书和文档，我的建议是尽量避免使用PythonOperator，因为它可能导致在其中定义逻辑，而不是仅仅为了完成其设计的目的而使用Airflow：编排。

So my question are the following:

因此，我的问题如下：

How would you insert into Spanner values read from a previous task?

I read that storing objects into XComs or Airflow itself is not a good practice for the inter-tasks communication, but at the same time if something has to be read by task one and written by task two, I don't see many alternatives to use XComs.

Thanks

谢谢

更多回答

优秀答案推荐

Airflow leverages Jinja to parameterize. When you use Jinja the parameterization is done by Airflow itself and then the SQL statement is submitted to the SQL engine to be executed.

气流利用劲佳进行参数化。当您使用JJJA时，参数化是由Airflow本身完成的，然后将SQL语句提交给SQL引擎执行。

Some integrations/services have their own parameterization mechanisms thus Airflow can also support that and user can choose what to use.

一些集成/服务具有自己的参数化机制，因此Airflow也可以支持该机制，用户可以选择使用什么。

PostgresOperator can use SqlAlchemy engine thus if you want this engine to render the statement you can pass the variables to it using the parameters parameter. The answer in https://stackoverflow.com/a/72246305/14624409 shows how to use both options for supported operator.

PostgresOperator可以使用SqlAlChemy引擎，因此，如果您希望此引擎呈现语句，则可以使用参数将变量传递给它。Https://stackoverflow.com/a/72246305/14624409中的答案显示了如何对支持的运算符使用这两个选项。

In your case, SpannerQueryDatabaseInstanceOperator has query as templated field so you can simply use Jinja engine with it.

在您的例子中，SpannerQueryDatabaseInstanceOperator将查询作为模板化字段，因此您可以简单地将JJJA引擎与其一起使用。

For example:

例如：

SpannerQueryDatabaseInstanceOperator(
    instance_id="my_instance",
    database_id="my_db",
    query="select {{ params.my_parameter }}",
    params={"my_parameter": 5},
    task_id="spanner_instance_query_task",
)

Which gives:

这提供了：

As for your questions:

至于你的问题，我的答覆如下：

How would you insert into Spanner values read from a previous task?

Simply use {{ ti.xcom_pull(task_ids='run_pod', key='return_value') }} in the sql statement. It will be rendered by Jinja. task_ids is the task_id to pull value from and the key is the identifier of the xcom (task can push several xcoms).

只需在SQL语句中使用{{ti.xcom_Pull(TASK_IDS=‘run_pod’，key=‘Return_Value’)}}。它将由金佳呈现。TASK_IDS是从中提取值的TASK_ID，关键字是XCOM的标识符(TASK可以推送多个XCOM)。

I read that storing objects into XComs or Airflow itself is not a good practice for the inter-tasks communication, but at the same time if something has to be read by task one and written by task two, I don't see many alternatives to use XComs.

Xcoms are to make small metadata information accessible to other tasks. For example you can transfer count value of records but not the records themselves. If you need downstream task have access to a big dataset produced by upstream task then store it in the cloud (S3, Google Cloud, etc...). All tasks can access to cloud storage however the local disk of Airflow is not shared between tasks thus you can not relay that storing data on Airflow disk will be available for other tasks.

Xcoms使其他任务可以访问较小的元数据信息。例如，您可以传输记录的计数值，但不能传输记录本身。如果您需要下游任务访问上游任务产生的大数据集，则将其存储在云中(S3、Google Cloud等)。所有任务都可以访问云存储，但任务之间不共享气流本地盘，因此不能保证将数据存储在气流盘上就可以供其他任务使用。

更多回答

25

4

0

macos - 执行 wine != 执行 `which wine`
我有一个“有趣”的问题，即以两种不同的方式运行 wine 会导致: $> wine --version /Applications/Wine.app/Contents/Resources/bin/wi
javascript - CRONTAB 执行 Python，使用 puppeteer 执行 Node 来进行网页抓取不起作用
我制作了这个网络抓取工具来获取网页中的表格。我使用 puppeteer (不知道 crontab 有问题)、Python 进行清理并处理数据库的输出但令我惊讶的是，当我执行它时 */50 * * *
javascript - 对 javascript 函数的 Objective-C 调用何时被调用/执行，何时不被调用/执行？
JavaScript 是否被调用或执行取决于什么？准确地说，我有两个函数，它们都以相同的方式调用: [self.mapView stringByEvaluatingJavaScriptFromStri
python - 为什么使用 statsmodels 执行 OLS 和使用 scikit 执行 PooledOLS 时会得到相同的结果？
我目前正在使用 python 做一个机器学习项目(这里是初学者，从头开始学习一切)。只是想知道 statsmodels 的 OLS 和 scikit 的 PooledOlS 使用我拥有的相同面板数据
c# - 通过 Enterprise Guide 执行 SAS 和从 .Net 执行 IOM 之间的区别
在使用集成对象模型 (IOM) 后，我可以执行 SAS 代码并将 SAS 数据集读入 .Net/C# 数据集 here . 只是好奇，使用 .Net 作为 SAS 服务器的客户端与使用 Enterpr
javascript - jQuery 不会使用 animate : top 200px function. 执行，但它会使用 animate: height 执行
有一些直接的 jQuery 在单击时隐藏打开的 div 未显示，但仍将高度添加到导航中以使其看起来好像要掉下来了。这个脚本工作正常: $(document).ready(funct
java - 为什么我的代码使用 'IF' 执行 'ELSE' 和 '==' ，但不使用 '.equals' 执行？
这个问题已经有答案了: How do I compare strings in Java? (23 个回答) 已关闭 4 年前。这里是 Java 新手，我正在使用 NetBeans 尝试一些简单的代
python - Keras 2.0.8 仅使用 Python 3.x 执行 1 个 epoch，使用 2.x 执行 10 个
如果我将它切换到 Python 2.x，它执行 10。这是为什么？训练逻辑回归模型 import keras.backend as
JavaScript 执行
我有两个脚本，它们包含在 HTML 正文中。在第一个脚本中，我初始化一个 JS 对象，该对象在第二个脚本标记中引用。 ... obj.a = 1000; obj.
执行@number时的Java链接列表错误消息
每当我运行该方法时，我都会收到一个带有数字的错误以下是我的代码。 public String getAccount() { String s = "Listing the accounts";
java - 执行 while 循环以显示菜单
我已经用 do~while(true) 创建了我的菜单；但是每次用户输入一个数字时，它不会运行程序，而是再次显示菜单!你怎么看？ //我的主要方法 public static void main(St
ipython - 执行/命令完成时通知
执行命令后，如何让IPython通知我？我可以使用铃声/警报还是通过弹出窗口获取它？我正在OS X 10.8.5的iTerm上运行Anaconda。最佳答案使用最新版本的iTerm，您可以在she
java - Swing 执行
您好，我刚刚使用菜单栏为 Swing 编写了代码。但是问题出现在运行中。我输入: javac Menu.java java Menu 它没有给出任何错误，但 GUI 没有显示。这是我的源代码以供引用:
.net - 执行.NET应用程序时验证Authenticode签名
我觉得这里缺少明显的东西，但是我看不到它写在任何地方。我使用Authenticode证书对可执行文件进行签名，但是当我开始学习有关它的更多信息时，我对原样的值(value)提出了质疑。签名的exe
按钮单击事件上的 JavaScript 执行
我正在设计一个应用程序，它使用 DataTables 中的预定义库来创建数据表。我想对数据表执行删除操作，为此应在按钮单击事件上执行 java 脚本。 $(document).ready(functi
Haskell - 执行 while 循环
我是 Haskell 新手，如果有人愿意帮助我，我会很高兴!我试图让这个程序与 do while 循环一起工作。第二个 getLine 命令的结果被放入变量 goGlenn 中，如果 goGlenn
java - 执行 while 循环时出现问题
我有一个用 swing 实现迷你游戏的程序，在主类中我有一个循环，用于监听游戏 map 中的 boolean 值。使用 while 实现的循环不会执行一条指令，如果它是唯一的一条指令，我不知道为什么。
java - 执行.jar时将OJBDC添加到类路径
我正在尝试开发一个连接到 Oracle 数据库并执行函数的 Java 应用程序。如果我在 Eclipse 中运行该应用程序，它可以工作，但是当我尝试在 Windows 命令提示符中运行 .jar 时，
java future 执行
我正在阅读有关 Java 中的 Future 和 javascript 中的 Promises 的内容。下面是我作为示例编写的代码。我的问题是分配给 future 的任务什么时候开始执行？当如下行创
java - 执行 && 最有效的方法？
我有一个常见的情况，您有两个变量(xSpeed 和 ySpeed)，当它们低于 minSpeed 时，我想将它们独立设置为零，并在它们都为零时退出。最有效的方法是什么？目前我有两种方法(方法2更干净

首页

博学

6Ren·AI

商城

SpannerQueryDatabaseInstanceOperator how to perform a parametrized query(SpannerQueryDatabaseInstanceOperator如何执行参数化查询)