pyspark - to_timestamp 什么时候从 19xx 产生结果？-6ren

pyspark - to_timestamp 什么时候从 19xx 产生结果？

转载作者：行者123 更新时间：2023-12-04 10:53:13

26

4

PySpark 在什么条件/标准下以 dd-MMM-yy 格式转换日期(01-JAN-40) 至 1940-01-01 00:00:00.000而不是 2040-01-01 00:00:00.000 ?

from pyspark.sql import functions as psf
df.withColumn('my_date', psf.to_timestamp("my_date", "dd-MMM-yy"))

我运行的一些示例如下:

01-JAN-40 -> 1940-01-01 00:00:00.000
01-JAN-47 -> 1947-01-01 00:00:00.000
01-JAN-15 -> 2015-01-01 00:00:00.000
01-JAN-18 -> 2018-01-01 00:00:00.000
01-JAN-19 -> 2019-01-01 00:00:00.000
01-JAN-20 -> 2020-01-01 00:00:00.000

最佳答案

目前(Spark <= 2.4.4)，spark 正在使用 java SimpleDateFormat引擎盖下的类来解析字符串。来自 java 文档 here , 规定

For parsing with the abbreviated year pattern ("y" or "yy"), SimpleDateFormat must interpret the abbreviated year relative to some century. It does this by adjusting dates to be within 80 years before and 20 years after the time the SimpleDateFormat instance is created.

因此，如果您在 2019 年运行它，则最多 39 的所有内容都将在 20xx 中，其他所有内容都将在 19xx 中

关于pyspark - to_timestamp 什么时候从 19xx 产生结果？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59365446/

26

4

0

文章推荐： SQL按值对人口进行排序并按值分组

文章推荐： python - django 应用程序的数据库设计

文章推荐： c++-cli - 如何在 c++/cli 中调用基类索引器属性？

postgresql - to_timestamp 存储时区数据
我有两列，一列名为“日期”，另一列名为“时间”。日期是日期数据类型，“时间”是字符数据类型。我正在使用以下查询来选择一个新的组合时间戳列 SELECT to_timestamp(concat_
postgresql to_timestamp 通过设计接受无效日期
我正在尝试将字符串验证为来自多个 CSV 的时间戳，并且简单地将它们转换为 timestamptz 将失败，因为无法强制使用唯一的日期时间格式: select '10/31/2010'::timest
javascript - postgresql to_timestamp 返回与时间戳表示的日期不同的日期
这是问题的复制: 我通过 JavaScript 获取现在的时间戳 var ts = +new Date // 1368971991090 console.log( new Date(136897199
pyspark - to_timestamp 什么时候从 19xx 产生结果？
PySpark 在什么条件/标准下以 dd-MMM-yy 格式转换日期(01-JAN-40) 至 1940-01-01 00:00:00.000而不是 2040-01-01 00:00:00.000
apache-spark - pyspark to_timestamp 不包括毫秒
我正在尝试格式化我的时间戳列以包含毫秒但没有成功。我怎样才能把我的时间格式化成这样 - 2019-01-04 11:09:21.152 ? 我查看了文档并遵循了 SimpleDataTimeForma
java - 将 To_TimeStamp 函数转换为 java
package testOnly; import java.sql.Timestamp; import java.text.SimpleDateFormat; import java.util.Dat
sql - 如何将格式传递给 postgresql 中的 to_timestamp？
我有以毫秒为单位的 utc epocha，我希望我的 sql 以特定日期格式返回结果日期。这行得通 SELECT to_timestamp(timestamp / 1000) as date
PostgreSQL - to_timestamp 未正确转换 unix 时间戳
我正在尝试获取当前的 UTC 时间，并将其插入到 PostgreSQL 时间戳中。但它不能正常工作。我正在使用以下命令: INSERT INTO public.rt_block_height VAL
sql - 如何在查询 to_timestamp Postgresql 中删除时区
我在 Postgresql 中查找，我想删除查询中的 +9 UTC 值。例如:在to_timestamp列中，我想去掉+09，只保留2016-02-26 00:23:44 值(value)。这是我
Pandas Period 到 to_timestamp 给我 TypeError
我有一个格式如下所示的 Pandas Dataframe: Month Count 2021-02 100 2021-03 200 其中“月份”列是使用 dt
oracle - ORA-01830: 日期格式图片在转换整个输入字符串之前结束，尽管使用 TO_TIMESTAMP
尽管使用了 TO_TIMESTAMP 函数，但我的查询(由应用程序触发时)仍无法执行并出现此错误。 INSERT INTO MY_TABLE_NAME ( UPDATED_DATE, CREA
postgresql - 使用 to_timestamp 函数解析 Twitter 时间戳时出现问题
我通过流式 API 下载了 Twitter 数据，并希望将数据导入 Postgres(9.3 版)以进行一些地理分析。解析 json 数据有效，但我无法将 Twitter 时间设置为正确的时间戳。这
sql - Postgresql to_timestamp 在包装 extract() 时返回不同的日期
导入脚本写得有点错误，导致时间戳被插入了 1000 倍。然而，将 to_timestamp 与 extract() 一起使用会导致大约一个月的日期，即使中间数字和转换看起来是正确的。 1) selec
sql - to_timestamp——以 04 或 05 结束
我有一个 pg 数据库，其中包含以下数据: (yyyymmdd) hour (hh) minute (mm) and second (ss) 全部在单独的字符串类型列中。我使用这样的函数将其转换为时
postgresql - Postgres-必须 to_timestamp() 忽略/不读取日期/时间字符串中间的特定字符
我有原始文本列，其值类似于“2012-07-26T10:33:34”和“2012-07-26T10:56:16”。在使用 Joda-Time 的 Java 中，我可以通过调用轻松地将其转换为日期/从
python - 属性错误 : 'numpy.int64' object has no attribute 'to_timestamp'
我正在尝试从 python 数据框中绘制时间序列。代码如下。 import requests from bs4 import BeautifulSoup import pandas as pd imp
sql - 如何将 DISTINCT 与 string_agg() 和 to_timestamp() 一起使用？
我想在一行中使用逗号分隔的唯一 from_date。所以我在 TO_TIMESTAMP() 中使用 distinct() 函数，但出现错误。 SELECT string_agg(TO_CHAR(TO
python - TypeError : Passing PeriodDtype data is invalid. 改为使用 `data.to_timestamp()`
如何将格式为 2014-09 的 date 列转换为格式为 2014-09-01 00:00:00.000 ？之前的格式由df['date'] = pd.to_datetime(df['date'])
mysql - 如何将 to_timestamp ('12-10-18 12:00:16.565736000 PM' ,'DD-MM-RR HH12:MI:SSXFF AM' ) 转换为 MySQL
我在 oracle 中有一个插入查询 --- Insert into sample (name,time) values ('RJ-valley',to_timestamp('12-10-18 12:

首页

博学

6Ren·AI

商城

pyspark - to_timestamp 什么时候从 19xx 产生结果？