python - 推断 Pandas DataFrame-6ren

python - 推断 Pandas DataFrame

转载作者：太空狗更新时间：2023-10-29 22:12:55

28

4

使用 Series.interpolate 很容易在 Pandas.DataFrame 中插入值，如何进行外推？

例如，给定一个如图所示的 DataFrame，我们如何将它外推 14 个月到 2014 年 12 月 31 日？线性外推法很好。

X1 = range(10)
X2 = map(lambda x: x**2, X1)
df = pd.DataFrame({'x1': X1, 'x2': X2},  index=pd.date_range('20130101',periods=10,freq='M'))

我认为必须首先创建一个新的 DataFrame，DateTimeIndex 从 2013-11-31 开始，再延长 14 个 M 时间段。除此之外，我被困住了。

最佳答案

使用 `DatetimeIndex` 索引外推 `DataFrame`

这可以通过两个步骤完成:

扩展DatetimeIndex
推断数据

扩展索引

用新的 DataFrame 覆盖 df，其中数据为 resampled到基于原始 index's start, period and frequency 的新扩展索引.这允许原始 df 来自任何地方，如 csv 示例中的情况。有了这个，列就很方便了 filled with NaNs !

# Fake DataFrame for example (could come from anywhere)
X1 = range(10)
X2 = map(lambda x: x**2, X1)
df = pd.DataFrame({'x1': X1, 'x2': X2},  index=pd.date_range('20130101',periods=10,freq='M'))

# Number of months to extend
extend = 5

# Extrapolate the index first based on original index
df = pd.DataFrame(
    data=df,
    index=pd.date_range(
        start=df.index[0],
        periods=len(df.index) + extend,
        freq=df.index.freq
    )
)

# Display
print df

    x1  x2
2013-01-31   0   0
2013-02-28   1   1
2013-03-31   2   4
2013-04-30   3   9
2013-05-31   4  16
2013-06-30   5  25
2013-07-31   6  36
2013-08-31   7  49
2013-09-30   8  64
2013-10-31   9  81
2013-11-30 NaN NaN
2013-12-31 NaN NaN
2014-01-31 NaN NaN
2014-02-28 NaN NaN
2014-03-31 NaN NaN

推断数据

大多数外推器都要求输入是数字而不是日期。这可以用

# Temporarily remove dates and make index numeric
di = df.index
df = df.reset_index().drop('index', 1)

查看此 answer了解如何使用 3^rd order polynomial 推断 DataFrame 每一列的值.

Snippet from answer

# Curve fit each column
for col in fit_df.columns:
    # Get x & y
    x = fit_df.index.astype(float).values
    y = fit_df[col].values
    # Curve fit column and get curve parameters
    params = curve_fit(func, x, y, guess)
    # Store optimized parameters
    col_params[col] = params[0]

# Extrapolate each column
for col in df.columns:
    # Get the index values for NaNs in the column
    x = df[pd.isnull(df[col])].index.astype(float).values
    # Extrapolate those points with the fitted function
    df[col][x] = func(x, *col_params[col])

一旦列被推断出来，把日期放回去

# Put date index back
df.index = di

# Display
print df

x1   x2
2013-01-31   0    0
2013-02-28   1    1
2013-03-31   2    4
2013-04-30   3    9
2013-05-31   4   16
2013-06-30   5   25
2013-07-31   6   36
2013-08-31   7   49
2013-09-30   8   64
2013-10-31   9   81
2013-11-30  10  100
2013-12-31  11  121
2014-01-31  12  144
2014-02-28  13  169
2014-03-31  14  196

关于python - 推断 Pandas DataFrame，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/34159342/

28

4

0

文章推荐： python - numpy.std 和 excel STDEV 函数之间有什么区别吗？

文章推荐： c# - 如何覆盖 c# 中 List() 中每个列表项的序列化名称？

文章推荐： c# - 使用 `String` 而不是 `Date` 类型时如何验证日期？

文章推荐： c# - LINQ使用C#问题

大语言模型提示技巧（五）－推断
大语言模型具备从文字中推断情感和主题的能力。这种能力可用于获知客户对产品评价的情感、新闻或媒体文章的主题或倾向等。大语言模型的这种推断能力可被应用于舆情分析等场景。推断可以看作是模型接收文本作为输入
c++ - 推断/删除模板模板参数的类型
当使用模板模板参数时，我如何推断或删除模板模板的模板类型？考虑以下 SSCCE: #include #include #include using namespace std; templat
scala - 推断 mixin 上的类型参数
假设我有一些特质: trait A[T] { def foo: T } 一个扩展它的类: class B[T](t: T) extends A[T] { def foo = t } 以及父特征的子特征
OCaml 的 rectype 推断
一边玩-rectypes在某些时候选择 OCaml 我只是迷路了。这个表达式几乎可以打字: # fun x -> x x;; - : ('a -> 'b as 'a) -> 'b = 但是这里 O
haskell - 推断 Eq 类型类
我正在编写一个类似 CRUD 的应用程序，并且通过主键进行大量查找(主键可以有不同的类型)。所以我定义了以下类型类: {-# LANGUAGE MultiParamTypeClasses #-} cl
ontology - 推断 Protege 中的逆属性
我已经创建了关系 A 'is functional parent of' B并定义 'has functional parent'作为 'is functional parent of' 的倒数. '
Kotlin 推断 JOOQ 方法错误
给定一个使用 Kotlin 版本 1.3.61 和 JOOQ 版本 3.13.1 的系统，这样的方法会构建 union正常查询: val selectCommonPart = coalesce
haskell - 推断 if ... then ... else 奇怪的行为
考虑以下错误代码: fun x = if (null x) then 0 else (take 50 x) : (fun (drop 50 x)) 我注意到，我可以毫无问题地将它加载到
haskell - 推断 lambda 表达式的类型
给定一个具有以下类型的函数 a: a::x -> Bool 和以下类型的另一个函数 b: b::Bool -> y 我正在尝试找出推断以下函数类型的步骤: c =\d -> d a b 有人可以帮助解
android - 推断——gradle 构建不工作
我正在尝试使用 Infer 工具来分析我的应用代码。我关注了these steps每次我尝试运行 infer -- gradle build 时，我都会收到以下错误: infer -- gradle
c++ - 推断 lambda 的类型衰减为函数指针
所以我制作了这个模板来定义内联仿函数: template struct AsFunctor { template std::invoke_result_t operator()(A
c++ - 推断 CRTP 中模板化成员函数的返回类型
是否可以推断 CRTP 基类中模板化成员函数的返回类型？虽然推断参数类型效果很好，但它因返回类型而失败。考虑以下示例。 #include template struct base { tem
python - 推断 Pandas DataFrame
使用 Series.interpolate 很容易在 Pandas.DataFrame 中插入值，如何进行外推？例如，给定一个如图所示的 DataFrame，我们如何将它外推 14 个月到 2014
scala - 推断 lambda 的参数类型(再次!)
我想知道为什么这不起作用(缺少参数类型)？ Seq(1,2,3).toSet.map(_ + 1) 但这确实: val foo = Seq(1,2,3).toSet foo.map(_ + 1)
shell - 推断 SQLite3 shell 工具返回的值的类型
我没有必要使用 SQLite3 shell 工具来维护一个小型数据库。我正在使用 -header -ascii标志，尽管据我所知，这适用于任何输出选择。我正在寻找一种方法来避免对返回的任何一个值的类型
reactjs - FlowType - 推断 React 组件的通用类型
我有以下组件 type PropTypes = { items: T[], header: (item: T) => React.Element, body: (item: T) => R
javascript - Eclipse/JSDT 中的类型声明/推断
我想在 Eclipse/JSDT 中指定实例变量的类型，如下例所示: /** * @constructor */ function A() { /** @type Node */
python - IDE 推断 Python 类型
我正在用 Python 编写一个方法，它看起来像这样: def rgb_to_grayscale(image): print(image.shape) pass 此处预期的类型是 nu
python - 推断 numpy 数组中最近的、较小的值
我有一个 my_values 数组，我正在尝试为其推断 true_values 数组中最接近、较小的值。使用下面的 find_nearest 函数并不能完成我想要的。我如何追加它以找到最近的、较小的值
c++ - 推断 std::array 大小？
在下面的代码中: template int b(int q, const std::array& types) { int r = q; for (int t : types)

首页

博学

6Ren·AI

商城