r - 如何 reshape WDI 数据-6ren

r - 如何 reshape WDI 数据

转载作者：行者123 更新时间：2023-12-02 08:38:08

25

4

我需要 reshape 从世界银行数据库下载的一些数据。但是我有一些困难。

目标是它看起来像这样:

year CH DE US
1980 17383.38 11746.40 12179.56
1981 15833.74 9879.46 13526.19
1982 16133.97 9593.66 13932.68
1983 16007.82 9545.86 15000.09
1984 15229.82 9012.48 16539.38

我使用以下代码下载数据。需要 WDI 和 RJSONO 包。

wdi <-  WDI(country = c("CH","DE","US"), indicator = "NY.GDP.PCAP.CD" ,start = 1980, end = 2010, extra = F)

然后我按照以下方式 reshape :

wdi2 <- reshape(wdi, direction = "wide", timevar="year", v.names="NY.GDP.PCAP.CD", idvar="country", drop="iso2c")

输出与我对其外观的预期不符:

> wdi2
             country NY.GDP.PCAP.CD.2010 NY.GDP.PCAP.CD.2009 NY.GDP.PCAP.CD.2008
    1    Switzerland            70572.66            65790.07            68555.37
    32       Germany            40163.82            40275.25            44132.04
    63 United States            46615.51            45305.05            46759.56 ...

这个好一点，但仍然不是我想要的:

> t(wdi2) 
                    1             32          63             
country             "Switzerland" "Germany"   "United States"
NY.GDP.PCAP.CD.2010 "70572.66"    "40163.82"  "46615.51"     
NY.GDP.PCAP.CD.2009 "65790.07"    "40275.25"  "45305.05"     
NY.GDP.PCAP.CD.2008 "68555.37"    "44132.04"  "46759.56"     
NY.GDP.PCAP.CD.2007 "59663.77"    "40402.99"  "46349.12"

wdi 对象如下所示:

> wdi
   iso2c       country NY.GDP.PCAP.CD year
1     CH   Switzerland      70572.657 2010
2     CH   Switzerland      65790.067 2009
3     CH   Switzerland      68555.372 2008
4     CH   Switzerland      59663.770 2007
...
30    CH   Switzerland      16219.906 1981
31    CH   Switzerland      17807.340 1980
32    DE       Germany      40163.817 2010
33    DE       Germany      40275.251 2009
34    DE       Germany      44132.042 2008
...
62    DE       Germany      11746.404 1980
63    US United States      46615.511 2010
64    US United States      45305.052 2009

最佳答案

又在电脑前了……所以这里有一个更新。

正如我在评论中提到的，“reshape2”中的 dcast 对此非常方便。如果您只是执行整形步骤，则可以从基于 R 的 xtabs 获得类似的功能。

x <- xtabs(NY.GDP.PCAP.CD ~ year + iso2c, wdi)
head(x)
#       iso2c
# year         CH        DE       US
#   1980 17807.34 11746.404 12179.56
#   1981 16219.91  9879.457 13526.19
#   1982 16527.46  9593.657 13932.68
#   1983 16398.24  9545.859 15000.09
#   1984 15601.26  9012.479 16539.38
#   1985 15748.95  9125.121 17588.81

xtabs 创建一个类“xtabs”的矩阵，所以要得到一个data.frame，换行as.data.frame.matrix 中的输出。

head(as.data.frame.matrix(x))
#            CH        DE       US
# 1980 17807.34 11746.404 12179.56
# 1981 16219.91  9879.457 13526.19
# 1982 16527.46  9593.657 13932.68
# 1983 16398.24  9545.859 15000.09
# 1984 15601.26  9012.479 16539.38
# 1985 15748.95  9125.121 17588.81

要回答您在评论中提出的其他问题:但是，仅使用 reshape 函数将数据直接转换为正确的格式，这不是一种更智能的方法吗？。答案是“是的。只需交换您在原始 reshape 尝试中使用的“idvar”和“timevar”。

y <- reshape(wdi[-2], direction = "wide", idvar="year", timevar="iso2c")
## Optional step to clean up the resulting names
names(y) <- gsub("NY.GDP.PCAP.CD.", "", names(y))
head(y)
#   year       CH       DE       US
# 1 2010 70572.66 40163.82 46615.51
# 2 2009 65790.07 40275.25 45305.05
# 3 2008 68555.37 44132.04 46759.56
# 4 2007 59663.77 40402.99 46349.12
# 5 2006 54140.50 35237.60 44622.64
# 6 2005 51734.30 33542.78 42516.39

使用 reshape 函数时，有时忽略参数名称的“id”和“time”部分并考虑它们的去向会有所帮助。 ID 变量通常组成一列，而时间变量通常分布很宽，每次一列。因此，即使我们可能将“国家/地区”视为实际 ID 变量，但对于您想要的数据格式，它更像是一个时间变量。

希望这对您有所帮助，即使您已经接受了答案:)

关于r - 如何 reshape WDI 数据，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/19207341/

25

4

0

文章推荐： c - fgets 没有得到最后一个字符

文章推荐： SQL - 为什么交易？

文章推荐： django - Django 是否原生支持迁移

文章推荐： objective-c - react 性 NSMutableDictionary？

reshape - 如何检查APL中的字符串是否被 reshape ？
如何检查字符串是否被 reshape ？示例:“aab”返回 0，因为“a”无法 reshape 为该字符串或任何其他更短的字符串。另一个例子是“aabbaab”返回 1，因为“aabb”可以被 r
reshape - Theano reshape
我无法清楚地理解theano的reshape。我有一个形状的图像矩阵: [batch_size, stack1_size, stack2_size, height, width] ，其中有 s
reshape - 如何检查APL中的字符串是否被 reshape ？
如何检查字符串是否被 reshape ？示例:“aab”返回 0，因为“a”无法 reshape 为该字符串或任何其他更短的字符串。另一个例子是“aabbaab”返回 1，因为“aabb”可以被 r
reshape - 如何像这样使用 python reshape 数据集
这是原始数据 a=[[1,2,3,4,5,6], [7,8,9,10,11,12]] 我想把它转换成这样的格式: b=[[1,2,3,7,8,9], [4,5,6,10,11,12]] a
python - 只是 reshape 和 reshape 和获得转置之间的区别？
我目前正在学习 CS231 作业，我意识到一些令人困惑的事情。在计算梯度时，当我第一次 reshape x 然后得到转置时，我得到了正确的结果。 x_r=x.reshape(x.shape[0],-1
r - 如何使用 reshape 包 reshape 此数据框
这个问题在这里已经有了答案: Reshaping multiple sets of measurement columns (wide format) into single columns (lon
当 reshape 无法猜测时变变量的名称时， reshape r 中的数据
我有一个包含超过 1500 列的宽格式数据集。由于许多变量都是重复的，我想将其 reshape 为长形式。然而，r 抛出一个错误: Error in guess(varying) : Failed
从长到宽 reshape 数据 - 了解 reshape 参数
我有一个长格式的数据框狗，我正在尝试使用 reshape() 函数将其重新格式化为宽格式。目前看起来是这样的: dogid month year trainingtype home scho
python - NumPy 使用 reshape 函数 reshape 数组
这个问题在这里已经有了答案: how to reshape an N length vector to a 3x(N/3) matrix in numpy using reshape (1 个回答)
python - 'numpy.reshape' 和 'ndarray.reshape' 如何等效？
我对 ndarray.reshape 的结构有疑问.我读过 numpy.reshape()和 ndarray.reshape是 python 中用于 reshape 数组的等效命令。据我所知，num
reshape - 在 Stata 中没有唯一的 "j"变量的情况下如何 reshape ？
所以这是我的麻烦:我想将一个长格式的数据文件改成宽格式。但是，我没有唯一的“j”变量；长格式文件中的每条记录都有几个关键变量。例如，我想这样做: | caseid | gender | age |
从 base reshape vs 从具有缺失值的 reshape2 reshape
Whis 这个数据框， df df id parameter visit value sex 1 01 blood V1 1 f 2 01 saliva V
python - reshape numpy 数组的列表，然后 reshape 回来
我有一个列表，其中包含几个不同形状的 numpy 数组。我想将这个数组列表 reshape 为一个 numpy 向量，然后更改向量中的每个元素，然后将其 reshape 回原始数组列表。例如: 输入
Python 使用 np.reshape 按特定顺序 reshape 数组
我有一个形状为 (1800,144) 的数组 (a) 其中 a[0:900,:] 都是实数，后半部分数组 a[900:1800,:] 全部为零。我想把数组的后半部分水平地放在前半部分旁边，然后将它们推
python - 在 Python 中使用 reshape reshape 数组
我有一个如下所示的数组: array([[0, 0, 0, 0, 0, 0, 0, 0], [1, 1, 1, 1, 1, 1, 1, 1], [2, 2, 2, 2, 2
python - 为什么 Tensorflow Reshape tf.reshape() 会破坏梯度流？
我正在创建一个 tf.Variable()，然后使用该变量创建一个简单的函数，然后我使用 tf.reshape() 展平原始变量，然后我在函数和展平变量之间使用了 tf.gradients()。为什么
python - 使用 array.reshape(-1, 1) reshape 数组
我有一个名为 data 的数据框，我试图从中识别任何异常价格。数据框头部看起来像: Date Last Price 0 29/12/2017 487.74 1 28/
python - 使用 numpy reshape 数组 - ValueError : cannot reshape array
我有一个 float vec 数组，我想对其进行 reshape vec.shape >>> (3,) len(vec[0]) # all 3 rows of vec have 150 columns
python - 在不使用 reshape 的情况下 reshape n 维数组的 View
tl;dr 我可以在不使用 numpy.reshape 的情况下将 numpy 数组的 View 从 5x5x5x3x3x3 reshape 为 125x1x1x3x3x3 吗？我想对一个体积(大小
reshape() function to make wide to long data(RESHAPE()函数使数据变宽变长)
set.seed(123)data <- data.frame(ID = 1:10, weight_hus = rnorm(10, 0, 1),

首页

博学

6Ren·AI

商城

r - 如何 reshape WDI 数据