使用重复列 reshape 数据-6ren

使用重复列 reshape 数据

转载作者：行者123 更新时间：2023-12-04 17:54:55

26

4

我正在尝试使用 reshape 来重组我的数据集。

这是我的数据的一个子集，它是一个 16 X 198 的数据框。每个奇数列是一个 16 年的列表，偶数列具有不同国家/地区的值。

   Algeria.x Algeria.y Argentina.x Argentina.y
1       1985     37.48        1985       27.86
2       1986     36.26        1986       27.52
3       1987     35.04        1987       27.18
4       1988     33.82        1988       26.84
5       1989     32.60        1989       26.50
6       1990        NA        1990       25.50
7       1991        NA        1991       24.50
8       1992        NA        1992       23.50
9       1993        NA        1993       22.50
10      1994        NA        1994       21.50
11      1995        NA        1995       22.12
12      1996        NA        1996       22.74
13      1997        NA        1997       23.36
14      1998        NA        1998       23.98
15      1999        NA        1999       24.60
16      2000        NA        2000          NA

我想 reshape 数据，使其具有三列。第一个是国名，第二个是年份，第三个是值。这将是一个 1584 x 3 的长矩阵。

最佳答案

我会使用 stack在将数据拆分为两个 data.frames 后，函数两次:一个用于年份，另一个用于值:

# split the data into two data.frames
years.df  <- df[, seq(from = 1, to = ncol(df), by = 2)]
values.df <- df[, seq(from = 2, to = ncol(df), by = 2)]

# remove ".x" and ".y" at the end of the country names
names(years.df)  <- sub("\\.x$", "", names(years.df))
names(values.df) <- sub("\\.y$", "", names(values.df))

# stack each data.frame into a two-column data.frame
years.stack  <- stack(years.df)
values.stack <- stack(values.df)

# gather everything into a single data.frame
final.df <- data.frame(country = years.stack$ind,
                       year    = years.stack$value,
                       value   = values.stack$value)
final.df
#      country year value
# 1    Algeria 1985 37.48
# 2    Algeria 1986 36.26
# 3    Algeria 1987 35.04
# 4    Algeria 1988 33.82
# 5    Algeria 1989 32.60
# 6    Algeria 1990    NA
# 7    Algeria 1991    NA
# 8    Algeria 1992    NA
# 9    Algeria 1993    NA
# 10   Algeria 1994    NA
# 11   Algeria 1995    NA
# 12   Algeria 1996    NA
# 13   Algeria 1997    NA
# 14   Algeria 1998    NA
# 15   Algeria 1999    NA
# 16   Algeria 2000    NA
# 17 Argentina 1985 27.86
# 18 Argentina 1986 27.52
# 19 Argentina 1987 27.18
# 20 Argentina 1988 26.84
# 21 Argentina 1989 26.50
# 22 Argentina 1990 25.50
# 23 Argentina 1991 24.50
# 24 Argentina 1992 23.50
# 25 Argentina 1993 22.50
# 26 Argentina 1994 21.50
# 27 Argentina 1995 22.12
# 28 Argentina 1996 22.74
# 29 Argentina 1997 23.36
# 30 Argentina 1998 23.98
# 31 Argentina 1999 24.60
# 32 Argentina 2000    NA

关于使用重复列 reshape 数据，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/9847230/

26

4

0

文章推荐： Vim:退出拆分窗口时删除缓冲区

文章推荐： expandoobject - 如何对包含 ExpandoObjects 的 List 进行排序

文章推荐： emacs - 在 global-set-key 中使用分号作为 .emacs 中的函数

reshape - 如何检查APL中的字符串是否被 reshape ？
如何检查字符串是否被 reshape ？示例:“aab”返回 0，因为“a”无法 reshape 为该字符串或任何其他更短的字符串。另一个例子是“aabbaab”返回 1，因为“aabb”可以被 r
reshape - Theano reshape
我无法清楚地理解theano的reshape。我有一个形状的图像矩阵: [batch_size, stack1_size, stack2_size, height, width] ，其中有 s
reshape - 如何检查APL中的字符串是否被 reshape ？
如何检查字符串是否被 reshape ？示例:“aab”返回 0，因为“a”无法 reshape 为该字符串或任何其他更短的字符串。另一个例子是“aabbaab”返回 1，因为“aabb”可以被 r
reshape - 如何像这样使用 python reshape 数据集
这是原始数据 a=[[1,2,3,4,5,6], [7,8,9,10,11,12]] 我想把它转换成这样的格式: b=[[1,2,3,7,8,9], [4,5,6,10,11,12]] a
python - 只是 reshape 和 reshape 和获得转置之间的区别？
我目前正在学习 CS231 作业，我意识到一些令人困惑的事情。在计算梯度时，当我第一次 reshape x 然后得到转置时，我得到了正确的结果。 x_r=x.reshape(x.shape[0],-1
r - 如何使用 reshape 包 reshape 此数据框
这个问题在这里已经有了答案: Reshaping multiple sets of measurement columns (wide format) into single columns (lon
当 reshape 无法猜测时变变量的名称时， reshape r 中的数据
我有一个包含超过 1500 列的宽格式数据集。由于许多变量都是重复的，我想将其 reshape 为长形式。然而，r 抛出一个错误: Error in guess(varying) : Failed
从长到宽 reshape 数据 - 了解 reshape 参数
我有一个长格式的数据框狗，我正在尝试使用 reshape() 函数将其重新格式化为宽格式。目前看起来是这样的: dogid month year trainingtype home scho
python - NumPy 使用 reshape 函数 reshape 数组
这个问题在这里已经有了答案: how to reshape an N length vector to a 3x(N/3) matrix in numpy using reshape (1 个回答)
python - 'numpy.reshape' 和 'ndarray.reshape' 如何等效？
我对 ndarray.reshape 的结构有疑问.我读过 numpy.reshape()和 ndarray.reshape是 python 中用于 reshape 数组的等效命令。据我所知，num
reshape - 在 Stata 中没有唯一的 "j"变量的情况下如何 reshape ？
所以这是我的麻烦:我想将一个长格式的数据文件改成宽格式。但是，我没有唯一的“j”变量；长格式文件中的每条记录都有几个关键变量。例如，我想这样做: | caseid | gender | age |
从 base reshape vs 从具有缺失值的 reshape2 reshape
Whis 这个数据框， df df id parameter visit value sex 1 01 blood V1 1 f 2 01 saliva V
python - reshape numpy 数组的列表，然后 reshape 回来
我有一个列表，其中包含几个不同形状的 numpy 数组。我想将这个数组列表 reshape 为一个 numpy 向量，然后更改向量中的每个元素，然后将其 reshape 回原始数组列表。例如: 输入
Python 使用 np.reshape 按特定顺序 reshape 数组
我有一个形状为 (1800,144) 的数组 (a) 其中 a[0:900,:] 都是实数，后半部分数组 a[900:1800,:] 全部为零。我想把数组的后半部分水平地放在前半部分旁边，然后将它们推
python - 在 Python 中使用 reshape reshape 数组
我有一个如下所示的数组: array([[0, 0, 0, 0, 0, 0, 0, 0], [1, 1, 1, 1, 1, 1, 1, 1], [2, 2, 2, 2, 2
python - 为什么 Tensorflow Reshape tf.reshape() 会破坏梯度流？
我正在创建一个 tf.Variable()，然后使用该变量创建一个简单的函数，然后我使用 tf.reshape() 展平原始变量，然后我在函数和展平变量之间使用了 tf.gradients()。为什么
python - 使用 array.reshape(-1, 1) reshape 数组
我有一个名为 data 的数据框，我试图从中识别任何异常价格。数据框头部看起来像: Date Last Price 0 29/12/2017 487.74 1 28/
python - 使用 numpy reshape 数组 - ValueError : cannot reshape array
我有一个 float vec 数组，我想对其进行 reshape vec.shape >>> (3,) len(vec[0]) # all 3 rows of vec have 150 columns
python - 在不使用 reshape 的情况下 reshape n 维数组的 View
tl;dr 我可以在不使用 numpy.reshape 的情况下将 numpy 数组的 View 从 5x5x5x3x3x3 reshape 为 125x1x1x3x3x3 吗？我想对一个体积(大小
reshape() function to make wide to long data(RESHAPE()函数使数据变宽变长)
set.seed(123)data <- data.frame(ID = 1:10, weight_hus = rnorm(10, 0, 1),

首页

博学

6Ren·AI

商城

使用重复列 reshape 数据