string - 在 R 中使用可变字符串引用对象-6ren

string - 在 R 中使用可变字符串引用对象

转载作者：行者123 更新时间：2023-12-01 10:05:59

29

4

编辑:感谢那些到目前为止做出回应的人；我是 R 的初学者，刚刚为我的 MSc 论文承担了一个大型项目，所以我对初始处理有点不知所措。我使用的数据如下(来自 WMO 公开可用的降雨数据):

120 6272100 KHARTOUM 15.60 32.55 382 1899 1989 0.0 1899 0.03 0.03 0.03 0.03 0.03 1.03 13.03 12.03 9999 6.03 0.03 0.03 1900 0.03 0.03 0.03 0.03 0.03 23.03 80.03 47.03 23.03 8.03 0.03 0.03 1901 0.03 0.03 0.03 0.03 0.03 17.03 23.03 17.03 0.03 8.03 0.03 0.03 (...) <code>120 6272101 JEBEL AULIA 15.20 32.50 380 1920 1988 0.0 1920 0.03 0.03 0.03 0.00 0.03 6.90 20.00 108.80 47.30 1.00 0.01 0.03 1921 0.03 0.03 0.03 0.00 0.03 0.00 88.00 57.00 35.00 18.50 0.01 0.03 1922 0.03 0.03 0.03 0.00 0.03 0.00 87.50 102.30 10.40 15.20 0.01 0.03 (...)</code>

<code>

<p>There are ~100 observation stations that I'm interested in, each of which has a varying start and end date for rainfall measurements. They're formatted as above in a single data file, with stations separated by "120 (station number) (station name)".</p>

<p>I need first to separate this file by station, then to extract March, April, May and June for each year, then take a total of these months for each year. So far I'm messing around with loops (as below), but I understand this isn't the right way to go about it and would rather learn some better technique.
Thanks again for the help!</p>

<p>(Original question:)
I've got a large data set containing rainfall by season for ~100 years over 100+ locations. I'm trying to separate this data into more managable arrays, and in particular I want to retrieve the sum of the rainfall for March, April, May and June for each station for each year.
The following is a simplified version of my code so far: </p>

<pre><code>a <- array(1,dim=c(10,12))
for (i in 1:5) {

 all data:
 assign(paste("station_",i,sep=""), a)

 #march - june data:
 assign(paste("station_",i,"_mamj",sep=""), a[,4:7])
}
</code></pre>

</code>

<code>So this gives me <code>station_(i)__mamj_</code> which contains the data for the months I'm interested in for each station. Now I want to sum each row of this array and enter it in a new array called <code>station_(i)_mamj_tot</code>. Simple enough in theory, but I can't work out how to reference station_(i)_mamj</code> so that it varies the value of i每次迭代。非常感谢任何帮助!

最佳答案

这完全是在乞求一个数据框，然后就是这个带有像 ddply 这样的强大工具的单行代码(非常强大):

tot_mamj <- ddply(rain[rain$month %in% 3:6,-2], 'year', colwise(sum))

按年份给出 M/A/M/J 的总和:

   year station_1 station_2 station_3 station_4 station_5 ...
1  1972  8.618960  5.697739 10.083192  9.264512 11.152378 ...
2  1973 18.571748 18.903280 11.832462 18.262272 10.509621 ...
3  1974 22.415201 22.670821 32.850745 31.634717 20.523778 ...
4  1975 16.773286 17.683704 18.259066 14.996550 19.007762 ...
...

下面是完美的工作代码。我们创建一个 col.names 为 'station_n' 的数据框；还有用于年和月的额外列(因子，如果你懒惰，则为整数，请参见脚注)。现在您可以按月或年进行任意分析(使用 plyr 的拆分-应用-组合范例):

require(plyr) # for d*ply, summarise
#require(reshape) # for melt

# Parameterize everything here, it's crucial for testing/debugging
all_years <- c(1970:2011)
nYears <- length(all_years)  
nStations <- 101
# We want station names as vector of chr (as opposed to simple indices)
station_names <- paste ('station_', 1:nStations, sep='')

rain <- data.frame(cbind(
  year=rep(c(1970:2011),12),
  month=1:12
))
# Fill in NAs for all data
rain[,station_names] <- as.numeric(NA)
# Make 'month' a factor, to prevent any numerical funny stuff e.g accidentally 'aggregating' it
rain$month <- factor(rain$month)

# For convenience, store the row indices for all years, M/A/M/J
I.mamj <- which(rain$month %in% 3:6)

# Insert made-up seasonal data for M/A/M/J for testing... leave everything else NA intentionally
rain[I.mamj,station_names] <- c(3,5,9,6) * runif(4*nYears*nStations)

# Get our aggregate of MAMJ totals, by year
# The '-2' column index means: "exclude month, to prevent it also getting 'aggregated'"
excludeMonthCol = -2
tot_mamj <- ddply(rain[rain$month %in% 3:6, excludeMonthCol], 'year', colwise(sum))

# voila!!
#    year station_1 station_2 station_3 station_4 station_5
# 1  1972  8.618960  5.697739 10.083192  9.264512 11.152378
# 2  1973 18.571748 18.903280 11.832462 18.262272 10.509621
# 3  1974 22.415201 22.670821 32.850745 31.634717 20.523778
# 4  1975 16.773286 17.683704 18.259066 14.996550 19.007762

作为脚注，在我将月份从数字转换为因子之前，它正在悄悄地“聚合”(直到我输入“-2”:排除列引用)。然而，更好的是当你把它作为一个因素时，它会拒绝直接聚合，并抛出一个错误(这对于调试来说是可取的):

 ddply(rain[rain$month %in% 3:6, ], 'year', colwise(sum))
Error in Summary.factor(c(3L, 3L, 3L, 3L, 3L, 3L), na.rm = FALSE) : 
  sum not meaningful for factors

关于string - 在 R 中使用可变字符串引用对象，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/10588008/

29

4

0

文章推荐： jpa - 简单 JPA 2 条件查询 "where"条件

文章推荐： oracle - 如何复制多行 (Oracle)

文章推荐： java - Android Json 解析与 multipartEntity

文章推荐： multithreading - pthread_cond_wait 没有 while 循环

c++ - 将函数作为参数传递的良好做法 : copy, 引用，const 引用？
这个问题在这里已经有了答案: 关闭 10 年前。 Possible Duplicate: template pass by value or const reference or…? 以下对于将函数
C++ 重载运算符两次，一次返回非 const 引用，另一次返回 const 引用，偏好是什么？
我用相同的参数列表重载了一个运算符两次。但返回类型不同: T& operator()(par_list){blablabla} const T& operator()(par_list){bla
java - 如果 ViewModel 持有此 Activity 实现的接口(interface)引用，GC 是否会收集 Activity 引用？
假设我有实现接口(interface) I 的 Activity A。我的 ViewModel 类 (VM) 持有对实现接口(interface) I 的对象的引用: class A extends
PHP 引用 `$this`
PHP 如何解释 &$this ？为什么允许？我遇到了以下问题，这看起来像是 PHP 7.1 和 7.2 中的错误。它与 &$this 引用和跨命名空间调用以及 call_user_func_arr
引用 Php
谁能解释一下下面“&”的作用: class TEST { } $abc =& new TEST(); 我知道这是引用。但是有人可以说明我为什么以及什么时候需要这样的东西吗？或者给我指向一个对此有很好解
详解C++ 引用
引用变量是一个别名，也就是说，它是某个已存在变量的另一个名字。一旦把引用初始化为某个变量，就可以使用该引用名称或变量名称来指向变量。 C++ 引用 vs 指针引用很容易与指针混淆，它们之间有三
解析C++引用
目录引言背景结论引言我选择写C++中的引用是因为我感觉大多数人误解了引用。而我之所以有这个感受是因为我主持过很多C++的面试，并且我很少
16、Perl 引用
Perl 中的引用是指一个标量类型可以指向变量、数组、哈希表（也叫关联数组）甚至函数，可以应用在程序的任何地方创建引用定义变量的时候，在变量名前面加个 \，就得到了这个变量的一个引用 $sc
Perl，通过调用其父程序覆盖子程序 |引用
我编写了一个将从主脚本加载的 Perl 模块。该模块使用在主脚本中定义的子程序(我不是维护者)。对于主脚本中的一个子例程，需要扩展，但我不想修补主脚本。相反，我想覆盖我的模块中的函数并保存对原始子例
F# 引用 - 遍历由值表示的函数调用
我花了几个小时试图掌握 F# Quotations，但我遇到了一些障碍。我的要求是从可区分的联合类型中取出简单的函数(只是整数、+、-、/、*)并生成一个表达式树，最终将用于生成 C 代码。我知道使用
regex - 引用 - 密码验证
很多时候，问题(尤其是那些标记为 regex 的问题)询问验证密码的方法。似乎用户通常会寻求密码验证方法，包括确保密码包含特定字符、匹配特定模式和/或遵守最少字符数。这篇文章旨在帮助用户找到合适的密码
excel - 引用公式中的单元格地址/引用
我想通过 MIN 函数内的地址(例如，C800)引用包含文本的最后一个单元格。你能帮忙吗？ Sub Set_Formula() ' ----------------------------- Dim
for-of 循环中的 Javascript 引用
使用常规的 for 循环，我可以做类似的事情: for (let i = 0; i < objects.length; i++) { delete objects[i]; } 常规的 for-
cucumber :引用/不引用参数的最佳实践是什么
在 Cucumber 中，您定义了定义 BDD 语法的步骤；例如，您的测试可能有: When I navigate to step 3 然后你可以定义一个步骤: When /^I navigate t
linq - 表达式类型.引用
这是什么UnaryExpression的目的，以及应该怎样使用？最佳答案它需要一个 Expression对象并用另一个 Expression 包裹它.例如，如果您有一个用于 lambda 的表达式
JQuery 多个选择器，$(this) 引用？
给出以下内容 $("#identifier div:first, #idetifier2").fadeOut(300,function() { // I need to reference jus
xslt - XPath 引用
我不知道我要找的东西的正确术语，但我要找的是一个完整的引用，可以放在双引号之间的语句，比如 *， node()、@* 以及所有列出的 here加上任何其他存在的。我链接到的答案提供了一些细节，但还
regex - 引用-此正则表达式是什么意思？
This question's answers are a community effort。编辑现有答案以改善此职位。它当前不接受新的答案或互动。这是什么？这是常见问答的集合。这也是一个社区Wi
accessibility - Microsoft的UI自动化的教程/引用
Closed. This question does not meet Stack Overflow guidelines。它当前不接受答案。想改善这个问题吗？更新问题，以便将其作为on-topic
rust - 引用“静态生命周期不长？
考虑下一个代码: fn get_ref(slice: &'a Vec, f: fn(&'a Vec) -> R) -> R where R: 'a, { f(slice) } fn m

首页

博学

6Ren·AI

商城

string - 在 R 中使用可变字符串引用对象