letters [1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k-6ren">
gpt4 book ai didi

r - 按字母顺序排序时,字母 "y"位于 "i"之后

转载 作者:行者123 更新时间:2023-12-03 06:23:14 25 4
gpt4 key购买 nike

当使用函数sort(x)时,其中x是一个字符,字母“y”跳到中间,就在字母“i”之后:

> letters
[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k" "l" "m" "n" "o" "p" "q" "r" "s" "t"
[21] "u" "v" "w" "x" "y" "z"

> sort(letters)
[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "y" "j" "k" "l" "m" "n" "o" "p" "q" "r" "s"
[21] "t" "u" "v" "w" "x" "z"

原因可能是我位于立陶宛,这是“类似立陶宛语”的字母排序,但我需要正常排序。如何在 R 代码中将排序方法更改回正常?

我在 Win7 上使用 R 2.15.2。

最佳答案

您需要更改 R 运行的区域设置。要么对整个 Windows 安装执行此操作(这似乎不是最佳),要么在 R session 中通过以下方式执行此操作:

Sys.setlocale("LC_COLLATE", "C")

您可以使用任何其他有效的区域设置字符串来代替“C”,但这应该会让您回到所需的字母排序顺序。

阅读?locales了解更多信息。

我认为值得注意的是姊妹函数 Sys.getlocale(),它查询区域设置参数的当前设置。因此你可以这样做

(locCol <- Sys.getlocale("LC_COLLATE"))
Sys.setlocale("LC_COLLATE", "lt_LT")
sort(letters)
Sys.setlocale("LC_COLLATE", locCol)
sort(letters)
Sys.getlocale("LC_COLLATE")

## giving:
> (locCol <- Sys.getlocale("LC_COLLATE"))
[1] "en_GB.UTF-8"
> Sys.setlocale("LC_COLLATE", "lt_LT")
[1] "lt_LT"
> sort(letters)
[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "y" "j" "k" "l" "m" "n"
[16] "o" "p" "q" "r" "s" "t" "u" "v" "w" "x" "z"
> Sys.setlocale("LC_COLLATE", locCol)
[1] "en_GB.UTF-8"
> sort(letters)
[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k" "l" "m" "n" "o"
[16] "p" "q" "r" "s" "t" "u" "v" "w" "x" "y" "z"
> Sys.getlocale("LC_COLLATE")
[1] "en_GB.UTF-8"

这当然是@Hadley's Answer 所显示的with_collat​​e() 在安装devtools 后会更简洁地执行的操作。

关于r - 按字母顺序排序时,字母 "y"位于 "i"之后,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/14458314/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com