ruby - 如何找到包含所有元音的同一数组的两个元素-6ren

ruby - 如何找到包含所有元音的同一数组的两个元素

转载作者：行者123 更新时间：2023-12-04 23:11:48

我想迭代给定的数组，例如:

["goat", "action", "tear", "impromptu", "tired", "europe"]

我想查看所有可能的配对。

所需的输出是一个新数组，其中包含所有对，组合包含所有元音。此外，这些对应该连接为输出数组的一个元素:

["action europe", "tear impromptu"]

我尝试了以下代码，但收到一条错误消息:

No implicit conversion of nil into string.

def all_vowel_pairs(words)
  pairs = []

  (0..words.length).each do |i|                       # iterate through words
    (0..words.length).each do |j|                   # for every word, iterate through words again
      pot_pair = words[i].to_s + words[j]         # build string from pair
      if check_for_vowels(pot_pair)               # throw string to helper-method.
        pairs << words[i] + " " + words[j]      # if gets back true, concatenade and push to output array "pairs"
      end
    end
  end
  pairs
end

# helper-method to check for if a string has all vowels in it
def check_for_vowels(string)
  vowels = "aeiou"
  founds = []
  string.each_char do |char|
    if vowels.include?(char) && !founds.include?(char)
      founds << char
    end
  end
  if founds.length == 5
    return true
  end
  false
end

最佳答案

以下代码旨在提供一种在字数很大时构建所需数组的有效方法。请注意，与其他答案不同，它不使用方法 Array#combination .
说明部分的第一部分(下文)概述了算法采用的方法。然后填写详细信息。
代码

require 'set'

VOWELS = ["a", "e", "i", "o", "u"]
VOWELS_SET = VOWELS.to_set

def all_vowel_pairs(words)
  h = words.each_with_object({}) {|w,h| (h[(w.chars & VOWELS).to_set] ||= []) << w}
  h.each_with_object([]) do |(k,v),a|
    vowels_needed = VOWELS_SET-k
    h.each do |kk,vv|
      next unless kk.superset?(vowels_needed)
      v.each {|w1| vv.each {|w2| a << "%s %s" % [w1, w2] if w1 < w2}}
    end
  end
end

示例

words = ["goat", "action", "tear", "impromptu", "tired", "europe", "hear"]

all_vowel_pairs(words)
  #=> ["action europe", "hear impromptu", "impromptu tear"]

说明
对于给定的示例，步骤如下。

VOWELS_SET = VOWELS.to_set
  #=> #<Set: {"a", "e", "i", "o", "u"}> 

h = words.each_with_object({}) {|w,h| (h[(w.chars & VOWELS).to_set] ||= []) << w}
  #=> {#<Set: {"o", "a"}>=>["goat"],
  #    #<Set: {"a", "i", "o"}>=>["action"],
  #    #<Set: {"e", "a"}>=>["tear", "hear"],
  #    #<Set: {"i", "o", "u"}>=>["impromptu"],
  #    #<Set: {"i", "e"}>=>["tired"],
  #    #<Set: {"e", "u", "o"}>=>["europe"]}

可以看出 h的键是五个元音的子集。这些值是 words 的元素数组(词)包含由键给出的元音，没有其他元音。因此，这些值共同构成了 words 的分区。 .当单词数很大时，我们会期望 h有 31 个键( 2**5 - 1 )。
我们现在遍历 h 的键值对.对于每个，用 key k和值(value) v ，确定缺失元音集( vowels_needed )，然后我们循环遍历这些键值对 [kk, vv]的 h其中 kk是 vowels_needed 的超集. v的所有元素组合和 vv然后将它们添加到要返回的数组中(经过调整以避免重复计算每对单词)。
继续，

enum = h.each_with_object([])
  #=> #<Enumerator: {#<Set: {"o", "a"}>=>["goat"],
  #                  #<Set: {"a", "i", "o"}>=>["action"],
  #                  ...
  #                  #<Set: {"e", "u", "o"}>=>["europe"]}: 
  #     each_with_object([])>

第一个值由 enum 生成并传递给块，块变量被赋值:

(k,v), a = enum.next
  #=> [[#<Set: {"o", "a"}>, ["goat"]], []]

见 Enumerator#next .
单个变量由 array decomposition 赋值:

k #=> #<Set: {"o", "a"}> 
v #=> ["goat"] 
a #=> []

现在执行块计算。

vowels_needed = VOWELS_SET-k
  #=> #<Set: {"e", "i", "u"}> 
h.each do |kk,vv|
  next unless kk.superset?(vowels_needed)
  v.each {|w1| vv.each {|w2| a << "%s %s" % [w1, w2] if w1 < w2}}
end

单词“goat”( v )具有元音“o”和“a”，因此它只能与包含元音“e”、“i”和“u”(可能还有“o”和/)的词匹配或“一个”)。表达方式

next unless kk.superset?(vowels_needed)

跳过 h 的那些键( kk ) 不是 vowels_needed 的超集.见 Set#superset? . words中的词都没有包含“e”、“i”和“u”，所以数组 a不变。
下一个元素现在由 enum 生成, 传递给块，块变量被赋值:

(k,v), a = enum.next
  #=> [[#<Set: {"a", "i", "o"}>, ["action"]], []] 
k #=> #<Set: {"a", "i", "o"}> 
v #=> ["action"] 
a #=> []

块计算开始:

vowels_needed = VOWELS_SET-k
  #=> #<Set: {"e", "u"}>

我们看到 h只有一个键值对，其键是 vowels_needed 的超集:

kk = %w|e u o|.to_set
  #=> #<Set: {"e", "u", "o"}> 
vv = ["europe"]

因此我们执行:

v.each {|w1| vv.each {|w2| a << "%s %s" % [w1, w2] if w1 < w2}}

将一个元素添加到 a :

a #=> ["action europe"]

条款 if w1 < w2是为了确保后面的计算 "europe action"未添加到 a .
如 v (包含“a”、“i”和“u”的词)和 vv (包含“e”、“u”和“o”的词)改为:

v  #=> ["action", "notification"]
vv #=> ["europe", "route"]

我们会添加 "action europe" , "action route"和 "notification route"至 a . ( ”europe notification” 将在稍后添加，当 k #=> #<Set: {"e", "u", "o"} 时。)
基准
我使用@theTinMan 的 Fruity 基准代码将我的方法与其他人建议的方法进行了基准测试。唯一的区别在于要测试的单词数组以及将我的方法添加到基准测试中，我将其命名为 cary .对于要考虑的单词数组，我从计算机上的英语单词文件中随机选择了 600 个单词:

words = IO.readlines('/usr/share/dict/words', chomp: true).sample(600)
words.first 10
  #=> ["posadaship", "explosively", "expensilation", "conservatively", "plaiting",
  #    "unpillared", "intertwinement", "nonsolidified", "uraemic", "underspend"]

发现该数组包含 46,436 对包含所有五个元音的单词。
结果如下所示。

compare {
  _viktor { viktor(words) }
  _ttm1 { ttm1(words) }
  _ttm2 { ttm2(words) }
  _ttm3 { ttm3(words) }
  _cary { cary(words) }
}

Running each test once. Test will take about 44 seconds.
_cary is faster than _ttm3 by 5x ± 0.1
_ttm3 is faster than _viktor by 50.0% ± 1.0%
_viktor is faster than _ttm2 by 30.000000000000004% ± 1.0%
_ttm2 is faster than _ttm1 by 2.4x ± 0.1

然后我比较了 cary与 ttm3对于 1,000 个随机选择的单词。发现该数组包含 125,068 对包含所有五个元音的单词。结果如下:

Running each test once. Test will take about 19 seconds.
_cary is faster than _ttm3 by 3x ± 1.0

为了感受基准测试的可变性，我又进行了两次最后的比较，每次都随机选择了 1,000 个单词。这给了我以下结果:

Running each test once. Test will take about 17 seconds.
_cary is faster than _ttm3 by 5x ± 1.0

Running each test once. Test will take about 18 seconds.
_cary is faster than _ttm3 by 4x ± 1.0

可以看出，样本之间存在相当大的差异。

关于ruby - 如何找到包含所有元音的同一数组的两个元素，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59347316/

文章推荐： C++，多次重复一个函数的输入

文章推荐： javascript - 如何格式化这个 div 而没有里面的文本换行？

文章推荐： python - 在自定义 Python 类中动态重新分配 __len__

文章推荐： java - 使用 Guava 的 ImmutableSortedMap 作为频率图

function - 有没有办法定义一个单词是否只有 "a"元音？
我试图在 haskell 上创建一个函数，我知道该函数的定义如下: justWithA : [Char] -> Bool justWithA [] = True justWithA (x:xs) |
function - 有没有办法定义一个单词是否只有 "a"元音？
我试图在 haskell 上创建一个函数，我知道该函数的定义如下: justWithA : [Char] -> Bool justWithA [] = True justWithA (x:xs) |
字符串中的 C++ 元音，禁止比较
我正在尝试计算字符串中元音的总数。我正在使用 strlen 来获取字符串的总长度，但是当我尝试按每个字母对字符串进行计数时，它说 C++ 禁止比较。所以我假设我的 if 语句有问题。 #include
python - 在辅音之间找到每两个(不重叠的)元音
Task You are given a string . It consists of alphanumeric characters, spaces and symbols(+,-). Your
计算 C 字符串中单个字符(元音)的数量
我刚刚开始用 C 语言编程，我必须创建一个程序来计算字符串有多少个元音。到目前为止我有这个: int a; int len = strlen(text)-1 for(a=0;a==len;++a){
python - Python 中的正则表达式查找遵循模式 : vowel, 辅音、元音、辅音的单词
尝试学习 Python 中的正则表达式以查找具有连续元音-辅音或辅音-元音组合的单词。我将如何在正则表达式中执行此操作？如果无法在 Regex 中完成，是否有一种在 Python 中执行此操作的有效方
keyboard - 阿拉伯语:如何使用 PC 键盘方便地输入 Dagger 元音(又称微型元音)和 alif-wasla
我想输入 dagger-alif、dagger-waw 和 dagger-ya(也称为微型 alif、微型哇和微型 ya)作为 alif-wasla 使用 PC 阿拉伯语键盘。这些标记被使用帮助读者发
batch-file - 来自具有国际字符/突变/元音 (ä. ö. ü) 文件夹的 FTP 文件与 Windows FTP 批处理文件
我正在尝试创建一个将文件上传到 FTP 服务器的批处理文件。除了一个特定文件夹的名称中包含突变/元音外，一切正常(无法更改。也就是文件夹名称中包含 ö。)。我的问题是:有哪些选择可以实现这一目标？

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

ruby - 如何找到包含所有元音的同一数组的两个元素