gpt4 book ai didi

Ruby Net::FTP 超时线程

转载 作者:数据小太阳 更新时间:2023-10-29 06:49:56 25 4
gpt4 key购买 nike

我试图通过使用线程 FTP 连接来加速多个 FTP 下载。我的问题是我总是有线程挂起。我正在寻找一种干净的方法来告诉 FTP 它需要重试 ftp 事务,或者至少知道 FTP 连接何时挂起。

在下面的代码中,我对 5/6 个单独的 FTP 连接进行线程处理,其中每个线程都有一个预期下载的文件列表。当脚本完成时,一些线程挂起并且无法加入。我使用变量@last_updated 来表示上次成功下载的时间。如果当前时间+20秒超过了@last_updated,就杀掉剩下的线程。有没有更好的办法?

threads = []
max_thread_pool = 5
running_threads = 0
Thread.abort_on_exception = true

existing_file_count = 0
files_downloaded = 0

errors = []
missing_on_the_server = []
@last_updated = Time.now

if ids.length > 0
ids.each_slice(ids.length / max_thread_pool) do |id_set|
threads << Thread.new(id_set) do |t_id_set|
running_threads += 1
thread_num = running_threads
thread_num.freeze
puts "making thread # #{thread_num}"
begin
ftp = Net::FTP.open(@remote_site)
ftp.login(@remote_user, @remote_password)
ftp.binary = true
#ftp.debug_mode = true
ftp.passive = false
rescue
raise "Could not establish FTP connection"
end
t_id_set.each do |id|
@last_updated = Time.now
rmls_path = "/_Photos/0#{id[0,2]}00000/#{id[2,1]}0000/#{id[3,1]}000/#{id}-1.jpg"
local_path = "#{@photos_path}/01/#{id}-1.jpg"
progress += 1
unless File.exist?(local_path)
begin
ftp.getbinaryfile(rmls_path, local_path)
puts "ftp reponse: #{ftp.last_response}"
# find the percentage of progress just for fun
files_downloaded += 1
p = sprintf("%.2f", ((progress.to_f / total) * 100))
puts "Thread # #{thread_num} > %#{p} > #{progress}/#{total} > Got file: #{local_path}"
rescue
errors << "#{thread_num} unable to get file > ftp response: #{ftp.last_response}"
puts errors.last
if ftp.last_response_code.to_i == 550
# Add the missing file to the missing list
missing_on_the_server << errors.last.match(/\d{5,}-\d{1,2}\.jpg/)[0]
end
end
else
puts "found file: #{local_path}"
existing_file_count += 1
end
end
puts "closing FTP connection #{thread_num}"
ftp.close
end # close thread
end
end

# If @last_updated has not been updated on the server in over 20 seconds, wait 3 seconds and check again
while Time.now < @last_updated + 20 do
sleep 3
end
# threads are hanging so joining the threads does not work.
threads.each { |t| t.kill }

最佳答案

对我来说有用的技巧是使用 ruby​​ 的 Timeout.timeout 来确保 FTP 连接没有挂起。

begin
Timeout.timeout(10) do
ftp.getbinaryfile(rmls_path, local_path)
end
# ...
rescue Timeout::Error
errors << "#{thread_num}> File download timed out for: #{rmls_path}"
puts errors.last
rescue
errors << "unable to get file > ftp reponse: #{ftp.last_response}"
# ...
end

挂起的 FTP 下载导致我的线程看起来挂起。现在线程不再挂起,我可以使用更合适的方式处理线程:

threads.each { |t| t.join }

而不是丑陋的:

# If @last_updated has not been updated on the server in over 20 seconds, wait 3 seconds and check again
while Time.now < @last_updated + 20 do
sleep 3
end
# threads are hanging so joining the threads does not work.
threads.each { |t| t.kill }

关于Ruby Net::FTP 超时线程,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/1230172/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com