gpt4 book ai didi

python - 我怎样才能在python cgi中找到上传的文件名

转载 作者:太空宇宙 更新时间:2023-11-04 11:03:29 25 4
gpt4 key购买 nike

我制作了如下所示的简单网络服务器。

import BaseHTTPServer, os, cgi
import cgitb; cgitb.enable()

html = """
<html>
<body>
<form action="" method="POST" enctype="multipart/form-data">
File upload: <input type="file" name="upfile">
<input type="submit" value="upload">
</form>
</body>
</html>
"""
class Handler(BaseHTTPServer.BaseHTTPRequestHandler):
def do_GET(self):
self.send_response(200)
self.send_header("content-type", "text/html;charset=utf-8")
self.end_headers()
self.wfile.write(html)

def do_POST(self):
ctype, pdict = cgi.parse_header(self.headers.getheader('content-type'))
if ctype == 'multipart/form-data':
query = cgi.parse_multipart(self.rfile, pdict)
upfilecontent = query.get('upfile')
if upfilecontent:
# i don't know how to get the file name.. so i named it 'tmp.dat'
fout = file(os.path.join('tmp', 'tmp.dat'), 'wb')
fout.write (upfilecontent[0])
fout.close()
self.do_GET()

if __name__ == '__main__':
server = BaseHTTPServer.HTTPServer(("127.0.0.1", 8080), Handler)
print('web server on 8080..')
server.serve_forever()

在BaseHTTPRequestHandler的do_Post方法中,成功获取到上传的文件数据。

但我不知道如何获取上传文件的原始名称。self.rfile.name 只是一个“套接字”如何获取上传的文件名?

最佳答案

您在那里使用的非常破损的代码作为起点(例如,查看 global rootnode,其中 无处 使用了名称 rootnode - - 显然是经过半编辑的源代码,而且做得很糟糕)。

无论如何,您在 POST 中使用什么形式的“客户端”?它如何设置 upfile 字段?

为什么不使用正常的 FieldStorage 方法,如 Python's docs 中所述?这样,您可以使用适当字段的 .file 属性来获取要读取的类文件对象,或者使用其 .value 属性将其全部读取到内存中,然后将其作为字符串获取,加上该字段的 .filename 属性以了解上传文件的名称。关于 FieldStorage 的更详细但简洁的文档是 here .

编辑:既然 OP 已经编辑了 Q 以澄清问题,我看到了问题:BaseHTTPServer does not set the environment according to CGI 规范,所以 cgi 模块不是很适合它。不幸的是,环境设置的唯一简单方法是从 CGIHTTPServer.py 窃取和破解一大段代码(不是为了重用,因此需要,叹息,复制和粘贴编码),例如……:

def populenv(self):
path = self.path
dir, rest = '.', 'ciao'

# find an explicit query string, if present.
i = rest.rfind('?')
if i >= 0:
rest, query = rest[:i], rest[i+1:]
else:
query = ''

# dissect the part after the directory name into a script name &
# a possible additional path, to be stored in PATH_INFO.
i = rest.find('/')
if i >= 0:
script, rest = rest[:i], rest[i:]
else:
script, rest = rest, ''

# Reference: http://hoohoo.ncsa.uiuc.edu/cgi/env.html
# XXX Much of the following could be prepared ahead of time!
env = {}
env['SERVER_SOFTWARE'] = self.version_string()
env['SERVER_NAME'] = self.server.server_name
env['GATEWAY_INTERFACE'] = 'CGI/1.1'
env['SERVER_PROTOCOL'] = self.protocol_version
env['SERVER_PORT'] = str(self.server.server_port)
env['REQUEST_METHOD'] = self.command
uqrest = urllib.unquote(rest)
env['PATH_INFO'] = uqrest
env['SCRIPT_NAME'] = 'ciao'
if query:
env['QUERY_STRING'] = query
host = self.address_string()
if host != self.client_address[0]:
env['REMOTE_HOST'] = host
env['REMOTE_ADDR'] = self.client_address[0]
authorization = self.headers.getheader("authorization")
if authorization:
authorization = authorization.split()
if len(authorization) == 2:
import base64, binascii
env['AUTH_TYPE'] = authorization[0]
if authorization[0].lower() == "basic":
try:
authorization = base64.decodestring(authorization[1])
except binascii.Error:
pass
else:
authorization = authorization.split(':')
if len(authorization) == 2:
env['REMOTE_USER'] = authorization[0]
# XXX REMOTE_IDENT
if self.headers.typeheader is None:
env['CONTENT_TYPE'] = self.headers.type
else:
env['CONTENT_TYPE'] = self.headers.typeheader
length = self.headers.getheader('content-length')
if length:
env['CONTENT_LENGTH'] = length
referer = self.headers.getheader('referer')
if referer:
env['HTTP_REFERER'] = referer
accept = []
for line in self.headers.getallmatchingheaders('accept'):
if line[:1] in "\t\n\r ":
accept.append(line.strip())
else:
accept = accept + line[7:].split(',')
env['HTTP_ACCEPT'] = ','.join(accept)
ua = self.headers.getheader('user-agent')
if ua:
env['HTTP_USER_AGENT'] = ua
co = filter(None, self.headers.getheaders('cookie'))
if co:
env['HTTP_COOKIE'] = ', '.join(co)
# XXX Other HTTP_* headers
# Since we're setting the env in the parent, provide empty
# values to override previously set values
for k in ('QUERY_STRING', 'REMOTE_HOST', 'CONTENT_LENGTH',
'HTTP_USER_AGENT', 'HTTP_COOKIE', 'HTTP_REFERER'):
env.setdefault(k, "")
os.environ.update(env)

这可以进一步简化,但需要花费一些时间和精力完成该任务:-(。

有了这个 populenv 函数,我们可以重新编码:

def do_POST(self):
populen(self)
form = cgi.FieldStorage(fp=self.rfile)
upfilecontent = form['upfile'].value
if upfilecontent:
fout = open(os.path.join('tmp', form['upfile'].filename), 'wb')
fout.write(upfilecontent)
fout.close()
self.do_GET()

...从此过上幸福的生活;-)。 (当然,使用任何像样的 WSGI 服务器,甚至是 the demo one 会容易得多,但是这个练习对 CGI 及其内部结构有指导意义;-)。

关于python - 我怎样才能在python cgi中找到上传的文件名,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/3380501/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com