python - Pandas read_html() 缺少列-6ren

python - Pandas read_html() 缺少列

转载作者：行者123 更新时间：2023-11-28 02:46:21

32

4

我正在使用以下 read_html() 调用来读取表格(在付费专区后面):

df = pd.read_html('http://markets.ft.com/data/equities/tearsheet/' + 
              'financials?s=BAG:LSE&subView=BalanceSheet&periodType=a')[0]

除了缺少最后两列之外，它解析得很好。我正在使用最新版本的 Anaconda(Python 3.5、pandas 0.18.1、html5lib、BeautifulSoup4)。

输出的开头如下所示:

                Fiscal data as of Jan 30 2016  2016    2015    2014
                                      ASSETS   NaN     NaN     NaN
             Cash And Short Term Investments  6.80      25      13
                      Total Receivables, Net    50      49      45
                             Total Inventory    16      17      16

(太大无法全部显示)

HTML 的开头如下所示:

<table class="mod-ui-table">
            <thead>
                <tr>
                    <th class="mod-ui-table__header--text">Fiscal data as of Jan 30 2016</th>
                    <th>2016</th>
                    <th class="mod-ui-hide-xsmall">2015</th>
                    <th class="mod-ui-hide-xsmall">2014</th>
                    <th class="mod-ui-hide-xsmall">2013</th>
                    <th class="mod-ui-hide-xsmall">2012</th>
                </tr>
            </thead>
            <tr class="mod-ui-table__row--section-header">
                <th colspan="6">ASSETS</th>
            </tr>
            <tr class="mod-ui-table__row--striped">
                <th class="mod-ui-table__header--row-label">Cash And Short Term Investments</th>
                <td>6.80</td>
                <td class="mod-ui-hide-xsmall">25</td>
                <td class="mod-ui-hide-xsmall">13</td>
                <td class="mod-ui-hide-xsmall">0.91</td>
                <td class="mod-ui-hide-xsmall">8.29</td>
            </tr>
            <tr>
                <th class="mod-ui-table__header--row-label">Total Receivables, Net</th>
                <td>50</td>
                <td class="mod-ui-hide-xsmall">49</td>
                <td class="mod-ui-hide-xsmall">45</td>
                <td class="mod-ui-hide-xsmall">42</td>
                <td class="mod-ui-hide-xsmall">37</td>
            </tr>

HTML 的结尾如下所示:

<tr class="mod-ui-table__row--highlight">
                    <th class="mod-ui-table__header--row-label">Total liabilities &amp; shareholders&#39; equity</th>
                    <td>269</td>
                    <td class="mod-ui-hide-xsmall">255</td>
                    <td class="mod-ui-hide-xsmall">227</td>
                    <td class="mod-ui-hide-xsmall">215</td>
                    <td class="mod-ui-hide-xsmall">196</td>
                </tr>
                <tr class="mod-ui-table__row--striped">
                    <th class="mod-ui-table__header--row-label">Total common shares outstanding</th>
                    <td>117</td>
                    <td class="mod-ui-hide-xsmall">117</td>
                    <td class="mod-ui-hide-xsmall">117</td>
                    <td class="mod-ui-hide-xsmall">117</td>
                    <td class="mod-ui-hide-xsmall">117</td>
                </tr>
                <tr>
                    <th class="mod-ui-table__header--row-label">Treasury shares - common primary issue</th>
                    <td>0</td>
                    <td class="mod-ui-hide-xsmall">0</td>
                    <td class="mod-ui-hide-xsmall">0</td>
                    <td class="mod-ui-hide-xsmall">0</td>
                    <td class="mod-ui-hide-xsmall">--</td>
                </tr>
            </table>

如果不是很明显可能出了什么问题，我将不胜感激关于如何开始单步执行 read_html() 代码以找到问题根源的一些提示。我目前是 Python/pdb 的新手。

最佳答案

事实证明，如果你没有登录 FT 网站，你只能获得三年的数据。

所以我现在着手研究如何登录 FT 网站(可能使用 Twill)。

有个相关问题here

关于python - Pandas read_html() 缺少列，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/41394409/

32

4

0

文章推荐： javascript - 如何使用 jquery/AJAX 加载外部 Css 文件？

文章推荐： c++ - 派生类构造函数内部如何调用基类构造函数

文章推荐： HTML 子对象不从父对象获取属性

文章推荐： css - Bootstrap : Vertical Align across all devices

r - Leaflet R derivePolygons 缺少 lat 缺少 long
我正在尝试在 map 上绘制一些疾病事件数据的位置。我用它来导入数据: ByTown% addProviderTiles("CartoDB.Positron")%>% addPolygons
javascript - 缺少 ) 在使用异步等待的参数列表之后
我有一个文件调用 find.js，我使用 node find.js 运行，我的节点是版本 10 我不知道为什么我无法使用 async await。 const axios = require("axi
.net - 缺少 HttpContext
我有一个项目作为引用添加到 System.Web。但是，它似乎无法获取 HttpContext。这样做: Imports System.Web _ApplicationBase = HttpCont
java - 缺少 While 循环逻辑
在互联网上找到这段代码，出于某种原因它缺少 while 循环逻辑“while(i....)”，虽然我找到了 PigLatin* 问题的其他可行解决方案，但我真的很想了解这个正在工作。 *PigLati
缺少 TYPO3 管理后端模块
我工作了一整天来运行 Xampp 并在其上安装 TYPO3。现在我登录到后端，但没有显示许多管理模块，例如模板、访问等。 - 一定是我做错了什么，但我不知道。 these are the module
latex - 缺少 $ 插入
你好我有编译这个问题 \begin{equation} J = \sum_{j=1}^{C} \end{equation} 我不断收到错误 missing $ inserted 这很奇怪，因
缺少 SQLite generate_series
我正在尝试使用 SQLite CLI，但无法获得 generate_series功能来工作。我可以按照文档中的建议使用递归 CTE 对其进行模拟，但我似乎无法获得该链接中的任何示例。这是我的 sess
缺少 .NET 运行时优化服务
我目前正在开发我想要的软件，而软件正在安装，它可以在后台为软件创建 native 图像。我正在考虑使用 NGEN 并将进程优先级设置为低，因为我不希望它消耗 100% CPU。但是我发现我的计算机上
缺少 XCode 仪器自动化
我想使用 Xcodes Instruments 进行 UI 自动化测试。但似乎缺少“自动化”。我怎样才能添加这个？最佳答案如果您想使用自动化仪器，请使用 Xcode 7.3。 Apple 在 Xc
javascript - 缺少创建逻辑
我目前在 JS 开发中迈出了一小步，并编写了以下链接添加器: const button = document.getElementById('button') const listdiv = docu
ios - 缺少[在开始消息中发送表达式
此代码有什么问题: NSError *error = nil; [SFHFKeychainUtils deleteItemForUsername:@"IAPNoob01" andServiceName
flash - 缺少 AGALMiniAssembler
出于某种原因，在安装和配置(我认为)一切之后，com.adobe.utils.AGALMiniAssembler 不见了，其他一切正常。我认为我已尽一切努力让孵化器正常工作，但显然我错过了一步。如
缺少 Perl 参数
我有一个名为 new 的方法。调用 new 时，我传递了一个参数，但是当我运行应用程序时，出现没有参数或参数为空的错误。 StepReader.pm package StepReader; use s
c - 缺少 locale_t
安装 gtk 1.2(包名 gtk1)和 macports chokes 在最终的 make 中，在 libintl.h 的第 440 行。 extern locale_t libintl_newlo
javascript - 为什么我在动态生成按钮时在参数列表后出现错误 - 缺少 )？
我用按钮创建表格。这是javascript代码: function layersListTable(layers) { var content =''; $.each($(layer
javascript 缺少 ) 参数
我在使用此 javascript 时遇到此错误，任何人都可以帮我弄清楚我做错了什么吗？ $(this).prepend('Check availability »'); 它给我错误 mis
android - 缺少 sync_val_compare_and_swap_1
我有一个独立的工具链 NDK13b、api19、llvm 3.8 编译器、arm 32 位、带有 libcpp(llvm C++ 库) 我想避免依赖 libgcc，所以我构建了 compiler-rt
缺少 AndroidManifest 文件
我按照一些教程使用 phonegap 的条形码扫描器插件。但是当我从现有源创建一个新的 android 项目来创建条码库时 (step 6 in this page)我收到错误:“AndroidMan
缺少 Android 布局编辑器
我现在尝试在 Eclipse 中打开我的布局 xml 文件。我只得到错误 No XML content. Please add a root view or layout to your docume
缺少 Android 层次结构查看器
我的 android-sdk-windows\tools 目录中缺少层次结构查看器工具。工具链接: http://developer.android.com/guide/developing/too

首页

博学

6Ren·AI

商城

python - Pandas read_html() 缺少列