database - postgresql 9.6.4 : timestamp range query on large table takes forever-6ren

database - postgresql 9.6.4 : timestamp range query on large table takes forever

转载作者：搜寻专家更新时间：2023-10-30 22:07:25

25

4

我需要一些帮助来分析在包含 8366.014.2 亿行的大型表上执行的查询的不良性能，根据系统负载，计算需要长达 25 分钟到一个多小时的时间。

我创建了下表，其中包含一个组合键和 3 个索引:

CREATE TABLE IF NOT EXISTS ds1records(
 userid INT DEFAULT 0,
 clientid VARCHAR(255) DEFAULT '',
 ts TIMESTAMP,
 site VARCHAR(50) DEFAULT '',
 code VARCHAR(400) DEFAULT '');

CREATE UNIQUE INDEX IF NOT EXISTS primary_idx ON records (userid, clientid, ts, site, code);
CREATE INDEX IF NOT EXISTS userid_idx ON records (userid);
CREATE INDEX IF NOT EXISTS ts_idx ON records (ts);
CREATE INDEX IF NOT EXISTS userid_ts_idx ON records (userid ASC,ts DESC);

在 spring 批处理应用程序中，我正在执行一个如下所示的查询:

SELECT * 
  FROM records 
 WHERE userid = ANY(VALUES (2), ..., (96158 more userids) ) 
  AND ( ts < '2017-09-02' AND ts >= '2017-09-01' 
        OR ts < '2017-08-26' AND ts >= '2017-08-25' 
        OR ts < '2017-08-19' AND ts >= '2017-08-18' 
        OR ts < '2017-08-12' AND ts >= '2017-08-11')

用户 ID 在运行时确定(ID 的数量介于 95.000 和 110.000 之间)。对于每个用户，我需要提取当天和前三个工作日的页面浏览量。查询总是返回 3-4M 行之间的行。

使用 EXPLAIN ANALYZE 选项执行查询返回以下执行计划。

Nested Loop  (cost=1483.40..1246386.43 rows=3761735 width=70) (actual time=108.856..1465501.596 rows=3643240 loops=1)
   ->  HashAggregate  (cost=1442.38..1444.38 rows=200 width=4) (actual time=33.277..201.819 rows=96159 loops=1)
     Group Key: "*VALUES*".column1
     ->  Values Scan on "*VALUES*"  (cost=0.00..1201.99 rows=96159 width=4) (actual time=0.006..11.599 rows=96159 loops=1)
   ->  Bitmap Heap Scan on records  (cost=41.02..6224.01 rows=70 width=70) (actual time=8.865..15.218 rows=38 loops=96159)
     Recheck Cond: (userid = "*VALUES*".column1)
     Filter: (((ts < '2017-09-02 00:00:00'::timestamp without time zone) AND (ts >= '2017-09-01 00:00:00'::timestamp without time zone)) OR ((ts < '2017-08-26 00:00:00'::timestamp without time zone) AND (ts >= '2017-08-25 00:00:00'::timestamp without time zone)) OR ((ts < '2017-08-19 00:00:00'::timestamp without time zone) AND (ts >= '2017-08-18 00:00:00'::timestamp without time zone)) OR ((ts < '2017-08-12 00:00:00'::timestamp without time zone) AND (ts >= '2017-08-11 00:00:00'::timestamp without time zone)))
     Rows Removed by Filter: 792
     Heap Blocks: exact=77251145
     ->  Bitmap Index Scan on userid_ts_idx  (cost=0.00..41.00 rows=1660 width=0) (actual time=6.593..6.593 rows=830 loops=96159)
           Index Cond: (userid = "*VALUES*".column1)

我已经调整了一些 Postgres 调优参数的值(不幸的是没有成功):

effective_cache_size=15GB(可能没用，因为查询只执行一次)
shared_buffers=15GB
work_mem=3GB

该应用程序运行计算量大的任务(例如数据融合/数据注入(inject))并消耗大约 100GB 内存，因此系统硬件足够大，具有 125GB RAM 和 16 个内核(操作系统:Debian)。

我想知道为什么 postgres 没有在其执行计划中使用组合索引 userid_ts_idx？由于索引中的时间戳列以相反的顺序排序，我希望 postgres 使用它来为查询的范围部分找到匹配的元组，因为它可以顺序地遍历索引直到条件 ts < '2017-09-02 00:00:00 成立并返回所有值直到条件满足 ts >= 2017-09-01 00:00:00。相反，postgres 使用昂贵的位图堆扫描，如果我理解正确的话，它会进行线性表扫描。是我错误地配置了数据库设置还是我有概念上的误解？

更新

不幸的是，评论中建议的 CTE 没有带来任何改进。位图堆扫描已被顺序扫描取代，但性能仍然很差。以下是更新后的执行计划:

Merge Join  (cost=20564929.37..20575876.60 rows=685277 width=106) (actual time=2218133.229..2222280.192 rows=3907472 loops=1)
  Merge Cond: (ids.id = r.userid)
  Buffers: shared hit=2408684 read=181785
  CTE ids
    ->  Values Scan on "*VALUES*"  (cost=0.00..1289.70 rows=103176 width=4) (actual time=0.002..28.670 rows=103176 loops=1)
  CTE ts
    ->  Values Scan on "*VALUES*_1"  (cost=0.00..0.05 rows=4 width=32) (actual time=0.002..0.004 rows=4 loops=1)
  ->  Sort  (cost=10655.37..10913.31 rows=103176 width=4) (actual time=68.476..83.312 rows=103176 loops=1)
    Sort Key: ids.id
    Sort Method: quicksort  Memory: 7909kB
    ->  CTE Scan on ids  (cost=0.00..2063.52 rows=103176 width=4) (actual time=0.007..47.868 rows=103176 loops=1)
  ->  Sort  (cost=20552984.25..20554773.54 rows=715717 width=102) (actual time=2218059.941..2221230.585 rows=8085760 loops=1)
    Sort Key: r.userid
    Sort Method: quicksort  Memory: 1410084kB
    Buffers: shared hit=2408684 read=181785
    ->  Nested Loop  (cost=0.00..20483384.24 rows=715717 width=102) (actual time=885849.043..2214665.723 rows=8085767 loops=1)
          Join Filter: (ts.r @> r.ts)
          Rows Removed by Join Filter: 707630821
          Buffers: shared hit=2408684 read=181785
          ->  Seq Scan on records r  (cost=0.00..4379760.52 rows=178929152 width=70) (actual time=0.024..645616.135 rows=178929147 loops=1)
                Buffers: shared hit=2408684 read=181785
          ->  CTE Scan on ts  (cost=0.00..0.08 rows=4 width=32) (actual time=0.000..0.000 rows=4 loops=178929147)
Planning time: 126.110 ms
Execution time: 2222514.566 ms

最佳答案

如果您将该时间戳转换为日期并改为按值列表过滤，您应该得到不同的计划。

CREATE INDEX IF NOT EXISTS userid_ts_idx ON records (userid ASC,cast(ts AS date) DESC);

SELECT *
  FROM records
 WHERE userid = ANY(VALUES (2), ..., (96158 more userids) )
  AND cast(ts AS date) IN('2017-09-01','2017-08-25','2017-08-18','2017-08-11');

它是否会表现更好取决于您的数据和日期范围，因为我发现在我的案例中，即使日期值覆盖了整个表，Postgres 也会继续使用该索引(因此 seq 扫描会更好)。

Demo

关于database - postgresql 9.6.4 : timestamp range query on large table takes forever，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/46654608/

25

4

0

文章推荐： Vue.js 更改 {{ }} 标签

文章推荐： python - AWS Redshift 数据处理

文章推荐： sql - 如何在 SQL 中有效地找到多个记录的运行最新更新？

.net - .Take() 还是不.Take()，这是个问题
我有一个代表每个年龄段的人口(M，F)的集合。为了通过时间来预测人口，我必须首先对女性进行计算，以便我可以根据男性出生率的统计常数来计算新出生的男性和女性的百分比。也就是说，我有一个包含每岁男性和
clojure - clojure 中的 take-while 和 take-nth
我正在尝试从队列中获取 n 条消息(使用 langohr)。我有一个工作版本，但我想知道是否有更好的 clojurist 方法来做到这一点: (def not-nil? (complement nil
postgresql - 分析: Why a query taking could take so long,好像成本不高？
我有这些结果用于分析一个简单的查询，该查询不会从少于 200 条记录的表中返回超过 150 条记录，因为我有一个存储最新值的表，而其他字段是数据的 FK . 更新:稍后查看同一查询的新结果。该网站未公
c# - 如何在 .Take() 之前获得总结果计数 - 但在使用 .Take() 时
我正在使用 .Take() 来获取固定数量的结果。获取 TotalCountBeforeTake 的最佳方法是什么(即好像我没有使用 .Take())？我可以在不运行查询两次的情况下获得 Tota
Java 泛型 : Class that takes a class that takes another class
我有一个 BatchConfigurable 类 public class BatchConfigurable() {} 我正在尝试为其编写一个包装器。这将是另一个类，它采用此类或任何扩展 Batch
c# - 使用 Take() 从字节数组中获取字节，在 Take() 之后无法转换回 byte[]
byte[] result = memStream.ToArray(); memStream.Close(); byte[] temp = r
c# - Enumerable.Repeat with Union and Take 不返回 'Take' 值
很简单的问题。我有一个值列表，我想用空值填充这些值，这样我总是会返回 X 个项目。 List list = new List() { 10, 20, 30 }; IEnumerable values
angular - take is not a function : Angular 7 | Observable. take 抛出运行时错误
我正在构建一个购物车，并且我使用了一个购物车服务，在该服务中我将数量分配给产品/将产品添加到购物车。除了使用 take 获取可观察项 $ 的第一个实例之外，还有其他方法吗？我正在正确导入 take
c# - array.Take(13).Skip(x) 减去 take
这是欧拉计划的问题 8。我试图通过数字数组foreach，每次跳过最后一个数字并拉接下来的13个相邻数字数组。我的代码: for(int x = 0; x product) {
css - 显示 : table not take the dynamic height take their children height
我有 3 个 div 元素，一个是父元素，另外两个是子元素: dinesh pathak and their css are: #table {
java - hudson : Builds Taking Longer time than What it takes by running it Locally
我在 Hudson 上发现了异常行为。Hudson 作业大约需要 25 分钟，而当我在本地运行相同的作业时，需要 9 分钟。我在这里缺少什么？我增加了 JAVA_OPTS、MAVEN_OPTS，甚至
f# - 漏洞？ Seq.take 10 运行良好，Seq.take 100 不起作用
let a = [1;2;3;] for i in (a |> Seq.take 10) do Console.WriteLine(i) for i in (a |> Seq.take 100) do
c# - 可以在 c# linq 的查询表达式中使用 take 而不是使用 .Take(x) 吗？
我正在尝试编写一些 LINQ To SQL 代码来生成类似 SQL 的代码 SELECT t.Name, g.Name FROM Theme t INNER JOIN ( SELECT TOP
c# - LINQ，简化表达式 - take while take of sum of taken does not exceed given value
给定这样的设置.. class Product { int Cost; // other properties unimportant } var products = new List
c# - 在 List 上使用 Take(...) 时，是否会在应用 Take(...) 之前返回整个列表？
我有一个 List 类型的元素 public class FriendList { public List friends { get; set; } // List
c# - LINQ ToList().Take(10) vs Take(10).ToList() 哪个生成更高效的查询
给定以下 LINQ 语句，哪个更有效？一个: public List GetLatestLogEntries() { var logEntries = from entry in db.Lo
kotlin - Kotlin 中 toList().take(10) 和 take(10).toList() 之间的区别
我只是在尝试新的 kotlin 语言。我遇到了生成无限列表的序列。我生成了一个序列并尝试打印前 10 个元素。但是下面的代码没有打印任何东西: fun main(args: Array) {
javascript - 在 saga : `while(true) take()` vs `while(take())` vs. `takeEvery()` 中监听 Action 的最佳方式
我见过 sagas 以 3 种方式监听 Action : 1。 while(true) take() function* onUserDetailsRequest() { while(true)
c# - 当 skip 不能被 take 整除时，通过 pageIndex/pageSize 分页实现 skip/take 分页
假设我有一些神奇的分页黑盒类，它使用 pageIndex 和 pageSize 检索数据，如下所示: public class PaginatedList { // ... // Ch
混帐 "discrepancy": `git diff` does not take commit messages into consideration but SHA-1 generation takes commit messages while computing SHA-1
我有两个 git 分支 b' 和 b" 具有完全相同的 SHA-1 和因此内容。我提交 b ' 并在提交时，我使用 -x 应用 cherry-pick 而不是 merge 或 rebase单个提交到我

首页

博学

6Ren·AI

商城

database - postgresql 9.6.4 : timestamp range query on large table takes forever