MySQL性能优化: order by datetime field-6ren

MySQL性能优化: order by datetime field

转载作者：IT老高更新时间：2023-10-28 13:00:05

我有一个包含大约 100.000 个博客帖子的表格，通过 1:n 关系链接到一个包含 50 个提要的表格。当我使用 select 语句查询两个表时，按张贴表的日期时间字段排序，MySQL 总是使用文件排序，导致查询时间非常慢(> 1 秒)。这是 postings 表的架构(简化):

+---------------------+--------------+------+-----+---------+----------------+
| Field               | Type         | Null | Key | Default | Extra          |
+---------------------+--------------+------+-----+---------+----------------+
| id                  | int(11)      | NO   | PRI | NULL    | auto_increment |
| feed_id             | int(11)      | NO   | MUL | NULL    |                |
| crawl_date          | datetime     | NO   |     | NULL    |                |
| is_active           | tinyint(1)   | NO   | MUL | 0       |                |
| link                | varchar(255) | NO   | MUL | NULL    |                |
| author              | varchar(255) | NO   |     | NULL    |                |
| title               | varchar(255) | NO   |     | NULL    |                |
| excerpt             | text         | NO   |     | NULL    |                |
| long_excerpt        | text         | NO   |     | NULL    |                |
| user_offtopic_count | int(11)      | NO   | MUL | 0       |                |
+---------------------+--------------+------+-----+---------+----------------+

这是 feed 表:

+-------------+--------------+------+-----+---------+----------------+
| Field       | Type         | Null | Key | Default | Extra          |
+-------------+--------------+------+-----+---------+----------------+
| id          | int(11)      | NO   | PRI | NULL    | auto_increment |
| type        | int(11)      | NO   | MUL | 0       |                |
| title       | varchar(255) | NO   |     | NULL    |                |
| website     | varchar(255) | NO   |     | NULL    |                |
| url         | varchar(255) | NO   |     | NULL    |                |
+-------------+--------------+------+-----+---------+----------------+

这是执行时间超过 1 秒的查询。请注意 post_date 字段有一个索引，但 MySQL 并没有使用它来对发帖表进行排序:

SELECT 
    `postings`.`id`, 
    UNIX_TIMESTAMP(postings.post_date) as post_date, 
    `postings`.`link`, 
    `postings`.`title`, 
    `postings`.`author`, 
    `postings`.`excerpt`, 
    `postings`.`long_excerpt`, 
    `feeds`.`title` AS feed_title, 
    `feeds`.`website` AS feed_website
FROM 
    (`postings`)
JOIN 
    `feeds` 
ON 
    `feeds`.`id` = `postings`.`feed_id`
WHERE 
    `feeds`.`type` = 1 AND 
    `postings`.`user_offtopic_count` < 10 AND 
    `postings`.`is_active` = 1
ORDER BY 
    `postings`.`post_date` desc
LIMIT 
    15

explain extended 命令的结果表明 MySQL 正在使用文件排序:

+----+-------------+----------+--------+---------------------------------------+-----------+---------+--------------------------+-------+-----------------------------+
| id | select_type | table    | type   | possible_keys                         | key       | key_len | ref                      | rows  | Extra                       |
+----+-------------+----------+--------+---------------------------------------+-----------+---------+--------------------------+-------+-----------------------------+
|  1 | SIMPLE      | postings | ref    | feed_id,is_active,user_offtopic_count | is_active | 1       | const                    | 30996 | Using where; Using filesort |
|  1 | SIMPLE      | feeds    | eq_ref | PRIMARY,type                          | PRIMARY   | 4       | feedian.postings.feed_id |     1 | Using where                 |
+----+-------------+----------+--------+---------------------------------------+-----------+---------+--------------------------+-------+-----------------------------+

当我删除 order by 部分时，MySQL 停止使用文件排序。如果您对如何优化此查询以使 MySQL 使用索引对数据进行排序和选择有任何想法，请告诉我。正如一些博客文章所建议的那样，我已经尝试了一些事情，例如在所有 where/order by 字段上创建组合索引，但这也不起作用。

最佳答案

在 postings (is_active, post_date) 上创建复合索引(按此顺序)。

它将用于过滤is_active并通过 post_date 订购.

MySQL应该显示 REF EXPLAIN EXTENDED 中此索引的访问方法.

请注意，您有一个 RANGE过滤条件超过 user_offtopic_count ，这就是为什么在过滤和按其他字段排序时不能对该字段使用索引的原因。

取决于您的user_offtopic_count 的选择性。 (即满足 user_offtopic_count < 10 的行数)，在 user_offtopic_count 上创建索引可能更有用并让 post_dates 排序。

为此，请在 postings (is_active, user_offtopic_count) 上创建一个复合索引并确保 RANGE使用了此索引的访问方法。

哪个索引会更快取决于您的数据分布。创建两个索引，FORCE看看哪个更快:

CREATE INDEX ix_active_offtopic ON postings (is_active, user_offtopic_count);
CREATE INDEX ix_active_date ON postings (is_active, post_date);

SELECT 
    `postings`.`id`, 
    UNIX_TIMESTAMP(postings.post_date) as post_date, 
    `postings`.`link`, 
    `postings`.`title`, 
    `postings`.`author`, 
    `postings`.`excerpt`, 
    `postings`.`long_excerpt`, 
    `feeds`.`title` AS feed_title, 
    `feeds`.`website` AS feed_website
FROM 
    `postings` FORCE INDEX (ix_active_offtopic)
JOIN 
    `feeds` 
ON 
    `feeds`.`id` = `postings`.`feed_id`
WHERE 
    `feeds`.`type` = 1 AND 
    `postings`.`user_offtopic_count` < 10 AND 
    `postings`.`is_active` = 1
ORDER BY 
    `postings`.`post_date` desc
LIMIT 
    15

/* This should show RANGE access with few rows and keep the FILESORT */

SELECT 
    `postings`.`id`, 
    UNIX_TIMESTAMP(postings.post_date) as post_date, 
    `postings`.`link`, 
    `postings`.`title`, 
    `postings`.`author`, 
    `postings`.`excerpt`, 
    `postings`.`long_excerpt`, 
    `feeds`.`title` AS feed_title, 
    `feeds`.`website` AS feed_website
FROM 
    `postings` FORCE INDEX (ix_active_date)
JOIN 
    `feeds` 
ON 
    `feeds`.`id` = `postings`.`feed_id`
WHERE 
    `feeds`.`type` = 1 AND 
    `postings`.`user_offtopic_count` < 10 AND 
    `postings`.`is_active` = 1
ORDER BY 
    `postings`.`post_date` desc
LIMIT 
    15

/* This should show REF access with lots of rows and no FILESORT */

关于MySQL性能优化: order by datetime field，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/714950/

文章推荐： c++ - C++中的依赖注入(inject)

文章推荐： c++ - 如何使用 boost 正态分布类？

文章推荐： android - 通知通过旧的 Intent Extras

文章推荐： c++ - 使用 GDB 调试从 shell 脚本调用的 C++ 程序

sql - 在 SQL 中的 order by 中嵌套 order by/order by
我正在寻找通过 sql 查询对我的 sql 结果进行排序，大概在 order by 子句中使用某种嵌套的 order by/order by 我有以下数据: TERM USER I
sql - order by 后跟从属 order by
我有一个表格，其中包含如下所示的部分数据。我已经在 edition_id 上完成了订购。现在还需要订购 laungauge_id，这取决于 edition_id 的值。 Edition_id 是指报纸
SQL Order By 中的 Order By
所以我有两个表，Questions 和 Answers，由多对多关系表 QuestionsAnswers 连接。 Questions 有一个排序列，允许我控制它们如何显示给用户，而 Questions
recursion - FP : What does "order" mean in "high order" functions? 递归函数是否为 "high order"函数？
当我们说“高阶”函数时，我怀疑“阶”的真正含义是什么？例如，我有一个嵌入式函数调用: f.g.h 那么它叫“三阶”函数吗？ “高阶”函数是静态函数累加的概念吗？然后当我有一个递归函数 f 时，在运行时
sql - 对于多个 sql order by 子句，即使之前的 order by 已经证明行不相等，所有的 order bys 是否都运行？
在具有多个 order by 子句的 SQL 查询中，它们是否真的在执行期间全部运行？例子: select * from my_table order by field5, field3, fiel
SPARQL group by 和 order by : not ordered
我跟进 query其中 schema.org 数据库用于查找类的子级数量 - 作为比我的应用程序更简单的数据库。我想按字母顺序连接 child 的名字。查询: prefix schema: pre
wolfram-mathematica - Ordering@Ordering 和排名排列
正如 nazdrovje 所指出的(参见 here ) Ordering@Ordering 可用于获取列表中每个元素的排名。即使列表包含重复元素，结果也是 n 排列(作为整数 1 到 n 的有序列表，
MySQL:如何在使用父查询 "order by"的同时使用子查询列 "order by"？
我有两张 table 。它们都有日期和 item_id 列。我正在通过 item_id 加入他们。结果应按两个日期列一起排序下面的代码有效，生成正确的结果集... 但是它们仅按第一个表的日期排
mysql - SQL ORDER BY by 内部 ORDER BY
尝试掌握 SQL 我想按日期订购，然后在其中按标题订购。示例: SELECT * FROM tblboek ORDER BY jr_van_uitgave DESC 如何在按年龄的订单中按头衔排序？
mysql order by field order 不符合我的期望
我想使用 FIELD 参数对我的 SQL 输出进行排序，但是当我这样做时，它首先吐出我不想要的结果，然后它首先吐出我想要的结果。在结果之上，它首先吐出。如果这有意义的话 ;) 如何先吐出已定义的值，然
php - MySQL order-by 原始 "where order"
我有一个无法破解的排序问题。我这样从我的表中选择: SELECT * FROM 'sidemodules' WHERE name = 'module1' OR name = 'module2' OR
python - 冲突 'order' 模型在应用程序 'order'
我对 Django oscar 的覆盖模型有疑问。我想为模型添加一个新字段，但是当我这样做时，我遇到了 RuntimeError: Conflicting 'order' models in appl
Multiple "order by" in LINQ(LINQ中的多个“order by”)
我有两个表，电影和类别，我想先按CategoryID获得一个排序列表，然后按名称排序。。电影表格有三个列ID、NAME和CategoryID。CATEGORY表有两列ID和NAME。。我尝试了下面这样
Does ORDER BY apply before or after DISTINCT?(ORDER BY适用于DISTINCT之前还是之后？)
In a MySQL query, when using the DISTINCT option, does ORDER BY apply after the duplicates are re
sql - 如何构建一个 sql 查询以返回 avg(price)、min(price)、max(price) 与 avg(order)、min(order)、max(order)
我想创建一个 sql 查询，为 2 个不同的查询一起返回结果。例如，我想要以下形式的结果:产品名称, avg(price), min(price), max(price), avg(order), m
sql - 使用 order by 时的动态 order by - 加速
我正在使用行号从存储过程中获取分页结果。我发现使用动态 case 语句列名称进行排序会减慢速度 - 但如果我对所有内容进行硬编码就可以了。有没有办法通过不使整个 sql 查询一个字符串并使用 SP
z-order-curve - 如何在范围搜索中使用Morton Order(z阶曲线)？
如何在范围搜索中使用Morton Order？在wiki中，在“使用一维数据结构进行范围搜索”段落中，它说 "the range being queried (x = 2, ..., 3, y =
javascript - Order By (alias) then Order by second sequelize
我正在使用 sequelize.js，我在使用 order 语句时遇到问题，我想先通过 if id 排序(如果我的 id 在该别名表中)，然后再排序.... order = [['alias', 'i
php - MySQL 查询末尾的 "ORDER BY order"导致问题
我有一个 php 脚本，它从数据库中提取内容并以某种方式打印它们。数据库有一个名为“order”的列标题，它的 INT 大小为 11。当我从数据库中获取数据时，我试图按数据库中的值“order”对内容
mysql - 更新 order by 子句排序不同，然后选择 order by
我有一个带有 ORDER BY 子句的 UPDATE 查询。我已将相同的查询复制到具有相同 ORDER BY 子句的 SELECT 中，但得到了不同的结果。更新查询: UPDATE t_locks

IT老高

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

MySQL性能优化: order by datetime field