gpt4 book ai didi

MySQL Select with several join 首先对连接表进行全表扫描

转载 作者:行者123 更新时间:2023-11-29 02:50:58 26 4
gpt4 key购买 nike

我有以下查询:

SELECT 
Impressions.id AS `Impressions__id`,
Impressions.timestamp AS `Impressions__timestamp`,
Impressions.name AS `Impressions__name`,
Impressions.lat AS `Impressions__lat`,
Impressions.lng AS `Impressions__lng`,
Impressions.personas_count AS `Impressions__personas_count`,
Impressions.modified AS `Impressions__modified`,
Beacons.id AS `Beacons__id`,
Beacons.uuid AS `Beacons__uuid`,
Beacons.major AS `Beacons__major`,
Beacons.minor_dec AS `Beacons__minor_dec`,
Beacons.minor_hex AS `Beacons__minor_hex`,
Beacons.impressions_count AS `Beacons__impressions_count`,
Beacons.created AS `Beacons__created`,
Beacons.modified AS `Beacons__modified`,
Zones.id AS `Zones__id`,
Zones.location_id AS `Zones__location_id`,
Zones.beacon_id AS `Zones__beacon_id`,
Zones.fixture_no AS `Zones__fixture_no`,
Zones.placement AS `Zones__placement`,
Zones.floor AS `Zones__floor`,
Zones.impressions_count AS `Zones__impressions_count`,
Zones.ignore_further_incidents AS `Zones__ignore_further_incidents`,
Zones.is_reviewed AS `Zones__is_reviewed`,
Zones.review_date AS `Zones__review_date`,
Zones.created AS `Zones__created`,
Zones.modified AS `Zones__modified`,
Locations.id AS `Locations__id`,
Locations.retailer_id AS `Locations__retailer_id`,
Locations.google_place_id AS `Locations__google_place_id`,
Locations.regional_name AS `Locations__regional_name`,
Locations.location AS `Locations__location`,
Locations.store_no AS `Locations__store_no`,
Locations.lat AS `Locations__lat`,
Locations.lng AS `Locations__lng`,
Locations.address1 AS `Locations__address1`,
Locations.address2 AS `Locations__address2`,
Locations.address3 AS `Locations__address3`,
Locations.city AS `Locations__city`,
Locations.state AS `Locations__state`,
Locations.postal_code AS `Locations__postal_code`,
Locations.region_id AS `Locations__region_id`,
Locations.country_id AS `Locations__country_id`,
Locations.zones_count AS `Locations__zones_count`,
Locations.contacts_count AS `Locations__contacts_count`,
Locations.created AS `Locations__created`,
Locations.modified AS `Locations__modified`,
Devices.id AS `Devices__id`,
Devices.os AS `Devices__os`,
Devices.bluetooth_enabled AS `Devices__bluetooth_enabled`,
Devices.impressions_count AS `Devices__impressions_count`,
Devices.modified AS `Devices__modified`,
Regions.id AS `Regions__id`,
Regions.country_name AS `Regions__country_name`,
Regions.subdiv AS `Regions__subdiv`,
Regions.subdiv_name AS `Regions__subdiv_name`,
Regions.level_name AS `Regions__level_name`,
Regions.alt_names AS `Regions__alt_names`,
Regions.subdiv_star AS `Regions__subdiv_star`,
Regions.subdiv_id AS `Regions__subdiv_id`,
Regions.country_id AS `Regions__country_id`,
Regions.country_code_2 AS `Regions__country_code_2`,
Regions.country_code_3 AS `Regions__country_code_3`,
Countries.id AS `Countries__id`,
Countries.country_name AS `Countries__country_name`,
Countries.alt_names AS `Countries__alt_names`,
Countries.code2 AS `Countries__code2`,
Countries.code3 AS `Countries__code3`,
Countries.iso_cc AS `Countries__iso_cc`,
Countries.fips_code AS `Countries__fips_code`,
Countries.fips_country_name AS `Countries__fips_country_name`,
Countries.un_region AS `Countries__un_region`,
Countries.un_subregion AS `Countries__un_subregion`,
Countries.comments AS `Countries__comments`,
Countries.created AS `Countries__created`,
Countries.modified AS `Countries__modified`
FROM
impressions Impressions
inner join beacons Beacons ON Beacons.id = (Impressions.beacon_id)
inner JOIN zones Zones ON Zones.id = (Impressions.zone_id)
inner JOIN devices Devices ON Devices.id = (Impressions.device_id)
INNER JOIN locations Locations ON Locations.id = (Zones.location_id)
LEFT JOIN regions Regions ON Regions.id = (Locations.region_id)
LEFT JOIN countries Countries ON Countries.id = (Locations.country_id)
ORDER BY
Impressions.timestamp desc
LIMIT
15 OFFSET 15

此查询大约需要 6 秒才能运行。 EXPLAIN 输出如下:

+----+-------------+-------------+--------+---------------------------------------+----------------+---------+---------------------------------+-------+---------------------------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+-------------+--------+---------------------------------------+----------------+---------+---------------------------------+-------+---------------------------------+
| 1 | SIMPLE | Devices | ALL | PRIMARY | NULL | NULL | NULL | 43274 | Using temporary; Using filesort |
| 1 | SIMPLE | Impressions | ref | zone_idx,device_id_idx2,beacon_id_idx | device_id_idx2 | 8 | gen1_d2go.Devices.id | 3 | NULL |
| 1 | SIMPLE | Zones | eq_ref | PRIMARY,fk_location_idx,comp | PRIMARY | 8 | gen1_d2go.Impressions.zone_id | 1 | NULL |
| 1 | SIMPLE | Beacons | eq_ref | PRIMARY | PRIMARY | 8 | gen1_d2go.Impressions.beacon_id | 1 | NULL |
| 1 | SIMPLE | Locations | eq_ref | PRIMARY | PRIMARY | 8 | gen1_d2go.Zones.location_id | 1 | NULL |
| 1 | SIMPLE | Regions | eq_ref | PRIMARY | PRIMARY | 4 | gen1_d2go.Locations.region_id | 1 | NULL |
| 1 | SIMPLE | Countries | eq_ref | PRIMARY | PRIMARY | 4 | gen1_d2go.Locations.country_id | 1 | NULL |
+----+-------------+-------------+--------+---------------------------------------+----------------+---------+---------------------------------+-------+---------------------------------+
7 rows in set (0.00 sec)

我不明白为什么它支持对 Devices 表进行全面扫描。这些表都是索引的,ImpressionsDevicesCREATE语句如下:

展示次数

CREATE TABLE `impressions` (
`id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
`device_id` bigint(20) unsigned NOT NULL,
`beacon_id` bigint(20) unsigned NOT NULL,
`zone_id` bigint(20) unsigned NOT NULL,
`timestamp` datetime NOT NULL,
`google_place_id` bigint(20) unsigned DEFAULT NULL,
`name` varchar(60) DEFAULT NULL,
`lat` decimal(12,7) DEFAULT NULL,
`lng` decimal(12,7) DEFAULT NULL,
`personas_count` int(10) unsigned DEFAULT '0',
`created` datetime DEFAULT NULL,
`modified` datetime DEFAULT NULL,
PRIMARY KEY (`id`,`timestamp`),
KEY `zone_idx` (`zone_id`),
KEY `device_id_idx2` (`device_id`),
KEY `beacon_id_idx` (`beacon_id`),
KEY `timestamp_idx` (`id`,`timestamp`),
KEY `ALL` (`id`,`timestamp`,`name`,`lat`,`lng`,`personas_count`,`modified`),
CONSTRAINT `beacon_id` FOREIGN KEY (`beacon_id`) REFERENCES `beacons` (`id`) ON DELETE NO ACTION ON UPDATE NO ACTION,
CONSTRAINT `device2` FOREIGN KEY (`device_id`) REFERENCES `devices` (`id`) ON DELETE NO ACTION ON UPDATE NO ACTION,
CONSTRAINT `zone_FK` FOREIGN KEY (`zone_id`) REFERENCES `zones` (`id`) ON DELETE NO ACTION ON UPDATE NO ACTION
) ENGINE=InnoDB AUTO_INCREMENT=303907 DEFAULT CHARSET=utf8;

设备

CREATE TABLE `devices` (
`id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
`device_id` bigint(20) unsigned NOT NULL,
`advertiser_id` char(36) NOT NULL,
`os` varchar(80) DEFAULT NULL,
`bluetooth_enabled` tinyint(1) DEFAULT NULL,
`impressions_count` int(10) unsigned DEFAULT '0',
`created` datetime DEFAULT NULL,
`modified` datetime DEFAULT NULL,
PRIMARY KEY (`id`),
KEY `advertiser_idx` (`advertiser_id`),
KEY `ad_dev` (`device_id`,`advertiser_id`),
KEY `device_id` (`device_id`)
) ENGINE=InnoDB AUTO_INCREMENT=53628 DEFAULT CHARSET=utf8;

踢球者是:

当我在 FROM 印象之后使用 FORCE INDEX (timestamp_idx) 时,效果很好。它使用该索引,并在大约 0.078 秒内运行。我不知道为什么它试图避免使用该索引,或者根本不首先从该表中进行选择。

已更新

在 FORCE INDEX 中包含 EXPLAIN

Current database: gen1_d2go

+----+-------------+-------------+--------+------------------------------+---------------+---------+---------------------------------+------+-------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+-------------+--------+------------------------------+---------------+---------+---------------------------------+------+-------+
| 1 | SIMPLE | Impressions | index | NULL | timestamp_idx | 5 | NULL | 15 | NULL |
| 1 | SIMPLE | Zones | eq_ref | PRIMARY,fk_location_idx,comp | PRIMARY | 8 | gen1_d2go.Impressions.zone_id | 1 | NULL |
| 1 | SIMPLE | Beacons | eq_ref | PRIMARY | PRIMARY | 8 | gen1_d2go.Impressions.beacon_id | 1 | NULL |
| 1 | SIMPLE | Locations | eq_ref | PRIMARY | PRIMARY | 8 | gen1_d2go.Zones.location_id | 1 | NULL |
| 1 | SIMPLE | Regions | eq_ref | PRIMARY | PRIMARY | 4 | gen1_d2go.Locations.region_id | 1 | NULL |
| 1 | SIMPLE | Countries | eq_ref | PRIMARY | PRIMARY | 4 | gen1_d2go.Locations.country_id | 1 | NULL |
| 1 | SIMPLE | Devices | eq_ref | PRIMARY | PRIMARY | 8 | gen1_d2go.Impressions.device_id | 1 | NULL |
+----+-------------+-------------+--------+------------------------------+---------------+---------+---------------------------------+------+-------+
7 rows in set (0.01 sec)

最佳答案

Impressions 需要一个索引 starting with timestamp。这样,优化器将有望决定以 timestamp 顺序扫描 Impressions,从而避免排序等。

边课...您有 3 个以 id, timestamp 开头的索引。一个是 PRIMARY KEY。这意味着其他两个是不必要的。

因此可以获得额外的加速:

ALTER TABLE Impressions
DROP INDEX timestamp_idx, -- as already mentioned
DROP INDEX ALL, -- ditto
DROP PRIMARY KEY, -- to rearrange it
ADD PRIMARY KEY(timestamp, id), -- thus
ADD INDEX(id); -- and keep AUTO_INCREMENT happy

为什么?通过使 PK 以时间戳开始,查询可以扫描数据而不是在某些索引和数据之间跳转。这将加快相关查询的速度。警告:它可能会伤害其他查询。

其他注意事项...

CHAR(36) 闻起来像 UUID,对吗?但是对于 utf8,它恰好需要 108 个字节!。更改为 CHAR(36) CHARACTER SET ascii NOT NULL,这样它就只需要 36 个字节。 (或者您可以转换为 `BINARY(16) 以节省更多;但那是另一回事,需要更多代码。)

除非您有数十亿行,否则 BIGINT(8 字节)对于 id 来说太过分了。 INT UNSIGNED 只有 4 个字节。

更小以各种方式转化为更快。

关于MySQL Select with several join 首先对连接表进行全表扫描,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35921916/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com