MongoDB 数组或单独集合-6ren

MongoDB 数组或单独集合

转载作者：可可西里更新时间：2023-11-01 10:31:50

26

4

我有一个用户集合。每个用户可能有: - 大量关注者(100K+)并且可能正在关注大量其他用户。 - 大量收藏夹 - 查看的大量项目

我看到 2 个模式设计。关于查询，我需要找到用户关注的人我还需要知道给定用户的收藏夹和观看列表。所有列表(followers, following, favorites 必须有唯一条目

我试图通过 Google 查找类似的问题或主题，但找不到任何内容。

MongoDB 能否处理像这样的大型数组，或者我应该采用设计方法 2，将映射存储在单独的集合中，这样我就可以拥有无限数量的映射？

非常感谢您的宝贵意见。

我选择选项 2，因为它允许我拥有无限数量的映射。但在我走那条路之前，我想检查一下是否会有我不想要的问题。

从一种设计转向另一种设计的成本很高。

Design 1 (EMBEDDED ARRAY TO STORE MAPPINGS):
[
{
  user: bob, //(key)
  followers: ["Alex", "john", "steve", "mark", ... 200K+ entries]
  following: ["Mila", "mark", "Bill", "Joe", ... 100K+ entries]
  favorites: [ObjectI(1), ObjectId(2),...5K+ entries]
  watched: [ObjectI(4), ObjectId(5),...100K+ entries]
},
{
  user: Nick, //(key)
  followers: [bob", "kery", "Jery", "Tom", ... 200K+ entries]
  following: ["Tim", "Shane", "Sally", "Joe", ... 100K+ entries]
  favorites: [ObjectI(4), ObjectId(5),...5K+ entries]
  watched: [ObjectI(2), ObjectId(9),...100K + entries]
}
]

设计 2(单独的集合存储映射)

user_followers collection:
[
 { user: bob, follower: "Alex" }, //key: (user, follower)
 { user: bob, follower: "john"}, 
 { user: bob, follower: "steve"}, 
 { user: bob, follower: "mark"}
  ... 200K+ entries
]

user_following collection:
[
 { user: bob, following: "Mila"},  //key (user, following)
 { user: bob, following: "mark"},
 { user: bob, following: "Bill"}, 
 { user: bob, following: "Joe"},
 ... 100K+ entries
]

user_favorites collection:
[
 { user: bob, favorite: ObjectId(1)},
 { user: bob, favorite: ObjectId(3)},
 { user: bob, favorite: ObjectId(6)},
 ... 5k entries
},

最佳答案

Can MongoDB handle large array like these or I should go with design approach 2 where store the mapping in separate collections which allow me to have unlimited # of mappings?

在 MongoDB 中，文档可以是 at most 16 MB .对于您的第一个设计，您可能会达到我认为的极限。

但是关于第二种设计，在我看来 user_followers 和 user_following 集合只是重复相同的数据:如果 bob 正在关注 martha，那么 bob 是玛莎，所以你可以将这两个集合合并为一个，条目如 { followed: 'martha', follower: 'bob' }

更新

评论中有关于如何处理双向关系或查询索引的问题。

给定两个用户 bob 和 martha，他们可以没有任何关系，或者 bob 关注 martha，或者 martha 关注 bob，或者 bob 和 martha 相互关注，即三种不同的可能关系。

现在对于 bob 跟随 martha 的情况，followers 集合将是

[
  {
    followed: 'martha',
    follower: 'bob'
  }
]

对于 martha 跟随 bob 的情况，它将是

[
  {
    followed: 'bob',
    follower: 'martha'
  }
]

当两者互相跟随时

[
  {
    followed: 'martha',
    follower: 'bob'
  }, {
    followed: 'bob',
    follower: 'martha'
  }
]

此设计中唯一开销较大的操作在设计 1 和 2 中也是开销很大的，原因相同:我们需要隔离两个集合之间的公共(public)元素；该操作正在寻找双向关系(例如，鲍勃和玛莎互相跟随)。

就索引而言，只有两个有用，{ follower: 1, followed: 1 } 和 { followed: 1, follower: 1 } (两者都只对排序有用，因为这两者中的任何一个都涵盖所有过滤情况)。

现在回到设计 2，上面的用例应该是:

鲍勃跟着玛莎

user_followers

[
  {
    user: 'martha',
    follower: 'bob'
  }
]

user_following

[
  {
    user: 'bob',
    following: 'martha'
  }
]

玛莎跟随鲍勃

user_followers

[
  {
    user: 'bob',
    follower: 'martha'
  }
]

user_following

[
  {
    user: 'martha',
    following: 'bob'
  }
]

鲍勃和玛莎互相跟随

user_followers

[
  {
    user: 'bob',
    follower: 'martha'
  }, {
    user: 'martha',
    follower: 'bob'
  }
]

user_following

[
  {
    user: 'martha',
    following: 'bob'
  }, {
    user: 'bob',
    following: 'martha'
  }
]

现在我们可以看到，正如我所指出的，设计 2 将复制所有关注者信息，但绝对没有任何好处。

关于MongoDB 数组或单独集合，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/21980692/

26

4

0

文章推荐： mongodb - Casbah Mongo 作为 scala 数组 : is this the most elegant way?

文章推荐： node.js - Mongo shell 不显示内容

文章推荐： java - 如果 DBCursor 没有关闭怎么办？

文章推荐： python-3.x - tensorflow 1.8 与 python 3.6 在 windows64

mysql - WHERE(集合)IN(集合)
编辑:我似乎问错了这个问题。我正在尝试寻找一种方法来查询一个集合是否在另一个集合中可用。例如: SELECT * FROM something WHERE (1, 3) IN (1, 2, 3, 4
jquery - $.each([集合]) 与 $([集合]).each()
这两种方法似乎 produce the same results ，但我一直很难真正说服人们第二种方法有效，因为它显然并不为人所知。 // Create some data var foo = { '
kotlin - 为什么我们在Kotlin中仅使用[列表， map ，集合]集合？
我一直在学习Kotlin，并且遇到过Collections API。在Kotlin之前，我一直在学习Java，并且我知道Java中有很多不同类型的Collections API。例如，我们使用List
Java addAll(集合)与 new ArrayList(集合)
为什么我会得到不同的行为: Collection col2 = new ArrayList(col); 集合 col2 = new ArrayList(); col2.addAll(col) 我正在与
javascript: 集合[id][prop] = 集合[id][prop] || []//将空数组分配给对象时出现问题
所以我有一个代表专辑信息的 JSON 对象。给定“function updateRecords(id, prop, value)”我希望能够更新每个条目。正确的完成代码如下。我得到了指示，粗体部分，
java - 我需要哪种 Java 对象类型(集合/列表/集合/其他)？
我想存储一个对象集合，这些对象根据它们所代表的值进行键控。这些键可以重复。例如: [4] => Bob [5] => Mary [5] => Sue [9] => Steve [10] =>
collections - ArrayList(集合 c)VS HashSet(集合 c)
在检查 ArrayList API 时，我注意到一些看起来很奇怪的东西。确实，这里是 ArrayList 构造函数实现，其中 Collection 作为参数传递: public ArrayList(
collections - 如何在我的 YAML Swagger 定义中将属性类型定义为字符串列表(列表、集合、数组、集合)
我正在为 API 编写一个 swagger 定义文件。 API 是用于 GET 请求的 /path/to/my/api: get: summary: My Custom API d
java - 如何将 Scala 集合 Seq[(Int, Seq[String])] 转换为 Java 集合 List[(int, List[String])]？
我知道scala.collection包中有两个非常有用的对象，可以帮助我们实现这个目标: JavaConverters(如果我想明确说明并准确说明我要转换的内容) JavaConversions(如
Automapper 集合
我已经阅读了无数其他帖子，但似乎无法弄清楚发生了什么，所以是时候寻求帮助了。我正在尝试将包含集合的域实体映射到也包含集合的 dtos。这是一个原始示例； (我提前为代码墙道歉，我尽量保持简短):
Java:集合
我正在创建一个具有 ArrayList 的类，因此当我调用构造函数时，它会初始化该数组: public class ElementsList { private ArrayList list;
polymer this.$ 集合
我正在阅读事件指南和指南的开头，它说: You can also add an event listener to any element in the this.$ collection using
Python - 集合
我是 Python 新手，想知道如何使用键在字典中存储不同数据类型的列表例如 - {[Key1,int1,int1,String1] , [Key2,int2,int2,String2], [Key
C# 集合
int[] mylist = { 2, 4, 5 }; IEnumerable list1 = mylist; list1.ToList().Add(1); // why 1 does not get
Java 集合
我在 UI 表单中的每一行之后将以下内容添加到 HashMap 集合中声明 Map> map = new HashMap>(); List valSetOne = new ArrayList();
Java - 集合
我正在开发我的第一个 Java 项目，我有一个问题。问题应该很简单(虽然代码不是那么短，但没有理由被吓倒:))。我创建了一个基本的角色扮演游戏，并且有一个定义每个角色的抽象类“Character”。在
MongoDB 集合
我正在开发一款应用程序，可以为用户收集推文、Facebook 状态和 Facebook 照片。目前，用户确切地设定了他们希望这种收获发生的时间和时间，并且蜘蛛会在此期间拉取数据。 when 和 to
C# 集合？
有谁知道在 C# 中是否有与 Java 的 Set 集合等效的好方法？我知道您可以通过填充但忽略值来使用 Dictionary 或 HashTable 在某种程度上模仿集合，但这不是一种非常优雅的方式
oracle 集合
EXISTS 该函数返回集合中第一个元素的索引，如果集合为空，返回NULLNULLNULL Collecti
07、RDF 集合
RDF集合是通过属性 rdf:parseType="Collection" 来描述仅包含指定成员的组 rdf:parseType="Collection" 属

首页

博学

6Ren·AI

商城

MongoDB 数组或单独集合