MongoDb 与 Cassandra :Read/Write myths?-6ren

MongoDb 与 Cassandra :Read/Write myths?

转载作者：IT老高更新时间：2023-10-28 12:31:18

25

4

在网上阅读几篇关于 MongoDB 与 Cassandra 读/写性能的文章，

写

一般来说，Cassandra 的写入性能在数据量很大的情况下比 Mongo 的要好。请参阅下面的声明。

Cassandra's storage engine provides constant-time writes no matter how big your data set grows. Writes are more problematic in MongoDB, partly because of the b-tree based storage engine, but more because of the per database write lock.

这是我的问题:- 这个陈述仍然正确吗？据我了解，Mongo 支持每个文档而不是每个数据库的锁定。正确的？那么目前Cassandra在写性能上还是比Mongo更好吗？如果是，为什么？

阅读

一般来说，Mongo 的读取性能比 Cassandra 好，但我没有找到任何理由让 Mongo 的读取性能优于 Cassandra 的？

更新:-

来自 Jared Answer 的 this forum

Reads are more efficient in MongoDB's storage engine than they are in Cassandra. Cassandra's storage engine performs very well on writes because it stores data in an append only format. This makes great use of spinning disk drives that have poor seek times, but can do serial writes very quickly. But the downside is that when you do a read, you often need to scan through several versions of an object to get the most recent version to return to the caller. MongoDB updates data in place. This means it does more random IO when writes are processed, but it has the benefit of being faster when processing reads, since you can find the exact location of the object on disk in one b-tree lookup.

它帮助我了解 Cassandra 在删除/编辑现有记录时更快，因为它必须最后附加它，而不是像 Mongo 那样必须先搜索然后编辑它的地方编辑。这使得 cassandra 在编写方面比 Mongo 更好

但同样的事情让 Mongo 比 Cassandra 慢，因为 Cassandra 必须扫描同一记录的多个版本才能获取最新版本以返回给调用者

另一个原因是blog为什么 cassandra 写得更好

MongoDB with its “single master” model can take writes only on the primary. The secondary servers can only be used for reads. So essentially if you have three node replica set, only the master is taking writes and the other two nodes are only used for reads. This greatly limits write scalability. You can deploy multiple shards but essentially only 1/3 of your data nodes can take writes. Cassandra with its “multiple master” model can take writes on any server. Essentially your write scalability is limited by the number of servers you have in the cluster. The more servers you have in the cluster, the better it will scale.

来自同一个blog为什么 Mongo 在阅读方面比 cassandra 好

Secondary indexes are a first-class construct in MongoDB. This makes it easy to index any property of an object stored in MongoDB even if it is nested. This makes it really easy to query based on these secondary indexes. Cassandra has only cursory support for secondary indexes. Secondary indexes are also limited to single columns and equality comparisons. If you are mostly going to be querying by the primary key then Cassandra will work well for you.

最佳答案

问题的答案:是的。最新的 MongoDB 支持每个文档的锁。 https://docs.mongodb.com/manual/core/wiredtiger/

以下是写操作的基准:https://www.datastax.com/nosql-databases/benchmarks-cassandra-vs-mongodb-vs-hbase根据这些基准， Cassandra 在规模上表现更好(在集群中的节点数量较多时)。

希望对您有所帮助。

以下是有关您的问题的一些详细信息，可能也会有所帮助。

关于 Cassandra

Cassandra 正在使用针对大量写入进行了优化的 LSM-tree。 https://docs.datastax.com/en/cassandra/2.1/cassandra/dml/dml_manage_ondisk_c.html

一些细节:

执行写入时，数据会立即写入提交日志。提交日志是一种崩溃恢复机制。在写入提交日志之前，写入不会被视为成功。数据写入commit log后，写入memtable。在最新版本的 Cassandra 中，memtable 主要存储在 native 内存中，而不是 JVM 堆中。所以它也提高了性能。

当存储在 memtable 中的对象数量达到阈值时，memtable 的内容会在称为 SSTable 的文件中刷新到磁盘。然后创建一个新的内存表。一旦一个 memtable 被刷新到一个 SSTable，它就是不可变的。

向 Cassandra 写入值不需要任何类型的读取或查找，因为所有写入都是追加操作。

关于 MongoDB

默认情况下，MongoDB 使用 MMAPv1 存储引擎，该引擎使用 B-tree (https://docs.mongodb.com/manual/core/mmapv1/)，但最近版本的 MongoDB 使用 WiredTiger 存储引擎 (https://docs.mongodb.com/manual/core/wiredtiger/)，它也可以支持 LSM-tree。

关于锁:WiredTiger MongoDB 支持文档级锁，但 MMAPv1 支持集合级并发控制。

一些有用的文章:
https://dba.stackexchange.com/questions/121160/mongodb-mmapv1-vs-wiredtiger-storage-engines
https://docs.mongodb.com/manual/faq/concurrency/
https://www.percona.com/blog/2016/01/06/mongodb-revs-you-up-what-storage-engine-is-right-part-1/

关于MongoDb 与 Cassandra :Read/Write myths?，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/47970547/

25

4

0

文章推荐： c++ - 在 clang-format 中，惩罚有什么作用？

文章推荐： c - 是否可以确定持有互斥锁的线程？

文章推荐： c++ - 我需要一个互斥锁来阅读吗？

文章推荐： python - 滚动或滑动窗口迭代器？

rust - 为什么 Read::read 和 Read::read_exact 需要初始化传递给它们的缓冲区？
我有一个阅读器，其中包含有关 51*51 网格的信息，其中网格上的每个点都由 f32 表示。 .我想将这些数据读入一个向量，以便我可以轻松处理它: pub fn from_reader(reader:
sql-server - 为什么在 SQL Server 中首次执行查询时 'physical reads' 小于 'read-ahead reads' 和 'logical reads'？
我重新启动了 SQL Server 2005 并运行了统计 IO 的查询。我得到了这些结果:表“xxx”。扫描计数 1，逻辑读取 789，物理读取 3，预读读取 794，... 预读读取数是读取并放
lisp - defstruct - :read-only is not read only
在 CLHS 中，我为 :read-only x 读到:“当 x 为真时，这指定不能更改此插槽；它将始终包含构造时提供的值。” 我可以做到这一点(CCL、SBCL): CL-USER> (defstr
multithreading - “reads before reads”在内存排序中是什么意思？
让我们考虑一下这句话(Total Store Ordering): reads are ordered before reads, writes before writes, and reads be
rust - Read::read 是否保证附加数据而不覆盖任何现有数据？
我正在开发一个 SMTP 库，它使用缓冲读取器通过网络读取行。我想要一种安全的方式来从网络读取数据，而不依赖于 Rust 内部机制来确保代码按预期工作。具体来说，我想知道 Read trait 是否
Clojure & ClojureScript : clojure. core/read-string, clojure.edn/read-string 和 cljs.reader/read-string
我不清楚所有这些读取字符串函数之间的关系。嗯，很明显clojure.core/read-string可以读取 pr[n] 输出的任何序列化字符串甚至 print-dup .也很清楚clojure.ed
c - 如何使 read() 非阻塞并重置 read()
所以我做了这个功能，就像倒计时一样。我想在倒计时减少时读取命令。我的大问题是让 read() 在倒计时减少时等待输入。如您所见，我尝试使用 select() 但在第一个 printf 之后("time
echart报错Cannot read properties of undefined (reading ‘type‘)
这是我vue3+echart5 遇到的报错：Cannot read properties of undefined (reading ‘type‘) 这个问题需要搞清楚两个关键方法： toRaw：作
c - LLVM 内存依赖性分析中的 Read after Read 依赖性
下图中，左边是C代码，右边是未优化的LLVM IR形式。 The Figure 在 IR 上运行 MemoryDependenceAnalysis 可查找内存依赖性。原始代码及其 IR 等效代码中
bash - 为什么管道输入到 "read"仅在馈入 "while read ..."构造时才有效？
这个问题在这里已经有了答案: Read values into a shell variable from a pipe (17 个答案) 关闭 3 年前。我一直在尝试像这样从程序输出中读取环境变
c - 需要像 read() 这样的函数将整数数据读入缓冲区并获得与 read() 相同的缓冲区值
当我输入相同的整数时，如何将整数转换为与使用 read(0,buff,nbytes) 获得的缓冲区相同的值/编码字符？我正在尝试编写类似 read() 的东西，但用整数数据代替读取到缓冲区的文件描述符
linux - “read”命令不在“while read line”循环中执行
This question already has answers here: Closed 2 years ago. Read input in bash inside a while loop （
c# - 在调用 Read() 之前尝试访问字段无效，但我先调用了 Read()
我正在尝试处理来自 MySQL 数据库的一些数据(主要是 double 值)。我收到此错误消息: Invalid attempt to access a field before calling Re
java - DataInputStream.read() 与 DataInputStream.readFully()
我正在制作一个简单的 TCP/IP 套接字应用这样做有什么不同: DataInputStream in = new DataInputStream(clientSocket.getInputStre
java - HttpMessageNotReadableException : Could not read JSON: Read timed out
我操作API服务器。手机APP访问API服务器时，有时会出现该异常。我尝试在测试服务器上进行测试，但无法重现。(我改变了apache和tomcat的连接时间。) 有什么问题？？我该如何解决这个问
html - "Click here to read this article""Read More"为什么这些对屏幕阅读器不利？
我在段落末尾使用“阅读更多”只是为了提醒像P.T.O一样的用户为什么会有问题？最佳答案您必须明白，许多屏幕阅读器用户不会等到整个页面都读给他们听。他们使用键盘快捷键在页面中导航。 JAWS(可以
angular - 类型错误 : Cannot read properties of undefined (reading 'match' )
我已将我的 Angular 应用程序从 12 版本升级到 13 版本。我在单元测试运行期间开始遇到此错误。 Chrome Headless 94.0.4606.61 (Windows 10) AppC
angular - 类型错误 : Cannot read properties of undefined (reading 'pipe' )
我正在尝试为以下组件编写一个。我正在使用 queryParams 然后使用 switchmap 来调用服务。这是 url 的样子: http://localhost:4200/test-fee/det
javascript - 未捕获的类型错误 : Cannot read properties of undefined (reading 'remove' )
我的代码有什么问题？ Uncaught TypeError: Cannot read properties of undefined (reading 'remove') 和 Uncaught Typ
javascript - 类型错误 : Cannot read properties of undefined (reading 'requestContent' )
我在我的 React 应用程序中遇到了这个问题。类型错误:无法读取未定义的属性(读取“requestContent”) 我在我的应用程序中使用 commercejs。代码指向 isEmpty=!ca

首页

博学

6Ren·AI

商城

MongoDb 与 Cassandra :Read/Write myths?