使用mgo在MongoDB中进行有效的分页

^{I've searched and found no Go solution to the problem, not with or without using mgo.v2, not on StackOverflow and not on any other site. This Q&A is in the spirit of knowledge sharing / documenting.}

Let's say we have a users collection in MongoDB modeled with this Go struct:

type User struct {
    ID      bson.ObjectId `bson:"_id"`
    Name    string        `bson:"name"`
    Country string        `bson:"country"`
}

We want to sort and list users based on some criteria, but have paging implemented due to the expected long result list.

To achieve paging of the results of some query, MongoDB and the mgo.v2 driver package has built-in support in the form of Query.Skip() and Query.Limit(), e.g.:

session, err := mgo.Dial(url) // Acquire Mongo session, handle error!

c := session.DB("").C("users")
q := c.Find(bson.M{"country" : "USA"}).Sort("name", "_id").Limit(10)

// To get the nth page:
q = q.Skip((n-1)*10)

var users []*User
err = q.All(&users)

This however becomes slow if the page number increases, as MongoDB can't just "magically" jump to the x^th document in the result, it has to iterate over all the result documents and omit (not return) the first x that need to be skipped.

MongoDB provides the right solution: If the query operates on an index (it has to work on an index), cursor.min() can be used to specify the first index entry to start listing results from.

This Stack Overflow answer shows how it can be done using a mongo client: How to do pagination using range queries in MongoDB?

Note: the required index for the above query would be:

db.users.createIndex(
    {
        country: 1,
        name: 1,
        _id: 1
    }
)

There is one problem though: the mgo.v2 package has no support specifying this min().

How can we achieve efficient paging that uses MongoDB's cursor.min() feature using the mgo.v2 driver?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanji1482 2016-11-16 14:38
关注
Unfortunately the mgo.v2 driver does not provide API calls to specify cursor.min().

But there is a solution. The mgo.Database type provides a Database.Run() method to run any MongoDB commands. The available commands and their documentation can be found here: Database commands

Starting with MongoDB 3.2, a new find command is available which can be used to execute queries, and it supports specifying the min argument that denotes the first index entry to start listing results from.

Good. What we need to do is after each batch (documents of a page) generate the min document from the last document of the query result, which must contain the values of the index entry that was used to execute the query, and then the next batch (the documents of the next page) can be acquired by setting this min index entry prior to executing the query.

This index entry –let's call it cursor from now on– may be encoded to a string and sent to the client along with the results, and when the client wants the next page, he sends back the cursor saying he wants results starting after this cursor.

Doing it manually (the "hard" way)

The command to be executed can be in different forms, but the command name (find) must be first in the marshaled result, so we'll use bson.D (which preserves order in contrast to bson.M):

limit := 10 cmd := bson.D{ {Name: "find", Value: "users"}, {Name: "filter", Value: bson.M{"country": "USA"}}, {Name: "sort", Value: []bson.D{ {Name: "name", Value: 1}, {Name: "_id", Value: 1}, }, {Name: "limit", Value: limit}, {Name: "batchSize", Value: limit}, {Name: "singleBatch", Value: true}, } if min != nil { // min is inclusive, must skip first (which is the previous last) cmd = append(cmd, bson.DocElem{Name: "skip", Value: 1}, bson.DocElem{Name: "min", Value: min}, ) }

The result of executing a MongoDB find command with Database.Run() can be captured with the following type:

var res struct { OK int `bson:"ok"` WaitedMS int `bson:"waitedMS"` Cursor struct { ID interface{} `bson:"id"` NS string `bson:"ns"` FirstBatch []bson.Raw `bson:"firstBatch"` } `bson:"cursor"` } db := session.DB("") if err := db.Run(cmd, &res); err != nil { // Handle error (abort) }

We now have the results, but in a slice of type []bson.Raw. But we want it in a slice of type []*User. This is where Collection.NewIter() comes handy. It can transform (unmarshal) a value of type []bson.Raw into any type we usually pass to Query.All() or Iter.All(). Good. Let's see it:

firstBatch := res.Cursor.FirstBatch var users []*User err = db.C("users").NewIter(nil, firstBatch, 0, nil).All(&users)

We now have the users of the next page. Only one thing left: generating the cursor to be used to get the subsequent page should we ever need it:

if len(users) > 0 { lastUser := users[len(users)-1] cursorData := []bson.D{ {Name: "country", Value: lastUser.Country}, {Name: "name", Value: lastUser.Name}, {Name: "_id", Value: lastUser.ID}, } } else { // No more users found, use the last cursor }

This is all good, but how do we convert a cursorData to string and vice versa? We may use bson.Marshal() and bson.Unmarshal() combined with base64 encoding; the use of base64.RawURLEncoding will give us a web-safe cursor string, one that can be added to URL queries without escaping.

Here's an example implementation:

// CreateCursor returns a web-safe cursor string from the specified fields. // The returned cursor string is safe to include in URL queries without escaping. func CreateCursor(cursorData bson.D) (string, error) { // bson.Marshal() never returns error, so I skip a check and early return // (but I do return the error if it would ever happen) data, err := bson.Marshal(cursorData) return base64.RawURLEncoding.EncodeToString(data), err } // ParseCursor parses the cursor string and returns the cursor data. func ParseCursor(c string) (cursorData bson.D, err error) { var data []byte if data, err = base64.RawURLEncoding.DecodeString(c); err != nil { return } err = bson.Unmarshal(data, &cursorData) return }

And we finally have our efficient, but not so short MongoDB mgo paging functionality. Read on...

Using github.com/icza/minquery (the "easy" way)

The manual way is quite lengthy; it can be made general and automated. This is where github.com/icza/minquery comes into the picture (disclosure: I'm the author). It provides a wrapper to configure and execute a MongoDB find command, allowing you to specify a cursor, and after executing the query, it gives you back the new cursor to be used to query the next batch of results. The wrapper is the MinQuery type which is very similar to mgo.Query but it supports specifying MongoDB's min via the MinQuery.Cursor() method.

The above solution using minquery looks like this:

q := minquery.New(session.DB(""), "users", bson.M{"country" : "USA"}). Sort("name", "_id").Limit(10) // If this is not the first page, set cursor: // getLastCursor() represents your logic how you acquire the last cursor. if cursor := getLastCursor(); cursor != "" { q = q.Cursor(cursor) } var users []*User newCursor, err := q.All(&users, "country", "name", "_id")

And that's all. newCursor is the cursor to be used to fetch the next batch.

Note #1: When calling MinQuery.All(), you have to provide the names of the cursor fields, this will be used to build the cursor data (and ultimately the cursor string) from.

Note #2: If you're retrieving partial results (by using MinQuery.Select()), you have to include all the fields that are part of the cursor (the index entry) even if you don't intend to use them directly, else MinQuery.All() will not have all the values of the cursor fields, and so it will not be able to create the proper cursor value.

Check out the package doc of minquery here: https://godoc.org/github.com/icza/minquery, it is rather short and hopefully clean.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用mgo在MongoDB中进行有效的分页 mongodb
2016-11-16 14:38

回答 1 已采纳 Unfortunately the mgo.v2 driver does not provide API calls to specify cursor.min(). But there is
使用mgo在MongoDB中插入数据 mongodb
2017-01-23 01:17

回答 1 已采纳 First, you seem to want the results sorted by Endpoint. If you don't specify any sort order when q
使用GoLang mgo将PDF保存在MongoDb中 jquery mongodb
2017-10-17 07:36

回答 1 已采纳 The file returned by Request.FormFile() is of type multipart.File which is: type File interface {
basicmgo:演示如何使用 mgo 进行基本的 mongodb 调用
2021-06-09 22:39

此源代码的使用受 GNU 许可证的约束，该许可证可在 LICENSE 句柄中找到。此应用程序提供了如何连接到 MongoDB 数据库并对其执行命令的示例。阿丹工作室12973 SW 112 ST，套房 153 佛罗里达州迈阿密 33186 安装...
使用mgo Golang从MongoDB子文档数组中解组 mongodb
2018-12-07 21:36

回答 1 已采纳 It's because you don't capture a single player, you capture players. like in the response from the
如何在Go和mgo中使用mongodb投影？ mongodb
2015-11-17 07:36

回答 1 已采纳 Would go with the Select method as the doc states that this enables selecting which fields should
无法使用mgo从MongoDB通过ObjectId获取数据 mongodb
2017-01-21 11:26

回答 1 已采纳 You can try this c.FindId(bson.M{"_id": bson.ObjectIdHex("56bdd27ecfa93bfe3d35047d")}) may be i
minquery：支持高效分页的MongoDB mgo查询（光标继续列出我们停下来的文档）
2021-02-04 06:00

最小查询支持高效分页的MongoDB / mgo查询（游标可在我们停下的地方继续列出文档）。注意：仅MongoDB 3.2和更高版本支持此软件包使用的功能...介绍假设我们在MongoDB中有一个使用以下Go struct建模的users集合： typ
使用Golang和mgo从Collection MongoDB中获取元素 mongodb
2016-02-27 15:29

回答 1 已采纳 First of all I agree with commenters above - you should add timestamp to your Connect structure. B
如何使用mgo在golang中编写mongodb搜索
2016-05-19 12:06

回答 1 已采纳 From $search I'm assuming you're trying to use a text index/search, but in your case that wouldn't
使用Go和mgo解析MongoDB结果 mongodb
2016-05-24 04:41

回答 1 已采纳 Remove the space in the middle of tags. Use bson:"firstName" instead of bson: "firstName" type P
Golang之使用mgo连接MongoDB
2019-10-12 18:10

RunFromHere的博客 session, err := mgo.Dial("localhost:27017") if err != nil { log.Println("err: ", err) return } defer session.Close() // Optional. Switch the session to a monotonic behavior. session.S...
使用mgo从Golang中的Mongodb中选择列 mongodb
2015-06-29 13:05

回答 3 已采纳 Use the query Select method to specify the fields to return: var result []struct{ Text string `bs
MongoDB 学习笔记3 - 使用 mgo 连接MongoDB
2020-04-07 12:35

张云飞VIR的博客 mgo：(发音为mango)是一个用于Go语言的MongoDB驱动程序，它在一个非常简单的API下实现了丰富和经过良好测试的特性选择，遵循了标准的Go习惯用法。突出特点：集群发现和通信：mgo提供自动化的集群拓扑发现和维护...
go-transaction-example:包含的示例可指导如何使用Golang在Mongodb上进行交易
2021-05-16 22:52

去交易的例子包含的示例可指导如何使用Golang进行交易。安装依赖库 $ go get github.com/globalsign/mgo/bson$ go get github.com/globalsign/mgo设想它演示了一个简单的服务器，可以为银行的付费用户提供服务。 ...
mgo：Go的MongoDB驱动程序
2021-02-02 17:14

支持的版本众所周知， mgo在MongoDB v3.0、3.2、3.4和3.6上可以很好地工作（并且已经针对它进行了集成测试）。 MongoDB 4.0目前处于试验阶段-我们很乐意接受PR，以帮助改善支持！变化修复了尝试在每个查询之前进行...
golang mongodb mysql_golang使用mgo连接MongoDB
2021-02-27 13:30

白小烨的博客现在MongoDB官方还没有推出关于官方支持的golang的driver,推荐使用的是mgo. mgo的详细文档说明：http://godoc.org/labix.org/v2/mgo 下面是我开发中自己写的一个用mgo连接MongoDB数据库的使github:https:...
mgo 实现mongodb中的sortByCount方法
2018-06-06 18:32

谁是人生一场梦的博客 mgo使用pipe来实现mongodb中的aggregation,所以在实现mongodb中的一些方法的时候就需要我们自己去拼接，现在使用go给大家分享一下我实现sortByCount的方法，大家可以自己自己实现一些其它的方法，如果有需要，我会...
Go实战--golang中使用MongoDB(mgo)
2017-07-14 13:29

一苇渡江694的博客昨天分享了golang如何操作redis数据库，那今天就介绍一下golang中如何使用mongodb数据库。何为MongoDB？简介 MongoDB 是由C++语言编写的，是一个基于分布式文件存储的开源数据库系统。在高负载的情况下，...
qmgo：Qmgo-MongoDB的Go驱动程序。它基于官方的mongo-go-driver，但像Mgo一样易于使用
2021-02-03 14:31

Qmgo是迁移的首选mgo新MongoDB driver用最少的代码改变。要求 -达到Go 1.10以上。 MongoDB 2.6及更高版本。产品特点对文档进行CRUD，并带有所有官方支持的选项排序，限制，计数，选择，区别交易次数钩子...
没有解决我的问题, 去提问

悬赏问题

¥15 R语言Rstudio突然无法启动
¥15 关于#matlab#的问题：提取2个图像的变量作为另外一个图像像元的移动量，计算新的位置创建新的图像并提取第二个图像的变量到新的图像
¥15 改算法，照着压缩包里边，参考其他代码封装的格式写到main函数里
¥15 用windows做服务的同志有吗
¥60 求一个简单的网页(标签-安全|关键词-上传)
¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？
¥15 有偿求跨组件数据流路径图
¥15 写一个方法checkPerson，入参实体类Person，出参布尔值

使用mgo在MongoDB中进行有效的分页

1条回答 默认 最新

Doing it manually (the "hard" way)

Using github.com/icza/minquery (the "easy" way)

悬赏问题

1条回答默认最新

Using `github.com/icza/minquery` (the "easy" way)