无法在mongodb中获得准确的聚合计数

So I have a large collection and I want to count the number of documents that all have the same company_id with other filters(either expired_at is greater than new Date() or expired_at doesn't exist in the document). Basically to count the number of active jobs belong to a company.

Here is what I have so far but the count result is way bigger than it should be. Can anybody tell me what is wrong with the query?

Maybe there are duplicates? If thats the case, how to group duplicates as one or avoid them when counting? Thanks.

db.jobs.aggregate([
{
    $match: {
        $and: [
            {company_id: ObjectId('524a09a44c9ff23382000037')},
            { $or: [ { expired_at: { $gt: new Date() } }, { expired_at: { $exists: false } } ]}
        ] 
    }
},
{
    $group: {
        _id: 1,   
        count: {$sum: 1}
    }
}
])

Actual code:

query := bson.M{
  "$and": []bson.M{
    bson.M{"company_id": id},
    bson.M{
      "$or": []bson.M{
        bson.M{"expired_at": bson.M{"$exists": false}},
        bson.M{"expired_at": bson.M{"$gt": bson.Now()}},
      },
    },
  },
}

counter := []bson.M{
  bson.M{"$match": query},
  bson.M{"$group": bson.M{"_id": 1, "count": bson.M{"$sum": 1}}},
}

var result struct {
  Count int `bson:"count"`
}
err := s.DB(session).C("jobs").Pipe(counter).One(&result)

This is what the schema looks like:

{
    "_id" : ObjectId("52683eceda9f660e1e000011"),

    "activated_at" : ISODate("2014-05-30T09:18:40.961Z"),

    "url" : "http://blahblah/jobid3939799-public-sector-oracle-federal-financials-senior-associate-jobs",

    "job_category_ids" : [ 
        "Accounting/Auditing", 
        "Finance", 
        "Information Technology", 
        "Consulting"
    ],
    "location" : {
        "full_address" : "McLean, VA",
        "pts" : [ 
            -77.1772604, 
            38.9338676
        ]
    },
    "created_at" : ISODate("2013-10-23T21:25:34.262Z"),
    "ref_id" : "42927BR-0",

    "company_id" : ObjectId("524a09a44c9ff23382000037"),

    "updated_at" : ISODate("2014-05-30T09:18:41.085Z"),

    "expired_at" : ISODate("2014-05-31T09:21:30.357Z")
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

如何从MongoDB集合中获取聚合 mongodb
2017-11-19 16:56

回答 1 已采纳 Using Distinct() What you want is easiest done using collection.distinct(). In MongoDB console it
如何在golang中进行Mongodb聚合 mongodb
2017-11-15 05:27

回答 2 已采纳 You're not checking the errors, that's your main problem. Pipe.All() returns an error which you gr
mongodb聚合在golang中给出错误 mongodb
2018-02-06 04:42

回答 1 已采纳 Hi this is a sample code I modified to use pipelines with MgO golang. Watch It and try to apply it
二、mongodb数据库系列——聚合操作 & 索引操作 & 权限管理
2020-07-10 17:01

小小白学计算机的博客一、mongodb的聚合操作学习目标了解 mongodb的聚合原理掌握 mongdb的管道命令掌握 mongdb的表达式 1 mongodb的聚合是什么聚合(aggregate)是基于数据处理的聚合管道，每个文档通过一个由多个阶段（stage）组成...
mongoDB,聚合中如何写判断if else mongodb 数据库
2023-01-19 22:00

回答 1 已采纳下面这篇文章,希望能给你带来帮助https://blog.csdn.net/weixin_39580031/article/details/122727009
在MongoDB中计数 mapreduce mongodb nosql php
2012-11-16 11:12

回答 3 已采纳 If you are looking to use map reduce for such a query then you are already doing it wrong. MR wou
大数据上的MongoDB聚合超时异常 mongodb php
2016-03-14 07:23

回答 1 已采纳 As I am using Doctrine MongoDB ODM module in my application I fixed my issue in the following way.
mongodb聚合查询优化_【MongoDB】MongoDB 性能优化 - BI查询聚合
2020-12-19 17:24

weixin_39876739的博客在BI服务中通过查询聚合语句分析定位慢查询/聚合分析，小结如下：慢查询定位:通过Profile分析慢查询对于查询优化：通过添加相应索引提升查询速度；对于聚合大数据方案:首先要说明的一个问题是，对于OLAP型的操作，...
无法在Golang中从MongoDB结果解码ObjectId子值 mongodb
2019-03-31 21:02

回答 2 已采纳 Thanks to this excellent tutorial and this anwser I was able to find the answer. I needed to se
PHP中的MongoDB聚合 mongodb php
2018-09-26 18:27

回答 1 已采纳 Thanks to Veeram, the solution was to use strtotime(.) instead of MongoDB\BSON\UTCDateTime(strtoti
mongodb中出现连接错误 linux mongodb 大数据有问必答
2022-01-07 09:13

回答 3 已采纳服务关了。然后你再怎么输命令肯定全报错啊。它不是在报：尝试重连失败？
mongodb高级聚合查询
2019-05-18 22:08

天意的博客在工作中会经常遇到一些mongodb的聚合操作，特此总结下。mongo存储的可以是复杂类型，比如数组、对象等mysql不善于处理的文档型结构，并且聚合的操作也比mysql复杂很多。注：本文基于 mongodb v3.6 目录 mongo与...
MongoDB查找并迭代与计数 mongodb
2018-11-29 11:56

回答 1 已采纳 How could the second version be an optimization over the first? Your first query retrieves a sing
MongoDB内容分享（七）：MongoDB 性能：查询聚合优化
2023-12-09 07:50

之乎者也·的博客索引的添加只是解决了针对索引字段查询的效率，但是并不能解决查询之后数据的聚合问题。毕竟是对于大量数据的操作，光从IO就已经远超通常的OLTP操作，所以要求达到OLTP操作的速度和并发是不现实的，也是没有意义的。...
mongodb的聚合操作
2021-07-01 11:17

卡布丶的博客 mongodb的聚合操作学习目标了解 mongodb的聚合原理掌握 mongdb的管道命令掌握 mongdb的表达式 1 mongodb的聚合是什么聚合(aggregate)是基于数据处理的聚合管道，每个文档通过一个由多个阶段（stage）组成的...
没有解决我的问题, 去提问

悬赏问题

¥15 winform的chart曲线生成时有凸起
¥15 msix packaging tool打包问题
¥15 finalshell节点的搭建代码和那个端口代码教程
¥15 用hfss做微带贴片阵列天线的时候分析设置有问题
¥15 Centos / PETSc / PETGEM
¥15 centos7.9 IPv6端口telnet和端口监控问题
¥120 计算机网络的新校区组网设计
¥20 完全没有学习过GAN，看了CSDN的一篇文章，里面有代码但是完全不知道如何操作
¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
¥20 海浪数据南海地区海况数据，波浪数据

码龄粉丝数原力等级 --

无法在mongodb中获得准确的聚合计数

0条回答默认最新

悬赏问题

无法在mongodb中获得准确的聚合计数

0条回答 默认 最新

悬赏问题

0条回答默认最新