使用Google Go的Goroutines创建贝叶斯网络

I have a large dataset of philosophic arguments, each of which connect to other arguments as proof or disproof of a given statement. A root statement can have many proofs and disproofs, each of which may also have proofs and disproofs. Statements can also be used in multiple graphs, and graphs can be analyzed under a "given context" or assumption.

I need to construct a bayesian network of related arguments, so that each node propagates influence fairly and accurately to it's connected arguments; I need to be able to calculate the probability of chains of connected nodes concurrently, with each node requiring datastore lookups that must block to get results; the process is mostly I/O bound, and my datastore connection can run asynchronously in java, go and python {google appengine}. Once each lookup completes, it propagates the effects to all other connected nodes until the probability delta drops below a threshold of irrelevance {currently 0.1%}. Each node of the process must calculate chains of connections, then sum up all the results across all queries to adjust validity results, with results chained outward to any connected arguments.

In order to avoid recurring infinitely, I was thinking of using an A*-like process in goroutines to propagate updates to the argument maps, with a heuristic based on compounding influence which ignores nodes once probability of influence dips below, say 0.1% . I'd tried to set up the calculations with SQL triggers, but it got complex and messy way too fast. Then I moved to google appengine to take advantage of asynchronous nosql, and it was better, but still too slow. I need to be run the updates fast enough to get a snappy UI, so when a user creates or votes for or against a proof or disproof, they can see the results reflected in UI immediately.

I think Go is the language of choice to support the concurrency I need, but I'm open to suggestions. The client is a monolithic javascript app that just uses XHR and websockets to push and pull argument maps {and their updates} in real time. I have a java prototype that can compute large chains in 10~15s, but monitoring of performance shows that most of my runtime is wasted in synchronization and overhead from ConcurrentHashMap.

If there are other highly-concurrent languages worth trying out, please let me know. I know java, python, go, ruby and scala, but will learn any language if it suits my needs.

Similarly, if there are open source implementations of huge Bayesian networks, please leave a suggestion.

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douxiuyu2028 2012-05-10 05:21
关注
I think it's a bit difficult to tell what you are asking about. Maybe you can elaborate on your question.

Goroutines are quite cheap, and are a perfect match for modern web applications which use XHR or Websockets heavily (and other I/O bound applications which have to wait for database responses and stuff like that). Additionally, the go runtime is also able to execute those goroutines in parallel, so that Go is also a good fit for CPU bound tasks, which should take advantage of multiple cores and the speed of a natively compiled language.

But you should also keep in mind, that goroutines and channels aren't for free. They still require some amount of memory and each synchronization point (e.g. a channel send or receive) comes with its cost. That's normally not a problem, since the synchronization is, in comparison to a database query for example, extremely cheap, but it might not be suited for building efficient Bayesian networks, especially if the actual work of each goroutine / node is negligible in comparison to the synchronization overhead.

Your primary goal for every concurrent program should be to avoid shared mutability as far as possible. So a Bayesian network modeled with goroutines and channels might be a good educational example and a great way to measure the performance of Go's channel implementation, but it's probably not the best fit for your problem.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报
编辑

预览
轻敲空格完成输入
显示为

卡片

标题

链接
评论

按下Enter换行，Ctrl+Enter发表内容

编辑

预览

报告相同问题？

关注问题

go语言开发的风控决策引擎系统源码.zip
2022-06-09 14:42

1. **Go语言基础**：Go语言，也称为Golang，是由Google开发的一种静态类型的、编译型的、并发型且具有垃圾回收功能的编程语言。它的设计目标是提高开发效率和运行性能，特别适合构建网络服务和大规模并发系统。 2. ...
probab:自动从code.google.compprobab导出
2021-05-12 02:05

10. **并发编程**：Go语言的一个强项是其内置的并发模型，如果`probab`库考虑了并行计算，那么会涉及goroutines和channels的使用。总的来说，`probab`库是一个强大的工具，适用于那些需要在Go语言中进行概率计算和...
Go学习路线
2022-05-02 06:37

kgduu的博客图形语言 GraphJin - 用于 Postgres 的即时 GraphQL API。无需代码，将 GraphQL 编译为 SQL。 MTProto MTProto - 在纯 Go 上编写的 Telegram API 的完整本实现。天文学 go-fits - FITS（灵活图像传输系统）...
【Go入门】编程语言比较：Golang VS Python
2023-10-15 09:31

王多头发的博客 Asteroid 是该公司的 Wireguard 服务器...Go 或 Golang 由 Google 于 2007 年设计，与 C 相似，内存安全，具有垃圾收集功能，并且是结构类型的。事实上，它几乎可以执行所有可以想象到的任务，这是它最好的功能之一。
2018最新精选的Go框架，库和软件的精选列表二
2019-01-04 03:02

秋天的春的博客 2018最新精选的Go框架，库和软件的精选列表二地理地理工具和服务器 geocache - 适用于基于地理定位的应用程序的内存缓存。 pbf - OpenStreetMap PBF golang编码器/解码器。 S2几何 - Go中的S2几何库。 ...
【吐血整理】超全golang面试题合集+golang学习指南+golang知识图谱+成长路线一份涵盖大部分golang程序员所需要掌握的核心知识。
2021-01-11 04:37

小白debug的博客目录(善用Ctrl+F) 基础入门新手 Golang开发新手常犯的50个错误数据类型连nil切片和空切片一不一样都不清楚？...map不初始化使用会怎么样 map不初始化长度和初始化长度的区别 map承载多大，..
golang知识图谱
2021-09-06 09:01

csy2005csy的博客实现格式化的输入输出操作，其中的fmt.Printf()和fmt.Println()是开发者使用最为频繁的函数。 io 实现了一系列非平台相关的IO相关接口和实现，比如提供了对os中系统相关的IO功能的封装。我们在进行流式读写...
mglda:多颗粒LDA
2021-05-07 15:57

“Go”标签明确了实现mglda所使用的编程语言，Go语言由Google开发，被设计用于处理大量并发任务和大数据处理。在处理像mglda这样的大型主题建模任务时，Go的并发机制和内存管理能力可以显著提高程序性能，减少延迟，...
2018最新精选的Go框架，库和软件的精选列表二 https://awesome-go.com/
2019-01-25 00:56

sanshengshi134的博客 pbf - OpenStreetMap PBF golang编码器/解码器。 S2几何 - Go中的S2几何库。 Tile38 - 具有空间索引和实时地理围栏的地理位置数据库。去编译器编译工具转到其他语言。 gopherjs - 转到JavaScript的编译器。 llgo -...
Go 相关的框架，库和软件的精选清单
2020-07-03 01:37

baobaodqh的博客这是一个Go 相关的框架，库和软件的精选清单，引用自 awesome-go项目，并翻译补充而来这是一个Go 相关的框架，库和软件的精选清单，引用自 awesome-go项目，并翻译补充而来音频和音乐用于处理音频的库。 ...
没有解决我的问题, 去提问

使用Google Go的Goroutines创建贝叶斯网络

1条回答 默认 最新

1条回答默认最新