特定于体系结构的Golang函数的文档

I have a function that I would like to provide an assembly implementation for on amd64 architecture. For the sake of discussion let's just suppose it's an Add function, but it's actually more complicated than this. I have the assembly version working but my question concerns getting the godoc to display correctly. I have a feeling this is currenty impossible, but I wanted to seek advice.

Some more details:

The assembly implementation of this function contains only a few instructions. In particular, the mere cost of calling the function is a significant part of the entire cost.
It makes use of special instructions (BMI2) therefore can only be used following a CPUID capability check.

The implementation is structured like this gist. At a high level:

In the generic (non-amd64 case) the function is defined by delegating to addGeneric.
In the amd64 case the function is actually a variable, initially set to addGeneric but replaced by addAsm in the init function if a cpuid check passes.

This approach works. However the godoc output is crappy because in the amd64 case the function is actually a variable. Note godoc appears to be picking up the same build tags as the machine it's running on. I'm not sure what godoc.org would do.

Alternatives considered:

The Add function delegates to addImpl. Then we pull some similar trick to replace addImpl in the amd64 case. The problem with this is (in my experiments) Go doesn't seem to be able to inline the call, and the assembly is now wrapped in two function calls. Since the assembly is so small already this has a noticable impact on performance.
In the amd64 case we define a plain function Add that has the useAsm check inside it, and calls one of addGeneric and addAsm depending on the result. This would have an even worse impact on performance.

So I guess the questions are:

Is there a better way to structure the code to achieve the performance I want, and have it appear properly in documentation.
If there is no alternative, is there some other way to "trick" godoc?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanpao9781 2018-05-20 13:17
关注
See math.Sqrt for an example of how to do this.

Write a stub function with the documentation

Write a generic implementation as an unexported function.

For each architecture, write a function in assembler that jumps to the unexported generic implementation or implements the function directly.

To handle the cpuid check, set a package variable in init() and conditionally jump based on that variable in the assembly implementation.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Golang函数参数中的函数数组
2019-07-05 09:00

回答 1 已采纳 The uploaderOptions argument of NewS3FileWriter need to be filled with slice of functions during c
结构成员的Golang函数指针
2016-04-01 02:20

回答 3 已采纳 You can even do it directly, without an interface: package main import "fmt" type A struct {
在C程序中使用golang函数
2019-07-12 08:14

回答 1 已采纳 There are a couple of issues here. First is the incompatibility of the types. Go will return a GoI
Golang面试题
2024-04-05 10:31

御风行云天的博客 Golang高频面试题
在函数调用中定义golang结构的成本
2019-01-17 13:48

回答 1 已采纳 Types are a compile-time concept and their scope won't (generally) affect runtime speed, since the
golang const 可以修饰函数形参吗 golang
2019-05-28 11:49

回答 1 已采纳不可以，const是定义和修饰常量的。用const修饰的量是不可以变的。函数里的形参你用const修饰，那调用函数时难道只能传递同一个对象常量或者值类型常量进被调用函数体吗？那这个函数参数还有什么
将json文件传递给GoLang函数
2018-04-18 21:27

回答 1 已采纳 If the structure of your JSON is not well defined and can change, that's the way to go: import (
Golang 基础与进阶知识点
2024-05-18 14:05

Lisongxi的博客 Go 语言的 GPM 调度模型是 Go ...实战参考G: 表示 Goroutine，每个 Goroutine 对应一个 G 结构体，G 存储 Goroutine 的运行堆栈、状态以及任务函数，可重用。G 并非执行体，每个 G 需要绑定到 P 才能被调度执行。P。
Golang中没有返回值函数
2018-02-23 13:46

回答 3 已采纳 The function here you have used func log(message string){ fmt.Println(message) } Actually r
Golang中的结构上的递归函数
2016-07-07 15:00

回答 1 已采纳 You just need to step higher and higher in the hierarchy until you reach the root. Assuming Widget
Golang函数参数无类型？
2016-06-13 02:45

回答 1 已采纳 Per https://tour.golang.org/basics/5: When two or more consecutive named function parameters s
golang面试题
2023-05-31 16:21

Damon-Rui的博客 golang 缺点 ①右大括号不允许换行，否则编译报错 ②不允许有未使用的包或变量 ③错误处理原始，虽然引入了defer、panic、recover处理出错后的逻辑，函数可以返回多个值，但基本依靠返回错误是否为空来判断函数是否...
是否所有Golang函数都将err作为第二个返回值返回
2019-04-02 09:12

回答 2 已采纳 Answering your question: Fortunately, Go prevents certain types of programmers errors. It just won
golang大厂面试2
2023-07-04 14:42

theo.wu的博客实现一个函数，有两个参数分别是升序的整数数组a和b，返回合并后的升序整数数组。理解不理解这些树的构造，是要解决什么问题？处理日志的时候如果发现突然量变大，该如何扩容让以前堆积的日志可以消耗掉？命令的时间...
golang后端面试宝典
2024-05-06 12:11

EssRt的博客缓存穿透缓存穿透是指...使用布隆过滤器（Bloom Filter）：布隆过滤器是一种空间效率很高的数据结构，可以判断一个元素是否在一个集合中。可以在访问缓存和数据库之前，先通过布隆过滤器判断数据是否存在。限制频率。
没有解决我的问题, 去提问

悬赏问题

¥15 做个有关计算的小程序
¥15 MPI读取tif文件无法正常给各进程分配路径
¥15 如何用MATLAB实现以下三个公式（有相互嵌套）
¥30 关于#算法#的问题：运用EViews第九版本进行一系列计量经济学的时间数列数据回归分析预测问题求各位帮我解答一下
¥15 setInterval 页面闪烁，怎么解决
¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化

特定于体系结构的Golang函数的文档

1条回答 默认 最新

悬赏问题

1条回答默认最新