将Go汇编程序翻译为NASM

I came across the following Go code:

type Element [12]uint64

//go:noescape
func CSwap(x, y *Element, choice uint8)

//go:noescape
func Add(z, x, y *Element)

where the CSwap and Add functions are basically coming from an assembly, and look like the following:

TEXT ·CSwap(SB), NOSPLIT, $0-17

    MOVQ    x+0(FP), REG_P1
    MOVQ    y+8(FP), REG_P2
    MOVB    choice+16(FP), AL   // AL = 0 or 1
    MOVBLZX AL, AX              // AX = 0 or 1
    NEGQ    AX                  // RAX = 0x00..00 or 0xff..ff

    MOVQ    (0*8)(REG_P1), BX
    MOVQ    (0*8)(REG_P2), CX
    // Rest removed for brevity

TEXT ·Add(SB), NOSPLIT, $0-24

    MOVQ    z+0(FP), REG_P3
    MOVQ    x+8(FP), REG_P1
    MOVQ    y+16(FP), REG_P2

    MOVQ    (REG_P1), R8
    MOVQ    (8)(REG_P1), R9
    MOVQ    (16)(REG_P1), R10
    MOVQ    (24)(REG_P1), R11
    // Rest removed for brevity

What I try to do is that translate the assembly to a syntax that is more familiar to me (I think mine is more like NASM), while the above syntax is Go assembler. Regarding the Add method I didn't have much problem, and translated it correctly (according to test results). It looks like this in my case:

.text
.global add_asm
add_asm:
  push   r12
  push   r13
  push   r14
  push   r15

  mov    r8, [reg_p1]
  mov    r9, [reg_p1+8]
  mov    r10, [reg_p1+16]
  mov    r11, [reg_p1+24]
  // Rest removed for brevity

But, I have a problem when translating the CSwap function, I have something like this:

.text
.global cswap_asm
cswap_asm:
  push   r12
  push   r13
  push   r14

  mov    al, 16
  mov    rax, al
  neg    rax

  mov    rbx, [reg_p1+(0*8)]
  mov    rcx, [reg_p2+(0*8)]

But this doesn't seem to be quite correct, as I get error when compiling it. Any ideas how to translate the above CSwap assembly part to something like NASM?

EDIT (SOLUTION):

Okay, after the two answers below, and some testing and digging, I found out that the code uses the following three registers for parameter passing:

#define reg_p1  rdi
#define reg_p2  rsi
#define reg_p3  rdx

Accordingly, rdx has the value of the choice parameter. So, all that I had to do was use this:

movzx  rax, dl // Get the lower 8 bits of rdx (reg_p3)
neg    rax

Using byte [rdx] or byte [reg_3] was giving an error, but using dl seems to work fine for me.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douniewei6346 2017-12-21 22:39
关注
Basic docs about Go's asm: https://golang.org/doc/asm. It's not totally equivalent to NASM or AT&T syntax: FP is a pseudo-register name for whichever register it decides to use as the frame pointer. (Typically RSP or RBP). Go asm also seems to omit function prologue (and probably epilogue) instructions. As @RossRidge comments, it's a bit more like a internal representation like LLVM IR than truly asm.

Go also has its own object-file format, so I'm not sure you can make Go-compatible object files with NASM.

If you want to call this function from something other than Go, you'll also need to port the code to a different calling convention. Go appears to be using a stack-args calling convention even for x86-64, unlike the normal x86-64 System V ABI or the x86-64 Windows calling convention. (Or maybe those mov function args into REG_P1 and so on instructions disappear when Go builds this source for a register-arg calling convention?)

(This is why you could you had to use movzx eax, dl instead of loading from the stack at all.)

BTW, rewriting this code in C instead of NASM would probably make even more sense if you want to use it with C. Small functions are best inlined and optimized away by the compiler.

It would be a good idea to check your translation, or get a starting point, by assembling with the Go assembler and using a disassembler.

objdump -drwC -Mintel or Agner Fog's objconv disassembler would be good, but they don't understand Go's object-file format. If Go has a tool to extract the actual machine code or get it in an ELF object file, do that.

If not, you could use ndisasm -b 64 (which treats input files as flat binaries, disassembling all the bytes as if they were instructions). You can specify an offset/length if you can find out where the function starts. x86 instructions are variable length, and disassembly will likely be "out of sync" at the start of the function. You might want to add a bunch of single-byte NOP instructions (kind of a NOP sled) for the disassembler, so if it decodes some 0x90 bytes as part of an immediate or disp32 for a long instruction that was really not part of the function, it will be in sync. (But the function prologue will still be messed up).

You might add some "signpost" instructions to your Go asm functions to make it easy to find the right place in the mess of crazy asm from disassembling metadata as instructions. e.g. put a pmuludq xmm0, xmm0 in there somewhere, or some other instruction with a unique mnemonic that you can search for which the Go code doesn't include. Or an instruction with an immediate that will stand out, like addq $0x1234567, SP. (An instruction that will crash so you don't forget to take it out again is good here.)

Or you could use gdb's built-in disassembler: add an instruction that will segfault (like a load from a bogus absolute address (movl 0, AX null-pointer deref), or a register holding a non-pointer value e.g. movl (AX), AX). Then you'll have an instruction-pointer value for the instructions in memory, and can disassemble from some point behind that. (Probably the function start will be 16-byte aligned.)

Specific instructions.

MOVBLZX AL, AX reads AL, so that's definitely an 8-bit operand. The size for AX is given by the L part of the mnemonic, meaning long for 32 bit, like in GAS AT&T syntax. (The gas mnemonic for that form of movzx is movzbl %al, %eax). See What does cltq do in assembly? for a table of cdq / cdqe and the AT&T equivalent, and the AT&T / Intel mnemonic for the equivalent MOVSX instruction.

The NASM instruction you want is movzx eax, al. Using rax as the destination would be a waste of a REX prefix. Using ax as the destination would be a mistake: it wouldn't zero-extend into the full register, and would leave whatever high garbage. Go asm syntax for x86 is very confusing when you're not used to it, because AX can mean AX, EAX, or RAX depending on the operand size.

Obviously mov rax, al isn't a possibility: Like most instructions, mov requires both its operands to be the same size. movzx is one of the rare exceptions.

MOVB choice+16(FP), AL is a byte load into AL, not an immediate move. choice+16 is a an offset from FP. This syntax is basically the same as AT&T addressing modes, with FP as a register and choice as an assemble-time constant.

FP is a pseudo-register name. It's pretty clear that it should simply be loading the low byte of the 3rd arg-passing slot, because choice is the name of a function arg. (In Go asm, choice is just syntactic sugar, or a constant defined as zero.)

Before a call instruction, rsp points at the first stack arg, so that + 16 is the 3rd arg. It appears that FP is that base address (and might actually be rsp+8 or something). After a call (which pushes an 8 byte return address), the 3rd stack arg is at rsp + 24. After more pushes, the offset will be even larger, so adjust as necessary to reach the right location.

If you're porting this function to be called with a standard calling convention, the 3 integer args will be passed in registers, with no stack args. Which 3 registers depends on whether you're building for Windows vs. non-Windows. (See Agner Fog's calling conventions doc: http://agner.org/optimize/)

BTW, a byte load into AL and then movzx eax, al is just dumb. Much more efficient on all modern CPUs to do it in one step with

movzx eax, byte [rsp + 24] ; or rbp+32 if you made a stack frame.

I hope the source in the question is from un-optimized Go compiler output? Or the assembler itself makes such optimizations?
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

将Go汇编程序翻译为NASM
2017-12-21 21:11

回答 2 已采纳 Basic docs about Go's asm: https://golang.org/doc/asm. It's not totally equivalent to NASM or AT&
测试考拉兹猜想的 c + + 代码比手工编写的程序集更快——为什么？ c++
2016-11-01 06:12

回答 11 已采纳 If you think a 64-bit DIV instruction is a good way to divide by two, then no wonder the compiler'
求解gcc编译链接汇编和c++代码找不到引用的问题
2016-09-05 12:36

回答 2 已采纳可能是C++的name mangling问题，`extern unsigned long _gettsc();`改为`extern "C" unsigned long _gettsc();`试试
linux汇编知识总结(GAS和NASM汇编)
2019-12-28 00:09

鱼日天的博客 linux汇编总结(GAS和NASM汇编) 参考： 1. 阮一峰的网络日志：汇编语言入门教程 2. x86 Assembly Guide 3. Linux 汇编器：对比 GAS 和 NASM 目录： ...6. 几个汇编程序例子 1. 何为汇编？ assembly la...
运行后没有结果，提示没有输入文件？ c语言
2022-01-17 12:35

回答 2 已采纳第8行，应修改为：while (fscanf(fin, "%d %d %d", &x, &y, &z) == 3)
ubuntu18.04 makefile:13: recipe for target 'run' failed c语言
2023-03-20 10:47

回答 5 已采纳已解决，汇编文件出了问题，导致makefile出现段错误
bochs运行错误——系统制作 linux 其他
2022-11-26 08:11

回答 1 已采纳这篇文章讲的很详细，请看：Bochs的使用
操作系统制作（1）nasm编写boot.s
2019-08-10 00:05

超帅浩浩的博客 nasm汇编器：NASM version 2.13.02 讲述bochs运行一个简单的引导程序并显示“Loading system”例子，运行效果如下：注意：运行bochs后，需要输入c，才能进入运行状态。 1)boot.s代码如下：（nasm语法格式）没有...
乌班图运行bochs linux ubuntu
2022-09-25 22:19

回答 1 已采纳文章：Bochs简易教程中也许有你想要的答案，请看下吧
汇编，局部变量和函数（Win32，NASM）
2020-09-22 11:41

跑来跑去的修理工的博客原文：Assembly, Local Variables and Functions (Win32, NASM) 先看以下这段*C++*代码： #include <iostream> using namespace std; int main(int argc, char * argv[]){ char yourname[512]; char ...
汇编之调试环境搭建及调试步骤详解
2023-06-03 14:52

dll007的博客：程序中的汇编代码需要转译为处理器指令，在提交给处理器执行，nasm负责这个事：因为处理器指令无法像java，go语言直接跑在现有的mac，window等笔记本机器上，是直接跑在处理器上的命令，所以需要安装模拟原生...
利用NASM编写引导程序
2012-05-06 19:57

liuwons的博客在学习操作系统时，为了编写引导程序，花了不少时间寻找合适的16位汇编器。 gas支持很多格式，但是语法太烦而且不能很好地支持实模式8086的16位编程；MASM不错但是不支持二进制输出格式，而且老式MASM和LINK与我...
32位汇编语言学习笔记(29)--在NASM中使用宏
2015-01-04 07:32

swordmanwk的博客宏的语法规则与汇编指令没有关系，只是为了管理代码复杂度而设计的类似于高级语言的特性。NASM的宏语法格式如下： %macro 宏名称参数个数宏的内容 %endmacro 宏的第一个参数用%1表示，第二个参数用%2表示，...
操作系统OS-Lab1-100位大数除法NASM实现
2023-05-09 17:01

αSIM0V的博客南京大学软院操作系统2023春实验1 100位大数除法NASM实现
bochs 打印as86 汇编程序步骤
2020-04-05 03:53

Coder233的博客关于bochs 的简单配置参考链接 ...这个上面是用nasm 写的，我用的是as86 那一套 1.编写代码,源码命名boot.s .globl begtext, begdata, begbss,endtext,enddata,endbss .text begtext: .data begdata:...
Go 语言汇编快速入门
2021-10-27 15:37

iqifenxia的博客第一步 Go ASM 和标准的汇编语法（ NASM 或 YASM ）不太一样，首先你会发现它是架构独立的，没有所谓的 32 或 64 位寄存器，如下图所示： NASM x86NASM x64Go ASMeaxraxAXebxrbxBXecxrcxCX……… 大部分寄存器符号...
nasm中文手册
2020-03-11 23:25

qq_25205059的博客 Nasm中文手册 -------------------------------------------------------------------------------- ... NASM是一个为可移植性与模块化而设计的一个80x86的汇编器。它支持相当多的目标文件格式...
操作系统制作（2）nasm改写linux0.00
2019-08-18 23:08

超帅浩浩的博客环境： virtual-box：版本 6.0.10 r132072 (Qt5.6.2)运行的的ubuntu18.04系统。 nasm汇编器：NASM version 2.13.02 讲述bochs运行一个简单AB任务切换的例子，运行效果如下： ...
从0开始的x86汇编 2 汇编语言与汇编软件
2022-01-04 15:58

EINTR的博客汇编语言与汇编软件
NASM中文手册
2006-09-13 17:54

路漫漫其修远.的博客第一章: 简介 1.1 什么是NASM NASM是一个为可移植性与模块化而设计的一个80x86的汇编器。它支持相当多的目标文件格式，包括Linux和NetBSD/FreeBSD,a.out,ELF,COFF,微软16 位的OBJ和Win32。它还可以输出纯二进制文件...
没有解决我的问题, 去提问

悬赏问题

¥15 vue3加ant-design-vue无法渲染出页面
¥15 matlab（相关搜索：紧聚焦）
¥15 基于51单片机的厨房煤气泄露检测报警系统设计
¥15 路易威登官网里边的参数逆向
¥15 Arduino无法同时连接多个hx711模块，如何解决？
¥50 需求一个up主付费课程
¥20 模型在y分布之外的数据上预测能力不好如何解决
¥15 processing提取音乐节奏
¥15 gg加速器加速游戏时，提示不是x86架构
¥15 python按要求编写程序

将Go汇编程序翻译为NASM

2条回答 默认 最新

Specific instructions.

悬赏问题

2条回答默认最新