yacc shift-reduce用于歧义lambda语法

I'm writing a grammar for a toy language in Yacc (the one packaged with Go) and I have an expected shift-reduce conflict due to the following pseudo-issue. I have to distilled the problem grammar down to the following.

start:
  stmt_list

expr:
  INT | IDENT | lambda | '(' expr ')' { $$ = $2 }

lambda:
  '(' params ')' '{' stmt_list '}'

params:
  expr | params ',' expr

stmt:
  /* empty */ | expr

stmt_list:
  stmt | stmt_list ';' stmt

A lambda function looks something like this:

map((v) { v * 2 }, collection)

My parser emits:

conflicts: 1 shift/reduce

Given the input:

(a)

It correctly parses an expr by the '(' expr ')' rule. However given an input of:

(a) { a }

(Which would be a lambda for the identity function, returning its input). I get:

syntax error: unexpected '{'

This is because when (a) is read, the parser is choosing to reduce it as '(' expr ')', rather than consider it to be '(' params ')'. Given this conflict is a shift-reduce and not a reduce-reduce, I'm assuming this is solvable. I just don't know how to structure the grammar to support this syntax.

EDIT | It's ugly, but I'm considering defining a token so that the lexer can recognize the ')' '{' sequence and send it through as a single token to resolve this.

EDIT 2 | Actually, better still, I'll make lambdas require syntax like ->(a, b) { a * b} in the grammar, but have the lexer emit the -> rather than it being in the actual source code.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douji5746 2016-09-04 17:32
关注
Your analysis is indeed correct; although the grammar is not ambiguous, it is impossible for the parser to decide with the input reduced to ( <expr> and with lookahead ) whether or not the expr should be reduced to params before shifting the ) or whether the ) should be shifted as part of a lambda. If the next token were visible, the decision could be made, so the grammar LR(2), which is outside of the competence of go/yacc.

If you were using bison, you could easily solve this problem by requesting a GLR parser, but I don't believe that go/yacc provides that feature.

There is an LR(1) grammar for the language (there is always an LR(1) grammar corresponding to any LR(k) grammar for any value of k) but it is rather annoying to write by hand. The essential idea of the LR(k) to LR(1) transformation is to shift the reduction decisions k-1 tokens forward by accumulating k-1 tokens of context into each production. So in the case that k is 2, each production P: N → α will be replaced with productions ^TN^U → ^Tα U for each T in FIRST(α) and each U in FOLLOW(N). [See Note 1] That leads to a considerable blow-up of non-terminals in any non-trivial grammar.

Rather than pursuing that idea, let me propose two much simpler solutions, both of which you seem to be quite close to.

First, in the grammar you present, the issue really is simply the need for a two-token lookahead when the two tokens are <kbd>)</kbd><kbd>{</kbd>. That could easily be detected in the lexer, and leads to a solution which is still hacky but a simpler hack: Return ){ as a single token. You need to deal with intervening whitespace, etc., but it doesn't require retaining any context in the lexer. This has the added bonus that you don't need to define params as a list of exprs; they can just be a list of IDENT (if that's relevant; a comment suggests that it isn't).

The alternative, which I think is a bit cleaner, is to extend the solution you already seem to be proposing: accept a little too much and reject the errors in a semantic action. In this case, you might do something like:

start: stmt_list expr: INT | IDENT | lambda | '(' expr_list ')' { // If $2 has more than one expr, report error $$ = $2 } lambda: '(' expr_list ')' '{' stmt_list '}' { // If anything in expr_list is not a valid param, report error $$ = make_lambda($2, $4) } expr_list: expr | expr_list ',' expr stmt: /* empty */ | expr stmt_list: stmt | stmt_list ';' stmt

Notes

That's only an outline; the complete algorithm includes the mechanism to recover the original parse tree. If k is greater than 2 then T and U are strings the the FIRST_k-1 and FOLLOW_k-1 sets.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

yacc shift-reduce用于歧义lambda语法
2016-09-02 23:51

回答 2 已采纳 Your analysis is indeed correct; although the grammar is not ambiguous, it is impossible for the p
使用yacc的模态解析器
2016-06-07 19:17

回答 1 已采纳 I have solved the problem by creating pseudo-symbols which I insert using the lexer: line : T
为什么在定义更多类型时会导致yacc解析器崩溃？
2015-02-17 06:10

回答 2 已采纳 "It panics,"Index out of range in yacc.go:891" " I got into the same problem because I needed mor
c语法分析器--采用bison(yacc)
2021-04-02 16:32

c语法分析器，采用bison2.1(yacc), flex(lex), 生成程序的语法树分析单个文件，不支持预处理, 不解析预处理符号# bison，flex工具在上传包内，语法见cgrammar-new.y，词法见input.lex 另附相关说明，本代码采用vs...
求大神帮我讲解下这个yacc关于后缀表达式的输出
2016-06-06 14:16

回答 1 已采纳 http://blog.csdn.net/xiaowei_cqu/article/details/7764913
在Delphi中解析PHP / JavaScript文档结构 php
2009-07-31 16:31

回答 2 已采纳 By "Lexing" your referring to Lexical Analysis, and there are some ancient tools which mostly stil
无法安装套件
2018-07-17 17:08

回答 1 已采纳 I was able to solve this by doing the following: cd /Users/pro/go/src/golang.org/x/tools (This i
Postgresql中yacc语法树冲突解决方法（shift/reduce conflicts）
2022-10-20 17:37

高铭杰的博客 Postgresql中语法树冲突解决方法（shift/reduce conflicts）
Jupyter Notebook的主题编码问题 python
2018-02-10 07:54

回答 1 已采纳看 https://jingyan.baidu.com/article/456c463b3f56690a58314406.html 第4步
如何在不使用go get的情况下安装Go应用？ github
2017-04-03 12:18

回答 2 已采纳 Downloading cockroachdb using go get I get a $GOPATH/src/github.com/cockroachdb/cockroach with a s
yacc语法分析和语义分析-编译原理
2018-07-07 11:27

来源于北邮编译原理作业，代码中的语法分析和语义分析均为基于yacc实现，文件中包括代码、文档、测试用例。可供yacc初学者学习参考。
YACC1-2020:另一台自定义计算机-2020版
2021-03-29 03:49

YACC1-2020（还有另一个定制CPU）最后，是时候尝试从基本TTL和其他“简单” IC（没有74ls181 ALU了，这会欺骗:-)来构建自定义CPU。 YACC1-2020提供了工具和资源，以学习计算机科学，从数字逻辑的基础到具有用户定义...
Yacc.rar_Yacc java _yacc class_yacc_分析器_语法_语法分析
2022-09-23 09:23

语法分析器生成工具 YACC 实例说明含源码运行程序
yacc.tar.Z_C- 语法分析_Syntactic analysis_yacc_yacc.tar_yacc工具
2022-09-19 12:30

伯克利的语法分析器的创建工具
yacc-dev.7z
2022-07-17 17:14

yacc-dev.7z
vscode-lex-flex-yacc-bison:VSCode中的Lex，Flex，Yacc和Bison的语法突出显示
2021-05-20 20:46

Lex Flex Yacc野牛Lex，Flex，Yacc和Bison的语法突出显示。此扩展基于以下扩展：概述编程语言的编译器或解释器通常分解为两部分：阅读源程序并发现其结构。处理此结构，例如生成目标程序。 Lex和Yacc可以生成解决...
基于lex和yacc的词法分析器+语法分析器(C语言编译器)【100012624】
2023-06-05 16:38

词法分析器的作用是读取源程序生成词法单元，并过滤掉注释和空白。项目中的词法分析使用了lex 。
yacc语法学习-part1
2021-03-14 14:24

CrazyPixel的博客 2. Yacc编译器将此文法说明文件.y转换成用C编写的语法分析器文件.tab.c，这个文件里至少应该包含语法分析驱动程序yyparse()以及LALR分析表。在这个文件里，语法分析驱动程序调用yylex()这个函数获取输入记号，每次...
基于lex和yacc的词法分析器+语法分析器.zip
2022-07-04 12:26

本编译器所支持的词法和语法请参考第二第三小节解压压缩包运行命令 unzip compiler.zip 进入文件夹运行命令 ./compiler test.cmm 其中test.cmm可以替换成其他文件如果报错，则输出错误行号输出语法树产生语法树...
LEX&&YACC--编译界的神
2021-09-07 11:41

aiyo_的博客 yacc是gnu开源的全文解析工具，lex用于词法解析，yacc用于语法解析。lex一般也称为token scanner/lexer，yacc称为parser generator(语法解析器生成器)。 lex&&yacc这两个工具已经很老了，现代版本的工具为...
没有解决我的问题, 去提问

悬赏问题

¥15 对于知识的学以致用的解释
¥50 三种调度算法报错有实例
¥15 关于#python#的问题，请各位专家解答！
¥200 询问：python实现大地主题正反算的程序设计，有偿
¥15 smptlib使用465端口发送邮件失败
¥200 总是报错，能帮助用python实现程序实现高斯正反算吗？有偿
¥15 对于squad数据集的基于bert模型的微调
¥15 为什么我运行这个网络会出现以下报错？CRNN神经网络
¥20 steam下载游戏占用内存
¥15 CST保存项目时失败

yacc shift-reduce用于歧义lambda语法

2条回答 默认 最新

Notes

悬赏问题

2条回答默认最新