在Go中解析格式化的字符串

The Problem

I have slice of string values wherein each value is formatted based on a template. In my particular case, I am trying to parse Markdown URLs as shown below:

- [What did I just commit?](#what-did-i-just-commit)
- [I wrote the wrong thing in a commit message](#i-wrote-the-wrong-thing-in-a-commit-message)
- [I committed with the wrong name and email configured](#i-committed-with-the-wrong-name-and-email-configured)
- [I want to remove a file from the previous commit](#i-want-to-remove-a-file-from-the-previous-commit)
- [I want to delete or remove my last commit](#i-want-to-delete-or-remove-my-last-commit)
- [Delete/remove arbitrary commit](#deleteremove-arbitrary-commit)
- [I tried to push my amended commit to a remote, but I got an error message](#i-tried-to-push-my-amended-commit-to-a-remote-but-i-got-an-error-message)
- [I accidentally did a hard reset, and I want my changes back](#i-accidentally-did-a-hard-reset-and-i-want-my-changes-back)

What I want to do?

I am looking for ways to parse this into a value of type:

type Entity struct {
    Statement string
    URL string
}

What have I tried?

As you can see, all the items follow the pattern: - [{{ .Statement }}]({{ .URL }}). I tried using the fmt.Sscanf function to scan each string as:

var statement, url string
fmt.Sscanf(s, "[%s](%s)", &statement, &url)

This results in:

statement = "I"
url = ""

The issue is with the scanner storing space-separated values only. I do not understand why the URL field is not getting populated based on this rule.

How can I get the Markdown values as mentioned above?

EDIT: As suggested by Marc, I will add couple of clarification points:

This is a general purpose question on parsing strings based on a format. In my particular case, a Markdown parser might help me but my intention to learn how to handle such cases in general where a library might not exist.
I have read the official documentation before posting here.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doujunchi1238 2018-01-03 16:25
关注
Note: The following solution only works for "simple", non-escaped input markdown links. If this suits your needs, go ahead and use it. For full markdown-compatibility you should use a proper markdown parser such as gopkg.in/russross/blackfriday.v2.

You could use regexp to get the link text and the URL out of a markdown link.

So the general input text is in the form of:

[some text](somelink)

A regular expression that models this:

\[([^\]]+)\]\(([^)]+)\)

Where:

\[ is the literal [

([^\]]+) is for the "some text", it's everything except the closing square brackets

\] is the literal ]

\( is the literal (

([^)]+) is for the "somelink", it's everything except the closing brackets

\) is the literal )

Example:

r := regexp.MustCompile(`\[([^\]]+)\]\(([^)]+)\)`) inputs := []string{ "[Some text](#some/link)", "[What did I just commit?](#what-did-i-just-commit)", "invalid", } for _, input := range inputs { fmt.Println("Parsing:", input) allSubmatches := r.FindAllStringSubmatch(input, -1) if len(allSubmatches) == 0 { fmt.Println(" No match!") } else { parts := allSubmatches[0] fmt.Println(" Text:", parts[1]) fmt.Println(" URL: ", parts[2]) } }

Output (try it on the Go Playground):

Parsing: [Some text](#some/link) Text: Some text URL: #some/link Parsing: [What did I just commit?](#what-did-i-just-commit) Text: What did I just commit? URL: #what-did-i-just-commit Parsing: invalid No match!
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

在Golang中解析格式化的字符串
2014-07-31 22:59

回答 1 已采纳 For example, package main import ( "errors" "fmt" "strconv" "strings" ) // http
在vue中解析字符串html html5 vue.js 前端
2023-04-11 16:41

回答 2 已采纳参考自 chatGPT：可以使用Vue的动态组件来实现解析字符串HTML并使事件有效。首先，将字符串HTML转换为Vue组件，可以使用Vue的compile函数来实现： import Vue from
在Go中以字符串形式返回格式化日期
2015-01-16 04:48

回答 1 已采纳 In you main function, you should use getCurrentTime() instead of getCurrentTime. Like this: fmt.P
[golang]-go中字符串格式化与fmt包简介
2021-01-30 22:58

alwaysrun的博客文章目录格式化符通用指针数值字符串与字节序列宽度与精度标识符占位符格式化错误GoStringer & StringerScanningPrintingErrorf fmt包中实现了格式化的I/O函数（类似Ｃ语言中的printf和scanf，但更加简单）。...
在go中解析证书字符串
2016-10-17 10:20

回答 1 已采纳 I found the error. The problem was in the source. As I was explaining, my cert string was "30 82 0
格式化输出字符串，控制长度，右边对齐，不足补星号 python 有问必答
2022-10-26 16:38

回答 4 已采纳 s=input('输入字符串：') print(s.rjust(8,'*'))
Linux下C++字符串格式化问题 c++ linux
2017-01-05 03:10

回答 4 已采纳 string类里没有printf()这个方法，printf()是C++标准库里面函数，直接调用就可以了
Go-geopattern-在golang中从一个字符串创建漂亮的生成图像模式
2019-08-14 03:59

在Golang中，`Go-geopattern`是一个开源库，它允许开发者从任何字符串生成美观的几何图案，这些图案可以用于网站背景、验证码、艺术作品等。这个库的灵感来源于JavaScript的`GeoPattern`库，但在Golang中提供了更...
java -给定格式化字符串，解析其中对应内容 java 有问必答
2021-11-26 11:20

回答 2 已采纳转换成json，然后用json工具解析出来，放入到map中 public static void main(String[] args) { String str = "are
使用localtime结合格式化字符串的方法显示当前日期时间 python
2022-05-15 15:48

回答 1 已采纳 import time tm = time.localtime() dt = time.strftime('%Y-%m-%d %H:%M:%S', tm) print('当前日期时间为：', d
将字符串转换为时间并在golang中解析
2016-11-02 19:31

回答 1 已采纳 You're not correctly providing the layout argument to Parse. You're supposed to be using Mon Jan 2
go语言中时间戳格式化的方法
2020-09-22 09:16

本文将详细介绍如何在Go语言中对时间戳进行格式化。首先，我们需要了解Go语言中的`time`包。`time`包提供了创建、解析、比较和格式化时间的功能。在Go语言中，我们可以通过`time.Now()`获取当前时间，而`time.Unix...
python中列表解析和格式字符串的使用问题 python 开发语言
2021-12-20 22:28

回答 2 已采纳 def pi10(): import math return ['{x:.{d}f}'.format(x=math.pi, d=n) for n in range(1,11)]
python格式化字符串format_Python 中格式化字符串 % 和 format 两种方法之间的区别
2020-11-28 21:33

weixin_39667509的博客 Python2.6引入了 format 格式化字符串的方法，现在格式化字符串有两种方法，就是 % 和 format ，具体这两种方法有什么区别呢？请看以下解析。# 定义一个坐标值c = (250, 250)# 使用%来格式化s1 = "敌人坐标：%s" % c...
48、Go语言秘籍：数组与字符串的转换技巧解析
2024-06-23 12:32

多多的编程笔记的博客本文深入探讨了Go语言中数组与字符串的转换方法，涵盖了fmt.Sprintf和strings.Split等关键函数的使用，同时强调了在转换过程中需要注意的细节和注意事项。通过实际应用场景和实用技巧，我们了解了数组与字符串之间的...
没有解决我的问题, 去提问

悬赏问题

¥15 乌班图ip地址配置及远程SSH
¥15 怎么让点阵屏显示静态爱心，用keiluVision5写出让点阵屏显示静态爱心的代码，越快越好
¥15 PSPICE制作一个加法器
¥15 javaweb项目无法正常跳转
¥15 VMBox虚拟机无法访问
¥15 skd显示找不到头文件
¥15 机器视觉中图片中长度与真实长度的关系
¥15 fastreport table 怎么只让每页的最下面和最顶部有横线
¥15 java 的protected权限，问题在注释里
¥15 这个是哪里有问题啊？

在Go中解析格式化的字符串

2条回答 默认 最新

悬赏问题

2条回答默认最新