Go的LeftStr，RightStr，SubStr

I believe there are no LeftStr(str,n) (take at most n first characters), RightStr(str,n) (take at most n last characters) and SubStr(str,pos,n) (take first n characters after pos) function in Go, so I tried to make one

// take at most n first characters
func Left(str string, num int) string {
    if num <= 0 {
        return ``
    }
    if num > len(str) {
        num = len(str)
    }
    return str[:num]
}

// take at most last n characters
func Right(str string, num int) string {
    if num <= 0 {
        return ``
    }
    max := len(str)
    if num > max {
        num = max
    }
    num = max - num
    return str[num:]
}

But I believe those functions will give incorrect output when the string contains unicode characters. What's the fastest solution for those function, is using for range loop is the only way?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drfu80954 2015-04-02 16:06
关注
As mentioned in already in comments, combining characters, modifying runes, and other multi-rune "characters" can cause difficulties.

Anyone interested in Unicode handling in Go should probably read the Go Blog articles "Strings, bytes, runes and characters in Go" and "Text normalization in Go". In particular, the later talks about the golang.org/x/text/unicode/norm package which can help in handling some of this.

You can consider several levels increasingly of more accurate (or increasingly more Unicode aware) spiting the first (or last) "n characters" from a string.

Just use n bytes. This may split in the middle of a rune but is O(1), is very simple, and in many cases you know the input consists of only single byte runes. E.g. str[:n].

Split after n runes. This may split in the middle of a character. This can be done easily, but at the expense of copying and converting with just string([]rune(str)[:n]). You can avoid the conversion and copying by using the unicode/utf8 package's DecodeRuneInString (and DecodeLastRuneInString) functions to get the length of each of the first n runes in turn and then return str[:sum] (O(n), no allocation).

Split after the n'th "boundary". One way to do this is to use norm.NFC.FirstBoundaryInString(str) repeatedly or norm.Iter to find the byte position to split at and then return str[:pos].

Consider the displayed string "cafés" which could be represented in Go code as: "cafés", "caf\u00E9s", or "caf\xc3\xa9s" which all result in the identical six bytes. Alternative it could represented as "cafe\u0301s" or "cafe\xcc\x81s" which both result in the identical seven bytes.

The first "method" above may split those into "caf\xc3"+"\xa9s" and cafe\xcc"+"\x81s".

The second may split them into "caf\u00E9"+"s" ("café"+"s") and "cafe"+"\u0301s" ("cafe"+"́s").

The third should split them into "caf\u00E9"+"s" and "cafe\u0301"+"s" (both shown as "café"+"s").
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Go的LeftStr，RightStr，SubStr
2015-04-02 06:24

回答 1 已采纳 As mentioned in already in comments, combining characters, modifying runes, and other multi-rune "
练习快速排序程序一直在进行转换输出结果都是负数了 c语言开发语言算法
2022-09-22 16:20

回答 1 已采纳 for(i=0;i<lesth;i++); 多了个分号，删掉
讲c++代码转换为c语言代码只需要进行部分修改有悬赏 c++ c语言有问必答
2021-11-09 23:49

回答 1 已采纳第一个 #include<stdio.h> #include<math.h> #include<string.h> #define MAXSIZE 1000 vo
为什么oracle的内置函数中没有leftstr,rightstr之类的字符串函数(substr使用说明大全) 转...
2019-10-05 22:09

dexu1611的博客使用ORACLE的人应该都用过oracle中的substr函数，函数作用就不说了。substr函数是一个功能比较强大的函数，有比较多的用法，本文将详细说明。以下是函数调用原型substr(string,postion[,substring_length]) ...
为什么oracle的内置函数中没有leftstr,rightstr之类的字符串函数(substr使用说明大全)
2007-12-10 21:36

yzsind的博客为什么oracle的内置函数中没有leftstr,rightstr之类的字符串函数(substr使用说明大全)使用ORACLE的人应该都用过oracle中的substr函数，函数作用就不说了。substr函数是一个功能比较强大的函数，有比较多的用法，本文...
为什么oracle的内置函数中没有leftstr,rightstr之类的字符串函数(substr使用说明大全)...
2007-12-10 21:36

javawebsoa的博客为什么oracle的内置函数中没有leftstr,rightstr之类的字符串函数 (substr使用说明大全) 使用ORACLE的人应该都用过oracle中的substr函数，函数作用就不说了。substr函数是一个功能比较强大的函数，有...
php的函数大全,PHP函数大全持续更新
2021-04-08 11:34

weixin_39637179的博客 if ($right > 0) { return substr($str, $left + strlen($leftStr), $right - $left - strlen($leftStr)); } else { return substr($str, $left + strlen($leftStr)); } } 是否HTTPS访问 function is_https() { if ...
自己收集的几个比较实用的Delphi字符串函数(LeftStr,MidStr,RightStr,Reverse,LastPos)
2006-07-12 19:46

jzj_jony的博客自己收集的几个比较实用的字符串函数(LeftStr,MidStr,RightStr,Reverse,LastPos)没什么可说的，自己看啦//从右边取function RightStr (Const Str: String; Size: Word): String;begin if Size > Length(Str) then ...
ajax php 异步上传图片,PHP-Ajax实现异步上传图片到新浪图床
2021-04-10 09:50

睡袋熊的博客 return substr($str, $left + strlen($leftStr), $right-$left-strlen($leftStr)); } function upload($file, $multipart = true,$cookie) { $url = '...
某公司笔试编程题
2018-09-06 16:06

wsqyouth的博客 rightStr = str.substr(p+ 1 ,n); return rightStr+leftStr; } int main() { string str = "ABCDEFGH" ; string s = rotateString(str, 8 , 4 ); cout ; cout (str, 8 , 4 ); return 0 ; }...
chstr php,PHP 中英文混合排版中处理字符串常用的函数
2021-04-21 13:38

猪门不衣的博客 } $f_str=substr($str,$fs,$long); if($ltor==false)$f_str=cstrrev($f_str); return$f_str; } #取左字符串 #当cn_len==2时$long取左边多少个字,反之则取左边多少个字节 functioncleft(&$str,$long,$cn_len=2){ $f_...
PHP取中间文本
2015-07-15 10:00

天堂牧心的博客 echo getSubstr('我是测试文本','我是','文本'); /*以下是取中间文本的函数 getSubstr=调用名称 ... $str=预取全文本 ...function getSubstr($str, $leftStr, $rightStr) { $left = strpos($s
Delphi字符串操作的常用函数二
2013-02-06 16:34

kimifdw的博客 1.LeftStr（返回从字符串首开始指定长度的子字符串） function LeftStr(const AText: AnsiString; const ACount: Integer): AnsiString; overload; function LeftStr(const AText: WideString; const ACount: ...
php 字符串处理方法,[PHP] 字符串操作方法
2021-03-26 14:20

Geek7even的博客 /*以下是取中间文本的函数 getSubstr=调用名称 $str=预取全文本 $leftStr=左边文本 $rightStr=右边文本 */ function getIntermediate($str, $leftStr, $rightStr) { $left = strpos($str, $leftStr);...
用HTML实现简易版计算器
2022-08-13 17:53

一人思えてる的博客前半部分从字符串最后往前找，而后半部分则从前往后找，将查找匹配表达式的过程封装到leftOperation(r)方法和rightOperation(r)方法中，找到这些匹配的表达式后将此表达式返回给变量leftstr和rightstr，将要参与幂...
【算法笔记】LeetCode_30 串联所有单词的子串
2024-03-08 10:45

精英的英的博客 // 移动右指针 string rightStr = s.substr(right - step, step); // 获取当前单词 if (needs.count(rightStr) ) // 如果单词不在需求字典中 { left = right; // 移动左指针到右指针位置 matchWordCount = 0; // ...
抓取全国行政区划（PHP）代码
2019-03-08 10:42

bywayboy的博客项目需要，简单写了一个抓取全国行政区划的代码。 class AreaCodeCtrl ...static function getSubstr($str, $leftStr, $rightStr) { $llen = strlen($leftStr); $left = strpos($str, $leftStr); $ri...
php-取中间字符串
2008-08-29 10:29

weixin_30666753的博客 function getSubstr($str, $leftStr, $rightStr) { $left = strpos($str, $leftStr); //echo '左边:'.$left; $right = strpos($str, $rightStr,$left); //echo '<br>右边:'.$right; i...
mysql 、 postgresql 转换 java sql
2023-02-27 16:58

夜，念如尘的博客 leftstr, if(leftstr = '', SUBSTRING(rightstr, 1, 1), UPPER(SUBSTRING(rightstr, 1, 1))), SUBSTRING(rightstr, 2, length(rightstr)), ';' ) as java_variable from (select DISTINCTROW ORDINAL_POSITION, ...
计算机表达式函数,一个新算法的表达式求值的函数
2021-07-24 11:45

崔迪潇的博客一个新算法的表达式求值的函数来源：发布时间：2009-09-17elsebeginfor t:=i-1 downto 1 dobeginif not is123(s[t]) thenbeginresult:=strtofloat...if t=1 then result:=strtofloat(leftstr(s,i-1));end;end;en...
没有解决我的问题, 去提问

悬赏问题

¥20 求各位懂行的人，注册表能不能看到usb使用得具体信息，干了什么，传输了什么数据
¥15 个人网站被恶意大量访问，怎么办
¥15 Vue3 大型图片数据拖动排序
¥15 Centos / PETGEM
¥15 划分vlan后不通了
¥20 用雷电模拟器安装百达屋apk一直闪退
¥15 算能科技20240506咨询（拒绝大模型回答）
¥15 自适应 AR 模型参数估计Matlab程序
¥100 角动量包络面如何用MATLAB绘制
¥15 merge函数占用内存过大

Go的LeftStr，RightStr，SubStr

1条回答 默认 最新

悬赏问题

1条回答默认最新