帮助awk截断和填充

I have a long list of Unicode values that are semi-colon delimited. Here's an example:

E0027;TAG APOSTROPHE;Cf;0;BN;;;;;N;;;;;

All I need is the "E0027;" part.

So I first need to drop everything in the line AFTER the first semicolon, but in some cases the semicolon is after 4 digits, in other cases, (as above) it's after 5. If it were the same throughout I'd just truncate after a fixed number of chars. I've found lots of examples for doing various manipulations with awk but no regular expressions that seem to fit this particular case. Does anyone know what the proper syntax is? The logic is merely to keep everything BEFORE the first semicolon and to drop everything after it.

Then, for the resulting file, I need to add a leading 0 to the line if the number is only 4 chars. So for example:

8A9B;

Should become:

08A9B;

But the 5 digit values (such as the first example) should remain as is...no leading zero.

(Though would an extra leading zero make a difference if I'm using these values in HTML? Would it matter if I had:

&#x0E0027

Instead of:

&#xE0027

If these will be parsed identically by PHP and won't make a difference, I guess the latter part isn't so important (though with thousands of extra zeros it will bloat the size of the code.)

Thank you for any help in advance!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

5条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dpv46227 2011-03-27 03:35
关注
awk -F';' '$0=length($1)<5?"0" $1 FS:$1 FS'

Proof of Concept

$ echo "8A9B;TAG APOSTROPHE;Cf;0;BN;;;;;N;;;;;" | awk -F';' '$0=length($1)<5?"0" $1 FS:$1 FS' 08A9B; $ echo "E0027;TAG APOSTROPHE;Cf;0;BN;;;;;N;;;;;" | awk -F';' '$0=length($1)<5?"0" $1 FS:$1 FS' E0027;
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(4条)

报告相同问题？

关注问题

帮助awk截断和填充 php
2011-03-27 03:06

回答 5 已采纳 awk -F';' '$0=length($1)<5?"0" $1 FS:$1 FS' Proof of Concept $ echo "8A9B;TAG APOSTROPHE;Cf;
Comm，awk替代php php
2013-04-30 22:27

回答 1 已采纳 By using these : read files into an array file function get difference between two arrays array_
sed命令和awk命令练习 linux
2018-12-28 17:06

回答 2 已采纳第一题答案： 1 This is the first line. 2 Hello, Everybody! 3 192.168.1.1 www..edu.cn 4 lijunhui:x:504:
Ctfshow web入门 PHP特性篇 web89-web151 全
2023-07-10 14:49

Jay 17的博客 Ctfshow web入门 PHP特性篇 web89-web151 全 web入门 PHP特性篇的wp都一把梭哈在这里啦~ 有点多，师傅们可以收藏下来慢慢看，写的应该挺全面的叭..... 有错误敬请斧正！
像sed / awk / grep一样帮助PHP数据修改 php
2010-04-29 22:20

回答 1 已采纳 You don't filter output. You use simple_html_dom to parse and manipulate that way. it really is mo
使用shell 或者awk 统计数据 linux
2022-05-17 16:13

回答 1 已采纳 # awk -F "," '{print $1,$5}' file |sed -e 's/{a=//g' -e 's/e=//g' -e 's/}//g' 22 5656 33 5346 44
利用sed和awk将列数据通过关键字转换为多列
2018-01-10 04:02

回答 4 已采纳 ![图片说明](https://img-ask.csdn.net/upload/201801/11/1515651157_581541.png) ``` #shell命令： #将每一
CTFSHOW-PHP特性
2021-10-09 21:53

_Monica_的博客 include("flag.php"); highlight_file(__FILE__); if(isset($_GET['num'])){ $num = $_GET['num']; if(preg_match("/[0-9]/", $num)){ die("no no no!"); } if(intval($num)){ echo $flag; } } 知识点： ...
linux命令之 awk 截取 linux
2021-09-15 16:01

回答 1 已采纳你可以先把你要的2个字段打印出来再用管道处理比如 awk '{print $1,$7}' t.txt 结果应该是 `COMCODE` '公司编码', 然后继续用awk 去掉符号，或者用其他命令
使用curl和awk获取本地服务器的外网IP地址 centos 服务器
2022-09-26 13:02

回答 3 已采纳你需要加个参数 -s 隐藏耗时 curl -s myip.ipip.net | awk '{print $2}'
Bash循环按文件计算php标签 bash php
2015-05-21 21:18

回答 2 已采纳 Any time you write a loop in shell just to manipulate text you have the wrong approach. In this ca
ctfshow-php特性系列
2021-04-10 17:57

multi4的博客文章目录WEB89-数组绕过WEB90-intval绕过WEB91-preg_match-换行绕过WEB92-...include("flag.php"); highlight_file(__FILE__); if(isset($_GET['num'])){ $num = $_GET['num']; if(preg_match("/[0-9]/", $num)){
执行 awk 出现下面问题
2016-04-20 09:39

回答 2 已采纳自己解决了自己解决了
Web RCE总结
2023-12-20 21:14

孟嘎嘎想学好Web的博客文章目录 RCE php常见函数与php性质文件包含漏洞漏洞分类本地文件包含漏洞远程文件包含漏洞伪协议 php://input php://filter data伪协议 file://伪协议利用XSS执行（还不会）代码执行漏洞过滤及bypass方式 ...
PHP 面试知识点整理归纳
2018-09-05 18:33

php小学一年级生的博客该篇文章是针对Github上wudi/PHP-Interview-Best-Practices-in-China资源的答案个人整理 lz也是初学者，以下知识点均为自己整理且保持不断更新，也希望各路大神多多指点，若发现错误或有补充，可直接comment，lz...
没有解决我的问题, 去提问

悬赏问题

¥15 javaweb登陆的网页为什么不能正确连接查询数据库
¥15 数学建模数学建模需要
¥15 已知许多点位，想通过高斯分布来随机选择固定数量的点位怎么改
¥20 nao机器人语音识别问题
¥15 怎么生成确定数目的泊松点过程
¥15 layui数据表格多次重载的数据覆盖问题
¥15 python点云生成mesh精度不够怎么办
¥15 QT C++ 鼠标键盘通信
¥15 改进Yolov8时添加的注意力模块在task.py里检测不到
¥50 高维数据处理方法求指导

帮助awk截断和填充

5条回答 默认 最新

Proof of Concept

悬赏问题

5条回答默认最新