将分隔符内的文本转换为有效的URL

I have to convert an old website to a CMS and one of the challenges I have is at present there are over 900 folders that contain up to 9 text files in each folder. I need to combine the up to 9 text files into one and then use that file as the import into the CMS.

The file concatenation and import are working perfectly.

The challenge that I have is parsing some of the text in the text file.

The text file contains a url in the form of

Some text [http://xxxxx.com|About something] some more text

I am converting this with this code

if (substr ($line1, 0, 7) !=="Replace") {
    $pattern = '/\\[/';
    $pattern2 = '/\\]/';
    $pattern3 = '/\\|/';
    $replacement = '<a href="';
    $replacement3 = '">';
    $replacement2='</a><br>';

    $subject = $line1;
    $i=preg_replace($pattern, $replacement, $subject, -1 );
    $i=preg_replace($pattern3, $replacement3, $i, -1 );
    $i=preg_replace($pattern2, $replacement2, $i, -1 );

    $line .= '<div class="'.$folders[$x].'">'.$i.'</div>' ;
}

It may not be the most efficient code but it works and as this is a one off exercise execution time etc is not an issue.

Now to the problem that I cannot seem to code around. Some of the urls in the text files are in this format

Some text [http://xxxx.com] some more text

The pattern matching that I have above finds pattern and pattern2 but as there is no pattern3 the url is malformed in the output.

Regular expressions are not my forte is there a way to modify what I have above or is there another way to get the correctly formatted url in my output or will I need to parse the output a second time looking for the malformed url and correct it before writing it to the output file?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doulin2555 2014-03-31 07:00
关注
You can use preg_replace_callback() to achieve this:

Find any string of the format [...]

Try to split them by the delimiter | using explode()

If the split array contains two pieces, then it means the [...] string contains two pieces: the link href and the link anchor text

If not, then it means the the [...] string contains only the link href part

Format and return the link

Code:

$input = <<<EOD Some text [http://xxxxx.com|About something] some more text Some text [http://xxxx.com] some more text EOD; $output = preg_replace_callback('#\[([^\]]+)\]#', function($m) { $parts = explode('|', $m[1]); if (count($parts) == 2) { return sprintf('<a href="%s">%s</a>', $parts[0], $parts[1]); } else { return sprintf('<a href="%1$s">%1$s</a>', $m[1]); } }, $input); echo $output;

Output:

Some text About something some more text
Some text http://xxxx.com some more text

Live demo
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

将分隔符内的文本转换为有效的URL php
2014-03-31 06:54

回答 1 已采纳 You can use preg_replace_callback() to achieve this: Find any string of the format [...] Try to
使用PHP使用SPACE分隔符将十进制转换为ASCII php
2016-03-06 03:06

回答 1 已采纳 This would be a situation where the strtok function actually could be used for something. The str
PHP - 将制表符分隔的TXT文件转换为CSV php
2016-08-07 02:34

回答 3 已采纳 fputcsv() only writes one line at a time. Not the whole file. You you need to loop through $data i
php将文本文件转换csv输出的方法
2020-12-18 16:08

这个类提供了转换成固定宽度的CSV文件,快速,简便的方法,它可将SplFileObject用于执行迭代,使它非常高效的一个迭代只知道当前成员,期权是提供给指定行字符和字段分隔符结束,This from CSV files.这个类是特别有用的,...
PHP - 将字符串转换为具有千位分隔符支持的整数 php
2017-01-02 20:53

回答 2 已采纳 You can use str_replace in PHP . like this. intval(str_replace('.','',$string)) Assuming you a
pl/sql文本导入器分隔符 oracle sql
2015-07-15 07:39

回答 2 已采纳 plsql可以设置的。 ![图片说明](https://img-ask.csdn.net/upload/201507/15/1436949955_106362.png)
java中设置千分位分隔符 eclipse java
2022-06-16 18:52

回答 1 已采纳 import java.util.*; public class T { public static void main(String[] args) { int ri, r
php html url编码转换,HTML URL编码
2021-04-21 23:48

heyonie的博客 URL——统一资源定位符Web浏览器通过URL从Web服务器上请求页面。URL就是网页的地址，如：http://yige.org。URL编码在因特网上传送URL，只能采用ASCII字符集。但由于URL常常包含ASCII字符集以外的字符，所以我们必须...
Python将字符串以分隔符分隔为列表元素 python
2022-05-17 16:37

回答 8 已采纳先用text.replace('\n\n', ',')把换行符换成逗号，再用text.split(',')用逗号作为分隔符。
PHP preg_split将分隔符保存在不同的元素中 php
2018-10-19 05:17

回答 2 已采纳 This will get you pretty close $page_content = 'the quick brown fox [[random text here]] and the
使用分隔符将CSV数据转换为array_map和str_getcsv php
2018-06-15 06:06

回答 2 已采纳 Try this: $rows = array_map(function($v){return str_getcsv($v, ";");}, file($filePath_product_nam
10、hive综合示例：数据多分隔符（正则RegexSerDe）、url解析、行列转换常用函数（case when、union、concat和explode）详细使用示例
2023-06-08 10:13

一瓢一瓢的饮 alanchanchn的博客一、多字节分隔符的三种方案 1、默认规则 Hive默认序列化类是LazySimpleSerDe，其只支持使用单字节分隔符（char）来加载文本数据，例如逗号、制表符、空格等等，默认的分隔符为”\001”。根据不同文件的不同分隔符...
如何将以冒号分隔的文本转化为json？ json python
2022-07-16 00:05

回答 1 已采纳做成嵌套字典，Pool的关键字作为第一层，然后Network到Available做成第二成关键字，第一层的key是第二成的关键字，第二成的key是那些值然后用json.dump来写入
awk 分隔符 多个空格_awk分隔符设定为多个字符或字符串
2021-01-17 14:23

weixin_39999222的博客 awk -F"[0][1]" '{}' 这种形式指定的分隔符是合并的关系，即以“01”作为一个字符为分隔符。故假如有test.txt文本文件只有一行:1. mail from: tomcat@gmail.com 2. subject:hello 3. data:2012-07-12 17:00 4. c...
先分号分隔然后逗号分割c语言,分隔符的用法
2021-05-22 17:51

炮弹喵的博客用分隔符标识文字分隔的位置，或在将文本转换为表格时，用其标识新行或新列的起始位置。而分隔符有哪些使用的技巧呢?以下是由学习啦小编整理关于分隔符的用法的内容，希望大家喜欢!分隔符的用法分页符在插入点处插入...
没有解决我的问题, 去提问

悬赏问题

¥15 没有证书，nginx怎么反向代理到只能接受https的公网网站
¥50 成都蓉城足球俱乐部小程序抢票
¥15 yolov7训练自己的数据集
¥15 esp8266与51单片机连接问题(标签-单片机|关键词-串口)（相关搜索：51单片机|单片机|测试代码）
¥15 电力市场出清matlab yalmip kkt 双层优化问题
¥30 ros小车路径规划实现不了，如何解决？(操作系统-ubuntu)
¥20 matlab yalmip kkt 双层优化问题
¥15 如何在3D高斯飞溅的渲染的场景中获得一个可控的旋转物体
¥88 实在没有想法，需要个思路
¥15 MATLAB报错输入参数太多

将分隔符内的文本转换为有效的URL

1条回答 默认 最新

悬赏问题

1条回答默认最新