拆分一个字符串，记住分裂的位置

Assume I have the following string:

I have | been very busy lately and need to go | to bed early

By splitting on "|", you get:

$arr = array(
  [0] => I have
  [1] => been very busy lately and need to go
  [2] => to bed early
)

The first split is after 2 words, and the second split 8 words after that. The positions after how many words to split will be stored: array(2, 8, 3). Then, the string is imploded to be passed on to a custom string tagger:

tag_string('I have been very busy lately and need to go to bed early');

I don't know what the output of tag_string will be exactly, except that the total words will remain the same. Examples of output would be:

I have-nn been-vb very-vb busy lately and-rr need to-r go to bed early-p
I-ee have been-vb very busy-df lately-nn and need-f to go to bed-uu early-yy

This will lengthen the string by an unknown number of characters. I have no control over tag_string. What I know is (1) the number of words will be the same as before and (2) the array was split after 2, and thereafter after 8 words, respectively. I now need a solution explode the tagged string into the same array as before:

$string = "I have-nn been-vb very-vb busy lately and-rr need to-r go to bed early-p"
function split_string_again() {
  // split after 2nd, and thereafter after 8th word
}

With output:

$arr = array(
  [0] => I have-nn
  [1] => been-vb very-vb busy lately and-rr need to-r go
  [2] => to bed early-p
)

So to be clear (I wasn't before): I cannot split by remembering the strpos, because strpos before and after the string went through the tagger, aren't the same. I need to count the number of words. I hope I have made myself more clear :)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

dongyong6332 2012-02-14 01:51

关注

Interesting question, although I think the rope data structure still applies it might be a little overkill since word placement won't change. Here is my solution:

$str = "I have | been very busy lately and need to go | to bed early";

function get_breaks($str)
{
    $breaks = array();
    $arr = explode("|", $str);

    foreach($arr as $val)
    {
        $breaks[] = str_word_count($val);
    }

    return $breaks;
}

$breaks = get_breaks($str);

echo "<pre>" . print_r($breaks, 1) . "</pre>";

$str = str_replace("|", "", $str);

function rebreak($str, $breaks)
{
    $return = array();
    $old_break = 0;

    $arr = str_word_count($str, 1);

    foreach($breaks as $break)
    {
        $return[] = implode(" ", array_slice($arr, $old_break, $break));

        $old_break += $break;
    }

    return $return;
}

echo "<pre>" . print_r(rebreak($str, $breaks), 1) . "</pre>";

echo "<pre>" . print_r(rebreak("I have-nn been-vb very-vb busy lately and-rr need to-r go to bed early-p", $breaks), 1) . "</pre>";

Let me know if you have any questions, but it is pretty self explanatory. There are definitely ways to improve this as well.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(2条)

报告相同问题？

关注问题

拆分一个字符串，记住分裂的位置 php
2012-02-13 16:58

回答 3 已采纳 Interesting question, although I think the rope data structure still applies it might be a little
PHP拆分多个字符串 php
2016-12-11 17:53

回答 1 已采纳 Assuming the fact that you're reading the entire text file data in $readTxtData variable, the solu
java编程拆分字符串文本监听
2019-01-08 07:35

回答 4 已采纳首先，你是要在输入完成所有字符串后显示还是边输入边显示输入完成显示：不会就先百度找下String.split()方法，字符串转数组边输入边显示：监听输入内容含有"|"，截取字符串显示文
php将字符串分成两位,使用PHP将字符串拆分为两个相等长度的部分
2021-03-23 17:35

weixin_39636411的博客还有一件事,虽然分裂的话不应该被打破.解决方法:这将分裂而不会破坏文字,尽可能在文本的一半,但它可能会分裂为任何其他字符(,,.,@ etc)$data = "Split a string by length without breaking word"; //stringif ...
在PHP中拆分包含多个字符的字符串 php
2018-10-26 11:07

回答 1 已采纳 preg_split() will ONLY split the string (on the space between the two desired substrings) if there
如何拆分字符串并构建一个关联数组php？ php
2017-12-18 15:13

回答 1 已采纳 From your examples, I'm guessing you want to split the string by commas followed by the number 1.
我想将一个字符串拆分为两个变量 javascript jquery mysql php
2018-11-29 07:22

回答 4 已采纳 String split functions are available in both Javascript and PHP. In php, you can just use $arr
使用PowerShell将字符串拆分为数组
2020-07-24 08:12

culuo4781的博客 and “Administrator.” 如您所见，我们为$ insputString变量分配了“ Sonali Bhatt is a Database Administrator ”字符串，使用.Split（）函数将字符串拆分为多个单词，并使用数组的索引打印“ Sonali ”和“ ...
Java字符串拆分为单个字符 java
2021-12-24 10:14

回答 2 已采纳就是一个string转list的问题可以百度一下
在数组php中拆分和存储字符串 php
2017-08-21 05:37

回答 4 已采纳 You need to use explode() like below:- $myarray = explode(' ', $descr); print_r($myarray); Outp
C#如何把用Split拆分过后的字符串进行一个条件筛选 c#
2022-05-11 21:02

回答 1 已采纳 C#有LINQ可以用来对集合数据进行筛选操作，比如： var str = "1,2,3,4,5,6"; var arr = str.Split(","); var result = arr.Selec
php把word按段落划分,使用PHP将字符串拆分为一半(Word-Aware)
2021-04-27 08:10

巴黎巨星岬太郎的博客例如,快速：棕色狐狸跳过懒惰/狗的中间是快速：棕色狐狸Ju在一个单词的中间,这第一个例子给string2分裂的单词;最下面的例子给string1分词.在分词上给string1少一点$text = "The Quick : Brown Fox Jumped Over T...
PHP正则表达式：在数值上拆分一个字符串 php
2014-12-13 14:34

回答 1 已采纳 Split your input according to the space which exists just before to the number and the space which
mysql数据库名长可为64字符,MySQL数据库设计规范
2021-04-25 12:51

努力奋斗的Brian的博客库的名称尽量控制在32个字符以内，最长不超过64个字符，相关模块的表名与表名之间尽量体现join的关系，如user表和user_login表。库名建议不要使用MySQL保留字。如ic_u_payment_prod_db,为创新中心unex payment项目。...
php mysql和c,MySQL插入,PHP比C更快,这是预期的吗？
2021-03-26 09:52

智珠在睿的博客最近我的任务是做一些速度检查,所以我可以判断是否更快使用php / php-cli或c将一定数量的行插入数据库.在开始之前,让我告诉你一些细节,以便一切都清楚：> php部分通过Apache运行,直接在浏览器中请求.>正在运行...
没有解决我的问题, 去提问

悬赏问题

¥15 Macbookpro 连接热点正常上网，连接不了Wi-Fi。
¥15 delphi webbrowser组件网页下拉菜单自动选择问题
¥15 linux驱动，linux应用，多线程
¥20 我要一个分身加定位两个功能的安卓app
¥15 基于FOC驱动器，如何实现卡丁车下坡无阻力的遛坡的效果
¥15 IAR程序莫名变量多重定义
¥15 (标签-UDP|关键词-client)
¥15 关于库卡officelite无法与虚拟机通讯的问题
¥15 目标检测项目无法读取视频
¥15 GEO datasets中基因芯片数据仅仅提供了normalized signal如何进行差异分析

码龄粉丝数原力等级 --

拆分一个字符串，记住分裂的位置

3条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

拆分一个字符串，记住分裂的位置

3条回答 默认 最新

悬赏问题

3条回答默认最新