doushansu9012 2015-11-07 17:03
浏览 40

如何从Instagram标题中明智地删除所有尾随主题标签?

Many Instagram posts end with a plethora of hashtags, for example:

"This is one of the amazing Mountains you can find in the National Forest Park in #Zhangjiajie #Chinawhich is where James Cameron drew his inspiration for the flying mountains in #Avatar..

Credit: @phototravelnomads 
#pictoura #gydr 
#destinationearth #earthpix #ourlonelyplanet#wonderful_earthLife #timeoutsociety#fantastic_earthpics #liveoutdoors #igglobalclub#awesomeearth #mist_vision #earthdeluxe
# #worldbestgram #mthrworld #fantastic_earth#famouscaptures #destination_wow #dreamlifepix#wonderful_places #igworldclub #ig_global_life
#natureaddict #beautifuldestinations #traveler #guider#locals"

I'm looking to process the captions to remove the hashtag collection at the end, while leaving the rest intact. What would be a good approach to doing this? I'm sure I can figure out a brute force way, but I'm hoping to get some thoughts on an elegant solution. Doesn't have to be actual code. :)

Edit per burna's comment: The expected result would be:

"This is one of the amazing Mountains you can find in the National Forest Park in #Zhangjiajie #Chinawhich is where James Cameron drew his inspiration for the flying mountains in #Avatar..

Credit: @phototravelnomads"

Edit per Alan Moore's answer: This works quite well, but not in every situation. For instance, if the input text would be:

"This is one of the amazing Mountains you can find in the National Forest Park in #Zhangjiajie #Chinawhich is where James Cameron drew his inspiration for the flying mountains in #Avatar"

... it would be cut off from "#Zhangjiajie" on.

I'm thinking there's probably a bit more logic required, perhaps splitting the string into an array; checking if it ends in hashtags; if so then how many; if more than X (4?), cut it off from the first one in the last complete series.

  • 写回答

2条回答 默认 最新

  • dqf2015 2015-11-07 17:25
    关注

    If I understand correctly the following should work:

    $hashTag="pictoura #gydr 
    
    destinationearth #earthpix #ourlonelyplanet#wonderful_earthLife #timeoutsociety#fantastic_earthpics #liveoutdoors #igglobalclub#awesomeearth #mist_vision #earthdeluxe
    
     #worldbestgram #mthrworld #fantastic_earth#famouscaptures #destination_wow #dreamlifepix#wonderful_places #igworldclub #ig_global_life
    
    natureaddict #beautifuldestinations #traveler #guider#locals";
    
    echo preg_replace('/(#.*\s*)/','',$hashTag);
    

    That outputs:

    pictoura destinationearth natureaddict

    Good luck!!

    评论

报告相同问题?

悬赏问题

  • ¥15 2020长安杯与连接网探
  • ¥15 关于#matlab#的问题:在模糊控制器中选出线路信息,在simulink中根据线路信息生成速度时间目标曲线(初速度为20m/s,15秒后减为0的速度时间图像)我想问线路信息是什么
  • ¥15 banner广告展示设置多少时间不怎么会消耗用户价值
  • ¥16 mybatis的代理对象无法通过@Autowired装填
  • ¥15 可见光定位matlab仿真
  • ¥15 arduino 四自由度机械臂
  • ¥15 wordpress 产品图片 GIF 没法显示
  • ¥15 求三国群英传pl国战时间的修改方法
  • ¥15 matlab代码代写,需写出详细代码,代价私
  • ¥15 ROS系统搭建请教(跨境电商用途)