Today it came to my mind to write a web bot/crawler/spider
/etc in PHP
that only crawls News
websites. First of all I read articles about crawlers and then encountered with this issue:
How can a bot recognize a URL/post/article/text as it's related to News
!
The only soultion I came with, is to check them for some particular keywords, but No! I don't think that's a good and workable practice. At least not perfect!
So any ideas about better sloutions, is appreciated.