I have the following data
<description><div dir="ltr" style="text-align: left;" trbidi="on"><div class="MsoNormal"><i><span style="font-family: Georgia, Times New Roman, serif; font-size: xx-small;">By Marina Correa</span></i></div><div class="MsoNormal"><i><span style="font-family: Georgia, Times New Roman, serif; font-size: xx-small;">Photography: Courtesy the architect</span><span style="font-family: Georgia, serif; font-size: 9pt;"><o:p></o:p></span></i></div><div class="MsoNormal"><br></div><table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"><tbody><tr><td style="text-align: center;"><a href="http://3.bp.blogspot.com/-D1JRy4epwOM/UooCcR-U7lI/AAAAAAAALyM/tDr2ezxnb-I/s1600/Prost_Beer_+House_AH_Design_Indiaartndesign.jpg" imageanchor="1" style="margin-left: auto; margin-right: auto;"><img alt="Prost Beer House in Bengaluru, India,by AH design." border="0" src="http://3.bp.blogspot.com/-D1JRy4epwOM/UooCcR-U7lI/AAAAAAAALyM/tDr2ezxnb-I/s1600/Prost_Beer_+House_AH_Design_Indiaartndesign.jpg" title=""></a></td></tr><tr><td class="tr-caption" style="text-align: right;"><span style="font-family: Arial, Helvetica, sans-serif; font-size: xx-small;">.</span></td></tr></tbody></table><div class="MsoNormal"><br></div><div class="MsoNormal"></div><div style="text-align: justify;"><span style="font-family: Georgia, &#39;Times New Roman&#39;, serif;">Evolving from carnage of shipwrecked metal, the interiors of Prost Beer House in Bengaluru, India, make it an attention-grabbing drinking hole…</span></div></div><a href="http://inditerrain.indiaartndesign.com/2013/11/beerhouse-rock.html#more">Read more »</a><img src="http://feeds.feedburner.com/~r/IndiaArtNDesign/~4/jGC75D3KB0o" height="1" width="1"/></description>
however instead of "<" i have "& lt;" and instead of ">" i have "& gt;"
i need a regular expression to find the data not inside the html tags ie the actual text and not the names of the tags, class name etc...
for parsing the html with "<" and ">" i found this: (?<=^|>)[^><]+?(?=<|$)
although i dont know how to convert it to suit what i need. help is much appreciated