I'm trying to preserve the formatting
in table cells when I extract the contents of a <td>
cell.
What happens is if there are two lines of text (for e.g, an address) in the , the code may look like:
<td>
address line1<br>
1 address line2</td>
When colly extracts this, I get the following: address line1address line2
with no spacing or line breaks since all the html has been stripped from the text.
How can I work around / fix this so I receive readable text from the <td>