So we've got a text file that we're trying to upload to a database. We've managed to do that. However, the text file comes from Microsoft Office so it's got a load of horrible tags in. We convert the file by using mb_convert_encoding($content, "UTF-8", "UTF-16");
This allows us to strip the nasty tags etc.
However, even after conversion we are still getting characters like ⁂
, instead of "B. and †
instead of closing smart quotes (from office).
Is there any conversion method people have found that works? Please note I have also tried iconv and utf8_convert.
I've also had a search through other posts and I am yet to find the solution.
Thanks