Opened 8 years ago

Closed 8 years ago

Last modified 8 years ago

#125 closed defect (fixed)

HTML entities preventing proper translation

Reported by: Artem Russakovskii Owned by: Ofer Wald
Priority: major Milestone: 0.6 - Current major
Component: Parser Version: 0.6.5
Keywords: Cc: archon810@…

Description

This plugin is exactly what I'd been looking for. It's great in many ways, but I found this bug that makes translation subpar in certain cases.

I'm using Windows Live Writers, which, for example, likes to insert single quotes as ’ instead of '. Apparently, the translating backend doesn't account for that, so it treats cases like "weren’t" as 2 separate words, resulting in translations like this (into .ru): "Костюмы weren’T давая".

I think a simple fix would be to convert some of these "smart" characters (god, I hate those) before sending them to the translating engine, should be pretty easy to do - we just need a list of possible characters that look like other characters.

Thank you.

Change History (4)

comment:1 Changed 8 years ago by Artem Russakovskii

Example of various texts that exhibit the problem can be found at http://www.androidpolice.com or http://beerpla.net

comment:2 Changed 8 years ago by Artem Russakovskii

Cc: archon810@… added

comment:3 Changed 8 years ago by Ofer Wald

Milestone: Beta wordpress plugin (0.5)0.6
Resolution: fixed
Status: newclosed

Fixed by [559]

comment:4 Changed 8 years ago by Artem Russakovskii

Confirmed as working. Thank you :)

Note: See TracTickets for help on using tickets.