Opened 7 years ago

Closed 7 years ago

Last modified 7 years ago

#125 closed defect (fixed)

HTML entities preventing proper translation

Reported by: archon810 Owned by: ofer
Priority: major Milestone: 0.6 - Current major
Component: Parser Version: 0.6.5
Keywords: Cc: archon810@…

Description

This plugin is exactly what I'd been looking for. It's great in many ways, but I found this bug that makes translation subpar in certain cases.

I'm using Windows Live Writers, which, for example, likes to insert single quotes as ’ instead of '. Apparently, the translating backend doesn't account for that, so it treats cases like "weren’t" as 2 separate words, resulting in translations like this (into .ru): "Костюмы weren’T давая".

I think a simple fix would be to convert some of these "smart" characters (god, I hate those) before sending them to the translating engine, should be pretty easy to do - we just need a list of possible characters that look like other characters.

Thank you.

Change History (4)

comment:1 Changed 7 years ago by archon810

Example of various texts that exhibit the problem can be found at http://www.androidpolice.com or http://beerpla.net

comment:2 Changed 7 years ago by archon810

  • Cc archon810@… added

comment:3 Changed 7 years ago by ofer

  • Milestone changed from Beta wordpress plugin (0.5) to 0.6
  • Resolution set to fixed
  • Status changed from new to closed

Fixed by [559]

comment:4 Changed 7 years ago by archon810

Confirmed as working. Thank you :)

Note: See TracTickets for help on using tickets.