Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Indexer iconv problem string conversion
07-27-2012, 01:29 PM,
#1
Question  Indexer iconv problem string conversion
Hello out there,

Lucene Indexer (for full text search) gives to me the following Notice on serveral uploaded documents:

PHP Notice: iconv(): Detected an illegal character in input string in ..../letodms/Zend/Search/Lucene/Analysis/Analyzer/Common/Text.php on line 58

According to that, the word list (viewed by fulltext index info form Admin-Menu) shows, that sometimes Transliteration (iconv ASCII//TRANSLIT) of german "Umlaute" e.g.

ä -> ae , ö -> oe, ü -> ue, ß -> ss

works "as designed", but sometimes it fails, truncating the Words in the index Wordlist, e.g "bersetzen" instead of "uebersetzen" as would be expected be, when the German "ü" correctly transliterated to "ue".

How to fix this behaviour ?

VG
Alex
Reply
07-28-2012, 12:13 AM,
#2
RE: Indexer iconv problem string conversion
(07-27-2012, 01:29 PM)AlSchedl Wrote: Hello out there,

Lucene Indexer (for full text search) gives to me the following Notice on serveral uploaded documents:

PHP Notice: iconv(): Detected an illegal character in input string in ..../letodms/Zend/Search/Lucene/Analysis/Analyzer/Common/Text.php on line 58

According to that, the word list (viewed by fulltext index info form Admin-Menu) shows, that sometimes Transliteration (iconv ASCII//TRANSLIT) of german "Umlaute" e.g.

ä -> ae , ö -> oe, ü -> ue, ß -> ss

works "as designed", but sometimes it fails, truncating the Words in the index Wordlist, e.g "bersetzen" instead of "uebersetzen" as would be expected be, when the German "ü" correctly transliterated to "ue".

How to fix this behaviour ?

Can you provide an example document to reproduce the error.

Uwe
Reply


Forum Jump:


Users browsing this thread: 1 Guest(s)

Contact Us | LetoDMS Community Forum | Return to Top | | Lite (Archive) Mode | RSS Syndication