Tags:
create new tag
, view all tags

Question

Hi, It seems that SearchEnginePluceneAddOn doesn't like special characters (with accents etc):

Can't add out-of-order term geÃxtrapoleerde lt
+geÃ\203Ã\203Ã\202Ã\203Ã\203Ã\202Ã\202Ã\203Ã\203Ã\203Ã\202Ã\202Ã\203Ã\202Ã\202Ã\203Ã\203Ã\203Ã\202Ã\203Ã\203Ã\202Ã\202Ã\202Ã
+\203Ã\203Ã\202Ã\202Ã\203Ã\202Ã\202ërodeerd (text lt text) at /usr/lib/perl5/site_perl/5.8.5/Plucene/Index/SegmentMerger.pm
+line 154
Can't add out-of-order term oriÃ\203Ã\203Ã\202Ã\203Ã lt
+oriÃ\203Ã\203Ã\202Ã\203Ã\203Ã\202Ã\202Ã\203Ã\203Ã\203Ã\202Ã\202Ã\203Ã\202Ã\202Ã\203Ã\203Ã\203Ã\202Ã\203Ã\203Ã\202Ã\202Ã\202Ã
+\203Ã\203Ã\202Ã\202Ã\203Ã\202Ã\202ënterend (text lt text) at /usr/lib/perl5/site_perl/5.8.5/Plucene/Index/SegmentMerger.pm
+line 154

I'm pretty new to this; could someone point me into the right direction for where to look / what to install?

Thanks!

Environment

TWiki version: TWikiRelease02Sep2004
TWiki plugins: DefaultPlugin, EmptyPlugin, InterwikiPlugin
Server OS: linux RH 9 Enterprise
Web server: Apache 1.3.33
Perl version: 5.8.5
Client OS: Win2k
Web Browser: IE6
Categories: Installation, Search, Internationalisation, Plugins

-- JosMaccabiani - 26 Aug 2005

Answer

ALERT! If you answer a question - or have a question you asked answered by someone - please remember to edit the page and set the status to answered. The status is in a drop-down list below the edit box.

I haven't used Plucene, but at first sight this looks like something is getting encoded to UTF-8 when it shouldn't be. InternationalisationEnhancements has some possibly useful links in its Unicode section, but some TWikiDebugging would be necessary.

-- RichardDonkin - 17 Sep 2005

I have no problems indexing and searching spanish/catalan topics/attachments (which have accents and other chars à é ï ...) with a Linux box (locale settings to en_US.ISO-8859-1) running both major TWiki versions (latest Cairo/Dakar)

The only pending issue is about special characters in comment field of attachments, which are not displayed properly. Plucene documentation is not clear enough about fields encoding ...

-- JoanMVigo - 14 Mar 2006

 
Topic revision: r4 - 2006-03-14 - JoanMVigo
 
Twitter Delicious Facebook Digg Google Bookmarks E-mail LinkedIn Reddit StumbleUpon    
  • Download TWiki
TWiki logo Powered by Perl Hosted by OICcam.com Ideas, requests, problems regarding TWiki? Send feedback. Ask community in the support forum.
Copyright © 1999-2012 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.