Ticket #2 (assigned defect)

Opened 3 years ago

Last modified 16 months ago

Search Engine has an issue with international characters

Reported by: MRAMBERG@… Owned by: klur
Priority: normal Milestone:
Component: openguides Version:
Severity: normal Keywords: hackfestsummer2007-reviewed
Cc:

Description

Sat May 22 17:01:40 2004 MRAMBERG - Ticket created [Reply] [Comment]

Subject: Search Engine has an issue with international characters

Namely, it doesn't support them, so searches for things like grünerløkka and grønland are moot.

Download (untitled) 95b

Sat May 22 17:14:21 2004 IVORW - Correspondence added [Reply] [Comment]

From: IVORW

[MRAMBERG - Sat May 22 17:01:40 2004]:

Namely, it doesn't support them, so searches for things like grünerløkka and grønland are moot.

I will look at expanding the allowed character set for search strings to take into account internationalization. We need to see if there are issues with Search::InvertedIndex?, and make sure that it tokenizes in an i18n way.

Ivor.

Download (untitled) 379b

Mon Dec 13 19:19:56 2004 DOM - Severity Normal added Sun Jan 9 10:02:22 2005 DOM - Comments added [Reply] [Comment]

[IVORW - Sat May 22 17:14:21 2004]:

[MRAMBERG - Sat May 22 17:01:40 2004]:

Namely, it doesn't support them, so searches for things like grünerløkka and grønland are moot.

I will look at expanding the allowed character set for search strings to take into account internationalization. We need to see if there are issues with Search::InvertedIndex?, and make sure that it tokenizes in an i18n way.

Work on this should probably be concentrated on Plucene - see other utf8 bugs in OpenGuides though.

Change History

Changed 3 years ago by dom

  • component set to openguides

Changed 16 months ago by dom

  • owner set to Nobody

Changed 16 months ago by klur

  • owner changed from Nobody to klur
  • status changed from new to assigned

Changed 16 months ago by Kake

  • keywords hackfestsummer2007-reviewed added

Changed 16 months ago by klur

Doing the search with accented characters: hotel/hôtel the keyword hôtel doesn't even appear in the search title, where hotel does. Otherwise accented characters are displayed properly using sqlite.

Note: See TracTickets for help on using tickets.