Chinese zoom search indexes entire sentences - performance?

Please post all questions and comments regarding Help & Manual 7 here.

Moderators: Alexander Halser, Tim Green

Post Reply
Chris Middleton
Posts: 8
Joined: Wed May 16, 2018 7:25 pm

Chinese zoom search indexes entire sentences - performance?

Unread post by Chris Middleton »

Zoom Search for a Chinese language WebHelp creates an index (zoom_index.js) of entire sentences. Chinese does not use spaces between words. I do not know Chinese, but I can imagine that splitting the words might be an unsolvable computer problem. Yes?

So my question is:
Once I get my 25,000 word help translated to Chinese, will the index file be huge, and will it have an awful search response time?
User avatar
Tim Green
Site Admin
Posts: 23181
Joined: Mon Jun 24, 2002 9:11 am
Location: Bruehl, Germany
Contact:

Re: Chinese zoom search indexes entire sentences - performan

Unread post by Tim Green »

Hi Chris,

Unfortunately, Chinese is not search index friendly. This is from the documentation of Wrensoft Zoom, which is the indexing engine bundled in Help+Manual:
  • There are several things you can do to make Zoom produce faster search results on your website. First of all, the most important factor is the size of your index file. ... This is especially true if your website is translated to Unicode languages such as Japanese, Chinese, Korean, and others. These webpages can not be successfully indexed and will only result in a larger, slower, and less effective index file.
The point there being that because of the factors you mention yourself, a Chinese index will by definition be large and cumbersome. That comes with the territory. If you buy a Great Dane puppy you're going to have a big dog... 8)

The only thing you can do is to maximize efficiency on those factors where it is possible:
  • Define the PHP or ASP platform in your skin (NOT in the project, those settings are ignored if you're using a skin!). They are much more efficient and performant than the JavaScript option.
  • If possible, increase the size of your minimum skip words. The shorter they are, the larger your index will get.
Regards,
Tim (EC Software Documentation & User Support)

Private support:
Please do not email or PM me with private support requests -- post to the forum directly.
Chris Middleton
Posts: 8
Joined: Wed May 16, 2018 7:25 pm

Re: Chinese zoom search indexes entire sentences - performan

Unread post by Chris Middleton »

Thanks for the info. This subject closed.

And thanks, Tim, for your prompt, clear, and complete replies. You're some of the best support I experience.

Chris
Post Reply