[LWN Logo]
[LWN.net]
From:	 Kir Kolyshkin <kir@asplinux.ru>
To:	 lwn@lwn.net
Subject: ASPSeek progress (for "Development" section @lwn)
Date:	 Tue, 25 Sep 2001 13:43:58 +0400

We have recently released ASPSeek v.1.2.5. Among
other improvements, it introduces new storage mode -
UTF-8 (this is a variant of unicode that uses only 1
byte for ASCII characters). Also, a full set of manual
pages is provided (they are also available online
at http://www.aspseek.org/manual.html). We have also
fixed numerous bugs so chances are high this release
will be even more stable for users.

For those who is not familiar with ASPSeek, this is
a search engine software, much like Google, but for
smaller volumes (up to several million URLs). It has
all the "bells and whistles" you can expect from a
decent search engine - phrase and boolean search,
word patterns, , ability to limit search to particular
site/set of sites/subsection of site, or particular
part of HTML documents. Search results can be sorted
by relevance (advanced ranking algorythms are used)
or by date. ASPSeek can work with multiple
languages/encodings at once (including multibyte
encodings such as Chinese). Other features include
stopwords and ispell support, a charset and language
guesser, HTML templates for search results, excerpts,
and query words highlighting.

ASPSeek is written in C++ using the STL library. It
uses MySQL for data storage, but data that affects
search speed is stored in binary files. And yes,
ASPSeek is free software, it is available under the
GNU GPL.

Future plans for ASPSeek includes further optimization
of internal data structures and algorythms for even
more performance, as well as developing some nifty
features.

Regards,
  Kir Kolyshkin, ASPSeek developer.

-- 
kir@asplinux.ru  ICQ 7551596  Phone +7 903 6722750
Reality always seems harsher in the early morning.
--