Search Tools, Indexing Tools
Site Search Tools :
Programs for implementing search functionality on the website. Most tools have built in indexing function while some are exclusively search centered. Features may include boolean searching, string based, date based searches and search rankings.
DataparkSearch Engine
DataparkSearch Engine is a full-featured open sources web-based search engine released under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system. DataparkSearch consists of two parts. The first part is indexing mechanism (indexer). Indexer walks over html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search using data collected by indexer.
Key features
Support for http, https, ftp, nntp and news URL schemes.
htdb virtual URL scheme support for indexing SQL databases.
text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif mime types built-in support.
External parsers support for other document types.
Ability to index multilingual sites using content negotiation.
Searching all of the word forms using ispell affixes and dictionaries.
Fuzzy searching based on acronyms and abbreviations.
Stop-words, synonyms and acronyms lists.
Boolean query language support.
Neural network based Popularity Rank.
Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank).
Various character sets support.
Accent insensitive search.
Phrases segmenting for Chinese, Japanese, Korean and Thai languages.
mod_dpsearch - search module for Apache web server.
Internationalized Domain Names support.
The Summary Extraction Algorithm.
Loading .....