-
A scholarly paper by Loren G. Terveen, William C. Hill, Brian Amento, David McDonald, and Josh Creter.
-
Cheshire II is a "Next-Generation Online Catalog and Full-Text Information Retrieval System." It features advanced IR techniques, including support for Boolean and probabilistic 'best match' ranked searching, SGML/XML as the primary data base format, and a client/server architecture that uses the Z39.50 Information Retrieval Protocol.
-
Full-text indexing of desktop documents for researchers, journalists, and historians with low indexing overhead 13 percent beyond document space. Also displays the most important words from each document. [Windows 95/98]
-
Search engine vendor of BRS/Search, a text based core product, and web enabled products.
-
Search software in 100% Java (J2EE) with parametric, natural language and full-text search capabilities.
-
Bankruptcy software in WordPerfect and MS-Word for legal professionals. Menu-driven data input and automatic form compilation in official forms typeset format for Chapters 7, 9, 11, 12, 13. Electronic filing (ECF) compatible.
-
Develops enterprise software that intelligently processes text-based information using automated information indexing and tagging.
-
Suite of search software products that finds information in multiple file formats and languages. Features product descriptions, evaluation version download, company profile and contact information.
-
Software for indexing and searching text documents, using full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Support for document types such as HTML, SGML, mail folders, and USMARC.
-
Developer of the KE Texpress database system, KE Texhtml WWW module, KE EMu (electronic museum management) and LifeData (vital statistics management).
-
Jakarta Lucene is a full-featured text search engine written entirely in Java, and it is an open source project available for free download from Apache Jakarta. The current goals of the project are primarily to provide application and also a platform for research.
-
Information and select sections of a book about indexing and compression techniques for documents and images. Also provides information about open source IR system released with the book.
-
Develops software and solutions for data mining, text analysis, and knowledge discovery.
-
Non-numerical information storage and retrieval software developed to allow institutions, especially in developing countries, to streamline their information processing activities.
-
Toolkit (SDK) for adding full-text indexing and searching capabilities to applications. Ported to a wide range of platforms and highly scalable. Designed for use in both large and small scale systems. Free evaluation download.
-
Supplier of information retrieval and collaborative software.
-
Provides document scanning, optical character recognition and full-text searching.
-
SWISH-Enhanced is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files.
-
SimpleScan Software, Inc - providing powerful, cost effective, enterprise wide document management software solutions.
-
Combine is an open system for harvesting and threshing (indexing) Internet resources.
-
Thunderstone has a number of full text search related products including their flagship text/relational database, Texis.
-
The tools they use at their site for sale. Demo version available for download.
-
Create and maintain a search engine using a perl script and database management tool.
-
Finds information in a related web of pages. Collects and indexes pages based on traversal of links or subdirectories. Create a context-sensitive search by category by linking to relevant pages.
-
Combined Computer Resources, Inc. (CCR), a software developer and integrator, specializes in customizing and integrating document imaging, COLD report management and workflow software products.
-
Zebra is a fulltext and free-text indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, and GILS records. The Zebra server is freely available for noncommercial applications.
-
Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range.
-
A complete indexing and searching system for a small domain or intranet. Source code provided under the GPL.
-
SGML/XML-savy structured fulltext engine and multi-protocol information management system. (BSn)