High performance and scalable. Capable of handling full-text indexing of large-scale document collections.

About

About Unlike other information retrieval systems, Terrier learns from empirical data and adapts to the user’s information needs and queries. The framework offers a modular API for querying, allowing the development of applications as diverse as experimenting with standard test collections, or the rapid deployment of Web, intranet and desktop search engines. Terrier implements state-of-the-art indexing and retrieval functionalities, such as DFR and BM25F, as well as term dependence proximity models such as Markov Random Fields, and provides an ideal platform for the rapid development and evaluation of large-scale retrieval applications. Terrier has an outstanding performance with respect to other current public technologies that aim to provide similar retrieval facilities and is readily deployable on large-scale collections of documents. Search results can be viewed in a handy desktop search application, or online from a JSP web interface.   Key Benefits High performance and scalable.  Capable of handling full-text indexing of large-scale document collections.  Provides multiple indexing strategies, such as multi-pass, single-pass and large-scale MapReduce indexing.  Implemented in cross-platform Java to ease portability between operating systems.  Multilingual – uses UTF internally, and can support corporas written in languages other than English.  Supports HTML, PDF, Microsoft Office and source code files, and is easily extensible for new document types.   Applications Building Web, intranet and desktop search engines  Performing information retrieval experiments  Full-text searching  Multilingual information retrieval  Query performance prediction  Adaptive retrieval approaches XML searching  

Register for free for full unlimited access to all innovation profiles on LEO

  • Discover articles from some of the world’s brightest minds, or share your thoughts and add one yourself
  • Connect with like-minded individuals and forge valuable relationships and collaboration partners
  • Innovate together, promote your expertise, or showcase your innovations