Table of Contents
1 Data import
1.1 Database
2 Indexing & Query
2.1 Analyzers
2.1.1 Tokenizers
2.1.1.1 KeywordTokenizer
Treats the entire text field as a token.
2.1.2 Filters
2.2 Query methods
2.2.1 eDisMax
3 Schema
3.1 Copying fields
copyField
makes it so that all the data coming into the source
will be
copied over to dest
, before analysis happens. This allows for us to index
and query a field in multiple ways, e.g.:
<field name="name" type="text_general" indexed="true" stored="true" /> <field name="name_exact" type="string" indexed="true" stored="false" /> <copyField source="name" dest="name_exact" />
Here the document only stores the field name
but we can also query
using the field name_exact
, which will only match the entire names.
IMPORTANT: If dest
either takes multiple values from source
, or is
copied to from many sources, remember to set multiValued
to "true"
on
the dest
field.
3.2 Words
- document
- basic unit of information; consists of fields
- doc-values
- way of recording field values that is more efficient for some purposes, e.g. faceting and sorting
- faceting
- the arrangement of search results into categories based on indexed terms. Searchers are presented with the indexed terms, along with numerical counts of how many matching documents were found were each term.
- (no term)