Design decisions

#1: What to store

  • The index needs to store at least the words contained in each record (separated by punctuation characters).

  • Can also store frequency, position, and other meta-data.

#2: When to Update

  • Maintain auxiliary data structures to "stage" updates and then update the index in batches.

Last updated