using whoosh to search a pile of markdown
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Charles Reid 99d1206ed1 split alpha, update readme 7 years ago
fake-docs@d31168e813 update fake-docs 7 years ago
.gitignore use glob to add document titles/content 7 years ago
.gitmodules add fake-docs submodule 7 years ago
LICENSE 2 legit 2 init commit (license and readme) 7 years ago
Readme.md split alpha, update readme 7 years ago
alpha_create.py split alpha, update readme 7 years ago
alpha_search.py split alpha, update readme 7 years ago
quickstart_notes.md split off quick start notes from readme 7 years ago
try_whoosh.py try whoosh 7 years ago

Readme.md

whoosh search

using python whoosh to create a search engine for a pile of markdown documents.

quickstart notes

See quickstart_notes.md

fake-docs

We have the charlesreid1/fake-docs repo as a submodule of this repo.

alpha: simple markdown index

alpha: phase 1: build a simple search engine index for a pile of markdown files.

important notes:

  • there are many ways to implement a search query
  • when you create your schema you truly define everthing about the documents - including what is stored.

metadata in search:

  • to access metadata in a search, it must be STORED. example: when you do the search, if you want to have highlighting, every record should store its own full text.

  • alternatively, if the data can be obtained some other way on-the-fly, that would save memory/space.