||4 years ago|
|fake-docs@d31168e813||4 years ago|
|.gitignore||4 years ago|
|.gitmodules||4 years ago|
|LICENSE||4 years ago|
|Readme.md||4 years ago|
|alpha_create.py||4 years ago|
|alpha_search.py||4 years ago|
|quickstart_notes.md||4 years ago|
|try_whoosh.py||4 years ago|
using python whoosh to create a search engine for a pile of markdown documents.
We have the charlesreid1/fake-docs repo as a submodule of this repo.
alpha: simple markdown index
alpha: phase 1: build a simple search engine index for a pile of markdown files.
- there are many ways to implement a search query
- when you create your schema you truly define everthing about the documents - including what is stored.
metadata in search:
to access metadata in a search, it must be STORED. example: when you do the search, if you want to have highlighting, every record should store its own full text.
alternatively, if the data can be obtained some other way on-the-fly, that would save memory/space.