fix the way repo name label is handled

update how we store totals
remove print statement
2018-08-01 00:25:29 -07:00 · 2018-07-31 23:58:19 -07:00 · 2018-07-31 23:17:16 -07:00 · 2018-07-31 23:16:45 -07:00 · 2018-07-31 23:16:23 -07:00 · 2018-07-31 23:12:57 -07:00
21 changed files with 454 additions and 310 deletions
--- a/.gitmodules
+++ b/.gitmodules
@@ -0,0 +1,3 @@
 [submodule "mkdocs-material"]
 	path = mkdocs-material
 	url = https://git.charlesreid1.com/charlesreid1/mkdocs-material.git
--- a/Readme.md
+++ b/Readme.md
@@ -1,4 +1,4 @@
-# centillion
+# The Centillion
 **the centillion**: a pan-github-markdown-issues-google-docs search engine.
@@ -6,62 +6,40 @@
 the centillion is 3.03 log-times better than the googol.
 ![Screen shot of centillion](img/ss.png)
 ## what is it
-The centillion is a search engine built using [whoosh](#),
+The centillion is a search engine built using [whoosh](https://whoosh.readthedocs.io/en/latest/intro.html),
 a Python library for building search engines.
 We define the types of documents the centillion should index,
-and how, using what fields. The centillion then builds and
+what info and how. The centillion then builds and
-updates a search index.
+updates a search index. That's all done in `centillion_search.py`.
 The centillion also provides a simple web frontend for running
-queries against the search index.
+queries against the search index. That's done using a Flask server
 defined in `centillion.py`.
 The centillion keeps it simple.
-## work that is done
+## quickstart
-See [Workdone.md](Workdone.md)
+Run the centillion app with a github access token API key set via
 environment variable:
 ```
 GITHUB_TOKEN="XXXXXXXX" python centillion.py
 ```
 This will start a Flask server, and you can view the minimal search engine
 interface in your browser at <http://localhost:5000>.
 ## more info
 For more info see the documentation: <https://charlesreid1.github.io/centillion>
 ## work that is being done
 See [Workinprogress.md](Workinprogress.md) for details about
 route and function layout. Summary below.
 ### code organization
 centillion app routes:
 - home
    - if not logged in, landing page
    - if logged in, redirect to search
 - search
 - main_index_update
    - update main index, all docs period
 centillion Search functions:
 - open_index creates the schema
 - add_issue, add_md, add_document have three diff method sigs and add diff types
  of documents to the search index
 - update_all_issues or update_all_md or update_all_documents iterates over items
  and determines whether each item needs to be updated in the search index
 - update_main_index - update the entire search index
    - calls all three update_all methods
 - create_search_results - package things up for jinja
 - search - run the query, pass results to the jinja-packager
 ## work that is planned
 See [Workplanned.md](Workplanned.md)
--- a/Todo.md
+++ b/Todo.md
@@ -0,0 +1,7 @@
 # todo
 current problems:
 - some github issues have no title
 - github issues are just being re-indexed over and over
 - documents not showing up in results
--- a/Workinprogress.md
+++ b/Workinprogress.md
@@ -1,106 +0,0 @@
 # Components
 The components of centillion are as follows:
 - Flask application, which creates a Search object and uses it to search index
 - Search object, which allows you to create/update/search an index
 ## Routes layout
 Current application routes are as follows:
 - home -> search
 - search
 - update_index
 Ideal application routes (using github flask dance oauth):
 - home
    - if not logged in, landing page
    - if logged in, redirect to search
 - search
 - main_index_update
    - update main index, all docs period
 - delta_index_update
    - updates delta index, docs that have changed since last main index
 There should be one route to update the main index
 There should be another route to update the delta index
 These should go off and call the update index methods
 for each respective type of document/collection.
 For example, if I call `main_index_update` route it should
 - call `main_index_update` for all github issues
 - call `main_index_update` for folder of markdown docs
 - call `main_index_update` for google drive folder
 These are all members of the Search class
 ## Functions layout
 Functions of the entire search app:
 - create a search index
 - load a search index
 - call the search() method on the index
 - update the search index
 The first and last, creating and updating the search index,
 are of greatest interest.
 The Schema affects everything so it is hard to separate
 functionality into a main Search class shared by many.
 (Avoid inheritance/classes if possible.)
 current Search:
 - open_index creates the schema
 - add_issue or add_document adds an item to the index
 - add_all_issues or add_all_documents iterates over items and adds them to index
 - update_index_incremental - update the search index
 - create_search_results - package things up for jinja
 - search - run the query, pass results to the jinja-packager
 centillion Search:
 - open_index creates the schema
 - add_issue, add_md, add_document have three diff method sigs and add diff types
  of documents to the search index
 - update_all_issues or update_all_md or update_all_documents iterates over items
  and determines whether each item needs to be updated in the search index
 - update_main_index - update the entire search index
    - calls all three update_all methods
 - create_search_results - package things up for jinja
 - search - run the query, pass results to the jinja-packager
 Nice to have but focus on it later:
 - update_diff_issues or update_diff_md or update_diff_documents iterates over items
  and indexes recently-added items
 - update_diff_index - update the diff search index (what's been added since last
  time)
    - calls all three update_diff methods
 ## Files layout
 Schema definition:
 * include a "kind" or "class" to group objects
 * can provide different searches of different collections
 * eventually can provide user with checkboxes
--- a/centillion.py
+++ b/centillion.py
@@ -38,7 +38,7 @@ class UpdateIndexTask(object):
        from get_centillion_config import get_centillion_config
        config = get_centillion_config('config_centillion.json')
-        gh_token = os.environ['GITHUB_ACESS_TOKEN']
+        gh_token = os.environ['GITHUB_TOKEN']
        search.update_index_issues(gh_token, config)
        search.update_index_gdocs(config)
@@ -76,25 +76,26 @@ def search():
        parsed_query, result = search.search(query.split(), fields=[fields])
        store_search(query, fields)
-    total = search.get_document_total_count()
+    totals = search.get_document_total_count()
-    return render_template('search.html', entries=result, query=query, parsed_query=parsed_query, fields=fields, last_searches=get_last_searches(), total=total)
+    return render_template('search.html', 
-
+                           entries=result, 
-@app.route('/open')
+                           query=query, 
-def open_file():
+                           parsed_query=parsed_query, 
-    path = request.args['path']
+                           fields=fields, 
-    fields = request.args.get('fields')
+                           last_searches=get_last_searches(), 
-    query = request.args['query']
+                           totals=totals)
    call([app.config["EDIT_COMMAND"], path])
    return redirect(url_for("search", query=query, fields=fields))
@app.route('/update_index')
 def update_index():
    rebuild = request.args.get('rebuild')
    UpdateIndexTask(diff_index=False)
    flash("Rebuilding index, check console output")
-    return render_template("search.html", query="", fields="", last_searches=get_last_searches())
+    return render_template("search.html", 
                           query="", 
                           fields="", 
                           last_searches=get_last_searches(),
                           totals={})
 ##############
--- a/centillion_search.py
+++ b/centillion_search.py
@@ -14,6 +14,8 @@ import tempfile, subprocess
 import pypandoc
 import os.path
 import codecs
 from datetime import datetime
 from whoosh.qparser import MultifieldParser, QueryParser
 from whoosh.analysis import StemmingAnalyzer
@@ -57,6 +59,10 @@ Schema:
 """
 def clean_timestamp(dt):
    return dt.replace(microsecond=0).isoformat()
 class SearchResult:
    score = 1.0
    path = None
@@ -115,7 +121,7 @@ class Search:
        schema = Schema(
                id = ID(stored=True, unique=True),
-                kind = ID(),
+                kind = ID(stored=True),
                created_time = ID(stored=True),
                modified_time = ID(stored=True),
@@ -172,10 +178,11 @@ class Search:
                'document' : 'docx',
        }
        content = ""
        if(mimetype not in mimemap.keys()):
            # Not a document - 
            # Just a file
-            print("Indexing document %s of type %s"%(item['name'], mimetype))
+            print("Indexing document \"%s\" of type %s"%(item['name'], mimetype))
        else:
            # Document with text
            # Perform content extraction
@@ -187,7 +194,7 @@ class Search:
            # This is a file type we know how to convert
            # Construct the URL and download it
-            print("Extracting content from %s of type %s"%(item['name'], mimetype))
+            print("Extracting content from \"%s\" of type %s"%(item['name'], mimetype))
            # Create a URL and a destination filename
@@ -227,7 +234,7 @@ class Search:
                )
                assert output == ""
            except RuntimeError:
-                print("XXXXXX Failed to index document %s"%(item['name']))
+                print("XXXXXX Failed to index document \"%s\""%(item['name']))
            # If export was successful, read contents of markdown
@@ -240,7 +247,7 @@ class Search:
            # No matter what happens, clean up.
-            print("Cleaning up %s"%item['name'])
+            print("Cleaning up \"%s\""%item['name'])
            subprocess.call(['rm','-fr',fullpath_output])
            #print(" ".join(['rm','-fr',fullpath_output]))
@@ -259,16 +266,17 @@ class Search:
                kind = 'gdoc',
                created_time = item['createdTime'],
                modified_time = item['modifiedTime'],
                indexed_time = datetime.now().replace(microsecond=0).isoformat(),
                title = item['name'],
                url = item['webViewLink'],
                mimetype = mimetype,
                owner_email = item['owners'][0]['emailAddress'],
                owner_name = item['owners'][0]['displayName'],
-                repo_name=None,
+                repo_name='',
-                repo_url=None,
+                repo_url='',
-                github_user=None,
+                github_user='',
-                issue_title=None,
+                issue_title='',
-                issue_url=None,
+                issue_url='',
                content = content
        )
@@ -277,7 +285,7 @@ class Search:
        """
        Add a Github issue/comment to a search index.
        """
-        repo_name = repo.name
+        repo_name = repo.owner.login+"/"+repo.name
        repo_url = repo.html_url
        count = 0
@@ -285,39 +293,62 @@ class Search:
        # Handle the issue content
        print("Indexing issue %s"%(issue.html_url))
        created_time = clean_timestamp(issue.created_at)
        modified_time = clean_timestamp(issue.updated_at)
        indexed_time = clean_timestamp(datetime.now())
        writer.add_document(
                id = issue.html_url,
                kind = 'issue',
                created_time = created_time,
                modified_time = modified_time,
                indexed_time = indexed_time,
                title = issue.title,
                url = issue.html_url,
-                is_comment = False,
+                mimetype='',
-                timestamp = issue.created_at,
+                owner_email='',
                owner_name='',
                repo_name = repo_name,
                repo_url = repo_url,
                github_user = issue.user.login,
                issue_title = issue.title,
                issue_url = issue.html_url,
                user = issue.user.login,
                content = issue.body.rstrip()
        )
        count += 1
        # Handle the comments content
        if(issue.comments>0):
            comments = issue.get_comments()
            for comment in comments:
                print(" > Indexing comment %s"%(comment.html_url))
                created_time = clean_timestamp(comment.created_at)
                modified_time = clean_timestamp(comment.updated_at)
                indexed_time = clean_timestamp(datetime.now())
                writer.add_document(
                        id = comment.html_url,
                        kind = 'comment',
                        created_time = created_time,
                        modified_time = modified_time,
                        indexed_time = indexed_time,
                        title = "Comment on "+issue.title,
                        url = comment.html_url,
-                        is_comment = True,
+                        mimetype='',
-                        timestamp = comment.created_at,
+                        owner_email='',
                        owner_name='',
                        repo_name = repo_name,
                        repo_url = repo_url,
                        github_user = comment.user.login,
                        issue_title = issue.title,
                        issue_url = issue.html_url,
-                        user = comment.user.login,
+                        content = comment.body.rstrip()
                        content = comment.body.strip()
                )
        count += 1
@@ -354,25 +385,50 @@ class Search:
        drive = service.files()
        # We should do more here
        # to check if we should update
        # or not...
        # 
        # loop over existing documents in index:
        #
        #    p = QueryParser("kind", schema=self.ix.schema)
        #    q = p.parse("gdoc")
        #    with self.ix.searcher() as s:
        #        results = s.search(q,limit=None)
        #        counts[key] = len(results)
        # The trick is to set next page token to None 1st time thru (fencepost)
        nextPageToken = None
        # Use the pager to return all the things
        items = []
        while True:
            ps = 12
            results = drive.list(
-                    pageSize=100,
+                    pageSize=ps,
                    pageToken=nextPageToken,
-                    fields="files(id, kind, createdTime, modifiedTime, mimeType, name, owners, webViewLink)",
+                    fields="nextPageToken, files(id, kind, createdTime, modifiedTime, mimeType, name, owners, webViewLink)",
                    spaces="drive"
            ).execute()
            nextPageToken = results.get("nextPageToken")
            items += results.get("files", [])
-            if nextPageToken is None:
+            # Keep it short
            break
            #if nextPageToken is None:
            #    break
        # Here is where we update.
        # Grab indexed ids
        # Grab remote ids
        # Drop indexed ids not in remote ids
        # Index all remote ids
        # Change add_ to update_
        # Add a hash check in update_
        indexed_ids = set()
        for item in items:
            indexed_ids.add(item['id'])
@@ -386,9 +442,12 @@ class Search:
        count = 0
        for item in items:
-            self.add_item(writer, item, indexed_ids, temp_dir, config)
+            self.add_drive_file(writer, item, indexed_ids, temp_dir, config)
            count += 1
        print("Cleaning temporary directory: %s"%(temp_dir))
        subprocess.call(['rm','-fr',temp_dir])
        writer.commit()
        print("Done, updated %d documents in the index" % count)
@@ -414,14 +473,14 @@ class Search:
        writer = self.ix.writer()
        # Iterate over each repo
-        list_of_repos = config['repos']
+        list_of_repos = config['repositories']
        for r in list_of_repos:
            if '/' not in r:
                err = "Error: specify org/reponame or user/reponame in list of repos"
                raise Exception(err)
-            this_repo, this_org = re.split('/',r)
+            this_org, this_repo = re.split('/',r)
            org = g.get_organization(this_org)
            repo = org.get_repo(this_repo)
@@ -441,6 +500,7 @@ class Search:
                to_index.add(issue.html_url)
                writer.delete_by_term('url', issue.html_url)
                count -= 1
                comments = issue.get_comments()
                for comment in comments:
@@ -477,11 +537,6 @@ class Search:
            # contains a {% for e in entries %}
            # and then an {{e.score}}
            # ------------------
            # cheseburger
            # create search results
            sr = SearchResult()
            sr.score = r.score
@@ -495,37 +550,29 @@ class Search:
            sr.id = r['id']
            sr.kind = r['kind']
-            sr.url = r['url']
+
            sr.created_time = r['created_time']
            sr.modified_time = r['modified_time']
            sr.indexed_time = r['indexed_time']
            sr.title = r['title']
            sr.url = r['url']
            sr.mimetype = r['mimetype']
            sr.owner_email = r['owner_email']
            sr.owner_name = r['owner_name']
            sr.content = r['content']
            # -----------------
            # github isuses
            # create search results
            sr = SearchResult()
            sr.score = r.score
            sr.url = r['url']
            sr.title = r['issue_title']
            sr.repo_name = r['repo_name']
            sr.repo_url = r['repo_url']
            sr.issue_title = r['issue_title']
            sr.issue_url = r['issue_url']
-            sr.is_comment = r['is_comment']
+            sr.github_user = r['github_user']
            sr.content = r['content']
            # ------------------
            highlights = r.highlights('content')
            if not highlights:
                # just use the first 1,000 words of the document
@@ -558,27 +605,15 @@ class Search:
            elif len(fields) == 2:
                pass
            else:
-                fields = ['id',
+                # If the user does not specify a field,
-                          'kind',
+                # these are the fields that are actually searched
-                          'created_time',
+                fields = ['title',
                          'modified_time',
                          'indexed_time',
                          'title',
                          'url',
                          'mimetype',
                          'owner_email',
                          'owner_name',
                          'repo_name',
                          'repo_url',
                          'issue_title',
                          'issue_url',
                          'github_user',
                          'content']
            if not query:
                query = MultifieldParser(fields, schema=self.ix.schema).parse(query_string)
            parsed_query = "%s" % query
            print("query: %s" % parsed_query)
-            results = searcher.search(query, terms=False, scored=True, groupedby="url")
+            results = searcher.search(query, terms=False, scored=True, groupedby="kind")
            search_result = self.create_search_result(results)
        return parsed_query, search_result
@@ -589,7 +624,29 @@ class Search:
        return s if len(s) <= l else s[0:l - 3] + '...'
    def get_document_total_count(self):
-        return self.ix.searcher().doc_count_all()
+        p = QueryParser("kind", schema=self.ix.schema)
        kind_labels = {
                "documents" : "gdoc",
                "issues" :    "issue",
                "comments" :  "comment"
        }
        counts = {
                "documents" : None,
                "issues" : None,
                "comments" : None,
                "total" : None
        }
        for key in kind_labels:
            kind = kind_labels[key]
            q = p.parse(kind)
            with self.ix.searcher() as s:
                results = s.search(q,limit=None)
                counts[key] = len(results)
        counts['total'] = self.ix.searcher().doc_count_all()
        return counts
 if __name__ == "__main__":
    search = Search("search_index")
--- a/config_centillion.json
+++ b/config_centillion.json
@@ -1,7 +1,6 @@
 {
    "repositories" : [
        "dcppc/2018-june-workshop",
-        "dcppc/2018-july-workshop",
+        "dcppc/2018-july-workshop"
        "dcppc/data-stewards"
    ]
 }
--- a/config_flask.py
+++ b/config_flask.py
@@ -1,27 +1,9 @@
 # Path to markdown files
 MARKDOWN_FILES_DIR = "/Users/charles/codes/whoosh/markdown-search/fake-docs/"
 # Location of index file
 INDEX_DIR = "search_index"
 # Command to use when clicking on filepath in search results
 EDIT_COMMAND = "view"
 # Toggle to show Whoosh parsed query
 SHOW_PARSED_QUERY=True
 # Toogle to use tags
 USE_TAGS=True
 # Optional prefix in a markdown file, e.g. "tags: python search markdown tutorial"
 TAGS_PREFIX=""
 # List of tags that should be ignored
 TAGS_TO_IGNORE = "and are what how its not with the"
 # Regular expression to select tags, eg tag has to start with alphanumeric followed by at least two alphanumeric or "-" or "."
 TAGS_REGEX = r"\b([A-Za-z0-9][A-Za-z0-9-.]{2,})\b"
 # Flask settings
 DEBUG = True
 SECRET_KEY = '42c5a8eda356ca9d9c3ab2d149541e6b91d843fa'
--- a/docs/index.md
+++ b/docs/index.md
@@ -0,0 +1,54 @@
 # The Centillion
 **the centillion**: a pan-github-markdown-issues-google-docs search engine.
 **a centillion**: a very large number consisting of a 1 with 303 zeros after it.
 the centillion is 3.03 log-times better than the googol.
 ## what is it
 The centillion is a search engine built using [whoosh](https://whoosh.readthedocs.io/en/latest/intro.html),
 a Python library for building search engines.
 We define the types of documents the centillion should index,
 what info and how. The centillion then builds and
 updates a search index. That's all done in `centillion_search.py`.
 The centillion also provides a simple web frontend for running
 queries against the search index. That's done using a Flask server
 defined in `centillion.py`.
 The centillion keeps it simple.
 ## quickstart
 Run the centillion app with a github access token API key set via
 environment variable:
 ```
 GITHUB_TOKEN="XXXXXXXX" python centillion.py
 ```
 This will start a Flask server, and you can view the minimal search engine
 interface in your browser at <http://localhost:5000>.
 ## work that is done
 See [standalone.md](standalone.md) for the summary of
 the three standalone whoosh servers that were built:
 one for a folder of markdown files, one for github issues
 and comments, and one for google drive documents.
 ## work that is being done
 See [workinprogress.md](workinprogress.md) for details about
 work in progress.
 ## work that is planned
 See [plans.md](plans.md)
--- a/Workplanned.md
+++ b/Workplanned.md
@@ -31,3 +31,4 @@ Stateless
--- a/docs/standalone.md
+++ b/docs/standalone.md
@@ -1,4 +1,4 @@
-## work that is done
+## work that is done: standalone
 **Stage 1: index folder of markdown files** (done)
 * See [markdown-search](https://git.charlesreid1.com/charlesreid1/markdown-search.git)
@@ -13,7 +13,7 @@
 Needs work:
-* More appropriate schema
+* <s>More appropriate schema</s>
 * Using more features (weights) plus pandoc filters for schema
 * Sqlalchemy (and hey waddya know safari books has it covered)
@@ -25,15 +25,16 @@ Needs work:
 * Main win here is uncovering metadata/linking/presentation issues
 Needs work:
- treat comments and issues as separate objects, fill out separate schema fields
+- <s>treat comments and issues as separate objects, fill out separate schema fields
 - map out and organize how the schema is updated to make it more flexible
- configuration needs to enable user to specify organization+repos
+- configuration needs to enable user to specify organization+repos</s>
 ```plain
 {
-    "to_index" : {
+    "to_index" : [
-        "google" : "google-api-python-client",
+        "google/google-api-python-client",
-        "microsoft" : ["TypeCode","api-guidelines"]
+        "microsoft/TypeCode",
        "microsoft/api-guielines"
    }
 }
 ```
@@ -48,3 +49,4 @@ Needs work:
 * Use the google drive api (see simple-simon)
 * Main win is more uncovering of metadata issues, identifying
  big-picture issues for centillion
--- a/docs/workinprogress.md
+++ b/docs/workinprogress.md
@@ -0,0 +1,48 @@
 # Components
 The components of centillion are as follows:
 - Flask application, which creates a Search object and uses it to search index
 - Search object, which allows you to create/update/search an index
 ## Routes layout
 Centillion flask app routes:
 - `/home`
    - if not logged in, landing page
    - if logged in, redirect to search
 - `/search`
 - `/main_index_update`
    - update main index, all docs period
 ## Functions layout
 Centillion Search class functions:
 - `open_index()` creates the schema
 - `add_issue()`, `add_md()`, `add_document()` have three diff method sigs and add diff types
  of documents to the search index
 - `update_all_issues()` or `update_all_md()` or `update_all_documents()` iterates over items
  and determines whether each item needs to be updated in the search index
 - `update_main_index()` - update the entire search index
    - calls all three update_all methods
 - `create_search_results()` - package things up for jinja
 - `search()` - run the query, pass results to the jinja-packager
 Nice to have but focus on it later:
 - update diff search index (what's been added since last index time)
    - max index time
 ## Files layout
 Schema definition:
 * include a "kind" or "class" to group objects
 * can provide different searches of different collections
 * eventually can provide user with checkboxes
--- a/img/ss.png
+++ b/img/ss.png
--- a/1
+++ b/1
--- a/static/bootstrap.min.css
+++ b/static/bootstrap.min.css
--- a/static/centillion_black.png
+++ b/static/centillion_black.png
--- a/static/centillion_white.png
+++ b/static/centillion_white.png
--- a/static/centillion_xparent.png
+++ b/static/centillion_xparent.png
--- a/static/style.css
+++ b/static/style.css
@@ -1,3 +1,24 @@
 li.search-group-item {
    position: relative;
    display: block;
    padding: 0px;
    margin-bottom: -1px;
    background-color: #fff;
    border: 1px solid #ddd;
 }
 div.list-group {
    border: 1px solid rgba(86,61,124,.2);
 }
 div.url {
    background-color: rgba(86,61,124,.15);
    padding: 8px;
 }
 /***************************/
 body {
    font-family: sans-serif;
 }
@@ -56,7 +77,7 @@ table {
    overflow: hidden;
 }
-td.info, .last-searches {
+.info, .last-searches {
    color: gray;
    font-size: 12px;
    font-family: Arial, serif;
--- a/templates/layout.html
+++ b/templates/layout.html
@@ -1,7 +1,8 @@
 <!doctype html>
 <title>Markdown Search</title>
 <link rel="stylesheet" type="text/css" href="{{ url_for('static', filename='github-markdown.css') }}">
 <link rel="stylesheet" type="text/css" href="{{ url_for('static', filename='style.css') }}">
 <link rel="stylesheet" type="text/css" href="{{ url_for('static', filename='github-markdown.css') }}">
 <link rel="stylesheet" type="text/css" href="{{ url_for('static', filename='bootstrap.min.css') }}">
 <div>
    {% for message in get_flashed_messages() %}
        <div class="flash">{{ message }}</div>
--- a/templates/search.html
+++ b/templates/search.html
@@ -1,56 +1,140 @@
 {% extends "layout.html" %}
 {% block body %}
-<h1><a href="{{ url_for('search')}}?query=&fields=">Search directory: {{ config.MARKDOWN_FILES_DIR }}</a></h1>
+
 <div class="container">
    <div class="row">
        <div class="col12sm">
            <center>
                <a href="{{ url_for('search')}}?query=&fields=">
                <img src="{{ url_for('static', filename='centillion_white.png') }}">
                </a>
            </center>
        </div>
    </div>
    <div class="row">
        <div class="col12sm">
            <center>
                <h2>
                    <a href="{{ url_for('search')}}?query=&fields=">
                    Search the DCPPC
                    </a>
                </h2>
            </center>
        </div>
    </div>
    <div class="row">
        <div class="col-12">
            <center>
                <a class="index" href="{{ url_for('update_index')}}">[update index]</a>
                <a class="index" href="{{ url_for('update_index')}}?rebuild=True">[rebuild index]</a>
                <form action="{{ url_for('search') }}" name="search">
-    <input type="text" name="query" value="{{ query }}">
+                    <input type="text" name="query" value="{{ query }}"> <br />
-    <input type="submit" value="search">
+                    <button type="submit" style="font-size: 20px; padding: 10px; padding-left: 50px; padding-right: 50px;" 
-    <a href="{{ url_for('search')}}?query=&fields=">[clear]</a>
+                        value="search" class="btn btn-primary">Search</button>
                    <br />
                    <a href="{{ url_for('search')}}?query=&fields=">[clear all results]</a>
                </form>
-<table cellspacing="3">
+            </center>
        </div>
    </div>
 </div>
 <div class="container">
    <div class="row">
        {% if directories %}
-    <tr>
+        <div class="col-12 info directories-cloud">
-        <td class="directories-cloud">File directories:&nbsp
+            File directories:&nbsp
            {% for d in directories %}
                <a href="{{url_for('search')}}?query={{d|trim}}&fields=filename">{{d|trim}}</a>
            {% endfor %}
-        </td>
+        </div>
    </tr>
        {% endif %}
-    {% if config['SHOW_PARSED_QUERY']%}
+
-    <tr>
+        <ul class="list-group">
-        <td class="info">Parsed query: {{ parsed_query }}</td>
+
-    </tr>
+            {% if config['SHOW_PARSED_QUERY'] and parsed_query %}
                <li  class="list-group-item">
                    <div class="col-12 info">
                        <b>Parsed query:</b> {{ parsed_query }}
                    </div>
                </li>
            {% endif %}
-    <tr>
+
-        <td class="info">FOUND {{ entries | length }} results of {{total}} documents</td>
+            {% if parsed_query %}
-    </tr>
+                <li  class="list-group-item">
                    <div class="col-12 info">
                        <b>Found:</b> {{entries|length}} documents with results, out of {{totals["total"]}} total documents
                    </div>
                </li>
            {% endif %}
            <li  class="list-group-item">
                <div class="col-12 info">
                    <b>Indexing:</b> {{totals["documents"]}} Google Documents,
                    {{totals["issues"]}} Github issues, and 
                    {{totals["comments"]}} Github comments
                </div>
            </li>
        </ul>
    </div>
 </div>
 <div class="container">
    <div class="row">
        <ul class="list-group">
            {% for e in entries %}
-    <tr>
+                <li  class="search-group-item">
-        <td class="search-result">
+
            <!--
                <div class="path"><a href='{{ url_for("open_file")}}?path={{e.path|urlencode}}&query={{query}}&fields={{fields}}'>{{e.path}}</a>score: {{'%d'  % e.score}}</div>
            -->
                    <div class="url">
-                {% if e.is_comment %}
+                        {% if e.kind=="gdoc" %}
-                    <b>Comment</b> <a href='{{e.url}}'>(comment link)</a>
+                            <b>Google Drive File:</b>
-                    on issue <a href='{{e.issue_url}}'>{{e.issue_title}}</a>
+                            <a href='{{e.url}}'>{{e.title}}</a>
-                    in repo <a href='{{e.repo_url}}'>dcppc/{{e.repo_name}}</a>
+                            ({{e.owner_name}}, {{e.owner_email}})
-                    <br />
+                        {% elif e.kind=="comment" %}
-                {% else %}
+                            <b>Comment:</b>
-                    <b>Issue</b> <a href='{{e.issue_url}}'>{{e.issue_title}}</a>
+                            <a href='{{e.url}}'>Comment (link)</a>
-                    in repo <a href='{{e.repo_url}}'>dcppc/{{e.repo_name}}</a>
+                            {% if e.github_user %}
-                    <br />
+                            by <a href='https://github.com/{{e.github_user}}'>@{{e.github_user}}</a>
                            {% endif %}
                            on issue <a href='{{e.issue_url}}'>{{e.issue_title}}</a>
                            <br/>
                            <b>Repository:</b> <a href='{{e.repo_url}}'>{{e.repo_name}}</a>
                            {% if e.github_user %}
                            {% endif %}
                        {% elif e.kind=="issue" %}
                            <b>Issue:</b>
                            <a href='{{e.issue_url}}'>{{e.issue_title}}</a>
                            {% if e.github_user %}
                            by <a href='https://github.com/{{e.github_user}}'>@{{e.github_user}}</a>
                            {% endif %}
                            <br/>
                            <b>Repository:</b> <a href='{{e.repo_url}}'>{{e.repo_name}}</a>
                        {% else %}
                            <b>Item:</b> (<a href='{{e.url}}'>link</a>)
                        {% endif %}
                        <br />
                        score: {{'%d'  % e.score}}
                    </div>
                    <div class="markdown-body">{{ e.content_highlight|safe}}</div>
-        </td>
+
-    </tr>
+                </li>
            {% endfor %}
-</table>
+        </ul>
    </div>
 </div>
 <div class="container">
    <div class="row">
        <div class="col-12">
            <div class="last-searches">Last searches: <br/>
                {% for s in last_searches %}
                    <span><a href="{{url_for('search')}}?{{s}}">{{s}}</a></span>
@@ -59,4 +143,9 @@
            <p>
                More info can be found in the <a href="https://github.com/BernhardWenzel/markdown-search">README.md file</a>
            </p>
        </div>
    </div>
 </div>
 {% endblock %}
Author	SHA1	Message	Date
Charles Reid	69339abe24	fix the way repo name label is handled	2018-08-01 00:25:29 -07:00
Charles Reid	8d2718d783	update how we store totals	2018-07-31 23:58:19 -07:00
Charles Reid	8912b945fe	remove print statement	2018-07-31 23:17:16 -07:00
Charles Reid	ddceb16a2c	fix template rendering in update_index url endpoint	2018-07-31 23:16:45 -07:00
Charles Reid	f769d18b4e	clean up flask config file	2018-07-31 23:16:23 -07:00
Chaz Reid	34a889479a	Update config_flask.py	2018-07-31 23:12:57 -07:00
Charles Reid	a074e6c0e7	add image to readme	2018-07-31 23:07:32 -07:00
Charles Reid	918c9d583f	update search results template	2018-07-31 23:01:38 -07:00
Charles Reid	6cd505087b	package up the counts in get_document_total_count	2018-07-31 22:37:20 -07:00
Charles Reid	ee9b3bb811	pass a count dictionary instead of an integer to the jinja template	2018-07-31 22:36:43 -07:00
Charles Reid	8a4e20b71c	update template - gotta look good	2018-07-31 22:36:13 -07:00
Charles Reid	64d3ce4a9b	update search engine style to use centillion logo	2018-07-31 18:29:01 -07:00
Charles Reid	5e9b584d26	uncovered the mysterious missing google docs: they were just being labeled as issues by the search template.	2018-07-31 15:59:21 -07:00
Charles Reid	b03a42d261	start some troubleshooting	2018-07-31 05:21:58 -07:00
Charles Reid	bd4f4da8dc	more fixes - use "" not None	2018-07-31 05:15:22 -07:00
Charles Reid	23743773a6	add mkdocs-material submodule	2018-07-31 04:33:27 -07:00
Charles Reid	b7d2a8c960	rename some files, and move docs into docs/	2018-07-31 04:32:38 -07:00
Charles Reid	1f4b43163a	fix env var name	2018-07-31 03:16:28 -07:00
Charles Reid	f80ccc2520	successfully indexing, unsuccessfully searching	2018-07-31 03:06:25 -07:00
Charles Reid	c2eae4f521	improve handling of repo names, oweners, and document schema. improve timestamps.	2018-07-31 01:52:44 -07:00
Charles Reid	c758ca7a6c	add quickstart	2018-07-31 01:28:38 -07:00
Charles Reid	3cf142465a	updating readme with flask mention	2018-07-31 01:23:49 -07:00
Charles Reid	bfd351c990	Update 'Workdone.md'	2018-07-31 08:12:28 +00:00