[docs] Introduce "search" feature for website by jugglinmike · Pull Request #15916 · web-platform-tests/wpt

jugglinmike · 2019-03-18T21:14:36Z

I've been reviewing WPT's open "docs" issues and the responses to the 2018 WPT survey, and one of the strongest themes is difficulty locating existing content. We're hoping to improve that by restructuring the documents, and a dedicated text search is another way to help folks find what they're looking for.

This patch implements search on the client, integrating Jekyll output with the Lunr.js JavaScript library. As a MIT-licensed project, I don't think there are any concerns about including it in WPT. For a small client-side library, Lunr.js is surprisingly good at natural language search (e.g. the query "equals" matches the term "equality").

Building the site locally can be a bit tricky, so here are a few screenshots:

I also experimented with a simpler solution which relies on DuckDuckGo. While that approach requires less code, it has some drawbacks.

search results include matches from pulls.web-platform-tests.org, but such matches are unlikely to be desired (we should probably publish a robots.txt file there)
DuckDuckGo has to index new content before it will be available via search

While the visualization of results in this patch could use some work, I wanted to get some feedback before spending more time on it. @gsnedders @jgraham @foolip (+anyone else): does this look promising to you? Do you think DuckDuckGo would be better? Or is there another alternative that I should look in to?

foolip · 2019-03-19T20:36:29Z

This looks very promising, I was hoping we could have a client-side search just like this! I'll review.

foolip · 2019-03-19T20:38:41Z

@@ -0,0 +1,6 @@
+/**
+ * lunr - http://lunrjs.com - A bit like Solr, but much smaller and not as bright - 2.3.6


Do we need anything in place to remember to update this occasionally?

foolip · 2019-03-19T20:42:28Z

+
+function getQuery() {
+  var query = window.location.search.substring(1);
+  var vars = query.split("&");


For our audience I think it's OK to depend on https://developer.mozilla.org/en-US/docs/Web/API/URLSearchParams/URLSearchParams to simplify this code a bit, i.e., getQuery won't be needed at all.

foolip · 2019-03-19T20:53:57Z

+
+  var appendString = "";
+
+  for (var i = 0; i < results.length; i++) {  // Iterate over the results


It'd be fine to use for (var result of results) or results.forEach and not support older browsers if you'd find that nicer and not require explanation.

foolip · 2019-03-19T20:59:59Z

+      {% for page in collection.docs %}
+      "{{ page.url | slugify }}": {
+        "title": "{{ page.title | xml_escape }}",
+        "content": {{ page.content | strip_html | jsonify }},


Aha, xml_escape and strip_html may answer most of the questions I've had. Can you add comments in the suspicious parts of the code pointing out why it's totally safe?

What I'd already written elsewhere:

The text argument here comes from lunr to begin with. Is this text that should be shown verbatim to the user, or already escaped as HTML? In either case it looks like something is missing:

escaping of each part of text as it's pieced together if it's verbatim text that happens to include < and such (creating a DocumentFragment with Text and HTMLMarkElement children might be less work)

parsing text as HTML and operating on the DOM if text is actually markup (very tedious)

jugglinmike · 2019-06-18T22:15:29Z

In gh-16458, we re-implemented the build process to use Sphinx. That includes a client-side search, so this patch is no longer necessary.

[docs] Introduce "search" feature for website

9fa6fd6

jugglinmike requested review from foolip, gsnedders and jgraham March 18, 2019 21:14

wpt-pr-bot added the docs label Mar 18, 2019

wpt-pr-bot assigned gsnedders Mar 18, 2019

wpt-pr-bot requested a review from sideshowbarker March 18, 2019 21:14

foolip reviewed Mar 19, 2019

View reviewed changes

jugglinmike closed this Jun 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Introduce "search" feature for website#15916

[docs] Introduce "search" feature for website#15916
jugglinmike wants to merge 1 commit intoweb-platform-tests:masterfrom
bocoup:docs-search-client

jugglinmike commented Mar 18, 2019

Uh oh!

foolip commented Mar 19, 2019

Uh oh!

foolip Mar 19, 2019

Uh oh!

foolip Mar 19, 2019

Uh oh!

foolip Mar 19, 2019

Uh oh!

foolip Mar 19, 2019

Uh oh!

jugglinmike commented Jun 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -0,0 +1,6 @@
		/**
		* lunr - http://lunrjs.com - A bit like Solr, but much smaller and not as bright - 2.3.6


		var appendString = "";

		for (var i = 0; i < results.length; i++) { // Iterate over the results

Conversation

jugglinmike commented Mar 18, 2019

Uh oh!

foolip commented Mar 19, 2019

Uh oh!

foolip Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

foolip Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

foolip Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

foolip Mar 19, 2019

Choose a reason for hiding this comment

Uh oh!

jugglinmike commented Jun 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants