[query] Support SQLite's LIKE for non-fulltext text searching #747

ncalexan · 2018-06-18T17:47:09Z

While implementing a rough clone of Firefox for iOS's logins handling, I noticed that we don't support SQLite's LIKE operator for non-fulltext text searching. That's what we use in Firefox for iOS, and I don't want to make some of these fields :db/fulltext true, so we should grow support for it. I'm thinking that it'll be a special filtering function, like:

[:find ?e :where
 [?e :credential/name ?t]
 [(string-contains ?t "pattern")]]

although there are a few subtleties. First, "pattern" can be a binding (which is well supported by SQLite). Second, the pattern can contain _ and %, which have special meaning to SQLite. It's easy to escape a constant pattern, but not so easy to escape a binding (coming from elsewhere in the query engine, i.e., another column). We could make escaping the responsibility of the consumer, but that's likely to lead to surprises.

As a first cut, we could only accept literal patterns, which we can escape (or not) and transform concretely.

The text was updated successfully, but these errors were encountered:

rnewman · 2018-06-19T02:27:59Z

I concur re literal patterns, but perhaps consider string-starts-with and string-ends-with (which would omit the implied %string% stuff in SQLite).

I encourage you to require that any such ?t has a bound attribute, because otherwise you’re doing an unbounded full search of any string in the store, including vocabularies that calling code might not be aware of. You can use or successfully here.

ncalexan · 2018-06-19T04:03:01Z

I encourage you to require that any such ?t has a bound attribute, because otherwise you’re doing an unbounded full search of any string in the store, including vocabularies that calling code might not be aware of. You can use or successfully here.

It's not clear to me how to trace provenance of bindings so concretely in the code as currently expressed. This has been on my mind a bit as I idly ponder what our performance profile would be if we stored the indices as separate tables and then used the "best possible" table at query time. (That's the strategy Datascript and presumably Datomic take.)

But yes, bad things can happen with unbounded table walks. Datomic prevents them, which is irritating when you're trying to inspect the store :)

rnewman · 2018-06-21T00:53:15Z

We could hack in this kind of placement analysis into the initial pattern walk — we know at this point which variables are both string matches and the objects of patterns, and what the attributes of those patterns are — but in general this is constraint algebrizing, and it’s hard.

ncalexan added enhancement help wanted A-query Issues or requests for query capabilities. labels Jun 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[query] Support SQLite's LIKE for non-fulltext text searching #747

[query] Support SQLite's LIKE for non-fulltext text searching #747

ncalexan commented Jun 18, 2018

rnewman commented Jun 19, 2018

ncalexan commented Jun 19, 2018

rnewman commented Jun 21, 2018

[query] Support SQLite's LIKE for non-fulltext text searching #747

[query] Support SQLite's LIKE for non-fulltext text searching #747

Comments

ncalexan commented Jun 18, 2018

rnewman commented Jun 19, 2018

ncalexan commented Jun 19, 2018

rnewman commented Jun 21, 2018