Page MenuHomePhabricator

Show users how fulltext search queries are parsed and executed; don't query stopwords or short tokens
ClosedPublic

Authored by epriestley on Apr 13 2017, 12:04 AM.
Tags
None
Referenced Files
F19817244: D17672.diff
Thu, Mar 5, 11:54 AM
F19811553: D17672.diff
Wed, Mar 4, 1:37 AM
F19749784: D17672.id42497.diff
Sun, Feb 15, 8:57 PM
F19671033: D17672.id.diff
Sun, Feb 8, 12:10 AM
F19662270: D17672.id42503.diff
Feb 7 2026, 8:42 AM
F19617055: D17672.id.diff
Feb 4 2026, 4:36 AM
F19532227: D17672.diff
Jan 20 2026, 9:42 AM
F19528507: D17672.diff
Jan 19 2026, 9:42 AM
Subscribers
None

Details

Summary

Depends on D17670. Fixes T12137. Fixes T12003. Ref T2632.

This shows users a readout of which terms were actually searched for.

This also drops those terms from the query we submit to the backend, dodging the weird behaviors / search engine bugs in T12137.

This might need some design tweaking.

Test Plan

Screen Shot 2017-04-12 at 4.56.59 PM.png (493×851 px, 52 KB)

Diff Detail

Repository
rP Phabricator
Branch
search4
Lint
Lint Passed
Unit
Tests Passed
Build Status
Buildable 16429
Build 21862: Run Core Tests
Build 21861: arc lint + arc unit

Event Timeline

  • Use phutil_utf8_strlen() for identifying short tokens.

InnoDB list is pretty conservative:

https://dev.mysql.com/doc/refman/5.7/en/fulltext-stopwords.html

| a     |
| about |
| an    |
| are   |
| as    |
| at    |
| be    |
| by    |
| com   |
| de    |
| en    |
| for   |
| from  |
| how   |
| i     |
| in    |
| is    |
| it    |
| la    |
| of    |
| on    |
| or    |
| that  |
| the   |
| this  |
| to    |
| was   |
| what  |
| when  |
| where |
| who   |
| will  |
| with  |
| und   |
| the   |
| www   |
This revision is now accepted and ready to land.Apr 13 2017, 1:59 AM
This revision was automatically updated to reflect the committed changes.