Ref T6656 use elasticsearch to find duplicates
- Group Reviewers
- Maniphest Tasks
- T4828: Suggest/propose possible duplicates when creating a new task
use ./bin/search find_duplicates to try without any index changes.
Then do a ./bin/search find_duplicates --installmapping to change the mapping.
Then all tasks need to be reindexed (./bin/search index --type TASK) and then try
./bin/search find_duplicates --analyzed and compare with the initial result.
Language analyzed is hardcoded to english for now.
- rP Phabricator
Lint Warnings Severity Location Code Message Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:9 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:20 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:24 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:34 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:77 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:82 TXT3 Line Too Long Warning src/applications/search/management/PhabricatorSearchManagementFindDuplicatesWorkflow.php:90 TXT3 Line Too Long
- Build Status
Buildable 3296 Build 3303: [Placeholder Plan] Wait for 30 Seconds
This diff is not really meant to be merged. (And i'm not sure if can even create a diff on top of another not yet landed diff?)
But you're right that if we want the numbers to be correct you should apply the patch from D10955 and then this one on top
and then only run: ./bin/search find_duplicates
I'll just remove the mapping stuff from this diff.