Commit Graph

200 Commits

Author SHA1 Message Date
Andre Duffeck c0665975b3 Do not try to fulltext-index large files. (#6395)
* Do not try to extract the content of large files.

Both tika and ocis do not handle large files/archives very well.

* Apply suggestions from code review

Co-authored-by: Martin <github@diemattels.at>

---------

Co-authored-by: Martin <github@diemattels.at>
2023-05-27 19:03:58 +02:00
kobergj 924f0c408d Merge pull request #6370 from kobergj/AddFulltextsearchCapability
Add Fulltextsearch Capability
2023-05-25 15:50:43 +02:00
jkoberg 70391a0bf2 unify CACERT envvars
Signed-off-by: jkoberg <jkoberg@owncloud.com>
2023-05-25 13:32:37 +02:00
André Duffeck e1279da025 Do not mess with non-lowercased fields 2023-05-25 10:55:52 +02:00
kobergj 382a2800f8 Update services/search/README.md
Co-authored-by: Martin <github@diemattels.at>
2023-05-24 13:58:40 +02:00
jkoberg 50ef698fda add fulltextsearch capability
Signed-off-by: jkoberg <jkoberg@owncloud.com>
2023-05-24 13:58:37 +02:00
André Duffeck f033182e09 Log search durations 2023-05-24 13:12:41 +02:00
André Duffeck cfb0152372 Fix boolean queries 2023-05-24 12:01:35 +02:00
André Duffeck b1bfb93230 Fix test expectations 2023-05-24 12:01:35 +02:00
André Duffeck e8c9d16576 Do not choke on : searches
Fixes #4744
2023-05-24 12:01:35 +02:00
André Duffeck 9fe1225b53 Make field base search case insensitive
Fixes part of #6366
2023-05-24 12:01:35 +02:00
André Duffeck 6275b7b124 Do not include full-text and tag data in the composite field
This commit removes the full-text and tag data from the composite field
("_all") which can be used for default term queries which we don't do
anyways.
2023-05-19 16:13:30 +02:00
André Duffeck 55a6716d49 Trigger a space reindex when the space has been renamed
Fixes #6289 (in combination with https://github.com/cs3org/reva/pull/3889)
2023-05-15 16:21:53 +02:00
Martin da49a5cd71 Update services/search/README.md 2023-05-15 14:20:16 +02:00
Martin d3cd43861c [docs-only ]Update search README.md
Add a note that tika containers are only available for the amd64 platform.
2023-05-15 14:20:16 +02:00
Andre Duffeck 32014badd7 Merge pull request #6268 from aduffeck/improve-search
Improve search
2023-05-12 12:11:38 +02:00
André Duffeck e9d4e07a08 constify the "trashed" space state 2023-05-12 10:37:43 +02:00
André Duffeck 82590632b6 Always use UTC for doc timestamps 2023-05-12 10:37:43 +02:00
André Duffeck e01ef9b8c6 Search spaces concurrently 2023-05-12 10:37:43 +02:00
André Duffeck a9b139af97 Do not consider disabled spaces during search
Fixes #6201
2023-05-12 10:37:43 +02:00
André Duffeck 63b2ac2368 Do not use prefix queries which gets slow with big indexes
Co-authored-by: Florian Schade <f.schade@icloud.com>
2023-05-12 10:37:43 +02:00
mmattel 456387c69f [docs-only] Add Tika container link to search for dev purposes 2023-05-12 09:31:21 +02:00
André Duffeck ca637de017 Fix detecting resources that haven't changed when reindexing spaces
This is a regression introduced with the experimental search backport.
2023-04-19 17:44:16 +02:00
mmattel 8ed34926ef [docs-only] Correct the removal version according semver to 4.0.0 2023-04-19 10:16:06 +02:00
mmattel b2504035e0 fix readme service names 2023-04-12 18:40:41 +02:00
Daniel Swärd 46e50b3a70 Reapply name change, without adding files by mistake. 2023-04-05 13:26:29 +02:00
Daniel Swärd a693a18b99 Revert "Change new name to address PR comment."
This reverts commit 8911ca2e98.
2023-04-05 13:25:04 +02:00
Daniel Swärd 8911ca2e98 Change new name to address PR comment. 2023-04-05 12:26:56 +02:00
Daniel Swärd afbfd2d6b2 Deprecating and renaming many environment variables
Fixes #5767
Fixes #5905
Fixes #5532
2023-04-05 11:05:48 +02:00
Roman Perekhod fcfb2ab9fa Introduced global OCIS_EVENTS_ENDPOINT and OCIS_EVENTS_CLUSTER #5901 2023-04-04 15:43:03 +02:00
Dennis Sieben e28603e633 - fixed typos 2023-03-27 08:54:51 +02:00
David Christofas 924c2ca346 add missing comments 2023-03-06 19:01:16 +01:00
David Christofas 63fa35fa50 reduce duplication in configuration code 2023-03-06 16:52:44 +01:00
Jörn Friedrich Dreyer 9b00f02383 bump reva to ec27f5f8feb3
Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>
2023-02-15 15:26:07 +00:00
mmattel 8dda0c6728 create readme.md for seach 2023-02-07 09:51:37 +01:00
mmattel 210cd54817 fix some envvar descriptions 2023-02-03 09:40:35 +01:00
Daniël Franke d2076941f4 Add tests.
Also update mocks with mockery.
2023-01-23 14:38:12 +01:00
Florian Schade 1afd7c2a4b use event executant to index tags (#5405) 2023-01-13 17:31:34 +01:00
Jörn Friedrich Dreyer 3aa864aecc Remove Staticcheck warnings (#5394)
* remove deprecated ioutil imports

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

* remove duplicate imports

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

* remove unused values

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

* remove some unused types

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

* remove deprecated ioutil import

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>
2023-01-13 15:29:14 +01:00
jkoberg 837eff9017 adjust for tags pkg
Signed-off-by: jkoberg <jkoberg@owncloud.com>

tmp

Signed-off-by: jkoberg <jkoberg@owncloud.com>
2022-12-23 10:55:57 +01:00
jkoberg b29ffeb3a9 use stream pkg instead server pkg
Signed-off-by: jkoberg <jkoberg@owncloud.com>
2022-12-20 09:53:11 +01:00
Florian Schade 1db03dd512 [full-ci] experimental tags backport (#5227)
* add tags to search service resource
add tags getTags, AssignTags and UnassignTags endpoint to graph
use and prefer search event spaceOwner over executant
add tags to search report response
update libre graph api
update reva

Co-authored-by: David Christofas <dchristofas@owncloud.com>
2022-12-19 15:44:02 +01:00
Florian Schade 2404eff48e [full-ci] experimental search backport (#5221)
* experimental search backport
fix basic extractor resource name
move escapeQuery regex into global variable
minor pr review changes
rename DebounceDuration env variable
add document title and content when rebuilding bleve resource

Co-authored-by: David Christofas <dchristofas@owncloud.com>
2022-12-13 14:22:41 +01:00
Jörn Friedrich Dreyer b2b4f9893c align revactx package import (#5206)
Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>

Signed-off-by: Jörn Friedrich Dreyer <jfd@butonic.de>
2022-12-08 14:45:30 +01:00
Christian Richter 080a37784f Fix tests
Co-authored-by: André Duffeck <aduffeck@firondu.de>
Signed-off-by: Christian Richter <crichter@owncloud.com>
2022-11-24 15:11:38 +01:00
Christian Richter 724cf40c0e Add traces to search + indexer
Co-authored-by: André Duffeck <aduffeck@firondu.de?
Signed-off-by: Christian Richter <crichter@owncloud.com>
2022-11-24 12:38:09 +01:00
Andre Duffeck 60b979d1d4 Find spaces (#5044)
* Use the node name for the name field in the index instead of the path

* Add changelog

* Fix tests

* Adapt expected failures
2022-11-15 16:52:55 +01:00
Willy Kloucek d74b879d4f update tooling (#5038)
* update tooling

* update nodejs version
2022-11-11 15:01:45 +01:00
Andre Duffeck a0762e248e Filter hidden files (#5018)
* Filter hidden files from search results

* Add changelog

* Do not filter hidden files by default

* Set the hidden fields when converting fields to an entity

* Fix test
2022-11-10 17:05:43 +01:00
Andre Duffeck 925790d64b Dont reindex twice (#5001)
* Only reindex a space once at a time

* Add changelog
2022-11-08 15:20:38 +01:00