Commit Graph

14 Commits

Author SHA1 Message Date
Dominik Schmidt 7f83b5020d feat(location): add tika test 2023-12-06 15:13:09 +01:00
Dominik Schmidt 4c27b365fa feat(graph): add location facet to driveItems 2023-12-06 15:13:09 +01:00
Dominik Schmidt 07edd1e23a chore(search): move audio metadata extraction in tika to dedicated method 2023-12-06 15:13:09 +01:00
Dominik Schmidt db32fb46f9 Audio Metadata (#7490)
* Add audio facet to search protobuf message

* Add audio metadata to search index

* Return audio facet from search if available

* Store audio metadata in arbitrary metadata

* Add audio facet to driveItems listings

* Make tests coding style more consistent

* Fix tests

* Add changelog

* Make valueToString code more defensive

* Log status code as well
2023-11-06 08:56:46 +01:00
Florian Schade cdd2100b4b enhancement: improve content extraction stop word cleaning (#7553)
* enhancement: improve content extraction stop word cleaning

* fix: cleanup documentation

Co-authored-by: Martin <github@diemattels.at>

* fix: failing tika stop word unit tests

---------

Co-authored-by: Martin <github@diemattels.at>
2023-10-23 13:40:37 +02:00
Daniël Franke e43001fb96 Pass context for reva bump.
Also refresh the generated code.
2023-07-03 11:08:45 +02:00
Florian Schade 4f26424db6 [full-ci] enhancement: use reva client pool selectors (#6452)
* enhancement: use reva client pool selectors

register mock service to registry and pass tests

* enhancement: bump reva

* Fix a couple of linter issues

---------

Co-authored-by: Ralf Haferkamp <rhaferkamp@owncloud.com>
2023-06-08 12:41:04 +02:00
Andre Duffeck c0665975b3 Do not try to fulltext-index large files. (#6395)
* Do not try to extract the content of large files.

Both tika and ocis do not handle large files/archives very well.

* Apply suggestions from code review

Co-authored-by: Martin <github@diemattels.at>

---------

Co-authored-by: Martin <github@diemattels.at>
2023-05-27 19:03:58 +02:00
André Duffeck 82590632b6 Always use UTC for doc timestamps 2023-05-12 10:37:43 +02:00
André Duffeck ca637de017 Fix detecting resources that haven't changed when reindexing spaces
This is a regression introduced with the experimental search backport.
2023-04-19 17:44:16 +02:00
Daniël Franke d2076941f4 Add tests.
Also update mocks with mockery.
2023-01-23 14:38:12 +01:00
jkoberg 837eff9017 adjust for tags pkg
Signed-off-by: jkoberg <jkoberg@owncloud.com>

tmp

Signed-off-by: jkoberg <jkoberg@owncloud.com>
2022-12-23 10:55:57 +01:00
Florian Schade 1db03dd512 [full-ci] experimental tags backport (#5227)
* add tags to search service resource
add tags getTags, AssignTags and UnassignTags endpoint to graph
use and prefer search event spaceOwner over executant
add tags to search report response
update libre graph api
update reva

Co-authored-by: David Christofas <dchristofas@owncloud.com>
2022-12-19 15:44:02 +01:00
Florian Schade 2404eff48e [full-ci] experimental search backport (#5221)
* experimental search backport
fix basic extractor resource name
move escapeQuery regex into global variable
minor pr review changes
rename DebounceDuration env variable
add document title and content when rebuilding bleve resource

Co-authored-by: David Christofas <dchristofas@owncloud.com>
2022-12-13 14:22:41 +01:00