readur/src at 774efd1140f993f146e8680fdd612e6b13daa3f8 - readur - Gitea - ZeroTwo

mirror/readur

mirror of https://github.com/readur/readur.git synced 2026-01-04 05:20:11 -06:00

Files

History

perf3ct 774efd1140 refactor(server): remove XML vs library comparison functionality

Remove all comparison-related code used to evaluate XML vs library-based
Office document extraction. The XML approach has proven superior, so the
comparison functionality is no longer needed.

Changes:
- Remove extraction_comparator.rs (entire comparison engine)
- Remove test_extraction_comparison.rs binary
- Remove comparison mode logic from enhanced.rs
- Simplify fallback_strategy.rs to use XML extraction only
- Update OCR service to use XML extraction as primary method
- Clean up database migration to remove comparison-specific settings
- Remove test_extraction binary from Cargo.toml
- Update integration tests to work with simplified extraction

The Office document extraction now flows directly to XML-based
extraction
without any comparison checks, maintaining the superior extraction
quality
while removing unnecessary complexity.

2025-09-02 01:22:19 +00:00

..

feat(storage): further support the s3 storage backend

2025-08-01 17:57:09 +00:00

feat(office): add library-based and xml-based parsing

2025-09-02 00:25:06 +00:00

fix(dev): remove unneeded docs

2025-08-13 20:51:13 +00:00

fix(dev): remove unneeded docs

2025-08-13 20:51:13 +00:00

metadata_extraction

feat(server): implement unit tests for source metadata extraction

2025-07-10 22:02:41 +00:00

feat(office): add library-based and xml-based parsing

2025-09-02 00:25:06 +00:00

fix(server): resolve import issues

2025-07-03 23:58:11 +00:00

refactor(server): remove XML vs library comparison functionality

2025-09-02 01:22:19 +00:00

feat(office): add library-based and xml-based parsing

2025-09-02 00:25:06 +00:00

feat(office): try to resolve docx/doc not working

2025-09-01 19:58:06 +00:00

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

feat(storage): further support the s3 storage backend

2025-08-01 17:57:09 +00:00

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

feat(security): this was just pain

2025-08-11 01:13:29 +00:00

auth.rs

feat(server): upgrade all versions and resolve breaking changes

2025-06-15 02:23:35 +00:00

config.rs

feat(storage): further support the s3 storage backend

2025-08-01 17:57:09 +00:00

db_guardrails_simple.rs

feat(server): create more DB guardrails, and lots of missing tests

2025-06-15 22:14:02 +00:00

db_guardrails.rs

fix(backend): lables handling

2025-06-19 19:47:49 +00:00

lib.rs

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

lib.rs.backup

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

main.rs

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

metadata_extraction.rs

feat(server): show source metadata EVEN better

2025-07-10 21:51:30 +00:00

mime_detection.rs

feat(server): do a *much* better job at determining file types thanks to infer rust package

2025-07-29 21:28:33 +00:00

oidc.rs

fix(tests): move oidc tests to correct folder

2025-06-27 19:33:58 +00:00

seed.rs

chore(server): remove unused system user

2025-06-19 00:41:01 +00:00

swagger.rs

feat(source): implement generic "SourceError" and then have it be propagated as "WebDAVerror", etc.

2025-08-17 22:05:58 +00:00

test_helpers.rs

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

test_utils.rs

feat(metrics): try to simplify webdav metrics some

2025-08-23 22:17:40 +00:00

webdav_xml_parser.rs

feat(server): do a *much* better job at determining file types thanks to infer rust package

2025-07-29 21:28:33 +00:00