Commit Graph

240 Commits

Author SHA1 Message Date
oysteikt 34a30b3a35 benchmark: create separate benchmarks for jp and en search
Build and test / build (push) Has been cancelled
2026-04-13 20:18:43 +09:00
oysteikt 0bb542840c test/search/radical_search: skip failing tests for now
Build and test / build (push) Successful in 8m0s
2026-04-13 19:51:18 +09:00
oysteikt 507f1a21d8 jmdict: store glossary type in different table
Build and test / build (push) Failing after 6m34s
2026-04-13 19:43:25 +09:00
oysteikt 556d07913d jmdict: don't store glossary language 2026-04-13 19:42:11 +09:00
oysteikt 6165045ea7 migrations: simplify JMdict_CombinedEntryScore 2026-04-13 19:29:04 +09:00
oysteikt 316dff3b46 migrations: comment out unused jmdict <-> kanjidic xref table 2026-04-13 19:27:16 +09:00
oysteikt 747e680a02 migrations: remove some excessive indices 2026-04-13 19:27:16 +09:00
oysteikt 4f73e07056 test/search/radical_search: init 2026-04-13 19:12:15 +09:00
oysteikt 15540514f6 jmdict: don't store kanji + reading for xrefs
Build and test / build (push) Successful in 7m0s
2026-04-13 18:33:06 +09:00
oysteikt 4faf543d6e jmdict: don't store empty entry scores
Build and test / build (push) Successful in 8m17s
2026-04-13 18:18:48 +09:00
oysteikt d1a6f39cca kanjidic: split grade/freq/jlpt into separate tables
Build and test / build (push) Successful in 8m1s
2026-04-09 14:10:51 +09:00
oysteikt a222b2d9b8 jmdict: elementId instead of reading for element restriction tables
Build and test / build (push) Successful in 8m0s
2026-04-08 19:57:07 +09:00
oysteikt 6364457d9e docs/database: add some notes about elementId embeddings
Build and test / build (push) Successful in 7m56s
2026-04-08 19:07:48 +09:00
oysteikt 5d26b41524 jmdict: embed element type (k/r) into elementId 2026-04-08 19:05:02 +09:00
oysteikt 114febbe02 docs/database: add some notes about JMdict_EntryScore
Build and test / build (push) Successful in 9m59s
2026-04-08 18:14:28 +09:00
oysteikt 20243dec09 docs: add docs about database schema choices
Build and test / build (push) Successful in 8m35s
2026-04-08 18:01:16 +09:00
oysteikt f6de8680ad jmdict: infer entryId from element ids 2026-04-08 18:01:16 +09:00
oysteikt 99218a6987 jmdict: embed orderNum in senseId for senses 2026-04-08 17:21:50 +09:00
oysteikt e8ee1ab944 data_ingestion/tanos-jlpt: remove redundant code import 2026-04-08 17:21:49 +09:00
oysteikt 4f320e4ea9 jmdict: embed orderNum in elementId for kanji and readings 2026-04-08 17:21:49 +09:00
oysteikt 9c9f5543c8 .gitea/workflows: fix 'update database inputs' step
Build and test / build (push) Successful in 8m59s
2026-04-08 14:05:28 +09:00
oysteikt be493a6150 .gitea/workflows: fix build-and-test job
Build and test / build (push) Successful in 8m59s
2026-04-08 13:36:42 +09:00
oysteikt 8d742b92be flake.nix: pull tanos jlpt data from datasources repo
Build and test / build (push) Has been cancelled
2026-04-08 13:34:25 +09:00
oysteikt 9b9c771eff flake.nix: pull datasources from datasources repo
Build and test / build (push) Failing after 8m19s
2026-04-07 17:26:47 +09:00
oysteikt eebeaba0e0 flake.nix: split off sqlite debugging tools into separate devshell
Build and test / build (push) Failing after 11m9s
2026-04-06 12:56:17 +09:00
oysteikt 61ac226fc3 word_search_result: add getter for unusual kanji flag
Build and test / build (push) Successful in 11m48s
2026-04-02 15:53:39 +09:00
oysteikt ede57a7a00 docs: init
Build and test / build (push) Successful in 11m51s
2026-04-01 16:48:40 +09:00
oysteikt 2ad1e038f1 tanos-jlpt: remove flatten from xml stream
Build and test / build (push) Successful in 13m41s
This was earlier used to compensate for a double nesting bug. This has
been fixed in the latest version of the xml package.
2026-04-01 16:04:44 +09:00
oysteikt f40825de65 jmdict: skip inserting duplicate xrefs 2026-04-01 16:03:56 +09:00
oysteikt 5aa068eaec flake.nix: add sqldiff to devshell
Build and test / build (push) Failing after 12m0s
2026-04-01 15:27:25 +09:00
oysteikt 170c3a853e flake.lock: bump, pubspec.lock: update inputs
Build and test / build (push) Failing after 10m4s
2026-03-26 22:18:10 +09:00
oysteikt c70838d1bf Add a basic benchmark
Build and test / build (push) Successful in 13m30s
2026-03-04 19:00:57 +09:00
oysteikt 0f7854a4fc migrations: add version tables for all data sources
Build and test / evals (push) Successful in 11m34s
2026-03-03 12:59:58 +09:00
oysteikt a86f857553 util/romaji_transliteration: add functions to generate transliteration spans
Build and test / evals (push) Successful in 18m58s
2026-03-02 18:23:36 +09:00
oysteikt d14e3909d4 search/filter_kanji: keep order when deduplicating
Build and test / evals (push) Successful in 13m33s
2026-03-02 17:37:45 +09:00
oysteikt bb44bf786a tests: move const_data tests to test/const_data
Build and test / evals (push) Successful in 11m38s
2026-03-02 17:16:14 +09:00
oysteikt ad3343a01e README: add link to coverage
Build and test / evals (push) Successful in 13m25s
2026-03-02 15:02:36 +09:00
oysteikt 16d72e94ba WIP: .gitea/workflows: generate coverage
Build and test / evals (push) Successful in 13m17s
2026-03-02 14:34:08 +09:00
oysteikt b070a1fd31 .gitea/workflows: merge build and test pipeline 2026-03-02 14:31:59 +09:00
oysteikt dcf5c8ebe7 lemmatizer: implement equality for AllomorphPattern/LemmatizationRule 2026-03-02 12:01:13 +09:00
oysteikt 1f8bc8bac5 lemmatizer: let LemmatizationRule.validChildClasses be a set 2026-03-02 12:01:13 +09:00
oysteikt ab28b5788b search/word_search: fix english queries without pageSize/offset 2026-03-02 12:01:13 +09:00
oysteikt dd7b2917dc flake.nix: add lcov to devshell 2026-03-02 12:01:13 +09:00
oysteikt 74798c77b5 flake.nix: add libsqlite to LD_LIBRARY_PATH in devshell 2026-03-02 12:01:12 +09:00
oysteikt 63a4caa626 lemmatizer/rules/ichidan: add informal conditionals 2026-03-02 12:01:12 +09:00
oysteikt 374be5ca6b lemmatizer: add some basic tests 2026-03-02 12:01:12 +09:00
oysteikt 4a6fd41f31 lemmatizer: misc small improvements 2026-03-02 12:01:12 +09:00
oysteikt c06fff9e5a lemmatizer/rules: name all rules as separate static variables 2026-03-02 12:01:12 +09:00
oysteikt 1d9928ade1 search/kanji: split queries into separate functions 2026-03-02 12:01:11 +09:00
oysteikt 1a3b04be00 word_search_result: add romanization getters 2026-03-02 12:01:11 +09:00