Commit Graph

  • 8df3db4c77 WIP: add kanjivg data add-kanjivg-data h7x4 2026-03-03 13:47:59 +09:00
  • d1a6f39cca kanjidic: split grade/freq/jlpt into separate tables main h7x4 2026-04-09 14:10:51 +09:00
  • a222b2d9b8 jmdict: elementId instead of reading for element restriction tables h7x4 2026-04-08 19:57:07 +09:00
  • 6364457d9e docs/database: add some notes about elementId embeddings h7x4 2026-04-08 19:07:48 +09:00
  • 5d26b41524 jmdict: embed element type (k/r) into elementId h7x4 2026-04-08 19:05:02 +09:00
  • 114febbe02 docs/database: add some notes about JMdict_EntryScore h7x4 2026-04-08 18:14:28 +09:00
  • 20243dec09 docs: add docs about database schema choices h7x4 2026-04-08 17:38:08 +09:00
  • f6de8680ad jmdict: infer entryId from element ids h7x4 2026-04-08 17:37:52 +09:00
  • 99218a6987 jmdict: embed orderNum in senseId for senses h7x4 2026-04-08 16:38:42 +09:00
  • e8ee1ab944 data_ingestion/tanos-jlpt: remove redundant code import h7x4 2026-04-08 16:28:36 +09:00
  • 4f320e4ea9 jmdict: embed orderNum in elementId for kanji and readings h7x4 2026-04-08 16:27:19 +09:00
  • 9c9f5543c8 .gitea/workflows: fix 'update database inputs' step h7x4 2026-04-08 14:05:28 +09:00
  • be493a6150 .gitea/workflows: fix build-and-test job h7x4 2026-04-08 13:36:42 +09:00
  • 8d742b92be flake.nix: pull tanos jlpt data from datasources repo h7x4 2026-04-08 13:34:25 +09:00
  • 9b9c771eff flake.nix: pull datasources from datasources repo h7x4 2026-04-07 17:24:59 +09:00
  • eebeaba0e0 flake.nix: split off sqlite debugging tools into separate devshell h7x4 2026-04-06 10:21:11 +09:00
  • 7bacfc39a8 WIP sqlite-icu h7x4 2026-04-06 12:55:40 +09:00
  • c74a5f5cb6 flake.nix: build sqlite with ICU extension h7x4 2026-04-06 10:21:59 +09:00
  • 4fbed59143 flake.nix: split off sqlite debugging tools into separate devshell h7x4 2026-04-06 10:21:11 +09:00
  • 61ac226fc3 word_search_result: add getter for unusual kanji flag h7x4 2026-04-02 15:47:22 +09:00
  • ede57a7a00 docs: init h7x4 2026-04-01 16:48:40 +09:00
  • 2ad1e038f1 tanos-jlpt: remove flatten from xml stream h7x4 2026-04-01 16:04:41 +09:00
  • f40825de65 jmdict: skip inserting duplicate xrefs h7x4 2026-04-01 16:03:56 +09:00
  • 5aa068eaec flake.nix: add sqldiff to devshell h7x4 2026-04-01 15:27:25 +09:00
  • 170c3a853e flake.lock: bump, pubspec.lock: update inputs h7x4 2026-03-26 22:18:10 +09:00
  • df8b204d3c WIP: search/word_search: do a bit of performance optimization on result regrouping word-regrouping-optimization h7x4 2026-03-04 17:49:49 +09:00
  • c70838d1bf Add a basic benchmark h7x4 2026-03-04 19:00:57 +09:00
  • 0f7854a4fc migrations: add version tables for all data sources h7x4 2026-03-03 12:59:58 +09:00
  • a86f857553 util/romaji_transliteration: add functions to generate transliteration spans h7x4 2026-03-02 18:21:06 +09:00
  • d14e3909d4 search/filter_kanji: keep order when deduplicating h7x4 2026-03-02 17:37:45 +09:00
  • bb44bf786a tests: move const_data tests to test/const_data h7x4 2026-03-02 17:16:14 +09:00
  • ad3343a01e README: add link to coverage h7x4 2026-03-02 15:02:36 +09:00
  • 16d72e94ba WIP: .gitea/workflows: generate coverage h7x4 2026-03-02 14:34:08 +09:00
  • b070a1fd31 .gitea/workflows: merge build and test pipeline h7x4 2026-03-02 14:31:59 +09:00
  • dcf5c8ebe7 lemmatizer: implement equality for AllomorphPattern/LemmatizationRule h7x4 2026-03-02 11:47:46 +09:00
  • 1f8bc8bac5 lemmatizer: let LemmatizationRule.validChildClasses be a set h7x4 2026-03-02 11:46:40 +09:00
  • ab28b5788b search/word_search: fix english queries without pageSize/offset h7x4 2026-03-02 11:24:49 +09:00
  • dd7b2917dc flake.nix: add lcov to devshell h7x4 2026-03-02 10:50:24 +09:00
  • 74798c77b5 flake.nix: add libsqlite to LD_LIBRARY_PATH in devshell h7x4 2026-03-02 10:44:37 +09:00
  • 63a4caa626 lemmatizer/rules/ichidan: add informal conditionals h7x4 2026-03-02 10:34:50 +09:00
  • 374be5ca6b lemmatizer: add some basic tests h7x4 2026-03-02 09:31:36 +09:00
  • 4a6fd41f31 lemmatizer: misc small improvements h7x4 2026-03-02 09:30:01 +09:00
  • c06fff9e5a lemmatizer/rules: name all rules as separate static variables h7x4 2026-03-01 11:34:53 +09:00
  • 1d9928ade1 search/kanji: split queries into separate functions h7x4 2026-02-28 18:57:57 +09:00
  • 1a3b04be00 word_search_result: add romanization getters h7x4 2026-02-28 18:46:13 +09:00
  • c0c6f97a01 search/word_search: fix casing of SearchMode variants h7x4 2026-02-28 18:27:13 +09:00
  • a954188d5d Fix a few lints h7x4 2026-02-28 18:25:37 +09:00
  • 5b86d6eb67 README: add textual overview of the word search procedure h7x4 2026-02-28 14:52:22 +09:00
  • 72f31e974b dart format h7x4 2026-02-28 14:37:17 +09:00
  • e824dc0a22 search/word_search: split data queries into functions h7x4 2026-02-28 14:36:14 +09:00
  • f5bca61839 flake.lock: bump v0.6.0 h7x4 2026-02-25 16:28:18 +09:00
  • 056aaaa0ce tests/search_match_inference: add more cases h7x4 2026-02-25 12:42:38 +09:00
  • a696ed9733 Generate matchspans for word search results h7x4 2026-02-24 16:54:37 +09:00
  • 00b963bfed .gitea/workflows/test: init h7x4 2026-02-24 20:39:35 +09:00
  • 4376012f18 pubspec.lock: update deps h7x4 2026-02-24 18:44:20 +09:00
  • 8ae1d882a0 Add TODO for word matching h7x4 2026-02-24 15:21:03 +09:00
  • 81db60ccf7 Add some docstrings h7x4 2026-02-24 15:13:33 +09:00
  • f57cc68ef3 search/radicals: deduplicate input radicals before search h7x4 2026-02-24 15:08:19 +09:00
  • 48f50628a1 Create empty() factory for word search results h7x4 2026-02-23 13:01:57 +09:00
  • 1783338b2a nix/database_tool: fix building h7x4 2026-02-21 00:49:53 +09:00
  • e92e99922b {flake.lock,pubspec.*}: bump h7x4 2026-02-21 00:49:24 +09:00
  • 05b56466e7 tanos-jlpt: fix breaking changes for csv parser h7x4 2026-02-21 00:46:24 +09:00
  • 33016ca751 flake.nix: comment out sqlint, currently broken due to dep build failure h7x4 2026-02-09 14:45:19 +09:00
  • 98d92d370d {flake.lock,pubspec.lock}: bump, source libsqlite via hooks h7x4 2026-02-09 14:44:14 +09:00
  • 5252936bdc flake.nix: filter more files from src h7x4 2026-02-09 14:40:53 +09:00
  • ac0cb14bbe flake.lock: bump, pubspec.lock: update inputs v0.5.0 h7x4 2025-12-19 08:34:58 +09:00
  • 49a86f60ea .gitea/workflows: upload db as artifact h7x4 2025-12-19 08:27:46 +09:00
  • 9472156feb .gitea/workflows: update actions/checkout: v3 -> v6 h7x4 2025-12-08 18:46:26 +09:00
  • 4fbdba604e .gitea/workflows: run on debian-latest h7x4 2025-12-08 18:46:21 +09:00
  • 0cdfa2015e .gitea/workflows: add workflow for building database h7x4 2025-11-13 16:33:51 +09:00
  • a9ca9b08a5 flake.lock: bump, pubspec.lock: update inputs h7x4 2025-11-13 16:13:51 +09:00
  • 45e8181041 search/kanji: don't transliterate onyomi to katakana h7x4 2025-07-30 01:37:26 +02:00
  • 0d3ebc97f5 flake.lock: bump mugiten-v0.4.0 h7x4 2025-07-17 00:21:48 +02:00
  • bb68319527 treewide: add and apply a bunch of lints h7x4 2025-07-17 00:21:36 +02:00
  • 2803db9c12 bin/query-word: fix default pagination h7x4 2025-07-16 18:32:47 +02:00
  • 93b76ed660 word_search: include data for cross references h7x4 2025-07-16 18:32:28 +02:00
  • 29a3a6aafb treewide: dart format h7x4 2025-07-16 15:23:04 +02:00
  • 3a2adf0367 pubspec.{yaml,lock}: update deps h7x4 2025-07-15 21:32:42 +02:00
  • eae6e881a7 flake.lock: bump h7x4 2025-07-15 21:32:35 +02:00
  • 0a3387e77a search: add function for fetching multiple kanji at once h7x4 2025-07-15 00:58:16 +02:00
  • f30465a33c search: add function for fetching multiple word entries by id at once h7x4 2025-07-14 19:16:36 +02:00
  • d9006a0767 word_search: fix count query h7x4 2025-07-13 20:34:39 +02:00
  • 1e1761ab4d pubspec.{yaml,lock}: update deps h7x4 2025-07-13 20:15:13 +02:00
  • 37d29fc6ad cli/query_word: add flags for pagination h7x4 2025-07-13 20:12:22 +02:00
  • 60898fe9a2 word_search: fix pagination h7x4 2025-07-13 20:12:10 +02:00
  • 5049157b02 cli/query_word: add --json flag h7x4 2025-07-13 16:27:11 +02:00
  • 1868c6fb41 word_search: don't throw error on empty results mugiten-v0.3.1 h7x4 2025-07-09 14:57:19 +02:00
  • 4ee21d98e2 flake.lock: bump mugiten-v0.3.0 h7x4 2025-07-08 20:37:16 +02:00
  • 7247af19cb word_search: always order exact matches first h7x4 2025-07-07 13:27:50 +02:00
  • ac7deae608 word_search: remove duplicate results h7x4 2025-07-07 12:47:20 +02:00
  • 0ca55f065f WIP: kanjidic: split news ichi gai nf into separate tables split-news-ichi-gai-tables h7x4 2025-06-25 20:50:12 +02:00
  • 7978b74f8d lib/{_data_ingestion/search}: store kanjidic onyomi as hiragana h7x4 2025-06-25 20:18:28 +02:00
  • 50870f64a0 cli/query_kanji: remove -k flag, use arguments h7x4 2025-06-25 20:18:27 +02:00
  • 62d77749e6 cli/query_word: allow querying with jmdict id h7x4 2025-06-25 20:18:27 +02:00
  • 80b3610a72 Store type enum as CHAR(1) h7x4 2025-06-25 20:18:27 +02:00
  • 54705c3c10 word_search: add TODO h7x4 2025-06-24 23:04:47 +02:00
  • c7134f0d06 flake.nix: filter src h7x4 2025-06-24 19:32:42 +02:00
  • aac9bf69f6 cli/create_db: return an erroneous exit on on error h7x4 2025-06-24 19:32:42 +02:00
  • 189d4a95cf test/word_search: cover more functionality h7x4 2025-06-24 19:32:42 +02:00
  • c32775ce7a use ids for \{kanji,reading\}Element tables h7x4 2025-06-24 01:01:07 +02:00