Commit Graph

53 Commits

Author SHA1 Message Date
080638e7ef lib/search/word_search: assert no missing rubys 2025-05-16 21:30:33 +02:00
3f267b78d3 lib: format 2025-05-16 21:06:19 +02:00
84ae7eca9e lib/search/word_search: move some score calculation to virtual column 2025-05-16 21:03:37 +02:00
45c4c5f09a lib/cli/query-word: stringify 2025-05-16 18:47:26 +02:00
369fcdbd4b lib/search/word_search: fix rubification 2025-05-16 18:46:57 +02:00
33cf5028f4 lib/search/word_search: score and order results by several metrics 2025-05-16 18:46:39 +02:00
bb9550380c lib/search/word_search: limit entry id query to single column 2025-05-16 17:06:01 +02:00
3680827176 lib/search/word_search: use const table names 2025-05-16 17:06:01 +02:00
9d9ce840fa lib/search/word_search: split 2025-05-16 17:06:00 +02:00
6c580e95e2 lib/search/word_search: pagination 2025-05-16 17:06:00 +02:00
2db99e76f0 lib/_data_ingestion: perform VACUUM after data ingestion 2025-05-16 17:06:00 +02:00
42db69e57a lib/cli/create_db: make WAL mode optional 2025-05-16 17:05:59 +02:00
4407c06f12 lib/cli: add timing logs to query commands 2025-05-16 17:05:59 +02:00
fc0956d5c3 lib/text_filtering: add kanjiRegex 2025-05-16 17:05:57 +02:00
d7f7f9cd19 Filter data out into separate files 2025-05-16 09:22:28 +02:00
cf95f85caa Add kanji filtering utility 2025-05-16 09:17:18 +02:00
f278b34415 lib/_data_ingestion: enable WAL 2025-05-15 22:31:54 +02:00
b07fc8f4b3 lib/word_search: calculate isCommon 2025-05-14 20:59:57 +02:00
8299572225 migrations: add view for base/furigana queries 2025-05-14 17:12:31 +02:00
78ba1bae1a lib/search: query readings for xrefs 2025-05-14 17:12:31 +02:00
87383c8951 lib/search: query jlpt tags on word search 2025-05-14 17:12:31 +02:00
cd9b318956 lib/cli/create_db: await seeding data 2025-05-14 17:12:30 +02:00
96f52b5860 lib/_data_ingestion/tanos-jlpt: report sql errors 2025-05-14 17:12:30 +02:00
59e8db5add Query more detailed information about langauge source 2025-05-14 17:12:30 +02:00
9038119eb7 Consistently use senseId name everywhere 2025-05-14 17:12:29 +02:00
3290d5dc91 Consistently use entryId name everywhere 2025-05-14 17:12:29 +02:00
2980bcda06 lib/_data_ingestion/jmdict: format 2025-05-14 17:12:29 +02:00
581f9daf25 lib/_data_ingestion: add phrases for language source data 2025-05-14 17:12:28 +02:00
9898793bca Convert entryBy* tables into views 2025-05-14 17:12:28 +02:00
10e74e3f43 Add tanos jlpt tags to database 2025-05-13 21:32:22 +02:00
c43a29a398 lib/_data_ingestion: don't return results on batch db commits 2025-05-13 21:26:17 +02:00
c379bc8019 lib/_data_ingestion: await RADKFILE seeding 2025-05-13 11:51:56 +02:00
8ddba55e43 lib/util: move escapeStringValue to sqlite utils 2025-05-13 11:51:55 +02:00
39f5826eaa lib/_data_ingestion: improve openLocalDB 2025-04-30 10:00:22 +02:00
1fdcb7a477 bin/jadb: split 2025-04-30 08:48:00 +02:00
4357485870 lib/util/romaji_trx: add and 2025-04-29 13:58:25 +02:00
0214473120 lib/_data_ingestion: split parsing and seeding steps 2025-04-29 13:29:31 +02:00
38706c0532 lib/util: add several utils from jptools 2025-04-29 12:14:40 +02:00
929be590ee Move gitignored data sources to data/tmp 2025-04-29 10:33:11 +02:00
946adb90cc Add jouyou kanji as constant data 2025-04-28 21:57:53 +02:00
b6410c717f Implement word search 2025-04-28 21:53:09 +02:00
191594f0bc _data_ingestion: add order numbers for readings and kanji 2025-04-26 14:44:53 +02:00
19cdfb8434 lib/_data_ingestion/jmdict: mark ambiguity in xrefs 2025-04-23 13:26:29 +02:00
8496c3add0 lib: format 2025-04-23 11:20:06 +02:00
bf0d20dde8 lib/_data_ingestion: remove invalid use of findAllElements 2025-04-23 11:19:56 +02:00
bd0dc9aad9 lib/search: continue work on jmdict search 2025-04-22 20:04:52 +02:00
7544013ffd lib: improve jmdict ingestion, update constants 2025-04-22 19:10:53 +02:00
84e80fe876 lib: implement stub for word search 2025-04-19 12:12:07 +02:00
68eb46174f lib: implement basic kanji search 2025-04-19 01:21:18 +02:00
23f90c1127 data_ingestion: fix kanji grade, jlpt, newspaper rank ingestion 2025-04-19 01:18:10 +02:00