2025-05-13 - 2026-05-13

Overview

0 Active Pull Requests
80 Active Issues
Excluding merges, 1 author has pushed 192 commits to main and 202 commits to all branches. On main, 153 files have changed and there have been 12010 additions and 15998 deletions.

34 Issues closed from 1 user

Closed #10 Store version numbers for the individual sources 2026-04-13 14:00:52 +02:00

Closed #100 Don't store JMdict_Sense.language when building with translations from a single language 2026-04-13 12:44:31 +02:00

Closed #101 Externalize JMdict_Sense.type into a separate table 2026-04-13 12:44:31 +02:00

Closed #96 Use elementId for XREF__JMdict_KanjiElement__KANJIDIC_Character 2026-04-13 12:15:08 +02:00

Closed #94 Don't store kanji + reading in xref tables 2026-04-13 11:33:14 +02:00

Closed #65 Omit rows with basescore 0 in JMdict_EntryScore 2026-04-13 11:26:09 +02:00

Closed #67 Create separate tables for KANJIDIC_Character's grade/frequency/jlpt 2026-04-09 07:10:58 +02:00

Closed #85 Keep local copies of all datasources 2026-04-08 20:23:09 +02:00

Closed #95 Use elementId to refer to sense restrictions 2026-04-08 12:57:28 +02:00

Closed #98 Make kanji and reading elements unique from each other by embedding type 2026-04-08 12:08:25 +02:00

Closed #97 Make elementId into a composite of entryId and the ordering number 2026-04-08 10:38:38 +02:00

Closed #35 Word search: support globs 2026-03-02 09:39:51 +01:00

Closed #60 Create deconjugation/lemmatization algorithm 2026-03-02 02:18:46 +01:00

Closed #76 Generate "match spans", detailing where the search results matched the searchword 2026-02-24 13:27:52 +01:00

Closed #71 Always order exact matches first in word search, no matter commonness 2025-07-07 13:28:03 +02:00

Closed #70 Remove duplicates from word search join 2025-07-07 12:49:58 +02:00

Closed #21 Store KANJIDIC onyomi as hiragana 2025-06-25 20:19:34 +02:00

Closed #41 Create query cli tool for querying a single JMdict entryId 2025-06-25 20:19:29 +02:00

Closed #64 Reduce size of type enum by reducing to char(1) (r, k) 2025-06-25 20:19:22 +02:00

Closed #52 Create integer ids for Reading/Kanji elements to reduce space usage 2025-06-24 20:07:22 +02:00

Closed #57 Filter inputs for nix source 2025-06-24 20:06:33 +02:00

Closed #61 Export function for creating an empty database 2025-06-23 21:12:13 +02:00

Closed #55 Order english queries by score 2025-05-23 15:28:12 +02:00

Closed #48 Add function to filter kanji from a string by what's available in the dictionary 2025-05-22 19:45:31 +02:00

Closed #44 Precalculate search scores with a table and a bunch of triggers 2025-05-22 16:58:30 +02:00

Closed #50 List words by JLPT 2025-05-22 16:07:34 +02:00

Closed #49 List kanji by JLPT 2025-05-22 16:07:27 +02:00

Closed #29 Create radical search which retrieves remaining possible combinations 2025-05-21 14:28:57 +02:00

Closed #47 Add orderNum to KANJIDIC kunyomi, onyomi and meaning 2025-05-21 13:36:31 +02:00

Closed #42 Renormalize KANJIDIC radical data 2025-05-19 19:53:14 +02:00

Closed #39 Word search: pagination 2025-05-16 18:48:05 +02:00

Closed #31 Add JLPT tags to WordSearchResult 2025-05-14 17:14:16 +02:00

Closed #32 Extend language source data for WordSearchSense (upon search) 2025-05-14 17:14:16 +02:00

Closed #30 Add basic info (reading, kanji) to word search xrefs (antonyms, seealsos) 2025-05-14 17:14:16 +02:00

77 Issues created by 1 user

Opened #30 Add basic info (reading, kanji) to word search xrefs (antonyms, seealsos) 2025-05-13 23:29:08 +02:00

Opened #31 Add JLPT tags to WordSearchResult 2025-05-13 23:29:35 +02:00

Opened #32 Extend language source data for WordSearchSense (upon search) 2025-05-13 23:46:03 +02:00

Opened #33 Ensure consistent naming scheme for tables 2025-05-14 14:10:00 +02:00

Opened #34 Group kanji readings/meanings by rmgroup parent nodes 2025-05-14 23:13:11 +02:00

Opened #35 Word search: support globs 2025-05-14 23:13:47 +02:00

Opened #36 Word search: support a variety of tags 2025-05-14 23:14:14 +02:00

Opened #37 Word search: support mixed input (kanji, kana, romaji) 2025-05-14 23:14:37 +02:00

Opened #38 Add wanikani levels 2025-05-14 23:15:38 +02:00

Opened #39 Word search: pagination 2025-05-15 09:09:45 +02:00

Opened #40 Word search: optimize word regrouping 2025-05-16 14:36:34 +02:00

Opened #41 Create query cli tool for querying a single JMdict entryId 2025-05-16 17:57:07 +02:00

Opened #42 Renormalize KANJIDIC radical data 2025-05-19 10:13:42 +02:00

Opened #43 Add additional radical data from KanjiAlive 2025-05-19 10:13:44 +02:00

Opened #44 Precalculate search scores with a table and a bunch of triggers 2025-05-19 10:23:14 +02:00

Opened #45 Add nix package for dart doc 2025-05-19 15:03:40 +02:00

Opened #46 Create "furigana segmentation" (or "kanji/kana alignment") algorithm 2025-05-19 15:34:22 +02:00

Opened #47 Add orderNum to KANJIDIC kunyomi, onyomi and meaning 2025-05-19 19:55:09 +02:00

Opened #48 Add function to filter kanji from a string by what's available in the dictionary 2025-05-21 19:33:41 +02:00

Opened #49 List kanji by JLPT 2025-05-21 19:33:55 +02:00

Opened #50 List words by JLPT 2025-05-21 19:34:04 +02:00

Opened #51 Measure time taken during substeps of data ingestion 2025-05-22 15:29:00 +02:00

Opened #52 Create integer ids for Reading/Kanji elements to reduce space usage 2025-05-22 15:32:03 +02:00

Opened #53 Find a better way to manage/order migrations 2025-05-22 15:48:12 +02:00

Opened #54 Split out Kanji/Reading element's readingDoesNotMatchKanji/news/ichi/spec/gai/nf into separate tables 2025-05-22 16:06:36 +02:00

Opened #55 Order english queries by score 2025-05-22 16:18:37 +02:00

Opened #56 Validate input before using it in FTS5 queries 2025-05-22 22:00:04 +02:00

Opened #57 Filter inputs for nix source 2025-05-22 22:59:20 +02:00

Opened #58 Add an option to only ingest a subset of data for development speed 2025-05-23 11:09:02 +02:00

Opened #59 Add developer option to log EXPLAIN QUERYoutput 2025-05-23 14:41:46 +02:00

Opened #60 Create deconjugation/lemmatization algorithm 2025-05-23 21:58:05 +02:00

Opened #61 Export function for creating an empty database 2025-06-20 21:24:09 +02:00

Opened #62 Add pairs of lookalike kanjis to use for word search 2025-06-22 14:32:37 +02:00

Opened #63 Consider embedding vibrato morph analyzer 2025-06-23 13:00:25 +02:00

Opened #64 Reduce size of type enum by reducing to char(1) (r, k) 2025-06-25 08:53:22 +02:00

Opened #65 Omit rows with basescore 0 in JMdict_EntryScore 2025-06-25 09:15:53 +02:00

Opened #66 Use INTEGER for JMdict_JLPTTag.jlptLevel 2025-06-25 10:42:14 +02:00

Opened #67 Create separate tables for KANJIDIC_Character's grade/frequency/jlpt 2025-06-25 10:43:40 +02:00

Opened #68 Use integer enum for KANJIDIC_Codepoint.type 2025-06-25 10:44:25 +02:00

Opened #69 Figure out what's up with the scoring here 2025-07-03 16:52:40 +02:00

Opened #70 Remove duplicates from word search join 2025-07-07 00:19:28 +02:00

Opened #71 Always order exact matches first in word search, no matter commonness 2025-07-07 12:51:21 +02:00

Opened #72 Improve ambiguous crossreferences by choosing those that share kanji first 2025-07-19 02:34:51 +02:00

Opened #73 Add kanji variant usage percentage 2025-07-26 05:54:50 +02:00

Opened #74 Create performance benchmarks 2026-02-09 06:50:43 +01:00

Opened #75 Add audio samples 2026-02-22 09:37:41 +01:00

Opened #76 Generate "match spans", detailing where the search results matched the searchword 2026-02-24 09:21:52 +01:00

Opened #77 Generate diagram for the lemmatization transducer 2026-03-01 04:05:24 +01:00

Opened #78 Add common followup verbs to lemmatizer 2026-03-02 02:13:53 +01:00

Opened #79 Connect lemmatizer to word search 2026-03-02 02:16:49 +01:00

Opened #80 Cross reference dictionary to aid lemmatizer 2026-03-02 02:18:27 +01:00

Opened #81 Add 〜た/〜だ verb followups for lemmatizer 2026-03-02 02:23:10 +01:00

Opened #82 Add common volitional verb followups 2026-03-02 02:32:02 +01:00

Opened #83 Cache lemmatization results 2026-03-02 02:38:27 +01:00

Opened #84 Reenable test concurrency 2026-03-02 06:44:10 +01:00

Opened #85 Keep local copies of all datasources 2026-03-02 06:53:39 +01:00

Opened #86 Add "kanji example word" mode to word search 2026-03-02 07:06:58 +01:00

Opened #87 Add kanjivg data to sqlite db 2026-03-02 07:12:43 +01:00

Opened #88 Add 漢検 level ratings to kanji 2026-03-02 07:33:35 +01:00

Opened #89 Add wikipedia references for certain dictionary entries 2026-03-02 07:43:18 +01:00

Opened #90 Support wildcards in word search 2026-03-02 08:56:08 +01:00

Opened #91 Test json de/serialization roundtrip for all models 2026-03-02 10:27:05 +01:00

Opened #92 Find datasource for idioms and fixed phrases 2026-03-10 02:32:43 +01:00

Opened #93 Vendor custom SQLite build with ICU extension enabled 2026-04-06 02:04:37 +02:00

Opened #94 Don't store kanji + reading in xref tables 2026-04-06 05:35:20 +02:00

Opened #95 Use elementId to refer to sense restrictions 2026-04-06 05:38:17 +02:00

Opened #96 Use elementId for XREF__JMdict_KanjiElement__KANJIDIC_Character 2026-04-06 05:39:49 +02:00

Opened #97 Make elementId into a composite of entryId and the ordering number 2026-04-08 08:35:37 +02:00

Opened #98 Make kanji and reading elements unique from each other by embedding type 2026-04-08 11:11:56 +02:00

Opened #99 Disable constraints causing sqlite autoindex creation when compiling for production usage 2026-04-08 13:47:32 +02:00

Opened #100 Don't store JMdict_Sense.language when building with translations from a single language 2026-04-13 12:20:46 +02:00

Opened #101 Externalize JMdict_Sense.type into a separate table 2026-04-13 12:21:31 +02:00

Opened #102 Retrieve JMdict_SenseGlossaryType information 2026-04-13 12:37:42 +02:00

Opened #103 Get rid of JMdict_SenseGlossary_byPhrase index for production use 2026-04-13 13:23:20 +02:00

Opened #104 Add 放送禁止用語 tags 2026-04-22 22:51:12 +02:00

Opened #105 Register commit hash for 'datasources' repo 2026-04-22 22:52:58 +02:00

Opened #106 Switch to the XML-NG version of JMDict (once it releases) 2026-05-05 16:59:29 +02:00

18 Unresolved Conversations

Open #6 Create tool for diffing two instances of the database 2026-04-08 08:37:25 +02:00

Open #23 Word search: kana type independence 2026-04-06 02:06:50 +02:00

Open #13 Word search: automatically deconjugate words 2026-04-03 13:39:00 +02:00

Open #11 Add ENAMDICT/JMnedict 2026-03-02 07:39:46 +01:00

Open #16 Find source for word pitch data 2026-03-02 07:36:23 +01:00

Open #18 Generate conjugation tables 2026-03-02 02:33:26 +01:00

Open #22 Kanji search: query for example words 2025-06-23 10:52:20 +02:00

Open #25 Add system for normalizing kanji variants 2025-06-23 10:52:05 +02:00

Open #5 Add tatoeba sentences 2025-06-23 10:51:54 +02:00

Open #27 Deal with remaining characters from kana -> romaji transliteration 2025-06-23 10:51:50 +02:00

Open #28 Duplicate kana on , , , during K -> R transliteration 2025-06-23 10:51:38 +02:00

Open #26 Add source of grammar data 2025-06-23 10:51:32 +02:00

Open #20 Normalize numbers and other symbols during search 2025-06-23 10:45:01 +02:00

Open #19 Word search: add levenshtein thresholds 2025-06-23 10:44:59 +02:00

Open #17 Add kanji freq data from https://scriptin.github.io/kanji-frequency/ 2025-06-23 10:44:53 +02:00

Open #12 Add stroke count data from radkfile 2025-06-23 10:44:42 +02:00

Open #9 Add progress bars while creating the database 2025-06-23 10:44:29 +02:00

Open #7 Build with CI 2025-06-23 10:44:24 +02:00