2026-02-13 - 2026-05-13
Overview
14 Issues closed from 1 user
Closed
#10 Store version numbers for the individual sources
Closed
#101 Externalize JMdict_Sense.type into a separate table
Closed
#100 Don't store JMdict_Sense.language when building with translations from a single language
Closed
#96 Use elementId for XREF__JMdict_KanjiElement__KANJIDIC_Character
Closed
#94 Don't store kanji + reading in xref tables
Closed
#65 Omit rows with basescore 0 in JMdict_EntryScore
Closed
#67 Create separate tables for KANJIDIC_Character's grade/frequency/jlpt
Closed
#85 Keep local copies of all datasources
Closed
#95 Use elementId to refer to sense restrictions
Closed
#98 Make kanji and reading elements unique from each other by embedding type
Closed
#97 Make elementId into a composite of entryId and the ordering number
Closed
#35 Word search: support globs
Closed
#60 Create deconjugation/lemmatization algorithm
Closed
#76 Generate "match spans", detailing where the search results matched the searchword
32 Issues created by 1 user
Opened
#75 Add audio samples
Opened
#76 Generate "match spans", detailing where the search results matched the searchword
Opened
#77 Generate diagram for the lemmatization transducer
Opened
#78 Add common followup verbs to lemmatizer
Opened
#79 Connect lemmatizer to word search
Opened
#80 Cross reference dictionary to aid lemmatizer
Opened
#81 Add 〜た/〜だ verb followups for lemmatizer
Opened
#82 Add common volitional verb followups
Opened
#83 Cache lemmatization results
Opened
#84 Reenable test concurrency
Opened
#85 Keep local copies of all datasources
Opened
#86 Add "kanji example word" mode to word search
Opened
#87 Add kanjivg data to sqlite db
Opened
#88 Add 漢検 level ratings to kanji
Opened
#89 Add wikipedia references for certain dictionary entries
Opened
#90 Support wildcards in word search
Opened
#91 Test json de/serialization roundtrip for all models
Opened
#92 Find datasource for idioms and fixed phrases
Opened
#93 Vendor custom SQLite build with ICU extension enabled
Opened
#94 Don't store kanji + reading in xref tables
Opened
#95 Use elementId to refer to sense restrictions
Opened
#96 Use elementId for XREF__JMdict_KanjiElement__KANJIDIC_Character
Opened
#97 Make elementId into a composite of entryId and the ordering number
Opened
#98 Make kanji and reading elements unique from each other by embedding type
Opened
#99 Disable constraints causing sqlite autoindex creation when compiling for production usage
Opened
#100 Don't store JMdict_Sense.language when building with translations from a single language
Opened
#101 Externalize JMdict_Sense.type into a separate table
Opened
#102 Retrieve JMdict_SenseGlossaryType information
Opened
#103 Get rid of JMdict_SenseGlossary_byPhrase index for production use
Opened
#104 Add 放送禁止用語 tags
Opened
#105 Register commit hash for 'datasources' repo
Opened
#106 Switch to the XML-NG version of JMDict (once it releases)
13 Unresolved Conversations
Open
#6
Create tool for diffing two instances of the database
Open
#23
Word search: kana type independence
Open
#13
Word search: automatically deconjugate words
Open
#54
Split out Kanji/Reading element's readingDoesNotMatchKanji/news/ichi/spec/gai/nf into separate tables
Open
#62
Add pairs of lookalike kanjis to use for word search
Open
#74
Create performance benchmarks
Open
#40
Word search: optimize word regrouping
Open
#63
Consider embedding vibrato morph analyzer
Open
#11
Add ENAMDICT/JMnedict
Open
#16
Find source for word pitch data
Open
#46
Create "furigana segmentation" (or "kanji/kana alignment") algorithm
Open
#18
Generate conjugation tables
Open
#73
Add kanji variant usage percentage