Kango-enrichment

A simple script to create readings and translations for a list of japanese kango

Examples and usage

By default the script accepts a CSV file with 3 columns as an input. The columns should contain the following headers - kango, reading, translation. Values of reading and/or translation may be missing in some/all rows. The output is a CSV file of the same format with missing cells having been filled where possible. If option -T is provided it will change output to a JSON file of the format:

[
{  // word 1
  'kango': 'some_japanese_hyerogliphs',
  'reading': 'hirana_reading',
  'translation': 'english_translation_of_the_word',
  'examples': ['example 1 from tatoeba',
            // Some more examples
            'example n form tatoeba']
}, 
// some more words
// ...
]

sample_intput.csv

kango	reading	translation
先生

sample_output.csv

kango	reading	translation
先生	せんせい	Teacher, doctor, master

Default usage would be:

python csv_modifier.py sample_input.csv sample_output.csv

N.B. If the output file exists it will be overwritten

dsl_search.py can be used to find separate values in the dsl file, however it is much better to use programs like goldendict to explore dsl dictionariesю

Dependencies

pykakasi
tatoeba_links.db provided by Tatoeba Project under the Creative Commons license

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dsl_search.py		dsl_search.py
enriched_generator.py		enriched_generator.py
japan2.dsl		japan2.dsl
querry_tatoeba_links_db.py		querry_tatoeba_links_db.py
tatoeba_links.db		tatoeba_links.db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kango-enrichment

Examples and usage

Dependencies

About

Releases

Packages

Languages

License

DrNightingales/Kango-enrichment

Folders and files

Latest commit

History

Repository files navigation

Kango-enrichment

Examples and usage

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages