Gukhanmun Hong Minhee
winget install --id=HongMinhee.Gukhanmun -e Gukhanmun is a software tool designed to convert Korean text written in mixed script (with hanja characters interleaved with hangul) into hangul-only text. This conversion process preserves the meaning of the original text while making it more accessible and easier to read.
Key Features:
- Lattice segmentation for accurate word splitting, avoiding common pitfalls like incorrectly segmenting 行事場所 as 行事 + 場所 instead of 行事場 + 所.
- Pluggable dictionary support, including in-memory maps, FST files, and CDB formats, with precompiled dictionaries for the Standard Korean Language Dictionary and Open Korean Dictionary.
- Format adapters for plain text, HTML, and Markdown, ensuring compatibility with various content types.
- Five rendering modes: hangul-only, hangul(hanja) parentheses, hanja(hangul) parentheses, ruby markup, and selective glossing of mixed script.
- Streaming-first design that minimizes buffering, making it suitable for large documents or real-time processing.
- Support for the initial sound law in South Korean orthography during fallback conversions.
Audience & Benefit: Ideal for linguists, publishers, content creators, and developers working with Korean text. Gukhanmun enables accurate and efficient conversion of mixed-script Korean into hangul-only text, enhancing readability and accessibility without requiring additional downloads or complex setup. Its support for multiple formats and rendering modes makes it versatile for various use cases, from academic research to web development.
Installable via winget, Gukhanmun provides a robust solution for handling Korean text conversion with precision and flexibility.