Plugin for markdown-it that suppresses linebreaks between east asian characters.
Normally newlines in a markdown text get rendered as newlines in output html text. Then browsers will usually render those newlines as whitespace (more smart behavior is included in w3c drafts, but not actually implemented by vendors).
This plugin finds and removes newlines that cannot be converted to space, algorithm matches CSS Text Module Level 3:
- If the character immediately before or immediately after the segment break is the zero-width space character (U+200B), then the break is removed, leaving behind the zero-width space.
- Otherwise, if the East Asian Width property [UAX11] of both the character before and after the segment break is F, W, or H (not A), and neither side is Hangul, then the segment break is removed.
- Otherwise, the segment break is converted to a space (U+0020).
yarn add markdown-it-cjk-breaks
var md = require('markdown-it')();
var cjk_breaks = require('markdown-it-cjk-breaks');
md.use(cjk_breaks);
md.render(`
あおえ
うい
aoe
ui
`);
// returns:
//
//<p>あおえうい
//aoe
//ui</p>