Unicode

Question

Unicode

Accepted Answer

Prior to ES6, JavaScript strings are represented by 16 bit character encoding (UTF 16). Each character is represented by 16 bit sequence known as code unit. Since the character set is been expanded by Unicode, you will get unexpected results from UTF 16 encoded strings containing surrogate pairs(i.e, Since it is not sufficient to represent certain characters in just 16 bits, you need two 16 bit code units). ECMAScript 6 added full support for UTF 16 within strings and regular expressions. It introduces new Unicode literal form in strings and new RegExp u mode to handle code points, as well as new APIs(codePointAt, fromCodePoint) to process strings.