专利名称:Hybrid comparison for unicode text strings
consisting primarily of ASCII characters
发明人:Thomas Neumann,Viktor Leis,Alfons Kemper
申请号:US15719479
申请日:20170928
公开号:US10089281B1
公开日:
20181002
专利内容由知识产权出版社提供
专利附图:
摘要:Comparing text strings with Unicode encoding includes receiving two text
strings Sand S. The process computes, for the first text string S, a first weight according to a weight function ƒ that computes an ASCII prefix ƒ(S), computes a Unicode weight
suffix ƒ(S), and concatenates the weights to form the first weight ƒ(S)=ƒ(S)+ƒ(S). Computing the ASCII prefix for the first string applies bitwise operations to n-byte contiguous blocks of the first string to determine whether each block contains only ASCII characters, and replaces accented Unicode characters with equivalent unaccented ASCII characters when comparison is designated as accent-insensitive. When there is a first block containing a non-replaceable non-ASCII character, the Unicode weight suffix is computed by performing a character-by-character Unicode weight lookup beginning with the first block. The same process is applied to the second string. The text string are compared by comparing their computed weights.
申请人:Tableau Software, Inc.
地址:Seattle WA US
字符串长度比较国籍:US
代理机构:Morgan, Lewis & Bockius LLP
更多信息请下载全文后查看
版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系QQ:729038198,我们将在24小时内删除。
发表评论