Skip to content

Fix Bug in Buffering UTF-8 Text Input

Compare
Choose a tag to compare
@prohippo prohippo released this 06 Dec 19:15
· 106 commits to master since this release

This release corrects a long-time bug in the buffering of UTF-8 text data input for analysis. The original C code for AW was ASCII only and had to be completely rewritten for its Java reimplementation. Processing errors in v2.3 become evident for text with emoticons, but problems could also arise with non-ASCII punctuation.