- Remove invalid UTF-8 character
- Remove delimeter
- Remove end_of_field (eof)
- Remove nulls
- No single start quote
- Update quote sanitizer, remove both start and end quotes
- Only remove \u0000, rather all possible null characters
- Fix the chinese character encode bug