Customer Portal

Reading special characters from input file

Comments 1

  • Avatar
    jraszyk
    0
    Comment actions Permalink
    I'm afraid if those special characters (byte values) are not valid characters under selected encoding, you will get such errors.

    For example, any byte value above 127 is not valid in USASCII encoding. In UTF-8 only certain combinations of >127 byte values are valid UTF-8 character. But for example ISO-8859-1 accepts all byte values successfully and should not produce the error you've shown.

    Another option would be to parse such strings as byte(array) in CloverETL and then further process it for example in Reformat component with some of the Conversion Functions (e.g. byte2str() on preprocessed byte array) or maybe String Functions (removeNonAscii() or removeNonPrintable(), etc.).

Please sign in to leave a comment.