UTF8 (Bouncy Castle Library 1.81 API Specification)

Overview

Package

Class

Tree

Deprecated

Index

Help

Bouncy Castle Cryptography Library 1.81

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

org.bouncycastle.util.encoders
Class UTF8

java.lang.Object
  org.bouncycastle.util.encoders.UTF8

public class UTF8
extends java.lang.Object

Utilities for working with UTF-8 encodings.

Decoding of UTF-8 is based on a presentation by Bob Steagall at CppCon2018 (see https://github.com/BobSteagall/CppCon2018). It uses a Deterministic Finite Automaton (DFA) to recognize and decode multi-byte code points.

Constructor Summary
`UTF8()`

Method Summary
`static int`	`transcodeToUTF16(byte[] utf8, char[] utf16)` Transcode a UTF-8 encoding into a UTF-16 representation.
`static int`	`transcodeToUTF16(byte[] utf8, int utf8Off, int utf8Length, char[] utf16)` Transcode a UTF-8 encoding into a UTF-16 representation.

Methods inherited from class java.lang.Object

clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail

UTF8

public UTF8()

Method Detail

transcodeToUTF16

public static int transcodeToUTF16(byte[] utf8,
                                   char[] utf16)

Transcode a UTF-8 encoding into a UTF-16 representation. In the general case the output array should be at least as long as the input one to handle arbitrary inputs. The number of output UTF-16 code units is returned, or -1 if any errors are encountered (in which case an arbitrary amount of data may have been written into the output array). Errors that will be detected are malformed UTF-8, including incomplete, truncated or "overlong" encodings, and unmappable code points. In particular, no unmatched surrogates will be produced. An error will also result if is found to be too small to store the complete output.

Parameters:: utf8 - A non-null array containing a well-formed UTF-8 encoding.; utf16 - A non-null array, at least as long as the array in order to ensure the output will fit.
Returns:: The number of UTF-16 code units written to (beginning from index 0), or else -1 if the input was either malformed or encoded any unmappable characters, or if the is too small.

transcodeToUTF16

public static int transcodeToUTF16(byte[] utf8,
                                   int utf8Off,
                                   int utf8Length,
                                   char[] utf16)

Transcode a UTF-8 encoding into a UTF-16 representation. In the general case the output array should be at least as long as the input length from to handle arbitrary inputs. The number of output UTF-16 code units is returned, or -1 if any errors are encountered (in which case an arbitrary amount of data may have been written into the output array). Errors that will be detected are malformed UTF-8, including incomplete, truncated or "overlong" encodings, and unmappable code points. In particular, no unmatched surrogates will be produced. An error will also result if is found to be too small to store the complete output.

Parameters:: utf8 - A non-null array containing a well-formed UTF-8 encoding.; utf8Off - start position in the array for the well-formed encoding.; utf8Length - length in bytes of the well-formed encoding.; utf16 - A non-null array, at least as long as the array in order to ensure the output will fit.
Returns:: The number of UTF-16 code units written to (beginning from index 0), or else -1 if the input was either malformed or encoded any unmappable characters, or if the is too small.

Overview

Package

Class

Tree

Deprecated

Index

Help

Bouncy Castle Cryptography Library 1.81

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

org.bouncycastle.util.encoders Class UTF8

UTF8

transcodeToUTF16

transcodeToUTF16

org.bouncycastle.util.encoders
Class UTF8