public final class BOMNewlineEncodingDetector extends Object
| Modifier and Type | Class and Description |
|---|---|
static class |
BOMNewlineEncodingDetector.NewlineType
Defines type friendly newline types.
|
| Modifier and Type | Field and Description |
|---|---|
static String |
BOCU_1
BOCU (Binary Ordered Compression for Unicode)
|
static String |
EBCDIC
Java friendly EBCDIC encoding name..
|
static String |
ISO_8859_1
Java friendly ISO-8859-1 encoding name.
|
static String |
SCSU
SCSU (Standard Compression Scheme for Unicode)
|
static String |
UTF_16
Java friendly UTF-16 encoding name.
|
static String |
UTF_16BE
Java friendly UTF-16 big endian encoding name.
|
static String |
UTF_16LE
Java friendly UTF-16 little endian encoding name.
|
static String |
UTF_32
Java friendly UTF-32 encoding name..
|
static String |
UTF_32BE
Java friendly UTF-32 big endian encoding name..
|
static String |
UTF_32LE
Java friendly UTF-32 little endian encoding name..
|
static String |
UTF_7
Java friendly UTF-7 encoding name..
|
static String |
UTF_8
Java friendly UTF-8 encoding name.
|
static String |
UTF_EBCDIC
Java friendly UTF-EBCDIC encoding name..
|
| Constructor and Description |
|---|
BOMNewlineEncodingDetector(InputStream inputStream)
Create a new BOMNewlineEncodingDetector from an
InputStream. |
BOMNewlineEncodingDetector(InputStream inputStream,
Charset defaultEncoding) |
BOMNewlineEncodingDetector(InputStream inputStream,
String defaultEncoding)
Create a new BOMNewlineEncodingDetector from an
InputStream and a user provided encoding. |
| Modifier and Type | Method and Description |
|---|---|
void |
detectAndRemoveBom() |
void |
detectBom() |
int |
getBomSize()
Gets the number of bytes used by the Byte-Order-mark in this document.
|
String |
getDefaultEncoding()
Get the defaultEncoding set by the user.
|
String |
getEncoding()
Get the guessed encoding or if encoding couldn't be guessed return the user supplied encoding.
|
String |
getEncodingSpecificationInfo()
Return a short description of the encoding.
|
InputStream |
getInputStream()
Get the input stream pased in to the constructor
|
BOMNewlineEncodingDetector.NewlineType |
getNewlineType()
Detects newline type using the inputStream itself.
|
static BOMNewlineEncodingDetector.NewlineType |
getNewlineType(CharSequence text)
Static helper method for detecting newline type used in a run of text.
|
boolean |
hasBom()
Does this document have a byte order mark?
|
boolean |
hasUtf7Bom()
Does this document have a UTF-7 byte order mark?
|
boolean |
hasUtf8Bom()
Indicates if the guessed encoding is UTF-8 and this file has a BOM.
|
boolean |
hasUtf8Encoding() |
boolean |
isAutodetected()
Indicates if the guessed encoding was auto-detected.
|
boolean |
isDefinitive()
Are we confident of the document encoding?
|
void |
setDefaultEncoding(String defaultEncoding)
Set the default encoding.
|
public static final String UTF_16
public static final String UTF_16BE
public static final String UTF_16LE
public static final String UTF_8
public static final String ISO_8859_1
public static final String EBCDIC
public static final String SCSU
public static final String UTF_7
public static final String UTF_EBCDIC
public static final String BOCU_1
public static final String UTF_32
public static final String UTF_32BE
public static final String UTF_32LE
public BOMNewlineEncodingDetector(InputStream inputStream)
InputStream. Cannot detect BOMNewlineEncodingDetector.NewlineType unless a
valid encoding is detected.inputStream - the input streampublic BOMNewlineEncodingDetector(InputStream inputStream, String defaultEncoding)
InputStream and a user provided encoding. This
BOMNewlineEncodingDetector can convert the input bytes to Unicode for detection of the BOMNewlineEncodingDetector.NewlineTypeinputStream - the input streamdefaultEncoding - the default encodingpublic BOMNewlineEncodingDetector(InputStream inputStream, Charset defaultEncoding)
public static BOMNewlineEncodingDetector.NewlineType getNewlineType(CharSequence text)
text - - text which includes newlines.BOMNewlineEncodingDetector.NewlineTypepublic BOMNewlineEncodingDetector.NewlineType getNewlineType()
BOMNewlineEncodingDetector.NewlineTypepublic InputStream getInputStream()
InputStreampublic String getEncoding()
public String getEncodingSpecificationInfo()
public boolean isDefinitive()
public void detectBom()
public void detectAndRemoveBom()
public String getDefaultEncoding()
public void setDefaultEncoding(String defaultEncoding)
defaultEncoding - default encodingpublic boolean hasBom()
public boolean hasUtf8Bom()
public boolean hasUtf7Bom()
public boolean isAutodetected()
public int getBomSize()
public boolean hasUtf8Encoding()
Copyright © 2021. All rights reserved.