2008-03-08 13:52:38 +00:00
|
|
|
/////////////////////////////////////////////////////////////////////////////
|
|
|
|
// Name: convauto.h
|
2008-03-10 15:24:38 +00:00
|
|
|
// Purpose: interface of wxConvAuto
|
2008-03-08 13:52:38 +00:00
|
|
|
// Author: wxWidgets team
|
|
|
|
// RCS-ID: $Id$
|
|
|
|
// Licence: wxWindows license
|
|
|
|
/////////////////////////////////////////////////////////////////////////////
|
|
|
|
|
|
|
|
/**
|
|
|
|
@class wxConvAuto
|
|
|
|
@wxheader{convauto.h}
|
2008-03-08 14:43:31 +00:00
|
|
|
|
2008-03-08 13:52:38 +00:00
|
|
|
This class implements a Unicode to/from multibyte converter capable of
|
|
|
|
automatically recognizing the encoding of the multibyte text on input. The
|
|
|
|
logic used is very simple: the class uses the BOM (byte order mark) if it's
|
|
|
|
present and tries to interpret the input as UTF-8 otherwise. If this fails, the
|
|
|
|
input is interpreted as being in the default multibyte encoding which can be
|
|
|
|
specified in the constructor of a wxConvAuto instance and, in turn, defaults to
|
|
|
|
the value of @ref wxConvAuto::getdefaultmbencoding GetFallbackEncoding if
|
|
|
|
not explicitly given.
|
2008-03-08 14:43:31 +00:00
|
|
|
|
2008-03-08 13:52:38 +00:00
|
|
|
For the conversion from Unicode to multibyte, the same encoding as was
|
|
|
|
previously used for multibyte to Unicode conversion is reused. If there had
|
|
|
|
been no previous multibyte to Unicode conversion, UTF-8 is used by default.
|
|
|
|
Notice that once the multibyte encoding is automatically detected, it doesn't
|
|
|
|
change any more, i.e. it is entirely determined by the first use of wxConvAuto
|
|
|
|
object in the multibyte-to-Unicode direction. However creating a copy of
|
|
|
|
wxConvAuto object, either via the usual copy constructor or assignment
|
|
|
|
operator, or using wxMBConv::Clone method, resets the
|
|
|
|
automatically detected encoding so that the new copy will try to detect the
|
|
|
|
encoding of the input on first use.
|
2008-03-08 14:43:31 +00:00
|
|
|
|
2008-03-08 13:52:38 +00:00
|
|
|
This class is used by default in wxWidgets classes and functions reading text
|
2008-03-08 14:43:31 +00:00
|
|
|
from files such as wxFile, wxFFile,
|
2008-03-08 13:52:38 +00:00
|
|
|
wxTextFile, wxFileConfig and
|
2008-03-08 14:43:31 +00:00
|
|
|
various stream classes so the encoding set with its
|
2008-03-08 13:52:38 +00:00
|
|
|
@ref wxConvAuto::setdefaultmbencoding SetFallbackEncoding method will
|
|
|
|
affect how these classes treat input files. In particular, use this method
|
|
|
|
to change the fall-back multibyte encoding used to interpret the contents of
|
|
|
|
the files whose contents isn't valid UTF-8 or to disallow it completely.
|
2008-03-08 14:43:31 +00:00
|
|
|
|
2008-03-08 13:52:38 +00:00
|
|
|
@library{wxbase}
|
|
|
|
@category{FIXME}
|
2008-03-08 14:43:31 +00:00
|
|
|
|
2008-03-10 15:24:38 +00:00
|
|
|
@see @ref overview_mbconvclasses "wxMBConv classes overview"
|
2008-03-08 13:52:38 +00:00
|
|
|
*/
|
|
|
|
class wxConvAuto : public wxMBConv
|
|
|
|
{
|
|
|
|
public:
|
|
|
|
/**
|
|
|
|
Constructs a new wxConvAuto instance. The object will try to detect the input
|
|
|
|
of the multibyte text given to its wxMBConv::ToWChar method
|
|
|
|
automatically but if the automatic detection of Unicode encodings fails, the
|
2008-03-09 12:33:59 +00:00
|
|
|
fall-back encoding @a enc will be used to interpret it as multibyte text.
|
2008-03-08 13:52:38 +00:00
|
|
|
The default value of this parameter, @c wxFONTENCODING_DEFAULT means
|
|
|
|
that the global default value which can be set using
|
|
|
|
@ref setdefaultmbencoding() SetFallbackEncoding method should be
|
|
|
|
used. As with that method, passing @c wxFONTENCODING_MAX inhibits using
|
|
|
|
this encoding completely so the input multibyte text will always be interpreted
|
|
|
|
as UTF-8 in the absence of BOM and the conversion will fail if the input
|
|
|
|
doesn't form valid UTF-8 sequence. Another special value is
|
|
|
|
@c wxFONTENCODING_SYSTEM which means to use the encoding currently used
|
|
|
|
on the user system, i.e. the encoding returned by
|
|
|
|
wxLocale::GetSystemEncoding. Any other
|
2008-03-08 14:43:31 +00:00
|
|
|
encoding will be used as is, e.g. passing @c wxFONTENCODING_ISO8859_1
|
2008-03-08 13:52:38 +00:00
|
|
|
ensures that non-UTF-8 input will be treated as latin1.
|
|
|
|
*/
|
|
|
|
wxConvAuto(wxFontEncoding enc = wxFONTENCODING_DEFAULT);
|
|
|
|
|
|
|
|
/**
|
|
|
|
Disable the use of the fall back encoding: if the input doesn't have a BOM and
|
|
|
|
is not valid UTF-8, the conversion will fail.
|
|
|
|
*/
|
|
|
|
static void DisableFallbackEncoding();
|
|
|
|
|
|
|
|
/**
|
|
|
|
Returns the encoding used by default by wxConvAuto if no other encoding is
|
2008-03-08 14:43:31 +00:00
|
|
|
explicitly specified in constructor. By default, returns
|
|
|
|
@c wxFONTENCODING_ISO8859_1 but can be changed using
|
2008-03-08 13:52:38 +00:00
|
|
|
@ref setdefaultmbencoding() SetFallbackEncoding method.
|
|
|
|
*/
|
|
|
|
static wxFontEncoding GetFallbackEncoding();
|
|
|
|
|
|
|
|
/**
|
|
|
|
Changes the encoding used by default by wxConvAuto if no other encoding is
|
|
|
|
explicitly specified in constructor. The default value, which can be retrieved
|
2008-03-08 14:43:31 +00:00
|
|
|
using @ref getdefaultmbencoding() GetFallbackEncoding, is
|
2008-03-08 13:52:38 +00:00
|
|
|
@c wxFONTENCODING_ISO8859_1.
|
2008-03-08 14:43:31 +00:00
|
|
|
Special values of @c wxFONTENCODING_SYSTEM or
|
2008-03-09 12:33:59 +00:00
|
|
|
@c wxFONTENCODING_MAX can be used for @a enc parameter to use the
|
2008-03-08 13:52:38 +00:00
|
|
|
encoding of the current user locale as fall back or not use any encoding for
|
|
|
|
fall back at all, respectively (just as with the similar constructor
|
|
|
|
parameter). However @c wxFONTENCODING_DEFAULT value cannot be used here.
|
|
|
|
*/
|
|
|
|
static void SetFallbackEncoding(wxFontEncoding enc);
|
|
|
|
};
|
2008-03-10 15:24:38 +00:00
|
|
|
|