std::swscanf
std::wscanf, std::fwscanf, std::swscanf
Defined in header | | |
---|---|---|
int wscanf( const wchar_t* format, ... | (1) | (since C++11) |
int fwscanf( std::FILE* stream, const wchar_t* format, ... | (2) | (since C++11) |
int swscanf( const wchar_t* buffer, const wchar_t* format, ... | (3) | (since C++11) |
Reads data from the a variety of sources, interprets it according to format
and stores the results into given locations.
1) Reads the data from stdin
.
2) Reads the data from file stream
stream
.
3) Reads the data from null-terminated wide string buffer
.
Parameters
stream | - | input file stream to read from |
---|---|---|
buffer | - | pointer to a null-terminated wide string to read from |
format | - | pointer to a null-terminated wide string specifying how to read the input. The format string consists of. non-whitespace wide characters except %: each such character in the format string consumes exactly one identical character from the input stream, or causes the function to fail if the next character on the stream does not compare equal. whitespace characters: any single whitespace character in the format string consumes all available consecutive whitespace characters from the input (determined as if by calling iswspace in a loop). Note that there is no difference between "\n", " ", "\t\t", or other whitespace in the format string. conversion specifications. Each conversion specification has the following format: introductory % character (optional) assignment-suppressing character *. If this option is present, the function does not assign the result of the conversion to any receiving argument. (optional) integer number (greater than zero) that specifies maximum field width, that is, the maximum number of characters that the function is allowed to consume when doing the conversion specified by the current conversion specification. Note that %s and %[ may lead to buffer overflow if the width is not provided. (optional) length modifier that specifies the size of the receiving argument, that is, the actual destination type. This affects the conversion accuracy and overflow rules. The default destination type is different for each conversion type (see table below). conversion format specifier The following format specifiers are available: Conversion specifier Explanation Argument type length modifier hh (C++11). h (none) l ll (C++11). j (C++11). z (C++11). t (C++11). L % matches literal % N/A N/A N/A N/A N/A N/A N/A N/A N/A c matches a character or a sequence of characters If a width specifier is used, matches exactly width wide characters (the argument must be a pointer to an array with sufficient room). Unlike %s and %[, does not append the null character to the array. N/A N/A char* wchar_t* N/A N/A N/A N/A N/A s matches a sequence of non-whitespace characters (a string) If width specifier is used, matches up to width or until the first whitespace character, whichever appears first. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). set matches a non-empty sequence of character from set of characters. If the first character of the set is ^, then all characters not in the set are matched. If the set begins with ] or ^] then the ] character is also included into the set. It is implementation-defined whether the character - in the non-initial position in the scanset may be indicating a range, as in 0-9. If width specifier is used, matches only up to width. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). d matches a decimal integer. The format of the number is the same as expected by wcstol() with the value 10 for the base argument. signed char* or unsigned char* signed short* or unsigned short* signed int* or unsigned int* signed long* or unsigned long* signed long long* or unsigned long long* intmax_t* or uintmax_t* size_t* ptrdiff_t* N/A i matches an integer. The format of the number is the same as expected by wcstol() with the value 0 for the base argument (base is determined by the first characters parsed). u matches an unsigned decimal integer. The format of the number is the same as expected by wcstoul() with the value 10 for the base argument. o matches an unsigned octal integer. The format of the number is the same as expected by wcstoul() with the value 8 for the base argument. x, X matches an unsigned hexadecimal integer. The format of the number is the same as expected by wcstoul() with the value 16 for the base argument. n returns the number of characters read so far. No input is consumed. Does not increment the assignment count. If the specifier has assignment-suppressing operator defined, the behavior is undefined. a, A(C++11) e, E f, F g, G matches a floating-point number. The format of the number is the same as expected by wcstof(). N/A N/A float* double* N/A N/A N/A N/A long double* p matches implementation defined character sequence defining a pointer. printf family of functions should produce the same sequence using %p format specifier. N/A N/A void** N/A N/A N/A N/A N/A N/A For every conversion specifier other than n, the longest sequence of input characters which does not exceed any specified field width and which either is exactly what the conversion specifier expects or is a prefix of a sequence it would expect, is what's consumed from the stream. The first character, if any, after this consumed sequence remains unread. If the consumed sequence has length zero or if the consumed sequence cannot be converted as specified above, the matching failure occurs unless end-of-file, an encoding error, or a read error prevented input from the stream, in which case it is an input failure. All conversion specifiers other than [, c, and n consume and discard all leading whitespace characters (determined as if by calling iswspace) before attempting to parse the input. These consumed characters do not count towards the specified maximum field width. If the length specifier l is not used, the conversion specifiers c, s, and [ perform wide-to-multibyte character conversion as if by calling wcrtomb() with an mbstate_t object initialized to zero before the first character is converted. The conversion specifiers s and [ always store the null terminator in addition to the matched characters. The size of the destination array must be at least one greater than the specified field width. The use of %s or %[, without specifying the destination array size, is as unsafe as std::gets. The correct conversion specifications for the fixed-width integer types (int8_t, etc) are defined in the header <cinttypes> (although SCNdMAX, SCNuMAX, etc is synonymous with %jd, %ju, etc). There is a sequence point after the action of each conversion specifier; this permits storing multiple fields in the same "sink" variable. When parsing an incomplete floating-point value that ends in the exponent with no digits, such as parsing "100er" with the conversion specifier %f, the sequence "100e" (the longest prefix of a possibly valid floating-point number) is consumed, resulting in a matching error (the consumed sequence cannot be converted to a floating-point number), with "r" remaining. Existing implementations do not follow this rule and roll back to consume only "100", leaving "er", e.g. glibc bug 1765. |
Conversion specifier | Explanation | Argument type |
length modifier | hh (C++11). | h |
% | matches literal % | N/A |
c | matches a character or a sequence of characters If a width specifier is used, matches exactly width wide characters (the argument must be a pointer to an array with sufficient room). Unlike %s and %[, does not append the null character to the array. | N/A |
s | matches a sequence of non-whitespace characters (a string) If width specifier is used, matches up to width or until the first whitespace character, whichever appears first. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | |
set | matches a non-empty sequence of character from set of characters. If the first character of the set is ^, then all characters not in the set are matched. If the set begins with ] or ^] then the ] character is also included into the set. It is implementation-defined whether the character - in the non-initial position in the scanset may be indicating a range, as in 0-9. If width specifier is used, matches only up to width. Always stores a null character in addition to the characters matched (so the argument array must have room for at least width+1 characters). | |
d | matches a decimal integer. The format of the number is the same as expected by wcstol() with the value 10 for the base argument. | signed char* or unsigned char* |
i | matches an integer. The format of the number is the same as expected by wcstol() with the value 0 for the base argument (base is determined by the first characters parsed). | |
u | matches an unsigned decimal integer. The format of the number is the same as expected by wcstoul() with the value 10 for the base argument. | |
o | matches an unsigned octal integer. The format of the number is the same as expected by wcstoul() with the value 8 for the base argument. | |
x, X | matches an unsigned hexadecimal integer. The format of the number is the same as expected by wcstoul() with the value 16 for the base argument. | |
n | returns the number of characters read so far. No input is consumed. Does not increment the assignment count. If the specifier has assignment-suppressing operator defined, the behavior is undefined. | |
a, A(C++11) e, E f, F g, G | matches a floating-point number. The format of the number is the same as expected by wcstof(). | N/A |
p | matches implementation defined character sequence defining a pointer. printf family of functions should produce the same sequence using %p format specifier. | N/A |
... | - | receiving arguments |
- non-whitespace wide characters except
%
: each such character in the format string consumes exactly one identical character from the input stream, or causes the function to fail if the next character on the stream does not compare equal.
The following format specifiers are available:
Conversion
specifier Explanation Argument type length modifier hh
(C++11).
h
(none) l
ll
(C++11).
j
(C++11).
z
(C++11).
t
(C++11).
L
%
matc
hes literal %
N/A N/A N/A N/A N/A N/A N/A N/A N/A c
matc
hes a character
or a sequenc
e of characters
If a width
spec
ifier is used, matc
hes exac
tly width
wide characters
(the argument must be a pointer to an array with suffic
ient room). Unlike %
s and %
[, does not append the null character
to the array.
N/A N/A char*
wchar_t*
N/A N/A N/A N/A N/A s
matches
a s
equence of non-whites
pace characters
(a string
) If width
s
pecifier is
us
ed, matches
up to width
or until the firs
t whites
pace character, whichever appears
firs
t. Always
s
tores
a null character in addition to the characters
matched (s
o the argument array mus
t have room for at leas
t width+1
characters
).
`[`set`]` matches a non-empty sequence of character from set of characters. If the first character of the set is `^`, then all characters not in the set are matched. If the set begins with `]` or `^]` then the `]` character is also included into the set. It is implementation-defined whether the character `-` in the non-initial position in the scanset may be indicating a range, as in `[0-9]`. If width specifier is used, matches only up to _width_. Always stores a null character in addition to the characters matched (so the argument array must have room for at least _width+1_ characters).
`d` matches a **decimal integer**. The format of the number is the same as expected by [`wcstol()`](../../string/wide/wcstol) with the value `10` for the `base` argument.
`signed char*` or `unsigned char*`
`signed short*` or `unsigned short*`
`signed int*` or `unsigned int*`
`signed long*` or `unsigned long*`
`signed long long*` or `unsigned long long*`
`intmax_t*` or `uintmax_t*`
size_t*
ptrdiff_t*
N/A i
matches an integer
. The format of the number i
s the same as expected by wcstol()
wi
th the value 0
for the base
argument (base
i
s determi
ned by the fi
rst characters parsed).
`u` matches an unsigned **decimal integer**. The format of the number is the same as expected by [`wcstoul()`](../../string/wide/wcstoul) with the value `10` for the `base` argument.
`o` matches an unsigned **octal integer**. The format of the number is the same as expected by [`wcstoul()`](../../string/wide/wcstoul) with the value `8` for the `base` argument.
`x`, `X` matches an unsigned **hexadecimal integer**. The format of the number is the same as expected by [`wcstoul()`](../../string/wide/wcstoul) with the value `16` for the `base` argument.
`n` returns the **number of characters read so far**. No input is consumed. Does not increment the assignment count. If the specifier has assignment-suppressing operator defined, the behavior is undefined.
`a`, `A`(C++11)
e
, E
f
, F
g
, G
matches a floating-point number
. The format of the number is the same as expected by wcstof()
.
N/A N/A float*
double*
N/A N/A N/A N/A long double*
`p` matches implementation defined character sequence defining a **pointer**. `printf` family of functions should produce the same sequence using `%p` format specifier.
N/A N/A void**
N/A N/A N/A N/A N/A N/A For every con
version
specifier other than
n
, the lon
gest sequen
ce of in
put characters which does n
ot exceed an
y specified field width an
d which either is exactly what the con
version
specifier expects or is a prefix of a sequen
ce it would expect, is what's con
sumed from the stream. The first character, if an
y, after this con
sumed sequen
ce remain
s un
read. If the con
sumed sequen
ce has len
gth zero or if the con
sumed sequen
ce can
n
ot be con
verted as specified above, the matchin
g failure occurs un
less en
d-of-file, an
en
codin
g error, or a read error preven
ted in
put from the stream, in
which case it is an
in
put failure.
All c
on
version
spec
ifiers other than
[
, c
, an
d n
c
on
sume an
d disc
ard all leadin
g whitespac
e c
harac
ters (determin
ed as if by c
allin
g iswspace
) before attemptin
g to parse the in
put. These c
on
sumed c
harac
ters do n
ot c
oun
t towards the spec
ified maximum field width.
If the l
ength s
pec
ifier l
is
not us
ed, the c
onvers
ion s
pec
ifiers
c
, s
, and [
perform wide-to-mul
tibyte c
harac
ter c
onvers
ion as
if by c
al
l
ing wcrtomb()
with an mbstate_t
objec
t initial
ized to zero before the firs
t c
harac
ter is
c
onverted.
The convers
ion s
pecifiers
s
and [
always
s
tore the null terminator in addition to the matched characters
. The s
ize of the des
tination array mus
t be at leas
t one greater than the s
pecified field width. The us
e of %s
or %[
, without s
pecifying the des
tination array s
ize, is
as
uns
afe as
std::gets
.
The correct conversion specifications for the fixed-width integer types (int8_t, etc) are defined in the header <cinttypes> (although SCNdMAX, SCNuMAX, etc is synonymous with %jd, %ju, etc).
There is a sequence point after the action of each conversion specifier; this permits storing multiple fields in the same "sink" variable.
When parsing an incomplete floating-point value that ends in the exponent with no digits, such as parsing "100er"
with the conversion specifier %f
, the sequence "100e"
(the longest prefix of a possibly valid floating-point number) is consumed, resulting in a matching error (the consumed sequence cannot be converted to a floating-point number), with "r"
remaining. Existing implementations do not follow this rule and roll back to consume only "100"
, leaving "er"
, e.g. glibc bug 1765.
... - receiving arguments
Return value
Number of arguments successfully read, or EOF
if failure occurs before the first receiving argument was assigned.
Example
See also
vwscanfvfwscanfvswscanf (C++11)(C++11)(C++11) | reads formatted wide character input from stdin, a file stream or a buffer using variable argument list (function) |
---|
| C documentation for wscanf, fwscanf, swscanf |
© cppreference.com
Licensed under the Creative Commons Attribution-ShareAlike Unported License v3.0.