Details about wide-character support

Could you please detail what parts of multi-byte characters are supported by CompCert? Or, inversely, if there are specific multi-byte-related features which are not supported?

In particular, some compilers accept this code (e.g. GCC, Clang), while CompCert emits a warning (`escape sequence is out of range (code 0x3B1)`) and then crashes:

```
#include <stdio.h>
int main() {
  printf("\u03B1-conversion\n");
}
```

Also, whenever a wide character literal is used in a function such as `wprintf`, a warning similar to `argument #1 of function call has type int * instead of the expected type wchar_t const *` is emitted. This seems to impact the result of this function, which differs from GCC/Clang in examples such as:

```
#include <stdio.h>
#include <wchar.h>
#include <locale.h>

int main() {
  setlocale(LC_ALL, "en_US.UTF-8");
  wchar_t c = L'\x3b1';
  wprintf(L"α %lc\n", c);
}
```

CompCert prints some nonsensical values such as `Î± ?` instead of `α α`.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Details about wide-character support #26

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Details about wide-character support #26

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions