std::filesystem::u8path
From cppreference.com
                    
                                        
                    < cpp | filesystem | path
                    
                                                            
                    | Defined in header  <filesystem> | ||
| template< class Source > path u8path( const Source& source ); | (1) | (since C++17) (deprecated in C++20) | 
| template< class InputIt > path u8path( InputIt first, InputIt last ); | (2) | (since C++17) (deprecated in C++20) | 
Constructs a path p from a UTF-8 encoded sequence of chars, supplied either as an std::string, or as std::string_view, or as a null-terminated multibyte string, or as a [first, last) iterator pair.
-  If path::value_typeis char and native encoding is UTF-8, constructs a path directly as if by path(source) or path(first, last). Note: this is the typical situation of a POSIX system that uses Unicode, such as Linux.
-  Otherwise, if path::value_typeis wchar_t and native encoding is UTF-16 (this is the situation on Windows), or ifpath::value_typeis char16_t (native encoding guaranteed UTF-16) or char32_t (native encoding guaranteed UTF-32), then first converts the UTF-8 character sequence to a temporary stringtmpof typepath::string_typeand then the new path is constructed as if by path(tmp)
-  Otherwise (for non-UTF-8 narrow character encodings and for non-UTF-16 wchar_t), first converts the UTF-8 character sequence to a temporary UTF-32-encoded string tmpof type std::u32string, and then the new path is constructed as if by path(tmp) (this path is taken on a POSIX system with a non-Unicode multibyte or single-byte encoded filesystem)
Parameters
| source | - | a UTF-8 encoded std::string, std::string_view, a pointer to a null-terminated multibyte string, or an input iterator with char value type that points to a null-terminated multibyte string | 
| first, last | - | pair of LegacyInputIterators that specify a UTF-8 encoded character sequence | 
| Type requirements | ||
| - InputItmust meet the requirements of LegacyInputIterator. | ||
| -The value type of InputItmust be char | ||
Return value
The path constructed from the input string after conversion from UTF-8 to the filesystem's native character encoding.
Exceptions
May throw std::bad_alloc if memory allocation fails.
Notes
On systems where native path format differs from the generic path format (neither Windows nor POSIX systems are examples of such OSes), if the argument to this function is using generic format, it will be converted to native.
Example
Run this code
#include <cstdio> #ifdef _MSC_VER #include <io.h> #include <fcntl.h> #else #include <locale> #include <clocale> #endif #include <fstream> #include <iostream> #include <filesystem> namespace fs = std::filesystem; int main() { #ifdef _MSC_VER _setmode(_fileno(stderr), _O_WTEXT); #else std::setlocale(LC_ALL, ""); std::locale::global(std::locale("")); std::cout.imbue(std::locale()); std::wcerr.imbue(std::locale()); #endif fs::path p = fs::u8path(u8"要らない.txt"); std::ofstream(p) << "File contents"; // Prior to LWG2676 uses operator string_type() // on MSVC, where string_type is wstring, only // works due to non-standard extension. // Post-LWG2676 uses new fstream constructors // native string representation can be used with OS APIs if (std::FILE* f = #ifdef _MSC_VER _wfopen(p.c_str(), L"r") #else std::fopen(p.c_str(), "r") #endif ) { int ch; while((ch=fgetc(f))!= EOF) putchar(ch); std::fclose(f); } // multibyte and wide representation can be used for output std::cout << "\nFile name in narrow multibyte encoding: " << p.string() << '\n'; std::wcerr << "File name in wide encoding: " << p.wstring() << '\n'; fs::remove(p); }
Output:
File contents File name in narrow multibyte encoding: 要らない.txt File name in wide encoding: 要らない.txt
See also
| (C++17) | represents a path (class) |