1. Revision History
1.1. Revision 2
Destroy 
1.2. Revision 1
Create future directions section, follow up on Library Evolution Working Group comments.
Change 
Add more code demonstrating the old way and motivating examples.
Incorporate LEWG feedback, particularly alignment requirements illuminated by Odin Holmes and Niall Douglass. Add a feature macro on top of having 
1.3. Revision 0
Initial release.
2. Motivation
I’m very keen on std::embed. I’ve been hand-embedding data in executables for NEARLY FORTY YEARS now. — Guy "Hatcat" Davidson, June 15, 2018
Every C and C++ programmer -- at some point -- attempts to 
- 
     Financial Development - 
       representing coefficients and numeric constants for performance-critical algorithms; 
 
- 
       
- 
     Game Development - 
       assets that do not change at runtime, such as icons, fixed textures and other data 
- 
       Shader and scripting code; 
 
- 
       
- 
     Embedded Development - 
       storing large chunks of binary, such as firmware, in a well-compressed format 
- 
       placing data in memory on chips and systems that do not have an operating system or file system; 
 
- 
       
- 
     Application Development - 
       compressed binary blobs representing data 
- 
       non-C++ script code that is not changed at runtime; and 
 
- 
       
- 
     Server Development - 
       configuration parameters which are known at build-time and are baked in to set limits and give compile-time information to tweak performance under certain loads 
- 
       SSL/TLS Certificates hard-coded into your executable (requiring a rebuild and potential authorization before deploying new certificates). 
 
- 
       
In the pursuit of this goal, these tools have proven to have inadequacies and contribute poorly to the C++ development cycle as it continues to scale up for larger and better low-end devices and high-performance machines, bogging developers down with menial build tasks and trying to cover-up disappointing differences between platforms.
MongoDB has been kind enough to share some of their code below. Other companies have had their example code anonymized or simply not included directly out of shame for the things they need to do to support their workflows. The author thanks MongoDB for their courage and their support for 
The request for some form of 
This paper proposes 
#include <embed>int main ( int , char * []) { constexpr std :: span < const std :: byte > fxaa_binary = std :: embed ( "fxaa.spirv" ); // assert this is a SPIRV file, compile-time static_assert ( fxaa_binary [ 0 ] == 0x03 && fxaa_binary [ 1 ] == 0x02 && fxaa_binary [ 2 ] == 0x23 && fxaa_binary [ 3 ] == 0x07 , "given wrong SPIRV data, check rebuild or check the binaries!" ) auto context = make_vulkan_context (); // data kept around and made available for binary // to use at runtime auto fxaa_shader = make_shader ( context , fxaa_binary ); for (;;) { // ... // and we’re off! // ... } return 0 ; } 
3. Scope and Impact
4. Design Decisions
4.1. Current Practice
Here, we examine current practice, their benefits, and their pitfalls. There are a few cross-platform (and not-so-cross-platform) paths for getting data into an executable.
4.1.1. Manual Work
Many developers also hand-wrap their files in (raw) string literals, or similar to massage their data -- binary or not -- into a conforming representation that can be parsed at source code:
- 
     Have a file data . json 
{ "Hello" : "World!" } 
- 
     Mangle that file with raw string literals, and save it as raw_include_data . h 
R" json({ "Hello": "World!" } )json" 
- 
     Include it into a variable, optionally made constexpr 
#include <iostream>#include <string_view>int main () { constexpr std :: string_view json_view = #include "raw_include_data.h"; // { "Hello": "World!" } std :: cout << json_view << std :: endl ; return 0 ; } 
This happens often in the case of people who have not yet taken the "add a build step" mantra to heart. The biggest problem is that the above C++-ready source file is no longer valid in as its original representation, meaning the file as-is cannot be passed to any validation tools, schema checkers, or otherwise. This hurts the portability and interop story of C++ with other tools and languages.
Furthermore, if the string literal is too big vendors such as VC++ will hard error the build (example from Nonius, benchmarking framework).
4.1.2. Processing Tools
Other developers use pre-processors for data that can’t be easily hacked into a C++ source-code appropriate state (e.g., binary). The most popular one is 
4.1.3. ld 
   Resource files and other "link time" or post-processing measures have one benefit over the previous method: they are fast to perform in terms of compilation time. A example can be seen in the §8.1.3 ld Sadness section.
4.1.4. The incbin 
   There is a tool called [incbin] which is a 3rd party attempt at pulling files in at "assembly time". Its approach is incredibly similar to 
4.2. Prior Art
There has been a lot of discussion over the years in many arenas, from Stack Overflow to mailing lists to meetings with the Committee itself. The latest advancements that had been brought to WG21’s attention was p0373r0 - File String Literals. It proposed the syntax 
4.2.1. Literal-Based, constexpr
A user could reasonably assign (or want to assign) the resulting array to a 
4.2.2. Literal-Based, Null Terminated (?)
It is unclear whether the resulting array of characters or bytes was to be null terminated. The usage and expression imply that it will be, due to its string-like appearance. However, is adding an additional null terminator fitting for desired usage? From the existing tools and practice (e.g., 
4.2.3. Encoding
Because the proposal used a string literal, several questions came up as to the actual encoding of the returned information. The author gave both 
4.3. Design Goals
Because of the aforementioned reasons, it seems more prudent to take a "compiler intrinsic"/"magic function" approach. The function takes the form:
constexpr span < const byte > embed ( string_view resource_identifier ); 
4.3.1. Implementation Defined
Calls such as 
There is precedent for specifying library features that are implemented only through compile-time compiler intrinsics (
As 
4.3.2. Binary Only
Creating two separate forms or options for loading data that is meant to be a "string" always fuels controversy and debate about what the resulting contents should be. The problem is sidestepped entirely by demanding that the resource loaded by 
4.3.3. Constexpr Compatibility
The entire implementation must be usable in a 
5. Help Requested
The author of this proposal is extremely new to writing standardese. While the author has read other papers and consumed the standard, there is a definite need for help and any guidance and direction is certainly welcome. The author expects that this paper may undergo several revisions and undertake quite a few moments of "bikeshedding".
5.1. Feeling Underrepresented?
The author has consulted dozens of C++ users in each of the Text Processing, Video Game, Financial, Server, Embedded and Desktop Application development subspaces. The author has also queried the opinions of Academia. The author feels this paper adequately covers many use cases, existing practice and prior art. If there is a use case or a problem not being adequately addressed by this proposal, the author encourages anyone and everyone to reach out to have their voice heard.
5.2. Bikeshedding
5.2.1. Alternative Names
Some people feel that 
- 
     embed 
- 
     embed_resource 
- 
     slurp 
- 
     constexpr_read 
- 
     static_read 
- 
     static_resource 
- 
     static_include 
- 
     cfsread 
5.2.2. Answered Questions
Is
instd :: vector < std :: byte > format better as a return value (i.e., do we depend on Louis Dionne, Roger Orr, and Daveed Vandevoorde’s work in [p0784r3], [p1002r0], [p1004r0] and [p1023r0])?constexpr 
No, but can be persuaded otherwise. Do not want to lock this feature down as a dependency to these other papers in-flight.
Is it desireable to make
templated so it can return other kinds of views / types at constexpr time, since the standard does not yet have a concept of a constexprstd :: embed orreinterpret_cast +std :: bless ?std :: launder 
Yes.
Should
(std::embed revision 0) just be replaced withstd :: embedded ?std :: span 
Yes. 
Should the return
be const-qualified, as instd :: span ?std :: span < const std :: byte > 
Yes. (LEWG confirmed.)
What is the lookup scheme for files and other resources?
Implementation defined. We expect compilers to expose an option similar to 
Pulling in large data files on every compile might be expensive?
We fully expect implementations to employ techniques already in use for compilation dependency tracking to ensure this is not a frequent problem past the first compilation.
Is this more suitable for WG14 (the standards C committee) rather than WG21 (the standards C++ committee)?
The author does not think that the C standards committee will be the best place for some feature of this variety at this time. All of the typical solutions that would work in C were rejected previously (preprocessor or special literal). Indication has shown that the C standards committee might be able to support this idea and come up with a better design than WG21 or provide more fine-grained mechanisms for it, but there is no evidence that they are either interested or have the time. (Raised in LEWG, confirmed in general post-LEWG discussion.)
Is a magic library function in a header the most desirable?
Other proposals for other avenues have fallen flat and not garnered enough support. Preprocessor directives are rountinely discouraged in multiple LEWG, EWG and discussion at conferences. Many individuals want this to work with 
Icecream’s team has been contacted about this. They have advised the use of 
This brings up the question: do we change std::embed to work before Phase 7 of compilation as some kind of preprocessing directive because 
6. Changes to the Standard
Wording changes are relative to [n4762].
6.1. Intent
The intent of the wording is to provide a function that has implementation-defined behavior but returns the specified constexpr span representing the bytes of the resource the passed-in string_view represents. The wording also explicitly disallows the usage of the function outside of a core constant expression, meaning it is ill-formed if it is attempted to be used at not-constexpr time (std::embed calls should not show up as a function in the final executable).
6.2. Proposed Feature Test Macro
The proposed feature test macro is 
6.3. Proposed Wording
Append to §16.3.1 General [support.limits.general]'s Table 35 one additional entry:
Macro name Value __cpp_lib_embed 201811L 
Append to §19.1 General [utilities.general]'s Table 38 one additional entry:
Subclause Header(s) 19.20 Compile-time Resources <embed> 
Add a new section §19.20 Compile-time Resources [const.res]:
19.20 Compile-time Resources [const.res]19.20.1 In general [const.res.general]
Compile-time resources allow the library implementation to retrieve data into a program from implementation-defined sources.
19.20.2 Header
synopsis [embed.syn]embed namespace std { constexpr span < const byte > embed ( string_view resource_identifier ) noexcept ; } 19.20.3 Function
[embed.embed]embed namespace std { constexpr span < const byte > embed ( string_view resource_identifier ) noexcept ; } 1 Returns: A contiguous sequence of bytes representing the implementation-defined resource.
2 Remarks: Accepts a
whose contents provide an implementation-defined description of a resource. If the implementation cannot find the resource specified, the implementation shall error. [Note— Implementations should provide a mechanism similar to e.g. include paths. — End Note]string_view 3 If this function is called and cannot be evaluated as a §7.7 core constant expression [expr.const], then the program is ill-formed.
7. Future Direction
Many developers in all spaces are concerned that 
The first is to word a 
template < typename T = std :: byte > constexpr span < const T > embed ( string_view resource_identifier ); 
This would supply the data and view it as a type 
A way to help with this would be to 
We do not propose any of these future directions at this time.
8. Appendix
8.1. Sadness
Other techniques used include pre-processing data, link-time based tooling, and assembly-time runtime loading. They are detailed below, for a complete picture of today’s sad landscape of options.
8.1.1. Pre-Processing Tools Sadness
- 
     Run the tool over the data ( xxd - i xxd_data . bin > xxd_data . h xxd_data . h 
unsigned char xxd_data_bin [] = { 0x48 , 0x65 , 0x6c , 0x6c , 0x6f , 0x2c , 0x20 , 0x57 , 0x6f , 0x72 , 0x6c , 0x64 , 0x0a }; unsigned int xxd_data_bin_len = 13 ; 
- 
     Compile main . cpp 
#include <iostream>#include <string_view>// prefix as constexpr, // even if it generates some warnings in g++/clang++ constexpr #include "xxd_data.h"; template < typename T , std :: size_t N > constexpr std :: size_t array_size ( const T ( & )[ N ]) { return N ; } int main () { static_assert ( xxd_data_bin [ 0 ] == 'H' ); static_assert ( array_size ( xxd_data_bin ) == 13 ); std :: string_view data_view ( reinterpret_cast < const char *> ( xxd_data_bin ), array_size ( xxd_data_bin )); std :: cout << data_view << std :: endl ; // Hello, World! return 0 ; } 
Others still use python or other small scripting languages as part of their build process, outputting data in the exact C++ format that they require.
There are problems with the 
Binary data as C(++) arrays provide the overhead of having to comma-delimit every single byte present, it also requires that the compiler verify every entry in that array is a valid literal or entry according to the C++ language.
This scales poorly with larger files, and build times suffer for any non-trivial binary file, especially when it scales into Megabytes in size (e.g., firmware and similar).
8.1.2. python 
   Other companies are forced to create their own ad-hoc tools to embed data and files into their C++ code. MongoDB uses a custom python script, just to get their data into C++:
import os import sys def jsToHeader ( target , source ) : outFile = target h = [ '#include "mongo/base/string_data.h" ', '#include "mongo/scripting/engine.h" ', 'namespace mongo { ', 'namespace JSFiles { ', ] def lineToChars ( s ) : return ',' . join ( str ( ord ( c )) for c in ( s . rstrip () + '\n' )) + ',' for s in source : filename = str ( s ) objname = os . path . split ( filename )[ 1 ]. split ( '.' )[ 0 ] stringname = '_jscode_raw_ '+ objname h . append ( 'constexpr char '+ stringname + "[] = {" ) with open ( filename , 'r' ) as f : for line in f : h . append ( lineToChars ( line )) h . append ( "0};" ) # symbols aren’t exported w/o this h . append ( 'extern const JSFile % s ; '% objname ) h . append ( 'const JSFile % s = { "%s" , StringData ( % s , sizeof ( % s ) - 1 ) }; '% ( objname , filename . replace ( '\\' , '/' ), stringname , stringname )) h . append ( "} // namespace JSFiles" ) h . append ( "} // namespace mongo" ) h . append ( "" ) text = '\n' . join ( h ) with open ( outFile , 'wb ') as out : try : out . write ( text ) finally : out . close () if __name__ == "__main__" : if len ( sys . argv ) < 3 : "Must specify [target] [source] " sys . exit ( 1 ) jsToHeader ( sys . argv [ 1 ], sys . argv [ 2 : ]) 
MongoDB were brave enough to share their code with me and make public the things they have to do: other companies have shared many similar concerns, but do not have the same bravery. We thank MongoDB for sharing.
8.1.3. ld 
   A full, compilable example (except on Visual C++):
- 
     Have a file ld_data.bin with the contents Hello , World ! 
- 
     Run ld - r binary - o ld_data . o ld_data . bin 
- 
     Compile the following main . cpp c ++ - std = c ++ 17 ld_data . o main . cpp 
#include <iostream>#include <string_view>#ifdef __APPLE__ #include <mach-o/getsect.h>#define DECLARE_LD(NAME) extern const unsigned char _section$__DATA__##NAME[]; #define LD_NAME(NAME) _section$__DATA__##NAME #define LD_SIZE(NAME) (getsectbyname("__DATA", "__" #NAME)->size) #elif (defined __MINGW32__) /* mingw */ #define DECLARE_LD(NAME) \ extern const unsigned char binary_##NAME##_start[]; \ extern const unsigned char binary_##NAME##_end[]; #define LD_NAME(NAME) binary_##NAME##_start #define LD_SIZE(NAME) ((binary_##NAME##_end) - (binary_##NAME##_start)) #else /* gnu/linux ld */ #define DECLARE_LD(NAME) \ extern const unsigned char _binary_##NAME##_start[]; \ extern const unsigned char _binary_##NAME##_end[]; #define LD_NAME(NAME) _binary_##NAME##_start #define LD_SIZE(NAME) ((_binary_##NAME##_end) - (_binary_##NAME##_start)) #endif DECLARE_LD ( ld_data_bin ); int main () { // impossible //static_assert(xxd_data_bin[0] == 'H'); std :: string_view data_view ( reinterpret_cast < const char *> ( LD_NAME ( ld_data_bin )), LD_SIZE ( ld_data_bin ) ); std :: cout << data_view << std :: endl ; // Hello, World! return 0 ; } 
This scales a little bit better in terms of raw compilation time but is shockingly OS, vendor and platform specific in ways that novice developers would not be able to handle fully. The macros are required to erase differences, lest subtle differences in name will destroy one’s ability to use these macros effectively. We ommitted the code for handling VC++ resource files because it is excessively verbose than what is present here.
N.B.: Because these declarations are 
9. Acknowledgements
A big thank you to Andrew Tomazos for replying to the author’s e-mails about the prior art. Thank you to Arthur O’Dwyer for providing the author with incredible insight into the Committee’s previous process for how they interpreted the Prior Art.
A special thank you to Agustín Bergé for encouraging the author to talk to the creator of the Prior Art and getting started on this. Thank you to Tom Honermann for direction and insight on how to write a paper and apply for a proposal.
Thank you to Arvid Gerstmann for helping the author understand and use the link-time tools.
Thank you to Tony Van Eerd for valuable advice in improving the main text of this paper.
Thank you to Lilly (Cpplang Slack, @lillypad) for the valuable bikeshed and hole-poking in original designs, alongside Ben Craig who very thoroughly explained his woes when trying to embed large firmware images into a C++ program for deployment into production.
For all this hard work, it is the author’s hope to carry this into C++. It would be the author’s distinct honor to make development cycles easier and better with the programming language we work in and love. ♥