# Minecraft data specification ## NBT specification NBT (named binary tag) is a tree data structure. Tags have a numeric type ID, name and a payload. The human-readable variant of this data is SNBT (stringified named binary tag) NBT files are a compressed `compound` tag. GZip is the compression used in most cases, in some rare cases it's stored uncompressed. A tag is an individual part of the data tree, where the first byte is the ID, followed by a `uint16_t` for the length of the name, then the name as `UTF-8`. (end tag does not contain name) note: UUID are stored as an integer array. ### tag types | ID | tag name | payload specification | |-----:|:-------------|:------------------------------------| | `0` | `end` | - | | `1` | `byte` | `int8_t` | | `2` | `short` | `int16_t` (BE) | | `3` | `int` | `int32_t` (BE) | | `4` | `long` | `int64_t` (BE) | | `5` | `float` | `float` (BE) | | `6` | `double` | `double` (BE) | | `7` | `byte array` | `int32_t` (len) -> `int8_t` | | `8` | `string` | `uint16_t` (len) -> `UTF-8` | | `9` | `list` | ID: `int32_t` (len) -> ID | | `10` | `compound` | list of tags delimited with end tag | | `11` | `int array` | `int32_t` (len) -> `int32_t` | | `12` | `long array` | `int32_t` (len) -> `int64_t` | ## world data There is a difference between \*.mca and \*.mcr files. Where \*.mca files are the newer variant. - mcr = MinecraftRegion - mca = Anvil ### file locations | dim | file path | |:----------|:---------------------| | overworld | `world/region` | | nether | `world/DIM-1/region` | | the end | `world/DIM1/region` | ### coordinate conversions ```c // block->chunk: return (x / 16) - sgn(x); return (x >> 4) - (x & 0x80000000); // chunk->region: return (x / 32) - sgn(x); return (x >> 5) - (x & 0x80000000); ``` ### chunk format specification ### MCR format specification #### header Region files have an 8KiB header, split in two 4KiB tables. The first containing the offsets of chunks in the region file itself, the second providing timestamps for the last updates of those chunks. The offset of a chunk (x,z) (in chunk coordinates) in the first table can be found by using this formula: `4 * ((x & 31) + (z & 31) * 32)`. The timestamp can be found 4096 bytes later in the file. | range | `0x00`—`0x0FFF` | `0x1000`—`0x1FFF` | `0x2000`— | |:------|:-------------------|:--------------------|:------------------------| | data | locations (4B) | timestamps (4B) | chunks and unused space | ##### chunk location Location info for a chunk is stored as a 32 bit big-endian integer, where the first three bytes are an offset in 4KiB sectors from the start of the file. The last byte gives the length of the chunk in 4KiB sectors. (rounded up, of course). Where chunks are always less than 1MiB in size. If a chunk isn't present in the region file (e.g. because it hasn't been generated or migrated yet), both fields are zero. ##### timestamps Timestamp data are 32 bit big-endian integers, representing the last modification time of an individual chunk in epoch seconds. #### payload Chunk data starts with a big-endian 32 bit signed integer which contains the exact length of the data in bytes. The first byte of this data indicates the compression scheme used for the data. The rest of the data is the actual compression data. (len-1 remaining now) The data has an alignment requirement of 4KiB. ##### compression | value | type | |------:|:------------------------------------| | `1` | GZip (RFC1952) (unused in practice) | | `2` | Zlib (RFC1950) | | `3` | uncompressed (<`1.15.1`) | | `4` | LZ4 (≥`24w04a`) | | `127` | custom algorithm\* (≥`24w04a`) | \*: A namespaced string must follow representing the algorithm used. The string is preceded by its length, encoded as an unsigned 16-bit integer. The uncompressed data is in NBT format and follows the chunk format. If the value of compression scheme increases by 128, the compressed data is saved in a file called `c.x.z.mcc`, where x and z are the chunk's coordinates, instead of the usual position. ### MCA format specification ## sources - https://minecraft.wiki/w/Region_file_format - https://minecraft.wiki/w/Anvil_file_format - https://minecraft.wiki/w/Chunk_format