How to compress or send less data through a remote event?

Maelstorm_1973 · July 10, 2023, 12:08am

I see that there is a lot of confusion here about integers and floats.

Integer

First of all, unlike decimal numbers, integers are radix 2 (power of 2) numbers because each position in an integers can only be 0 or 1. For decimal numbers (radix 10), each position in a number can be 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9. So if you have a number 10⁶ That is equivilent to 1,000,000. For an integer, 2¹⁶ is 65,536. 2³² = 4,294,967,296. These are unsigned numbers. For signed numbers, the formula is 2^{x - 1}-1. So for a signed 32-bit integer, the value range is -2,147,483,648 to 2,147,483,647. The maximum positive value for any unsigned integer is 2^x - 1 because you still have to represent 0.

Note that the exponent represents the hard limit as to the maximum values that a integer can hold. Unpredictable results can occur if the limit is exceeded.

As for the OP’s question, you can split and combine integers if you can guarantee that they will within 8, 16, or 32 bits. Roblox does not support 64-bit integers at this time. The way to do this is as follows:

-- Splits a 32-bit integer into two 16-bit integers.
local function split32to16(x)
	local low = bit32.band(0x0000FFFF, x)
	local high = bit32.band(0x0000FFFF, bit32.rshift(x, 16))
	return low, high
end

-- Combines two 16-bit integers into a 32-bit integer.
local function comb16to32(low, high)
	return bit32.bor(bit32.band(0x0000FFFF, low), bit32.band(0xFFFF0000, bit32.lshift(high, 16)))
end

-- Splits a 16-bit integer into two 8-bit integers.
local function split16to8(x)
	local low = bit32.band(0x000000FF, x)
	local high = bit32.band(0x000000FF, bit32.rshift(x, 8))
end

-- Combines two 8-bit integers into a single 16-bit integer.
local function comb8to16(low, high)
	return bit32.bor(bit32.band(0x000000FF, low), bit32.band(0x0000FF00, bit32.lshift(high, 8)))
end

Disclaimer: There’s a few things that you need to keep in mind when using these.

There might be some errors to this since I did this from memory. I wrote these routines and quite a few others some time ago in C/C++.
If you try to combine numbers greater than what it’s looking for, those extra bits will be masked off, so you may get a number you weren’t expecting.
No error checking is done.

Another thing to consider is endianness, or byte order. Although LUA insulates us from this, in other languages it can be a concern when dealing with CPUs that are not Intel/AMD/Cyrix (Little Endian). ARM CPUs (most, if not all mobile devices) have the ability to set the byte order to either 1234 (Big Endian) or 4321 (Little Endian). Other CPUs such as MIPS, Sparc, and IBM’s Z-Processor are big endian devices. Furthermore, byte order on the network is also big endian. Endian has to do with the order bytes are stored in memory for multi-byte integers in respect to increasing memory addresses. For instance, the 32-bit number 0x12345678 is stored as 0x12, 0x34, 0x56, 0x78 in memory for big endian machines. For little endian machines, it’s backwards: 0x78, 0x56, 0x34, 0x12. So make sure you get your byte order right.

Floating Point

Now the floating point specification is the IEEE-754 standard. It specifies the layout of floating point numbers in 16, 32, 64, 128, and 256 bit formats, also known as precision (someone did mention that). For all the formats, the basic layout is the same regardless of the width of the fields.

The sign bit. When it’s a 1, the number is negative.
The exponent. The exponent is encoded using offsets, so a 0 exponent is not 0 but another value. So for a double, its 0xb0111111111 (0x3ff). 0 and 0x7ff have special meanings which are mentioned in the double document on Wikipedia.
The mantissa or fraction. The leading 1 is always assumed, but the first bit of the mantissa is 1/2, the second is 1/4, the third is 1/8, and on down the line for however many bits the mantissa is.

A word of warning though. LUA does not support direct manipulation of floating point types at the bit level. I written code in C/C++ that does do this for a big number library (numbers that are so big they do not fit into a native CPU register). It can get quite complicated depending on what you are trying to do.

Another way you can shoot yourself in the foot with floats is comparison. It is not recommended to directly compare two floats using == or !=. In fact, C/C++ compilers will warn you of this. The best way to handle this is as follows:

local x = 0.33298575
local y = 0.33298243

if math.abs(x - y) < 0.0000000001 then
	-- Do something
else
	-- Do something else
end

Hopefully this helps people.

apictrain0 · July 10, 2023, 12:13am

I might of foreshadowed it, but I still forgot about overflowing of integers like about how I said with a signed 10 bit integer goes over 1023 it will go back to -1024, which may be something people could exploit to shoot someone across the map, But probably a way to defend this is by something like Checkimg if it goes over or using math.clamp

Maelstorm_1973 · July 10, 2023, 12:30am

@apictrain0 @C_Corpze

It’s not necessarily 64-bits. It usually is in the normal course of things, but I’ve ran into situations where I had numbers like 2¹⁰²⁴ be properly represented with complete precision. Variable in LUA and Roblox are variant types, so what I think it’s doing is setting the data type of the variable on the fly to meet the needs of the data. A variant type like in PHP and JavaScript is something like the following.

One or two bytes to denote the type.
One to four bytes to denote the size.
The data.

The value of field 2 depends on what the value of 1 is. Although on the command line, when I do print(2^1022), it prints 4.49423283715579e+307. But if I copy an inf value from a constraint in the workspace, I get this:

179769313486231570814527423731704356798070567525844996598917476803157260780028538760589558632766878171540458953514382464234321326889464182768467546703537516986049910576551282076245490090389328944075868508455133942304583236903222948165808559332123348274797826204144723168738177180919299881250404026184124858368

That number is 2¹⁰²⁴ as I confirmed it on a big number calculator. I have an open bug report about that because the constant values are missing from the documentation of the math library.

And math.huge = 2¹⁰²⁴ = inf

apictrain0 · July 10, 2023, 12:57am

Thanks for correcting me, but I realized that in the numbers document it says there are 3 types of numbers and missing a type where it could reach 2^1024

But also this 2^1024 type number doesn’t appear to be anywhere in the doc.

and In the doc It says it is a number is a double
I tried using type() and typeof() but both returns number.
I have always assume even in the doc the int and int64 would convert to a double during runtime but I appear to be wrong as a 2^1024 number could exist.

Maelstorm_1973 · July 10, 2023, 2:20am

But so far, full representation seems to only be in the workspace. So something about how it’s represented in the workspace is different than how it’s represented in the code. You can do constraint.force = math.huge and it will show as inf in the constraint when you view it in explorer. If you happen to copy the inf value (which is what I did) you get that big number in the previous post.

To fully represent 1024 bits requires 128 bytes, which is in the big number arena (and that is an old standard for RSA crypto back in the early 1990’s). So what’s represented in the constraint is for the physics engine to use and it may require the full 128 bytes. Either way, it’s not the same datatype as what’s used in the scripts.

I think math.huge is the full constant and is a big number datatype because if you type this in the console, you get this:

  19:18:48.517  > print(2^1024)  -  Studio
  19:18:48.518  inf  -  Edit
  19:19:25.834  > print(math.huge)  -  Studio
  19:19:25.835  inf  -  Edit
  19:19:35.788  > print(math.huge == 2^1024)  -  Studio
  19:19:35.789  true  -  Edit
  19:19:39.752  > print(math.huge == 2^1023)  -  Studio
  19:19:39.752  false  -  Edit

It’s definitely a unique situation.

EDIT: On a hunch, I did this:

print(179769313486231570814527423731704356798070567525844996598917476803157260780028538760589558632766878171540458953514382464234321326889464182768467546703537516986049910576551282076245490090389328944075868508455133942304583236903222948165808559332123348274797826204144723168738177180919299881250404026184124858368)  -  Studio
  19:21:52.387  1.7976931348623157e+308  -  Edit

Yes, I told it to print that big number and it came back with what appears to be the maximum value that you can have for a double before it codes to infinity.

Mystxry12 · July 10, 2023, 6:30am

Actually afaik a number takes up 9 bytes i.e 72 bits. And yes as @apictrain0 pointed out, they aren’t stored as integers but IEEE signed doubles, which should technically make them 8 bytes however in other posts, its stated as 9 bytes so guess we have to take their word.

For additional info, check out the wiki page.

Now coming back to your question, from the post I linked we know that a Vector3 takes up 13 bytes but if we were to send the components as numbers, it’d take up 9 * 3 bytes which is 27 bytes so that isn’t an option. However if you looked at the post, you will notice that a string of length 1 is 1 + 2 = 3 bytes which is pretty good. So what if we encode the components of Vector3 to a character?

C_Corpze · July 10, 2023, 12:00pm

So I should basically just encode vectors into strings to save up on bandwidth if I for some reason had to replicate a ridiculous amount of vectors in a array?

To be honest I haven’t done a lot of number <> character conversions before.

I know some string manipulation like filtering, formatting, splitting, etc for admin command systems and whatnot.
But my knowledge on using hex code or turning big numbers into small text is limited.

But I’d love to learn more about it because I know it’s a very powerful tool and strings in Lua seem pretty optimized so it might actually be a very viable way of compressing vectors and big numbers.

Knowing how to write the code is cool but I’d love knowing how it works as well so I’m not just copying over things like a parrot without knowing the meaning or logic behind it.

faze_paspro · July 10, 2023, 12:19pm

Try this

local bigIneger = v3.X + v3.Y + v3.Z

C_Corpze · July 10, 2023, 12:22pm

The problem with this method is that you can’t separate them.
This just adds numbers together, it doesn’t combine them in the way I intended where you can split them later without losing information.

faze_paspro · July 10, 2023, 12:26pm

Can you explain a litte bit more about separating them

C_Corpze · July 10, 2023, 12:32pm

Let’s say I have the numbers 10, 2850 and 565.
I want to combine them into a single value so it’s compressed.

Once it reaches the other side (the server), I want to turn that single value back into the numbers 10, 2850 and 565 without losing important information.

I plan to possibly also use this method for compressing data in datastores if it gets really big for some reason.

Maelstorm_1973 · July 13, 2023, 5:26am

If you are looking to compress the data, I would suggest looking at either LZW or PK compression methods. Both have their strengths and weaknesses. I’m sure there are LUA implementations out there you can use.

If you can read code, then 7-zip source code is available for download.

C_Corpze · July 13, 2023, 11:51am

I’ve actually looked into that but it had me wondering if this would actually result in smaller data.

Because to compress data, you also need a dictionary of keys and values to decompress it later but the dictionary of course also takes up space which might result in the compressed data becoming larger if it already is small.

A array of vectors and numbers already is relatively small.

The problem is that if we want to “zip” it, it might compress the data itself, but since we now also have to send a dictionary to the server for decompression, the dictionary might use more data, making the zipping inefficient and just slower.

Maelstorm_1973 · July 13, 2023, 1:59pm

Like I said, strengths and weaknesses. In some cases, it’s probably best to just leave things alone.

C_Corpze · July 13, 2023, 2:33pm

Updated title to be a bit more relevant and general because I might have to seek different solutions.

I did actually come across a library that uses remote events in a more optimized way but I don’t want to rely on 3rd party libraries since I might write my own library instead to use for multiple projects.

Trying to learn techniques instead of just copying what someone else already did.

Maelstorm_1973 · July 14, 2023, 4:55am

If you’re sending a large amount of data at one time, then compression makes sense. The LZW table is 256 integers which contains the counts for each character that appears in the data stream. You just send the table along with the compressed data and the receiving end recreates the tree.

GZIP style algorithms work by creating a dictionary. However, to keep the dictionary small and dynamic, a new dictionary is created for every 8KB to 16KB of data or so. This allows the algorithm to adapt to content changes in the data stream. For example, take an ELF executable which has a multitude of different types of data. The binary data won’t compress very well. However, text data does compress well.

With data compression, the more data you have, the more it makes sense to use it. My understanding is that Roblox converts data to JSON before sending it on the wire. They might even compress and encrypt it. It would make sense to do so. Have you studied any materials relating to information theory and information content? I have some books on data compression. They are outdated, but they cover the basics of what you need to know.

C_Corpze · July 14, 2023, 12:47pm

Roblox converting stuff to JSON before sending it through a remote did leave me with questions.
I saw a post earlier that showed how much data every data type in Roblox used.

If I recall it was something like…
A number was 9 bytes.
A string 2 bytes + it’s length.

A Vector3 was roughly 12 or 14 bytes which surprised me because that implies that sending a vector is cheaper than 3 separate numbers.
As 3 x 9 would be 27 bytes, which is way more than a Vector3 uses apparently.

What ESPECIALLY baffled me is that sending a boolean, a value that is normally only 1 bit and can only be 1 or 0, takes up…

4 bytes

Yes a boolean apparently is 4 bytes, which left me wondering if the JSON theory is true.
because the word “true” on itself already is 4 characters long which would make up for the 4 bytes.

But "false"is actually 5 characters but if I recall, a boolean uses 4 bytes regardless of it’s value which is really weird because if Roblox truly does put everything in a JSON table then this should be 5 bytes, right?

I did stumble upon an public library/resource called BridgeNet2.
And apparently this library is really good at optimizing networking and I’m just trying to figure out how it manages to use less data somehow.

I’ve looked through it’s code on GitHub but can’t exactly find in what script or which function does the “compression” or “optimizing” of data.
I don’t want to rely on 3rd party libraries because I want to develop my own at some point likely and not straight up copy what someone else wrote.
I seek to learn how things work so I can eventually pass on the knowledge.

Blank remote call: ~9 bytes

string (len 0): 2 bytes
string (len 1): 4 bytes
string (len 2): 8 bytes
string (len 3): 9 bytes
string (len 4): 10 bytes
string (len 5): 11 bytes
string (len 6): 12 bytes
string (len 8): 14 bytes
string (len 16): 22 bytes
string (len 32): 36 bytes

boolean: 2 bytes
number: 9 bytes

table (empty): 2 bytes
table (array with 4 numbers): 38 bytes

EnumItem: 4 bytes
Vector3: 13 bytes
CFrame (axis aligned): 14 bytes
CFrame (random rotation): 20 bytes

I did find this post, by @Tomarty.

Oh, here it apparently says a boolean is just 2 bytes, huh? Maybe I got 2 sources mixed up.
But that is still a lot of bytes for something that is basically only on or off.

A CFrame apparently is 20 bytes which also absolutely blows my mind because CFrames hold a position which is 3 values AND a rotation which in Euler form (I think) also has 3 or 4 values depending on if it uses quaternions.

6 numbers should be 6 x 9, right? ~~funny sum.~~
Wouldn’t that be 54 bytes? That’s more bytes than a string with 32 characters.
Howwwwwww?

Does this imply I could compress strings by putting characters inside CFrame components?
The more I learn, the less I seem to know about the subject.
Seems like I might not know as much as I initially knew.
Roblox engine under the hood surely has it’s mysteries.

Maelstorm_1973 · July 14, 2023, 3:15pm

Here’s the thing. I ran into a problem where some datatypes are fixed size. A Vector3 for instance, even though it says its a number isn’t big enough to store a UserId (yes, I tried it). So based on this and other things. I’m thinking that it’s a modified form a JSON or a proprietary data format. I’m leaning towards the latter.

struct dataframe
{
	int datatype;
	uint32_t size;
	char data[1];
};

Basically, the datatype determines the size of the data. Then it’s placed in a memory buffer and the entire buffer is sent. They might even be using a union to do this. In any case, this is a common hack in C/C++ when dealing with different types and lengths of data within the same memory buffer.


#define		TYPE_STRING	27

char *buffer = malloc(65536);
uint32_t index = 0;

void packData(char *buffer, uint32_t index, int datatype, void *data)
{
	/* Setup */
	uint32_t size;
	uint32_t i;
	dataframe *dfptr;
	char *charptr;
	uint32_t txa;

	/* Pack the data */

	/* Set the pointer */
	dfptr = buffer + index;

	/* Determine the size of the data */
	if (datatype == TYPE_STRING)
	{
		size = strlen(data);
	}
	else
	{
		size = getDatatypeSize(datatype);
	}

	/* Fill the structure */
	dfptr->datatype = datatype;
	dfptr->size = size;

	/* Copy the data over */
	charptr = &dfptr->data;
	for (i = 0; i < size; i++)
	{
		charptr[i] = (char *)data[i];
	}

	/* Compute sizes */
	txa = sizeof(int) + sizeof(uint32_t) + size;

	/* Return */
	return(txa);
}

Something like that. It’s in C but that’s how I would do it. The return value gets added to the index so its pointing to the byte after the structure in the buffer. That would be the most expeditious way to do it without compression. This works for both fixed and variable data types, although the only variable data type is string.

As for boolean values taking 4 bytes, there’s a reason for that. CPUs cannot access individual bytes in memory. They have to read an entire word from memory (It’s actually more complicated than that. It’s actually a block of 128 bytes, but that’s a topic for another conversation.). Because of it, they assign an entire machine word to it. There’s also issues with memory address alignment. In most CPUs since the 80286 or so, the lower order address bits (A0, A1, A2, A3) are missing, so memory is addressed on a 32-bit boundary. Because of this data structures must be aligned on a 4-byte boundary. It’s actually more efficient hardware wise because of the way memory is organized. Each bit in the RAM is on a separate chip, and is 32 or 64 bits wide. So it makes sense to do it this way because of parallelism.

Remember that 128-byte block? That’s the size of a cache line. When data is read from disk, it’s a full memory page (4096 bytes) that gets read into memory, and it’s aligned on a page boundary. Once in memory, the cache hardware reads in the data in 128 byte blocks depending on what the CPU is addressing. The data propagates from main memory to the L3 cache (if equipped), then the L2 cache, then the L1 cache. The L3 and L2 caches run at FSB (Front Side Bus) speed. The L1 cache runs at CPU core speed since the L1 cache is on the CPU chip itself. Once in the L1 cache, the CPU can access the data, but only 4 bytes at a time. When the data is in a CPU register, then it can access individual bytes using byte register operands on the instructions. Furthermore, there are separate caches for program and data.

C_Corpze · July 14, 2023, 3:43pm

This does have me wondering.
Since a boolean must use an X amount of bytes.

Wouldn’t it be possible to map 8 - 16 booleans to just a single set of bytes though? Why not do that?
I could see maybe memory address and whatnot becoming a problem.

But say you have an environment in which the same 8 - 16 booleans are always present and used.
Why not let them all share the same memory address and let them each use one bit of a byte?

I feel like you could already do this by allocating just enough memory for a 8-bit or 16-bit number and manipulating/reading individual bits in a C++ program?

Ilucere · July 14, 2023, 6:28pm

my post covers some bit manipulation, and how i put head rotation into a single 8 byte number and compressed it to 4 bytes, for maximum efficiency and sending it extremely small to the remote event

in addition i covered binary it in these messages

“mf” is cool kid lingo for “my friend”