Conclusion
First, be warned that there has been quite a few changes between 4.0.0 and 4.2.16 (which seems to be the latest version).
The scheme starts with a staggering overhead of 188 characters for 4.2 and about 244 for 4.0 (given that I did not forget any newlines and such). So to be safe you will probably need in the order of 200 characters for 4.2 and 256 characters for 4.0 plus 1.8 times the plain text size, if the characters in the plaintext are encoded as single bytes.
Analysis
I just looked into the source code of Laravel 4.0 and Laravel 4.2 with regards to this function. Lets get into the size first:
- the data is serialized, so the encryption size depends on the size of the type of the value (which is probably a string);
- the serialized data is PKCS#7 padded using Rijndael 256 or AES, so that means adding 1 to 32 bytes or 1 to 16 bytes - depending on the use of 4.0 or 4.2;
- this data is encrypted with the key and an IV;
- both the ciphertext and IV are separately converted to base64;
- a HMAC using SHA-256 over the base64 encoded ciphertext is calculated, returning a lowercase hex string of 64 bytes
- then the ciphertext consists of
base64_encode(json_encode(compact('iv', 'value', 'mac')))
(where the value is the base 64 ciphertext and mac is the HMAC value, of course).
A string in PHP is serialized as s:<i>:"<s>";
where <i>
is the size of the string, and <s>
is the string (I'm presuming PHP platform encoding here with regards to the size). Note that I'm not 100% sure that Laravel doesn't use any wrapping around the string value, maybe somebody could clear that up for me.
Calculation
All in all, everything depends quite a lot on character encoding, and it would be rather dangerous for me to make a good estimation. Lets assume a 1:1 relation between byte and character for now (e.g. US-ASCII):
- serialization adds up to 9 characters for strings up to 999 characters
- padding adds up to 16 or 32 bytes, which we assume are characters too
- encryption keeps data the same size
- base64 in PHP creates
ceil(len / 3) * 4
characters - but lets simplify that to (len * 4) / 3 + 4
, the base 64 encoded IV is 44 characters
- the full HMAC is 64 characters
- the JSON encoding adds 3*5 characters for quotes and colons, plus 4 characters for braces and comma's around them, totaling 19 characters (I'm presuming
json_encode
does not end with a white space here, base 64 again adds the same overhead
OK, so I'm getting a bit tired here, but you can see it at least twice expands the plaintext with base64 encoding. In the end it's a scheme that adds quite a lot of overhead; they could just have used base64(IV|ciphertext|mac)
to seriously cut down on overhead.
Notes
- if you're not on 4.2 now, I would seriously consider upgrading to the latest version because 4.2 fixes quite a lot of security issues
- the sample code uses a string as key, and it is unclear if it is easy to use bytes instead;
- the documentation does warn against key sizes other than the Rijndael defaults, but forgets to mention string encoding issues;
- padding is always performed, even if CTR mode is used, which kind of defeats the purpose;
- Laravel pads using PKCS#7 padding, but as the serialization always seems to end with
;
, that was not really necessary;
- it's a nice thing to see authenticated encryption being used for database encryption (the IV wasn't used, fixed in 4.2).