obfuscation Featured

Branch Encryption

Usage of branch encryption in various obfuscation contexts.

Pixelmelt

12 Mar 2024 • 3 min read

Photo by Andy Luo / Unsplash

Usage with virtualization based obfuscation

Sources:

Mesh design pattern: hash-and-decrypt - Archive
loski2619 on the Sneaker Development Discord

This technique prevents future states of a program from being known before they are reached.

In a proper implementation, an attacker wouldn't be able to infer previous states of a program it hasn't tracked, at the same time, the attacker wouldn't be able to jump into states which require non-constant values to be executed (unless it can alter the control flow to visit those nodes and has the proper key to them) - loski2619

Say you want to protect the following code that will reveal a secret only to those who know the password.

let usrinpt = input("what is the password?")
if(usrinpt == "42"){
	console.log("the cake is a lie")
} else {
	console.log("denied")
}

You put it through your virtual machine obfuscator and it outputs a VM and the following instructions in bytecode.

STRING "what is the password?"
LOAD 0
STRING "42"
EQUIVALENT
# If statement flow
JUMP_IF_FALSE IF_ELSEBLOCK
STRING "the cake is a lie"
PRINT # Logic to call console.log would be here but its not relevent
JUMP IF_END
LABEL IF_ELSEBLOCK
STRING "denied"
PRINT
LABEL IF_END
STOP_VM

Now do you see the issue here? Our secret is in plain text! Sure you might have a complex string decoding algorithm or something but the issue still exists, the attacker will always be able to recover our secret without knowing the password. (this applies to any constant data defined in your code, not just strings)

What if I told you that there is a solution to our problem that would lock everything inside the if statements true condition behind an impenetrable wall? We can use hashing and encryption to do this.

Currently, we are doing something like

if(input == "42") {
	run_code()
} else {
	run_code()
}

Which allows the attacker to see what the valid password is by looking at what we compare the user input to, by including "42" in the comparison there is no way to hide the password since the attacker must execute the comparison to run the program.

But we can change the code to not directly include the comparison

if(hash(input) == hashedPassword) {
	run_code()
} else {
	run_code()
}

Since "42" is a constant that is never rewritten and immediately gets discarded after usage, that means our compiler definitively knows what its value during runtime will be. The compiler can replace the constant "42" with the output of a hash function with the value as input. Then instead of comparing the user input to the valid key, we can compare the hash of the valid key to the hash of the user input.

This is all well and good but we have one more issue, the valid key may be hidden now, but the secret is still recoverable by the attacker if they read the source code.

We can fix that by encrypting the code inside the if statement with the correct password as a key, and then during runtime we can try to decrypt the code with the users input. (note that this is not the hash of the input)

if(hash(input) == hashedPassword) {
	decrypt_and_run_code(input)
} else {
	run_code()
}

Our bytecode now looks something like this

STRING "what is the password?"
LOAD 0
CALL_HASH_FUNCTION # Logic to call the hash function would be here but is not relevent
STRING "imtheoutputofahashfunction" # This used to be "42"
EQUIVALENT
LOAD 0
# If statement flow
JUMP_IF_FALSE IF_ELSEBLOCK
DECRYPT_BLOCK startIndex endIndex # Will decrypt the code between the JIF and label
ENCRYPTED_INSTRUCTION
ENCRYPTED_INSTRUCTION
ENCRYPTED_INSTRUCTION
LABEL IF_ELSEBLOCK
STRING "denied"
PRINT
LABEL IF_END
STOP_VM

Look at that, now the only way for our secret to be known is for the user to know the exact password used to decrypt the code.

Usage with plain Javascript

Here is an example of how this would look implemented with normal JS, given this implementation is weak because the code is stored as a string then eval'ed and it does not use a strong hash or encryption function.

function hash(input) {
	return input.split("").reduce((a, b) => a + b.charCodeAt(0), 0);
}

function xorcrypt(text, key) {
	let result = "";
	for (let i = 0; i < text.length; i++) {
		result += String.fromCharCode(text.charCodeAt(i) ^ key);
	}
	return result;
}

function checkPassword(userInput) {
	if (hash(userInput) === 1235) {
		// decrypted value: console.log('the cake is a lie')
		eval(xorcrypt("ҰҼҽҠҼҿҶӽҿҼҴӻӴҧһҶӳҰҲҸҶӳҺҠӳҲӳҿҺҶӴӺ", hash(userInput)));
	} else {
	    console.log("Access denied");
	}
}

checkPassword("wrongpass"); // Access denied

checkPassword("secretpass123"); // the cake is a lie

Where to use

This technique is most applicable when you need to protect both:

The comparison value itself (like the password "42")
The code/data that should only be accessible when the comparison succeeds