Print Page - Branchless Conditional Code

Title: Branchless Conditional Code
Post by: bozo on November 09, 2007, 10:47:48 PM

Can anyone here point me in the right direction of branchless code examples?
i think it would be useful for various purposes.
some good examples i wrote were lost in hard drive accident.

say conditional jumps, where the length of bytes to jump is stored in a register, rather than as code label

normal conditional test, pretty bad example, off top of my head.

Code Select

test_result:
    test eax,eax
    je false

true:
   mov eax,12345678h
   jmp end_routine
false:
  mov eax,0abcdefh
end_routine:
  ret

Code Select

test_result:
    call @get_eip
@get_eip:
    pop ebp

    shl eax,shift_byte      ; multiples of 2, depending on how many bytes required for true condition.
    lea ecx,[ebp + eax + (@get_eip - false)]
    jmp ecx
true:
   mov eax,12345678h
   jmp end_routine
   ; padded bytes            ; depends on number of bytes executed for true condition
false:
  mov eax,0abcdefh
end_routine:
  ret

hopefully might make some sense to someone...i'll try upload real working examples later.
but the only problem is calculating the correct offsets/padding out bytes.

the purpose? optimizations, anti-debug are 2 i can think of.

Title: Re: Branchless Conditional Code
Post by: drizz on November 10, 2007, 12:41:28 AM

Quote from: Kernel_Gaddafi on November 09, 2007, 10:47:48 PM
Code Select Expand
test_result: Â Â test eax,eax Â Â je false true: Â Â mov eax,12345678h Â Â jmp end_routine false: Â mov eax,0abcdefh end_routine: Â ret

Code Select

cmp eax,1
sbb eax,eax
and eax,0abcdefh-12345678h
add eax,12345678h

Title: Re: Branchless Conditional Code
Post by: bozo on November 12, 2007, 07:44:01 AM

you gave good answer, drizz - its just that i gave bad example.
still didn't get around to creating a good example yet, didn't have time..

Title: Re: Branchless Conditional Code
Post by: Rockoon on November 12, 2007, 08:27:34 AM

as far as the BRANCHING methodology, this form is superior:

test eax, eax
mov eax, truevalue
jne @@skip
mov eax, falsevalue
@@skip:

only a single branch (here, only branches when true), and with one less instruction it is also smaller ..

Title: Re: Branchless Conditional Code
Post by: Mark Jones on November 12, 2007, 04:08:10 PM

If the jump is normally taken, a small cache improvement can be made in loops by prefixing the conditional jump with a branch hint, i.e.:

Code Select


T MACRO _jump:VARARG                    ; Branch Hint: Taken
    DB 2Eh
    _jump
ENDM

NT MACRO _jump:VARARG                   ; Branch Hint: Not Taken
    DB 3Eh
    _jump
ENDM

.code
start:
    mov eax,input("Enter a number: ")
    cmp byte ptr [eax+1],10             ; break on pressing enter
NT  jz gb                               ; and exit
    cmp byte ptr [eax],"-"              ; do not allow negative numbers
NT  jz start                            ; 
...
T   jl sl                               ; loop if factor less than nVal
    test si,si                          ; were there any factors?
T   jnz dn                              ; if so, jump to done
    print "PRIME! ",0                   ; else show there were no factors
dn: print chr$(8,20h,10,10)             ; done factoring, newlines
    jmp start                           ; and back to beginning
    
gb: ret                                 ; exit gracefully

Although this obviously uses one additional byte.

Title: Re: Branchless Conditional Code
Post by: bozo on November 12, 2007, 08:04:57 PM

sorry still don't have good example yet, currently at work.
the aim is to eliminate all conditional jumps/calls..anything that addresses labels.
it is possible, but a little tricky..all adresses are created at runtime, rather than etched in memory.
i'll try do something tonight...

Title: Re: Branchless Conditional Code
Post by: Rockoon on November 12, 2007, 08:16:20 PM

you mean like jump tables?

Title: Re: Branchless Conditional Code
Post by: Alloy on November 13, 2007, 02:31:19 AM

The book "32/64-bit 80x86 Assembly Language Architecture" by James Leiterman has a chapter on branchless code. Along with the branch hints it discussed using bit manipulations to avoid branches. It looks like alot more code than using hints or jump tables.

Title: Re: Branchless Conditional Code
Post by: bozo on November 13, 2007, 04:35:40 AM

Code Select


WIN32_LEAN_AND_MEAN equ 1

.686
.xmm
.model flat,stdcall

include <windows.inc>
include <msvcrt.inc>
include <stdio.inc>

includelib <kernel32.lib>

do_padding macro code_size, flag
Â  Â local pad_size
Â  Â local count

Â  Â pad_size = 1
Â  Â count = 0Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; would hold how many bits to shift

Â  Â while pad_size lt code_size
Â  Â  Â  pad_size = (pad_size SHL 1)
Â  Â  Â  count = count + 1
Â  Â endm

Â  Â if flag eq 1
Â  Â  Â  db (1 shl count) - code_size dup (90h)
Â  Â endif

endm

pCreateFileAÂ  equ <esi+4*0>
pWriteFileÂ  Â  equ <esi+4*1>
pCloseHandleÂ  equ <esi+4*2>
pprintfÂ  Â  Â  Â equ <esi+4*3>
pExitProcessÂ  equ <esi+4*4>

Â  Â  .code

main:
Â  Â sub esp,4*5
Â  Â mov edi,esp
Â  Â push edi
Â  Â mov eax,[CreateFileA]Â  Â  Â  Â  Â  ; these api addresses could optionally be calculated
Â  Â stosdÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; at runtime
Â  Â mov eax,[WriteFile]
Â  Â stosd
Â  Â mov eax,[CloseHandle]
Â  Â stosd
Â  Â mov eax,[printf]
Â  Â stosd
Â  Â mov eax,[ExitProcess]
Â  Â stosd
Â  Â pop esi

Â  Â call @geipÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; only call once
@geip:
Â  Â pop ebp
Â  Â lea ebp,[ebp + (begin_call - @geip)]
Â  Â ; ebp = current eip
Â  Â lea ebp,[ebp + (end_call - begin_call)]
Â  Â ; ebp = end_call address
begin_call:
Â  Â push 0
Â  Â push FILE_ATTRIBUTE_NORMAL
Â  Â push OPEN_EXISTING
Â  Â push 0
Â  Â push FILE_SHARE_WRITE
Â  Â push GENERIC_WRITE
Â  Â push CStr(<'file.txt'>)
Â  Â push ebpÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; save return address on stack
Â  Â jmp dword ptr [pCreateFileA]
end_call:
Â  Â ; we end back here..

Â  Â test eax,eaxÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; test for INVALID_HANDLE_VALUE
Â  Â xchg eax,ebxÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; save handle in ebx

Â  Â sets alÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; js if not opened
Â  Â and eax,0ffhÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; clear upper 24-bits

Â  Â ; this is part i haven't been able to pre-calculate ..unless you know?
Â  Â ; set to 6, for 64 bytes

Â  Â shl eax, 6Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  ; multiply depending on pad_size
Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; do_padding macro calculates this value
Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; but no way to forward reference it!
Â  Â lea eax,[ebp + eax + (@opened - end_call)]
Â  Â jmp eaxÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; jump on condition
Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; between @opened and @not_opened
Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; we have space for 32 bytes of instructions
@opened:
Â  Â lea ebp,[eax + (@end_write - @opened)]
Â  Â push 0
Â  Â push esp
Â  Â push 14Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; write 14 bytes to file
Â  Â push CStr(<'Hello,World!',13,10>)
Â  Â push ebx
Â  Â push ebp
Â  Â jmp dword ptr [pWriteFile]
@end_write:
Â  Â pop ecxÂ  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â  Â ; length of bytes written

Â  Â lea ebp,[ebp + (@exit_code - @end_write)]Â  Â ; return to @not_opened address
Â  Â push ebx
Â  Â push ebp
Â  Â jmp dword ptr [pCloseHandle]
@end_opened:

Â  Â do_padding (@end_opened - @opened), 1

@not_opened:
Â  Â lea ebp,[eax + (@exit_code - @not_opened)]

Â  Â push CStr(<10,'file not opened'>)
Â  Â push ebp
Â  Â jmp dword ptr [pprintf]

@exit_code:
Â  Â push 0
Â  Â jmp dword ptr [pExitProcess]

Â  Â end main

Â i know what some of you are thinking... :P

Â what the hell is that?? well, i wanted to write some examples that were anti-decompile without using complex
Â polymorphic/metamorphic or compression code - i guess i would have to write my own assembler to make the job
Â easier..just wouldn't make sense to do all this manually.

Â if you process through decompiler, it won't give meaningful results, since they usually rebuild loop/conditional structures
Â based on address labels.its not possible to analyse code with flowchart unless there are address labels for each conditional test.

of course, it would probably be quite easy for someone to re-calculate the appropriate addresses, and insert JE/JNE/JS/JNS..etc where applicable, but that wouldn't be easy for everyone.

maybe it is still not the best of examples, but its best i could do for now..

Quote32/64-bit 80x86 Assembly Language Architecture

i must check this out!

Title: Re: Branchless Conditional Code
Post by: drizz on November 14, 2007, 02:44:07 AM

Hi Kernel_Gaddafi,

nice ideas, you could simplify it by using macros.

Code Select

_JS macro yes:req, no:req
	cdq
	and edx,offset yes-offset no
	lea eax,[edx+offset no]
	jmp eax
endm

_JZ macro yes:req, no:req
	cmp eax,1
	sbb eax,eax
	and eax,offset yes-offset no
	add eax,offset no
	jmp eax
endm

start:
		int 3

		mov eax,-1;;1;;result from createfile
		_JS notopened,opened;
opened: 	nop
		jmp @F
notopened:	nop
@@:
		mov eax,0
		_JZ _zero,not0
not0: 		nop
		jmp @F
_zero:		nop
@@:
end start

Title: Re: Branchless Conditional Code
Post by: bozo on November 14, 2007, 03:12:43 PM

nice one, drizz.
i'll test this out and see how well it works.
cheers!

Title: Re: Branchless Conditional Code
Post by: Shell on November 16, 2007, 04:58:22 AM

@Kernel_Gaddafi: Hello Momar, erm I mean Kernel :green2 Having seen your last few topics, it appears we are both after the same thing - code obfuscation. Now I'm not sure what you intend to do with these little gems (posts) but my current project is a garden variety "YET ANOTHER PE PROTECTOR" :eek albeit with a few twists - my favorite among which is total abuse of SEH/VEH for everything from function calling to anti-tricks, to ring0 switching, etc.

This topic has opened my eyes to new posibilties. I hope you don't mind if I "borrow" some of these ideas :bg Forget about the Leiterman book (unless you're looking for a few laughs). Your example is several degrees of magnitude more advanced than what the book's author achieved with the entire chapter (all three source listings worth ::) )

@drizz: Elegant and straight to the point as always :U

@Mark Jones: Thanks for the branch hint example usage. They will come in very handy.

Title: Re: Branchless Conditional Code
Post by: u on November 16, 2007, 02:34:12 PM

What about the CMOVXX instructions?

Title: Re: Branchless Conditional Code
Post by: bozo on November 17, 2007, 04:51:12 AM

@Shell

if you can use the code i write for anything, i've no problem with that.
as far as using the obfuscated code for anything, i was just curious about how to make it work - no use for it really.
i just think it would be interesting to write code that had nothing but direct jmps, with the addresses calculated dynamically, stored in a register.
it would be alot easier on x64 cpu, since you can access rip register directly.

The MASM Forum Archive 2004 to 2012

General Forums => The Laboratory => Topic started by: bozo on November 09, 2007, 10:47:48 PM