Typically, a TLS server uses a Certificate and associated Private Key to sign TLS session. From now on I'll call this Private Key a "traffic- private-key". Both certificate and traffic-private-key form an asymmetric cryptographic key-pair. Revealing the traffic-private-key makes it possible to perform men-in-the-middle type of attacks. Typically traffic-private-key is stored on the server's hard disk. Even if traffic-private-key is stored in encrypted form, at some point HTTPS server needs to have a possibility to decrypt it to use for signing. It means that at runtime the key in the plaintext will be available in memory of an HTTPS process. At this point attacker with access to the machine may be able to dump the memory of the process and learn the traffic-private-key.
Hence, server operators need to take special care to make sure traffic-private-keys are not revealed.
This situation gets more complicated in cases when a server operator and domain owner are 2 different entities. For example in the case of CDN, TLS offloading happens on the edge system - which often is a completely different machine than the actual application server. Also, it is often the case that servers (physical machines) of CDN provider are spread over the world and are located in remote data centres. Those data centres may be owned by multiple different entities.
In such situations, the problem of ensuring that the traffic-private-key is not copied and used by an attacker may be challenging and not obvious to solve. Clients of the CDN may also be concerned about the idea of spreading the traffic-private-key over the world.
For brevity, I'm assuming server uses only TLS 1.3 as specified in [RFC8446], but the solution can be adapted to any version of TLS.
The idea is to perform TLS session signing inside the Trusted Execution Environment. The traffic-private-key will be accessible only to TEE. Additionally, the solution ensures that the key is stored in encrypted form in trusted storage. The storage is bound to the physical machine and hence a copy of the storage can’t be used on some different machine.
The solution as implemented in the PoC and described below is based on ARM TrustZone and it uses open-sourced TEE called OP-TEE (see [OP-TEE]), sources of OP-TEE are stored on GitHub (see [OP-TEE-SRC]). OP-TEE was driven by the fact that the author is quite familiar with the environment nevertheless it can be implemented with other TEEs which provide device bound trusted storage. The author is convinced that Intel SGX with Asylo would be a better choice here. The solution makes also heavy use of BoringSSL for handling with TLS traffic.
Points below describe the implementation in more details:
-
The key provisioning server
It is assumed that machine is initially provisioned with a software which acts as a server for traffic-private-key provisioning.
To install traffic-private-key on a machine, the operator connects to the key provisioning server and sends the traffic-private-key to be installed on the machine. This operation is done over TLS connection which uses client authentication. Position of some form of TLS provisioning is required by the operator. The key provisioning server must be able to verify provisioning key, hence verification-provisioning-key is also preinstalled.
After sucessuful TLS authentication, operator sends a pair of traffic-private-key and domain name for which the key must be used. This pair is installed on secure storage which accessible from TEE only. TEE ensures traffic-provisioning-key can't be read from outside of TEE.
-
TLS session signing
Solution uses BoringSSL to offload TLS traffic. BoringSSL API gives a possibility to register a function which is called during TLS handshake, when server needs to sign a session with traffic-private-key.
It means that there are no modifications needed to BoringSSL in order to use it for signing TLS session with traffic-private-key stored in TEE.
The code which registers signing operation looks like this:
void signing_operation(message_to_sign, domain_name, *signature) { ... calls TEE for signing ... } SSL_PRIVATE_KEY_METHOD private_key_methods { .sign = signing_operation .decrypt = ... .complete = ... }; SSL_CTX_set_private_key_method(SSL_CTX, &private_key_methods)
TLS server calls
signing_operation
function when TLS session needs to be signed. This function passesmessage_to_sign
anddomain_name
to the TEE. While in the TEE, thedomain_name
is used as an index in order to retrieve right traffic-private-key (many domains can be handled by the server). TEE performs signing andsignature
is returned to the BoringSSL. BoringSSL continues TLS handshake as normal. -
Key can't be used on another machine.
Trusted storage in OP-TEE is bound to the physical device. It means even if the storage is coppied to another device, it won't be possible to decrypt stored data.
In more detail, OP-TEE implements GlobalPlatform Trusted Storage API. Device binding is one of the requirements for trusted storage. In order to make it possible each device needs to come with preinstalled Hardware Unique Key (HUK).
More details about trusted storage can be found on in OP-TEE documentation (see [OP-TEE-STORAGE]).
It must be mentioned, that in order to use trusted storage, SoC specific customization is needed (see comment in orange at the bottom of [OP-TEE-STORAGE]).
-
RPMB
TODO
As mentioned before, implementation uses OP-TEE as a base for TEE. Example was tested with OP-TEE running inside QEMU emulating ARMv8.
PoC is composed of:
-
admin_cli
: Client used for installing the private keys inside TEE. This component is used instead ofkey provisioning server
as such server was not implemented in PoC. -
server
: It is a TLS offloading server. Server listens on127.0.0.1:443
and uses BoringSSL to accept and handle TLS connection. Server implements function callback, which calls TEE when private key operation needs to be done. Only ECDSA/P256 sining is currently supported. -
ta
: Trusted application running inside TEE. The application is responsible for processing requests fromadmin_cli
, which is storing the keys on trusted storage and deleting them if requested. As well as processing signing requests from theserver
.
The section called "Example of usage" explains how to use the software in details.
Following steps need to be taken to install the software:
-
OP-TEE building. This step is explained in details (here)[https://optee.readthedocs.io/building/gits/build.html#get-and-build-the-solution]. It is required to perform steps 1 to 5. The
TARGET
(see the building instruction) used by this example is calledQEMUv8
. In case OP-TEE is started after step 5, it has to be stopped. -
Next steps assume that Linux operating system is used and OP-TEE has been cloned to the directory called
OPTEE_DIR
. -
Create directory
/tmp/tee_shared
-
Go to
OPTEE_DIR
directory. -
Clone
git clone https://git.amongbytes.com/kris/c3-tls-sign-delegator.git projects
-
Compile BoringSSL for aarch64 and native system:
cd OPTEE_DIR/projects/bssl; make
. Makefile is configured to use toolchain build in step 1. This step will also build BoringSSL for host machine, it requires all dependencies for building BoringSSL are installed (see [BORING-BUILD]). -
Compile solution:
cd OPTEE_DIR/projects/delegator; make
-
Starting OP-TEE:
At this point it is best to refer to video 1 on youtube as step is quite complicated.
<iframe width="560" height="315" src="https://www.youtube.com/embed/02klEwlsJIA" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>User needs to:
-
Enter build directory:
cd OPTEE_DIR/build
-
Start QEMU with OP-TEE emulation:
make QEMU_VIRTFS_ENABLE=y QEMU_USERNET_ENABLE=y QEMU_VIRTFS_HOST_DIR=/tmp/tee_share HOSTFWD=",hostfwd=tcp::1443-:1443" run-only
. -
Just after qemu starts it will pause with following prompt:
cd /home/hdc/repos/optee/qemuv8/build/../out/bin && /home/hdc/repos/optee/qemuv8/build/../qemu/aarch64-softmmu/qemu-system-aarch64 \ -nographic \ -serial tcp:localhost:54320 -serial tcp:localhost:54321 \ -smp 2 \ -s -S -machine virt,secure=on -cpu cortex-a57 \ -d unimp -semihosting-config enable,target=native \ -m 1057 \ -bios bl1.bin \ -initrd rootfs.cpio.gz \ -kernel Image -no-acpi \ -append 'console=ttyAMA0,38400 keep_bootcon root=/dev/vda2' \ -fsdev local,id=fsdev0,path=/tmp/tee_share,security_model=none -device virtio-9p-device,fsdev=fsdev0,mount_tag=host -netdev user,id=vmnic,hostfwd=tcp::1443-:1443 -device virtio-net-device,netdev=vmnic QEMU 3.0.93 monitor - type 'help' for more information (qemu)
User needs to continue the process by entering
c
(qemu) c
After a while 2 additional terminals should appear - one terminal labeld as "Normal", running linux and another terminal labeled as "Secure" with output from the TEE.
-
-
In the "Normal World" terminal user needs to mount file system to share data between guest and host machine. Following command needs to be used:
mount -t 9p -o trans=virtio host /mnt
See video 2:
<iframe width="560" height="315" src="https://www.youtube.com/embed/5psOtKtdlWI" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe> -
Install Trusted Application inside OP-TEE:
In the "Normal" terminal invoke:
sh /mnt/out/etc/tee_install
See video 3:
<iframe width="560" height="315" src="https://www.youtube.com/embed/aGbqgz_e9Ec" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
At this point installation and startup process is complated and solution can be used.
-
Installing a key on secure storage
First step is to install the key on secure storage. Ideally this step is done by "Key provisioning server". Nevertheless, this PoC doesn't implement such server. Instead
admin_cli
can be used to install the key.In the "Normal" terminal, go to
/mnt/out/
and invokecd /mnt/out # ./admin_cli/admin_cli put www.test.com etc/ecdsa_256.key
This command installs private key for
www.test.com
. In the "Secure" terminal you should see a messageE/TA: install_key:156 Storing a key
. After this stepetc/ecdsa_256.key
can be removed.See video 4:
<iframe width="560" height="315" src="https://www.youtube.com/embed/__7WKvx8XxM" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe> -
Start a TLS server and perform TLS handshake:
With private key installed TLS server can be started. In the "Normal" terminal invoke
> cd /mnt/out > ./server/server
Server will start listening on
127.0.0.1:1443
. In the host machine one can try to connect to the TLS server:> cd OPTEE_DIR > ./projects/bssl/src/build.native/tool/bssl client -connect 127.0.0.1:1443 -server-name "www.test.com" Connecting to 127.0.0.1:1443 Connected. Version: TLSv1.3 Resumed session: no Cipher: TLS_AES_128_GCM_SHA256 ECDHE curve: X25519 Signature algorithm: ecdsa_secp256r1_sha256 Secure renegotiation: yes Extended master secret: yes Next protocol negotiated: ALPN protocol: OCSP staple: no SCT list: no Early data: no Cert subject: CN = www.dmv.com Cert issuer: C = FR, ST = PACA, L = Cagnes sur Mer, OU = Domain Control Validated SARL, CN = Domain Control Validated SARL
See video 5
<iframe width="560" height="315" src="https://www.youtube.com/embed/kRRl2zhbUqc" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe> -
Trial to access different domain fails as traffic-private-key is not available.
See video 6
<iframe width="560" height="315" src="https://www.youtube.com/embed/LBhllWcn4RY" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>)
First of all - key storage is bound to the device. In order to use stolen key for MITM, attacker needs to steal whole machine, which is much more difficult and easier to control. In order to implement such solution user doesn’t need expensive HSM, but it can simply use Intel with SGX and Asylo. Also it’s easy to imagine some extensions to this idea. For example instead of calling TEE each time for session signing during TLS handshake, one could imagine that solution can use “Delegated Credentials for TLS” (see https://tools.ietf.org/html/draft-rescorla-tls-subcerts-02). In this case TEE would be responsible for generating short lived certificates and TLS server would request such certificate every fixed amount of time (every few minutes). This idea could be combined with another – instead of storing traffic-private-key in multiple machines, one could imagine storing a key in some central location with more restricted access (but still in TEE). Combining those two ideas improves security of traffic-private-key storage without degrading time needed to perform TLS handshake. It must be noticed that “Delegated Credentials for TLS” are already implemented in BoringSSL.
- [C3] https://inthecloud.withgoogle.com/computing-challenge/register.html
- [RFC8446] https://tools.ietf.org/html/rfc8446
- [OP-TEE] https://www.op-tee.org/
- [OP-TEE-SRC] https://github.com/OP-TEE
- [OP-TEE-STORAGE] https://optee.readthedocs.io/architecture/secure_storage.html
- [BORING-BUILD] https://github.com/google/boringssl/blob/master/BUILDING.md