TLS: Memory corruption #1217

avbelov23 · 2019-03-21T14:13:24Z

Tempesta config:

server 127.0.0.1:80; #nginx
listen 443 proto=https;
tls_certificate /home/avb/Projects/tempesta/tfw-root.crt;
tls_certificate_key /home/avb/Projects/tempesta/tfw-root.key;

Log sh:

$ cat /var/www/html/index.html 
<!DOCTYPE html>
<html>
<body>
Test
</body>
</html>
$ curl -k https://localhost
<!DOCTYPE html>
<html>
<body>
Test
</body>
</html>
$ cat /var/www/html/index.html 
�\>�(���e�����m�#����O̬�����r��(���t�����Zl���7�>+ɩ�����4�k�!�/$QS��N���
$ echo 3 | sudo tee /proc/sys/vm/drop_caches
3
$ cat /var/www/html/index.html 
<!DOCTYPE html>
<html>
<body>
Test
</body>
</html>

The text was updated successfully, but these errors were encountered:

avbelov23 · 2019-03-22T13:04:10Z

Blame sendfile on;

avbelov23 · 2019-03-22T13:26:30Z

It turns out that thanks to the in-place implementation of TLS, we encrypt data pages without allocating additional memory and copying, but because sendfile links the socket with the file cache, we overwrite the file cache.

If several clients request the same file, we will overwrite it in different places at once and each client will receive a unique file.

Then the problem can be reproduced not only with the help of sendfile. If the file is given from the cache, then the result will be the same. When sending from our cache, we reuse the pages of memory from the cache.

krizhanovsky · 2019-03-22T16:17:17Z

@avb good catch - this the fundamental problem of out TLS encryption!

The fix should be straightforward enough - we need to copy skb data before encryption if an skb is formed from splice() (sendfile() for file cache or splice() for user-space buffer, but the core is the same) or our web cache. We do something similar in ss_send(). It's easy to do for our cache, but I'm not sure whether we know the nature of skb data in tcp_write_xmit() - probably some kernel patching is required. However, maybe SKBTX_SHARED_FRAG flag will help us, see do_tcp_sendpages().

vankoven · 2019-03-26T14:35:24Z

Also note, that only payload part of the response served from cache reuses pages. The headers are created from scratch for every new response. Same can happen to sendfile() responses. But in the same time backend server may have its own cache and use zero-copy TCP transmissions. So it's can be very tricky.

Probably the copying of skb data should be configurable option:

avoid - only for necessary cases, such as serving from splice()/webcache
always - always create copy of the skb for the encrypted data.

krizhanovsky · 2019-03-26T18:19:53Z

We have #634 to reuse HTTP response headers kept in cache. For current kernel there is only one zero-copy transmission mechanism - splice, which is used by both sendfile() and vmsplice(). However, newer kernels introduce MSG_ZEROCOPY, so the final problem solution must consider MSG_ZEROCOPY and at least contain TODO comment what we need to do to move to the newer kernels.

krizhanovsky · 2019-06-04T00:45:19Z

Please also don’t forget to backport the fix to 0.6.

krizhanovsky assigned avbelov23 Mar 21, 2019

krizhanovsky added bug crucial labels Mar 21, 2019

krizhanovsky added this to the 0.7 TempestaTLS v0.3 & HTTP performance milestone Mar 21, 2019

krizhanovsky mentioned this issue Mar 22, 2019

Functional test for HTTPS #737

Closed

17 tasks

avbelov23 mentioned this issue May 29, 2019

tls: encrypting in new sk_buff pages, if sendfile() is used or response from cache #1264

Merged

krizhanovsky mentioned this issue Jul 1, 2019

Crash on stress test #1287

Closed

This was referenced Jul 1, 2019

Fix compile error #1286

Merged

Initialization pages_end #1288

Merged

avbelov23 closed this as completed Jul 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TLS: Memory corruption #1217

TLS: Memory corruption #1217

avbelov23 commented Mar 21, 2019 •

edited

Loading

avbelov23 commented Mar 22, 2019

avbelov23 commented Mar 22, 2019

krizhanovsky commented Mar 22, 2019

vankoven commented Mar 26, 2019

krizhanovsky commented Mar 26, 2019

krizhanovsky commented Jun 4, 2019

TLS: Memory corruption #1217

TLS: Memory corruption #1217

Comments

avbelov23 commented Mar 21, 2019 • edited Loading

avbelov23 commented Mar 22, 2019

avbelov23 commented Mar 22, 2019

krizhanovsky commented Mar 22, 2019

vankoven commented Mar 26, 2019

krizhanovsky commented Mar 26, 2019

krizhanovsky commented Jun 4, 2019

avbelov23 commented Mar 21, 2019 •

edited

Loading