Fix Github Actions #834

andarut · 2023-05-25T09:45:20Z

The problem

Github Actions fails randomly. On my fork i rerun all jobs for same code many times. And got this:

I think it's very important to fix actions, because without all checks passed branch can't be merged and of course there is no trust to PR. For example my previous PR Fix build C libraries inside kphp. @Danil42Russia rerun checks on my PR 3 times and no one succed. So branch with useful fix can't be merged, because of random. And this is not about code in PR, this is about randomly failed tests. You can check my fork for more (i tested another my PR Add mbstring functions to kphp): my fork.

The path and fix

Linux build

So, there is 5 randomly failing python tests (timeout error):

test_define_from_config
test_script_errors
test_headers_limit
test_post_limit
test_query_limit

The cause is that the latest version of requests does not support urllib3 2.0.
This is triggered by versions of requests-toolbelt and urllib3 that were both released in the past few weeks (May 1 and May 4, 2023). This is already an issue in requests-toolbelt. This is refer to urllib3 2.0 bug: urllib3 keeps a reference to that exception to include it in the MaxRetryError exception it will eventually raise. This means the garbage collector will not close the socket on its own, which means the connection will be kept open. So when urllib3 retries, it won't be able to open a connection, because the server is waiting for the previous try to finish. That's a deadlock!

Note! I've tried to rewrite send_http_request function from tests/python/lib/http_server.py with urllib requests. This is wrong way, urllib have different work style. Downgrade urllib much more safety.

And 1 test, randomly failing with RuntimeError: Got bad stat line error

test_store_fetch_delete

To fix this, i changed tests/python/lib/stats_receiver.py that way, so now when we receive not complete stats key, we just skip it. That's right because after this test fails, even logs shows that stats file is full and right. So sometimes engine don't be on time to write full string, when test runs. But old code already have timeout function for that (wait_next_stats). So i think when we wait for updating stats and got not full key, we must't throw runtime error. Also when get ValueError when unpacking key, this is also reason to wait for engine to write.

And 1 test, randomly failing because of timeout:

test_job_stack_overflow_error

Test deleted, because of incorrect work with ASAN. ASAN randomly shutdown worker when getting stackoverflow error, so we have timeout.

MacOS build

So, there is 3 failed cpp tests:

counter_test
parallel_limit_counter
parallel_counter

All tests deal with threads. I think that slow syscalls might cause other threads to spawn to take over. Github Actions default mac os is 3 cores machine, so maybe the efficiency cores are too slow to take over and kphp thinks the syscall takes too long. So i disable these tests similar to maximum-test.cpp (which also deals with threads).

Result

… comment" This reverts commit 7e137cb.

This reverts commit cb60059.

.github/workflows/Build.yml

Danil42Russia

I give my approve, ONLY for running tests.

This does not mean that this PR is ready to merge in master.

Tests passed

… on 12-cores

DrDet · 2023-07-14T09:51:16Z

tests/python/tests/job_workers/test_job_errors.py

+    # def test_job_stack_overflow_error(self):
+        # error_code = self.JOB_STACK_OVERFLOW_ERROR
+        # data = [[1, 2, 3, 4], [7, 9, 12]]
+        # buffers = 4
+        # stats_before = self.kphp_server.get_stats()
+        # resp = self.kphp_server.http_post(


Maybe we'll remove this completely?

andarut added 9 commits April 8, 2023 02:44

fix issue

cb60059

move kphp-timelib installation to the top and add explanatory comment

7e137cb

fix gh for mac os, try fix for linux without changing version of urllib

8ffd6b6

fix urllib and requests-toolbelt versions

3f12034

try different config

f5a7642

try fix bad stats line

65cd7e6

try fix bad stats line

7047c35

try fix bad stats line

7ab113e

fix stackoverflow error handle

3c439c1

andarut force-pushed the fix branch from 3c439c1 to 7e137cb Compare May 25, 2023 12:33

andarut added 2 commits May 25, 2023 15:59

Revert "move kphp-timelib installation to the top and add explanatory…

cd0d57f

… comment" This reverts commit 7e137cb.

debug stats file lines

b2320f2

troy4eg self-requested a review May 25, 2023 18:43

Danil42Russia added the refactoring Logic and code style improvements label May 25, 2023

andarut and others added 6 commits May 26, 2023 03:26

Revert "fix issue"

f8257e3

This reverts commit cb60059.

len of stats file always lower than older stats

4770c1d

install from requirements

6df0401

const python package versions

af5a260

fix pytest-mysql version

138435c

Delete test.txt

cd2ec49

DrDet self-requested a review June 1, 2023 16:29

Danil42Russia removed the refactoring Logic and code style improvements label Jun 2, 2023

Danil42Russia reviewed Jun 2, 2023

View reviewed changes

.github/workflows/Build.yml Outdated Show resolved Hide resolved

andarut and others added 3 commits June 2, 2023 14:12

try macos target with 12 cores, to check for slow syscalls

ee0e2a7

Merge branch 'fix' of https://github.com/andreylzmw/kphp into fix

2238261

Merge branch 'master' into fix

5211b1a

Danil42Russia previously approved these changes Jun 2, 2023

View reviewed changes

Danil42Russia self-requested a review June 2, 2023 11:36

test failed unit test on 3-core github actions runner, but now we are…

7e711f7

… on 12-cores

try with python3.7

38b643f

andarut force-pushed the fix branch 13 times, most recently from 225097d to 4aa15cd Compare July 12, 2023 20:15

run python tests with python 3.7

e5fcbc2

andarut force-pushed the fix branch from 4aa15cd to e5fcbc2 Compare July 12, 2023 21:15

try with python > 3.7

2b79ab8

andarut force-pushed the fix branch from 2755c8f to 2b79ab8 Compare July 13, 2023 13:07

return to python 3.7

99b2d39

andarut force-pushed the fix branch from 45e9cca to 99b2d39 Compare July 13, 2023 14:56

andarut added 2 commits July 13, 2023 21:57

remove test

a84c928

Merge branch 'master' of https://github.com/VKCOM/kphp

ac5fed1

DrDet reviewed Jul 14, 2023

View reviewed changes

andarut added 3 commits July 14, 2023 14:59

Merge branch 'master' into fix

8ed31d7

remove test

b1368f9

profiler back to 2

2dc908c

andarut requested a review from DrDet July 14, 2023 15:50

DrDet approved these changes Jul 14, 2023

View reviewed changes

andarut merged commit 60656e1 into VKCOM:master Jul 14, 2023

andarut added this to the next milestone Jul 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Github Actions #834

Fix Github Actions #834

andarut commented May 25, 2023 •

edited

Loading

Danil42Russia left a comment •

edited

Loading

DrDet Jul 14, 2023

andarut Jul 14, 2023

Fix Github Actions #834

Fix Github Actions #834

Conversation

andarut commented May 25, 2023 • edited Loading

The problem

The path and fix

Linux build

MacOS build

Result

Danil42Russia left a comment • edited Loading

Choose a reason for hiding this comment

DrDet Jul 14, 2023

Choose a reason for hiding this comment

andarut Jul 14, 2023

Choose a reason for hiding this comment

andarut commented May 25, 2023 •

edited

Loading

Danil42Russia left a comment •

edited

Loading