feat(test): test opcode programs in different scenarios #808

winsvega · 2024-09-16T10:28:23Z

🗒️ Description

Conversion of opcode diff places tests by ori.
A test defines series of test scenarios that are run on each parametrized opcode sequence.
Then we check if the sequence worked as expected in a given scenario.

I think this is a powerful method to template test any new given opcode.
we already have pre defined scenarios. then we just add one more parameter with what we want to test, and it will be covered on all the cases automatically.

Cases can be like:
callcode->staticcall-> [opcode]
create2-> [opcode]
[opcode] -> revert

check it out. so we can define opcode programs and scenarios. then the test will put each opcode program in each scenario and verify that it's result is the same (perhaps result will be complex depending on context and fork)

the idea is so far to have a template test and then we can easily just add opcode programms and it will be run in all crazy combinations.
likce call delegate call suicide revert and so on

This is still WIP.

🔗 Related Issues

#184

✅ Checklist

All: Set appropriate labels for the changes.
All: Considered squashing commits to improve commit history.
All: Added an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
Tests: All converted JSON/YML tests from ethereum/tests have been added to converted-ethereum-tests.txt.
Tests: A PR with removal of converted JSON/YML tests from ethereum/tests have been opened.
Tests: Included the type and version of evm t8n tool used to locally execute test cases: e.g., ref with commit hash or geth 1.13.1-stable-3f40e65.
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.

chfast

Can you give it any better name? The "opcode diff places" has no meaning.

winsvega · 2024-09-17T10:57:50Z

it translates as opcode in different (logical) places

marioevz

A few suggestions after a quick review. I haven't checked files in the ./scenarios/ scenarios/ folder, but will do on the re-review.

tests/frontier/scenarios/test_scenarios.py

tests/frontier/scenarios/common.py

tests/frontier/scenarios/programs/all_frontier_opcodes.py

marioevz

More comments.

I feel like the programs should be dynamic based on the fork, and the approach could be improved in the sense that the program could be a pytest.fixture that takes the fork as input and then resolves to the actual bytecode.

See the comment in the invalid_opcodes.py file regarding the list of opcodes being invalid, since that is going to be changed frequently depending on the fork, it's going to be a pain to maintain, but we have the tools in pytest to do the maintenance more automatic and seamless.

src/ethereum_test_forks/forks/forks.py

tests/frontier/scenarios/programs/invalid_opcodes.py

marioevz · 2025-01-10T22:16:58Z

tests/frontier/scenarios/test_scenarios.py

+    combinations_input: ScenarioGeneratorInput = ScenarioGeneratorInput(
+        fork=fork,
+        pre=pre,
+        operation_code=operation,
+        external_address=external_address,
+        external_balance=external_balance,
+    )
+
+    call_combinations = ScenariosCallCombinations(combinations_input).generate()
+    for combination in call_combinations:
+        if not debug.scenario_name or combination.name == debug.scenario_name:
+            scenarios_list.append(combination)
+
+    call_combinations = scenarios_create_combinations(combinations_input)
+    for combination in call_combinations:
+        if not debug.scenario_name or combination.name == debug.scenario_name:
+            scenarios_list.append(combination)
+
+    revert_combinations = scenarios_revert_combinations(combinations_input)
+    for combination in revert_combinations:
+        if not debug.scenario_name or combination.name == debug.scenario_name:
+            scenarios_list.append(combination)


I think having the inputs as a separate structure that gets used in three different places is not a great pattern.

It seems to me like we should have a single class that can generate all types of combinations:

combinations_generator: ScenarioGenerator = ScenarioGenerator( fork=fork, pre=pre, operation_code=operation, external_address=external_address, external_balance=external_balance, ) for combination in combinations_generator.generate_call_combinations(): if not debug.scenario_name or combination.name == debug.scenario_name: scenarios_list.append(combination) for combination in combinations_generator.generate_create_combinations(): if not debug.scenario_name or combination.name == debug.scenario_name: scenarios_list.append(combination) for combination in combinations_generator.generate_revert_combinations(): if not debug.scenario_name or combination.name == debug.scenario_name: scenarios_list.append(combination)

here generate_X is dynamic.
so ideally we iterate over array of functions and call each function with environment conditions.
and when we want to add new scenario generator it adds a function into that list

tests/frontier/scenarios/scenarios/call_combinations.py

marioevz · 2025-01-10T22:56:23Z

tests/frontier/scenarios/scenarios/call_combinations.py

+        root_contract_balance = 105
+        scenario_contract_balance = 107
+        sub_contract_balance = 111
+        program_selfbalance = 113


This specific property, program_selfbalance, might be better off as a property of the operation.

here I calculate manually how much the selfbalance is supposed to be in this context. and pass it as expected result to verifier. if program return different value while checking selfbalance there will be an error
this is equvivalent to post state defenition

tests/frontier/scenarios/scenarios/call_combinations.py

marioevz · 2025-01-10T23:31:08Z

tests/frontier/scenarios/test_scenarios.py

+        program_suicide,
+    ],
+)
+def test_scenarios(


Suggested change

def test_scenarios(

@pytest.mark.execute(pytest.mark.skip(reason="gas usage"))

def test_scenarios(

This test has to be skipped in execute mode because it uses 500 million gas.

should not be that much. I need to correct

can make a test per scenario. right now all scenarios combined in one test for each program.
that would be a lot of test files though, but easier for debug

marioevz · 2025-01-10T23:33:14Z

tests/frontier/scenarios/test_scenarios.py

+        gas_limit=500_000_000,
+        gas_price=tx_gasprice,
+        to=runner_contract,
+        data=b"0x11223344",


I think this is what you meant:

Suggested change

data=b"0x11223344",

data=b"\x11\x22\x33\x44",

The current one creates a byte array with the ascii equivalent of the "0x11223344" string.

is there a handy constructor like BYTES("0x11223344") ?

bytes.fromhex(), example here: #1067 (comment).

I see here is confusion
bytes.fromhex("11223344"),
prefix 0x is not allowed, can mistake with dec

marioevz · 2025-01-10T23:37:00Z

tests/frontier/scenarios/programs/invalid_opcodes.py

+invalid_opcode_ranges = [
+    range(0x0C, 0x10),
+    range(0x1E, 0x20),
+    range(0x21, 0x30),
+    range(0x4B, 0x50),
+    range(0xA5, 0xF0),
+    range(0xF6, 0xFA),
+    range(0xFB, 0xFD),
+    range(0xFE, 0xFF),
+]


I feel like this should not be a static list, and rather should be derived from the fork properties.

yes this can be built depending on fork

ah, but the programs does not know about fork. I would have to refactor all programs to take fork as an argument.

hm I can make this a predeployed contract in scenarios if you concern so much. but making programs a fixtures that accept fork is rather difficult design decision. programs are intended as byte sequences that are to be deployed in different scenarios and tested

winsvega · 2025-01-12T11:04:01Z

pytest.fixture

can you explain this part a little.
currently the program is just a bytecode that is deployed and executed inside a dynamic address that is created by scenario. depending on scenario this address or rather execution context is prepared and executed. with this we achieve following model: define bytecodeX and execute it in different contextsY.

winsvega marked this pull request as draft September 16, 2024 10:28

winsvega force-pushed the dailytest branch from 1761a66 to dc4f466 Compare September 17, 2024 09:50

chfast reviewed Sep 17, 2024

View reviewed changes

winsvega closed this Sep 20, 2024

winsvega force-pushed the dailytest branch from dc4f466 to 2f2d356 Compare September 20, 2024 10:50

winsvega reopened this Sep 20, 2024

winsvega force-pushed the dailytest branch from b64a725 to 11adfb6 Compare September 20, 2024 12:06

winsvega changed the title ~~feat(test): opcode diff places test~~ feat(test): test opcode programs in different scenarios Sep 20, 2024

winsvega force-pushed the dailytest branch from 11adfb6 to 2729f06 Compare September 20, 2024 12:10

winsvega requested a review from marioevz September 20, 2024 13:37

marioevz reviewed Sep 20, 2024

View reviewed changes

winsvega force-pushed the dailytest branch 2 times, most recently from b1af590 to ff2eeff Compare September 24, 2024 13:13

winsvega force-pushed the dailytest branch 2 times, most recently from b28f2ef to f7de36c Compare October 17, 2024 10:02

winsvega force-pushed the dailytest branch from f7de36c to e2cf609 Compare October 28, 2024 11:24

winsvega added scope:pytest Scope: Pytest plugins type:feat type: Feature labels Oct 28, 2024

winsvega force-pushed the dailytest branch 11 times, most recently from 5d3dd84 to 940cb3e Compare October 31, 2024 14:12

winsvega force-pushed the dailytest branch 3 times, most recently from 57c4bd1 to 7b44f08 Compare November 12, 2024 11:03

danceratopz assigned winsvega Nov 19, 2024

danceratopz mentioned this pull request Nov 19, 2024

impose test format verification after fill #940

Open

danceratopz self-requested a review November 19, 2024 13:38

winsvega force-pushed the dailytest branch from 7b44f08 to 1e1ac77 Compare December 3, 2024 12:43

winsvega marked this pull request as ready for review December 12, 2024 09:59

winsvega mentioned this pull request Dec 17, 2024

EIP7702 Test Ideas #952

Open

22 tasks

test scenarios

c0284ac

winsvega force-pushed the dailytest branch from 1e1ac77 to c0284ac Compare January 10, 2025 11:04

format with ruff

12e6874

winsvega force-pushed the dailytest branch from 027bcd7 to 12e6874 Compare January 10, 2025 12:34

marioevz reviewed Jan 10, 2025

View reviewed changes

address some comments

cb02788

winsvega force-pushed the dailytest branch from d644de9 to cb02788 Compare January 15, 2025 08:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(test): test opcode programs in different scenarios #808

feat(test): test opcode programs in different scenarios #808

winsvega commented Sep 16, 2024 •

edited

Loading

chfast left a comment

winsvega commented Sep 17, 2024

marioevz left a comment

marioevz left a comment

marioevz Jan 10, 2025

winsvega Jan 12, 2025

marioevz Jan 10, 2025

winsvega Jan 12, 2025

marioevz Jan 10, 2025

winsvega Jan 12, 2025

winsvega Jan 14, 2025 •

edited

Loading

marioevz Jan 10, 2025

winsvega Jan 12, 2025

danceratopz Jan 13, 2025

winsvega Jan 14, 2025

marioevz Jan 10, 2025

winsvega Jan 12, 2025

winsvega Jan 14, 2025

winsvega Jan 22, 2025

winsvega commented Jan 12, 2025

	def test_scenarios(
	@pytest.mark.execute(pytest.mark.skip(reason="gas usage"))
	def test_scenarios(

feat(test): test opcode programs in different scenarios #808

Are you sure you want to change the base?

feat(test): test opcode programs in different scenarios #808

Conversation

winsvega commented Sep 16, 2024 • edited Loading

🗒️ Description

🔗 Related Issues

✅ Checklist

chfast left a comment

Choose a reason for hiding this comment

winsvega commented Sep 17, 2024

marioevz left a comment

Choose a reason for hiding this comment

marioevz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

winsvega Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

winsvega commented Jan 12, 2025

winsvega commented Sep 16, 2024 •

edited

Loading

winsvega Jan 14, 2025 •

edited

Loading