Rewrite assembler #5

gipsond · 2022-06-21T22:27:07Z

This is a rewrite of the assembler to use the chumsky parser combinator library (and its sister error presentation library, ariadne). The main feature it enables is that the parsing step implicitly records the span of each parse tree element as a Range<usize>, instead of storing &str slices. This removes the need for explicit lifetime management which I expect would have made the old assembler difficult to maintain.

The design of this rewrite differs in a few other key ways:

The top-level API is more specialized to the assembler binary and TUI use cases.
It uses traditional compiler modules; the input is passed, in order, through: lexer, parser, parse tree analysis, assembly, and linking.
- Notably, analyzing and generating errors happens primarily on the parse tree in one step, rather than during a series of steps which combine parsing, assembly, and error analysis.
- I believe this has greatly improved maintainability and extensibility.
Almost all errors are explicitly returned in Results, rather than panicking.
- As far as I can tell, the assembler will not panic unless a very unusual input is given.

This rewrite also includes automated tests for error cases, something the original lacked.

There is at least one significant regression: some errors do not provide as much specific information, particularly lexical errors. For example, if a source includes the invalid operand #OOPS, the current assembler would point to OOPS as an invalid decimal number, whereas this rewrite will simply indicate #OOPS is an invalid token, but not why. This rewrite does indicate all of the same errors, but not with as much specificity in all cases, though this can be improved with future work.

… using module name to disambiguate

… IR5

… NOT, STR, and TRAP

…tests individually

…oke 100! 😁😬)

.STRINGZ)

…very

gipsond added 30 commits April 23, 2020 19:58

assembler: start reorganizing

ce1b521

assembler: add LC-3 artifacts to gitignore

d6a5fc3

assembler: make types in each IR have same name for similar concepts,…

378ca71

… using module name to disambiguate

assembler: split expanded.rs into appropriate modules

5ecf289

assembler: rename IR modules and their respective parse functions

075f0e1

!BROKEN! assembler: rewrite memory placement, symbol table to work on…

ccc36d2

… IR5

trash

5b22b25

assembler: finish first pass at more complete CST

4b62c36

assembler: update error extraction

6c6436d

assembler: add validate analysis function

a8cbfc7

assembler: add assemble method to complete::Program

a3df64f

assembler: add messages, annotations for new errors (BUILD FIXED)

3d056a5

assembler: add some query methods to complete::Program CST

c175162

assembler: add single instruction integ tests for ADD, AND

5509870

assembler: add single-instruction tests for JMP, JSRR, RTI, RET, LDR,…

8347a2d

… NOT, STR, and TRAP

assembler: add named trap tests, add macro to run single-instruction …

5bae064

…tests individually

assembler: add BR, LD, LDI, ST, STI single-instruction tests (just br…

1a19e30

…oke 100! 😁😬)

assembler: add LEA, JSR tests; adjust reg/pcoffset9 macro

a3e4bc8

assembler: tweak reg/pcoffset9 macro to remove boilerplate from uses

928bd71

assembler: finish remaining single instruction tests (.FILL, .BLKW,

6c18893

.STRINGZ)

assembler: remove stale integ tests, add alternative style tests

ff3833d

misc: update UTP dependencies

c9d2925

assembler: try chumsky lexer (untested)

02d283c

assembler: add chumsky instruction parser (untested)

84fd53e

assembler: add full file chumsky parser (untested)

0ffcb8f

assembler: move new lexer, parser into separate files

35e9273

assembler: switch integration tests to new parser

9b389a9

assembler: remove original assembler

b21f196

assembler: allow assembling onto OS image again

aab715a

assembler: assemble all objects in given file (no overlap check)

418c328

gipsond added 11 commits June 15, 2022 23:50

assembler: rename test macros, add basic label reference tests

58c579f

assembler: add multiple error tests, improve instruction parsing reco…

ee7d7f0

…very

assembler: replace 'program' with 'region' in abstract syntax

9350087

assembler: move error data and functions to new modules

3779e57

assembler: rename modules after core functions

9c8396f

assembler: split link and layer steps

86153ad

assembler: remove unwrap calls from binary, unreachable calls from lib

8c8895c

assembler: remove unused code, move WithErrData to top level

a2eff30

assembler: present source path in error messages

48a59ad

assembler: remove unused dependencies

dc99108

assembler: add method to return String error report

e590dd7

gipsond requested a review from rrbutani June 21, 2022 22:27

Merge branch 'master' into chumsky

7064a4a

gipsond marked this pull request as ready for review June 21, 2022 22:40

rrbutani added 2 commits June 21, 2022 17:51

Merge branch 'master' into chumsky

524c854

misc: bump the MSRV to 1.56, use edition 2021

50c342f

rrbutani force-pushed the chumsky branch from 5c945c4 to 50c342f Compare June 22, 2022 03:42

gipsond added 13 commits June 22, 2022 02:15

assembler: deduplicate some assembly code

0b9b55c

assembler: document top level of library

262d192

assembler: rename link::LinkedRegion to link::Block

89b5bac

assembler: rename 'region' to 'block' or 'program block'

786c569

assembler: document the lex module

0479288

assembler: document the parse module

12cc003

assembler: allow multiple semantic analysis visitors in one pass

05adfd4

assembler: document the analyze module

f25a50d

assembler: complete docs for all public API

b77ca8a

assembler: document how to use analyze::Visit

a897466

assembler: if strict, require BR cond codes in nzp order

ef1fee1

assembler: disallow hanging labels in strict mode

b3dd742

assembler: add strict label restrictions, document all restrictions

29749ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewrite assembler #5

Rewrite assembler #5

Uh oh!

gipsond commented Jun 21, 2022

Uh oh!

Uh oh!

Rewrite assembler #5

Are you sure you want to change the base?

Rewrite assembler #5

Uh oh!

Conversation

gipsond commented Jun 21, 2022

Uh oh!

Uh oh!