Basic multiline highlighting #13

sungshik · 2024-08-20T13:29:13Z

This PR adds basic support for multiline highlighting:

The analysis stage of the conversion has remained roughly the same (i.e., most information was already available).
Most changes are in the transformation stage, which now cleanly separates the generation of "inner rules" (detect the internal content of a production) and "outer rules" (detect the external context of a production). See the new module ConversionUnit for more details in the comments.

All tests are updated accordingly as well (and some new ones were added).

…h the latest version of the semantic tokenizer

… for synthetic names)

…nd inner/outer rules Note: This changes the nature of conversion units from types to only read from, to types to also write to.

… multi-line productions

sungshik

A few additional clarifying comments

rascal-textmate-core/src/main/rascal/VSCode.rsc

rascal-textmate-core/src/main/rascal/lang/textmate/Conversion.rsc

toinehartman

That's a lot of work again! I like the readability of the code and the usage of a lot of utility functions. To me, this helps a lot with readability.
Good choice to split out some of the logic to separate modules.

The tests are hard to review for me, but at least look like they cover a lit of cases.

rascal-textmate-core/src/main/rascal/lang/rascal/grammar/Util.rsc

rascal-textmate-core/src/main/rascal/lang/textmate/Conversion.rsc

DavyLandman · 2024-08-22T12:54:36Z

vscode-extension/syntaxes/rascal.tmLanguage.json

+          "name": "constant.regexp"
+        },
+        "2": {
+          "name": ""


@sungshik is this a bug?

Nope, but it's not particularly elegant either 😉 Some generated regexes have groups that are used only internally for matching, without a category. These show up here as empty strings. This has no impact on the highlighting, but I agree it would be nicer to not have these degenerate entries in the captures map at all. It requires the addition of just a simple non-emptiness check. I've fixed that now.

rascal-textmate-core/src/main/rascal/lang/textmate/Conversion.rsc

…` map

sungshik · 2024-08-23T10:03:29Z

Thank you, Toine, Davy, and Pieter for the comments 🙂

sungshik added 23 commits August 13, 2024 15:32

Add utility function to compute the terminals that occur in a production

cb96022

Add utility function to strip a symbol from outer ? and * operators

ad369c1

Add/update utility function(s) to process prefix literals

87bebae

Update category mapping for Rascal's own grammar to be consistent wit…

54d94b7

…h the latest version of the semantic tokenizer

Refine the interface to generate regular expressions

19788e3

Add simple (for now) grammar preprocessor to the conversion pipeline

f30737d

Refactor sorting function for conversion units (and add special cases…

6498255

… for synthetic names)

Update ConversionUnit type with kinds (single/multi-line), names, a…

400688b

…nd inner/outer rules Note: This changes the nature of conversion units from types to only read from, to types to also write to.

Revert simplification to computation of prodsDelimiters

74bf2ec

Update private functions to build TmRule values

4aaac7f

Rewrite transformation code to generate inner/outer rules and support…

2fe29cc

… multi-line productions

Update tests

8f795b9

Update generated TextMate grammar for Rascal's own grammar

6f57aee

Merge branch 'main' into basic-multiline-highlighting

0665942

Add type annotations to function signature (toRegExp)

5e3e20b

Remove old comments from test code

7381584

Move ConversionUnit and conversion constants to new modules

f2a8538

Move addition of names to units a separate function

d275722

Refactor generation of inner rules and outer rules

2ec51c5

Fix documentation in a few places

784b20e

Fix tests

be7100d

Rename a variable and add comments to clarify its purpose

7146374

Update generated TextMate grammar for Rascal's own grammar

bcaa1f7

sungshik commented Aug 21, 2024

View reviewed changes

sungshik marked this pull request as ready for review August 21, 2024 11:21

Fix a few typos in comments

04aa8cf

toinehartman approved these changes Aug 21, 2024

View reviewed changes

DavyLandman reviewed Aug 22, 2024

View reviewed changes

PieterOlivier reviewed Aug 23, 2024

View reviewed changes

rascal-textmate-core/src/main/rascal/lang/textmate/Conversion.rsc Show resolved Hide resolved

Rename filter* to retain* to avoid confusion

cf66e51

sungshik added 3 commits August 23, 2024 11:42

Simplify preprocessing code for grammars

a7080df

Add comment to explain function insertIn

8c54c6a

Avoid generation of degenerate entries (empty names) in the `captures…

ab777ad

…` map

sungshik added 2 commits August 23, 2024 12:31

Merge branch 'main' into basic-multiline-highlighting

afc0467

Update generated TextMate grammar for Rascal's own grammar

46e8198

sungshik merged commit 1a3b465 into main Aug 23, 2024
2 checks passed

sungshik deleted the basic-multiline-highlighting branch August 23, 2024 10:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic multiline highlighting #13

Basic multiline highlighting #13

sungshik commented Aug 20, 2024 •

edited

Loading

sungshik left a comment

toinehartman left a comment

DavyLandman Aug 22, 2024

sungshik Aug 23, 2024

sungshik commented Aug 23, 2024

Basic multiline highlighting #13

Basic multiline highlighting #13

Conversation

sungshik commented Aug 20, 2024 • edited Loading

sungshik left a comment

Choose a reason for hiding this comment

toinehartman left a comment

Choose a reason for hiding this comment

DavyLandman Aug 22, 2024

Choose a reason for hiding this comment

sungshik Aug 23, 2024

Choose a reason for hiding this comment

sungshik commented Aug 23, 2024

sungshik commented Aug 20, 2024 •

edited

Loading