Example DSL #153

brucou · 2021-07-01T00:25:27Z

brucou
Jul 1, 2021

This is linked to #139

This thread aims at documenting as they occur the questions/answers/pain points faced while implementing a simple DSL. This is a Learning in public approach which is useful when designing tutorials to indentify the problems faced by users and how they solve or fail to solve them. Other methods include filming (or watching) a user as it goes through his tasks. It is simpler for me here to just note down all what comes to my mind (stream of consciousness).

The DSL is as follows:

Objectives

This DSL is created to illustrate the steps to create a DSL with Langium and its associated tooling. The DSL is chosen to be reasonably realistic and simple rather than to exercise many parts of the Langium ecosystem.

Language goals

The language aims at supporting individuals that are interested in following a diet. The DSL supports:

defining aliments and their nutrition profile
registering daily food intake
defining health objectives (weight loss, waist size improvement, etc.)
reporting physical exercise
recording health assessment for a given day (weight, waist size, etc.)
Allows some user configuration/customization of the language semantics (default values, types and categories, etc.)

In the first draft of this DSL, we will only pursue the first two goals and the last one.

Language definition

Aliments and their nutrition profile

Aliments may have a type and category (legumes, liquid). Aliments provide a given number of calories and have a nutritional profile.
The nutritional profile should logically have mandatory components (proteins, carbs, fats) but also optional user-defined components (fiber, etc.).
If the nutrient information is not available for a category, a ? can be used to indicate that unavailability. The list of available measuring units should be fixed (g, mg, ml, cl, dl, l). Unless otherwise indicated, 1l is converted to 1kg.

An example is as follows:

-- Aliments
100g of alpro almond vanilla (liquid, milk product) provides 29 kcal, with the following nutrients:
- proteines: 0.40 g
- glucides: 3.80g
- lipides: 1.1g
- fiber: 1.3  g
- sodium:35mg
- cholesterol: ?
- calcium: 120 mg
- fer: ?
- magnesium: -

100g of Dr Oetker Spinaci Ristorante Pizza (solid, ?) provides 220 kcal, with the following nutrients:
- proteines: 7.1 g
- glucides: 21g
- lipides: 11g
- fiber: 1.9  g
- sodium:0.27g
- cholesterol: 0
- calcium: ? mg
- fer: ?
- magnesium: ?

-- Portions
alpro almond vanilla:
- small: 250ml
- large: 500ml

Daily food intake

The daily food intake section of the DSL lets the user declare his food intake across the meals of a given day. Meals are a list of aliments whose portion/quantity can be specified. To make it more user-friendly and closer to natural language, we define two distinct formats to enter that list:

An example is as follows:

On the DATE_FORMAT:
- at breakfast:
  - alpro almond vanilla (small)
- at lunch:
  - ?
- at dinner:
  - x1 Dr Oetker Spinaci Ristorante Pizza
- at USER_DEFINED_MEAL:
  - ...

On the DATE_FORMAT, at breakfast:
- alpro almond vanilla (200ml)
- ...

On the DATE_FORMAT, at dinner:
- Dr Oetker Spinaci Ristorante Pizza
- ...

Note that the existence of two formats is creating edge cases that will have to be handle. Such is the case when on the same day, for the same meal a user defines distinct food intakes in distinct formats. The food intakes should be summed appropriately nonetheless.

User configuration

There should be defaults so that this is rather infrequent.

An example is as follows:

Meals:
- breakfast
- lunch
- dinner

Categories:
- Fruit
- Vegetable
- Legume
- Sweet
- Fish 
- Meat
- Milk product

Type:
- Solid
- Liquid (cream-like)
- Liquid (sauce-like)
- Liquid (oil-like)
- Liquid (water-like)

brucou · 2021-07-01T00:29:05Z

brucou
Jul 1, 2021
Author

A word about me:

telecom engineer with general programming experience in miscellaneous low- and high-level languages
has written hand-made parsers
Also used bison/yacc back in the days, EBNF has only few misteries for me
Good understanding of basic concepts: CST/AST, code generation, etc.
no experience with XText (not to say a bad experience) and Java
fluent in JS/TS

0 replies

brucou · 2021-07-01T00:49:09Z

brucou
Jul 1, 2021
Author

Let's go at it. From the current npm package documentation, I gather that the plan is to draft a grammar for the DSL. From that, I should have a parser, LSP, and types generated (Langium CLI). With some extra efforts, that means I should be able to write texts in my brand new language in a VS Code editor (that is the vscode language server thing). And with even more efforts, I should be able to customize the whole thing.

Immediate questions:

which parts of the LSP are implemented from the grammar? syntax-highlighting? completion? jump to declaration? etc. What will I have to customize/implement from scratch myself? How easy will that be? What am I getting myself into :-)

Alright, let's draft that grammar. So:

The core of Langium is a grammar declaration language in which you describe multiple aspects of your language:

Tokens (keywords and terminal rules)
Syntax (parser rules)
Abstract syntax tree (AST)

Fair enough. I wonder how I am going to handle line returns and spaces with this language. It should be ok because clauses all fit in one line. Spaces are generally meaningless but there is some indentation to handle in the daily intake section of the language:

On the DATE_FORMAT:
- at breakfast:
  - alpro almond vanilla (small)

Le'ts keep that in some part of the mind, and move on.

The grammar declaration language of Langium is very similar to Xtext. Please follow the Xtext documentation to learn how to use this language.

What does very similar means?? What is similar, what is different? How would I even know? Ok I see that there are examples. That may help. So let's look at Xtext documentation and the examples.

Xtext documentation

Well.... the whole thing is dense and absconse enough that this will be left for another day. Let's move to the examples. That might work better.

Examples

I maintain a state machine library so let's look at the state machine example:

What am I supposed to look at? Alright let's look at package.json:

"langium": {
        "languageId": "statemachine",
        "grammar": "src/language-server/statemachine.langium",
        "extensions": [".statemachine"],
        "out": "src/language-server/generated",
        "textMate": {
            "out": "./syntaxes/statemachine.tmLanguage.json"
        }

Looks like the grammar is here: src/language-server/statemachine.langium. Let's look.

grammar Statemachine
hidden(WS, SL_COMMENT, ML_COMMENT)

Statemachine :
	{Statemachine}
	('events' events+=Event+ 'end')?
	('resetEvents' resetEvents+=[Event]+ 'end')?
	('commands'	commands+=Command+ 'end')?
	states+=State*;

Event:
	name=ID code=ID;

Command:
	name=ID code=ID;

State:
	'state' name=ID
		('actions' '{' actions+=[Command]+ '}')?
		transitions+=Transition*
	'end';

Transition:
	event=[Event] '=>' state=[State];

terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Ok, I can recognize terminal symbols, some regexps, some parsing rules, albeit with a strange syntax which is probably the Xtext part of things. First impression is that this is not very readable vs eBNF. But then again, this may just be the result of unfamiliarity. Let's try to figure out the grammar from some examples of the DSL. Where am I going to find that...

The pakcage.json ("extensions": [".statemachine"],) tells me that the extension is .statemachine so let's look for that.

...

Well I could not find any. The next best course of action is go back to Xtext then.

TO BE CONTINUED

2 replies

msujew Jul 1, 2021
Maintainer

Hi @brucou, just to give a very brief overview of what's happening in the statemachine grammar (compared to usual EBNF grammars):

In order to map the parsed text to the AST model we use assignments which are in the form of assignment=Rule, which will parse the given rule and assign the returned value to the property. Event: name=ID code=ID; for example will create an Event object that contains a name and code property which are strings (terminal rules return string values by default).
In order to be able to reference other existing elements we use cross references in the form of assignment=[Rule] which will by default try to parse an ID. The linking mechanism will then look for an object of the referenced type with an assigned name property that equals the parsed ID. Transition: event=[Event] '=>' state=[State]; will actually parse the tokens [ID, '=>', ID] and then transform the parsed Ids into references to other objects.
In most grammars you will find constructs such as {Addition.left=current} or like in the states grammar just {Statemachine}, these are actions. Actions are usually necessary to tell the parser what type of object you want to create. Using actions is quite cumbersome but they are necessary since we are using a LL(*) parser (they are not necessary for a LR parser). They are incredibly useful for building stuff like expression chains, used in other programming languages (or in the arithmetics example).

Most other grammar features should be familiar from EBNF like optional (?) or repeating (* or +) rules. I hope that helps you a bit.

brucou Jul 3, 2021
Author

Thanks @msujew yes it is helpful. However the reason why I am documenting here all my steps is to expose all the steps taken by a beginner trying Xtext. That information will be key later when designing tutorials (in a way similar to user-centered design methods) and documentation. Most of the time, once we reach the level of understanding we need to produce some output, we forgot all the pain we had to reach that understanding. For example, once I undertood Haskell monads, I can't really remember why I did not understand them so I can't help people writing monad tutorial anymore. So documenting everything helps.

So this is (not yet) written with the goal of getting help, but more to identify the missing parts in the available documentation and what would be beneficial for quicker onboarding.

brucou · 2021-07-03T22:20:54Z

brucou
Jul 3, 2021
Author

This continues #153 (comment) after failing to find a .statemachine example for the statemachine language given as examples

Learning Xtext

The grammar language is the corner stone of Xtext. It is a domain-specific language, carefully designed for the description of textual languages.

👍

The main idea is to describe the concrete syntax and how it is mapped to an in-memory representation – the semantic model.

Excellent. That is what I am going to have to figure out.

This model will be created by the parser on-the-fly when it consumes an input file. Of course the Xtext grammar language itself is implemented with Xtext, so you will find parts of its syntax described with its own means in this documentation.

Not really useful at this point, but why not.

An example grammar is shown in the 15 Minutes Tutorial.

Let's have a look! https://www.eclipse.org/Xtext/documentation/102_domainmodelwalkthrough.html#write-your-own-grammar

Oh I don't know why I thought this was a video. Probably because of the mention of the 15mn. So this is a text tutorial whose time to completion is estimated to be 15mn. 15mn sounds appealing so let's do that.

The tutorial starts with an example of the target language. Good, that's exactly what I was missing previously with the statemachine language.

In summary:

install xtext, start eclipse, set up a workspace, create a new Xtext project

I don't have eclipse on my PC. It is unclear what exactly is involved in installing eclipse, xtext (dependencies, environment, etc.), how eclipse is started, and how you set up a workspace. I now seriously doubt that this will be a 15mn tutorial anymore. I could end up down a rabbit hole. In the end I just want to understand how to write a grammar so I am going to skip this till I don't have a choice.

write the grammar

grammar org.example.domainmodel.Domainmodel with
                                      org.eclipse.xtext.common.Terminals
 
generate domainmodel "http://www.example.org/domainmodel/Domainmodel"
 
Domainmodel:
    (elements+=Type)*;
 
Type:
    DataType | Entity;
 
DataType:
    'datatype' name=ID;
 
Entity:
    'entity' name=ID ('extends' superType=[Entity])? '{'
        (features+=Feature)*
    '}';
 
Feature:
    (many?='many')? name=ID ':' type=[Type];

Ok that's useful. There is some kind of header and then what looks parsing rules. The explanation given in the tutorial is clear. I gather that:

There is a eBNF like syntax (more visible in the Type rule expressed as DataType | Entity. Good. I know that part.
there is a org.eclipse.xtext.common.Terminals grammar that can be reused and imported in this grammar. Good that means there is some modularity mechanism.
the feature terminology is used to describe variables/objects (name=ID;, name is the feature, ID is part of the eBNF grammar)
the rules in the grammar entities are mapped to features by means of an intermediary language. The eBNF-like grammar is intermingled with the intermediary language that and then compiled to a target language.
you can define cross-references (nice)

Some thinking:

for a very first tutorial I would minimize what there is to be aware to the most important things (here the grammar)? Is it better to
mention the org.eclipse.xtext.common.Terminals or instead to directly include it in the example grammar? Well it is not a big deal if you are already used to grammars
Because of the intermingling of the eBNF part (DataType: 'datatype' ID;) with the intermediate language part (name=ID), both the language description, and the AST description part have to be done conjointly. That could make things harder for learners as doing two things at once is harder than focusing on one thing (e.g., get the grammar right, then get the AST). When a problem will occur, it could be harder to discriminate whether the issue lies in the grammar definition or in the intermingled AST description.

This entities grammar already uses the most important concepts of Xtext’s grammar language. You have learned that keywords are written as string literals and a simple assignment uses a plain equal sign (=), whereas a multi-value assignment uses a plus-equals (+=). We have also seen the boolean assignment operator (?=). Furthermore the example contains syntax elements with different cardinalities (? = optional, * = any number, + = at least once) and demonstrates how cross-references can be declared.

Good summary.

Please consult the Grammar Language Reference for more details.

Let's have a look. Epackage, Ecore, etc.... We'll skip that in the end.

that is however useful:

Basically parsing can be separated in the following phases:

Lexing
Parsing
Linking
Validation

as we encountered mention of linking previously without explanation of what the term meant.

The part about terminal rules is interesting and easy to follow. The part about parser rules and below become too technical (EObject, etc.). Too many new concepts and not a single illustration of what that looks like (that is to define the shape of the AST right?? so showing an AST would be nice. Or maybe it is about defining code, which would explain the presence of type information, classes and other artifacts. That is probably harder to illustrate). There again, having both eBNF and AST generation language coupled makes it harder to explain and illustrate separately.

Ok, that shed some light on the Xtext way of describing grammars and generating parsers. Let's see if that helps understand the statemachine language:

grammar Statemachine
hidden(WS, SL_COMMENT, ML_COMMENT)

Statemachine :
	{Statemachine}
	('events' events+=Event+ 'end')?
	('resetEvents' resetEvents+=[Event]+ 'end')?
	('commands'	commands+=Command+ 'end')?
	states+=State*;

Event:
	name=ID code=ID;

Command:
	name=ID code=ID;

State:
	'state' name=ID
		('actions' '{' actions+=[Command]+ '}')?
		transitions+=Transition*
	'end';

Transition:
	event=[Event] '=>' state=[State];

terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Well:

still missing an example of .statemachine file. Without that I can only guess
I gather that state machines (Statemachine) can be nested ({Statemachine}} and have an optional list of events, reset events, and commands. Curiously a state machine can be defined without any state (states+=State*;`). Reset events must be among previously defined events (cross-reference syntax [Event]).
The terminals are defined directly here which is great rather than importing them from outside

Conclusion:

the tutorial helped understand the statemachine grammar even with a missing example file.
while it is possible to understand an Xtext grammar, it is still not yet possible to generate a grammar by oneself with confidence (unless that grammar does not desviate too much from the example grammar).
A trouble shooting section would be nice. What happens when I get things wrong? What are the common mistakes made by beginners? How to understand error messages? When/where are they generated anyways?

Alright that'is it for today. Next step is to see how to write our diet grammar with the information that we have gathered and brace for going through the unavoidable error messages that will result as we make our way through.

0 replies

brucou · 2021-07-05T00:39:55Z

brucou
Jul 5, 2021
Author

This continues #153 (comment).

Here we are going to try implementing our diet grammar. We start with installing VSCode and following the instructions documented in the npm package:

That went out nicely: installing vscode is a breeze
I did not use however code hello-world?
f5 indeed opens a new VS Code window.

Remarks:

It is not clear what is this F5 window and why it has to be another window in the first place.
Coming back the day after it seems that the new window has to be opened a second time and the .diet file opened yet anew. Not very practical, but that is very much a VS Code issue?
- not really, there is a reload button in a floating toolbar that does the trick.
instructions in langium-quickstart.md are quite clear, that's a relief.
- they also explain how to reload, nice. Still this information should be in the README.md of the package rather than the npm distribution, so all information is gathered in one spot for discoverabilty and searchability purposes.
VS Code completion magically works!!!

This is exciting, we did not have much to do to get the minimal Hello grammar working. Let's try now to get our grammar working. We will implement the grammar iteratively, and at every step check that we have indeed properly implemented a larger portion of our target language. We will try to keep the steps small as all being new we have little confidence in our abilities to get it right the first time on.

Original grammar:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    (persons+=Person | greetings+=Greeting)*;

Person:
    'person' name=ID;

Greeting:
    'Hello' person=[Person|ID] '!';

terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

We add the 100g of part of our grammar. That should be small enough so that if there is an error, we know what caused it.

trying to define UNIT as terminal did not work. Apparently only regexps are possible there. The error message displayed by VS Code worked very nicely as trouble shooting tool.

So now we have the second attempt:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT unit=UNIT 'of' 

Person:
    'person' name=ID;

Greeting:
    'Hello' person=[Person|ID] '!';

UNIT returns string: 'g' | 'mg' | 'ml' | 'cl'  | 'dl';

terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

it would have been nice to keep the Person and Greeting clause around by commenting them but I am not sure that was possible.

OK let's see how lucky we are.

We get from VS Code:

This parser rule does not create an object. Add a primitive return type or an action to the start of the rule to force object instantiation.

So we have to create an object it seems. UNIT returns String did not work, UNIT returns string did.

why do we have to create an object?? How does this object relates to our grammar parsing??

We obediently run

npm run langium:generate
npm run build

with the following results:

PS C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta> npm run langium:generate

> langium generate

Writing generated files to 'src/language-server/generated'
Generating serialized grammar...
Generating parser...
Generating grammar access...
Generating AST...
Generating dependency injection module...
Generating textmate grammars
Writing textmate grammars to 'syntaxes/dieta.tmLanguage.json'
PS C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta> npm run build

> [email protected] build
> webpack

asset main.js 1020 KiB [compared for emit] (name: main) 1 related asset
asset grammar.json 1.72 KiB [compared for emit] [from: src/language-server/generated/grammar.json] [copied]
runtime modules 937 bytes 4 modules
modules by path ./node_modules/ 916 KiB
  modules by path ./node_modules/chevrotain/lib_esm/src/ 336 KiB 42 modules
  modules by path ./node_modules/langium/lib/ 194 KiB 39 modules
  modules by path ./node_modules/vscode-languageserver-protocol/ 79.5 KiB 22 modules
  modules by path ./node_modules/vscode-languageserver/ 80 KiB 16 modules
  modules by path ./node_modules/vscode-jsonrpc/ 118 KiB 16 modules
  3 modules
modules by path ./src/language-server/ 10.2 KiB
  modules by path ./src/language-server/generated/*.ts 7.44 KiB 4 modules
  modules by path ./src/language-server/*.ts 2.77 KiB
    ./src/language-server/main.ts 547 bytes [built] [code generated]
    ./src/language-server/dieta-module.ts 1.16 KiB [built] [code generated]
    ./src/language-server/dieta-validator.ts 1.07 KiB [built] [code generated]
8 modules
webpack 5.42.0 compiled successfully in 7366 ms

asset extension.js 668 KiB [compared for emit] (name: main) 1 related asset
runtime modules 670 bytes 3 modules
modules by path ./node_modules/ 648 KiB
  modules by path ./node_modules/semver/ 55.8 KiB 44 modules
  modules by path ./node_modules/vscode-languageclient/ 281 KiB 31 modules
  modules by path ./node_modules/vscode-languageserver-protocol/ 79.5 KiB 22 modules
  modules by path ./node_modules/vscode-jsonrpc/ 118 KiB 16 modules
  modules by path ./node_modules/yallist/*.js 8.42 KiB
    ./node_modules/yallist/yallist.js 8.21 KiB [built] [code generated]
    ./node_modules/yallist/iterator.js 207 bytes [built] [code generated]
9 modules
webpack 5.42.0 compiled successfully in 7328 ms

No errors!! yay.

F5 to open a new VS Code window. And... No code completion... In fact, we stil have the Hello completion taking place. Ok, so closing everything and back. Same issue. So what a man to do??

Looking at the generated folder, it seems that ast.ts has not changed? (export type DietaAstType = 'Greeting' | 'Model' | 'Person';)

Restarting VS Code. Running generate again. And this time ast.ts is changed? ... oh well. Who knows what I did wrong. Or maybe it was the restarting VS Code? To keep in mind for the next iteration.

F5 and no, still Hello in there.

Back at our main VS Code window it seems that the ast.ts code is wrong. Indeed we have errors shown in the PROBLEMS tab.

Cannot find name 'g'
Cannot find name 'mg'
Cannot find name 'ml'
Cannot find name 'cl'
Cannot find name 'dl'

and this line export type DietaAstReference = ; gives an error:

Type expected.

Ok, let's return to the grammar. Maybe we should try again having UNIT as a terminal? Copying terminal WS: (' '|'\t'|'\r'|'\n')+; from the documentation it seems like the Xtext grammar differs here from the langium grammar declaration language. Can't be sure but VS Code says that it wants a regex.

Ok found one mistake I made. VS Code requires you to save the file. I am using WebStorm which does that automatically... So of course the grammar wasn't updated if the file was not saved...

[Edit]: there is an option in VS Code that allows to auto save file. Don't know for the life of me why this is not the default but it is easy enough to activate.

maybe something to display as a warning for the user when he runs npm run generate on unchanged files?

Ok new version of the grammar:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT unit=UNIT 'of';

terminal UNIT: /(g|mg|ml|cl|dl)/;
terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Still does not work... Ok let's make it even simpler:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT 'of';

terminal UNIT: /(g|mg|ml|cl|dl)/;
terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Only one error this time:

Type expected.

(in ast.ts : export type DietaAstReference)

That is not very encouraging. We have the most minimal grammar we can think of, no errors reported from the grammar declaration language in VS Code, and not a clue on what could be going wrong. DietaAstReference refers to a reference but we have no reference here...

That is about it. We are blocked and we'll call it a day.

3 replies

brucou Jul 5, 2021
Author

To investigate: remove all objects reference and just go full ebnf.

Model:
    INT 'of';

msujew Jul 5, 2021
Maintainer

Quick info: You encountered a bug which was fixed by #132. For now, add manually never to the type declaration of DietaAstReference or create a dummy rule which looks like this: Test: value=[Test]. It is an issue related to missing cross references in the grammar.

brucou Jul 8, 2021
Author

That worked

brucou · 2021-07-08T12:39:28Z

brucou
Jul 8, 2021
Author

This continues #153 (comment)

After meeting a hurdle, we are now updating our strategy to first write the eBNF part of the grammar declaration language. The idea is to first test that we are the right language, before thinking about the AST we need to exploit the language programmatically:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    INT 'of';

terminal UNIT: /(g|mg|ml|cl|dl)/;
terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Test: value=[Test];

(the last line is a workaround for a bug. Cf. #153 (reply in thread)).

Unfortunately that doesn't work:

This parser rule does not create an object. Add a primitive return type or an action to the start of the rule to force object instantiation.

So we really have to do both together at the same time... Shame I would already be done with the eBNF part already and feel like I achieved something valuable.

mmm let's try to see if we can create ONE object and then just forget about the rest. No, actually that wouldn't work, we are already creating one value object (the Test rule) and we still got our error message.

At that point, a portion of prospective users have already given up on using the tool. What could be done to keep those users in?

Ok, let's add back the units (g, ml, cl, etc.):

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT UNIT 'of';

terminal UNIT: /g|mg|ml|cl|dl/;
terminal WS: /\s+/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Test: value=[Test];

And.... that doesn't work. The grammar simply does not seem to have been updated?

Expecting --> 'of' <-- but found --> 'g' <--

Adding a unit=UNIT who knows. Nope. Same message

Expecting --> 'of' <-- but found --> 'g' <--

So maybe the regexp syntax is not the JS syntax? Let's simplify /g|mg|ml|cl|dl/ to just /g/.

=> still no luck.

So now let's copy a regexp that we know is working ( quantity=INT unit=ID 'of';).

=> same.

Ok so the issue is likely not in the Xtext. mmm I probably forgot to run the build... let's do that.

[tsl] ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\dieta-validator.ts(18,13)
      TS2322: Type '{ Person: (person: any, accept: ValidationAcceptor) => void; }' is not assignable to type 'DietaChecks'.
  Object literal may only specify known properties, and 'Person' does not exist in type 'DietaChecks'.

?? What the hell is Person doing there? I am going crazy. For some reasons, dieta-validator.ts is importing Person from the ast:

import { DietaAstType, Person } from './generated/ast';

This is probably a Langium error? Ok deleting validator.ts file -> not regenerated by npm run langium:generate. This is maybe because the language (.langium) has not changed? so we change it. Nope, same story. dieta.validator.ts is not regenerated. mmm ok, so I guess I also have to understand validator files now... Go back to read me some Xtext stuff. Ok I get more or less what the hell this does and I remove all Person references while keeping the core classes empty so it does not complain. Navigating TypeScript errors (TS6133: 'ValidationAcceptor' is declared but its value is never read.), the build finally succeeds.

Holding my breath.... it works. Ok so now let's modify the grammar back to what we originally wanted (unit=UNIT) but with terminal UNIT: /g/; first. Ok, that seems to work? No autocompletion though:

Let's try mg: terminal UNIT: /g|mg/;

mmm so we definitely have an issue with regexp. This likely does not follow the JS syntax. Not sure what to do here, there is no example of the regexp syntax used by Langium. Or maybe this is a bug.

That's all for today.

4 replies

brucou Jul 8, 2021
Author

should be able to work around it by creating a terminal rule for each unit, and one composite rule gathering the terminal rules. to try later.

would be great to be able to debug a langium file in a user-friendly way. Immediate feedback is the key here.
- here instead of looking up parser.ts file, I should be able to inspect the outputs of the generator as it goes through a series of steps (lexing, parsing, generating etc.). So here I would have been able to recognize that the error was in the parsing phase. I may have even been able to write automated tests given that all the involved phases leverage pure functions. Those automated tests also serve as documentation!
- this is already possible but at the end of the generation phase (e.g., looking at generated files) which make bug diagnostic harder given the number of things that happened and could have gone wrong along the way.

msujew Jul 8, 2021
Maintainer

Quick info: We are currently unable to correctly distinguish between terminal rules that could parse the same input. In your example, the 'g' can be both parsed by 'ID' and 'UNIT', and it tries to match 'ID' first. Actually, the UNIT terminal should not be a terminal at all, but either a datatype rule Unit returns string: 'g' | 'mg' | 'ml' | 'cl' | 'dl' or inlined into the unit assignment like this unit=('g'|'mg'|'ml'|'cl'|'dl'). Both should work correctly.

brucou Jul 8, 2021
Author

I came to the same exact conclusion :-) after looking up parser.ts and chevrotain parsing rules:

https://chevrotain.io/docs/tutorial/step1_lexing.html#creating-the-lexer

The order of Token definitions passed to the Lexer is important. The first PATTERN to match will be chosen not the longest.

https://github.com/chevrotain/chevrotain/blob/master/examples/lexer/keywords_vs_identifiers/keywords_vs_identifiers.

const keywordsVsIdentifiersLexer = new Lexer([
  Whitespace, // Whitespace is very common in most languages so placing it first generally speeds up the lexing.

  While, // the actual keywords (While/For/Do) must appear BEFORE the Identifier Token as they are all a valid prefix of it's PATTERN.
  For, // However the edge case of an Identifier with a prefix which is a valid keyword must still be handled, for example:
  Do, // 'do' vs 'done' or 'for' vs 'forEach'. This is solved by defining 'Keyword.LONGER_ALT = Identifier'/
  // thus each time a Keyword is detected the Lexer will also try to match a LONGER Identifier..

  Identifier // As mentioned above, the Identifier Token must appear after ALL the Keyword Tokens
])

parser.ts

const ID = createToken({ name: 'ID', pattern: /[_a-zA-Z][\w_]*/ });
const INT = createToken({ name: 'INT', pattern: /[0-9]+/ });
const ML_COMMENT = createToken({ name: 'ML_COMMENT', pattern: /\/\*[\s\S]*?\*\//, group: Lexer.SKIPPED });
const SL_COMMENT = createToken({ name: 'SL_COMMENT', pattern: /\/\/[^\n\r]*/, group: Lexer.SKIPPED });
const STRING = createToken({ name: 'STRING', pattern: /"[^"]*"|'[^']*'/ });
const UNIT = createToken({ name: 'UNIT', pattern: /g|mg/ });
const WS = createToken({ name: 'WS', pattern: /\s+/, group: Lexer.SKIPPED });
const OfKeyword = createToken({ name: 'OfKeyword', pattern: /of/, longer_alt: ID });

Note how token rules are in alphabetical order, that is, do not follow the order given in the grammar (UNIT comes before ID).

If order matters, it should be kept like in the grammar definition? There can be a TON of absurdly hard to find bugs caused by changing lexing rule order.

msujew Jul 8, 2021
Maintainer

Right, that makes sense. We should probably keep them in the original order and add that information to the documentation.

brucou · 2021-07-08T18:11:01Z

brucou
Jul 8, 2021
Author

This continues #153 (comment)

As mentioned in related comments, the previously detected issue probably comes from chevrotain lexing algorithm, given that we have feeded it with ambiguous tokenization rules (an ID could be a UNIT). The ambiguity is probably limited to tokenization as grammars are written to be unambiguous. So here either we remove ID as a terminal rule, or we remove UNIT as a terminal rule, or we remove both as terminal rules, and leave terminal rules to be, well, mere tokens, i.e. something small, simple and unambiguous. Comments for instance would not qualify as simple tokens. ADR (architecture detail record) to think about.

In any case, it could be interesting to detect and warn the user about ambiguity in terminal rules.

Given that terminal rules are regexp, this comes down to the problem of given two regexp r1 and r2, determining whether the intersection of r1 and r2 is empty or not. There is a mathematic theorem that guarantees that there is a r such that r1 ∩ r2 parses the same strings than r (cf. https://sci-hub.se/10.1145/2071368.2071372). I am not however aware of any algorithm that is able to compute the cardinality of the set of strings parsed by r. An alternative approach is to construct r and run a ton of arbitrary strings through it and check that none matches the regexp. A simpler approach is to generate arbitrary strings (Cf. https://github.com/dubzzz/fast-check) in some smart ways that exercise the r1 and r2 branches. But well none of the two approaches are exactly trivial to do that well in the general case. Food for thoughts.

Moving on...

0 replies

brucou · 2021-07-08T18:25:16Z

brucou
Jul 8, 2021
Author

So the suggested changes to the grammar work like a charm:

Model:
    quantity=INT unit=Unit 'of';

Unit returns string: 'g' | 'mg' | 'ml' | 'cl' | 'dl';

Let's add the food item, type, and category.

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT unit=Unit 'of' Food_Item '(' Food_Type ',' Food_Category ')'  ;

Unit returns string: 'g' | 'mg' | 'ml' | 'cl' | 'dl';
Food_Item returns string: item+=ID* ;
Food_Type returns string: 'solid';
Food_Category returns string: 'fruit'; 

terminal WS: /\s+/;
terminal ID: /[^[0-9]][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Test: value=[Test];

We get the following error in VS:

Normal parser rules are not allowed to return a primitive value. Use a datatype rule for that.

could be good to put a link that can be referred to for more information (trouble shooting helper)

What the hell is a datatype rule?? => https://www.google.com/search?q=data+type+rule+xtext&oq=data+type+rule+xtext&aqs=chrome..69i57.2580j0j7&sourceid=chrome&ie=UTF-8

=> https://www.eclipse.org/Xtext/documentation/301_grammarlanguage.html#datatype-rules

Data type rules create instances of EDataType instead of EClass

??

They are quite similar to terminal rules, but they are actually parser rules and are therefore ...

Right, but it is not a normal parser rules, so still waiting for the difference.

So data types rules can use hidden tokens.

Note that rules that do not call other parser rules and do neither contain any actions nor assignments are considered to be data type rules, and the data type EString is implied if none has been explicitly declared.

Ok that is more clear as to what distinguishes data type rules: rules with no call to other parser rules, no actions, no assignments.

Oh wait, I have the previous errors back again:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT unit=Unit 'of' Food_Item '(' Food_Type ',' Food_Category ')'  ;

Unit returns string: 'g' | 'mg' | 'ml' | 'cl' | 'dl';
Food_Item: item+=ID* ;
Food_Type returns string: 'solid';
Food_Category returns string: 'fruit'; 

terminal WS: /\s+/;
terminal ID: /[^[0-9]][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Test: value=[Test];

(note how I remove the returns from the Food_Item rule. It could not return a string anyways, but an array of strings, and it is unclear what is the syntax in the grammar declaration language for a string of array...).

gives:

ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts
31:19-20
[tsl] ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts(31,20)
      TS2304: Cannot find name 'g'

I thought that was working before. What happened again?

ast.ts:

export type Unit = g | mg | ml | cl | dl

Let's walk it back. Turns out it was already giving webpack errors but the .diet autocompletion actually works as expected (?).

In other words, the webpack errors can (at least here) be ignored?? Let's add just the food item back:

Model:
    quantity=INT unit=Unit 'of' Food_Item;

Unit returns string: 'g' | 'mg' | 'ml' | 'cl' | 'dl';
Food_Item: item+=ID+ ;

No can do:

So the ast.ts type error is blocking. I am stuck again. This time I really call it a day.

4 replies

msujew Jul 8, 2021
Maintainer

Another quick note: You found another bug! Datatype rules which only contain keywords and no terminals, are currently generated as a special typescript type. E.g. your unit type should be export type Unit = 'g' | 'mg' | 'ml' | 'cl' | 'dl' but is missing the quotes. I filed #165 for this. For now, you can replace your unit datatype rule with an assignment like unit=('g' | 'mg' | 'ml' | 'cl' | 'dl').

brucou Jul 8, 2021
Author

That is the tester in me :-) Thanks for the reactivity though

msujew Jul 8, 2021
Maintainer

You're welcome, I have to thank you for being so thorough in describing all your issues and pain points :D
FYI, I'm not trying to push you in some 'happy path' direction, I just don't want you to become too frustrated with Langium for the lack of documentation and the bugs that slipped through. That's why I'm writing all those quick notes.

brucou Jul 8, 2021
Author

Sure. I don't want to be blocked for too long anyways :-)

brucou · 2021-07-09T14:43:26Z

brucou
Jul 9, 2021
Author

This continues #153 (comment)

Ok so a confusing thing here that I am just realizing is that npm run watch does not actually rerun when I modify the .langium file. I was under the understanding that it would:

Run npm run watch to have the TypeScript compiler run automatically after every change of the source files.

So the source files here do not include the .langium files. Meaning that these are the generated files?

anyways it would be nice (DX) to have a watcher that rerun everything that has to be rerun instead of running two scripts manually
actually what is even nicer is to have a code playground with grammar files (and related files) on one side, and the monaco editor on the other side. The monaco editor is using LSP so it should be the same as what you see in VS Code, only without having to reload and other inefficiencies.

So, now that I run generate after changing the grammar webpack does not return any error for this updated grammar:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item: item+=ID+ ;

terminal WS: /\s+/;
terminal ID: /[^[0-9]][\w_]*/;
terminal INT returns number: /[0-9]+/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

Test: value=[Test];

Note that this grammar as recommended here does inline the definition of unit.

Let's try this in VS Code. Completion of the unit now works:

Two interesting issues though:

the 100 gives an error (or warning?):
the item gives an error:

Alright, let's go by step. On the 100 number, VS says:

Expecting: expecting at least one iteration which starts with one of these possible Token sequences::
<[ID]>but found: ''

That does not really make sense, at least at this position. The funny thing is that if I add a line return at the end after milk, that warning disappears? This could possibly be a bug in the generated LSP?

Moving on to the next warning because I don't think there is a mistake in the grammar declaration language here:

unexpected character: ->m<- at offset: 10, skipped 4 characters.

not sure how the first warning affects the validity of the subsequent warnings (this often happens when writing incorrect code right, one mistake upstream provokes a ton of mistakes downstream). In any case, VS does not recognize milk as being a Food_Item.

At this point I am unable to discriminate where the error lies. It could be the LSP, it could be the parser, it could be the lexer, and well it could be the grammar but I doubt it.

Let's look at the parser (parser.ts):

/******************************************************************************
 * This file was generated by langium-cli 0.1.0.
 * DO NOT EDIT MANUALLY!
 ******************************************************************************/

/* eslint-disable */
// @ts-nocheck
import { createToken, Lexer } from 'chevrotain';
import { LangiumParser, LangiumServices, DatatypeSymbol } from 'langium';
import { DietaGrammarAccess } from './grammar-access';
import { Food_Item, Model, Test, } from './ast';

const ID = createToken({ name: 'ID', pattern: /[^[0-9]][\w_]*/ });
const INT = createToken({ name: 'INT', pattern: /[0-9]+/ });
const ML_COMMENT = createToken({ name: 'ML_COMMENT', pattern: /\/\*[\s\S]*?\*\//, group: Lexer.SKIPPED });
const SL_COMMENT = createToken({ name: 'SL_COMMENT', pattern: /\/\/[^\n\r]*/, group: Lexer.SKIPPED });
const STRING = createToken({ name: 'STRING', pattern: /"[^"]*"|'[^']*'/ });
const WS = createToken({ name: 'WS', pattern: /\s+/, group: Lexer.SKIPPED });
const ClKeyword = createToken({ name: 'ClKeyword', pattern: /cl/, longer_alt: ID });
const DlKeyword = createToken({ name: 'DlKeyword', pattern: /dl/, longer_alt: ID });
const MgKeyword = createToken({ name: 'MgKeyword', pattern: /mg/, longer_alt: ID });
const MlKeyword = createToken({ name: 'MlKeyword', pattern: /ml/, longer_alt: ID });
const OfKeyword = createToken({ name: 'OfKeyword', pattern: /of/, longer_alt: ID });
const GKeyword = createToken({ name: 'GKeyword', pattern: /g/, longer_alt: ID });

ClKeyword.LABEL = "'cl'";
DlKeyword.LABEL = "'dl'";
GKeyword.LABEL = "'g'";
MgKeyword.LABEL = "'mg'";
MlKeyword.LABEL = "'ml'";
OfKeyword.LABEL = "'of'";
const tokens = [ClKeyword, DlKeyword, MgKeyword, MlKeyword, OfKeyword, GKeyword, ID, INT, ML_COMMENT, SL_COMMENT, STRING, WS];

export class Parser extends LangiumParser {
    readonly grammarAccess: DietaGrammarAccess;

    constructor(services: LangiumServices) {
        super(tokens, services);
    }

    Model = this.MAIN_RULE("Model", Model, () => {
        this.initializeElement(this.grammarAccess.Model);
        this.consume(1, INT, this.grammarAccess.Model.quantityINTRuleCall);
        this.or(1, [
            () => {
                this.consume(2, GKeyword, this.grammarAccess.Model.GKeyword);
            },
            () => {
                this.consume(3, MgKeyword, this.grammarAccess.Model.MgKeyword);
            },
            () => {
                this.consume(4, MlKeyword, this.grammarAccess.Model.MlKeyword);
            },
            () => {
                this.consume(5, ClKeyword, this.grammarAccess.Model.ClKeyword);
            },
            () => {
                this.consume(6, DlKeyword, this.grammarAccess.Model.DlKeyword);
            },
        ]);
        this.consume(7, OfKeyword, this.grammarAccess.Model.OfKeyword);
        this.unassignedSubrule(1, this.Food_Item, this.grammarAccess.Model.Food_ItemRuleCall);
        return this.construct();
    });

    Food_Item = this.DEFINE_RULE("Food_Item", Food_Item, () => {
        this.initializeElement(this.grammarAccess.Food_Item);
        this.atLeastOne(1, () => {
            this.consume(1, ID, this.grammarAccess.Food_Item.itemIDRuleCall);
        });
        return this.construct();
    });

    Test = this.DEFINE_RULE("Test", Test, () => {
        this.initializeElement(this.grammarAccess.Test);
        this.consume(1, ID, this.grammarAccess.Test.valueTestCrossReference);
        return this.construct();
    });

}

The parser does not look obviously bad.

const ClKeyword = createToken({ name: 'ClKeyword', pattern: /cl/, longer_alt: ID }); longer_alt is correctly set to ... actually maybe not? longer_alt if I remember well means that the lexer will try to match an ID. But that should be only be if the match is longer. If there is not a longer match, it should match the short pattern. Well I think anyways.

The only way to know is to test it. The good news is that Chevrotain seems to have a playground: https://chevrotain.io/playground/. Let's try to put that code in the playground.

So no that's not going to work because LangiumParser is connected to Chevrotain.Parser in unknown ways and it is impossible to know what to put in the playground. So that's a dead avenue.

Blocked again!

We experiment with simplifying the grammar to isolate the error. Skipping failing steps, this works:

grammar Dieta
hidden(WS)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item returns number: INT ;

terminal WS: /\s+/;
terminal ID: /[^[0-9]][\w_]*/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

but this doesn't:

grammar Dieta
hidden(WS)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item returns string: ID ;

terminal WS: /\s+/;
terminal ID: /[^[0-9]][\w_]*/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

There is progress though, the first 100 is not showing a warning anymore here:

but it is here:

Then I get this when trying again:

The dieta server crashed 5 times in the last 3 minutes. The server will not be restarted.

A request has failed. See the output for more information.

Error: Errors detected in definition of Lexer:
Token Type: ->ID<- static 'PATTERN' must not match an empty string
    at c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:13708:23
    at Lexer.TRACE_INIT (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:14198:20)
    at new Lexer (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:13615:14)
    at new LangiumParser (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:21641:22)
    at new Parser (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:23686:9)
    at LangiumParser (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:23543:38)
    at _resolve (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:10932:53)
    at Object.get (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:10898:29)
    at new DefaultDocumentBuilder (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:9372:39)
    at DocumentBuilder (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:93:44)
[Error - 23:09:27] Connection to server got closed. Server will not be restarted.

So we do this:

grammar Dieta
hidden(WS)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item returns string: ID ;

terminal WS: /\s+/;
terminal ID: /[\w_]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

We geT:

Debugger listening on ws://127.0.0.1:6009/dd8c4e8c-bd4f-4fab-b730-4d591617d490
For help, see: https://nodejs.org/en/docs/inspector
Parser exception thrown! MismatchedTokenException: Expecting token of type --> INT <-- but found --> 'sd' <--
    at ChevrotainWrapper.consumeInternalError (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:20827:31)
    at ChevrotainWrapper.consumeInternal (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:20802:22)
    at ChevrotainWrapper.consume (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:20015:21)
    at ChevrotainWrapper.wrapConsume (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:21876:21)
    at Parser.consume (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:21700:36)
    at c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:23689:18
    at Parser.<anonymous> (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:21675:26)
    at ChevrotainWrapper.invokeRuleWithTry (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:20469:33)
    at ChevrotainWrapper.wrappedGrammarRule [as Model] (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:20483:38)
    at Parser.parse (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:21658:38) {
  token: {
    image: 'sd',
    startOffset: 0,
    endOffset: 1,
    startLine: 1,
    endLine: 1,
    startColumn: 1,
    endColumn: 2,
    tokenTypeIdx: 47,
    tokenType: {
      name: 'ID',
      PATTERN: /[\w_]+/,
      tokenTypeIdx: 47,
      CATEGORIES: [],
      categoryMatches: [],
      categoryMatchesMap: {},
      isParent: false
    }
  },
  resyncedTokens: [],
  previousToken: {
    image: '',
    startOffset: NaN,
    endOffset: NaN,
    startLine: NaN,
    endLine: NaN,
    startColumn: NaN,
    endColumn: NaN,
    tokenTypeIdx: 1,
    tokenType: {
      name: 'EOF',
      PATTERN: /NOT_APPLICABLE/,
      tokenTypeIdx: 1,
      CATEGORIES: [],
      categoryMatches: [],
      categoryMatchesMap: {},
      isParent: false
    }
  },
  context: { ruleStack: [ 'Model' ], ruleOccurrenceStack: [ 0 ] }
}

which is expected given the input

For this input, the parser does not seem to recognize the numbers:

That's all for today. My intuition here is this is probably a bug in langium more than a mistake in the grammar definition.

1 reply

brucou Jul 9, 2021
Author

As an aside, and thinking about testing (a rather difficult subject here), generating correct but random grammars, and testing the parser on it would be great to find bugs.

I found one library that generates random strings that match a regexp. It is java though: https://github.com/mifmif/Generex. It is developed from another library that understand the following regexp grammar: https://www.brics.dk/automaton/doc/index.html?dk/brics/automaton/RegExp.html

There is that too https://code.google.com/archive/p/xeger/ from the same base library and also in Java.

If all terminals are regexp, and we have a way to generate string that satisfy regexp, we also have left to generate language examples that exercise all the branches of the parsing rules. So Rule: a | b => generate a string that satisfies a, generate a string that satisfies b, possibly generate a string that satisfies a AND b if that exists, generate a string that satisfies neither a nor b (i.e. generate a string that satisfies the complement of and b). The asymptotic complexity of all that doesn't look good, for sure,

Then when having a way to generate random grammars, and a way to generate random strings that satisfy and do not satisfy the grammar, we have a way to test.

Metamorphic testing could also work. You take one grammar and a set of strings. You modify that grammar so that the grammar semantics are the exact same (through syntactic manipulations that preserve semantics) and you check that the grammar with the new syntax recognizes the same set of strings that the original grammar.

All of that is hard, I just leave the idea here for future thoughts. Also for reference an interview on how this was done for testing the WebAssembly parser: https://www.infoq.com/articles/webassembly-wasmtime-tooling-reference-types/

brucou · 2021-07-09T23:30:51Z

brucou
Jul 9, 2021
Author

Made some progress with the following grammar:

grammar Dieta
hidden(WS)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item: item=ID ;

terminal WS: /\s+/;
terminal ID: /[^0-9][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

So the issue before was probably from Chevrotain parser as was visible from the logs. The ID rule was likely conflicting with the INT rule.

it is too easy to fall into that trap. There should be some way to guard against it.

Now I get this:

Parser exception thrown! NoViableAltException: Expecting: one of these possible Token sequences:
  1. ['g']
  2. ['mg']
  3. ['ml']
  4. ['cl']
  5. ['dl']
but found: ' g'

This time the issue seems to be with the parsing of spaces? ' g' should be parsed as a WS terminal and a unit. For some reasons that does not happen.

I solved it!!!!

The key was to replace in parser.ts, this

const tokens = [ClKeyword, DlKeyword, MgKeyword, MlKeyword, OfKeyword, GKeyword, ID, INT, WS];

by

const tokens = [WS, ClKeyword, DlKeyword, MgKeyword, MlKeyword, OfKeyword, GKeyword, ID, INT];

This is the same issue as before in the end:

conflicting terminal rules

Let's now try something else: replacing with terminal ID: /[^0-9 ][\w]+/; that is excluding spaces from the ID terminal rule.

And all is swell:

Let's make a food item consist of several words.

grammar Dieta
hidden(WS)

Model:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item;

Food_Item: item+=ID+ ;

terminal WS: /\s+/;
terminal ID: /[^0-9 ][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

That works great:

Let's add food type and category slowly:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' Food_Type ',' Food_Category ')'  ;

Food_Item: item+=ID+ ;
Food_Type: type=[ID];
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 ][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

Error is Could not resolve reference to 'ID'.:

We try that:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')'  ;

Food_Item: item+=ID+;
Food_Type returns string: ID;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 ][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

Error is Type 'string' does not satisfy the constraint 'AstNode'.:

ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts
32:20-26
[tsl] ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts(32,21)
      TS2344: Type 'string' does not satisfy the constraint 'AstNode'.
ts-loader-default_e3b0c44298fc1c14

ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts
82:23-29
[tsl] ERROR in C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\src\language-server\generated\ast.ts(82,24)
      TS2693: 'string' only refers to a type, but is being used as a value here.
ts-loader-default_e3b0c44298fc1c14

webpack 5.42.0 compiled with 2 errors in 6104 ms
npm ERR! code 1
npm ERR! path C:\Users\Bruno Couriol\WebstormProjects\torrents\dieta
npm ERR! command failed
npm ERR! command C:\Windows\system32\cmd.exe /d /s /c webpack

Not sure why that is. Next try:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')'  ;

Food_Item: item+=ID+;
Food_Type: name+=ID+;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 ][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

That compiles!

However the comma is not correctly parsed Expecting --> '(' <-- but found --> ',' <--:

NExt try: remove the +:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')'  ;

Food_Item: item+=ID+;
Food_Type: name=ID;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 ][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

Same issue:

Let's try this just in case:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')'  ;

Food_Item: item+=ID+;
Food_Type: name=ID;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 \(\)\,][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

And yes that worked...

So the issue there again was that the (, ) and , were also conflicting with the terminal tokens!

Adding the food cateogry now:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' category=[Food_Category] ')' 
    'provides' cal=INT 'kcal,' 'with' 'the' 'following' 'nutrients:' ;

Food_Item: item+=ID+;
Food_Type: name+=ID+;
Food_Category: name+=ID+; 

terminal WS: /\s+/;
terminal ID: /[^0-9 \(\)\,][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

That works!:

Ok so we have done a small part of small part of our target grammar :-) Time to make a summary of findings.

0 replies

brucou · 2021-07-09T23:47:55Z

brucou
Jul 9, 2021
Author

Let's now add the kcal:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')' 
    'provides' cal=INT 'kcal, with the following nutrients' ;

Food_Item: item+=ID+;
Food_Type: name=ID;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 \(\)\,][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

Error is Keywords should not contain whitespace characters.. Here I am not really sure how Xtext let you declare full strings...

Let's do this for now:

grammar Dieta
hidden(WS)

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' Food_Category ')' 
    'provides' cal=INT 'kcal,' 'with' 'the' 'following' 'nutrients' ;

Food_Item: item+=ID+;
Food_Type: name=ID;
Food_Category: category='fruit'; 

terminal WS: /\s+/;
terminal ID: /[^0-9 \(\)\,][\w]+/;
terminal INT returns number: /[0-9]+/;

Test: value=[Test];

It is probably not the same semantics but let's test and see.

So this is allowed:

100g of soja milk (milk, fruit)provides 1 kcal,with the following nutrients

when it should not.

This also allowed:

100g of soja milk (milk, fruit) provides 1 kcal, with the following nutrients

that too:

100g of soja milk (milk, fruit) provides 1kcal, with the following nutrients

so in the end it is not too bad. We can fine tune later.

0 replies

brucou · 2021-07-10T00:55:58Z

brucou
Jul 10, 2021
Author

Note on the documentation effort:

Conceptual framework

Self-learning and discovery

In any sufficiently complex software, a significant portion of users will not go and check the documentation (because the complexity of that is in proportion to that of the software), no matter how good that is. That makes it important to provide facilities for the user to discover by himself what works and what does not. In that sense, these help:

immediate feedback (REPL loop)
troubleshooting:
- informative error messages that allow to diagnose and correct the issue
- suggesting fixes to commonly made mistakes
- give links to error descriptions, or forums (Stackoverflow, etc.)
working examples
tutorials

Information architecture

Information provided to the user must be organized so he discovers:

where to look for a piece of information
how concepts relate together (hierarchy, sequence)
- optimally from the way the information itself is organized

Conceptual guides

Users must master the key terminology (universal language for the language modeling domain) that will be useful to discuss with peers and support team:

grammar definition
- general grammar concepts (tokens, terminals, composite rules, etc.)
- Xtext concepts
grammar generation process
- lexing
- parsing
- linking
- generating
etc.

API reference

every detail necessary to use the exposed APIs

FAQ/tips

Common errors (and known workarounds)
Common questions (and answers)
Common modeling questions
- in my example how to model an arbitrary string like 'kcal, with the following nutrients'

A few ideas

There go a few ideas that I believe may help drive adoption. They are quoted without any consideration of the time and effort that they may require :-) The idea is to list them all so you can see what makes sense given the direction of the project and the available resources.

a REPL is a MUST have. There are obvious challenges to this: slow compilation times, etc. but the value for the user is immense. REPL have time and time again proven to drive adoption more than documentation and support. Cf. comment from one Svelte team member https://www.infoq.com/news/2020/10/svelte-simple-repl-summit-2020/:

About 18 months ago, I published [MDsveX (MDX for Svelte)], a library [that allows using Svelte component in Markdown documents]. Honestly, nobody was interested […] Then one day, a friend was talking to me about how they wished they could write some combination of Markdown and Svelte.
It sounded like they needed MDsveX but they had no idea what it was. The README wasn’t helpful and neither was my explanation. […] I decided to spend a few evenings putting together an example site and an online playground so they could try it out for themselves. I posted a link on Svelte discord, it got shared around on Twitter and all of a sudden there was interest.

FAST feedback loop, good user interface. Web users have come to expect good UX as a basic feature. So the expectations here may be higher than those from back-end developers. Once you go with a REPL, it has to be good, or the selling point becomes a cursing point.
better feedback. Every stage of the generator produces outputs. Make those outputs visual or at least textual but visible. They will serve for the documentation. They will serve also as a means to write tests.

A good example of the three previous points is the Nearley playground:

The user can enter rules in the grammar definition language on the left, write sample texts on the right, and see the results of the parser (CST). This shows two succesfully parsed strings and one string that fails to be parsed:

This shows a more complex example that includes not only the eBNF grammar but also the AST production rules. The resulting AST/transformed CST is also explorable:

The input (left)/ output (right) metaphor works well with the immediate feedback to learn autonomously the grammar definition language. The separation of grammar definition (eBNF syntax) and CST transformation (standard JS) also serves to decrease the cognitive load of users who have less to learn (little new syntax), can focus on each task separately (check that the language is defined correctly, then check that the AST is correctly produced), backed by the REPL.

This may or may not be possible to replicate exactly with Xtext as is but the usability characteristics that Nearley exhibits are definitely desirable to achieve. That + the specific advantages of Xtext (cross-references, LSP) would be a winning combo.

Personal opinion

Quick feedback as a user after toiling to implement what in my mind was a simple grammar that i would already have implemented in a few hours with other generators:

I was badly beaten by the terminal parsing rules conflict. On the bright side it was the same mistake most of the time. On the other hand, it is an invisible mistake and one that is very easy to do, especially for beginners.
I actually do not see why I am obliged to dclared things that I am not using (yet). Like name=ID. In the grammar definition phase, I don't really care yet about how I will refer to that ID. It is a bit confusing to have to think about it immediately. Of course, once this is acquired, the advantage of doing two things at once is clear. Still, I'd rather do it in two steps. That matches how I naturally approach declaring a language. I guess that might not be the case for seasoned Xtext users though.
I did not use types so far in the grammar but I wonder how useful they are. I understand why to convert string to numbers, but I am not sure I see the interest of complex types. I just want to write a language quickly. That feels kind of in my way. It has not been a big issue so far in any case. Are the types necessary for good LSP? Or was it more because Java is a typed language?
the ability to indicate cross-references is great, that is a very common need so it really comes in handy, especially in conjunction with the LSP
it was time consuming to get used to VS (I use Webstorm usually, I am a first time VS code user), to manually compile the language every time I modify something, and to refresh the VS extension window. It is not really painful but it takes time. For those of us front-end dev who are used to hot-module reloading and immediate feedback, the whole process feels slow.
at the beginning I found it hard to navigate the available information. The README package has some information, then the npm package some more, then Xtext doc page yet some more, etc. All of this I had to organize myself in some ways. It is better if all of that is in one place and already organized.
I am seriously amazed at the integration with VS Code. Code assistance, completion, etc. is pure magic.

That is very preliminary and I haven't given too much thought. Leaving this here anyways for future reference.

6 replies

msujew Jul 10, 2021
Maintainer

Also another thing I noticed was your ID terminal definition. terminal ID: /[^0-9 \,][\w]+/; looks quite curious to me, since you could just use terminal ID: /[a-zA-Z][\w]*/; which would make adding other symbols a lot easier.

brucou Jul 10, 2021
Author

Regarding playground support: That's on our internal (we should probably make it public soon) roadmap. Just yesterday I finished the draft for the new parser infrastructure that will enable playground support in #169. An actual playground will probably follow soon after.

Nice!! A playground will really enhance what you already have I think.

AST: So Langium's (like Xtext) main use case is to build either a code generator or an interpreter based on the given grammar. For that, it is absolutely vital to have a cleanly typed AST which can be implemented directly in the grammar without a lot of overhead, except for the ones you noticed, like assignments and so on. Also the AST is really helpful implementing a lot of LSP functionalities.

By cleanly typed, do you mean an object with a type property, or an actual TS type?
To take the previous Nearley example, AST generation ({% d => AST building %}) are still coupled to the grammar definition (rule: subrule1 subrule2) -- it has to be, but the AST part is separated from the grammar definition part. The convention there is that d[0] will have subrule1 AST, and d[1] will have subrule2. And AST building is the body of any JS function, for instance d => {type: rule, info: f(d[1], d[2])}
even without separating AST generation from grammar definition, it would still be nicer to make the name=ID side of things optional, so users can write just ID, check that the grammar is working, then add the name information. That could mean two modes (say strict mode, and loose mode)? Just thinking out loud.

Haven't created interpreters but I did implement code generators, arguably for simple languages. I haven't met the need for TS types. Now that I think about it, it may not have been obvious, but I was thinking about TS types in my comment because while reading the Xtext language docs, and I disconnected at some point around the unassigned rule calls (AbstractToken: TokenA | TokenB | TokenC;, etc.). I guess I still don't understand the Xtext grammar well enough.

unassigned rule calls, actions, and other advanced features deserve proper introduction and justification (what problem do they solve, when do I need them) in (advanced) tutorials

Also another thing I noticed was your ID terminal definition. terminal ID: /[^0-9 \,][\w]+/; looks quite curious to me, since you could just use terminal ID: /[a-zA-Z][\w]*/; which would make adding other symbols a lot easier.

yeah that is the result of making small modifications and checking stuff work (bottom up iterative approach) vs. thinking about a proper model and writing it down (top down). Top-down works better for me when I master the concepts, otherwise in presence of uncertainty bottom-up allows to back-track and come back to a working state in case of errors. Refactoring the language is always possible at a later point.

The second point is that what I actually want to parse is any string, not just a-zA-Z which is biased towards western alphabets. This diet language could be used by chinese, japanese or indian users, so the characters they use in their language should be legit. So excluding the minimal set of characters rather than including a specified set of characters works a bit better? The best anyways would be to do it properly (say, reuse the definition of identifiers in the JS grammar) but the focus here was to get the grammar to work.

msujew Jul 10, 2021
Maintainer

By cleanly typed, do you mean an object with a type property, or an actual TS type?

Actually both. I'm probably heavily biased towards strictly typed objects since my background is mainly C# and Java, but we felt the need to easily distinguish between objects parsed by different parser rules - that's why in conjunction with the actual AST types we also generate is<type> methods. That's the rationale behind the $type property on all AST nodes. On top of that, having interfaces for each parser rule allows for IDE-assisted access and validation.

I was thinking about TS types in my comment because while reading the Xtext language docs, and I disconnected at some point around the unassigned rule calls

Quite understandable, unassigned rule calls and actions are probably the most unintuitive part of the Xtext grammar - and sadly ours too, since they are necessary for LL(*) parsers. There is no different way that I know of to correctly parse left recursive grammars without them with an LL parser.

AbstractToken: TokenA | TokenB | TokenC;

Quick note: The parser code behind that is quite straight-forward, as it will just parse either ones of the Token? rules here. However, the AST it will create is a bit more complex, as the types build by TokenA, TokenB and TokenC will inherit from AbstractToken. In addition, the type collector will try to lift common properties between the Token? types into their common super type AbstractToken where possible.

msujew Jul 10, 2021
Maintainer

On another note: I just implemented a watch mode for the langium generate command :) 26c666d

brucou Jul 12, 2021
Author

nice stuff!

brucou · 2021-07-15T21:43:10Z

brucou
Jul 15, 2021
Author

This follows #153 (comment)

Here we try to correct one issue we encountered before. We had to use this rule:

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' category=[Food_Category] ')' 
    'provides' cal=INT 'kcal,' 'with' 'the' 'following' 'nutrients:';

instead of that one:

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' category=[Food_Category] ')' 
    'provides' cal=INT 'kcal with the following nutrients:';

because of the error Keywords should not contain whitespace characters.

We try to create a Kcal rule to hold the string, but the same error appears (makes sense, this is just another rule). So the only option here is to encode that string as a terminal rule, but then we will run again in the same errors due to ambiguous terminal rules...

So actually no real solution here.

3 replies

msujew Jul 16, 2021
Maintainer

What stops you from using 'kcal,' 'with' 'the' 'following' 'nutrients:' as keywords directly? Also, just wanted to say, that this isn't necessary an error, but a warning. Langium should still be able to handle the keyword with whitespaces, but it's generally not a good idea to make a language whitespace sensitive. (Looking at you Python, Yaml)

brucou Jul 16, 2021
Author

What stops you from using 'kcal,' 'with' 'the' 'following' 'nutrients:' as keywords directly?

It is better UX if the completion in VS is the full sentence rather than one word one after another.

Ok, let me try then with 'kcal with the following nutrients:':

Nice nudge to update versions:

Writing textmate grammars to 'syntaxes/dieta.tmLanguage.json'
npm notice
npm notice New minor version of npm available! 7.5.2 -> 7.20.0
npm notice Changelog: https://github.com/npm/cli/releases/tag/v7.20.0
npm notice Run npm install -g [email protected] to update!
npm notice

Updating version:

PS  npm install -g [email protected]
npm ERR! code EPERM
npm ERR! syscall rename
npm ERR! path C:\Users\...\AppData\Roaming\npm\node_modules\npm\node_modules\asynckit\lib
npm ERR! dest C:\Users\...\AppData\Roaming\npm\node_modules\.npm-PgA1VV8A\node_modules\asynckit\lib
npm ERR! errno -4048
npm ERR! Error: EPERM: operation not permitted, rename 'C:\Users\Bruno Couriol\AppData\Roaming\npm\node_modules\npm\node_modules\asynckit\lib' -> 
'C:\Users\...\AppData\Roaming\npm\node_modules\.npm-PgA1VV8A\node_modules\asynckit\lib'
npm ERR!  [Error: EPERM: operation not permitted, rename 'C:\Users\...\AppData\Roaming\npm\node_modules\npm\node_modules\asynckit\lib' -> 'C:\Users\...\AppData\Roaming\npm\node_modules\.npm-PgA1VV8A\node_modules\asynckit\lib'] {
npm ERR!   errno: -4048,
npm ERR!   code: 'EPERM',
npm ERR!   syscall: 'rename',
npm ERR!   path: 'C:\\Users\\...\\AppData\\Roaming\\npm\\node_modules\\npm\\node_modules\\asynckit\\lib',
npm ERR!   dest: 'C:\\Users\\...\\AppData\\Roaming\\npm\\node_modules\\.npm-PgA1VV8A\\node_modules\\asynckit\\lib'
npm ERR! }
npm ERR!
npm ERR! The operation was rejected by your operating system.
npm ERR! It's possible that the file was already in use (by a text editor or antivirus),
npm ERR! or that you lack permissions to access it.
npm ERR!
npm ERR! If you believe this might be a permissions issue, please double-check the
npm ERR! permissions of the file and its containing directories, or try running
npm ERR! the command again as root/Administrator.

npm ERR! A complete log of this run can be found in:
npm ERR!

Bad luck on this one. Just in case trying a second time:

npm ERR! code EPERM
npm ERR! syscall mkdir
npm ERR! path C:\Program Files\nodejs\node_modules\.npm-QXjMEw5N
npm ERR! errno -4048
npm ERR! Error: EPERM: operation not permitted, mkdir 'C:\Program Files\nodejs\node_modules\.npm-QXjMEw5N'
npm ERR!  [Error: EPERM: operation not permitted, mkdir 'C:\Program Files\nodejs\node_modules\.npm-QXjMEw5N'] {
npm ERR!   errno: -4048,
npm ERR!   code: 'EPERM',
npm ERR!   syscall: 'mkdir',
npm ERR!   path: 'C:\\Program Files\\nodejs\\node_modules\\.npm-QXjMEw5N'
npm ERR! }
npm ERR!
npm ERR! The operation was rejected by your operating system.
npm ERR! It's possible that the file was already in use (by a text editor or antivirus),
npm ERR! or that you lack permissions to access it.
npm ERR!
npm ERR! If you believe this might be a permissions issue, please double-check the
npm ERR! permissions of the file and its containing directories, or try running
npm ERR! the command again as root/Administrator.

npm ERR! A complete log of this run can be found in:
npm ERR!     C:\Users\...\AppData\Local\npm-cache\_logs\2021-07-16T13_32_30_452Z-debug.log

Thats better but still not working. Trying a third time :-) Same exact error. Ok will have a look later.

msujew Jul 16, 2021
Maintainer

It is better UX if the completion in VS is the full sentence rather than one word one after another.

Ah, I see where you're coming from now :) For that you would usually implement a custom completion provider, which would look like this:

export class CustomCompletionProvider extends DefaultCompletionProvider {
    protected completionForKeyword(keyword: ast.Keyword, context: AstNode | undefined, acceptor: CompletionAcceptor): void {
        if (keyword.value === 'kcal,') {
            acceptor('kcal, with the following nutrients:', { kind: CompletionItemKind.Keyword, detail: 'Keyword', sortText: 1 });
        } else {
            super.completionForKeyword(keyword, context, acceptor);
        }
    }
}

Langium will then automatically try to suggest the whole string, instead of just a single word, without compromising the parser.

Though, I guess this is a bit too much for now ;)

brucou · 2021-08-02T20:01:54Z

brucou
Aug 2, 2021
Author

This continues #153 (comment)

In the previous session, I tried to install npm 7.20, that failed three times. I did not try again and just continued as if nothing happened. In the terminal npm -v tells me that I am using 7.20 already anyways...

So where were we? We wanted to add the list of nutrients, for instance:

100g of Dr Oetker Spinaci Ristorante Pizza (solid, ?) provides 220 kcal, with the following nutrients:
- proteines: 7.1 g
- glucides: 21g
- lipides: 11g
- fiber: 1.9  g
- sodium:0.27g
- cholesterol: 0
- calcium: ? mg
- fer: ?
- magnesium: ?

Ok that should not be too hard, provided that we do not declare - as a token :-) That is about the right time to switch to the actual English spelling (protein, carbohydrate, fat, fibre). For some reason, on labels, this is not in the plural form?

A first issue we can think of is that it would be very nice to have the - (dash) at the beginning of a line. But well it is non-breaking for now. Some spaces before the - is fine. What IS important is that there be a line return after the previous nutrients: on the previous line...

So how is the man to do? We could change the WS token to include spaces except line returns. WS is defined in MDN as [ \f\n\r\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff], so we will define spaces as [ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]. Let's try that:

terminal WS: /[ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;

Compilation is fine:

npm run langium;generate
npm run build

and... not much changes. So now, spaces will not be used as relevant for parsing. But the end of line will. That means we need to add it to our parsing rules:

terminal EOL: /\n/;

and

Mode:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' Food_Item '(' type=[Food_Type] ',' category=[Food_Category] ')' 
    'provides' cal=INT 'kcal with the following nutrients:' EOL;

Well, we don't seem to have a problem here (and we should):

Maybe that's due to the '... ...' kind of illegal syntax? Let's try to insert a break somewhere else:

Oh but all hell broke loose:

[Error - 21:47:54] Request textDocument/documentHighlight failed.
  Message: Request textDocument/documentHighlight failed with message: Food_Item:category is not a valid reference id.
  Code: -32603

Let's ignore that and add a line break before provides:

Parser exception thrown! MismatchedTokenException: Expecting --> 'kcal with the following nutrients:' <-- but found --> 'kcal' <--
    at ChevrotainWrapper.consumeInternalError

Perfect. The line breaks are not detected in long strings '... ...' but otherwise we are good. Well, except:

Error: Food_Item:type is not a valid reference id.
    at DietaAstReflection.getReferenceType (c:\Users\Bruno Couriol\WebstormProjects\torrents\dieta\out\language-server\main.js:23619:23)

Not sure what's going here.

NOTE TO SELF: pay attention to what is in this OUTPUT tab. I never know if errors there just happened, or it is just that I was not watching it before.

In any case I don't have a clue of why anything of any kind is not a valid reference ID. Let's have a look at a valid grammar from the example, see if we don't get some inspiration from there:

Statemachine :
	{Statemachine}
	('events' events+=Event+ 'end')?
	('resetEvents' resetEvents+=[Event]+ 'end')?
	('commands'	commands+=Command+ 'end')?
	states+=State*;

Event:
	name=ID code=ID;

So having a look at [Event] here, we do not seem to do anything different, except that we don't need an array here. There is only one food category to declare. So yeah no clue.

@msujew help!

8 replies

brucou Aug 2, 2021
Author

I see. But this ... type=[Type] ... followed by Type: 'type' name=ID; is not really intuitive... Type as a parsing rule is understandable. But then [Type] as reference rule is confusing: the 'type' is magically not a part of it. I would have never have guessed that by myself through trial and errors. Is that because Type is also well, a type? I think this may be the difficult part once again, to have to think on two levels at the same time, and to have to get both right at the same time. In fact, I am not even sure if what I am saying makes any sense.

oooh actually there is some information there: https://www.eclipse.org/Xtext/documentation/301_grammarlanguage.html#cross-references

Ok, on the bright side, this is like a programming language right. Like when you try to use a variable before declaration for instance. There could maybe be a way to catch those mistakes. That is detect that in a parsing rule that includes a reference, there is no parent parsing rule that references the reference (or sth like that, I probably explained that wrong).

On the second bright side, given enough examples (both working and failing examples), a programmer can probably infer some rules that allows him to get things right. Theoretically there is an infinite number of ways to define a syntax for a language but in practice we use a fairly limited and repetitive set of patterns. The 80/20 pareto rules works well here.

On the third bright side :-) LSP for free is still worth the trouble.

Another source of confusion:

events+=Event+ (cf. state machine grammar) i.e. both have a + while types+=Type only types has a +
- I guess that the + does not have the same meaning on the left and on the right side?

That begs the question, what would that do:

Model:
    (items+=Item+ | types+=Type+ | categories+=Category)*;

like I have several items (Item+ regex-like expression) several times (the *). How does this accumulate in items? Is that an array of arrays? or a flattened array?

msujew Aug 3, 2021
Maintainer

I guess that the + does not have the same meaning on the left and on the right side?

Right. Basically the += operator on assignments is an indicator that the following element (in this case Event) will be assigned to an array, instead of directly assigning the property. This also influences the generated AST, as events will be of type Array<Event> instead of Event. On the other hand, the + on the right side only sets the parser cardinality of the abstract expression that precedes it. You can also just write events+=Event without the + at the end, but then the parser only parses a single event, without the ability to add multiple Event elements. However, it is still added to the array of events, even though it is only a single element. This starts making way more sense once you think of it in terms of a normal programming language. How do we expess function parameters for example?

FunctionSignature: 'function' name=ID '(' (parameters+=Parameter (',' parameters+=Parameter)*)? ')';

Here it's paramount to be able to add single a Parameter to a list of them first, and then afterwards have a repeating group of them.

like I have several items (Item+ regex-like expression) several times (the *). How does this accumulate in items? Is that an array of arrays? or a flattened array?

Great question! With the stuff I said above, the logical conclusion is that everything is flattened, since the * and + are only parser instructions and completely disconnected from the AST elements. Also, funnily enough, everything still parses exactly the same as without the +, as you are allowed to - for example - follow a Type with another Type in your document in both grammars.

msujew Aug 3, 2021
Maintainer

That is detect that in a parsing rule that includes a reference, there is no parent parsing rule that references the reference.

I guess that makes sense. As soon as you use a cross reference, that type has to instantiated somewhere. I played around with adding unused markers for terminal/parser rules that are never referenced anywhere (and therefore unparsable). In the same vein it would make sense to add another validation for cross reference that could never resolve.

brucou Aug 3, 2021
Author

Ok I think I get it now. One thing describes parsing information (CST?), the other thing the actions to perform when traversing the parse tree (with as output an AST?). I can already view some kind of animated playground that simulates the parser as it goes through an input text. Not talking about a CLI (too much work) but it should be possible to do it manually for a specific language, and a few carefully chosen inputs as a learning support.

Some Tab up:
Column 0: definition language
Column 1: CST
Column 2: AST

Another Tab down:
Enter input sample textarea
Run button

That should help support or create an accurate mental model faster.

msujew Aug 5, 2021
Maintainer

I can already view some kind of animated playground that simulates the parser as it goes through an input text

I like that idea. I think we can actually generalize this: Hovering over a token of the input text highlights the related parser rule and appropriate terminal/keyword/assignment/whatever in the grammar editor. That should be easily possible (from a framework perspective), because every parsed CST node contains its parsing feature - the exact grammar element that was used to parse it. And this grammar element in turn is also just a Langium AST node, and therefore also contains CST location info (an advantage of Langium bootstrapping itself).

brucou · 2021-08-07T22:24:55Z

brucou
Aug 7, 2021
Author

This continues #153 (comment)

I adjusted the grammar to be able to parse the end of lines correctly. The final grammar is as follows:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    (items+=Item | types+=Type | categories+=Category | EOL)*;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL?)*;

Type: 'type' name=ID;
Category: 'category' name=ID;
Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');

terminal WS: /[ \f\n\r\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\/\/[^\n\r]*/;

NOTE TO SELF: handling manually the end of lines requires some discipline. The rule here is to add the EOL token as high/early as possible in the hierarchy of rules.

0 replies

brucou · 2021-08-07T22:50:09Z

brucou
Aug 7, 2021
Author

This continues #153 (comment).

Let's add comments to the language (comments start with --). We modify SL_COMMENT:

terminal SL_COMMENT: /--[^\n\r]*/;

That of course does not work!

Wait, actually I don't know why it does not work. Let's investigate:

So we still have some issues to handle with the EOL. No, it is just the error message from VS Code that leads us to the wrong path:

NOTE TO SELF: what to do about these errors? They are obviously nice, but how to check them so they are not misleading?? Will we have to write custom processing for everything LSP? That decreases the advantage precisely of not having to write the LSP ourselves. To think about.

Anyways, we don't have our comments working. Let's go back to the double /. That works:

So what is the issue here? Why substituting the / for a - has this not working? Let's try another character, say *. That works:

So the problem really is with the character -? Is that linked to some possible confusion between -- and the - in the Item rule? Ok last attempt, we escape the -.

terminal SL_COMMENT: /\-\-[^\n\r]*/;

We get:

Still failing...

@msujew what do I do wrong here?

Ok I'll try to not use a terminal tokens but add a comment rule to the grammar:

grammar Dieta
hidden(WS, SL_COMMENT, ML_COMMENT)

Model:
    (items+=Item | types+=Type | categories+=Category | Comment | EOL)*;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Type: 'type' name=ID;
Category: 'category' name=ID;
Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' STRING*;

terminal WS: /[ \f\n\r\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;
terminal SL_COMMENT: /\-\-[^\n\r]*/;

We get:

?? Let's remove the SL_COMMENT terminal rule. Ah no maybe the issue was with STRING. So let's use ID instead.

Comment returns string: '--' ID*;

That works, except the EOL:

So let's do

Comment returns string: '--' ID* EOL;

Not sure I understand this. I should not have to add the EOL rule here, it should be picked by the top-level Model rule right?

Anyways, we make some progress, it almost works:

Once again remaining issue with EOL. So now I do:

Comment returns string: '--' ID* EOL+;

And that works!!

Final grammar:

grammar Dieta
hidden(WS, ML_COMMENT)

Model:
    (items+=Item | types+=Type | categories+=Category | Comment | EOL)*;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Type: 'type' name=ID;
Category: 'category' name=ID;
Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' ID* EOL+;

terminal WS: /[ \f\n\r\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;

1 reply

msujew Aug 8, 2021
Maintainer

Great, that looks really good! The reason why the comment terminal starting with -- did not work as expected, was because in the released 0.1.0 version, terminals cannot start with the same characters as keywords (except for ID). Basically, Chevrotain always takes the first fit for a token. However, there is a concept of a longer_alt token, which is basically just a longer, alternative token. It took me a while to figure out how to fix this (basically, we do a lot of regex manipulation), but on the current main, terminals are correctly put as the longer_alt of short keywords like -.

brucou · 2021-09-16T18:42:21Z

brucou
Sep 16, 2021
Author

This continues #153 (comment)

This time we want to add the ability for user to define food categories, food types, and different kind of meals. For instance:

Meals:
- breakfast
- lunch
- dinner

Categories:
- Fruit
- Vegetable
- Legume
- Sweet
- Fish 
- Meat
- Milk product

Types:
- Solid
- Liquid
- Liquid (cream-like)
- Liquid (sauce-like)
- Liquid (oil-like)
- Liquid (water-like)

As a reminder, the current state of our grammar is as follows:

grammar Dieta
hidden(WS, ML_COMMENT)

Model:
    (items+=Item | types+=Type | categories+=Category | Comment | EOL)*;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Type: 'type' name=ID;
Category: 'category' name=ID;
Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' ID* EOL+;

terminal WS: /[ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;

We are first going to add meals:

grammar Dieta
hidden(WS, ML_COMMENT)

Model:
    (items+=Item | Meals | types+=Type | categories+=Category | Comment | EOL)*;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Meals: 'Meals:' EOL meals+=Meal+;
Meal: '-' name=ID EOL;

Type: 'type' name=ID;
Category: 'category' name=ID;
Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' ID* EOL+;

terminal WS: /[ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;

Works like a charm:

After some reorganization, we try to add the food categories. It will interesting to see if we can correctly use the cross-reference capability of Langium:

grammar Dieta
hidden(WS, ML_COMMENT)

Model:
    (Meals | Categories | types+=Type | items+=Item | Comment | EOL)*;

Meals: 'Meals:' EOL meals+=Meal+;
Meal: '-' name=ID EOL;

Categories: 'Categories:' EOL categories+=Category+;
Category: '-' name=ID EOL;

Type: 'type' name=ID;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' ID* EOL+;

terminal WS: /[ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;

Again, works like a charm:

So let's move with food types:

grammar Dieta
hidden(WS, ML_COMMENT)

Model:
    (Meals | Categories | Types | items+=Item | Comment | EOL)*;

Meals: 'Meals:' EOL meals+=Meal+;
Meal: '-' name=ID EOL;

Categories: 'Categories:' EOL categories+=Category+;
Category: '-' name=ID EOL;

Types: 'Types:' EOL types+=Type+;
Type: '-' name=ID EOL;

Item:
    quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl') 'of' name=STRING '(' type=[Type] ',' category=[Category] ')' 
    'provides' cal=INT 'kcal' ',' 'with' 'the' 'following' 'nutrients:'  EOL (nutrients+=Nutrient EOL)*;

Nutrient: '-' name=ID ':' quantity=INT unit=('g' | 'mg' | 'ml' | 'cl' | 'dl');
Comment returns string: '--' ID* EOL+;

terminal WS: /[ \f\t\v\u00a0\u1680\u2000-\u200a\u2028\u2029\u202f\u205f\u3000\ufeff]+/;
terminal EOL: /\r?\n/;
terminal ID: /[_a-zA-Z][\w_]*/;
terminal INT returns number: /[0-9]+(\.[0-9]+)?/;
terminal STRING: /"[^"]*"|'[^']*'/;

terminal ML_COMMENT: /\/\*[\s\S]*?\*\//;

Well, the food type parsing itself work, the problem seems to be with parsing comments:

vs.

Leaving that for another day.

0 replies

Example DSL #153

brucou Jul 1, 2021

Objectives

Language goals

Language definition

Aliments and their nutrition profile

Daily food intake

User configuration

Replies: 16 comments · 32 replies

brucou Jul 1, 2021 Author

brucou Jul 1, 2021 Author

Xtext documentation

Examples

msujew Jul 1, 2021 Maintainer

brucou Jul 3, 2021 Author

brucou Jul 3, 2021 Author

Learning Xtext

brucou Jul 5, 2021 Author

brucou Jul 5, 2021 Author

msujew Jul 5, 2021 Maintainer

brucou Jul 8, 2021 Author

brucou Jul 8, 2021 Author

brucou Jul 8, 2021 Author

msujew Jul 8, 2021 Maintainer

brucou Jul 8, 2021 Author

msujew Jul 8, 2021 Maintainer

brucou Jul 8, 2021 Author

brucou Jul 8, 2021 Author

msujew Jul 8, 2021 Maintainer

brucou Jul 8, 2021 Author

msujew Jul 8, 2021 Maintainer

brucou Jul 8, 2021 Author

brucou Jul 9, 2021 Author

brucou Jul 9, 2021 Author

brucou Jul 9, 2021 Author

brucou Jul 9, 2021 Author

brucou Jul 10, 2021 Author

Conceptual framework

Self-learning and discovery

Information architecture

Conceptual guides

API reference

FAQ/tips

A few ideas

Personal opinion

msujew Jul 10, 2021 Maintainer

brucou Jul 10, 2021 Author

msujew Jul 10, 2021 Maintainer

msujew Jul 10, 2021 Maintainer

brucou Jul 12, 2021 Author

brucou Jul 15, 2021 Author

msujew Jul 16, 2021 Maintainer

brucou Jul 16, 2021 Author

msujew Jul 16, 2021 Maintainer

brucou Aug 2, 2021 Author

brucou Aug 2, 2021 Author

msujew Aug 3, 2021 Maintainer

msujew Aug 3, 2021 Maintainer

brucou Aug 3, 2021 Author

msujew Aug 5, 2021 Maintainer

brucou Aug 7, 2021 Author

brucou Aug 7, 2021 Author

msujew Aug 8, 2021 Maintainer

brucou Sep 16, 2021 Author

brucou
Jul 1, 2021

Replies: 16 comments 32 replies

brucou
Jul 1, 2021
Author

brucou
Jul 1, 2021
Author

msujew Jul 1, 2021
Maintainer

brucou Jul 3, 2021
Author

brucou
Jul 3, 2021
Author

brucou
Jul 5, 2021
Author

brucou Jul 5, 2021
Author

msujew Jul 5, 2021
Maintainer

brucou Jul 8, 2021
Author

brucou
Jul 8, 2021
Author

brucou Jul 8, 2021
Author

msujew Jul 8, 2021
Maintainer

brucou Jul 8, 2021
Author

msujew Jul 8, 2021
Maintainer

brucou
Jul 8, 2021
Author

brucou
Jul 8, 2021
Author

msujew Jul 8, 2021
Maintainer

brucou Jul 8, 2021
Author

msujew Jul 8, 2021
Maintainer

brucou Jul 8, 2021
Author

brucou
Jul 9, 2021
Author

brucou Jul 9, 2021
Author

brucou
Jul 9, 2021
Author

brucou
Jul 9, 2021
Author

brucou
Jul 10, 2021
Author

msujew Jul 10, 2021
Maintainer

brucou Jul 10, 2021
Author

msujew Jul 10, 2021
Maintainer

msujew Jul 10, 2021
Maintainer

brucou Jul 12, 2021
Author

brucou
Jul 15, 2021
Author

msujew Jul 16, 2021
Maintainer

brucou Jul 16, 2021
Author

msujew Jul 16, 2021
Maintainer

brucou
Aug 2, 2021
Author

brucou Aug 2, 2021
Author

msujew Aug 3, 2021
Maintainer

msujew Aug 3, 2021
Maintainer

brucou Aug 3, 2021
Author

msujew Aug 5, 2021
Maintainer

brucou
Aug 7, 2021
Author

brucou
Aug 7, 2021
Author

msujew Aug 8, 2021
Maintainer

brucou
Sep 16, 2021
Author