Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Semantic tokenizer improvements: tests, simplification, syntax-in-syntax bugfix #497

Merged
merged 39 commits into from
Nov 8, 2024

Conversation

sungshik
Copy link
Contributor

@sungshik sungshik commented Nov 1, 2024

sungshik and others added 28 commits October 30, 2024 09:55
…-refactor-collector

Simplify class `TokenCollector` to avoid two versions of maximal token logic
…nd make corresponding changes in the Java part)
@sungshik sungshik marked this pull request as ready for review November 6, 2024 15:43
@sungshik sungshik changed the title Semantic tokenizer 2024 (fall) Semantic tokenizer improvements: tests, simplification, syntax-in-syntax bugfix Nov 6, 2024
Copy link
Member

@DavyLandman DavyLandman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, a few small things I noticed.

Copy link

sonarqubecloud bot commented Nov 8, 2024

Quality Gate Failed Quality Gate failed

Failed conditions
4.6% Duplication on New Code (required ≤ 3%)

See analysis details on SonarCloud

Copy link
Member

@DavyLandman DavyLandman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, I also checked with @jurgenvinju and he also approves of the change to the parsing constructor.

@DavyLandman DavyLandman merged commit a23957c into main Nov 8, 2024
12 of 13 checks passed
@sungshik sungshik deleted the semantic-tokenizer-fall2024 branch November 8, 2024 09:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Semantic tokenizer makes mistakes when a syntax tree (with category) has syntax children
2 participants