-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request programming language support here #1029
Comments
Adding support for HTML would be great, because the HTML judge allows for exercises with different solutions (e.g. add a title (doesn't matter which text), add at least 3 items to a list, ...). |
@BTWS2 there exists a parser for HTML, but it wouldn't work good for plagiarism detection because the parser ignores tag names, attribute names, the exact content itself, ... This is how tree-sitter converts an example HTML page: Especially if the underlying structure of the analyzed HTML files is expected to be very similar, using this parser would result in very high similarities. Using this parser, Dolos reports that the homepage of GitHub and the homepage of Dodona have a similarity of 88%. I would prefer to stick to languages that work good with Dolos. However if you want, you can try it out yourself by installing |
@rien No problem, thank you for the insight. |
@rien can there support simple text? Regular assignments (like essays) is the usecase I'm thinking for this. The tokenizer is as simple as splitting by space? Or sentences split by "." |
@anilgulecha Dolos is specifically made for plagiarism detection on source code. There are tools that should perform better on just text than Dolos. That said, we do indeed have a tokenizer that does split on spaces which you can use by passing |
Thanks for the char recommendation. will try it out. |
I need support for the language Modelica. It is not supported by tree-sitter, but there are two parsers on github: I managed to get them running using tree-sitter directly, but I had no luck adding them to dolos yet. Do you have any tips? I am on Linux if thats important. |
@yafuerst Dolos will try to find the parser with a fitting name ( If you've managed to get them working but if Dolos doesn't work, you can try "installing" the parser fro your user or globally with Let me know if it doesn't work and we'll figure it out. |
what about vue, react? |
@alexey-sh since those languages use multiple languages (template syntax, css, html, ...), tree-sitter does not handle those out-of-the box, so some additional work is required for them to work. In addition, since HTML and CSS often have a lot of common code fragments between submissions, Dolos isn't very good in detecting plagiarism with them (you get a lot of false positives). However, we do plan on changing Dolos under the hood to support these kind of languages in the future! |
Hi, I am running dolos with the following version -
|
@DhruvDh with the way we currently integrate tree-sitter languages, we will have to wait on As an alternative, you can try cloning this repository and updating the base tree-sitter version manually. However, that van be cumbersome. We already have some ideas how to avoid this problem with Dolos in the future (see #1028), however we've not started on the implementation of this solution yet. |
I was able to solve this by the following, I am not confident I understand it correctly, so I won't attempt an explanation.
|
Would you support Verilog? Thanks. |
Hi @nachiket, thanks for your suggestion! There is an official verilog parser available, so this is definitely possible. We'll put it in our schedule and will let you know when support for verilog has landed. |
Would you support Go and Rust? Thanks |
@rhz1949 working on that right now. If everything goes well, support for Go and Rust will be added in the new release I will make today. |
Is it posible to support Ocaml as well? |
@kudhru yes, there seems to be an official Ocaml parser (https://github.com/tree-sitter/tree-sitter-ocaml). It seems they provide three parsers: for implementations (.ml) interfaces (.mli) and types. Will it work for you if we only support the implementations? Because Dolos currently supports single-file submissions anyway. |
Yes, in the programming exam I am conducting, I will have only one But how much time would it take? Or could you tell me if I can do it locally on my machine? As I need to conduct an end-semester exam after 10 days. |
hello, |
@benhadid I cannot finds a specific parser for MIPS / RISC-V, but there seems to be a generic tree-sitter-asm parser, can you check whether this would fit your use-case? You can do so by checking their test cases: https://github.com/RubixDev/tree-sitter-asm/tree/main/test/corpus If not, is there any other parser from tree-sitter's list of parsers that would fit? |
@rien I checked the links you gave me and unfortunately this parser tree-sitter-asm is for the x86 architecture (intel/amd processors). I have googled a bit and found this tree-sitter-riscvasm parser for RISC-V. Can it be used with Dolos ? Thank you |
If you want to use a programming language that Dolos does not support yet, please ask here! This helps us with prioritizing which programming languages we should focus on first.
We currently ship Dolos with the following programming languages:
If your programming language is not in the list of languages supported out-of-the box, there is a high possibility that a tree-sitter parser already exists for that language. If that is the case, it should be easy to add support for your language.
In any case, let us know which languages you want to use with Dolos!
The text was updated successfully, but these errors were encountered: