-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
from where and how to start? #28
Comments
@kimgimkigi Dr. Dongsun Kim [email protected] maintains the data collection, I just forward it to him. |
@kimgimkigi "hash" is a commit hash. Please let me know if you have any further questions. |
@darkrsw Thanks! |
vanilla version is not something special.
This is a correct behavior. Since the commit
Again, a vanilla version is not something special. It is just a single snapshot of a project specified by a commit hash.
We have run our tool over all the commits in a project. Thus, no need to save any commit ID after our experiment. Literally, we archived all available versions of a project and applied FindBugs to all of them. |
Hi. Now I'm trying to execute UnfixedAlarmCollector.scala. My target project is "acceptance-test-harness", one of your dataset projects. When I start UnfixedAlarmCollector.scala, it seems like there is no problem on GitProxy and neo4j, I checked
I thought this is related to the below neo4j commands. UnfixedAlarmCollector.scala : Line 58~
Do I have to do something on neo4j or elsewhere before running this script? Thanks! |
Basically, the script assumes that alarm data are already inserted to the DB.
Again, setting up the DB is not enough. You need to put data upfront.
This depends on the summary file instead of the DB. Thus, its result is not
To get some results from the query, you need to run another script collecting alarms. |
Hi! @kimgimkigi I found that you have configured this project when I browsed this issue, and I'm trying to configure this project too. But when I used maven to pack the subproject "parsing-utils", I found that a dependency couldn't be found.
|
Hi @lxyeah I also struggled with that bug. https://mvnrepository.com/artifact/org.eclipse.jface/text/3.3.0-v20070606-0010 I roughly set up findbugs-violation but finally, I couldn't reuse it because of my less understanding of environments. |
Hi @kimgimkigi Thanks a lot! |
Hi! @darkrsw |
I'm not familiar with neo4j database, so I failed to set up from that point. |
@kimgimkigi |
Hi. I have simple question. |
I don't get your question. The CNN does not extract any specific feature but it embeds each patch, not the pattern. @Kui-Liu Can you explain how to build the training data for the CNN model for @kimgimkigi ? |
As I understand, I understand what is the input data of CNN model (Word2vec token vectors). Thanks. |
No. For a single patch, word2vec token vectors of pattern are fed into CNN model to get learned discriminating feature vectors. X-means clustering is used for identifying fix patterns from several common similar patches.
There is no label information for those input data. Basically, the CNN model is for autoencoding and no label information is necessary (i.e., input vector === output vector). |
Hi! @kimgimkigi |
Hi, Thanks for sharing your items.
I'm trying to use your tool with other static analysis tools.
Before using other tools, I wanna execute your tool under your environments and datasets.
I tried to "git clone {project-name}" all of the repositories in "repos.list" and I successfully got 684 projects in 730 projects you mentioned in the paper.
(46 projects are cannot find their repository from remote-server)
Then I guess the start points of your tools are "repo-iterator". So I tried to execute "archive.sh" first.
In "archive.sh" (AchiveCommand.java and GitCommands.scala), There are 3 arguments(gitDir, outputDir, hash) are needed.
I guess gitDir is the path of each project. But I cannot expect what is hash.
I briefly expect that hash means hash ID of project commits but I cannot be sure it is correct and If it is, then how many commit hashes will be used.
Could you please explain what is hash and the expected value of argument hash?
The text was updated successfully, but these errors were encountered: