-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Where does Shogun suck in benchmarks? #4097
Comments
I would say a good place to start is to check out the benchmarking system, run it on a small dataset, and then produce an SQL query on the results that gives the desired information. After that we can figure out the right way to put this together into something that can be merged. |
If somebody likes to work on the latests results, we can also provide an SQL dump. |
|
Hi, I've figured out a way to get the results through the mysql_wrapper.php.
And
Returned JSON:
|
Nice solution, thanks for the input. |
hi @zoq, I'm a bit concerned about the API though. With this interface, one might directly send malicious queries to the DB. I personally think we should find a better way to extract the result as well as making the API more restrict. One way might be to generate benchmark results offline and write the results as JSON files. The frontend will just read those JSON files. |
Actually, I was surprised you could send a query; only a specific IP should be able to do that. Do you think, that could be sufficient? |
I was surprised too! Can you elaborate a bit about your approach? If I understood correctly, you would allow only certain IPs to send the POST request to the server. If that's the case, the benchmark page will be functional on for those IPs, right? |
Right, only the build/webserver would be able to send the POST, I think this is already the case, but since a user is able to run the php script the IP is correct, so I guess we could adapt the script. |
Given the command I provided previously and current coverage in #4046 (comment), I think we basically have everything we need for this issue. What else should we do for this issue? |
But some shogun algos are uncovered, so how can we know how it performs? |
It would be interesting to see in which case shogun is leading or is behind. The second case, could be a good starting point for further analysis. |
I've also created an issue at mlpack's benchmarks (mlpack/benchmarks#133). It seems Shogun has an DTC algorithm but the benchmark report doesn't have the the result. @zoq do you have any idea why that happens? |
This task is to find a systematic way to figure out the cases for which a library in the automated benchmarking system performs badly
Contact @zoq and @rcurtin who have ideas how to do that.
The text was updated successfully, but these errors were encountered: