-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove device_code_agx
#512
Open
christiangnrd
wants to merge
2
commits into
main
Choose a base branch
from
nopython
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Metal Benchmarks
Benchmark suite | Current: 095323f | Previous: 6a760a6 | Ratio |
---|---|---|---|
private array/construct |
26815.916666666664 ns |
27270.785714285714 ns |
0.98 |
private array/broadcast |
458709 ns |
460209 ns |
1.00 |
private array/random/randn/Float32 |
789312 ns |
804875 ns |
0.98 |
private array/random/randn!/Float32 |
646334 ns |
646333 ns |
1.00 |
private array/random/rand!/Int64 |
556167 ns |
548916 ns |
1.01 |
private array/random/rand!/Float32 |
583791 ns |
587375 ns |
0.99 |
private array/random/rand/Int64 |
773083.5 ns |
716083.5 ns |
1.08 |
private array/random/rand/Float32 |
602250 ns |
615791.5 ns |
0.98 |
private array/copyto!/gpu_to_gpu |
589417 ns |
677125 ns |
0.87 |
private array/copyto!/cpu_to_gpu |
828125 ns |
640688 ns |
1.29 |
private array/copyto!/gpu_to_cpu |
705958 ns |
817625 ns |
0.86 |
private array/accumulate/1d |
1341708 ns |
1329687.5 ns |
1.01 |
private array/accumulate/2d |
1391000 ns |
1382229 ns |
1.01 |
private array/iteration/findall/int |
2099084 ns |
2073708 ns |
1.01 |
private array/iteration/findall/bool |
1825958 ns |
1799041 ns |
1.01 |
private array/iteration/findfirst/int |
1700063 ns |
1688292 ns |
1.01 |
private array/iteration/findfirst/bool |
1665208 ns |
1650020.5 ns |
1.01 |
private array/iteration/scalar |
3870000 ns |
3252542 ns |
1.19 |
private array/iteration/logical |
3184437 ns |
3147375 ns |
1.01 |
private array/iteration/findmin/1d |
1760791.5 ns |
1736042 ns |
1.01 |
private array/iteration/findmin/2d |
1345416.5 ns |
1348917 ns |
1.00 |
private array/reductions/reduce/1d |
1028437.5 ns |
1029542 ns |
1.00 |
private array/reductions/reduce/2d |
654854 ns |
650292 ns |
1.01 |
private array/reductions/mapreduce/1d |
1035979 ns |
1025917 ns |
1.01 |
private array/reductions/mapreduce/2d |
654209 ns |
657229.5 ns |
1.00 |
private array/permutedims/4d |
2562375.5 ns |
2553708 ns |
1.00 |
private array/permutedims/2d |
1010750 ns |
1027750 ns |
0.98 |
private array/permutedims/3d |
1655417 ns |
1585916 ns |
1.04 |
private array/copy |
630749.5 ns |
580417 ns |
1.09 |
latency/precompile |
5743299312.5 ns |
5847134584 ns |
0.98 |
latency/ttfp |
3036520333 ns |
6545482667 ns |
0.46 |
latency/import |
1141152333 ns |
1169724375 ns |
0.98 |
integration/metaldevrt |
724667 ns |
713125 ns |
1.02 |
integration/byval/slices=1 |
1586083.5 ns |
1580770.5 ns |
1.00 |
integration/byval/slices=3 |
10086645.5 ns |
9774042 ns |
1.03 |
integration/byval/reference |
1622792 ns |
1598000 ns |
1.02 |
integration/byval/slices=2 |
2594541 ns |
2571895.5 ns |
1.01 |
kernel/indexing |
452375 ns |
457542 ns |
0.99 |
kernel/indexing_checked |
457084 ns |
458645.5 ns |
1.00 |
kernel/launch |
7875 ns |
8125 ns |
0.97 |
metal/synchronization/stream |
14833 ns |
14209 ns |
1.04 |
metal/synchronization/context |
15208 ns |
15000 ns |
1.01 |
shared array/construct |
25993 ns |
25166.714285714286 ns |
1.03 |
shared array/broadcast |
477416.5 ns |
469917 ns |
1.02 |
shared array/random/randn/Float32 |
816333 ns |
825666 ns |
0.99 |
shared array/random/randn!/Float32 |
651625 ns |
616625 ns |
1.06 |
shared array/random/rand!/Int64 |
555375 ns |
547708 ns |
1.01 |
shared array/random/rand!/Float32 |
586459 ns |
591458 ns |
0.99 |
shared array/random/rand/Int64 |
743917 ns |
734166.5 ns |
1.01 |
shared array/random/rand/Float32 |
603125 ns |
610312 ns |
0.99 |
shared array/copyto!/gpu_to_gpu |
86916 ns |
88000 ns |
0.99 |
shared array/copyto!/cpu_to_gpu |
88083 ns |
86291 ns |
1.02 |
shared array/copyto!/gpu_to_cpu |
77916 ns |
77959 ns |
1.00 |
shared array/accumulate/1d |
1345583 ns |
1336542 ns |
1.01 |
shared array/accumulate/2d |
1389291 ns |
1384250 ns |
1.00 |
shared array/iteration/findall/int |
1824125 ns |
1765020.5 ns |
1.03 |
shared array/iteration/findall/bool |
1594333 ns |
1558812 ns |
1.02 |
shared array/iteration/findfirst/int |
1379834 ns |
1396125.5 ns |
0.99 |
shared array/iteration/findfirst/bool |
1361979.5 ns |
1360145.5 ns |
1.00 |
shared array/iteration/scalar |
160458 ns |
153334 ns |
1.05 |
shared array/iteration/logical |
2975459 ns |
2940000 ns |
1.01 |
shared array/iteration/findmin/1d |
1459750 ns |
1442291 ns |
1.01 |
shared array/iteration/findmin/2d |
1363500 ns |
1358562.5 ns |
1.00 |
shared array/reductions/reduce/1d |
725770.5 ns |
723750 ns |
1.00 |
shared array/reductions/reduce/2d |
659417 ns |
654792 ns |
1.01 |
shared array/reductions/mapreduce/1d |
729854 ns |
728687 ns |
1.00 |
shared array/reductions/mapreduce/2d |
662375 ns |
660125 ns |
1.00 |
shared array/permutedims/4d |
2551187.5 ns |
2522521 ns |
1.01 |
shared array/permutedims/2d |
1025834 ns |
1019375 ns |
1.01 |
shared array/permutedims/3d |
1602749.5 ns |
1579917 ns |
1.01 |
shared array/copy |
237958 ns |
233417 ns |
1.02 |
This comment was automatically generated by workflow using github-action-benchmark.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.