Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove device_code_agx #512

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Remove device_code_agx #512

wants to merge 2 commits into from

Conversation

christiangnrd
Copy link
Contributor

No description provided.

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Metal Benchmarks

Benchmark suite Current: 095323f Previous: 6a760a6 Ratio
private array/construct 26815.916666666664 ns 27270.785714285714 ns 0.98
private array/broadcast 458709 ns 460209 ns 1.00
private array/random/randn/Float32 789312 ns 804875 ns 0.98
private array/random/randn!/Float32 646334 ns 646333 ns 1.00
private array/random/rand!/Int64 556167 ns 548916 ns 1.01
private array/random/rand!/Float32 583791 ns 587375 ns 0.99
private array/random/rand/Int64 773083.5 ns 716083.5 ns 1.08
private array/random/rand/Float32 602250 ns 615791.5 ns 0.98
private array/copyto!/gpu_to_gpu 589417 ns 677125 ns 0.87
private array/copyto!/cpu_to_gpu 828125 ns 640688 ns 1.29
private array/copyto!/gpu_to_cpu 705958 ns 817625 ns 0.86
private array/accumulate/1d 1341708 ns 1329687.5 ns 1.01
private array/accumulate/2d 1391000 ns 1382229 ns 1.01
private array/iteration/findall/int 2099084 ns 2073708 ns 1.01
private array/iteration/findall/bool 1825958 ns 1799041 ns 1.01
private array/iteration/findfirst/int 1700063 ns 1688292 ns 1.01
private array/iteration/findfirst/bool 1665208 ns 1650020.5 ns 1.01
private array/iteration/scalar 3870000 ns 3252542 ns 1.19
private array/iteration/logical 3184437 ns 3147375 ns 1.01
private array/iteration/findmin/1d 1760791.5 ns 1736042 ns 1.01
private array/iteration/findmin/2d 1345416.5 ns 1348917 ns 1.00
private array/reductions/reduce/1d 1028437.5 ns 1029542 ns 1.00
private array/reductions/reduce/2d 654854 ns 650292 ns 1.01
private array/reductions/mapreduce/1d 1035979 ns 1025917 ns 1.01
private array/reductions/mapreduce/2d 654209 ns 657229.5 ns 1.00
private array/permutedims/4d 2562375.5 ns 2553708 ns 1.00
private array/permutedims/2d 1010750 ns 1027750 ns 0.98
private array/permutedims/3d 1655417 ns 1585916 ns 1.04
private array/copy 630749.5 ns 580417 ns 1.09
latency/precompile 5743299312.5 ns 5847134584 ns 0.98
latency/ttfp 3036520333 ns 6545482667 ns 0.46
latency/import 1141152333 ns 1169724375 ns 0.98
integration/metaldevrt 724667 ns 713125 ns 1.02
integration/byval/slices=1 1586083.5 ns 1580770.5 ns 1.00
integration/byval/slices=3 10086645.5 ns 9774042 ns 1.03
integration/byval/reference 1622792 ns 1598000 ns 1.02
integration/byval/slices=2 2594541 ns 2571895.5 ns 1.01
kernel/indexing 452375 ns 457542 ns 0.99
kernel/indexing_checked 457084 ns 458645.5 ns 1.00
kernel/launch 7875 ns 8125 ns 0.97
metal/synchronization/stream 14833 ns 14209 ns 1.04
metal/synchronization/context 15208 ns 15000 ns 1.01
shared array/construct 25993 ns 25166.714285714286 ns 1.03
shared array/broadcast 477416.5 ns 469917 ns 1.02
shared array/random/randn/Float32 816333 ns 825666 ns 0.99
shared array/random/randn!/Float32 651625 ns 616625 ns 1.06
shared array/random/rand!/Int64 555375 ns 547708 ns 1.01
shared array/random/rand!/Float32 586459 ns 591458 ns 0.99
shared array/random/rand/Int64 743917 ns 734166.5 ns 1.01
shared array/random/rand/Float32 603125 ns 610312 ns 0.99
shared array/copyto!/gpu_to_gpu 86916 ns 88000 ns 0.99
shared array/copyto!/cpu_to_gpu 88083 ns 86291 ns 1.02
shared array/copyto!/gpu_to_cpu 77916 ns 77959 ns 1.00
shared array/accumulate/1d 1345583 ns 1336542 ns 1.01
shared array/accumulate/2d 1389291 ns 1384250 ns 1.00
shared array/iteration/findall/int 1824125 ns 1765020.5 ns 1.03
shared array/iteration/findall/bool 1594333 ns 1558812 ns 1.02
shared array/iteration/findfirst/int 1379834 ns 1396125.5 ns 0.99
shared array/iteration/findfirst/bool 1361979.5 ns 1360145.5 ns 1.00
shared array/iteration/scalar 160458 ns 153334 ns 1.05
shared array/iteration/logical 2975459 ns 2940000 ns 1.01
shared array/iteration/findmin/1d 1459750 ns 1442291 ns 1.01
shared array/iteration/findmin/2d 1363500 ns 1358562.5 ns 1.00
shared array/reductions/reduce/1d 725770.5 ns 723750 ns 1.00
shared array/reductions/reduce/2d 659417 ns 654792 ns 1.01
shared array/reductions/mapreduce/1d 729854 ns 728687 ns 1.00
shared array/reductions/mapreduce/2d 662375 ns 660125 ns 1.00
shared array/permutedims/4d 2551187.5 ns 2522521 ns 1.01
shared array/permutedims/2d 1025834 ns 1019375 ns 1.01
shared array/permutedims/3d 1602749.5 ns 1579917 ns 1.01
shared array/copy 237958 ns 233417 ns 1.02

This comment was automatically generated by workflow using github-action-benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant