Conversation
Contributor
There was a problem hiding this comment.
CUDA.jl Benchmarks
Details
| Benchmark suite | Current: 4b2ba62 | Previous: e13541e | Ratio |
|---|---|---|---|
array/accumulate/Float32/1d |
99660 ns |
99639 ns |
1.00 |
array/accumulate/Float32/dims=1 |
75240 ns |
75729 ns |
0.99 |
array/accumulate/Float32/dims=1L |
1586097 ns |
1586365 ns |
1.00 |
array/accumulate/Float32/dims=2 |
140415 ns |
141018 ns |
1.00 |
array/accumulate/Float32/dims=2L |
652370 ns |
653104 ns |
1.00 |
array/accumulate/Int64/1d |
116925 ns |
116680 ns |
1.00 |
array/accumulate/Int64/dims=1 |
78507 ns |
79215 ns |
0.99 |
array/accumulate/Int64/dims=1L |
1699083 ns |
1699113 ns |
1.00 |
array/accumulate/Int64/dims=2 |
150763 ns |
151662 ns |
0.99 |
array/accumulate/Int64/dims=2L |
958820 ns |
958865 ns |
1.00 |
array/broadcast |
18359 ns |
18274 ns |
1.00 |
array/construct |
1208.8 ns |
1219.9 ns |
0.99 |
array/copy |
16591 ns |
16456 ns |
1.01 |
array/copyto!/cpu_to_gpu |
212464 ns |
211722 ns |
1.00 |
array/copyto!/gpu_to_cpu |
276870 ns |
279088 ns |
0.99 |
array/copyto!/gpu_to_gpu |
10413 ns |
10311 ns |
1.01 |
array/iteration/findall/bool |
130909 ns |
131360 ns |
1.00 |
array/iteration/findall/int |
145094 ns |
144792 ns |
1.00 |
array/iteration/findfirst/bool |
68693 ns |
69161 ns |
0.99 |
array/iteration/findfirst/int |
70567 ns |
70977 ns |
0.99 |
array/iteration/findmin/1d |
65284 ns |
65877 ns |
0.99 |
array/iteration/findmin/2d |
100890 ns |
101534 ns |
0.99 |
array/iteration/logical |
188155 ns |
189756 ns |
0.99 |
array/iteration/scalar |
65884 ns |
64275 ns |
1.03 |
array/permutedims/2d |
49565 ns |
49440 ns |
1.00 |
array/permutedims/3d |
50057 ns |
50394 ns |
0.99 |
array/permutedims/4d |
50563 ns |
50213 ns |
1.01 |
array/random/rand/Float32 |
11688 ns |
11532 ns |
1.01 |
array/random/rand/Int64 |
22066 ns |
23488 ns |
0.94 |
array/random/rand!/Float32 |
7976.333333333333 ns |
7911.666666666667 ns |
1.01 |
array/random/rand!/Int64 |
18040 ns |
20553 ns |
0.88 |
array/random/randn/Float32 |
35611 ns |
35800 ns |
0.99 |
array/random/randn!/Float32 |
24193 ns |
23702 ns |
1.02 |
array/reductions/mapreduce/Float32/1d |
33141 ns |
33010 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=1 |
37887 ns |
37873 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=1L |
50336 ns |
50240 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=2 |
55282 ns |
55474 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=2L |
67046 ns |
66984 ns |
1.00 |
array/reductions/mapreduce/Int64/1d |
39291 ns |
39254 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=1 |
41085 ns |
40660 ns |
1.01 |
array/reductions/mapreduce/Int64/dims=1L |
86444 ns |
86434 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=2 |
57607 ns |
57852 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=2L |
82863 ns |
83169 ns |
1.00 |
array/reductions/reduce/Float32/1d |
32977 ns |
33050 ns |
1.00 |
array/reductions/reduce/Float32/dims=1 |
38160 ns |
38122 ns |
1.00 |
array/reductions/reduce/Float32/dims=1L |
50309 ns |
50362 ns |
1.00 |
array/reductions/reduce/Float32/dims=2 |
55266 ns |
55220 ns |
1.00 |
array/reductions/reduce/Float32/dims=2L |
67435 ns |
67510 ns |
1.00 |
array/reductions/reduce/Int64/1d |
39384 ns |
39443 ns |
1.00 |
array/reductions/reduce/Int64/dims=1 |
40630 ns |
40416 ns |
1.01 |
array/reductions/reduce/Int64/dims=1L |
86233 ns |
86281 ns |
1.00 |
array/reductions/reduce/Int64/dims=2 |
57597 ns |
57610 ns |
1.00 |
array/reductions/reduce/Int64/dims=2L |
82676 ns |
82536 ns |
1.00 |
array/reverse/1d |
16785 ns |
16590 ns |
1.01 |
array/reverse/1dL |
67644 ns |
67515 ns |
1.00 |
array/reverse/1dL_inplace |
65272 ns |
65216 ns |
1.00 |
array/reverse/1d_inplace |
8298.333333333334 ns |
8850.666666666666 ns |
0.94 |
array/reverse/2d |
19832 ns |
19411 ns |
1.02 |
array/reverse/2dL |
71469 ns |
71262 ns |
1.00 |
array/reverse/2dL_inplace |
64909 ns |
65007 ns |
1.00 |
array/reverse/2d_inplace |
9581 ns |
10398 ns |
0.92 |
array/sorting/1d |
2723951 ns |
2714804 ns |
1.00 |
array/sorting/2d |
1062439 ns |
1062310 ns |
1.00 |
array/sorting/by |
3254748 ns |
3268184 ns |
1.00 |
cuda/synchronization/context/auto |
1142.5 ns |
1134.3 ns |
1.01 |
cuda/synchronization/context/blocking |
933.1379310344828 ns |
923.3333333333334 ns |
1.01 |
cuda/synchronization/context/nonblocking |
6006.6 ns |
5929 ns |
1.01 |
cuda/synchronization/stream/auto |
998 ns |
997.2857142857143 ns |
1.00 |
cuda/synchronization/stream/blocking |
856.5694444444445 ns |
807.1684210526316 ns |
1.06 |
cuda/synchronization/stream/nonblocking |
6055.4 ns |
5894.8 ns |
1.03 |
integration/byval/reference |
143262 ns |
143239 ns |
1.00 |
integration/byval/slices=1 |
145353 ns |
145256 ns |
1.00 |
integration/byval/slices=2 |
284195 ns |
283668 ns |
1.00 |
integration/byval/slices=3 |
422419 ns |
422144 ns |
1.00 |
integration/cudadevrt |
101763 ns |
101766 ns |
1.00 |
integration/volumerhs |
8879798 ns |
8884780 ns |
1.00 |
kernel/indexing |
12634 ns |
12552 ns |
1.01 |
kernel/indexing_checked |
13423 ns |
13362 ns |
1.00 |
kernel/launch |
2154.777777777778 ns |
2078.4444444444443 ns |
1.04 |
kernel/occupancy |
691.527027027027 ns |
701.984962406015 ns |
0.99 |
kernel/rand |
14256 ns |
14424 ns |
0.99 |
latency/import |
3869046372 ns |
3859927635 ns |
1.00 |
latency/precompile |
4632083292 ns |
4623280780 ns |
1.00 |
latency/ttfp |
4486859471 ns |
4484391359 ns |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #3156 +/- ##
=======================================
Coverage 16.31% 16.31%
=======================================
Files 124 124
Lines 9875 9875
=======================================
Hits 1611 1611
Misses 8264 8264 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #3156 +/- ##
=======================================
Coverage 16.31% 16.31%
=======================================
Files 124 124
Lines 9875 9875
=======================================
Hits 1611 1611
Misses 8264 8264 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.