-
Notifications
You must be signed in to change notification settings - Fork 3.6k
[MLAS/NEON] Add dedicated kernel for depthwise convolution for ARM64 using NEON intrinsics #26688
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
hariharans29
wants to merge
31
commits into
main
Choose a base branch
from
hari/expt_conv
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+441
−15
Open
Changes from 23 commits
Commits
Show all changes
31 commits
Select commit
Hold shift + click to select a range
160c71e
Initial commit
hariharans29 a134ea0
More changes
hariharans29 a44f708
More changes
hariharans29 3a3ccf7
Fix builds
hariharans29 212dbf1
Fix builds 2
hariharans29 fceae09
Threaded
hariharans29 3793d70
Fix x64 builds
hariharans29 481a7f6
Experiment
hariharans29 8993a0a
Experiment revert
hariharans29 d765c1a
Refactor
hariharans29 a428d50
More changes
hariharans29 d53dd15
a
hariharans29 8f12c51
Try
hariharans29 67b6801
More changes
hariharans29 01b43fb
Relax padding
hariharans29 ea83394
Vanilla NEON Depthwise
hariharans29 dd94a3b
Fix indexing
hariharans29 ffd291a
Add benchmark
hariharans29 92fb604
Add lambda
hariharans29 d15bb93
Rework
hariharans29 119ec9a
Update onnxruntime/test/mlas/bench/bench_sconv.cpp
hariharans29 d0fc143
Fix
hariharans29 2820a84
Remove Winograd implementation
hariharans29 0ffb811
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
hariharans29 59e2b2d
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
hariharans29 e34c930
Update onnxruntime/core/mlas/lib/convolve.cpp
hariharans29 027e742
Update onnxruntime/core/mlas/inc/mlas.h
hariharans29 f93ed67
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
hariharans29 f5c1b81
Update onnxruntime/core/mlas/lib/convolve.cpp
hariharans29 f15e554
Benchmark updates
hariharans29 bb324b5
Merge remote-tracking branch 'origin/main' into hari/expt_conv
hariharans29 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.