-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Draft][onert] Optimize FullyConnected #14335
Conversation
This commit adds ggml worker ONE-DCO-1.0-Signed-off-by: youngsik kim <[email protected]>
This commit is for optimizing bmm from sg5.lee ONE-DCO-1.0-Signed-off-by: youngsik kim <[email protected]>
This commit restores o2024 ggml ONE-DCO-1.0-Signed-off-by: youngsik kim <[email protected]>
* OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE | ||
* SOFTWARE. | ||
*/ | ||
|
||
#pragma once | ||
|
||
#define GGML_COMMON_DECL_C |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
o2024 ggml is restored,
the latest ggml has a performance degradation of 5% compared to the o2024 version
refer to glistening
|
||
ggml_setup_op_has_task_pass(); | ||
|
||
ggml_worker_init(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
create a thread in ggml_init, and the corresponding thread is waiting for work
#ifdef GGML_TRACE | ||
for(int i = 0; i < TRACE_THREAD_MAX_N; i++) | ||
{ | ||
chrome_trace_init(&trace_scope[i], i); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add trace for profiling ggml op
}; | ||
|
||
ggml_worker_submit(ggml_graph_compute_thread, &workers[j]); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- submit work with ggml_graph_computer_thread
- transmits the signal to the multi thread.
- the thread executes ggml_graph_computer_thread after wakeup.
I do not merge this PR. |
This is a test pr for fully connected optimization.
and there is no plan to merge this PR.
ONE-DCO-1.0-Signed-off-by: youngsik kim [email protected]