Add NPU Engine #31

szeyu · 2024-09-05T02:51:08Z

Add NPU Engine #28

Reference : Phi-3 Cookbook - Intel NPU acceleration library

Update in:

modelui.py
engine.py
setup.py
README.md

Add:

npu_egnine.py
requirements-npu.txt
npu_models.md

fix the logic error of if else

tjtanaa · 2024-09-25T21:37:33Z

@szeyu The NPU support looks good. I added some reviews. If those are resolved then it should be go to be merged.

Before merging this PR, can you first merged the OpenVINO vison model PR then only merge this NPU Engine?

merge npu merge npu merge npu merge npu merge npu merge npu merge npu merge npu

tjtanaa

I have added some comments, please take a look and let me know if you have any thoughts or questions.

tjtanaa · 2024-09-25T21:46:14Z

src/embeddedllm/engine.py

@@ -56,6 +56,16 @@ def __init__(self, model_path: str, vision: bool, device: str = "xpu", backend:

            self.engine = OnnxruntimeEngine(self.model_path, self.vision, self.device)
            logger.info(f"Initializing onnxruntime backend ({backend.upper()}): OnnxruntimeEngine")
+
+        elif self.backend == "npu":


Can you find a way to detect if this is Intel or AMD machine?

If user specify npu device, you have to check if it is Intel or AMD first.
> If the machine is Intel, then you continue to load model using intel_npu_engine.py.
> If the machine is AMD, then you throw error message saying that NPU support on AMD platform is not supported yet.

tjtanaa · 2024-09-25T21:47:29Z

src/embeddedllm/backend/npu_engine.py

@@ -0,0 +1,268 @@
+import contextlib


Can you rename npu_engine.py into intel_npu_engine.py as this is NPU code for Intel only?

Do DM me on Whatsapp to discuss about this if you think otherwise.

… processor

szeyu and others added 5 commits September 3, 2024 14:02

update npu models and engine setup

014a411

Update README.md

3e92ce5

Update README.md

736ea85

fix the typo of __init__

0bacaa7

Update modelui.py

2d730a3

fix the logic error of if else

szeyu added the type: enhancement / feature New feature or request label Sep 5, 2024

tjtanaa assigned szeyu Sep 25, 2024

tjtanaa linked an issue Sep 25, 2024 that may be closed by this pull request

[FEAT] Support OpenVINO NPU device #34

Open

[BUG FIXED] Update gradio version in requirements-webui.txt

5504f88

szeyu removed a link to an issue Sep 26, 2024

[FEAT] Support OpenVINO NPU device #34

Open

szeyu linked an issue Sep 26, 2024 that may be closed by this pull request

[FEAT] Add NPU Engine #28

Open

szeyu added 3 commits September 26, 2024 14:41

Merge branch 'szeyu-patch-2' into szeyu-npu-1

c5aee57

update gitignore

abfab05

Merge branch 'main' into szeyu-npu-1

e77cd9d

merge npu merge npu merge npu merge npu merge npu merge npu merge npu merge npu

tjtanaa requested changes Sep 27, 2024

View reviewed changes

szeyu added 2 commits October 4, 2024 11:26

Renamed to npu_engine to intel_npu_engine to specify that it is intel…

d7586d4

… processor

Add support for Intel NPU backend and handle unsupported processors

e0d320f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NPU Engine #31

Add NPU Engine #31

szeyu commented Sep 5, 2024

tjtanaa commented Sep 25, 2024 •

edited

Loading

tjtanaa left a comment

tjtanaa Sep 25, 2024

tjtanaa Sep 25, 2024

Add NPU Engine #31

Are you sure you want to change the base?

Add NPU Engine #31

Conversation

szeyu commented Sep 5, 2024

Add NPU Engine #28

tjtanaa commented Sep 25, 2024 • edited Loading

tjtanaa left a comment

Choose a reason for hiding this comment

tjtanaa Sep 25, 2024

Choose a reason for hiding this comment

tjtanaa Sep 25, 2024

Choose a reason for hiding this comment

tjtanaa commented Sep 25, 2024 •

edited

Loading