Add GPULlama3.java as model provider to run on GPUs #1793

mikepapadim · 2025-09-18T13:29:49Z

This is an initial effort to integrate GPULlama3.java in Quarkus.

There is an ongoing effort to be merged in upstream LangChain4j : langchain4j/langchain4j#3654

However, it can work as standalone engine from quarkus-langchain4j

…`GPULlama3ChatModel` and update dependency.

…ions.

geoand · 2025-09-18T13:33:42Z

Nice!!

geoand · 2025-09-25T06:42:27Z

How would you like to to move this forward?

mikepapadim · 2025-09-25T09:34:53Z

How would you like to to move this forward?

I was expecting to first merge the PR and backport the latest state here. Since that might take some time, I can start preparing the backport in parallel and make the PR final, so you can test it.

geoand · 2025-09-25T09:39:55Z

No rush on my end, completely up to you on how you want to proceed

…sistency.

…simplification.

…lBuildConfig`, and `GpuLlama3FixedRuntimeConfigBuildItem` classes.

…time config classes.

…ntegrate with updated config classes.

…g compiler plugin setup, and removing unused jar plugin execution.

…imeConfig`, simplify `GpuLlama3Processor`, and introduce `LangChain4jGPULlama3BuildTimeConfig`.

…3` for consistency and clarity. Update relevant properties and dependencies accordingly.

…rder` and `GPULlama3Processor` for naming consistency.

…o `RuntimeConfig` and `FixedRuntimeConfig`, add documentation for clarity, and update references accordingly.

ibriq and others added 4 commits September 16, 2025 15:30

[wip] gpullama3 support

c555174

Refactor GPU Llama3 integration to replace GpuLlama3ChatModel with …

6b35703

…`GPULlama3ChatModel` and update dependency.

Reorganize imports in GPULlama3ChatModel to follow standard convent…

5583bc4

…ions.

Add GPULlama3BaseModel class

c4ad740

orionpapadakis added 17 commits October 13, 2025 17:40

Update mvn dependency to GPULlama3.java

60079a6

Remove quarkus-langchain4j-gpu-llama3 module.

1ecdecb

Rename GPU Llama3 module and artifact from gpu-llama3 to gpullama3.

03e1139

[WIP] Add integration tests for GPULlama3 module

0c2e50b

[WIP] Refactor GPULlama3 model to adopt langchain4j logic.

6cc665c

Move GPULlama3 classes out of runtime package for consistency.

78011cf

Externalize GPULlama3 dependency version to parent POM.

10fdf47

Rename artifacts and modules from gpullama3 to gpu-llama3 for con…

f471170

…sistency.

Rename GpuLlama3ChatModelBuildConfig to ChatModelBuildConfig for …

5b64050

…simplification.

Remove deprecated GpuLlama3ConfigBuildItem, `GpuLlama3EmbeddingMode…

216cd3a

…lBuildConfig`, and `GpuLlama3FixedRuntimeConfigBuildItem` classes.

Refactor GPU Llama3 configuration into separate runtime and fixed run…

a2e9779

…time config classes.

Refactor GpuLlama3Recorder to simplify configuration handling and i…

6209b4e

…ntegrate with updated config classes.

Simplify pom.xml by standardizing deployment configuration, updatin…

af7c656

…g compiler plugin setup, and removing unused jar plugin execution.

Refactor GPU Llama3 deployment configuration: remove `GpuLlama3BuildT…

2e29393

…imeConfig`, simplify `GpuLlama3Processor`, and introduce `LangChain4jGPULlama3BuildTimeConfig`.

Rename GPULlama3 artifacts, modules, and configurations to `gpu-llama…

4aa2db6

…3` for consistency and clarity. Update relevant properties and dependencies accordingly.

Rename GpuLlama3Recorder and GpuLlama3Processor to `GPULlama3Reco…

d1888ba

…rder` and `GPULlama3Processor` for naming consistency.

Refactor GPU Llama3 configuration classes: rename GpuLlama3Config t…

5e6840a

…o `RuntimeConfig` and `FixedRuntimeConfig`, add documentation for clarity, and update references accordingly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add GPULlama3.java as model provider to run on GPUs #1793

Add GPULlama3.java as model provider to run on GPUs #1793

mikepapadim commented Sep 18, 2025

Uh oh!

geoand commented Sep 18, 2025

Uh oh!

geoand commented Sep 25, 2025

Uh oh!

mikepapadim commented Sep 25, 2025

Uh oh!

geoand commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add GPULlama3.java as model provider to run on GPUs #1793

Are you sure you want to change the base?

Add GPULlama3.java as model provider to run on GPUs #1793

Conversation

mikepapadim commented Sep 18, 2025

Uh oh!

geoand commented Sep 18, 2025

Uh oh!

geoand commented Sep 25, 2025

Uh oh!

mikepapadim commented Sep 25, 2025

Uh oh!

geoand commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants