Spring AI: Streamlining Local LLM Integration for Java Developers

About the Author

Lee Calcote

Lee Calcote is an innovative product and technology leader, passionate about empowering engineers and enabling organizations. As Founder of Layer5, he is at the forefront of the cloud native movement. Open source, advanced and emerging technologies have been a consistent focus through Calcote’s time at SolarWinds, Seagate, Cisco and Schneider Electric. An advisor, author, and speaker, Calcote is active in the community as a Docker Captain, Cloud Native Ambassador and GSoC, GSoD, and LFX Mentor.

Meshery

Meshery is the world's only collaborative cloud manager.

In our ongoing exploration of Docker Model Runner, we've covered its OCI-based model management, performance architecture, OpenAI-compatible API, and Docker Compose integration. Now, we turn to a specific, yet highly impactful, synergy: how Docker Model Runner empowers Java developers using the Spring AI framework to seamlessly incorporate local Large Language Models (LLMs) into their applications.
For Java engineers vested in the Spring ecosystem, Spring AI offers a familiar and powerful abstraction layer for interacting with various AI models. Docker Model Runner's compatibility provides a straightforward path to leverage these local models without stepping outside the conventional Spring development paradigm.

Spring AI: Simplifying AI for Java Applications

Before diving into the integration, it's worth briefly understanding Spring AI's mission. Spring AI aims to apply core Spring principles—such as autoconfiguration, dependency injection, and portable service abstractions—to the domain of artificial intelligence. It provides Java developers with:

Consistent APIs: A unified API for interacting with different AI models (both local and remote), reducing the need to learn multiple vendor-specific SDKs.
Abstraction Layers: Components like ChatClient, EmbeddingClient, and ImageClient abstract away the underlying model provider.
Integration with Spring Boot: Easy setup and configuration within Spring Boot applications.

Docker Model Runner as a Local "Ollama" for Spring AI

Spring AI supports various AI model providers, including commercial cloud services (like OpenAI, Azure OpenAI) and self-hosted solutions (like Ollama). From Spring AI's perspective, Docker Model Runner, with its OpenAI-compatible API, effectively acts like a local, easily manageable Ollama-style endpoint.
When Docker Model Runner is active and serving a model (e.g., Llama 3, Gemma) with its API endpoint accessible (typically http://localhost:12434 or http://model-runner.docker.internal if accessed from another container), Spring AI can be configured to point to it.
Here's how a Java engineer benefits:

Simplified Configuration in Spring Boot

Spring AI's autoconfiguration can often detect and set up the necessary beans to interact with an OpenAI-compatible endpoint. For Docker Model Runner, this typically involves setting a few properties in your application.properties or application.yml file:

1\# For Spring AI 0.8.x (or similar versions)  
2spring.ai.openai.chat.base-url=http://localhost:12434/engines/v1 
3\# Or your specific DMR endpoint  
4spring.ai.openai.chat.options.model=ai/llama3.2:1B-Q8\_0 
5\# The model you want to use  
6use  
7spring.ai.openai.api-key=YOUR\_DUMMY\_API\_KEY\_OR\_EMPTY
8\# Potentially disable API key if DMR doesn't require it strictly for local

(Note: The exact property names and structure might vary slightly based on the Spring AI version and whether you're configuring a generic OpenAI client or a more specific Ollama-like client type if Spring AI introduces more direct DMR support.)

Leveraging Spring AI's ChatClient and EmbeddingClient
Once configured, developers can inject and use Spring AI's standard clients without needing to know that the underlying provider is Docker Model Runner.

1import org.springframework.ai.chat.ChatClient;  
2   import org.springframework.ai.chat.prompt.Prompt;  
3   import org.springframework.beans.factory.annotation.Autowired;  
4   import org.springframework.stereotype.Service;
5
6   @Service  
7   public class MyAiService {
8
9       private final ChatClient chatClient;
10
11       @Autowired  
12       public MyAiService(ChatClient chatClient) {  
13           this.chatClient \= chatClient;  
14       }
15
16       public String getJokeAbout(String topic) {  
17           Prompt prompt \= new Prompt("Tell me a short joke about " \+ topic);  
18           return chatClient.call(prompt).getResult().getOutput().getContent();  
19       }  
20   }

This code remains the same whether Spring AI is talking to OpenAI's cloud API, a self-hosted Ollama instance, or Docker Model Runner serving a local model. This portability is a huge win.

Seamless Local Development and Testing Engineers can develop and test AI-driven features entirely locally using their preferred Java tools and the Spring framework. Docker Model Runner handles the model serving, and Spring AI provides the clean Java interface. This speeds up iteration cycles and reduces reliance on potentially costly cloud APIs during development.
Consistency with Production (Potentially)
While Docker Model Runner is primarily for local development, the abstraction provided by Spring AI means that switching to a production-grade, potentially cloud-hosted model provider for deployment can be achieved mainly through configuration changes, without altering the core application logic.

The Bigger Picture: Local AI in Enterprise Java

The integration with Spring AI is significant because it brings the ease of local LLM experimentation directly into the robust, enterprise-focused Java and Spring ecosystem. It allows Java teams to:

Prototype AI features rapidly.
Upskill on AI concepts using familiar tools.
Conduct local, private testing of AI interactions with business data.
Integrate AI into existing Spring Boot applications with minimal friction.

Docker's collaboration with Spring AI (as noted in some announcements) underscores a shared vision of making AI more accessible and developer-friendly across different programming environments. By ensuring Docker Model Runner presents an API that Spring AI can readily consume, both platforms contribute to lowering the barrier to entry for sophisticated AI development.
For Java engineers, this means Docker Model Runner isn't just another tool; it's a key enabler for leveraging the power of local LLMs within the comfort and productivity of the Spring framework.

Next, we'll delve into some practical, task-specific configurations and advanced use cases you can explore with Docker Model Runner, moving beyond basic chat completions.

This blog post is based on information about Docker Model Runner, a Beta feature. Features, commands, and APIs are subject to change. Configuration details for Spring AI may vary based on specific versions.

Lee Calcote

Tags:

docker ai java

Get started with Kanvas!

Explore and understand your infrastructure at a glance with our powerful Operator tool. Gain insights, track dependencies, and optimize performance effortlessly.

Related Blogs

Layer5, the cloud native management company

An empowerer of engineers, Layer5 helps you extract more value from your infrastructure. Creator and maintainer of cloud native standards. Maker of Meshery, the cloud native manager.

About the Author

Lee Calcote

Meshery

Spring AI: Streamlining Local LLM Integration for Java Developers

Docker Model Runner - A Technical Primer for Engineers

Spring AI: Simplifying AI for Java Applications

Docker Model Runner as a Local "Ollama" for Spring AI

The Bigger Picture: Local AI in Enterprise Java

Next, we'll delve into some practical, task-specific configurations and advanced use cases you can explore with Docker Model Runner, moving beyond basic chat completions.

Get started with Kanvas!

Related Blogs

RESOURCES

COMMUNITY

SOLUTIONS

COMPANY

About the Author

Lee Calcote

Meshery

Share

Spring AI: Streamlining Local LLM Integration for Java Developers

Docker Model Runner - A Technical Primer for Engineers

Category:

By:

On:

Spring AI: Simplifying AI for Java Applications

Docker Model Runner as a Local "Ollama" for Spring AI

The Bigger Picture: Local AI in Enterprise Java

Next, we'll delve into some practical, task-specific configurations and advanced use cases you can explore with Docker Model Runner, moving beyond basic chat completions.

Share Post:

Get started with Kanvas!

Related Blogs

RESOURCES

COMMUNITY

SOLUTIONS

COMPANY