Introduce Mistral AI support #32

ThomasVitale · 2024-07-06T09:59:03Z

Configure MistralAiClient using the Spring HTTP infrastructure
Define auto-configuration for using Mistral AI chat and embedding models in Spring Boot
Create a Spring Boot starter to provide the integration with Mistral AI

* Configure MistralAiClient using the Spring HTTP infrastructure * Define auto-configuration for using Mistral AI chat and embedding models in Spring Boot * Create a Spring Boot starter to provide the integration with Mistral AI Signed-off-by: Thomas Vitale <[email protected]>

ThomasVitale · 2024-07-06T10:05:22Z

The integration tests only run when there's a MISTRAL_AI_API_KEY environment variable defined.

ThomasVitale · 2024-07-06T12:17:31Z

This change in core LangChain4j would enhance the Spring support for Mistral AI, including being able to use an auto configured RestClient. langchain4j/langchain4j#1416

langchain4j

Hi @ThomasVitale, thanks a lot!

BTW, how will this work together with langchain4j/langchain4j#1103?
With the proposed approach each new model provider will require adding it's own langchain4j-spring-{model-provider} module and have quite a lot of duplication. (The aim of langchain4j/langchain4j#1103 is to have model-agnostic http clients that can plug into any model, so one does not have to re-implement client code over and over again)

langchain4j · 2024-07-10T14:43:42Z

...ain4j-spring-core/src/main/java/dev/langchain4j/spring/core/http/HttpLoggingInterceptor.java

+                    .toSingleValueMap()
+                    .entrySet()
+                    .stream()
+                    .filter(e -> !e.getKey().equals(HttpHeaders.AUTHORIZATION))


There are also X-Auth-Token and X-API-KEY common auth headers (they can also be written in different cases)

langchain4j · 2024-07-10T14:51:26Z

...-mistral-ai/src/main/java/dev/langchain4j/spring/mistralai/client/SpringMistralAiClient.java

+    @Override
+    public MistralAiChatCompletionResponse chatCompletion(MistralAiChatCompletionRequest chatCompletionRequest) {
+        Assert.notNull(chatCompletionRequest, "chatCompletionRequest cannot be null");
+        Assert.isTrue(!chatCompletionRequest.getStream(), "stream mode must be disabled");


nit:

Suggested change

Assert.isTrue(!chatCompletionRequest.getStream(), "stream mode must be disabled");

Assert.isFalse(chatCompletionRequest.getStream(), "stream mode must be disabled");

langchain4j · 2024-07-10T14:52:08Z

...-mistral-ai/src/main/java/dev/langchain4j/spring/mistralai/client/SpringMistralAiClient.java

+        Assert.notNull(chatCompletionRequest, "chatCompletionRequest cannot be null");
+        Assert.isTrue(!chatCompletionRequest.getStream(), "stream mode must be disabled");
+
+        logger.debug("Sending chat completion request: {}", chatCompletionRequest);


Is this required? It is already logged in HttpLoggingInterceptor
(same for other methods below)

langchain4j · 2024-07-10T14:57:05Z

...j-spring-mistral-ai/src/test/java/dev/langchain4j/spring/mistralai/MistralAiChatModelIT.java

+ * Adapted from MistralAiChatModelIT in the LangChain4j project.
+ */
+@EnabledIfEnvironmentVariable(named = "MISTRAL_AI_API_KEY", matches = ".*")
+class MistralAiChatModelIT {


what about making MistralAiChatModelIT in main repo abstract and inherit from it here (instead of having duplicate tests), similar to EmbeddingStoreIT?

langchain4j · 2024-07-10T15:08:53Z

...java/dev/langchain4j/spring/boot/autoconfigure/models/mistralai/MistralAiChatProperties.java

+@ConfigurationProperties(prefix = MistralAiChatProperties.CONFIG_PREFIX)
+public class MistralAiChatProperties {
+
+    public static final String CONFIG_PREFIX = "langchain4j.mistralai.chat";


nit:

Suggested change

public static final String CONFIG_PREFIX = "langchain4j.mistralai.chat";

public static final String CONFIG_PREFIX = "langchain4j.mistral-ai.chat-model";

langchain4j · 2024-07-10T15:12:50Z

...java/dev/langchain4j/spring/boot/autoconfigure/models/mistralai/MistralAiChatProperties.java

+    /**
+     * What sampling temperature to use, between 0.0 and 1.0. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. We generally recommend altering this or "top_p" but not both.
+     */
+    private Double temperature = 0.7;


Do we need to set default values here? (except for the mandatory params (model)).
I would avoid setting defaults here

langchain4j · 2024-07-10T15:14:12Z

...dev/langchain4j/spring/boot/autoconfigure/models/mistralai/MistralAiEmbeddingProperties.java

+@ConfigurationProperties(prefix = MistralAiEmbeddingProperties.CONFIG_PREFIX)
+public class MistralAiEmbeddingProperties {
+
+    public static final String CONFIG_PREFIX = "langchain4j.mistralai.embedding";


Suggested change

public static final String CONFIG_PREFIX = "langchain4j.mistralai.embedding";

public static final String CONFIG_PREFIX = "langchain4j.mistral-ai.embedding-model";

langchain4j · 2024-07-10T15:14:20Z

...ain/java/dev/langchain4j/spring/boot/autoconfigure/models/mistralai/MistralAiProperties.java

+@ConfigurationProperties(MistralAiProperties.CONFIG_PREFIX)
+public class MistralAiProperties {
+
+    public static final String CONFIG_PREFIX = "langchain4j.mistralai";


Suggested change

public static final String CONFIG_PREFIX = "langchain4j.mistralai";

public static final String CONFIG_PREFIX = "langchain4j.mistral-ai";

langchain4j · 2024-07-10T15:54:24Z

...dev/langchain4j/spring/boot/autoconfigure/models/mistralai/MistralAiAutoConfigurationIT.java

+class MistralAiAutoConfigurationIT {
+
+    private final ApplicationContextRunner contextRunner = new ApplicationContextRunner()
+        .withPropertyValues("langchain4j.mistralai.client.apiKey=" + System.getenv("MISTRAL_AI_API_KEY"))


Having apiKey and other client parameters configured once for all models simplifies configuration a bit, but it is not flexible enough: one cannot use different API keys for different models, or e.g. enable logging responses for chat model only (because embedding model responses are huge and make little sense to log), or set different timeouts for different model types (chat models are much-much slower than embedding models)

ThomasVitale · 2024-08-29T15:16:26Z

@langchain4j thanks so much for the review and sorry for the late answer. I'm trying to find the time to look more into the new HTTP client abstractions in Core to rethink what I've done in this PR.

langchain4j · 2024-09-02T11:27:21Z

@ThomasVitale thanks a lot in advance! BTW langchain4j/langchain4j#1103 is not set in stone, so feel free to suggest changes if you see the need!

ThomasVitale requested a review from langchain4j July 6, 2024 09:59

langchain4j reviewed Jul 10, 2024

View reviewed changes

langchain4j mentioned this pull request Sep 5, 2024

Mistral AI - Make Client configurable via ChatModel langchain4j/langchain4j#1416

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Mistral AI support #32

Introduce Mistral AI support #32

ThomasVitale commented Jul 6, 2024

ThomasVitale commented Jul 6, 2024

ThomasVitale commented Jul 6, 2024

langchain4j left a comment

langchain4j Jul 10, 2024 •

edited

Loading

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

langchain4j Jul 10, 2024

ThomasVitale commented Aug 29, 2024

langchain4j commented Sep 2, 2024

	Assert.isTrue(!chatCompletionRequest.getStream(), "stream mode must be disabled");
	Assert.isFalse(chatCompletionRequest.getStream(), "stream mode must be disabled");

	public static final String CONFIG_PREFIX = "langchain4j.mistralai.chat";
	public static final String CONFIG_PREFIX = "langchain4j.mistral-ai.chat-model";

Introduce Mistral AI support #32

Are you sure you want to change the base?

Introduce Mistral AI support #32

Conversation

ThomasVitale commented Jul 6, 2024

ThomasVitale commented Jul 6, 2024

ThomasVitale commented Jul 6, 2024

langchain4j left a comment

Choose a reason for hiding this comment

langchain4j Jul 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThomasVitale commented Aug 29, 2024

langchain4j commented Sep 2, 2024

langchain4j Jul 10, 2024 •

edited

Loading