google-genai [minor]: Support Gemini system messages #7235

chrisnathan20 · 2024-11-20T20:11:23Z

Currently for Google Gen AI, system message (it can only be the first element in list of messages) are prepended to the following human message which is used to steer the model's behavior. The handling of system messages is the same although some newer gen ai models support direct invocation of system messages via the systemInstructions field per https://ai.google.dev/gemini-api/docs/system-instructions?lang=node

This PR provides support for models that allow direct system instructions by providing an alternative method of handling system messages when possible or set via convertSystemMessageToHumanContent. The changes in this PR are modeled after the changes in google-common to support system instructions (#5089)

High Level Outline of Changes and Additions:

added optional convertSystemMessageToHumanContent which overrides dependency on model to use new system message logic or not. This is also done to match the Generative AI package on Python.
added method computeUseSystemInstruction to determine whether model supports system Instructions or not. List of supported and unsupported models taken from google-common's with deprecated models (palm) removed.
Added new optional boolean parameter to convertBaseMessagesToContent which defaults to false. Setting this to true triggers new handling of system messages where system messages are treated as independent messages and not prepended to following user message while maintaining existing restrictions.
Prior to sending prompt for response generation, if prompt's first message is an independent system message, then it is removed from the actual prompt to be sent and set as the system instruction of the model. It is important to note that an independent system message can only be the first message in prompt and can only exist if new system message handling in convertBaseMessagesToContent is triggered which can only be for model that supports system instructions.

Test Cases created:

Given: Input has single system message followed by one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: the System message is removed from prompt and passed as systemInstruction field instead and actualPrompt would only have 1 user message under the role of user
Given: Input has single system message followed by one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: the System message is not removed from prompt as actualPrompt would only have the system message prepended to the user message under the role of user
Given: Input has a system message that is not the first message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has a system message that is not the first message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has multiple system messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has multiple system messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has no system message and one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: actualPrompt would only have the user message under the role of user
Given: Input has no system message and one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: actualPrompt would only have the user message under the role of user
Given: Input has no system message and multiple user messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: actualPrompt would only have the first user message followed by the next user messages as separate messages
Given: Input has no system message and multiple user messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: actualPrompt would only have the first user message followed by the next user messages as separate messages

Credits:
Issue selection - @shannon-ab
Implementation - @chrisnathan20 @garychen2002
Testing - @shannon-ab @martinl498

…ction feat: Add direct system instruction invocation for GenAI [local PR]

feat: Update convertSystemMessageToHumanContent logic [local pr]

…rameter set to false

"False" system message test cases

added test cases for third parameter true

vercel · 2024-11-20T20:11:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-docs	✅ Ready (Inspect)	Visit Preview		Nov 20, 2024 8:25pm

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-api-refs	⬜️ Ignored (Inspect)			Nov 20, 2024 8:25pm

chrisnathan20 · 2024-11-21T19:01:10Z

Hi @jacoblee93 and @afirstenberg - this is our PR to support system instructions for capable Generative AI models based on the implementation plan shared under the issue. If there are any points of concern or improvements, please do not hesitate to let us know. Thank you and have a great rest of your day!

chrisnathan20 and others added 10 commits November 17, 2024 19:35

feat: Add direct system instruction invocation for GenAI

b1ff373

Merge pull request #1 from shannon-ab/christopher-genai-system-instru…

43c8fa2

…ction feat: Add direct system instruction invocation for GenAI [local PR]

Update convertSystemMessageToHumanContent logic

bd97dc5

suggested refactor for convertSystemMessageToHumanContent usage

6615664

Merge pull request #2 from shannon-ab/gary-system-instruction

db23d0b

feat: Update convertSystemMessageToHumanContent logic [local pr]

Add system message tests for convertBaseMessagesToContent with 3rd pa…

5f82744

…rameter set to false

Add specific error in throw test, add test with multiple sys messages

ea59aac

Merge pull request #3 from shannon-ab/martin-sys-test

d785573

"False" system message test cases

added test cases for third parameter true

17bf68d

Merge pull request #4 from shannon-ab/shannon-genai-systest

27058b0

added test cases for third parameter true

vercel bot deployed to Preview – langchainjs-docs November 20, 2024 20:25 View deployment

chrisnathan20 marked this pull request as ready for review November 21, 2024 18:56

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. auto:improvement Medium size change to existing code to handle new use-cases labels Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

google-genai [minor]: Support Gemini system messages #7235

google-genai [minor]: Support Gemini system messages #7235

chrisnathan20 commented Nov 20, 2024 •

edited

Loading

vercel bot commented Nov 20, 2024 •

edited

Loading

chrisnathan20 commented Nov 21, 2024

google-genai [minor]: Support Gemini system messages #7235

Are you sure you want to change the base?

google-genai [minor]: Support Gemini system messages #7235

Conversation

chrisnathan20 commented Nov 20, 2024 • edited Loading

vercel bot commented Nov 20, 2024 • edited Loading

chrisnathan20 commented Nov 21, 2024

chrisnathan20 commented Nov 20, 2024 •

edited

Loading

vercel bot commented Nov 20, 2024 •

edited

Loading