fix: check for latest OpenAI model names #6027

ArzelaAscoIi · 2023-10-11T12:09:27Z

Related Issues

TBD

Proposed Changes:

check for latest models: https://platform.openai.com/docs/models/continuous-model-upgrades

How did you test it?

not tested yet

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

mathislucka · 2023-10-11T12:16:04Z

haystack/nodes/prompt/invocation_layer/open_ai.py

@@ -270,7 +272,7 @@ def _ensure_token_limit(self, prompt: Union[str, List[Dict[str, str]]]) -> Union

    @classmethod
    def supports(cls, model_name_or_path: str, **kwargs) -> bool:
-        valid_model = model_name_or_path in ["ada", "babbage", "davinci", "curie", "gpt-3.5-turbo-instruct"] or any(
+        valid_model = model_name_or_path in OPEN_AI_MODEL_NAMES or any(


Does this work for the snapshot models (e.g. gpt-3.5-turbo-0613)?

No. :/ I just added the one that was missing for us. Basically we need to come up with a rule to match all OpenAI models or somehow with a hard coded rule or implement some kind of dynamic validation by e.g. fetching all available models and check if the chosen one is one of them.

Happy to add extra model names

anakin87 · 2023-10-11T12:47:40Z

Hello!

A general note.
Currently, in Haystack, OpenAI models are supported using two different invocation layers:

OpenAI for old GPT-3 models and gpt-3.5-turbo-instruct
ChatGPT for gpt-3.5-turbo and gpt-4 families of models

Check out the two different supports methods:

haystack/haystack/nodes/prompt/invocation_layer/open_ai.py

Lines 272 to 276 in 3803d23

    
           def supports(cls, model_name_or_path: str, **kwargs) -> bool: 
        
               valid_model = model_name_or_path in ["ada", "babbage", "davinci", "curie", "gpt-3.5-turbo-instruct"] or any( 
        
                   m in model_name_or_path for m in ["-ada-", "-babbage-", "-davinci-", "-curie-"] 
        
               ) 
        
               return valid_model and not has_azure_parameters(**kwargs)

haystack/haystack/nodes/prompt/invocation_layer/chatgpt.py

Lines 106 to 111 in 3803d23

    
           def supports(cls, model_name_or_path: str, **kwargs) -> bool: 
        
               valid_model = ( 
        
                   any(m for m in ["gpt-3.5-turbo", "gpt-4"] if m in model_name_or_path) 
        
                   and not "gpt-3.5-turbo-instruct" in model_name_or_path 
        
               ) 
        
               return valid_model and not has_azure_parameters(**kwargs)

My impression is that these last models are supported by the ChatGPT layer.

wochinge · 2023-10-11T13:06:37Z

haystack/nodes/prompt/invocation_layer/open_ai.py

@@ -20,6 +20,8 @@

 logger = logging.getLogger(__name__)

+OPEN_AI_MODEL_NAMES = ["ada", "babbage", "davinci", "curie", "gpt-3.5-turbo-instruct", "gpt-3.5-turbo", "gpt-4"]


how about making this configurable via env variables?

ArzelaAscoIi · 2023-10-11T16:19:15Z

Okay! Makes sense. We will add an extra step to validate also the ChatGPT invocation layer. Thanks for you help!

fix: check for latest OpenAI model names

ec9665e

ArzelaAscoIi requested a review from a team as a code owner October 11, 2023 12:09

ArzelaAscoIi requested review from julian-risch and removed request for a team October 11, 2023 12:09

ArzelaAscoIi marked this pull request as draft October 11, 2023 12:09

mathislucka reviewed Oct 11, 2023

View reviewed changes

wochinge reviewed Oct 11, 2023

View reviewed changes

masci self-assigned this Oct 11, 2023

ArzelaAscoIi closed this Oct 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: check for latest OpenAI model names #6027

fix: check for latest OpenAI model names #6027

ArzelaAscoIi commented Oct 11, 2023

mathislucka Oct 11, 2023

ArzelaAscoIi Oct 11, 2023 •

edited

Loading

anakin87 commented Oct 11, 2023

wochinge Oct 11, 2023

ArzelaAscoIi commented Oct 11, 2023

		@@ -20,6 +20,8 @@

		logger = logging.getLogger(__name__)

		OPEN_AI_MODEL_NAMES = ["ada", "babbage", "davinci", "curie", "gpt-3.5-turbo-instruct", "gpt-3.5-turbo", "gpt-4"]

fix: check for latest OpenAI model names #6027

fix: check for latest OpenAI model names #6027

Conversation

ArzelaAscoIi commented Oct 11, 2023

Related Issues

Proposed Changes:

How did you test it?

Checklist

mathislucka Oct 11, 2023

Choose a reason for hiding this comment

ArzelaAscoIi Oct 11, 2023 • edited Loading

Choose a reason for hiding this comment

anakin87 commented Oct 11, 2023

wochinge Oct 11, 2023

Choose a reason for hiding this comment

ArzelaAscoIi commented Oct 11, 2023

ArzelaAscoIi Oct 11, 2023 •

edited

Loading