feat: implement slim openai api fallback #169

quitrk · 2025-02-26T13:59:31Z

No description provided.

saghul · 2025-02-27T14:36:42Z

skynet/modules/ttt/openai_api/app.py

@@ -20,6 +21,7 @@

 def initialize():
    if not use_vllm:
+        app.include_router(slim_router)


do we also want to do it if there is a local llama? I'd only add it if use_oci is true

having it doesn't prevent you from trying to test the completions api exposed by ollama, but it comes with the advantage of being able to test locally what we're exposing on our /openai route

saghul · 2025-02-27T14:38:26Z

skynet/modules/ttt/openai_api/app.py


-__all__ = ['app', 'initialize', 'is_ready']
+    __all__ = ['app', 'initialize', 'is_ready']


Weird indent

saghul · 2025-02-27T14:39:13Z

skynet/modules/ttt/openai_api/slim_router.py

+
+
+@router.post('/v1/chat/completions')
+async def create_chat_completion(chat_request: ChatCompletionRequest, request: Request):


Does this work with streaming requests?

no, and i don't think our ainvoke does either since langchain exposes an astream method, i'm planning that for future times

My worry is that if the consumers of this are using it it will break for them, won't it?

we only have 1 consumer and I already studied how they're using it, thus the small factor for now

saghul reviewed Feb 27, 2025

View reviewed changes

feat: implement slim openai api fallback

1f43d29

quitrk force-pushed the tavram/openai branch from 1c49115 to 9fa11b5 Compare March 3, 2025 13:33

* fix indent

b96dcb6

quitrk force-pushed the tavram/openai branch from 9fa11b5 to b96dcb6 Compare March 3, 2025 13:33

saghul approved these changes Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement slim openai api fallback #169

feat: implement slim openai api fallback #169

quitrk commented Feb 26, 2025

saghul Feb 27, 2025

quitrk Mar 3, 2025 •

edited

Loading

saghul Feb 27, 2025

saghul Feb 27, 2025

quitrk Feb 27, 2025

saghul Feb 27, 2025

quitrk Feb 27, 2025


		__all__ = ['app', 'initialize', 'is_ready']
		__all__ = ['app', 'initialize', 'is_ready']



		@router.post('/v1/chat/completions')
		async def create_chat_completion(chat_request: ChatCompletionRequest, request: Request):

feat: implement slim openai api fallback #169

Are you sure you want to change the base?

feat: implement slim openai api fallback #169

Conversation

quitrk commented Feb 26, 2025

saghul Feb 27, 2025

Choose a reason for hiding this comment

quitrk Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

saghul Feb 27, 2025

Choose a reason for hiding this comment

saghul Feb 27, 2025

Choose a reason for hiding this comment

quitrk Feb 27, 2025

Choose a reason for hiding this comment

saghul Feb 27, 2025

Choose a reason for hiding this comment

quitrk Feb 27, 2025

Choose a reason for hiding this comment

quitrk Mar 3, 2025 •

edited

Loading