Python SDK for Laminar.
Laminar is an open-source platform for engineering LLM products. Trace, evaluate, annotate, and analyze LLM data. Bring LLM applications to production with confidence.
Check our open-source repo and don't forget to star it ⭐
First, install the package, specifying the instrumentations you want to use.
For example, to install the package with OpenAI and Anthropic instrumentations:
pip install 'lmnr[anthropic,openai]'
To install all possible instrumentations, use the following command:
pip install 'lmnr[all]'
Initialize Laminar in your code:
from lmnr import Laminar
Laminar.initialize(project_api_key="<PROJECT_API_KEY>")
You can also skip passing the project_api_key
, in which case it will be looked
in the environment (or local .env file) by the key LMNR_PROJECT_API_KEY
.
Note that you need to only initialize Laminar once in your application. You should try to do that as early as possible in your application, e.g. at server startup.
If you self-host a Laminar instance, the default connection settings to it are
http://localhost:8000
for HTTP and http://localhost:8001
for gRPC. Initialize
the SDK accordingly:
from lmnr import Laminar
Laminar.initialize(
project_api_key="<PROJECT_API_KEY>",
base_url="http://localhost",
http_port=8000,
grpc_port=8001,
)
To instrument any function in your code, we provide a simple @observe()
decorator.
This can be useful if you want to trace a request handler or a function which combines multiple LLM calls.
import os
from openai import OpenAI
from lmnr import Laminar
Laminar.initialize(project_api_key=os.environ["LMNR_PROJECT_API_KEY"])
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])
def poem_writer(topic: str):
prompt = f"write a poem about {topic}"
messages = [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": prompt},
]
# OpenAI calls are still automatically instrumented
response = client.chat.completions.create(
model="gpt-4o",
messages=messages,
)
poem = response.choices[0].message.content
return poem
@observe()
def generate_poems():
poem1 = poem_writer(topic="laminar flow")
poem2 = poem_writer(topic="turbulence")
poems = f"{poem1}\n\n---\n\n{poem2}"
return poems
Also, you can use Laminar.start_as_current_span
if you want to record a chunk of your code using with
statement.
def handle_user_request(topic: str):
with Laminar.start_as_current_span(name="poem_writer", input=topic):
poem = poem_writer(topic=topic)
# Use set_span_output to record the output of the span
Laminar.set_span_output(poem)
Laminar allows you to automatically instrument majority of the most popular LLM, Vector DB, database, requests, and other libraries.
If you want to automatically instrument a default set of libraries, then simply do NOT pass instruments
argument to .initialize()
.
See the full list of available instrumentations in the enum.
If you want to automatically instrument only specific LLM, Vector DB, or other
calls with OpenTelemetry-compatible instrumentation, then pass the appropriate instruments to .initialize()
.
For example, if you want to only instrument OpenAI and Anthropic, then do the following:
from lmnr import Laminar, Instruments
Laminar.initialize(project_api_key=os.environ["LMNR_PROJECT_API_KEY"], instruments={Instruments.OPENAI, Instruments.ANTHROPIC})
If you want to fully disable any kind of autoinstrumentation, pass an empty set as instruments=set()
to .initialize()
.
Autoinstrumentations are provided by Traceloop's OpenLLMetry.
Install the package:
pip install lmnr
Create a file named my_first_eval.py
with the following code:
from lmnr import evaluate
def write_poem(data):
return f"This is a good poem about {data['topic']}"
def contains_poem(output, target):
return 1 if output in target['poem'] else 0
# Evaluation data
data = [
{"data": {"topic": "flowers"}, "target": {"poem": "This is a good poem about flowers"}},
{"data": {"topic": "cars"}, "target": {"poem": "I like cars"}},
]
evaluate(
data=data,
executor=write_poem,
evaluators={
"containsPoem": contains_poem
},
group_id="my_first_feature"
)
Run the following commands:
export LMNR_PROJECT_API_KEY=<YOUR_PROJECT_API_KEY> # get from Laminar project settings
lmnr eval my_first_eval.py # run in the virtual environment where lmnr is installed
Visit the URL printed in the console to see the results.
Bring rigor to the development of your LLM applications with evaluations.
You can run evaluations locally by providing executor (part of the logic used in your application) and evaluators (numeric scoring functions) to evaluate
function.
evaluate
takes in the following parameters:
data
– an array ofEvaluationDatapoint
objects, where eachEvaluationDatapoint
has two keys:target
anddata
, each containing a key-value object. Alternatively, you can pass in dictionaries, and we will instantiateEvaluationDatapoint
s with pydantic if possibleexecutor
– the logic you want to evaluate. This function must takedata
as the first argument, and produce any output. It can be both a function or anasync
function.evaluators
– Dictionary which maps evaluator names to evaluators. Functions that take output of executor as the first argument,target
as the second argument and produce a numeric scores. Each function can produce either a single number ordict[str, int|float]
of scores. Each evaluator can be both a function or anasync
function.name
– optional name for the evaluation. Automatically generated if not provided.group_id
– optional group name for the evaluation. Evaluations within the same group can be compared visually side-by-side
* If you already have the outputs of executors you want to evaluate, you can specify the executor as an identity function, that takes in data
and returns only needed value(s) from it.
Read the docs to learn more about evaluations.
Various interactions with Laminar API are available in LaminarClient
and its asynchronous version AsyncLaminarClient
.
To run Laminar agent, you can invoke client.agent.run
from lmnr import LaminarClient
client = LaminarClient(project_api_key="<YOUR_PROJECT_API_KEY>")
response = client.agent.run(
prompt="What is the weather in London today?"
)
print(response.result.content)
Agent run supports streaming as well.
from lmnr import LaminarClient
client = LaminarClient(project_api_key="<YOUR_PROJECT_API_KEY>")
for chunk in client.agent.run(
prompt="What is the weather in London today?",
stream=True
):
if chunk.chunkType == 'step':
print(chunk.summary)
elif chunk.chunkType == 'finalOutput':
print(chunk.content.result.content)
from lmnr import AsyncLaminarClient
client = AsyncLaminarClient(project_api_key="<YOUR_PROJECT_API_KEY>")
response = await client.agent.run(
prompt="What is the weather in London today?"
)
print(response.result.content)
from lmnr import AsyncLaminarClient
client = AsyncLaminarClient(project_api_key="<YOUR_PROJECT_API_KEY>")
# Note that you need to await the operation even though we use `async for` below
response = await client.agent.run(
prompt="What is the weather in London today?",
stream=True
)
async for chunk in client.agent.run(
prompt="What is the weather in London today?",
stream=True
):
if chunk.chunkType == 'step':
print(chunk.summary)
elif chunk.chunkType == 'finalOutput':
print(chunk.content.result.content)