The AlloyDB for PostgreSQL for LlamaIndex package provides a first class experience for connecting to AlloyDB instances from the LlamaIndex ecosystem while providing the following benefits:
- Simplified & Secure Connections: easily and securely create shared connection pools to connect to Google Cloud databases utilizing IAM for authorization and database authentication without needing to manage SSL certificates, configure firewall rules, or enable authorized networks.
- Better integration with AlloyDB: built-in methods to take advantage of AlloyDB's advanced indexing and scalability capabilities.
- Improved metadata handling: store metadata in columns instead of JSON, resulting in significant performance improvements.
- Clear separation: clearly separate table and extension creation, allowing for distinct permissions and streamlined workflows.
In order to use this library, you first need to go through the following steps:
- Select or create a Cloud Platform project.
- Enable billing for your project.
- Enable the AlloyDB API.
- Setup Authentication.
Install this library in a virtualenv using pip. virtualenv is a tool to create isolated Python environments. The basic problem it addresses is one of dependencies and versions, and indirectly permissions.
With virtualenv, it's possible to install this library without needing system install permissions, and without clashing with the installed system dependencies.
Python >= 3.9
pip install virtualenv
virtualenv <your-env>
source <your-env>/bin/activate
<your-env>/bin/pip install llama-index-alloydb-pg
pip install virtualenv
virtualenv <your-env>
<your-env>\Scripts\activate
<your-env>\Scripts\pip.exe install llama-index-alloydb-pg
Code samples and snippets live in the samples/ folder.
Use a vector store to store embedded data and perform vector search.
import google.auth
from llama_index.core import Settings
from llama_index.embeddings.vertex import VertexTextEmbedding
from llama_index_alloydb_pg import AlloyDBEngine, AlloyDBVectorStore
credentials, project_id = google.auth.default()
engine = await AlloyDBEngine.afrom_instance(
"project-id", "region", "my-cluster", "my-instance", "my-database"
)
Settings.embed_model = VertexTextEmbedding(
model_name="textembedding-gecko@003",
project="project-id",
credentials=credentials,
)
vector_store = await AlloyDBVectorStore.create(
engine=engine, table_name="vector_store"
)
Use a document store to make storage and maintenance of data easier.
from llama_index_alloydb_pg import AlloyDBEngine, AlloyDBDocumentStore
engine = await AlloyDBEngine.afrom_instance(
"project-id", "region", "my-cluster", "my-instance", "my-database"
)
doc_store = await AlloyDBDocumentStore.create(
engine=engine, table_name="doc_store"
)
Use an index store to keep track of indexes built on documents.
from llama_index_alloydb_pg import AlloyDBIndexStore, AlloyDBEngine
engine = await AlloyDBEngine.from_instance(
"project-id", "region", "my-cluster", "my-instance", "my-database"
)
index_store = await AlloyDBIndexStore.create(
engine=engine, table_name="index_store"
)
Contributions to this library are always welcome and highly encouraged.
See CONTRIBUTING for more information how to get started.
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms. See Code of Conduct for more information.
Apache 2.0 - See LICENSE for more information.
This is not an officially supported Google product.