Generative AI has big impact on how enterprises conduct business. Oracle offers multiple products to build a Gen AI application such as retrieval augmented generation (RAG). Here is the sample notebook on langchain and a blog that explains how RAG solution can be implemented using Oracle's new database offering Oracle23ai to store embedding vectors.
If you are new to Oracle23ai, there are chances that you might encounter issues connecting to database using python. This documentation provides step-by-step procedure to implement the RAG solution using python3.9 to extract & store the contents of a webpage and create embeddings to store in oracle23ai.
-
Login to Oracle cloud using the credentials provided by the adminstrator
-
Provision an Click on menu on top left - Oracle Database - and select "Oracle Base Database"
2.1 In 'DB System Information' page, you'll be asked select shape, storage, to add/generate/paste SSH Keys and create/select VCN
2.2 In 'Database information' page, Click 'Database image' and select version '23ai'
2.3 Once Database is provisioned, click on the Database provisioned in 'Databases' secton (Overview Oracle Base Database -> DB Systems -> DB System Details)
2.4 Click on Pluggable Databases. Read more about difference between Container Database (CDB) and Pluggable Database here
2.5 Select 'DB Connection'
2.6 Copy the 'Connection String' from 'Easy Connect' Format to a notepad or text editor and keep it handy.
2.7 Navigate to Oracle Base Database -> DB Systems -> DB System Details -> Nodes, copy the 'Public IP address' to a notepad or text editor and keep it handy.
-
If you're using a mac, there are limitations to connect to Oracle23ai database. So, recommend you to provision a Linux compute instance
-
Create a python environment for python version 3.9 or later. Below command will create a python environment named py39 using the environment.yml file located here
conda env create -f environment.yml
-
Download Oracle InstantClient Basic for Linux x86-64 from https://download.oracle.com/otn_software/linux/instantclient/2340000/instantclient-basic-linux.x64-23.4.0.24.05.zip
curl https://download.oracle.com/otn_software/linux/instantclient/2340000/instantclient-basic-linux.x64-23.4.0.24.05.zip --output instantclient.zip
-
Unzip it in an opt directory using sudo
sudo mkdir -p /opt/oracle cd /opt/oracle sudo unzip /opt/oracle/instantclient-basic-linux.x64-23.4.0.24.05.zip
-
Install a linux package libaio1
sudo apt-get install libaio1
-
Add the path the external variable LD_LIBRARY_PATH by adding below line to .bashrc file
export LD_LIBRARY_PATH=/opt/oracle/instantclient_23_4:$LD_LIBRARY_PATH
-
After saving the .bashrc file, source it:
source ~/.bashrc
-
Follow steps in this link and install python-cli
-
Create a config file using
oci setup config
-
Update VCN to allow traffic through ports 1521 and 9200. This is required to connect to 23ai Database.
-
Update 'constants.py' file in 'code' folder with all required information
-
Run 'python code/create_user.py' to create non-admin user in the database
-
Run 'python code/create_embeddings.py' to create embeddings of the webpage that you want to index
-
Run 'python code/generate_answers.py' to generate answers to your question