Beta Testing #6
matinnuhamunada
started this conversation in
General
Replies: 2 comments 3 replies
-
|
Beta Was this translation helpful? Give feedback.
1 reply
-
Super impressed with the ease of set up! But I managed to break it with the first question ;) I asked how many streptomyces in the dataset and it ran this:
Which I think is correct, but because it using the GTDB taxonomy, I think the search should be 'g__Streptomyces'. What do you think? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
[UPDATE] - New Database Structure
We have finished the BGCFlow run for the G1034 dataset. Unfortunately, two genomes were dropped because of an issue with the annotation (inconsistent exon ordering for features):
NBC_0131000000000_213589.current.gb
)NBC_0108000000000_76298.current.gb
).Therefore, currently we only have result ready for the 1032 genomes (
~/net/o-drive/CFB-S-NewBioactiveCompounds/11_G1000/bgcflow/G1034_20240208/data/processed/G1032_20240208
).The new database structure now host two data schema, one for BGCFlow and another for antiSMASH. In the SQL query, this ca be fetched by explicitly mentioning the schema, for example
bgcflow.cdss
andantismash.cdss
are two different tables. See at the bottom on how it looks like in DBeaver.For lab members who want to try ChatBGC using the G1032 dataset, follow this step:
Get access to OpenAI API
To get the OpenAI API key, you can ask @matinnuhamunada for a key or use your own API.
Then, run this on your terminal
OPENAI_API_KEY="<change this to your API Key>"
Installing ChatBGC
Assuming you are using one of our shared machines and have python installed, follow this steps:
Running ChatBGC
Training (do this once!)
# Train ChatBGC (Do it once) chatbgc train --llm_type openai_chat --model gpt-4o G1032_dbt_bgcflow.duckdb
Running the chatbot
# Run ChatBGC chatbgc run --llm_type openai_chat --model gpt-4o G1032_dbt_bgcflow.duckdb
Run using streamlit (experimental)
# Run ChatBGC chatbgc run_streamlit --llm_type openai_chat --model gpt-4o G1032_dbt_bgcflow.duckdb
Manually exploring the database
If you want to explore the database generated manually, the easiest way is to install DBeaver
Beta Was this translation helpful? Give feedback.
All reactions