-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setting up blast databases in TBro in Docker in Amazon AWS Lightsail #49
Comments
you should run phing from inside the TBro main container - if you installed everything according to the documentation, you can enter that container using if you are still missing it, you should be able to install it using inside that container, running in any case, the folder to run phing from is the folder where the |
I tried to follow your installation directions closely when I installed TBRo - and I have reinstalled repeatedly without success when I get to the phing command. Phing appears to be installed but a 'command not found' error is given when I try to run phing in the TBro directory that has the build.xml file - or in any directory in TBro. Following your directions above: ubuntu@ip-172-26-13-108:~$ docker exec -it TBro_official /bin/bash oot@9b953c8ae04e: /root@9b953c8ae04e:/# phing queue-install-db oot@9b953c8ae04e: /root@9b953c8ae04e:/# composer global require phing/phing oot@9b953c8ae04e: /root@9b953c8ae04e:/# cd home/tbro/ oot@9b953c8ae04e: /home/tbroroot@9b953c8ae04e:/home/tbro# ls oot@9b953c8ae04e: /home/tbroroot@9b953c8ae04e:/home/tbro# phing queue-install-db ...I just realized the file phing will generate is already in the directory - queue_config.example.sql ...it looks like queue_config.example.sql was generated when things were built by screen output: Buildfile: /home/tbro/build.xml tbro > queue-install-db:
BUILD FINISHED Total time: 0.4795 seconds |
Two new questions: 1) To move zipped blast database files into my docker container using
how can I determine what the values of the three variables $WORKERFTP_FTP_USER are for my Docker container? When I run set in TBro, nothing shows up for the three variables: oot@dddabdc84640: /root@dddabdc84640:/# set | grep WORKERFTP so I'm not sure where to find values for the them. I tried $WORKERFTP_ENV_FTP_USER in the curl command but it didn't seem to work:
curl: (67) Access denied: 530 when run from Ubuntu 2) How do I "run the queue_config.sql commands in your queue database." ? Thank-you! |
I think I'll ping @greatfireball or @iimog on this, this is getting too specialized with the setup for me now, as they created the docker containers. |
Hi @000generic, You are also right regarding the environment variables (I will update them in the docs). However, the curl command should work from TBro. Can you please try again this one:
To import the content of the
|
Getting closer.... I was able to run both the curl and PGPASSWORD commands successfully now - but nothing is showing up in TBro as a blast database to blast against. Specifically, I did the following: cd /sono/peptides # this is where I placed by zipped blast databases cd /home/tbro -- database files available. name is the name it will be referenced by, md5 is the zip file's sum, download_uri specifies where the file can be retreived ...etc -- contains information which program is available for which program. ...etc PGPASSWORD=worker psql -U worker -h 172.17.0.3 -p 5432 <queue_config.sql I then tried to blast in TBro but no databases were offered as an option. |
So sorry, another lack of documentation. Thank you very much for your endurance and for reporting all the problems. This helps a lot in improving the documentation. |
Great! Now the blast databases are showing up in TBro - Thank-you :) ....but I think I have to correct the uri I am giving TBro, which I had guessed at after curling my zipped Blast database files into Docker. curl --data-binary --ftp-pasv --user tbro:ftp -T blastdb-barnacle-AA.zip ftp://172.17.0.4/ When I then configure the queue.config.sql file with: ('barnacle-AA4', '50e7cb5a77f37641a648edc59abcc11a', 'ftp://172.17.0.4/blastdb-barnacle-AA.zip'), TBro throws an error: There has been an error processing your job. Please review your job. If this keeps happening, notify the administrator. These errors occured: and when I configure the queue.config.sql file with: ('barnacle-AA5', '50e7cb5a77f37641a648edc59abcc11a', 'http://172.17.0.4/blastdb-barnacle-AA.zip'), TBro seems to hang up: Blast Results Your job is currently being processed. Please wait a moment. The page does an initial refresh saying it is one of one in queue - and then doesn't seem to refresh any more - and remains stalled after many minutes. |
OK, now we are really closing in on this. The blastdb is visible in TBro and the download of the zip file seems to work as well. The ftp configuration is the correct one. The problem now is that after unpacking the zip file the blastdb files are not found. How are those named?
use with care as it will also remove all past and present blast jobs from the database. Then re-import an sql file with the two sections for |
Genius! That works great - now I am blasting against my blast databases :) ....however, while the blast hits show visual alignments with many good hits, they are are not showing any isoform information (instead under the Name column in the blast report the hits all say 'No') and the link in 'No' just goes to the TBro landing page. This is true even when blasting, for instance, a protein that was used in building the blast database. When I search the same protein in TBro based on its id, the protein is returned as an isoform with a link that takes me to its TBro webpage. I checked and identifiers used in 1) the imported fasta files, 2) imported identifiers, 3) imported .tbl files, and4) in fastas used to build the blast databases are all the same. For instance: barnacle-ee100-aa So, it seems like the blast job is successful but the hits generated are not linking back to the TBro databases. I'll try rebuilding TBro again from the ground up but most likely I need to modify something somewhere along the way. Once we have all this worked out, I can generalize the steps and provide them to you - or post in GitHub etc. I think the combination of free/cheap easy up/easy down Amazon cloud + TBro is really great. Rather than a long-term repository, often times its useful to make things available to collaborators (or myself) with many updates for just a few days to months, and I think the Amazon/Docker/TBro combo is going to be a great way to do this. There is already growing interest from others here at the Marine Biological Laboratory. |
Nice! Happy to hear that it is finally working. The issue with showing "No" in the Name column is indeed very strange. TBro tries to map the name of the blast hit to an internal ID but even if it fails it should still show the original ID of the hit. This ID is parsed out of the Blast result xml. Something seems to go wrong there. Would you mind sharing the xml result? You can get this by calling the webservice directly via:
How does a I hope to sort out this last problem as well. A step by step guide for TBro on AWS would be really cool. If you don't mind I'd suggest including it as a separate section in the official documentation. Your contribution in improving and disseminating TBro is very much appreciated. |
Sure - here is the xml file. It looks like 'No' is short for 'No definition line' - I'll try rebuilding things and see if I can get lucky and solve anything. { |
Thanks for sharing. I think I found the problem. When creating a blast database from fasta via So when you are rebuilding could you please re-generate the blast databases without the I think in general it is more appropriate to have blast databases that use |
Great detective work! I haven't tested it yet but I think that makes sense. I agree, its generally more appropriate/useful to have a blast database that is setup with I'll test a new blast database set up in TBro next... It works fantastic!!! Now on to Expression Search :) |
Hi! Thanks for your help on the last two issues :)
I'm now having trouble running phing to set up the blast databases.
From the documentation, I'm not sure if I should be running phing at the TBro command line - or at the Ubuntu command line. At the TBro command line, phing can not be run or installed in my hands. At the Ubuntu command line, phing can be installed but does not run successfully.
Also, I don't know how to locate TBro directories when I am not at the TBro command line. Is it possible to enter TBro directories when I am at the Ubuntu command line?
I'm also unsure what you mean by "main TBro directory" in the documentation. Is this the default directory when I start up TBro? Or the directory I created to store my data in?
Details follow:
In TBro:
First I should move to my "main TBro directory" - I am guessing this is the directory I created to store all my data in when I set up TBro...?
cd /squid
I then follow TBro documentation instructions but the phing command is not found when run at the TBro command line:
oot@9b953c8ae04e: /root@9b953c8ae04e:/# phing queue-install-db
bash: phing: command not found
When I try to install phing I get the following error:
sudo apt-get update
sudo apt-get install phing
Reading package lists... Done
Building dependency tree
Reading state information... Done
E: Unable to locate package phing
so I'm not sure how to install phing at the TBro command line.
Outside TBro
I am able to install phing at the Ubuntu command line but I don't know how to locate my main TBro directory inside of Docker - is it possible to enter TBro directories from outside Docker/TBro? When I run the required phing command in different folders I get the following error:
ubuntu@ip-172-26-13-108:/$ phing queue-install-db
Buildfile: build.xml does not exist!
I'm not sure what this error is in reference to but maybe its related to being in the wrong directory?
So I'm not sure 1) what directory I should be running phing in, 2) if I should run phing the Ubuntu or in TBro command line, and 3) if I should run phing in TBro, I'm not sure how to install it.
Any suggestions would be greatly appreciated.
Thank-you
The text was updated successfully, but these errors were encountered: