-
Notifications
You must be signed in to change notification settings - Fork 11
LatticeGrid Installation
LatticeGrid is written using Ruby on Rails as the MVC framework and PostgreSQL as the backend relational database. The basic installation supports the latest stable build of Rails (at the time of writing 2.3.2) but does not work (or at least is not tested) in pre 2.0 versions of Rails. Fortunately it is easy to have multiple versions of Rails on a single server.
The install process scripts are documented here LatticeGrid Scripts
Moving the LatticeGrid database from one server to another is documented here LatticeGrid server migration
LatticeGrid is getting easier to install, although creating the organizational units, importing investigators, and creating the membership lists is still a bit more 'fussy' than it should be. Here is the quick skinny. Please open and review all the files so you get an overview of the full installation and customization process.
I don't really describe how to run a rails app - if this is something new for you or your organization please read some of the excellent documentation available though the Ruby on Rails project at http://rubyonrails.org/. If you are familiar with Rails, go through the '/doc/rails_installation.txt' file.
Again, I don't cover actually installing Postgres - please read the excellent Postgres documentation. Once you have Postgres installed (LatticeGrid has been tested on 7.1 through 8.2 and the dependencies are only limited by the rails drivers), follow the instructions in the '/doc/database_initiation.txt' file. I list some various practices for Postgres configuration - for the public data available from LatticeGrid I don't believe it is necessary to password protect the Postgres account. However, be aware that as of Rails 2.2, running the test scripts requires your Postgres user run as the superadmin - this is the equivalent to root for the postgres account, so if you want to run tests you should be aware of this. In production you shouldn't be running the testing framework anyway, so not giving your Postgres user superadmin privileges should be fine.
See database details for more information on how the models as instantiated in the database and the 'Rails Way'.
There are only a handful of files you will need to edit, including the config/database.yml file where you configure how to access the Postgres database, lib/config.rb where you configure default information and messages from your organization and of course images and stylesheets to customize the look&feel of LatticeGrid. The files are described in the file '/doc/customization.txt'.
As I mentioned, this is the most 'tweaky' part of setting up LatticeGrid. You can either write your own migration scripts for directly inserting your organization structure, your investigators, and the membership or appointment relationships, or use the file importing rake tasks to populate these tables, or a hybrid. I will put some additional information about these structures and methods on the wiki shortly. In the meantime, please look at the sample migration scripts in /db/sample_migrations/ and at the files to import using rake tasks in /db/imports A list of the tasks for importing organizations and people is given in '/doc/orgs_and_people_setup.txt' file
If you have made it this far, it now becomes pretty easy. If you want to import 10 years worth of data, run the rake task:
rake insertAllAbstracts
That will query PubMed for all the investigators in LatticeGrid, optionally using the PubMed search string for each investigator in pubmed_search_name or a synthetic search string comprised of last_name first_name first letter and middle_name first letter. You can limit the search globally to your institution in the 'lib/pubmed_config.rb' file by setting the variable @limit_to_institution to true or false. You can override this for a specific investigator by setting Investigator.pubmed_limit_to_institution to true for that individual. Please note that currently the switch is conservative - that is, if either @limit_to_institution or Investigator.pubmed_limit_to_institution is true, the search will be limited to publications listing your institution. You need to set the PubMed institutional search string variable @institutional_limit_search_string in the '/lib/pubmed_config.rb' file. The Northwestern search string is shown below:
@institutional_limit_search_string = '( "Northwestern University"[affil] OR "Feinberg School"[affil] OR "Robert H. Lurie Comprehensive Cancer Center"[affil] OR "Northwestern Healthcare"[affil] OR "Children''s Memorial"[affil] OR "Northwestern Memorial"[affil] OR "Northwestern Medical"[affil])'
After populating the abstracts, you can run the nightly import tasks.
rake nightlyBuild
The first time you run LatticeGrid, you will also want to build the more complex MeSH information content relationships. These are encapsulated in the 'monthlyBuild' rake task, which as the name indicates you only need to run monthly. Depending on the number of investigators and publications, this may take more the 20 hours to run.
rake monthlyBuild
Once you have confirmed that the data is what you expect, you will want to run these tasks as a cron job. You can use the example shell file, '/update_publication.sh', for some of the example commands you will want to have in your cron shell script. The full description of the import and run tasks are in the file '/doc/publications_scripts.txt'.
Finally, as you move to production with LatticeGrid, you will want to build a full set of cached pages. There are rake tasks to do this. The cached pages provide two benefits: LatticeGrid is exceptionally responsive when the pages have been cached, and secondly it is still available during the extensive monthly rebuild process. After each nightly or monthly build, you will want to run the following tasks to rebuild the cache:
cd /*LatticeGrid home directory*
rake RAILS_ENV=production nightlyBuild >> rake_results.txt
#clean up the database to keep queries running smoothly
vacuumdb -fz cancerpublications_production -U cancerpublications
# monthly run this in another shell script
# rake RAILS_ENV=production monthlyBuild >> monthly_rake_results.txt
#rebuild the application cache
rake RAILS_ENV=production tmp:cache:clear
rake RAILS_ENV=production cache:clear
rake RAILS_ENV=production cache:populate taskname=abstracts > buildCache.txt
rake RAILS_ENV=production cache:populate taskname=investigators >> buildCache.txt
rake RAILS_ENV=production cache:populate taskname=orgs >> buildCache.txt
rake RAILS_ENV=production cache:populate taskname=investigator_graphs >> buildCache.txt
rake RAILS_ENV=production cache:populate taskname=org_graphs >> buildCache.txt
You can review the results of the cache process in the file 'buildCache.txt' in your LatticeGrid home directory. Run this manually the first time so you can review any errors
There are three ways to import investigators into LatticeGrid - insert them directly in a Rails migration, add them through an LDAP connection to a local LDAP repository, or bulk import them using a file.
The attributes available for the model are in the model file: LatticeGrid/db/migrate/20080910_create_investigators.rb
Examples of each are shown below.
First you need to create a new migration file, using the generate helper
- script/generate migration insert_investigators
You can then edit that file and add an investigator so the up method looks like this
class InsertInvestigators < ActiveRecord::Migration
def self.up
Investigator.create :username => "jjones",
:last_name => "Jones",
:first_name => "Jonathan",
:middle_name => "C",
:pubmed_search_name => "Jones J C",
:pubmed_limit_to_institution => true,
:email => "[email protected]"
end
def self.down
Investigator.delete_all
end
end
You will need to install the ruby ldap gem:
gem install ruby-net-ldap
You may need to register the gem source http://rubyforge.org/projects/net-ldap/ before the install will work.
The call to import is very simple once the configuration has been completed:
rake getLDAP uid_list=aqw,wakibbe
Since this is called as a rake task, the config is in LatticeGrid/lib/ldap_config.rb An example record is listed, and the connection parameters to connect to the EDUCAUSE-compatible LDAP repository at Northwestern University.
@ldap_connection = Net::LDAP.new :host => "directory.northwestern.edu"
@ldap_treebase = "ou=People, dc=northwestern,dc=edu"
Depending on the format of your institution's LDAP repository, you may need to change the adaptor in LatticeGrid/lib/ldap_utilities.rb
# may need to adapt the ldap attributes to the Investigator data model
def InsertPI (pi_data)
puts "InsertPI: this shouldn't happen - pi_data was nil" if pi_data.nil?
raise "InsertPI: this shouldn't happen - pi_data was nil" if pi_data.nil?
thePI = nil
thePI = Investigator.find_by_username(pi_data.uid)
begin
if thePI.nil? || thePI.id < 1 then
thePI = Investigator.create! (
:username => pi_data["uid"],
:last_name => pi_data["sn"],
:middle_name => pi_data["nuMiddleName"],
:first_name => pi_data["givenName"],
:title => pi_data["title"],
:business_phone => pi_data["telephoneNumber"],
:employee_id => pi_data["employeeNumber"],
:address1 => pi_data["postalAddress"],
:mailcode => pi_data.ou[0],
:email => pi_data["mail"]
)
thePI.save!
end
rescue ActiveRecord::RecordInvalid => error
puts "InsertPI: raised an error for an investigator with the id of '#{pi_data.uid} with an error of #{error.inspect}"
if thePI.nil? then # something bad happened
puts "InsertPI: unable to find or insert investigator with the id of '#{pi_data.uid}"
raise "InsertPI: unable to find or insert investigator with the id of '#{pi_data.uid}"
end
end
thePI
end
There is also a file import mechanism that parses either a CSV or a tab-delimited text file. To use this method you must install the ruby fastercsv gem:
gem install fastercsv
You can then run the following rake task to import a file. The file can be prefixed with a relative path. No path means the file is located in your root directory. For instance, if you have the file investigators.txt in the LatticeGrid directory, you can load it with the rake task:
rake importInvestigators file=investigators.txt
If you want to couple your investigators to your LDAP records, you should be sure to include a username column with the LDAP uid for each investigator in this column.
The file lib/file_utilities does most of the work, and the expected column headers for the CSV or tab-delimited file include:
employee_id => pi.employee_id
username or netid => pi.username
email => pi.email
name in the format first_name middle_name last_name suffix where suffix is jr, sr, I, II or III OR
last_name => pi.last_name
first_name => pi.first_name
mi => pi.middle_name
dept/div => somewhat odd and is split between pi.home_department, pi.division, and pi.secondary
category => pi.appointment_type
career_track => pi.appointment_track
pubmed_search_name => pi.pubmed_search_name
pubmed_limit_to_institution => pi.pubmed_limit_to_institution
campus => pi.campus
If a column with the name of program exists, the Program model will be checked for a match, and if no match is found an entry is made for that program, and an InvestigatorProgram entry is also made.