Make smartBag Support Table Joins #120

stevencox · 2018-03-15T21:43:50Z

smartBag can generate a smartAPI from a BDBag.

But it's very simple and does not support API endpoints that require joining tabular data from multiple files.

stevencox · 2018-04-11T19:37:08Z

@tubafrenzy, please assign a milestone date to this item and update status in the issue.

tubafrenzy · 2018-04-13T19:30:46Z

This will be finished by the end of next week.

tubafrenzy · 2018-04-16T16:26:49Z

@stevencox Do you have a sample join that I could use for testing and development purposes? Seems like the API will need to accept parameters that indicate which column on one data set to join to which column on another data set. For now I am playing around with a dummy metadata file keyed off of the bicluster "index_id" field.

stevencox · 2018-04-16T16:37:06Z

All you need to design the feature is any column shared by two input files.

CTD_chem_gene_ixns.csv header:

# Fields:
# ChemicalName,ChemicalID,CasRN,GeneSymbol,GeneID,GeneForms,Organism,OrganismID,Interaction,InteractionActions,PubMedIDs

CTD_chemicals.csv header:


# Fields:
# ChemicalName,ChemicalID,CasRN,Definition,ParentIDs,TreeNumbers,ParentTreeNumbers,Synonyms,DrugBankIDs

The generated service should allow a query by ChemicalID to return data joining CTD_chemicals and CTD_chem_gene_ixns data. Assume column names are the same.

tubafrenzy · 2018-04-18T20:15:43Z

Noticed that CTD_chem_gene_ixns.csv contains data of the form:

MESH:C533344

while CTD_chemicals.csv seems to have the prefix stripped off:

C025205

This discrepancy isn't completely germane to the development I am doing, but it would mean these tables don't join properly in a demo/example.

Also, as I've been going down this road, I assume the API shoule be able to represent both one-to-one and many-to-one relationships from the perspective of both table queries? Or should they be cleanly married into a single denormalized-type table result from the "many" perspective, with duplicated "one" rows per line?

stevencox · 2018-04-18T20:54:24Z

(a), the normalized, relational approach, not the denormalized.

stevencox assigned tubafrenzy Mar 15, 2018

stevencox mentioned this issue Mar 15, 2018

May 2018 Hackathon Goals #112

Closed

17 tasks

tubafrenzy modified the milestone: m2b.1 Apr 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make smartBag Support Table Joins #120

Make smartBag Support Table Joins #120

stevencox commented Mar 15, 2018

stevencox commented Apr 11, 2018

tubafrenzy commented Apr 13, 2018

tubafrenzy commented Apr 16, 2018

stevencox commented Apr 16, 2018

tubafrenzy commented Apr 18, 2018

stevencox commented Apr 18, 2018

Make smartBag Support Table Joins #120

Make smartBag Support Table Joins #120

Comments

stevencox commented Mar 15, 2018

stevencox commented Apr 11, 2018

tubafrenzy commented Apr 13, 2018

tubafrenzy commented Apr 16, 2018

stevencox commented Apr 16, 2018

tubafrenzy commented Apr 18, 2018

stevencox commented Apr 18, 2018