chado.load package¶

Module contents¶

Contains loader methods

class chado.load.LoadClient(engine, metadata, session, ci)¶

Bases: chado.client.Client

blast(analysis_id, organism_id, input, blastdb=None, blastdb_id=None, re_name=None, query_type='polypeptide', match_on_name=False, skip_missing=False)¶

Load a blast analysis, in the same way as does the tripal_analysis_blast module

Parameters:	analysis_id (int) – Analysis ID organism_id (int) – Organism ID input (str) – Path to the Blast XML file to load blastdb (str) – Name of the database blasted against (must be in the Chado db table) blastdb_id (int) – ID of the database blasted against (must be in the Chado db table) query_type (str) – The feature type (e.g. ‘gene’, ‘mRNA’, ‘polypeptide’, ‘contig’) of the query. It must be a valid Sequence Ontology term. match_on_name (bool) – Match features using their name instead of their uniquename re_name (str) – Regular expression to extract the feature name from the input file (first capturing group will be used). skip_missing (bool) – Skip lines with unknown features or GO id instead of aborting everything.
Return type:	dict
Returns:	Number of processed hits

go(input, organism_id, analysis_id, query_type='polypeptide', match_on_name=False, name_column=2, go_column=5, re_name=None, skip_missing=False)¶

Load GO annotation from a tabular file, in the same way as does the tripal_analysis_go module

Parameters:	input (str) – Path to the input tabular file to load organism_id (int) – Organism ID analysis_id (int) – Analysis ID query_type (str) – The feature type (e.g. ‘gene’, ‘mRNA’, ‘polypeptide’, ‘contig’) of the query. It must be a valid Sequence Ontology term. match_on_name (bool) – Match features using their name instead of their uniquename name_column (int) – Column containing the feature identifiers (2, 3, 10 or 11; default=2). go_column (int) – Column containing the GO id (default=5). re_name (str) – Regular expression to extract the feature name from the input file (first capturing group will be used). skip_missing (bool) – Skip lines with unknown features or GO id instead of aborting everything.
Return type:	dict
Returns:	Number of inserted GO terms

interpro(analysis_id, organism_id, input, parse_go=False, re_name=None, query_type='polypeptide', match_on_name=False, skip_missing=False)¶

Load an InterProScan analysis, in the same way as does the tripal_analysis_intepro module

Parameters:	analysis_id (int) – Analysis ID organism_id (int) – Organism ID input (str) – Path to the InterProScan file to load parse_go (bool) – Load GO annotation to the database query_type (str) – The feature type (e.g. ‘gene’, ‘mRNA’, ‘polypeptide’, ‘contig’) of the query. It must be a valid Sequence Ontology term. match_on_name (bool) – Match features using their name instead of their uniquename re_name (str) – Regular expression to extract the feature name from the input file (first capturing group will be used). skip_missing (bool) – Skip lines with unknown features or GO id instead of aborting everything.
Return type:	dict
Returns:	Number of processed hits