Skip to content

Fill data from uniprot

ohnoseq edited this page Mar 14, 2011 · 8 revisions

Completes protein data performing HTTP requests to Uniprot site. (this program will soon be replaced by a WS with Bio4j servlets as end-point) The jar file can be downloaded here

Arguments for the program:

  1. Name of the XML file with predicted genes
  2. Output XML filename with uniprot data incorporated

Sample of String used for the request:

http://www.uniprot.org/uniprot/?query=accession:Q8YZR1&format=tab&columns=protein names,organism,comment(FUNCTION),ec,interpro,go,pathway,families,keywords,length,subcellular locations,citation

Information retrieved from Uniprot and associated to predicted genes includes:

  1. Protein names
  2. Gene names
  3. Organism
  4. Comment (FUNCTION)
  5. EC numbers
  6. InterPro
  7. Gene Ontology
  8. Pathway
  9. Protein family
  10. Keywords
  11. Length
  12. Subcellular locations
  13. PubMed ID
  14. Comment (mass spectrometry)
Clone this wiki locally