Skip to content

Fill data from uniprot

rtobes edited this page Aug 29, 2011 · 8 revisions

Completes protein data performing HTTP requests to Uniprot site. (this program can be replaced by a WS with Bio4j servlets as end-point) The jar file can be downloaded here

Arguments for the program:

  1. Name of the XML file with predicted genes
  2. Output XML filename with uniprot data incorporated

Sample of String used for the request:

http://www.uniprot.org/uniprot/?query=accession:Q8YZR1&format=tab&columns=protein names,organism,comment(FUNCTION),ec,interpro,go,pathway,families,keywords,length,subcellular locations,citation

Information retrieved from Uniprot and associated to predicted genes includes:

  1. Protein names
  2. Gene names
  3. Organism
  4. Comment (FUNCTION)
  5. EC numbers
  6. InterPro
  7. Gene Ontology
  8. Comment (Pathway)
  9. Protein family
  10. Keywords
  11. Length
  12. Subcellular locations
  13. PubMed ID
  14. Comment (mass spectrometry)
  15. UniRef 100 entry
  16. UniRef 90 entry
  17. Uniref 50 entry
Clone this wiki locally