Retrieve entries from public databases

Description

This tool retrieves sequence files or other database entries from public databases. The available databases are:

  • UniProt
  • EMBL
  • Ensembl Gene
  • Ensembl Transcript
  • Ensembl Genomes Gene
  • Ensembl Genomes Transcript
  • InterPro
  • MEDLINE
  • PDB
  • RefSeq nucleoide
  • RefSeq protein
  • Taxonomy
  • Trace Archive
  • Parameters

  • Name or ID of the sequence or database entry: the name of the database entry to be retrieved.
  • Database: database to be used.
  • Details

    Note that this tool retrieves data only based on the the entry names or accession numbers. You can use wildcard character (*) to retrieve several entries that share similar identities. For example in the case of UniProt database name ABC* would retrieve all the UniProt sequences that whose name start with ABC.

    Examples:
    UniProt: Q9HYL2, CASA1_HUMAN, cy550_*
    EMBL: AF391113
    InterPro: IPR009056
    PubMed: 12729583
    PDB: 1crn
    RefSeq protein:WP_010397534
    Ensembl Gene:ENSDARG00000028099
    Taxonomy: 6221, 941271

    Output

    Output is a text file. The format depends on the database selected. In the case of sequence databases, a full database entry, that contains all the annotation, is retrieved. This data can be further translated into FASTA format with the Sequence format conversion tool.