New job submissions

Using the tools provided on this page, you can create a profile or paste in a pre-created profile, then compare the profile with profiles of more than 350,000 proteins from 89 different genomes. You can create a phylogenetic profile by submitting an amino acid sequence (or its GenBank Identifier), which can then be compared with profiles that are stored in our database.

If you already have a profile, you can search for matching profiles by choosing BLAST Evalue scores for each position, and combining these values with operators, such as 'greater than' or 'equal to', which will present the value accordingly for comparison. Profiles can be searched for by specifying groups (for instance, you can choose presence in all Archaea, and any value for the rest of the genomes), or by specifying values for individual genomes. To avoid a certain numbness of the mind (if you don't want to select an operator and value for each genome), you can even paste in a preconstructed profile (see instructions for the format).

Please note that your results are stored on our system for a period of 24 hrs from the time a Job ID was assigned to you. This Job ID can be used to retrieve your results. Currently, you cannot retrieve profiles that you searched for by specifying Evalues and operators, but this feature will be available shortly. For help, please write to marcotte AT icmb dot utexas.edu

  • I have a Genbank Identifier
  • I have an amino acid sequence
  • I want to search for profiles, and I have a profile that I can paste in
  • I want to search for profiles, and define my own values and operators

  • I have a GenBank Identifier (GI number)

    GI   

    By default, BLAST results are formatted and only information relevant to profile analysis is displayed.
    Do not report BLAST results.
    Report unformatted BLAST results.
     
       

    I have an amino acid sequence

    Paste your amino acid query sequence in the text box below
    (in FASTA format only, one protein at a time, see sample sequence below!!)

    Sample sequence:
    >gi|1787315 flagellar biosynthesis, initiation of hook assembly
    MSIAVTTTDPTNTGVSTTSSSSLTGSNAADLQSSFLTLLVAQLKNQDPTNPMENNELTSQLAQISTVSGI
    EKLNTTLGSISGQIDNSQSLQASNLIGHGVMIPGTTVLAGTGSEEGAVTTTTPFGVELQQAADKVTATIT
    DKNGAVVRTIDIGELTAGVHSFTWDGTLTDGSTAPNGSYNVAISASNGGTQLVAQPLQFALVQGVIRGNS
    GNTLDLGTYGTTTLDEVRQII
    


    By default, BLAST results are formatted and only information relevant to profile analysis is displayed.
    Do not report BLAST results.
    Report unformatted BLAST results 
     
       

    I want to search for profiles, and I have a profile that I can paste in

    Required format:
    Single line with 89 space separated values (eg: 1e-1 1e-2 1e-3 ... 1e-89), where an Evalue >= 1.00, 'N' or missing values indicate absence from the designated genome.

    Note:
    • BLAST E-values become more significant as they decrease.
    • Please make sure that each value corresponds to the appropriate genome, based on this genome list.



    Enter mutual info. score cutoff:    

    Matches to return above selected cutoff: Top 20     Top 50     Top 100    

    Select a genome for comparison:      

      

    • Search for a profile by specifying groups:

    • Use operator and Evalue for all , and

      use operator and Evalue for the rest. Then, compare against

          Display maximum:     results.

        

    • Search for a profile by specifying operators and Evalues for individual genomes

    • Note: Admittedly, this is one of the most difficult to use, but also one of the most powerful tools in PLEX repertoire. Please make a careful note of what each of the operators signifies, and the all the possible combinations of the results that can be obtained. A few tips follow:

      • BLAST E-values become more significant as they decrease. Therefore, in most instances, using the '<' operator would capture the desired E-value, and all subsequent lower E-values for a given instance in a genome.
      • When constructing profiles directly from BLAST results, please note that PLEX uses the highest hit and its statistics in each genome to indicate presence. Therefore, only the topmost hit for any genome is relevant.
      • If the default '*' (any) operator is left unchanged, it will not exclude presence of your query in any genome, rather, it will select any observed E-value. This means that if the profile you desire is not very specific, the results will contain all observed combinations, along with the desired answer!


      #
      Kingdom
      Organism
      Operator
      Evalue
      0
      A
      Archaeoglobus fulgidus
      1
      A
      Aeropyrum pernix
      2
      A
      Halobacterium sp. NRC-1
      3
      A
      Methanosarcina acetivorans str. C2A
      4
      A
      Methanococcus jannaschii
      5
      A
      Methanopyrus kandleri AV19
      6
      A
      Methanosarcina mazei Goe1
      7
      A
      Methanothermobacter thermautotrophicus str. Delta H
      8
      A
      Pyrococcus abyssi
      9
      A
      Pyrobaculum aerophilum
      10
      A
      Pyrococcus furiosus DSM 3638
      11
      A
      Pyrococcus horikoshii
      12
      A
      Sulfolobus solfataricus
      13
      A
      Sulfolobus tokodaii
      14
      A
      Thermoplasma acidophilum
      15
      A
      Thermoplasma volcanium
      16
      B
      Aquifex aeolicus
      17
      B
      Agrobacterium tumefaciens str. C58 (Cereon)
      18
      B
      Agrobacterium tumefaciens str. C58 (UWash)
      19
      B
      Borrelia burgdorferi
      20
      B
      Bacillus halodurans
      21
      B
      Brucella melitensis
      22
      B
      Bacillus subtilis
      23
      B
      Buchnera sp. APS
      24
      B
      Clostridium acetobutylicum ATCC824
      25
      B
      Caulobacter crescentus CB15
      26
      B
      Corynebacterium glutamicum
      27
      B
      Campylobacter jejuni
      28
      B
      Chlamydia muridarum
      29
      B
      Clostridium perfringens
      30
      B
      Chlamydophila pneumoniae AR39
      31
      B
      Chlamydophila pneumoniae CWL029
      32
      B
      Chlamydophila pneumoniae J138
      33
      B
      Chlamydia trachomatis
      34
      B
      Deinococcus radiodurans R1
      35
      B
      Escherichia coli K12
      36
      B
      Escherichia coli O157:H7
      37
      B
      Escherichia coli O157:H7 EDL933
      38
      B
      Fusobacterium nucleatum subsp. nucleatum ATCC 25586
      39
      B
      Haemophilus influenzae Rd KW20
      40
      B
      Helicobacter pylori 26695
      41
      B
      Helicobacter pylori J99
      42
      B
      Listeria innocua
      43
      B
      Lactococcus lactis subsp. lactis
      44
      B
      Listeria monocytogenes EGD-e
      45
      B
      Mycoplasma genitalium
      46
      B
      Mycobacterium leprae
      47
      B
      Mesorhizobium loti
      48
      B
      Mycoplasma pneumoniae
      49
      B
      Mycoplasma pulmonis
      50
      B
      Mycobacterium tuberculosis CDC1551
      51
      B
      Mycobacterium tuberculosis H37Rv
      52
      B
      Neisseria meningitidis MC58
      53
      B
      Neisseria meningitidis Z2491
      54
      B
      Nostoc sp. PCC 7120
      55
      B
      Pseudomonas aeruginosa
      56
      B
      Pasteurella multocida
      57
      B
      Rickettsia conorii
      58
      B
      Rickettsia prowazekii
      59
      B
      Ralstonia solanacearum
      60
      B
      Staphylococcus aureus subsp. aureus MW2
      61
      B
      Staphylococcus aureus subsp. aureus Mu50 pVRSAp
      62
      B
      Staphylococcus aureus subsp. aureus N315
      63
      B
      Streptomyces coelicolor A3(2)
      64
      B
      Sinorhizobium meliloti
      65
      B
      Streptococcus pneumoniae R6
      66
      B
      Streptococcus pneumoniae TIGR4
      67
      B
      Streptococcus pyogenes M1 GAS
      68
      B
      Streptococcus pyogenes MGAS8232
      69
      B
      Salmonella typhi
      70
      B
      Salmonella typhimurium LT2
      71
      B
      Synechocystis sp. PCC 6803
      72
      B
      Thermotoga maritima
      73
      B
      Treponema pallidum
      74
      B
      Thermoanerobacter tengcogensis
      75
      B
      Ureaplasma urealyticum
      76
      B
      Vibrio cholerae
      77
      B
      Xanthomonas axonopodis pv. citri str. 306
      78
      B
      Xanthomonas campestris pv. campestris str. ATCC 33913
      79
      B
      Xylella fastidiosa 9a5c
      80
      B
      Yersinia pestis
      81
      E
      Arabidopsis thaliana
      82
      E
      Caenorhabditis elegans
      83
      E
      Drosophila melanogaster
      84
      E
      Encephalitozoon cuniculi
      85
      E
      Homo sapiens
      86
      E
      Neurospora crassa
      87
      E
      Saccharomyces cerevisiae
      88
      E
      Schizosaccharomyces pombe



      Display maximum:     results.

        

    Copyright © 2002 - 2004, Shailesh Date and Edward Marcotte. The protein link explorer data and server is the property of the University of Texas, and cannot be used for commercial purposes without written permission of Edward Marcotte and the University of Texas. It is forbidden to redistribute, derivatize, or encapsulate the protein link explorer data or server in another database without permission. Sale of information derived from it, whether directly or in revised form, is forbidden except by permission of UT and Edward Marcotte. All copies or mirrors of the protein link explorer must carry this notice.