; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014275 (gene) of Chayote v1 genome

Gene IDSed0014275
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationLG05:32907630..32911971
RNA-Seq ExpressionSed0014275
SyntenySed0014275
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]3.3e-21481Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNS+PK+F+F SDDIL SF+DY K D S+   +DPVSV N GKDFHKSRMST+F  A YGQ DD+ISQ+V+S VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRS+LARDHEEADSKLKSLE H+QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HS+SNEERASS A D KK E
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LP N+P QQQSYYI  +QLPGQPPHIQHAQ QYIS DS+H ASQPQDVSQM+NPQLSQ+ PQPFNQY Q
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ

Query:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE
        QW QPPSQ  QPPQQPSMQ QIRPPP SVYP  YPPNQP  MPETL SSMP+ QMSF SIPQPGSSR+D  PYGYA  SGGSAPQQPP VKNAYGP  GE
Subjt:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE

Query:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD
        GYMPPGQQ    SGGAYMMYDRESGRP H P Q  HFNQ GYP ANAP Q   QA  GPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRMEDSGQPVD
Subjt:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD

Query:  FNGVLDRLSSASPGPGPQRAW
        FN VLDRLSS S GPGPQRAW
Subjt:  FNGVLDRLSSASPGPGPQRAW

XP_022952329.1 class E vacuolar protein-sorting machinery protein hse1-like [Cucurbita moschata]2.7e-21681.36Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNSAPK+F+F SD+IL SF+DY K +    SH+DPVSV NS KDFHKSRMST+F GAAYGQPDDSI+QDV++AVEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRSDLARDHEEADSKLKS+E H+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHS++NEER S+   DPKKNE
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPIT-PPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSA LP NVP QQQSYYI S+QLPG QP HIQHAQ QYIS DS+H ASQPQDVSQMTNPQLSQ +PQPFNQY
Subjt:  NLSEIHNQQLALALPHQIVPQQNPIT-PPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY

Query:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPA
         QQW QPPSQ  QPPQQ SMQPQIRPPP+SVYP    PPNQP  MPETLSSSMP+ QMSFASIPQPGSSR D +PYGYAAASGGSAPQQPP VKNAYGPA
Subjt:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPA

Query:  AGEGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQ
         GEGYMPPGQQPALSSGGAYMMYDRESGRP        H PSQ  HF+Q GYPPANAP Q   QA TGPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQ
Subjt:  AGEGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQ

Query:  RMEDSGQPVDFNGVLDRLSSASPGPGPQRAW
        RMEDSGQ VDFN VLDRLS+ + GPGPQRAW
Subjt:  RMEDSGQPVDFNGVLDRLSSASPGPGPQRAW

XP_022969058.1 ataxin-2 homolog [Cucurbita maxima]2.2e-21881.66Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNSAPK+F+F SD+IL SF+DY K +    SH+DPVSV NS KDFHKSRMST+F GAAYGQPDDSI+QDV++ VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRSDLARDHEEADSKLKS+E H+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHS++NEER S+   DPKKNE
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQ-QSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP
        N SEIHNQQLALALPHQIVPQQNP+TPPSA LP NVPQQ QSYYI S+QLPG QP HIQHAQ QYIS DS H ASQPQDVSQMTNPQLSQ +PQPFNQY 
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQ-QSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP

Query:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG
        QQW QPPSQ  QPPQQ SMQPQIRPPP+SVY  PYPPNQP  MPETLSSSMP+ QMSFASIPQPGSSR D +PYGYAAASGGSAPQQPP VKNAYGPA G
Subjt:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG

Query:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM
        EGYMPPGQQPALSSGGAYMMYDRESGRP        H PSQ  HFNQ GYPPANAP Q   QA TGPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRM
Subjt:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM

Query:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW
        EDSGQ VDFN VLDRLS+ + GPGPQRAW
Subjt:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW

XP_023554446.1 trithorax group protein osa-like [Cucurbita pepo subsp. pepo]2.7e-21680.91Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNSAPK+F+F SD+IL SF+DY K +    SH+DPVSV NS KDFHKSRMST+F GAAYGQPDDSI+QDV++ VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRSDLARDHEEA+SKLKS+E H+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHS++NEER S+   DPKKNE
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP
        N SEIHNQQLALALPHQIVPQQNPITPPSA LP NVP QQQSYYI S+QLPG QP HIQHAQ QYIS DS+H ASQPQDVS MTNPQLSQ +PQPFNQY 
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP

Query:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG
        QQW QPPSQ  QPPQQ SMQPQIRPPP+SVY  PYPPNQP  MPETLSSSMP+ QMSFA IPQPGSSR D +PYGYAA+SGGSAPQQPP VKNAYGPA G
Subjt:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG

Query:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM
        EGYMPPGQQPALSSGGAYMMYDRESGRP        H PSQ  HFNQ GYPPANAP Q   QA  GPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRM
Subjt:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM

Query:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW
        EDSGQ VDFN VLDRLS+ + GPGPQRAW
Subjt:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]3.5e-21681.7Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYG--QPDDSISQDVMSAVENCMKKHSDNLFRFLE
        MASGS+GRPNS+PK+F+F SDDIL SF+DY K D    SH DPVS+ NS KDFHKSRMST+F  AAYG  Q DDSISQ+V+S VEN MKKHSDNL RFLE
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYG--QPDDSISQDVMSAVENCMKKHSDNLFRFLE

Query:  GMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKK
        G+SSRLSQLEL CYNLDKSV EMRSDLARDHEEADSKLKSLE H+QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHS+SNEERASS A DPKK
Subjt:  GMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKK

Query:  NENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY
        NEN SEIHNQQLALALPHQIVPQQN IT PSA LP N+P QQQSYYI S+QLPGQPPH+QHAQGQYISPDS   ASQPQDVSQM+NPQLSQ+ PQPFNQY
Subjt:  NENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY

Query:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAA
         QQW QPPSQ  QPPQQPSMQPQIRPPP SVYP  YPPNQP  MPETLSSSMP+  MSF SIPQPGSSRMD  PYGYAAASGGSAPQQPP VKNAYGPA 
Subjt:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAA

Query:  GEGYMPPGQQPALSSGGAYMMYDRESGR-PSHPPSQP------PHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQR
        GEGYMPPGQQ    SGGAYMMYDRESGR P HPP QP      PHFNQ GYPPAN   Q   QA TGPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQR
Subjt:  GEGYMPPGQQPALSSGGAYMMYDRESGR-PSHPPSQP------PHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQR

Query:  MEDSGQPVDFNGVLDRLSSASPGPGPQRAW
        MEDSGQPVDFN VLDRLS+ + GPGPQRAW
Subjt:  MEDSGQPVDFNGVLDRLSSASPGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein3.9e-21380.27Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNS+PK+F+F SDDIL SF+DY K D S+   +DPVSV N GKDFHK RMST+F  + YGQ DD+ISQ+V+S VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRS+LARDHEEADSKLKSLE H+QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++HS+SNEERASS A DPKK E
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LP N+P QQQSYYI  +QLPGQPPHIQHAQ QYI  DS+H ASQPQDVSQM+NPQLSQ+ PQPFNQY Q
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ

Query:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG
        QW QPPSQ  QPPQQPSMQ QIRPPP SVYP    PPNQP  MPETL SSMP+ QMSF SIPQPGSSR+D  PYGYAA SGGSAPQQPP VKNAYGP  G
Subjt:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG

Query:  EGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPV
        EGYMPPGQQ    SGGAYMMYDRESGRP H P Q  HFNQ GYP ANAP Q   QA  GPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRMEDSGQPV
Subjt:  EGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPV

Query:  DFNGVLDRLSSASPGPGPQRAW
        DFN VLDRLSS S GPGPQRAW
Subjt:  DFNGVLDRLSSASPGPGPQRAW

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like1.6e-21481Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNS+PK+F+F SDDIL SF+DY K D S+   +DPVSV N GKDFHKSRMST+F  A YGQ DD+ISQ+V+S VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRS+LARDHEEADSKLKSLE H+QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HS+SNEERASS A D KK E
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LP N+P QQQSYYI  +QLPGQPPHIQHAQ QYIS DS+H ASQPQDVSQM+NPQLSQ+ PQPFNQY Q
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ

Query:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE
        QW QPPSQ  QPPQQPSMQ QIRPPP SVYP  YPPNQP  MPETL SSMP+ QMSF SIPQPGSSR+D  PYGYA  SGGSAPQQPP VKNAYGP  GE
Subjt:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE

Query:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD
        GYMPPGQQ    SGGAYMMYDRESGRP H P Q  HFNQ GYP ANAP Q   QA  GPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRMEDSGQPVD
Subjt:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD

Query:  FNGVLDRLSSASPGPGPQRAW
        FN VLDRLSS S GPGPQRAW
Subjt:  FNGVLDRLSSASPGPGPQRAW

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like1.0e-21380.61Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNS+PK+F+F SDDIL SF+DY K D S+   +DPVS+ N GKDFHKSRMST+F  A Y Q DD+ISQ+V+S VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASH---NDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRS+LARDHEEADSKLKSLE H+QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HS+SNEERASS A D KK E
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LP N+P QQQSYYI  +QLPGQPPHIQHAQ QYIS DS+H ASQPQDVSQM+NPQLSQ+ PQPFNQY Q
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVP-QQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQ

Query:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE
        QW QPPSQ  QPPQQPSMQ QIRPPP SVYP  YPPNQP  MPETL SSMP+ QMSF SIPQPGSSR+D  PYGYA  SGGSAPQQPP VKNAYGP  GE
Subjt:  QWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAGE

Query:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD
        GYMPPGQQ    SGGAYMMYDRESGRP H P Q  HFNQ GYP ANAP Q   QA  GPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRMEDSGQPVD
Subjt:  GYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVD

Query:  FNGVLDRLSSASPGPGPQRAW
        FN VLDRLSS S GPGPQRAW
Subjt:  FNGVLDRLSSASPGPGPQRAW

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like1.3e-21681.36Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNSAPK+F+F SD+IL SF+DY K +    SH+DPVSV NS KDFHKSRMST+F GAAYGQPDDSI+QDV++AVEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRSDLARDHEEADSKLKS+E H+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHS++NEER S+   DPKKNE
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPIT-PPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSA LP NVP QQQSYYI S+QLPG QP HIQHAQ QYIS DS+H ASQPQDVSQMTNPQLSQ +PQPFNQY
Subjt:  NLSEIHNQQLALALPHQIVPQQNPIT-PPSATLPPNVP-QQQSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQY

Query:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPA
         QQW QPPSQ  QPPQQ SMQPQIRPPP+SVYP    PPNQP  MPETLSSSMP+ QMSFASIPQPGSSR D +PYGYAAASGGSAPQQPP VKNAYGPA
Subjt:  PQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPY---PPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPA

Query:  AGEGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQ
         GEGYMPPGQQPALSSGGAYMMYDRESGRP        H PSQ  HF+Q GYPPANAP Q   QA TGPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQ
Subjt:  AGEGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQ

Query:  RMEDSGQPVDFNGVLDRLSSASPGPGPQRAW
        RMEDSGQ VDFN VLDRLS+ + GPGPQRAW
Subjt:  RMEDSGQPVDFNGVLDRLSSASPGPGPQRAW

A0A6J1HZW1 ataxin-2 homolog1.1e-21881.66Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM
        MASGS+GRPNSAPK+F+F SD+IL SF+DY K +    SH+DPVSV NS KDFHKSRMST+F GAAYGQPDDSI+QDV++ VEN MKKHSDNL RFLEG+
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPD---ASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGM

Query:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE
        SSRLSQLEL CYNLDKSV EMRSDLARDHEEADSKLKS+E H+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHS++NEER S+   DPKKNE
Subjt:  SSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNE

Query:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQ-QSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP
        N SEIHNQQLALALPHQIVPQQNP+TPPSA LP NVPQQ QSYYI S+QLPG QP HIQHAQ QYIS DS H ASQPQDVSQMTNPQLSQ +PQPFNQY 
Subjt:  NLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQ-QSYYIPSTQLPG-QPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYP

Query:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG
        QQW QPPSQ  QPPQQ SMQPQIRPPP+SVY  PYPPNQP  MPETLSSSMP+ QMSFASIPQPGSSR D +PYGYAAASGGSAPQQPP VKNAYGPA G
Subjt:  QQWVQPPSQLTQPPQQPSMQPQIRPPPSSVY--PYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPP-VKNAYGPAAG

Query:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM
        EGYMPPGQQPALSSGGAYMMYDRESGRP        H PSQ  HFNQ GYPPANAP Q   QA TGPH SARNPSHSHLIEKLVGMGFRGDHVAS+IQRM
Subjt:  EGYMPPGQQPALSSGGAYMMYDRESGRP-------SHPPSQPPHFNQGGYPPANAPQQA--QAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRM

Query:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW
        EDSGQ VDFN VLDRLS+ + GPGPQRAW
Subjt:  EDSGQPVDFNGVLDRLSSASPGPGPQRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)7.4e-2332.25Show/hide
Query:  VENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSH
        ++  MKKH+D L   +EG+S+RLSQLE   +NL+  V +++  +   H   D K++ L+  L EV   VQ+++DKQE+ E Q  L+K QVS +   + +H
Subjt:  VENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSH

Query:  SKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQM
        S   +  A S AP P +   L+       + A P Q         PPS+ LPP +P Q S    S Q P  PP   H Q    +P        P    Q 
Subjt:  SKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQM

Query:  TNP-QLSQSSPQPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSA
          P Q S  SP    QYPQQ   PPS    P +QP  Q Q          YPPN P   P   + S P QQ  F + PQP  S  D         +GG +
Subjt:  TNP-QLSQSSPQPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPYPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSA

Query:  PQQPPVKNAYGPAAGEGY-MPPGQQPALSSGGAYMMYDRESGRPSHPPSQP-PHFNQGGYPPANAPQQAQAATGPHGSARNPSHSHLIEKLVGMGFRGDH
            P      P    G  M   + P +SS G        +G P    S+P PH      P  +A      ++ P   +R P    +I+++  MGF  D 
Subjt:  PQQPPVKNAYGPAAGEGY-MPPGQQPALSSGGAYMMYDRESGRPSHPPSQP-PHFNQGGYPPANAPQQAQAATGPHGSARNPSHSHLIEKLVGMGFRGDH

Query:  VASVIQRMEDSGQPVDFNGVLDRLSSASPGP
        V + ++++ ++GQ VD N VLD+L +    P
Subjt:  VASVIQRMEDSGQPVDFNGVLDRLSSASPGP

AT4G28300.1 Protein of unknown function (DUF1421)5.6e-10349.44Show/hide
Query:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDAS---HNDP-VSVPNSGKDFHKSRM--STIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFL
        MASGSSGR NS  K F+F SDDIL S+DDY   D+S   H+DP ++  NS K+FHK+RM  S++F  ++Y  P+DS+SQD+   VE  MK ++DN+ RFL
Subjt:  MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDAS---HNDP-VSVPNSGKDFHKSRM--STIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFL

Query:  EGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPK
        EG+SSRLSQLEL CYNLDK++ EMRS+L   HE+AD KL+SL+ HLQEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SSSSHS+  E+R ++  P+PK
Subjt:  EGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPK

Query:  KNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYI--PSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSP----
        K+EN S+ HNQQLALALPHQI PQ           P   PQQ  YY+  P TQL   P  +  +     +P S+     P   SQ   P  + S P    
Subjt:  KNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYI--PSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSP----

Query:  -QPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP-YPPNQPNPMP--ETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVK
         Q F QY Q W         PP     QPQ RP  S  YP Y P  P   P  E+L SSM +Q       P  G  +     YGY AA    AP Q   K
Subjt:  -QPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP-YPPNQPNPMP--ETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVK

Query:  NAYGPAAGEGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQP------PHFNQGGYPPANAPQQAQAATGPHGS--ARNPSHSHLIEKLVGMGFRGDHV
         +Y P  G+GY+P G  P   SG A  MY  E GR  +PP QP       H+ QG      +PQ  QA  G  G+       +  LIEKLV MGFRGDHV
Subjt:  NAYGPAAGEGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQP------PHFNQGGYPPANAPQQAQAATGPHGS--ARNPSHSHLIEKLVGMGFRGDHV

Query:  ASVIQRMEDSGQPVDFNGVLDRLSSASPGPGPQRAW
         +VIQRME+SGQP+DFN +LDRLS  S G GP R W
Subjt:  ASVIQRMEDSGQPVDFNGVLDRLSSASPGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)9.5e-8748.42Show/hide
Query:  STIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQE
        S++F  ++Y  P+DS+SQD+   VE  MK ++DN+ RFLEG+SSRLSQLEL CYNLDK++ EMRS+L   HE+AD KL+SL+ HLQEVHRSVQI+RDKQE
Subjt:  STIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYI--PSTQLPGQPPHI
        LA+TQK+LAKLQ+ QKE SSSSHS+  E+R ++  P+PKK+EN S+ HNQQLALALPHQI PQ           P   PQQ  YY+  P TQL   P  +
Subjt:  LAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPNVPQQQSYYI--PSTQLPGQPPHI

Query:  QHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSP-----QPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP-YPPNQPNPMP--ETLSSSMP
          +     +P S+     P   SQ   P  + S P     Q F QY Q W         PP     QPQ RP  S  YP Y P  P   P  E+L SSM 
Subjt:  QHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSP-----QPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYP-YPPNQPNPMP--ETLSSSMP

Query:  LQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVKNAYGPAAGEGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQP------PHFNQGGYPPAN
        +Q       P  G  +     YGY AA    AP Q   K +Y P  G+GY+P G  P   SG A  MY  E GR  +PP QP       H+ QG      
Subjt:  LQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVKNAYGPAAGEGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQP------PHFNQGGYPPAN

Query:  APQQAQAATGPHGS--ARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVDFNGVLDRLSSASPGPGPQRAW
        +PQ  QA  G  G+       +  LIEKLV MGFRGDHV +VIQRME+SGQP+DFN +LDRLS  S G GP R W
Subjt:  APQQAQAATGPHGS--ARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVDFNGVLDRLSSASPGPGPQRAW

AT5G14540.1 Protein of unknown function (DUF1421)9.7e-2329.55Show/hide
Query:  DASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPD-DSISQDVMSAVENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSK
        DAS   PVS  +S + +    M ++     + + D +S    ++SA++  MK H+D L   +EG+S+RL+QLE    +L+  V +++  +   H + D K
Subjt:  DASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPD-DSISQDVMSAVENCMKKHSDNLFRFLEGMSSRLSQLELVCYNLDKSVTEMRSDLARDHEEADSK

Query:  LKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPN
        L+ LE  + EV   VQ+++DKQE+ E Q  L+KLQ+S+      +HS   E  A                              P   P  P SA  PP+
Subjt:  LKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQNPITPPSATLPPN

Query:  VPQQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQL---SQSSPQPFNQYPQQWVQPPSQLTQPP-QQPSMQPQI--RPPPSSVY
        + QQ    +P  Q   QPP  QH     +SP S      P   S    P      QS P P  Q P Q   P   L QPP Q P  QPQ   +PPP   +
Subjt:  VPQQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQL---SQSSPQPFNQYPQQWVQPPSQLTQPP-QQPSMQPQI--RPPPSSVY

Query:  P--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVKNAYGPAAGEGYMPPGQQP-ALSSGGAYMMYDRESGRPSHP
        P  Y P +P P P+      P +Q    S P PGS+   P    Y A      P  P + +  G  +  G+ P G  P +    G    Y      PS  
Subjt:  P--YPPNQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVKNAYGPAAGEGYMPPGQQP-ALSSGGAYMMYDRESGRPSHP

Query:  PSQPPHFNQGGYP--------PANAPQQAQAATGPHGSARNPSHS-------HLIEKLVGMGFRGDHVASVIQRMEDSGQPVDFNGVLDRLSSASPG---
        P+       G YP        P   P  +  ++G  G   +   S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L +   G   
Subjt:  PSQPPHFNQGGYP--------PANAPQQAQAATGPHGSARNPSHS-------HLIEKLVGMGFRGDHVASVIQRMEDSGQPVDFNGVLDRLSSASPG---

Query:  ----PGPQRAW
              P R W
Subjt:  ----PGPQRAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGGTTCATCTGGTCGCCCTAATTCTGCCCCCAAAACATTCAATTTCGACTCCGACGACATTCTCCGCTCATTCGACGACTACGCCAAACCGGACGCTTCCCA
CAACGACCCCGTCTCCGTTCCCAACTCCGGCAAGGATTTTCACAAAAGTAGAATGTCTACCATATTCTCTGGTGCTGCTTATGGTCAACCAGATGATTCCATCAGTCAGG
ACGTGATGTCTGCCGTAGAGAACTGCATGAAAAAGCATTCAGATAACCTTTTTCGTTTTCTTGAGGGAATGAGTTCACGCCTATCACAACTTGAACTAGTTTGCTACAAC
CTTGATAAATCTGTTACAGAAATGCGTTCTGATCTAGCTCGTGATCATGAAGAGGCGGATTCAAAGCTTAAATCTCTTGAGATGCATCTACAAGAGGTCCACAGGTCTGT
ACAGATTATAAGAGACAAGCAAGAGCTTGCAGAGACTCAAAAAGATCTAGCCAAACTTCAGGTATCACAGAAAGAGCCATCTTCATCAAGCCATTCAAAGTCAAATGAGG
AGAGAGCTTCATCAGCCGCCCCGGATCCAAAAAAGAACGAAAATCTATCCGAGATTCACAACCAGCAATTAGCATTAGCCTTGCCGCACCAGATTGTTCCACAGCAAAAT
CCTATTACACCCCCTTCAGCAACATTGCCTCCGAATGTGCCTCAACAGCAATCGTACTACATTCCTTCAACCCAATTGCCTGGTCAACCACCCCACATCCAGCATGCTCA
GGGCCAATATATCTCACCCGATTCCCGACACTGTGCTTCGCAGCCTCAAGATGTTTCACAAATGACCAATCCCCAGCTAAGTCAGTCTTCGCCACAGCCATTCAATCAAT
ATCCGCAACAATGGGTGCAGCCGCCATCTCAGCTGACACAACCACCACAGCAGCCTTCTATGCAACCTCAGATCAGACCACCACCTAGTTCAGTCTACCCTTATCCACCA
AATCAACCTAATCCTATGCCAGAGACTCTGTCAAGCAGCATGCCTTTGCAGCAAATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAGCCGCATGGACCCGATGCCTTA
TGGGTATGCTGCTGCAAGCGGTGGTTCAGCTCCACAGCAACCTCCAGTAAAAAATGCGTATGGACCAGCAGCAGGTGAGGGATATATGCCTCCTGGACAACAACCTGCAC
TGTCCTCCGGTGGCGCATACATGATGTATGACAGGGAAAGTGGAAGACCATCGCACCCTCCGTCTCAACCACCACATTTCAATCAAGGCGGATATCCTCCAGCCAATGCA
CCTCAGCAGGCTCAGGCTGCAACAGGCCCCCATGGTTCAGCCAGGAATCCTAGTCATTCACATTTGATCGAGAAACTGGTTGGCATGGGGTTCAGAGGAGACCATGTTGC
CAGTGTAATTCAGAGAATGGAAGACAGTGGCCAGCCTGTTGACTTCAACGGAGTTCTAGACAGGTTGAGTTCTGCAAGTCCAGGTCCAGGTCCGCAGAGAGCATGGTGA
mRNA sequenceShow/hide mRNA sequence
GTACATATAAAGAAAGTCTTGGATTTGAAGGAATCCGAAGAACATTTGGGTATCGGATAATCGGAAAGGTGAATCTCTGTTTCTCCGCAACCATTATCAGTGCGATCTAT
GGCGTCTGGTTCATCTGGTCGCCCTAATTCTGCCCCCAAAACATTCAATTTCGACTCCGACGACATTCTCCGCTCATTCGACGACTACGCCAAACCGGACGCTTCCCACA
ACGACCCCGTCTCCGTTCCCAACTCCGGCAAGGATTTTCACAAAAGTAGAATGTCTACCATATTCTCTGGTGCTGCTTATGGTCAACCAGATGATTCCATCAGTCAGGAC
GTGATGTCTGCCGTAGAGAACTGCATGAAAAAGCATTCAGATAACCTTTTTCGTTTTCTTGAGGGAATGAGTTCACGCCTATCACAACTTGAACTAGTTTGCTACAACCT
TGATAAATCTGTTACAGAAATGCGTTCTGATCTAGCTCGTGATCATGAAGAGGCGGATTCAAAGCTTAAATCTCTTGAGATGCATCTACAAGAGGTCCACAGGTCTGTAC
AGATTATAAGAGACAAGCAAGAGCTTGCAGAGACTCAAAAAGATCTAGCCAAACTTCAGGTATCACAGAAAGAGCCATCTTCATCAAGCCATTCAAAGTCAAATGAGGAG
AGAGCTTCATCAGCCGCCCCGGATCCAAAAAAGAACGAAAATCTATCCGAGATTCACAACCAGCAATTAGCATTAGCCTTGCCGCACCAGATTGTTCCACAGCAAAATCC
TATTACACCCCCTTCAGCAACATTGCCTCCGAATGTGCCTCAACAGCAATCGTACTACATTCCTTCAACCCAATTGCCTGGTCAACCACCCCACATCCAGCATGCTCAGG
GCCAATATATCTCACCCGATTCCCGACACTGTGCTTCGCAGCCTCAAGATGTTTCACAAATGACCAATCCCCAGCTAAGTCAGTCTTCGCCACAGCCATTCAATCAATAT
CCGCAACAATGGGTGCAGCCGCCATCTCAGCTGACACAACCACCACAGCAGCCTTCTATGCAACCTCAGATCAGACCACCACCTAGTTCAGTCTACCCTTATCCACCAAA
TCAACCTAATCCTATGCCAGAGACTCTGTCAAGCAGCATGCCTTTGCAGCAAATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAGCCGCATGGACCCGATGCCTTATG
GGTATGCTGCTGCAAGCGGTGGTTCAGCTCCACAGCAACCTCCAGTAAAAAATGCGTATGGACCAGCAGCAGGTGAGGGATATATGCCTCCTGGACAACAACCTGCACTG
TCCTCCGGTGGCGCATACATGATGTATGACAGGGAAAGTGGAAGACCATCGCACCCTCCGTCTCAACCACCACATTTCAATCAAGGCGGATATCCTCCAGCCAATGCACC
TCAGCAGGCTCAGGCTGCAACAGGCCCCCATGGTTCAGCCAGGAATCCTAGTCATTCACATTTGATCGAGAAACTGGTTGGCATGGGGTTCAGAGGAGACCATGTTGCCA
GTGTAATTCAGAGAATGGAAGACAGTGGCCAGCCTGTTGACTTCAACGGAGTTCTAGACAGGTTGAGTTCTGCAAGTCCAGGTCCAGGTCCGCAGAGAGCATGGTGATGG
TAATTTCATCTCTCCCTGTTTACGTTGATACCGAGCATGACCAGCCTCATACATTGCGACTTTTCAATGCATTGAATAAAACACTGGTTTATGTTTCAATTATGTCCTCG
TACATAAATTGTTATGGTTTGTGAGATTTAGAAACGTTGGCTGTATGATTTCGAAACTTTGTGTGAATATCTTCTACCTGCA
Protein sequenceShow/hide protein sequence
MASGSSGRPNSAPKTFNFDSDDILRSFDDYAKPDASHNDPVSVPNSGKDFHKSRMSTIFSGAAYGQPDDSISQDVMSAVENCMKKHSDNLFRFLEGMSSRLSQLELVCYN
LDKSVTEMRSDLARDHEEADSKLKSLEMHLQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSKSNEERASSAAPDPKKNENLSEIHNQQLALALPHQIVPQQN
PITPPSATLPPNVPQQQSYYIPSTQLPGQPPHIQHAQGQYISPDSRHCASQPQDVSQMTNPQLSQSSPQPFNQYPQQWVQPPSQLTQPPQQPSMQPQIRPPPSSVYPYPP
NQPNPMPETLSSSMPLQQMSFASIPQPGSSRMDPMPYGYAAASGGSAPQQPPVKNAYGPAAGEGYMPPGQQPALSSGGAYMMYDRESGRPSHPPSQPPHFNQGGYPPANA
PQQAQAATGPHGSARNPSHSHLIEKLVGMGFRGDHVASVIQRMEDSGQPVDFNGVLDRLSSASPGPGPQRAW