; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC07G127780 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC07G127780
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationCiama_Chr07:3126235..3134378
RNA-Seq ExpressionCaUC07G127780
SyntenyCaUC07G127780
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589219.1 hypothetical protein SDJN03_17784, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0073.53Show/hide
Query:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL
        +++  +  DFHKSRMSTVYPAAAY QPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDV+RDHEEEDLKLKSLEKHL
Subjt:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL

Query:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE
        QE                                                                    VHRSVQIIRDKQELAETQKDLAKLHL+QKE
Subjt:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE

Query:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQY-QQLPDVSRL
        S SSSHSHSN+ERASPVASDP KNENPSENH NQQLALALPHQ++ QQNP+TPPPPAALPQN+PQQQ+YYI S  +P+Q  +IQH Q QY QQL DVSRL
Subjt:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQY-QQLPDVSRL

Query:  PSQMTNPQLSQTPPPQQFNQYQQQWT-----QQQPPQQVQPP-QQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTV
              PQ SQTPPPQQFNQY QQWT     QQQPPQ VQPP QQQPSMQPQIRP P+SVY SYSMNQPTSMPETL NSMPMQ +FSP+PQPGSSR+DTV
Subjt:  PSQMTNPQLSQTPPPQQFNQYQQQWT-----QQQPPQQVQPP-QQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTV

Query:  PYGYVGSGVTVSQQLPQVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHV
        PYGY GSG TV QQ PQVKNAFGP AGEGYLPSGPQ  LSSGG+YMMYDRESGRP HH PQPQ         QQPHFNQ  YPPANASLQIPQ  SGPHV
Subjt:  PYGYVGSGVTVSQQLPQVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHV

Query:  IPRNPNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAFAHPKKILVSIIGN
        + RNP+H H MR  NQ+HPYGEIVEKLVGMGFRSDH+ASVIHRMEESGQP+DFNAVLDGLSN GGPQR+                               
Subjt:  IPRNPNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAFAHPKKILVSIIGN

Query:  LNHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFP-AGSYGQADDSI
               S +N  LHS SF +S+R MASGSAGR NS+PK+FDFGSDDILCS+EDYGKQD SNGSH+DPVSVTNS+KDFHK RMST FP A SYGQ DDSI
Subjt:  LNHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFP-AGSYGQADDSI

Query:  SQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAET
        +Q+VIS VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS++ARDHEE DSKLKSLEKH+QE            VHRSVQIIRDKQELAET
Subjt:  SQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAET

Query:  QKDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQG
        QKDLAKLQVSQKEPS SSHSQSNEERASSVASDPKKNEN SEIH QQLALALPHQIVPQQNPI P SA LP N+PQ QQ YYI  +QL GQPPHIQHA G
Subjt:  QKDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQG

Query:  QYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQP
        QYISPD QHRA QPQD+S  +NPQLSQ+PPQPFNQYQQQWAQ PSQQPQPPQQ SMQPQ RPPP S Y  PYPPNQP+S+PETLSS+    MSFASIP P
Subjt:  QYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQP

Query:  GSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPH
        GS+R D  PYGY AASGGS+PQQPPQVKN YGPATGEGY+PPGQ    AYMMYDRESGRPPHHPP       QQPHFNQSGYPPANAPHQ+ PQA T P 
Subjt:  GSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPH

Query:  VSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        VS+RNPSHSHLIEKLVGMGFRGDHVA+IIQRMED G+PVDFN VLDRLS+   PGPQRAW
Subjt:  VSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

KAG7011974.1 hypothetical protein SDJN02_26882, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0072.99Show/hide
Query:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL
        +++  +  DFHKSR+STVYPAAAYGQPEDS+KQDVISTVENSMKKYSDNILRFLEGISSRLSQLELN YNLDKSVGEMRSD++RDHEE DLKLKSLEKHL
Subjt:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL

Query:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE
        QE                                                                    VHRSVQIIRDKQELAETQKDLAKLHLLQKE
Subjt:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE

Query:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQYQQLPDVSRLP
        SS S HSHSN+ERASP A DPKKNE PS+NH NQQLALALPHQIVPQQ+   PPPPAALP+NVPQQQ YY            IQH Q+Q+          
Subjt:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQYQQLPDVSRLP

Query:  SQMTN--PQLSQT--PPPQQFNQYQQQWTQQQPPQQVQPPQQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTVPYG
         QMTN   QLSQT  PPPQQF+QYQQQW  QQPPQQ QPPQQ PSMQPQIR PPTSVYSSYSMNQPTSMPET     PMQ+SFSPIPQPGSSRMDTV YG
Subjt:  SQMTN--PQLSQT--PPPQQFNQYQQQWTQQQPPQQVQPPQQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTVPYG

Query:  YVGSGVTVSQQLPQVKNAF--GPQAGEGYLPSGPQSGLSSGGTYMMYDRESGR--------------PSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANA
        YVGS  T+ QQ PQVKNAF  GPQAGEGYLPSGPQSGLSSGG YM+YDRE+GR              P HHPPQPQQPPHH PQPQQPHFNQSGYPPAN 
Subjt:  YVGSGVTVSQQLPQVKNAF--GPQAGEGYLPSGPQSGLSSGGTYMMYDRESGR--------------PSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANA

Query:  SLQIPQHPSGPHVIPRN-PNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLA
         +QIPQHPSGPHV+ RN PN  HFMR  NQNHPYGEIV+KLVGMGFRSDH+ SVIHRMEESGQP+DFNAVLDGLSNSG                      
Subjt:  SLQIPQHPSGPHVIPRN-PNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLA

Query:  FAHPKKILVSIIGNLNHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTV
                                                     GRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGSH++PVSV NS+KDFHKSRMSTV
Subjt:  FAHPKKILVSIIGNLNHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTV

Query:  FPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRS
        FP  +YGQ DDSI+Q+VI+ VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS+LARDHEEA+SKLKS+EKHVQE            VHRS
Subjt:  FPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRS

Query:  VQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQHQQPYYIPSS
        VQIIRDKQELAETQKDLAKLQV QKEPSLSSHSQ+NEER   V++DPKKNENPSEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQ QQ YYI SS
Subjt:  VQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQHQQPYYIPSS

Query:  QLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETL
        QLPG QP HIQHAQ QYIS DSQHRASQPQD+SQM+NPQLSQT PQPFNQYQQQWAQPPSQ  QPPQQ SMQPQ RPPP SVYPSPY PPNQPTSMPETL
Subjt:  QLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETL

Query:  SSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQP-HHPSQQPHFNQ
        SSSMPMQMSFASIPQPGS+R DA PYGY AASGGSAPQQPPQVKNAYGPATGEGYMPPGQQ    SGGAYMMYDRESGRPPHH PQQP HHPSQQ HF+Q
Subjt:  SSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQP-HHPSQQPHFNQ

Query:  SGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        SGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV  IIQRMEDSGQ VDFNAVLDRLSTP GPGPQRAW
Subjt:  SGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

KAG7022919.1 hypothetical protein SDJN02_16655, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0071.59Show/hide
Query:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL
        +++  +  DFHKSRMSTVYPAAAY QPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDV+RDHEEEDLKLKSLEKHL
Subjt:  IAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL

Query:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE
        QE                                                                    VHRSVQIIRDKQELAETQKDLAKLHL+QKE
Subjt:  QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKE

Query:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQY-QQLPDVSRL
        S SSSHSHSN+ERASPVASDP KNENPSENH NQQLALALPHQ++ QQNP+TPPPPAALPQNVPQQQ+YYI S  +P+Q  HIQH Q QY QQL DVSRL
Subjt:  SSSSSHSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQY-QQLPDVSRL

Query:  PSQMTNPQLSQTPPPQQFNQYQQQWT----QQQPPQQVQPP-QQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTVP
              PQ SQTPPPQQFNQY QQWT    QQQPPQ VQPP QQQPSMQPQIRP P+SVY SYSMNQPTSMPETL NSMPMQ +FSP+PQPGSSR+DTVP
Subjt:  PSQMTNPQLSQTPPPQQFNQYQQQWT----QQQPPQQVQPP-QQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTVP

Query:  YGYVGSGVTVSQQLPQVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHVI
        YGY GSG TV QQ PQVKNAFGP AGEGYLPSGPQ  LSSGG+YMMYDRESGRP HH PQPQ         QQPHFNQ  YPPANASLQIPQ  SGPHV+
Subjt:  YGYVGSGVTVSQQLPQVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHVI

Query:  PRNPNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAFAHPKKILVSIIGNL
         RNP+H H MR  NQ+HPYGEIVEKLVGMGFRSDH+ASVIHRMEESGQP+DFNAVLDGLSN GGPQR                                 
Subjt:  PRNPNHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAFAHPKKILVSIIGNL

Query:  NHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFP-AGSYGQADDSIS
                               S  S   GR NS+PK+FDFGSDDILCS+EDYGKQD SNGSH+DPVSVTNS+KDFHK RMST FP A SYGQ DDSI+
Subjt:  NHLSLFSPKNLSLHSHSFRYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFP-AGSYGQADDSIS

Query:  QNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQ
        Q+VIS VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS++ARDHEE                          VHRSVQIIRDKQELAETQ
Subjt:  QNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQ

Query:  KDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQ
        KDLAKLQVSQKEPS SSHSQSNEERASSVASDPKKNEN SEIH QQLALALPHQIVPQQNPI P SA LP N+PQ QQ YYI  +QL GQPPHIQHA GQ
Subjt:  KDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQ

Query:  YISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPG
        YISPD QHRA QPQD+S  +NPQLSQ+PPQPFNQYQQQWAQ PSQQPQPPQQ SMQPQ RPPP S Y  PYPPNQP+S+PETLSS+    MSFASIP PG
Subjt:  YISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPG

Query:  SNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHV
        S+R D  PYGY AASGGS+PQQPPQVKN YGPATGEGY+PPGQ    AYMMYDRESGRPPHHPP       QQPHFNQSGYPPANAPHQ+ PQA T P V
Subjt:  SNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHV

Query:  SARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRA
        S+RNPSHSHLIEKLVGMGFRGDHVA+IIQRMED G+PVDFN VLDRLS+   PGPQRA
Subjt:  SARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRA

XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]1.6e-25690.07Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVSVTN  KDFHKSRMSTVFPA  YGQADD+ISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQE            VHRSVQIIRDKQELAETQKDLAKLQVSQKEPS S+HSQSNEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS
        ASSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQ QQ YYI  SQLPGQPPHIQHAQ QYIS DSQHRASQPQD+SQMSNPQLS
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS

Query:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ
        QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ Q RPPPPSVYPS YPPNQPTSMPETL SSMPMQMSF SIPQPGS+RVDAGPYGY   SGGSAPQQPPQ
Subjt:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ

Query:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA
        VKNAYGP TGEGYMPPGQQSGGAYMMYDRESGRPPHHPP       QQ HFNQSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVA
Subjt:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA

Query:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        +IIQRMEDSGQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]2.6e-26792.72Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYG--QADDSISQNVISTVENSMKKHSDNLLRFLE
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVS+TNSTKDFHKSRMSTVFPA +YG  QADDSISQNVISTVENSMKKHSDNLLRFLE
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYG--QADDSISQNVISTVENSMKKHSDNLLRFLE

Query:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNE
        GISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQE            VHRSVQIIRDKQELAETQKDLAKLQVSQKEPS SSHSQSNE
Subjt:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNE

Query:  ERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQ
        ERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQN IT PSAALPQNMPQ QQ YYI SSQLPGQPPH+QHAQGQYISPDS +RASQPQD+SQMSNPQ
Subjt:  ERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQ

Query:  LSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQP
        LSQTPPQPFNQY QQWAQPPSQQPQPPQQPSMQPQ RPPPPSVYPS YPPNQPTSMPETLSSSMPM MSF SIPQPGS+R+DAGPYGY AASGGSAPQQP
Subjt:  LSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQP

Query:  PQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDH
        PQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPAN  HQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDH
Subjt:  PQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDH

Query:  VATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        VA+IIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
Subjt:  VATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein4.6e-25489.35Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVSV N  KDFHK RMSTVFPA  YGQADD+ISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQE            VHRSVQIIRDKQELAETQKDLAKLQVSQKEPS ++HSQSNEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS
        ASSVASDPKK EN SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQ QQ YYI  SQLPGQPPHIQHAQ QYI  DSQHRASQPQD+SQMSNPQLS
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS

Query:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPP
        QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ Q RPPPPSVYPS Y PPNQPTSMPETL SSMPMQMSF SIPQPGS+RVDAGPYGY A SGGSAPQQPP
Subjt:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPP

Query:  QVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV
        QVKNAYGP TGEGYMPPGQQSGGAYMMYDRESGRPPHHPP       QQ HFNQSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHV
Subjt:  QVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV

Query:  ATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        A+IIQRMEDSGQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  ATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like7.6e-25790.07Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVSVTN  KDFHKSRMSTVFPA  YGQADD+ISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQE            VHRSVQIIRDKQELAETQKDLAKLQVSQKEPS S+HSQSNEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS
        ASSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQ QQ YYI  SQLPGQPPHIQHAQ QYIS DSQHRASQPQD+SQMSNPQLS
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS

Query:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ
        QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ Q RPPPPSVYPS YPPNQPTSMPETL SSMPMQMSF SIPQPGS+RVDAGPYGY   SGGSAPQQPPQ
Subjt:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ

Query:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA
        VKNAYGP TGEGYMPPGQQSGGAYMMYDRESGRPPHHPP       QQ HFNQSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVA
Subjt:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA

Query:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        +IIQRMEDSGQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like4.9e-25689.7Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVS+TN  KDFHKSRMSTVFPA  Y QADD+ISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQE            VHRSVQIIRDKQELAETQKDLAKLQVSQKEPS S+HSQSNEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS
        ASSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQ QQ YYI  SQLPGQPPHIQHAQ QYIS DSQHRASQPQD+SQMSNPQLS
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLS

Query:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ
        QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ Q RPPPPSVYPS YPPNQPTSMPETL SSMPMQMSF SIPQPGS+RVDAGPYGY   SGGSAPQQPPQ
Subjt:  QTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQ

Query:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA
        VKNAYGP TGEGYMPPGQQSGGAYMMYDRESGRPPHHPP       QQ HFNQSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVA
Subjt:  VKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVA

Query:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        +IIQRMEDSGQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  TIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like1.7e-24887.43Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGSH+DPVSV NS+KDFHKSRMSTVFP  +YGQ DDSI+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQE            VHRSVQIIRDKQELAETQKDLAKLQV QKEPSLSSHSQ+NEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQHQQPYYIPSSQLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQ
           V++DPKKNENPSEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQ QQ YYI SSQLPG QP HIQHAQ QYIS DSQHRASQPQD+SQM+NPQ
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQHQQPYYIPSSQLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQ

Query:  LSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQ
        LSQT PQPFNQYQQQWAQPPSQ  QPPQQ SMQPQ RPPP SVYPSPY PPNQPTSMPETLSSSMPMQMSFASIPQPGS+R DA PYGY AASGGSAPQQ
Subjt:  LSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPY-PPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQ

Query:  PPQVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMG
        PPQVKNAYGPATGEGYMPPGQQ    SGGAYMMYDRESGRPPHH PQQPHHPSQQ HF+QSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMG
Subjt:  PPQVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMG

Query:  FRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        FRGDHVA+IIQRMEDSGQ VDFNAVLDRLSTP GPGPQRAW
Subjt:  FRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

A0A6J1HZW1 ataxin-2 homolog9.0e-25087.38Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGSH+DPVSV NS+KDFHKSRMSTVFP  +YGQ DDSI+Q+VI+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQE            VHRSVQIIRDKQELAETQKDLAKLQV QKEPSLSSHSQ+NEER
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEER

Query:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQL
           V++DPKKNENPSEIHNQQLALALPHQIVPQQNP+TPPSAALPQN+PQ  Q YYI SSQLPG QP HIQHAQ QYIS DS HRASQPQD+SQM+NPQL
Subjt:  ASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPG-QPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQL

Query:  SQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPP
        SQT PQPFNQYQQQWAQPPSQ  QPPQQ SMQPQ RPPP SVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGS+R DA PYGY AASGGSAPQQPP
Subjt:  SQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPP

Query:  QVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFR
        QVKNAYGPATGEGYMPPGQQ    SGGAYMMYDRESGRPPHH PQQPHHPSQQ HFNQSGYPPANAP QVPPQAPTGPHVSARNPSHSHLIEKLVGMGFR
Subjt:  QVKNAYGPATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFR

Query:  GDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        GDHVA+IIQRMEDSGQ VDFNAVLDRLSTP GPGPQRAW
Subjt:  GDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)9.5e-1828.38Show/hide
Query:  NGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADS
        + S   PVS T+   +F    + ++ P+        ++    I +  ++ +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H   D 
Subjt:  NGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADS

Query:  KLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEE-RASSVASDPKKNENPSEIHNQQLALALPHQIVPQ
        K++ L+  + E            V   VQ+++DKQE+ E Q              LS H  SN+  +  S+  DP   ++P+ +  QQ     P    PQ
Subjt:  KLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEE-RASSVASDPKKNENPSEIHNQQLALALPHQIVPQ

Query:  QNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQP----S
             PPS+    + P        PSSQLP Q P    +Q +   P   H    P +      PQ +QTP QP   YQ      P QQPQ PQQP     
Subjt:  QNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQP----S

Query:  MQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDR
          P+ +PP        YPPN P   P   + S P Q  F + PQP  +  D    G G  S    P                GY+       G+ M    
Subjt:  MQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDR

Query:  ESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHS----------HLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVL
         S +PPH               N +GYP  +    +P   P    VS+   S S           +I+++  MGF  D V   ++++ ++GQ VD N VL
Subjt:  ESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHS----------HLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVL

Query:  DRLSTPTGPGP
        D+L    G  P
Subjt:  DRLSTPTGPGP

AT4G28300.1 Protein of unknown function (DUF1421)5.8e-10849.72Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDP-VSVTNSTKDFHKSRM--STVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFL
        MASGS+GR NS  K FDFGSDDILCS++DY  QD SNG H+DP ++ +NS K+FHK+RM  S+VFP  SY   +DS+SQ++  TVE +MK ++DN++RFL
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDP-VSVTNSTKDFHKSRM--STVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFL

Query:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSN
        EG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QE            VHRSVQI+RDKQELA+TQK+LAKLQ+ QKE S SSHSQ  
Subjt:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSN

Query:  EERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYI--PSSQLPGQPP--HIQHAQGQYISPDSQHRASQPQDISQ
        E+R ++   +PKK+EN S+ HNQQLALALPHQI PQ           PQ  PQ  Q YY+  P +QL   P    +     Q  +P +Q +   P     
Subjt:  EERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYI--PSSQLPGQPP--HIQHAQGQYISPDSQHRASQPQDISQ

Query:  MSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGS
         S+P  +QT  Q F QYQQ W          P QP  +PQ+    P+  P+P P NQP    E+L SSM MQ  ++  PQ          YGYGAA    
Subjt:  MSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGS

Query:  APQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQ----SGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKL
        AP  P Q K +Y P TG+GY+P G      Y     E GR   +PP QP    QQ H+ Q     GY P   PHQ        P V      +  LIEKL
Subjt:  APQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQPHHPSQQPHFNQ----SGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKL

Query:  VGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        V MGFRGDHV  +IQRME+SGQP+DFN +LDRLS  +  GP R W
Subjt:  VGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)2.3e-8042.12Show/hide
Query:  STVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETCTTFSNHVAILN
        S+V+P ++Y  PEDS+ QD+  TVE +MK Y+DN++RFLEG+SSRLSQLEL CYNLDK++GEMRS++   HE+ D+KL+SL+KHLQE             
Subjt:  STVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETCTTFSNHVAILN

Query:  EFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERAS
                                                               VHRSVQI+RDKQELA+TQK+LAKL L+QKESSSSSHS   ++R +
Subjt:  EFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERAS

Query:  PVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQYQQLPDVSRLPSQMTNP--QLSQTPP
            +PKK+EN S+ H NQQLALALPHQI PQ        P   PQ  PQQ  YY     MP  PT +Q+          VS  PSQ+  P  Q    PP
Subjt:  PVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQYQQLPDVSRLPSQMTNP--QLSQTPP

Query:  P-----------QQFNQYQQQWTQQQPPQQVQPPQQQPSMQPQIRPPPTSVYSSYSMNQPTSMP--ETLSNSMPMQLSFSPIPQPGSSRMDTVPYGYVGS
        P           Q F QYQQ W          PP      QPQ RP  +  Y +YS   P + P  E+L +SM MQ  +S  PQ          YGY   
Subjt:  P-----------QQFNQYQQQWTQQQPPQQVQPPQQQPSMQPQIRPPPTSVYSSYSMNQPTSMP--ETLSNSMPMQLSFSPIPQPGSSRMDTVPYGYVGS

Query:  GVTVSQQLP--QVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHVIPRNP
        G     Q P  Q K ++ PQ G+GYLPSGP     SG    MY  E GR  + PPQPQQ      Q QQ H+ Q    P         H +G      N 
Subjt:  GVTVSQQLP--QVKNAFGPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHVIPRNP

Query:  NHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLS--NSGGPQR
          P  +R++     YGE++EKLV MGFR DHV +VI RMEESGQP+DFN +LD LS  +SGGP R
Subjt:  NHPHFMRNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLS--NSGGPQR

AT5G14540.1 Protein of unknown function (DUF1421)1.4e-2129.8Show/hide
Query:  TDPVSVTNSTKDFHKSRMSTVFPAGSYGQAD-DSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL
        +DP  V+ S+   + S M ++ P+  + + D +S    +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ L
Subjt:  TDPVSVTNSTKDFHKSRMSTVFPAGSYGQAD-DSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL

Query:  EKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQ---VSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNP
        E  + E            V   VQ+++DKQE+ E Q  L+KLQ   V+Q+  + S+H +   +  +S+   P     P  +  Q L      Q    Q+ 
Subjt:  EKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQ---VSQKEPSLSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNP

Query:  ITPPSAALPQ---NMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ--
        ++PPS  LPQ        Q+PY+ PS Q   QPP         I P  Q                    PP P     Q   QPP QQPQ PQQP  Q  
Subjt:  ITPPSAALPQ---NMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQ--

Query:  ------PQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMP---PGQQSGG
              P+  P P   YP P PP QP S P     S P Q  + + P P S     G         G +P+  P      GP +  G  P   P  QSG 
Subjt:  ------PQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMP---PGQQSGG

Query:  AYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRL
                SG  P  P  +P            G P A+A         +    S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L
Subjt:  AYMMYDRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTAACTGTATTGCCATGTTACAAGCTTGTAATGATTTTCATAAAAGTAGAATGTCTACTGTGTATCCTGCTGCTGCCTATGGTCAACCTGAAGATTCC
ATCAAACAAGATGTGATTTCTACTGTTGAGAACAGCATGAAAAAGTATTCTGATAATATTTTGCGTTTTCTAGAGGGAATAAGTTCACGCCTATCACAACTTGAA
CTGAATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGATCTGATGTTCTTCGTGACCATGAAGAGGAAGATTTGAAGCTTAAATCTCTTGAGAAGCATCTA
CAAGAGACATGCACAACTTTTAGCAATCATGTAGCAATATTAAATGAGTTTCCAACTCAAACTCAACGCTTGTCTGTTGAATCTGGAAATCACGCACTTGATGCT
GATCCTGCCATGTTGTCATCCATGCTCTTATTTTGGATGCTTGAACTATTCAACTGTCACCATAATATTTTTGTGTTACTTTTACTTCATTGTACAAAAGACCAA
GTCCACAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTTGCTGAGACTCAGAAAGACCTAGCCAAACTTCATCTTTTGCAGAAGGAGTCATCTTCATCCAGC
CATTCACATTCAAATGATGAGAGAGCTTCACCAGTTGCCTCTGATCCTAAGAAAAATGAAAACCCGTCTGAGAATCACAATAATCAGCAATTAGCTCTCGCCCTG
CCGCACCAGATTGTCCCACAGCAAAATCCTATTACACCACCCCCTCCAGCAGCTTTGCCACAGAATGTGCCTCAACAGCAATCTTATTACATCCCTTCAAACCAA
ATGCCAAGTCAACCAACCCATATCCAGCATGCTCAGACCCAATATCAACAACTTCCAGATGTTTCTCGGTTGCCATCACAAATGACTAATCCCCAGCTAAGTCAA
ACTCCACCGCCACAACAATTCAATCAGTATCAACAGCAATGGACGCAGCAGCAGCCACCTCAACAGGTACAACCACCACAACAACAGCCTTCTATGCAACCTCAG
ATCAGGCCGCCACCGACTTCAGTCTACTCTTCTTATTCGATGAATCAACCAACTTCTATGCCAGAGACTCTGTCAAACAGCATGCCTATGCAATTGTCATTTTCA
CCTATTCCTCAGCCAGGCTCAAGCCGCATGGACACCGTGCCATATGGATATGTTGGAAGTGGTGTTACCGTGTCCCAGCAACTTCCTCAAGTCAAAAATGCTTTT
GGACCACAAGCTGGAGAAGGTTACTTACCTTCTGGACCGCAATCTGGGCTTTCCTCGGGAGGTACATATATGATGTATGATAGGGAAAGCGGAAGACCATCACAC
CATCCTCCTCAACCTCAACAACCTCCACACCATTCTCCTCAACCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCAGCCAATGCATCTCTTCAGATTCCT
CAGCATCCATCAGGCCCCCACGTTATCCCCAGGAATCCGAACCATCCGCATTTTATGCGCAACCAAAACCAAAACCACCCTTACGGTGAAATAGTTGAGAAACTG
GTTGGCATGGGTTTCAGGAGCGACCACGTCGCTAGTGTAATTCATAGGATGGAGGAGAGCGGCCAACCTGTCGACTTCAATGCTGTTCTAGATGGGTTGAGTAAT
TCCGGAGGTCCTCAGCGGAGTCGCCTGTTTGTTTGTATGTTGAAGCTCATATCGGCCATGACCAGCCTAGCCTTTGCGCATCCGAAGAAAATTTTGGTCTCGATA
ATCGGAAATCTCAATCATCTCTCTCTCTTCTCCCCAAAAAATCTCAGCCTTCATTCCCATTCTTTCCGTTACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGT
CGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACCCTTCAAACGGTAGCCACACCGAT
CCCGTTTCCGTTACAAATTCTACCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGGTTCCTATGGTCAAGCAGATGATTCCATTAGTCAAAAT
GTGATTTCTACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTACGTTTTCTTGAGGGAATAAGTTCTCGCCTATCACAACTTGAACTATATTGCTAC
AACCTTGATAAATCCGTTGGAGAAATGAGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTGCAGGAGGTGAGC
TACAAGTTTAGAACTTTGATGAGTTTTATAGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAG
GTCTCGCAGAAAGAGCCATCTTTGTCAAGCCATTCACAGTCAAATGAGGAGAGGGCTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATT
CACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAACCCTATCACACCCCCTTCAGCAGCTTTACCTCAGAATATGCCTCAACATCAG
CAACCTTACTACATCCCTTCATCCCAATTGCCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACCCGATTCCCAACACCGTGCATCACAA
CCGCAAGATATTTCACAAATGTCCAATCCCCAACTAAGTCAAACTCCACCACAACCATTCAATCAGTATCAACAACAGTGGGCGCAGCCACCATCTCAGCAGCCA
CAACCTCCTCAACAGCCTTCAATGCAACCTCAGACCAGACCACCACCCCCTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCAACTTCTATGCCCGAGACG
CTCTCAAGCAGCATGCCCATGCAAATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAACCGTGTGGATGCAGGGCCTTATGGGTATGGCGCTGCAAGTGGTGGT
TCTGCTCCACAGCAACCTCCTCAAGTGAAAAATGCTTATGGTCCGGCAACTGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTAT
GATAGGGAAAGTGGAAGACCGCCACACCATCCGCCTCAACAACCACACCATCCATCTCAACAACCACACTTCAATCAAAGTGGCTATCCTCCGGCCAACGCACCT
CATCAGGTTCCTCCTCAGGCTCCAACAGGCCCCCATGTTTCAGCCAGGAATCCAAGTCACTCACATCTAATTGAAAAACTGGTTGGCATGGGTTTCAGGGGTGAC
CATGTTGCAACTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTAGACAGGTTGAGTACTCCTACAGGTCCAGGTCCACAAAGA
GCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTAACTGTATTGCCATGTTACAAGCTTGTAATGATTTTCATAAAAGTAGAATGTCTACTGTGTATCCTGCTGCTGCCTATGGTCAACCTGAAGATTCC
ATCAAACAAGATGTGATTTCTACTGTTGAGAACAGCATGAAAAAGTATTCTGATAATATTTTGCGTTTTCTAGAGGGAATAAGTTCACGCCTATCACAACTTGAA
CTGAATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGATCTGATGTTCTTCGTGACCATGAAGAGGAAGATTTGAAGCTTAAATCTCTTGAGAAGCATCTA
CAAGAGACATGCACAACTTTTAGCAATCATGTAGCAATATTAAATGAGTTTCCAACTCAAACTCAACGCTTGTCTGTTGAATCTGGAAATCACGCACTTGATGCT
GATCCTGCCATGTTGTCATCCATGCTCTTATTTTGGATGCTTGAACTATTCAACTGTCACCATAATATTTTTGTGTTACTTTTACTTCATTGTACAAAAGACCAA
GTCCACAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTTGCTGAGACTCAGAAAGACCTAGCCAAACTTCATCTTTTGCAGAAGGAGTCATCTTCATCCAGC
CATTCACATTCAAATGATGAGAGAGCTTCACCAGTTGCCTCTGATCCTAAGAAAAATGAAAACCCGTCTGAGAATCACAATAATCAGCAATTAGCTCTCGCCCTG
CCGCACCAGATTGTCCCACAGCAAAATCCTATTACACCACCCCCTCCAGCAGCTTTGCCACAGAATGTGCCTCAACAGCAATCTTATTACATCCCTTCAAACCAA
ATGCCAAGTCAACCAACCCATATCCAGCATGCTCAGACCCAATATCAACAACTTCCAGATGTTTCTCGGTTGCCATCACAAATGACTAATCCCCAGCTAAGTCAA
ACTCCACCGCCACAACAATTCAATCAGTATCAACAGCAATGGACGCAGCAGCAGCCACCTCAACAGGTACAACCACCACAACAACAGCCTTCTATGCAACCTCAG
ATCAGGCCGCCACCGACTTCAGTCTACTCTTCTTATTCGATGAATCAACCAACTTCTATGCCAGAGACTCTGTCAAACAGCATGCCTATGCAATTGTCATTTTCA
CCTATTCCTCAGCCAGGCTCAAGCCGCATGGACACCGTGCCATATGGATATGTTGGAAGTGGTGTTACCGTGTCCCAGCAACTTCCTCAAGTCAAAAATGCTTTT
GGACCACAAGCTGGAGAAGGTTACTTACCTTCTGGACCGCAATCTGGGCTTTCCTCGGGAGGTACATATATGATGTATGATAGGGAAAGCGGAAGACCATCACAC
CATCCTCCTCAACCTCAACAACCTCCACACCATTCTCCTCAACCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCAGCCAATGCATCTCTTCAGATTCCT
CAGCATCCATCAGGCCCCCACGTTATCCCCAGGAATCCGAACCATCCGCATTTTATGCGCAACCAAAACCAAAACCACCCTTACGGTGAAATAGTTGAGAAACTG
GTTGGCATGGGTTTCAGGAGCGACCACGTCGCTAGTGTAATTCATAGGATGGAGGAGAGCGGCCAACCTGTCGACTTCAATGCTGTTCTAGATGGGTTGAGTAAT
TCCGGAGGTCCTCAGCGGAGTCGCCTGTTTGTTTGTATGTTGAAGCTCATATCGGCCATGACCAGCCTAGCCTTTGCGCATCCGAAGAAAATTTTGGTCTCGATA
ATCGGAAATCTCAATCATCTCTCTCTCTTCTCCCCAAAAAATCTCAGCCTTCATTCCCATTCTTTCCGTTACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGT
CGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACCCTTCAAACGGTAGCCACACCGAT
CCCGTTTCCGTTACAAATTCTACCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGGTTCCTATGGTCAAGCAGATGATTCCATTAGTCAAAAT
GTGATTTCTACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTACGTTTTCTTGAGGGAATAAGTTCTCGCCTATCACAACTTGAACTATATTGCTAC
AACCTTGATAAATCCGTTGGAGAAATGAGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTGCAGGAGGTGAGC
TACAAGTTTAGAACTTTGATGAGTTTTATAGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAG
GTCTCGCAGAAAGAGCCATCTTTGTCAAGCCATTCACAGTCAAATGAGGAGAGGGCTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATT
CACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAACCCTATCACACCCCCTTCAGCAGCTTTACCTCAGAATATGCCTCAACATCAG
CAACCTTACTACATCCCTTCATCCCAATTGCCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACCCGATTCCCAACACCGTGCATCACAA
CCGCAAGATATTTCACAAATGTCCAATCCCCAACTAAGTCAAACTCCACCACAACCATTCAATCAGTATCAACAACAGTGGGCGCAGCCACCATCTCAGCAGCCA
CAACCTCCTCAACAGCCTTCAATGCAACCTCAGACCAGACCACCACCCCCTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCAACTTCTATGCCCGAGACG
CTCTCAAGCAGCATGCCCATGCAAATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAACCGTGTGGATGCAGGGCCTTATGGGTATGGCGCTGCAAGTGGTGGT
TCTGCTCCACAGCAACCTCCTCAAGTGAAAAATGCTTATGGTCCGGCAACTGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTAT
GATAGGGAAAGTGGAAGACCGCCACACCATCCGCCTCAACAACCACACCATCCATCTCAACAACCACACTTCAATCAAAGTGGCTATCCTCCGGCCAACGCACCT
CATCAGGTTCCTCCTCAGGCTCCAACAGGCCCCCATGTTTCAGCCAGGAATCCAAGTCACTCACATCTAATTGAAAAACTGGTTGGCATGGGTTTCAGGGGTGAC
CATGTTGCAACTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTAGACAGGTTGAGTACTCCTACAGGTCCAGGTCCACAAAGA
GCGTGGTGACAGTAAATTAATCATCCCTGTTTGCGGCCGATACTGGCCATGACCAGCCTCATACATTGCGTCTTTTTAATGCATTGAATAAAACACTGGTTTATG
ATTTAATTGTCCTCGTATATATTGTCATGGTTTGTGAGATTTAAAAACGTCGGCTGTATGATTTAAACTTGCGTGTGAATATCTTCTTCTTCCACATTCCGATCT
TTCCATATAATTCCCTTTGATGTTTATGTGCACCTTTTAATGATTAATCAGGGTTGTAATTCATTATCTGAATCTTTATTTTTGTGGGGGTAGATTGAGGTTTTG
GCTCTGTAAGCGAGGGTGAGGCCTTTCTCTAAGTGTGATAGGGTGTGTAACGATGCTGCCTATTGTCTCGCTCGGGCGGTGTCTAGTCACAGGGATTTTGTTTGT
GCTTTTGTGGGTGTTTCTTCTTTTTAGCTTGGGAGAAGCTGTTGTTGGGACTGTTTCTTCGGCTGTGTTTGAGGAAGTTGTTAGTTGATTCTTGCTGATTTTTCC
CTTGGAAAAGTTTAGTTTTGACAAAGATAAGATTGCTTGTTTTCCCCAAATTTTACTGCAAGTTTCCTACTTTTCATACCTGAATCTTTTTCGAAGCTCGAGCCA
TGCTCCCTTAGACAAGGAAGAAGAAAGAATTCATTCTTGCCGACACTCTATGTGTGGGTGTCCTTTGCTAGGAGTCAAGGACACAACAATATTGAAAATGTACAT
TGCTATTGCCTGTTGCTCTGATTCAATTTACATTAACTTACAGACAAACAAGTAGTTGAGCTGTCCCTTTCTTCTGGGTCGTAGGTCCCAGATATATCAAT
Protein sequenceShow/hide protein sequence
MDFNCIAMLQACNDFHKSRMSTVYPAAAYGQPEDSIKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHL
QETCTTFSNHVAILNEFPTQTQRLSVESGNHALDADPAMLSSMLLFWMLELFNCHHNIFVLLLLHCTKDQVHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSS
HSHSNDERASPVASDPKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQMPSQPTHIQHAQTQYQQLPDVSRLPSQMTNPQLSQ
TPPPQQFNQYQQQWTQQQPPQQVQPPQQQPSMQPQIRPPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRMDTVPYGYVGSGVTVSQQLPQVKNAF
GPQAGEGYLPSGPQSGLSSGGTYMMYDRESGRPSHHPPQPQQPPHHSPQPQQPHFNQSGYPPANASLQIPQHPSGPHVIPRNPNHPHFMRNQNQNHPYGEIVEKL
VGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAFAHPKKILVSIIGNLNHLSLFSPKNLSLHSHSFRYSLRSMASGSAG
RPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSTKDFHKSRMSTVFPAGSYGQADDSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCY
NLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVSYKFRTLMSFIVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSLSSHSQSNEERASSVASDPKKNENPSEI
HNQQLALALPHQIVPQQNPITPPSAALPQNMPQHQQPYYIPSSQLPGQPPHIQHAQGQYISPDSQHRASQPQDISQMSNPQLSQTPPQPFNQYQQQWAQPPSQQP
QPPQQPSMQPQTRPPPPSVYPSPYPPNQPTSMPETLSSSMPMQMSFASIPQPGSNRVDAGPYGYGAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMY
DRESGRPPHHPPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVATIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQR
AW