; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0017608 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0017608
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationchr01:2441991..2445129
RNA-Seq ExpressionIVF0017608
SyntenyIVF0017608
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031573.1 arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY

Query:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
        MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
Subjt:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD

Query:  RLSSPSGPGPQRAW
        RLSSPSGPGPQRAW
Subjt:  RLSSPSGPGPQRAW

XP_004136824.1 trithorax group protein osa [Cucumis sativus]0.097.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFPA+GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP-NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP-NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]0.099.61Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+TNPGKDFHKSRMSTVFPAAGY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY

Query:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
        MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
Subjt:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD

Query:  RLSSPSGPGPQRAW
        RLSSPSGPGPQRAW
Subjt:  RLSSPSGPGPQRAW

XP_023554446.1 trithorax group protein osa-like [Cucurbita pepo subsp. pepo]1.15e-30386.72Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVS+ N  KDFHKSRMSTVFP A Y Q DD+I+Q+VI+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEA+SKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS S+HSQ+NEER   V++D KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQN+PQQQQSYYIS SQLPGQ P HIQHAQ+QYISSDSQHRASQPQDVS M+NPQLSQTP QPFNQYQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ

Query:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGE
        QQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPNQPTSMPETL SSMPMQMSF  IPQPGSSR DA PYGYA  SGGSAPQQPPQVKNAYGP TGE
Subjt:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGE

Query:  GYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME
        GYMPPGQQ    SGGAYMMYDRESGRPPHH       P QQ+HFNQSGYP ANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME
Subjt:  GYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME

Query:  DSGQPVDFNAVLDRLSSPSGPGPQRAW
        DSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  DSGQPVDFNAVLDRLSSPSGPGPQRAW

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]0.091.98Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGY--AQADDTISQNVISTVENSMKKHSDNLLRFLE
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVSITN  KDFHKSRMSTVFPAA Y  AQADD+ISQNVISTVENSMKKHSDNLLRFLE
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGY--AQADDTISQNVISTVENSMKKHSDNLLRFLE

Query:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKK
        GISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEERASSVASD KK
Subjt:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKK

Query:  KENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
         ENPSEIHNQQLALALPHQIVPQQN IT PSAALPQNMPQQQQSYYIS SQLPGQPPH+QHAQ QYIS DS +RASQPQDVSQMSNPQLSQTPPQPFNQY
Subjt:  KENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTG
        QQ WAQPPSQQPQPPQQPSMQ QIRPPPPSVYPSTYPPNQPTSMPETL SSMPM MSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYGP TG
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTG

Query:  EGYMPPGQQSGGAYMMYDRESGRPPHHPPQQAH-------FNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSG
        EGYMPPGQQSGGAYMMYDRESGRPPHHPPQQ H       FNQSGYP AN  HQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSG
Subjt:  EGYMPPGQQSGGAYMMYDRESGRPPHHPPQQAH-------FNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSG

Query:  QPVDFNAVLDRLSSPSGPGPQRAW
        QPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  QPVDFNAVLDRLSSPSGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein4.2e-27697.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFPA+GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY-PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY-PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like1.0e-28299.61Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+TNPGKDFHKSRMSTVFPAAGY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY

Query:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
        MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
Subjt:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD

Query:  RLSSPSGPGPQRAW
        RLSSPSGPGPQRAW
Subjt:  RLSSPSGPGPQRAW

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like2.7e-283100Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGY

Query:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
        MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD
Subjt:  MPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLD

Query:  RLSSPSGPGPQRAW
        RLSSPSGPGPQRAW
Subjt:  RLSSPSGPGPQRAW

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like5.1e-23786.39Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVS+ N  KDFHKSRMSTVFP A Y Q DD+I+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS S+HSQ+NEER   V++D KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
        NPSEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQQQQSYYIS SQLPG QP HIQHAQ+QYISSDSQHRASQPQDVSQM+NPQLSQT PQPFNQY
Subjt:  NPSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTY-PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPT
        QQQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS Y PPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYA  SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTY-PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
        GEGYMPPGQQ    SGGAYMMYDRESGRP       PHHP QQ+HF+QSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR

Query:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW
        MEDSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW

A0A6J1HZW1 ataxin-2 homolog2.7e-23886.34Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVS+ N  KDFHKSRMSTVFP A Y Q DD+I+Q+VI+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS S+HSQ+NEER   V++D KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ
        NPSEIHNQQLALALPHQIVPQQNP+TPPSAALPQN+PQQ QSYYIS SQLPG QP HIQHAQ+QYISSDS HRASQPQDVSQM+NPQLSQT PQPFNQYQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ

Query:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGE
        QQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYA  SGGSAPQQPPQVKNAYGP TGE
Subjt:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGE

Query:  GYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME
        GYMPPGQQ    SGGAYMMYDRESGRP       PHHP QQ+HFNQSGYP ANAP QVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME
Subjt:  GYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRME

Query:  DSGQPVDFNAVLDRLSSPSGPGPQRAW
        DSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  DSGQPVDFNAVLDRLSSPSGPGPQRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)5.2e-2431.11Show/hide
Query:  NGSLSD--PVSITNPGKDF---HKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEE
        N S SD  PVS T+P  +F        S + P  G    + TI   +I   + +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H  
Subjt:  NGSLSD--PVSITNPGKDF---HKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEE

Query:  ADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLAL-ALPHQIVPQQNPITPPSA
         D K++ L+  + EV   VQ+++DKQE+ E Q  L+K QV      S+ H++++       A      ++P+ +  QQ  L + P        P  PPS+
Subjt:  ADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLAL-ALPHQIVPQQNPITPPSA

Query:  ALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP
         LP  +P Q      S  Q P  PP               H    P +      PQ +QTP QP   YQ      P QQPQ PQQP      PP     P
Subjt:  ALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP

Query:  STYPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHP
           PP Q  S P   P   P   S PS      PQP  S  D        G+GG +    P            GY+       G+ M     S +PPH  
Subjt:  STYPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHP

Query:  PQQAHFNQSGYP-LANA---PHQVP---PQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGP
              N +GYP L+N+   PH +P     +  G   S R+ S +    +I+++  MGF  D V + ++++ ++GQ VD N VLD+L +  G  P
Subjt:  PQQAHFNQSGYP-LANA---PHQVP---PQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGP

AT4G28300.1 Protein of unknown function (DUF1421)5.8e-10849.72Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSITNPGKDFHKSRM--STVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFL
        MASGS+GR NS  K FDFGSDDILCS++DY  QD SNG  SDP ++ +N  K+FHK+RM  S+VFP + Y+  +D++SQ++  TVE +MK ++DN++RFL
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSITNPGKDFHKSRM--STVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFL

Query:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSK
        EG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SSS+HSQ  E+R ++   + K
Subjt:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSK

Query:  KKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQM-------SNPQLSQT
        K EN S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q+  +    S    +   P   SQ        S+P  +QT
Subjt:  KKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQM-------SNPQLSQT

Query:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMP--ETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPP-Q
          Q F QYQQ W         PPQ     Q RP     YP TY P  P + P  E+LPSSM MQ  +   PQ     + A  YG AP      PQ PP Q
Subjt:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMP--ETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPP-Q

Query:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQAHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV
         K +Y P TG+GY+P G      Y     E GR  + PP      QQAH+ Q   G   +  PHQ        P V      +  LIEKLV MGFRGDHV
Subjt:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQAHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV

Query:  ASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW
         ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  ASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)1.9e-8748.1Show/hide
Query:  STVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE
        S+VFP + Y+  +D++SQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQE
Subjt:  STVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ
        LA+TQK+LAKLQ+ QKE SSS+HSQ  E+R ++   + KK EN S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q
Subjt:  LAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ

Query:  HAQSQYISSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMP--ETLPSSM
        +  +    S    +   P   SQ        S+P  +QT  Q F QYQQ W         PPQ     Q RP     YP TY P  P + P  E+LPSSM
Subjt:  HAQSQYISSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPNQPTSMP--ETLPSSM

Query:  PMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPP-QVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQAHFNQ--SGYPLAN
         MQ  +   PQ     + A  YG AP      PQ PP Q K +Y P TG+GY+P G      Y     E GR  + PP      QQAH+ Q   G   + 
Subjt:  PMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPP-QVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQAHFNQ--SGYPLAN

Query:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW
         PHQ        P V      +  LIEKLV MGFRGDHV ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW

AT5G14540.1 Protein of unknown function (DUF1421)3.0e-2430.95Show/hide
Query:  SDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL
        SDP  ++      + S M ++ P+  +A+ D    ++ +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ L
Subjt:  SDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL

Query:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQ---VSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNM
        E  + EV   VQ+++DKQE+ E Q  L+KLQ   V+Q+  + S H +   +  +S+         P  +  Q L      Q    Q+ ++PPS  LPQ +
Subjt:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQ---VSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNM

Query:  PQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-S
        P Q      S  Q P  PP  Q               SQP    Q         PP P     Q   QPP QQPQ PQQP  Q   P    P    YP  
Subjt:  PQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-S

Query:  TYPPNQPTSMP-ETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQ
        +YPPN P   P    P S P Q  + + P P S     G    +    G +P+  P      GPP+  G  P   P  QSG         SG  P  P  
Subjt:  TYPPNQPTSMP-ETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQ

Query:  QAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL
         A     G P+A+A         +    S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L
Subjt:  QAHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTCCCCTAAATCCTTTGATTTTGGTTCTGATGATATTCTTTGCTCTTTTGAAGACTACGGTAAACAGGACCCTTCAAA
CGGTAGCCTTAGCGATCCCGTTTCCATTACCAATCCTGGCAAGGATTTTCACAAGAGTAGAATGTCTACAGTATTCCCTGCTGCTGGCTATGCTCAAGCAGATGATACCA
TTAGTCAAAATGTGATTTCCACGGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTCTAT
TGCTACAACCTTGATAAATCCGTTGGAGAAATGCGGTCTGAATTAGCTCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTTCAAGAGGTCCA
CAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCAAACCATTCACAGT
CTAATGAGGAGAGGGCTTCTTCAGTTGCCTCTGATTCTAAAAAGAAGGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTACCACATCAGATCGTTCCA
CAGCAGAATCCTATAACACCCCCTTCAGCAGCTTTGCCTCAGAATATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAGTTGCCTGGTCAACCACCCCATAT
CCAGCATGCTCAGAGCCAATACATCTCATCTGATTCCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAGCTAAGTCAAACTCCACCACAGC
CATTCAATCAATATCAACAACAATGGGCGCAACCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCACCCCCTTCAGTCTACCCG
TCTACTTATCCACCTAATCAACCAACTTCTATGCCTGAGACGCTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCAGGTTCAAGTCGTGTGGA
TGCAGGGCCTTATGGGTACGCTCCTGGAAGTGGTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGAGGGATATATGCCTCCTG
GACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGTGGAAGACCGCCACACCATCCACCTCAACAAGCACACTTCAATCAAAGCGGATATCCTCTAGCCAAT
GCACCACATCAGGTTCCTCCTCAAGCTCCAGCGGGACCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTGGTTGGCATGGGTTTCAGGGGTGA
CCATGTTGCAAGTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTCGACAGGTTGAGTTCTCCTTCAGGTCCAGGTCCACAAAGAGCTT
GGTGA
mRNA sequenceShow/hide mRNA sequence
GTTTGTCTTCAATCTCGATAATCCAAAAGCTCAATCATCTCTCTCTCTTTTTCCCAAAAAAAAAATCTCAATCTTCATCTTCATCCCCATTCTCTCCATTTCACACTGCG
ATCTATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTCCCCTAAATCCTTTGATTTTGGTTCTGATGATATTCTTTGCTCTTTTGAAGACTACGGTAAACAGGACCCTT
CAAACGGTAGCCTTAGCGATCCCGTTTCCATTACCAATCCTGGCAAGGATTTTCACAAGAGTAGAATGTCTACAGTATTCCCTGCTGCTGGCTATGCTCAAGCAGATGAT
ACCATTAGTCAAAATGTGATTTCCACGGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACT
CTATTGCTACAACCTTGATAAATCCGTTGGAGAAATGCGGTCTGAATTAGCTCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTTCAAGAGG
TCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCAAACCATTCA
CAGTCTAATGAGGAGAGGGCTTCTTCAGTTGCCTCTGATTCTAAAAAGAAGGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTACCACATCAGATCGT
TCCACAGCAGAATCCTATAACACCCCCTTCAGCAGCTTTGCCTCAGAATATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAGTTGCCTGGTCAACCACCCC
ATATCCAGCATGCTCAGAGCCAATACATCTCATCTGATTCCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAGCTAAGTCAAACTCCACCA
CAGCCATTCAATCAATATCAACAACAATGGGCGCAACCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCACCCCCTTCAGTCTA
CCCGTCTACTTATCCACCTAATCAACCAACTTCTATGCCTGAGACGCTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCAGGTTCAAGTCGTG
TGGATGCAGGGCCTTATGGGTACGCTCCTGGAAGTGGTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGAGGGATATATGCCT
CCTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGTGGAAGACCGCCACACCATCCACCTCAACAAGCACACTTCAATCAAAGCGGATATCCTCTAGC
CAATGCACCACATCAGGTTCCTCCTCAAGCTCCAGCGGGACCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTGGTTGGCATGGGTTTCAGGG
GTGACCATGTTGCAAGTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTCGACAGGTTGAGTTCTCCTTCAGGTCCAGGTCCACAAAGA
GCTTGGTGAAGAGTAATTTAATCAACCCCCTGTTTGCGGCCGATACTGGCCATGACCAGCCTCATACATTGCGTCTTTTTAATGCATTGAATAAAATACTGGTTTATGAT
TTAATTGTCCTCGGTATATATTTTGTTATGGTTTGTGAGATTTTAAAACGTCGGCT
Protein sequenceShow/hide protein sequence
MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMSTVFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELY
CYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSNHSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVP
QQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP
STYPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQAHFNQSGYPLAN
APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW