; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy7G004230 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy7G004230
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationGy14Chr7:3142380..3146278
RNA-Seq ExpressionCsGy7G004230
SyntenyCsGy7G004230
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031573.1 arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa]0.097.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFPA+GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

XP_004136824.1 trithorax group protein osa [Cucumis sativus]0.0100Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]0.098.06Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSV NPGKDFHK RMSTVFPA+GYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

XP_022952329.1 class E vacuolar protein-sorting machinery protein hse1-like [Cucurbita moschata]1.55e-30286.39Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFP + YGQ DD+I+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQQQQSYYIS SQLPGQ P HIQHAQ+QYI SDSQHRASQPQDVSQM+NPQLSQTP QPFNQY
Subjt:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
        QQQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
        GEGYMPPGQQ    SGGAYMMYDRESGRPPHH       P QQ+HF+QSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR

Query:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW
        MEDSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]0.090.86Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQA--DDTISQNVISTVENSMKKHSDNLLRFLE
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVS+ N  KDFHK RMSTVFPA+ YGQA  DD+ISQNVISTVENSMKKHSDNLLRFLE
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQA--DDTISQNVISTVENSMKKHSDNLLRFLE

Query:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKK
        GISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++HSQSNEERASSVASDPKK
Subjt:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKK

Query:  KENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
         EN SEIHNQQLALALPHQIVPQQN IT PSAALPQNMPQQQQSYYIS SQLPGQPPH+QHAQ QYI  DS +RASQPQDVSQMSNPQLSQTPPQPFNQY
Subjt:  KENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
        QQ WAQPPSQQPQPPQQPSMQ QIRPPPPSVYPSTYPP NQPTSMPETL SSMPM MSFPSIPQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQTH-------FNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS
        GEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQ H       FNQSGYP AN  HQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS
Subjt:  GEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQTH-------FNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS

Query:  GQPVDFNAVLDRLSSPSGPGPQRAW
        GQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  GQPVDFNAVLDRLSSPSGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein0.0100Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like0.098.06Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSV NPGKDFHK RMSTVFPA+GYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like0.097.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFPA+GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPP NQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSPSGPGPQRAW
        DRLSSPSGPGPQRAW
Subjt:  DRLSSPSGPGPQRAW

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like7.52e-30386.39Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFP + YGQ DD+I+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQQQQSYYIS SQLPGQ P HIQHAQ+QYI SDSQHRASQPQDVSQM+NPQLSQTP QPFNQY
Subjt:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
        QQQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
        GEGYMPPGQQ    SGGAYMMYDRESGRPPHH       P QQ+HF+QSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR

Query:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW
        MEDSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  MEDSGQPVDFNAVLDRLSSPSGPGPQRAW

A0A6J1HZW1 ataxin-2 homolog3.84e-30085.98Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFP + YGQ DD+I+Q+VI+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ
        N SEIHNQQLALALPHQIVPQQNP+TPPSAALPQN+PQQ QSYYIS SQLPGQ P HIQHAQ+QYI SDS HRASQPQDVSQM+NPQLSQTP QPFNQYQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPP-HIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ

Query:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTG
        QQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPP NQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP TG
Subjt:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTG

Query:  EGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM
        EGYMPPGQQ    SGGAYMMYDRESGRPPHH       P QQ+HFNQSGYP ANAP QVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM
Subjt:  EGYMPPGQQ----SGGAYMMYDRESGRPPHH-------PPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM

Query:  EDSGQPVDFNAVLDRLSSPSGPGPQRAW
        EDSGQ VDFNAVLDRLS+P+GPGPQRAW
Subjt:  EDSGQPVDFNAVLDRLSSPSGPGPQRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)3.4e-2331.08Show/hide
Query:  NGSLSD--PVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEA
        N S SD  PVS  +P  +F  G + ++ P+         +    I +  ++ +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H   
Subjt:  NGSLSD--PVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEA

Query:  DSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAAL
        D K++ L+  + EV   VQ+++DKQE+ E Q  L+K QVS +   +  HS   +  A S A  P ++   +       + A P Q         PPS+ L
Subjt:  DSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAAL

Query:  PQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPST
        P  +P Q      S  Q P  PP          PS  Q   S P        PQ +QTP QP   YQ      P QQPQ PQQ       PPP S Y   
Subjt:  PQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPST

Query:  YPPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP
          PP Q  S P   P   P   S PS      PQP  S  D        G+GG +    P            GY+       G+ M     S +PPH   
Subjt:  YPPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP

Query:  QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGP
          T + Q  +  PL +A   V   +  G   S R+ S +    +I+++  MGF  D V + ++++ ++GQ VD N VLD+L +  G  P
Subjt:  QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGP

AT4G28300.1 Protein of unknown function (DUF1421)7.5e-10849.53Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSVNNPGKDFHKGRM--STVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFL
        MASGS+GR NS  K FDFGSDDILCS++DY  QD SNG  SDP ++ +N  K+FHK RM  S+VFP S Y   +D++SQ++  TVE +MK ++DN++RFL
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSVNNPGKDFHKGRM--STVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFL

Query:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPK
        EG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SS++HSQ  E+R ++   +PK
Subjt:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPK

Query:  KKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQT
        K EN+S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q+  +    S    +   P   SQ        S+P  +QT
Subjt:  KKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQT

Query:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQ
          Q F QYQQ W         PPQ     Q RP     YP  S  PP NQP    E+LPSSM MQ  +   PQ          YGY A     AP  P Q
Subjt:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQ

Query:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV
         K +Y P TG+GY+P G      Y     E GR  + PP      QQ H+ Q   G   +  PHQ        P V      +  LIEKLV MGFRGDHV
Subjt:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV

Query:  ASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW
         ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  ASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)8.7e-8847.89Show/hide
Query:  STVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE
        S+VFP S Y   +D++SQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQE
Subjt:  STVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ
        LA+TQK+LAKLQ+ QKE SS++HSQ  E+R ++   +PKK EN+S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q
Subjt:  LAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ

Query:  HAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSS
        +  +    S    +   P   SQ        S+P  +QT  Q F QYQQ W         PPQ     Q RP     YP  S  PP NQP    E+LPSS
Subjt:  HAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSS

Query:  MPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLAN
        M MQ  +   PQ          YGY A     AP  P Q K +Y P TG+GY+P G      Y     E GR  + PP      QQ H+ Q   G   + 
Subjt:  MPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLAN

Query:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW
         PHQ        P V      +  LIEKLV MGFRGDHV ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW

AT5G14540.1 Protein of unknown function (DUF1421)9.4e-2631.22Show/hide
Query:  SDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL
        SDP  V+      + G M ++ P+  + + D    ++ +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ L
Subjt:  SDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL

Query:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQ
        E  + EV   VQ+++DKQE+ E Q  L+KLQ+S+       HS   E  A   AS P+   +++         +L  Q +P Q  I PP++         
Subjt:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQ

Query:  QQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-STYP
        Q        QLP  P      Q  Y P   Q   SQP    Q         PP P     Q   QPP QQPQ PQQP  Q   P    P    YP  +YP
Subjt:  QQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-STYP

Query:  --PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQQ
          PP QP S P   P S P Q  + + P P S     G    +    G +P+  P      GPP+  G  P   P  QSG         SG  P  P  +
Subjt:  --PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQQ

Query:  THFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL
              G P+A+A         +    S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L
Subjt:  THFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTCCCCTAAATCCTTTGATTTTGGTTCTGATGATATCCTTTGCTCTTTTGAAGACTACGGTAAACAGGACCCTTCAAA
CGGTAGCCTCAGCGATCCCGTTTCCGTTAACAATCCTGGCAAGGATTTTCACAAGGGTAGAATGTCTACAGTATTCCCTGCTTCTGGCTATGGTCAAGCAGATGATACCA
TTAGTCAAAATGTGATTTCCACGGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGCTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTCTAT
TGCTACAACCTTGATAAATCCGTTGGAGAAATGCGGTCTGAATTAGCTCGTGACCATGAAGAAGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTTCAAGAGGTCCA
CAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCAACAAACCATTCACAGT
CTAATGAGGAGAGGGCTTCTTCAGTTGCCTCGGATCCTAAAAAGAAGGAAAATTCATCTGAGATTCACAACCAGCAATTAGCTTTGGCTTTGCCACATCAGATCGTCCCA
CAGCAAAATCCTATAACACCACCTTCAGCAGCTTTGCCTCAGAACATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAATTGCCCGGTCAACCACCCCATAT
CCAGCATGCTCAGAGCCAATATATCCCATCTGATTCCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCACAGCTAAGTCAAACTCCACCACAGC
CATTCAACCAATATCAACAACAATGGGCGCAACCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCTCCCCCTTCAGTGTACCCT
TCTACTTATCCACCACCAAATCAACCAACTTCTATGCCTGAGACACTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCTGGTTCAAGCCGTGT
GGATGCAGGGCCTTATGGGTATGCTGCCGGAAGTGGTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGAGGGATATATGCCTC
CTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATCCGCCTCAACAAACGCACTTCAATCAAAGTGGATATCCTCTGGCC
AATGCACCTCATCAGGTTCCTCCTCAAGCACCAGCAGGCCCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTAGTTGGCATGGGTTTCAGGGG
TGACCATGTTGCAAGTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCTGTTCTTGACAGGTTGAGTTCTCCTTCAGGTCCAGGTCCACAGAGAG
CTTGGTGA
mRNA sequenceShow/hide mRNA sequence
GTATATATATATAAATTTCTGATAACAATTTTTTTCTTATGGAGAAAATCCGATAACAAATTTTTAATTAATTTCCTTTTCCTAATGGAATAATATTAAACACATTATAT
GATATAATTTGTTAATTAAAAACCTTACACAAGTCTATATAAAAACAGTCTTGGAATTGAAGGCATCCGAATACACGTTTTAGTCTTCAATCTCGATAATCGAAAAGCTC
AATCATCTATTTTTTTCCCCAAAAAATCTCAATCGTCATCTTCATCCCCATTCTCTCCGTTACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTC
CCCTAAATCCTTTGATTTTGGTTCTGATGATATCCTTTGCTCTTTTGAAGACTACGGTAAACAGGACCCTTCAAACGGTAGCCTCAGCGATCCCGTTTCCGTTAACAATC
CTGGCAAGGATTTTCACAAGGGTAGAATGTCTACAGTATTCCCTGCTTCTGGCTATGGTCAAGCAGATGATACCATTAGTCAAAATGTGATTTCCACGGTTGAGAACAGC
ATGAAAAAGCATTCTGATAACCTTTTGCGCTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTCTATTGCTACAACCTTGATAAATCCGTTGGAGAAATGCG
GTCTGAATTAGCTCGTGACCATGAAGAAGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTTCAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAAC
TCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCAACAAACCATTCACAGTCTAATGAGGAGAGGGCTTCTTCAGTTGCCTCGGAT
CCTAAAAAGAAGGAAAATTCATCTGAGATTCACAACCAGCAATTAGCTTTGGCTTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACACCACCTTCAGCAGCTTT
GCCTCAGAACATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAATTGCCCGGTCAACCACCCCATATCCAGCATGCTCAGAGCCAATATATCCCATCTGATT
CCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCACAGCTAAGTCAAACTCCACCACAGCCATTCAACCAATATCAACAACAATGGGCGCAACCA
CCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCTCCCCCTTCAGTGTACCCTTCTACTTATCCACCACCAAATCAACCAACTTCTAT
GCCTGAGACACTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCTGGTTCAAGCCGTGTGGATGCAGGGCCTTATGGGTATGCTGCCGGAAGTG
GTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTAT
GATAGGGAAAGCGGAAGACCGCCACACCATCCGCCTCAACAAACGCACTTCAATCAAAGTGGATATCCTCTGGCCAATGCACCTCATCAGGTTCCTCCTCAAGCACCAGC
AGGCCCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTAGTTGGCATGGGTTTCAGGGGTGACCATGTTGCAAGTATAATCCAGAGAATGGAAG
ACAGTGGCCAACCTGTTGACTTCAACGCTGTTCTTGACAGGTTGAGTTCTCCTTCAGGTCCAGGTCCACAGAGAGCTTGGTGAAGAGTAATTTAATCATCCCGTCTGCGG
CCGATTGGCCATGACCAGCCTCATACATTGCGTCTTTTTAATGCATTGAATAAAATACTGGTTTATGATTTAATTGTCCTCCGTATATATTTTGTTATGGTTTGTGAGAT
TTTAAAACGTCGGCTGTATGATTTAAACTTCGTGTGAATATCATCTTCTACTTCCAAATTCCAATCTTTCTATATTCTCATTTATGTTTATCTGCACCTTTTGAATGATT
AATTGTGTTTGTAATTCGTTATCTGAATCTCTAGTTTTAGTGGAGAAGATGGAGGCTTTTGACTCCCCAAGCGATTATGTTAGGGTGTGTGATGATATGCTGCATATTGT
CTTACTCGAGCGACATCTAGTCATAGAAATAAAATAACATTGTGTACGCTTTTTGGGTGTTTTATTATTTTTAAGCTTGGAAGAGGGTGGTTACGGACGAGAAGTTTTTC
TGGATCTAGTCTTAATAATTTTGATACCCAAATCCCTTTAGAATCTCGTGCCGTTCTCCCATAGACGAGGAATAAGAAAGAATGCGTTGTTGCCGACACTTTATCTTGGG
ATGTCCTTTGCTTTGAAAATGTGCTGCCTACAATTCAATTTACATTAACTTACAGACAAACAAGTTGTTGAGCAGTACCTTTCTTATAGGTCATAGGCCCCATATATATG
AGTTGGATAATTTCATTGCCTTATTTTCCG
Protein sequenceShow/hide protein sequence
MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELY
CYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVP
QQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP
STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLA
NAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSPSGPGPQRAW