; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005305 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005305
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationChr07:1439943..1442570
RNA-Seq ExpressionHG10005305
SyntenyHG10005305
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031573.1 arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa]2.6e-21890.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMSTVFPAA Y QADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEER SSVASD KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEG
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        YMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

XP_004136824.1 trithorax group protein osa [Cucumis sativus]9.4e-21689.83Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+ N  KDFHK RMSTVFPA+ YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++HSQSNEER SSVASDPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYI  DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS Y PPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYG  TGE
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE

Query:  GYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        GYMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  GYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]6.9e-21990.89Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMSTVFPAA YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEER SSVASD KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEG
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        YMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

XP_023554446.1 trithorax group protein osa-like [Cucurbita pepo subsp. pepo]1.0e-21488.2Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMSTVFP AAYGQ DDS++Q++I+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEA+SKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSA LPQN+PQQQQSYYIS+SQLPG QP HIQHAQ QYIS DSQHRASQPQDVS M+NPQLSQT PQPFNQYQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ

Query:  QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE
        QQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPYPPNQP+SM ETLSSSMPMQMSF  IPQPGSSR DA PYGYAA+SGGSAPQQPPQVKNAYG ATGE
Subjt:  QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE

Query:  GYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        GYMPPGQQ    SGGAYMMYDRESGRPPHH PQQPHHPSQQ HFNQSGYPPANAPHQVPPQAP+GP
Subjt:  GYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]9.6e-22993.74Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYG--QADDSLSQNLISTVENSMKKHSDNLLRFLE
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGSHTDPVS+TNS+KDFHKSRMSTVFPAAAYG  QADDS+SQN+ISTVENSMKKHSDNLLRFLE
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYG--QADDSLSQNLISTVENSMKKHSDNLLRFLE

Query:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKK
        GISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEER SSVASDPKK
Subjt:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKK

Query:  NENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY
        NENPSEIHNQQLALALPHQIVPQQN IT PSA LPQNMPQQQQSYYIS+SQLPGQPPH+QHAQGQYISPDS +RASQPQDVSQMSNPQLSQT PQPFNQY
Subjt:  NENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY

Query:  QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATG
         QQWAQPPSQQ QPPQQPSMQPQIRPPP SVYPS YPPNQP+SM ETLSSSMPM MSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYG ATG
Subjt:  QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATG

Query:  EGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        EGYMPPGQQSGGAYMMYDRESGRPPHH PQQPHHPSQQPHFNQSGYPPAN  HQVPPQAP+GP
Subjt:  EGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein4.5e-21689.83Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+ N  KDFHK RMSTVFPA+ YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++HSQSNEER SSVASDPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYI  DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS Y PPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYG  TGE
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE

Query:  GYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        GYMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  GYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like3.4e-21990.89Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMSTVFPAA YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEER SSVASD KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEG
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        YMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like1.3e-21890.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMSTVFPAA Y QADD++SQN+ISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEER SSVASD KK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ
        NPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ

Query:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG
        QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEG
Subjt:  QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        YMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like3.6e-21388.25Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMSTVFP AAYGQ DDS++Q++I+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPIT-PPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY
        NPSEIHNQQLALALPHQIVPQQNPIT PPSA LPQN+PQQQQSYYIS+SQLPG QP HIQHAQ QYIS DSQHRASQPQDVSQM+NPQLSQT PQPFNQY
Subjt:  NPSEIHNQQLALALPHQIVPQQNPIT-PPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY

Query:  QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSAT
        QQQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPY PPNQP+SM ETLSSSMPMQMSF SIPQPGSSR DA PYGYAAASGGSAPQQPPQVKNAYG AT
Subjt:  QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSAT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        GEGYMPPGQQ    SGGAYMMYDRESGRPPHH PQQPHHPSQQ HF+QSGYPPANAPHQVPPQAP+GP
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

A0A6J1HZW1 ataxin-2 homolog1.9e-21488.2Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMSTVFP AAYGQ DDS++Q++I+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNE

Query:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ
        NPSEIHNQQLALALPHQIVPQQNP+TPPSA LPQN+PQQ QSYYIS+SQLPG QP HIQHAQ QYIS DS HRASQPQDVSQM+NPQLSQT PQPFNQYQ
Subjt:  NPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ

Query:  QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE
        QQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPYPPNQP+SM ETLSSSMPMQMSF SIPQPGSSR DA PYGYAAASGGSAPQQPPQVKNAYG ATGE
Subjt:  QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGE

Query:  GYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP
        GYMPPGQQ    SGGAYMMYDRESGRPPHH PQQPHHPSQQ HFNQSGYPPANAP QVPPQAP+GP
Subjt:  GYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)2.1e-1129.46Show/hide
Query:  NGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADS
        + S   PVS T+ + +F    + ++ P+        ++    I +  ++ +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H   D 
Subjt:  NGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLIST--VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADS

Query:  KLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQK---------EPSSSSHSQSNEER--TSSVASDPKKNENPSEIHNQQLALALPHQIVPQQN
        K++ L+  + EV   VQ+++DKQE+ E Q  L+K QVS +         +P++ S +    ++   +S    P     PS+  + QL   LP Q   QQ 
Subjt:  KLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQK---------EPSSSSHSQSNEER--TSSVASDPKKNENPSEIHNQQLALALPHQIVPQQN

Query:  PITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNP------QLSQTQPQPFNQYQQQWAQPPSQQQQPPQ-QP
        P  PP +  PQ  P     Y    +Q P QP         Y SP  Q +  Q    S   NP      Q+    P P  Q     + P  Q   PPQ QP
Subjt:  PITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNP------QLSQTQPQPFNQYQQQWAQPPSQQQQPPQ-QP

Query:  SMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSF--PSIPQPGSSR--MDAGPYGYAAASGGSA
        SM        +S +PS Y     +     +SS+ P  +S      PQ  +SR    A P   A +SGG +
Subjt:  SMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSF--PSIPQPGSSR--MDAGPYGYAAASGGSA

AT4G28300.1 Protein of unknown function (DUF1421)6.5e-9050.96Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDP-VSLTNSSKDFHKSRM--STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFL
        MASGS+GR NS  K FDFGSDDILCS++DY  QD+SNG H+DP ++ +NS+K+FHK+RM  S+VFP ++Y   +DSLSQ++  TVE +MK ++DN++RFL
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDP-VSLTNSSKDFHKSRM--STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFL

Query:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPK
        EG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SSSSHSQ  E+R ++   +PK
Subjt:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPK

Query:  KNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQP
        K+EN S+ HNQQLALALPHQI PQ         V PQ  PQQ Q Y      +P  P  +Q+  A     +P SQ +A   Q       P     S  Q 
Subjt:  KNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQP

Query:  QPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQV
        Q F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L SSM MQ  +   PQ          YGY AA    AP  P Q 
Subjt:  QPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQV

Query:  KNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQ
        K +Y   TG+GY+P G      Y     E GR   + P QP    QQ H+ Q     GY P   PHQ
Subjt:  KNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQ

AT4G28300.2 Protein of unknown function (DUF1421)5.3e-6849.01Show/hide
Query:  STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE
        S+VFP ++Y   +DSLSQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQE
Subjt:  STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQ
        LA+TQK+LAKLQ+ QKE SSSSHSQ  E+R ++   +PKK+EN S+ HNQQLALALPHQI PQ         V PQ  PQQ Q Y      +P  P  +Q
Subjt:  LAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQ

Query:  H--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQPQPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSM
        +  A     +P SQ +A   Q       P     S  Q Q F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L SSM
Subjt:  H--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQPQPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSM

Query:  PMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPP
         MQ  +   PQ          YGY AA    AP  P Q K +Y   TG+GY+P G      Y     E GR   + P QP    QQ H+ Q     GY P
Subjt:  PMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPP

Query:  ANAPHQ
           PHQ
Subjt:  ANAPHQ

AT5G14540.1 Protein of unknown function (DUF1421)1.7e-1328.41Show/hide
Query:  TDPVSLTNSSKDFHKSRMSTVFPAAAYGQAD-DSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL
        +DP  ++ SS   + S M ++ P+  + + D +S    +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ L
Subjt:  TDPVSLTNSSKDFHKSRMSTVFPAAAYGQAD-DSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL

Query:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQ
        E  + EV   VQ+++DKQE+ E Q  L+KLQ+S+      +HS   E      AS P+                      P  +   PPS +  Q +P Q
Subjt:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQ

Query:  QQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQL---SQTQPQPFNQYQQQWAQPPSQQQQPP-QQPSMQPQI--RPPPSSVYPSP
        Q   +I       QPP  QH     +SP S      P   S    P      Q+QP P  Q   Q   P     QPP Q P  QPQ   +PPP   +PS 
Subjt:  QQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQL---SQTQPQPFNQYQQQWAQPPSQQQQPP-QQPSMQPQI--RPPPSSVYPSP

Query:  YPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHP
        Y P +P    ++   + P Q   PS P PGS+          +    +AP  PP + +  G  +  G+  P   S  +Y      +G P  +       P
Subjt:  YPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHP

Query:  SQQPHFNQSGYPPANAPHQVPPQAPSGPMFQPG
        + Q       YP       +P   P       G
Subjt:  SQQPHFNQSGYPPANAPHQVPPQAPSGPMFQPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACGCTTCAAA
CGGTAGCCATACTGATCCCGTTTCCCTTACCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCC
TTAGTCAAAATTTGATTTCCACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTCCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATAT
TGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCA
CAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGT
CAAATGAGGAGAGGACTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCA
CAGCAAAATCCTATAACTCCCCCTTCAGCAGTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATAT
CCAGCATGCTCAGGGCCAATATATCTCACCTGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCAACCACAAC
CATTCAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCAGCAACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTAC
CCTTCTCCTTATCCACCAAATCAACCGTCTTCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCAT
GGATGCAGGGCCTTATGGGTATGCTGCTGCAAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTC
CTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAAT
CAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACGCTTCAAA
CGGTAGCCATACTGATCCCGTTTCCCTTACCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCC
TTAGTCAAAATTTGATTTCCACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTCCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATAT
TGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCA
CAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGT
CAAATGAGGAGAGGACTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCA
CAGCAAAATCCTATAACTCCCCCTTCAGCAGTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATAT
CCAGCATGCTCAGGGCCAATATATCTCACCTGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCAACCACAAC
CATTCAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCAGCAACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTAC
CCTTCTCCTTATCCACCAAATCAACCGTCTTCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCAT
GGATGCAGGGCCTTATGGGTATGCTGCTGCAAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTC
CTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAAT
CAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAA
Protein sequenceShow/hide protein sequence
MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELY
CYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVP
QQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVY
PSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFN
QSGYPPANAPHQVPPQAPSGPMFQPGIQAIHI