; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001596 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001596
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCyclin-dependent kinase 12 isoform X2
Genome locationChr09:18516800..18521337
RNA-Seq ExpressionHG10001596
SyntenyHG10001596
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035326.1 hypothetical protein SDJN02_02121 [Cucurbita argyrosperma subsp. argyrosperma]9.3e-19879.88Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRR QG  RPSGFEEVARNSRR T  VSNEDDD++DLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSA+RDSS+SSAYAARSASRV KPT SS+SPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA
        PAS QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN V QPEAPGRMREQG G G      GTG ATRVK ++RS+SDESQ KTENSSFQKGLGA
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA

Query:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLREL-----
        I SSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSG  M SQT PQ QTQMQTG+ELQLKASRDV   ++     ++  L     
Subjt:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLREL-----

Query:  ------KTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDL
                ++A F +       +   N +L+   E          VAMAMAAKAKLLLRELKTVKAD AFAKERCAQLEEE+KILRENRERGNNLEDDDL
Subjt:  ------KTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDL

Query:  IRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPT
        IRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPL  QP+SSSNR GVGLDMSSQMTHIVPT
Subjt:  IRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPT

Query:  CPLPPSDLRGIS
        CPLPPSDLRGIS
Subjt:  CPLPPSDLRGIS

XP_008456259.1 PREDICTED: uncharacterized protein LOC103496255 [Cucumis melo]5.3e-20983.93Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATR S FE VARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG
        P+SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPEAPGRMREQGFG G GAGA   AG G+ATR KYHNRSQSDESQPKTENSSFQKG
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG

Query:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT
        LGA+ASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSG+TMESQT P TQTQMQTGLELQLKASRD                  
Subjt:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT

Query:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
                                             VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
Subjt:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL

Query:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP
        LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK P AASNLNA+S LPPQPL  SSSNRPGVGLDMSSQMT I+PTCPLPP
Subjt:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP

Query:  SDLR
        SDLR
Subjt:  SDLR

XP_011656966.1 uncharacterized protein LOC101205186 [Cucumis sativus]5.5e-20683.53Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATR S FEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKP FSSS PS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAG--AGAGA-AGTGTATRVKYHNRSQSDESQPKTENSSFQKG
        P+SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPE PGRMREQGFG G  AG GA AG G+ATR KYHNR+QSDESQPKTENSSFQKG
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAG--AGAGA-AGTGTATRVKYHNRSQSDESQPKTENSSFQKG

Query:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT
        LGAIASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSGS MESQT P TQTQMQTGLELQLKASRD                  
Subjt:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT

Query:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
                                             VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
Subjt:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL

Query:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK-LPAASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP
        LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK   AASNLNA+S LPPQPL  SSSNRPGVGLDMSSQMT IVPTCPLPP
Subjt:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK-LPAASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP

Query:  SDLR
        SDLR
Subjt:  SDLR

XP_023533704.1 uncharacterized protein LOC111795485 [Cucurbita pepo subsp. pepo]5.5e-19880.44Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRR QG  RPSGFEEVARNSRR T  VSNEDDD++DLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSA+RDSS+SSAYAARSASRV KPT SS+SPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA
        PAS QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPEAPGRMREQG G G      GTGTATRVK ++RS+SDESQ KTENSSFQKGLGA
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA

Query:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKA
        I SSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSG  M SQT PQ QTQMQT +ELQLKASRD                     
Subjt:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKA

Query:  DFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAE
                                          VAMAMAAKAKLLLRELKTVKAD AFAKERCAQLEEE+KILRENRERGNNLEDDDLIRLQLETLLAE
Subjt:  DFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAE

Query:  KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLRGI
        KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPL  QP+SSSNR GVGLDMSSQMTHIVPTCPLPPSDLRGI
Subjt:  KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLRGI

Query:  S
        S
Subjt:  S

XP_038901686.1 uncharacterized protein LOC120088449 [Benincasa hispida]1.2e-20883.33Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATRPSGFEEVARNSRRP RFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPT+SSSSPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA
        P+SVQDSKA+EYTSMKSLNESKHGFWGALARKAKAILDDDNV++QPEAPGRMREQGFG        GTG+ATR KY+NRSQSDESQPK ENSSFQKGLGA
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA

Query:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLP--QTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTV
        IASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSGSTM+SQT P  QTQTQ QTGLELQLKASRD                   
Subjt:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLP--QTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTV

Query:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL
                                            VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL
Subjt:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL

Query:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPL-SSSNRPGVGLDMSSQMTHIVPTCPLPPSDL
        AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKL AASNLNAVSPLPPQPL SSSNRPG+GLDMSSQ+T IVPTCPLPPSDL
Subjt:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPL-SSSNRPGVGLDMSSQMTHIVPTCPLPPSDL

Query:  RGIS
        RGIS
Subjt:  RGIS

TrEMBL top hitse value%identityAlignment
A0A0A0KD42 Uncharacterized protein2.6e-20683.53Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATR S FEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKP FSSS PS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAG--AGAGA-AGTGTATRVKYHNRSQSDESQPKTENSSFQKG
        P+SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPE PGRMREQGFG G  AG GA AG G+ATR KYHNR+QSDESQPKTENSSFQKG
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAG--AGAGA-AGTGTATRVKYHNRSQSDESQPKTENSSFQKG

Query:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT
        LGAIASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSGS MESQT P TQTQMQTGLELQLKASRD                  
Subjt:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT

Query:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
                                             VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
Subjt:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL

Query:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK-LPAASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP
        LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK   AASNLNA+S LPPQPL  SSSNRPGVGLDMSSQMT IVPTCPLPP
Subjt:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK-LPAASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP

Query:  SDLR
        SDLR
Subjt:  SDLR

A0A1S3C2W6 uncharacterized protein LOC1034962552.6e-20983.93Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATR S FE VARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG
        P+SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPEAPGRMREQGFG G GAGA   AG G+ATR KYHNRSQSDESQPKTENSSFQKG
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG

Query:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT
        LGA+ASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSG+TMESQT P TQTQMQTGLELQLKASRD                  
Subjt:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT

Query:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
                                             VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
Subjt:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL

Query:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP
        LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK P AASNLNA+S LPPQPL  SSSNRPGVGLDMSSQMT I+PTCPLPP
Subjt:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP

Query:  SDLR
        SDLR
Subjt:  SDLR

A0A5D3B6N0 Cyclin-dependent kinase 12 isoform X22.6e-20983.93Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRRQQGATR S FE VARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG
        P+SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPEAPGRMREQGFG G GAGA   AG G+ATR KYHNRSQSDESQPKTENSSFQKG
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGA---AGTGTATRVKYHNRSQSDESQPKTENSSFQKG

Query:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT
        LGA+ASSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQYQTPNSG+TMESQT P TQTQMQTGLELQLKASRD                  
Subjt:  LGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKT

Query:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
                                             VAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL
Subjt:  VKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETL

Query:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP
        LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIK P AASNLNA+S LPPQPL  SSSNRPGVGLDMSSQMT I+PTCPLPP
Subjt:  LAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLP-AASNLNAVSPLPPQPL--SSSNRPGVGLDMSSQMTHIVPTCPLPP

Query:  SDLR
        SDLR
Subjt:  SDLR

A0A6J1D7J9 uncharacterized protein LOC1110180925.0e-19780.32Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        ME+ RRQQGATRPS FEEVARN+RR T  VSNEDDDVYDLSSDSGNDAVGSSRSSSSP+SLAAKAI+ASSAHRDSSLSSAYAARSASRVSKPTFSSSS S
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAP--GRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGL
         +SVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAP  GRMREQG          G+GTATRVKY N SQS++ Q KTENSSFQKGL
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAP--GRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGL

Query:  GAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTV
        GAI SSLNYIGNAFEE LTAVENKTVDIIQETRKHIKKKSGGSVAQYQT N G TM+SQT PQ QTQMQTG+ELQLKASRD                   
Subjt:  GAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTV

Query:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL
                                            VAMAMAAKAKLLLRELKTVKAD AFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL
Subjt:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL

Query:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLR
        AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNA SPLPPQP SSSNRPGVGLDMSSQMT IVPTCPLPPSDLR
Subjt:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLR

Query:  GIS
        GIS
Subjt:  GIS

A0A6J1KXX8 uncharacterized protein LOC1114997455.5e-19679.64Show/hide
Query:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS
        MEYKRR QG  RPSGFEEVARNSRR T  VSNEDDD++DLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSA+RDSS+SSAYAARSASRV KPT SS+SPS
Subjt:  MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPS

Query:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA
        PAS QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDN VQQPEAPGRMREQG          GTGTATRVK ++RS+SDESQ KTENSSFQKGLGA
Subjt:  PASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGA

Query:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKA
        I SSLNYIGNAFEEKLTAVENKT DIIQETRKHIKKKSGGSVAQY TPNSG  M SQT PQ QTQMQTG+ELQLKASRD                     
Subjt:  IASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKA

Query:  DFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAE
                                          VAMAMAAKAKLLLRELKTVKAD AFAKERCAQLEEE+KILRENRERGNNLEDDDLIRLQLETLLAE
Subjt:  DFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAE

Query:  KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLRGI
        KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPL  QP+SSSNR GVGLDMSSQMT I+PTCPLPPSDLRGI
Subjt:  KARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLRGI

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30050.1 unknown protein1.9e-5538.69Show/hide
Query:  RRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPSPASV
        RR+QG TR S F+E          +    D D  DL   S   +  SS+S SS  SLAA+AI+AS                                   
Subjt:  RRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPSPASV

Query:  QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASS
          S+A  +T+ +  +ES+ GFWG LA+KAK+IL+D+   QQ                                 + Q ++   +  N + +K +  I +S
Subjt:  QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASS

Query:  LNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKADFAF
        LN+IG++FE+     E +T+   Q  RK        +   +Q+  S S  +  T P  +       E QLKASRDVAMA AAKAKLLLRELKTVKAD AF
Subjt:  LNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKADFAF

Query:  AKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARL
        AKERC+QLEEENK LR+NR++GN                                                      NN  DDDLIRLQLETLLAEKARL
Subjt:  AKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARL

Query:  AHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYP-------IKLPAASNLNAVSPLPPQPLSSS
        AHENS+YARENRFLREIVEYHQLTMQDV+Y+DEG EEV EV P       +   +AS L ++SP P  P S S
Subjt:  AHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYP-------IKLPAASNLNAVSPLPPQPLSSS

AT2G30530.1 unknown protein2.9e-8050Show/hide
Query:  LSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPSPASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDD
        +S  S +    SS SSS+  SLAAKAI+ASSAHRDSSLSSAY++ S++ V  P        P  V  +KAYEYTSMKSLNE K GFWG+LA KAKA LD+
Subjt:  LSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPSPASVQDSKAYEYTSMKSLNESKHGFWGALARKAKAILDD

Query:  DNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASSLNYIGNAF----EEKLTAVENKTVDIIQETRKHIK
        D+  Q P++P RM EQ   +   +G    G   R              K+EN S Q+ L AI SSLNYIG       EE +TAVEN+T  IIQETRK IK
Subjt:  DNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASSLNYIGNAF----EEKLTAVENKTVDIIQETRKHIK

Query:  KKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLV
        KK                  S T  Q   ++Q  LE+QLKASRDVAMAMAAKAKLLLRELK VK+D AFAK+RCAQLEEENK+LRENR  G++  D    
Subjt:  KKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLV

Query:  AMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYL
                                                          DDDL+RLQLETLLAEKARLAHENS+Y REN +LR +VEYHQLTMQDV+Y 
Subjt:  AMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYL

Query:  DEGTEEVTEVYPIKLP--AASNLNAVSPLP
        DE TEEVTEVYPI +   ++S+ N+ +P P
Subjt:  DEGTEEVTEVYPIKLP--AASNLNAVSPLP

AT4G02800.1 unknown protein2.6e-2033.21Show/hide
Query:  TRVKYHNRSQSDESQPKTENSSFQKGLGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLE
        TR+   + +  D S P    SS ++    + + ++ +     +    V N T+ I  E  K  K  S   +  + + +   ++ S  L      +Q   E
Subjt:  TRVKYHNRSQSDESQPKTENSSFQKGLGAIASSLNYIGNAFEEKLTAVENKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLE

Query:  LQLKASRDVAMAMAAKAKLLLRELKTVKADFAFAK------ERCAQLEEENKILRENRERGNN-LEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERC
        L    S    +    KA  + R+ K    +   +K      E   +  E++    E R +    ++    +A++MAAKA  L RELKT+K+D +F +ERC
Subjt:  LQLKASRDVAMAMAAKAKLLLRELKTVKADFAFAK------ERCAQLEEENKILRENRERGNN-LEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERC

Query:  AQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDV
          LEEENK LR+   +G   E+DDL+RLQLE LLAEKARLA+EN+   REN+ L ++VEYHQ+T QD+
Subjt:  AQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDV

AT5G01970.1 unknown protein2.4e-5844.7Show/hide
Query:  QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASS
        + S A++    K+      G WG +A+KAK++++DD                      +  + T + +R  Y     SDE   K +N   ++GL  + SS
Subjt:  QDSKAYEYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASS

Query:  LNYIGN----AFEEKLTAVENKTVDIIQETRK-HIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQM-QTGLELQLKASRDVAMAMAAKAKLLLRELKTV
        LN IG+    AFE+  T VENKT DIIQETRK   +++  G   + Q  + G +   +  P+   Q+     E QLKASRDVAMA AAKAKLLLRELKTV
Subjt:  LNYIGN----AFEEKLTAVENKTVDIIQETRK-HIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQM-QTGLELQLKASRDVAMAMAAKAKLLLRELKTV

Query:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL
        KAD AFAKERCAQLEEENK LRE+                                                      +RE+G+N  D+DLIRLQLE+LL
Subjt:  KADFAFAKERCAQLEEENKILRENRERGNNLEDDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLL

Query:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYP
        AEKARLAHENSVYARENRFLREIVEYHQLTMQDV+Y+DEG+EEVT+V P
Subjt:  AEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTEEVTEVYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTACAAGAGGCGGCAGCAAGGGGCGACAAGACCTTCGGGTTTTGAAGAAGTAGCTCGGAATTCTCGGCGTCCTACTCGTTTTGTTTCTAATGAAGATGATGATGT
TTATGATCTTAGTAGCGATAGTGGCAATGATGCTGTGGGTAGTTCGCGGTCATCATCTTCCCCACAATCTCTTGCTGCTAAAGCCATAAAAGCCTCATCGGCACATAGGG
ATTCTTCACTCTCATCTGCTTATGCTGCGCGCTCTGCCTCTCGGGTCTCGAAACCCACTTTTTCTTCATCTTCTCCTTCCCCCGCATCCGTTCAGGATTCAAAGGCATAT
GAATATACATCAATGAAAAGTTTAAATGAATCAAAACATGGATTCTGGGGTGCGCTGGCCAGAAAGGCCAAAGCAATTCTTGATGATGATAATGTCGTCCAGCAACCTGA
AGCACCTGGAAGAATGAGGGAGCAAGGATTTGGTGCAGGTGCAGGTGCAGGTGCAGCAGGTACAGGTACAGCTACAAGGGTCAAGTATCATAACAGAAGTCAGTCAGATG
AAAGCCAACCGAAAACAGAAAATTCTTCTTTCCAGAAGGGATTGGGCGCCATAGCATCTTCCCTTAACTATATTGGCAATGCTTTTGAGGAAAAGCTTACGGCTGTGGAA
AACAAAACTGTTGATATCATTCAAGAAACTCGCAAGCATATTAAGAAAAAGTCTGGTGGTTCGGTAGCACAATATCAGACACCAAATTCTGGGTCTACAATGGAATCTCA
GACGCTGCCTCAAACACAAACCCAAATGCAAACTGGCCTTGAACTTCAGCTGAAGGCATCTCGCGACGTTGCAATGGCAATGGCTGCCAAAGCAAAGCTGCTTCTTAGGG
AGTTGAAGACTGTCAAAGCAGATTTTGCTTTTGCAAAGGAACGATGTGCTCAGCTGGAAGAAGAAAATAAAATTCTTCGGGAGAATCGTGAGAGGGGAAACAATCTAGAA
GATGATGATTTGGTTGCAATGGCAATGGCTGCCAAAGCAAAGCTGCTTCTTAGGGAGTTGAAGACTGTCAAAGCAGATTTTGCTTTTGCAAAGGAACGATGTGCTCAGCT
GGAAGAAGAAAATAAAATTCTTCGGGAGAATCGTGAGAGGGGAAACAATCTAGAAGATGATGATTTGATACGACTACAACTCGAAACACTTCTGGCAGAGAAGGCTCGTT
TGGCTCACGAAAATTCAGTCTATGCGCGCGAGAATCGGTTTCTAAGGGAGATTGTTGAATACCACCAGCTGACCATGCAGGATGTTATATACCTGGATGAGGGCACAGAA
GAAGTCACTGAAGTTTACCCCATTAAGCTTCCTGCAGCATCCAATCTCAATGCTGTTTCTCCGTTACCGCCTCAGCCATTGTCGTCTTCAAACCGACCCGGGGTAGGACT
TGACATGAGTTCACAAATGACTCACATTGTTCCTACTTGCCCTTTGCCCCCATCAGATCTGAGGGGAATCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTACAAGAGGCGGCAGCAAGGGGCGACAAGACCTTCGGGTTTTGAAGAAGTAGCTCGGAATTCTCGGCGTCCTACTCGTTTTGTTTCTAATGAAGATGATGATGT
TTATGATCTTAGTAGCGATAGTGGCAATGATGCTGTGGGTAGTTCGCGGTCATCATCTTCCCCACAATCTCTTGCTGCTAAAGCCATAAAAGCCTCATCGGCACATAGGG
ATTCTTCACTCTCATCTGCTTATGCTGCGCGCTCTGCCTCTCGGGTCTCGAAACCCACTTTTTCTTCATCTTCTCCTTCCCCCGCATCCGTTCAGGATTCAAAGGCATAT
GAATATACATCAATGAAAAGTTTAAATGAATCAAAACATGGATTCTGGGGTGCGCTGGCCAGAAAGGCCAAAGCAATTCTTGATGATGATAATGTCGTCCAGCAACCTGA
AGCACCTGGAAGAATGAGGGAGCAAGGATTTGGTGCAGGTGCAGGTGCAGGTGCAGCAGGTACAGGTACAGCTACAAGGGTCAAGTATCATAACAGAAGTCAGTCAGATG
AAAGCCAACCGAAAACAGAAAATTCTTCTTTCCAGAAGGGATTGGGCGCCATAGCATCTTCCCTTAACTATATTGGCAATGCTTTTGAGGAAAAGCTTACGGCTGTGGAA
AACAAAACTGTTGATATCATTCAAGAAACTCGCAAGCATATTAAGAAAAAGTCTGGTGGTTCGGTAGCACAATATCAGACACCAAATTCTGGGTCTACAATGGAATCTCA
GACGCTGCCTCAAACACAAACCCAAATGCAAACTGGCCTTGAACTTCAGCTGAAGGCATCTCGCGACGTTGCAATGGCAATGGCTGCCAAAGCAAAGCTGCTTCTTAGGG
AGTTGAAGACTGTCAAAGCAGATTTTGCTTTTGCAAAGGAACGATGTGCTCAGCTGGAAGAAGAAAATAAAATTCTTCGGGAGAATCGTGAGAGGGGAAACAATCTAGAA
GATGATGATTTGGTTGCAATGGCAATGGCTGCCAAAGCAAAGCTGCTTCTTAGGGAGTTGAAGACTGTCAAAGCAGATTTTGCTTTTGCAAAGGAACGATGTGCTCAGCT
GGAAGAAGAAAATAAAATTCTTCGGGAGAATCGTGAGAGGGGAAACAATCTAGAAGATGATGATTTGATACGACTACAACTCGAAACACTTCTGGCAGAGAAGGCTCGTT
TGGCTCACGAAAATTCAGTCTATGCGCGCGAGAATCGGTTTCTAAGGGAGATTGTTGAATACCACCAGCTGACCATGCAGGATGTTATATACCTGGATGAGGGCACAGAA
GAAGTCACTGAAGTTTACCCCATTAAGCTTCCTGCAGCATCCAATCTCAATGCTGTTTCTCCGTTACCGCCTCAGCCATTGTCGTCTTCAAACCGACCCGGGGTAGGACT
TGACATGAGTTCACAAATGACTCACATTGTTCCTACTTGCCCTTTGCCCCCATCAGATCTGAGGGGAATCTCCTAA
Protein sequenceShow/hide protein sequence
MEYKRRQQGATRPSGFEEVARNSRRPTRFVSNEDDDVYDLSSDSGNDAVGSSRSSSSPQSLAAKAIKASSAHRDSSLSSAYAARSASRVSKPTFSSSSPSPASVQDSKAY
EYTSMKSLNESKHGFWGALARKAKAILDDDNVVQQPEAPGRMREQGFGAGAGAGAAGTGTATRVKYHNRSQSDESQPKTENSSFQKGLGAIASSLNYIGNAFEEKLTAVE
NKTVDIIQETRKHIKKKSGGSVAQYQTPNSGSTMESQTLPQTQTQMQTGLELQLKASRDVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLE
DDDLVAMAMAAKAKLLLRELKTVKADFAFAKERCAQLEEENKILRENRERGNNLEDDDLIRLQLETLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVIYLDEGTE
EVTEVYPIKLPAASNLNAVSPLPPQPLSSSNRPGVGLDMSSQMTHIVPTCPLPPSDLRGIS