; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007475 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007475
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionATP-dependent DNA helicase
Genome locationscaffold2:1567082..1574837
RNA-Seq ExpressionSpg007475
SyntenySpg007475
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589756.1 hypothetical protein SDJN03_15179, partial [Cucurbita argyrosperma subsp. sororia]2.1e-22476.82Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVKK
        MTLYPEKLA KREC D FH+EH HLDKR KPD H+S+ GPD L  +SS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQ   ST +G SNSETF+FVVKK
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVKK

Query:  ESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETN
        ++++ ++PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKANCPDDGPA LNVVLARRPLFFRETN
Subjt:  ESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETN

Query:  PQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDV
        PQPRKHTLWQA +DFT GEAS+QRQHFLQCP G+LNKHFEKLIQCDSRLNFLSRQPE+VL SPYFE  S+ FTT EQ        A+NDNQS+L+TFQDV
Subjt:  PQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDV

Query:  ASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GHQ
         SSSV SSL++EQ+SPQMVF+P T+E PSPSSVMDAHEIEENRS T+V+ KPRNWEQ+KV GLHPSMS+ DLV+HIG+HITEQM STK+PF DD S  +Q
Subjt:  ASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GHQ

Query:  DMLNEIALYLLSDNQ-SSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA
         MLNEIA YLLSD+Q SSAA  EVS+MSRV+SLCCLLQKEPAA+QS QT+ E+C +  +++E+ RLKDA+E RDGR+T ++I IHPEV KDV GSIQ SA
Subjt:  DMLNEIALYLLSDNQ-SSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA

Query:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        MSRKDSF DLL  LPRIASLPKF F+IS+GDE QD
Subjt:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

XP_008443832.1 PREDICTED: uncharacterized protein LOC103487331 isoform X3 [Cucumis melo]6.6e-22677.43Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV
        +TL+ +K  VKRECDDSFH EH   DKRFKPD H  QS+ G D L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG +ST +G SN+ETFDFVV
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV

Query:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE
        K ES+D ++PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGPA LNVVLARRPLFFRE
Subjt:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE

Query:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ
        TNPQPRKHTLWQAT+DFT GEAS+ RQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP +VL SPYFEPR+S FTT EQASI GLE+A N NQS+LS FQ
Subjt:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ

Query:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-G
        DV SS+  +SL +EQASPQMVFEP T+EAPSPSSVMDAHEIEEN S  KVTSKPRNWE IKVPGLHPSMS+SDLVNHIG+HITEQM STK+PF DD S  
Subjt:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-G

Query:  HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPS
        +Q ML++IA YLLSDNQ SA SDEVSLMSRV+SLCCLLQKEP  +QS QTN E+  EG +NK+D++LK  +ELRDG+N E +I I P      SGSIQ S
Subjt:  HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPS

Query:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        +MSRKDS+ +LLL LPRIASLPKFLF+ISDGDE QD
Subjt:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

XP_022135007.1 uncharacterized protein LOC111007116 isoform X1 [Momordica charantia]1.2e-22477.8Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKE
        M L PEK AVKR+CDDSF + H  LDKRFKPD H              QNNP DEPSPLGLTLRKSPSLLDLIQMKLS G  ST+G SNSE FDF+VKKE
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKE

Query:  SKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNP
         KD +MPGT EKLKASNFPAS LKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGP TLNVVL+R PLFFRETNP
Subjt:  SKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNP

Query:  QPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVA
        QPRKHTLWQAT+DFT GEASM+RQHFLQCPQGLLNKHFEKLIQCDS LNFLS+QPE+VL SPYFEP++S FTT EQAS  GLERAE+  +  L TFQ VA
Subjt:  QPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVA

Query:  SSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDM
        S SV SSLK+EQ SPQMV EPCTLE PSPSSVMD HEIEENRS TKVT +PRNWEQIKVPGLHPSMS+SDLVNHIG+HITEQM ST  PF D  S +Q+M
Subjt:  SSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDM

Query:  LNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIH---PEVTKDVSGSIQ-PS
        L+EIALYLL+DNQSSAASDEV L SRVDSLCCLL KEPAA+QSFQT+S+SC  G D+KED +LK+A+ELRD +N   +IK+H    E TKDVS S Q P+
Subjt:  LNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIH---PEVTKDVSGSIQ-PS

Query:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        A+SRKDSF DLLL LPRI SLPKFLFNISDGDE QD
Subjt:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

XP_038879515.1 uncharacterized protein LOC120071357 isoform X1 [Benincasa hispida]2.3e-23478.28Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--SGDSNSETFDFVVK
        MTLYPEKL VKRE DDSFH EH  LDKRFKPD H+S+SGPD L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG  ST  +G SN+ETF+FVVK
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--SGDSNSETFDFVVK

Query:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET
         ES+D ++PGT EKLKASNFPAS L+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDDGPA LNVVLARRPLFFRET
Subjt:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET

Query:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD
        NPQPRKHTLWQAT+DFT GEAS+QRQHFLQCP GLLNKHFEKLIQCDSRLNFLSRQPE+VL SPYFEP +S FTT +QASI GLE+AENDNQS+LSTFQD
Subjt:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD

Query:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTS
        V SSS  SSLK+E+ASPQMVFEP T+EAPSPSS             VMDAHEIEENRS TKVT KPRNWEQIKVPG+HPS+S+SDLVNHIG+HITEQM S
Subjt:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTS

Query:  TKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPE
        TK+PF D+ S  +Q ML++IA YLLSDNQ SAASDEVSLMSRVDSLCCLLQKEP  +QS QTN E+C EG + ++     D + LRDG+N E++IKIHPE
Subjt:  TKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPE

Query:  VTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
         T++VSGSIQ SAMSRKDS+ DLLL LPRIASLPK LF+ISDGDE QD
Subjt:  VTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

XP_038879516.1 uncharacterized protein LOC120071357 isoform X2 [Benincasa hispida]3.7e-23780.19Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--SGDSNSETFDFVVK
        MTLYPEKL VKRE DDSFH EH  LDKRFKPD H+S+SGPD L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG  ST  +G SN+ETF+FVVK
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST--SGDSNSETFDFVVK

Query:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET
         ES+D ++PGT EKLKASNFPAS L+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDDGPA LNVVLARRPLFFRET
Subjt:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET

Query:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD
        NPQPRKHTLWQAT+DFT GEAS+QRQHFLQCP GLLNKHFEKLIQCDSRLNFLSRQPE+VL SPYFEP +S FTT +QASI GLE+AENDNQS+LSTFQD
Subjt:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD

Query:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GH
        V SSS  SSLK+E+ASPQMVFEP T+EAPSPSSVMDAHEIEENRS TKVT KPRNWEQIKVPG+HPS+S+SDLVNHIG+HITEQM STK+PF D+ S  +
Subjt:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GH

Query:  QDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA
        Q ML++IA YLLSDNQ SAASDEVSLMSRVDSLCCLLQKEP  +QS QTN E+C EG + ++     D + LRDG+N E++IKIHPE T++VSGSIQ SA
Subjt:  QDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA

Query:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        MSRKDS+ DLLL LPRIASLPK LF+ISDGDE QD
Subjt:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

TrEMBL top hitse value%identityAlignment
A0A0A0LTD1 Uncharacterized protein9.6e-22377.2Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH-QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVK
        +TL+P+K  VKRECDDSFH EH   DKRFKPD H QS+   D L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG ++T +G SN+ETFDFVVK
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH-QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVK

Query:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET
         ES+D ++PGT EKLKASNFPASHLKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGPA LNVVLAR+PLFFRET
Subjt:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET

Query:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD
        NPQPRKHTLWQAT+DFT GEAS+QR+H LQCP GLLNKHFEKLIQCDSRLNFLSRQPE+VL SPYFEPR+S F T EQASI G E+A N NQS+LS FQD
Subjt:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD

Query:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GH
        V SS+  +SL +EQASPQMVFEP T++APSPSSVMDA EIEENRS TKVT KP NWEQIKVPGLHPSMS+SDLVNHIG+HITEQM STK+PF DD S  +
Subjt:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-GH

Query:  QDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA
        Q ML++IA YLLSDNQ SA SDEVSLMSRV+SLCCLLQKEP  +QS QTN E+  +G +NK+D++LKD +ELRDG+N E    I P      SGSIQ SA
Subjt:  QDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSA

Query:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        MSRKDS+ +LLL LPRIASLPKFLF+ISDGDE QD
Subjt:  MSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

A0A1S3B8H0 uncharacterized protein LOC103487331 isoform X11.9e-22375.59Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV
        +TL+ +K  VKRECDDSFH EH   DKRFKPD H  QS+ G D L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG +ST +G SN+ETFDFVV
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV

Query:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE
        K ES+D ++PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGPA LNVVLARRPLFFRE
Subjt:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE

Query:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ
        TNPQPRKHTLWQAT+DFT GEAS+ RQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP +VL SPYFEPR+S FTT EQASI GLE+A N NQS+LS FQ
Subjt:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ

Query:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMT
        DV SS+  +SL +EQASPQMVFEP T+EAPSPSS             VMDAHEIEEN S  KVTSKPRNWE IKVPGLHPSMS+SDLVNHIG+HITEQM 
Subjt:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMT

Query:  STKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHP
        STK+PF DD S  +Q ML++IA YLLSDNQ SA SDEVSLMSRV+SLCCLLQKEP  +QS QTN E+  EG +NK+D++LK  +ELRDG+N E +I I P
Subjt:  STKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHP

Query:  EVTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
              SGSIQ S+MSRKDS+ +LLL LPRIASLPKFLF+ISDGDE QD
Subjt:  EVTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

A0A1S3B8X8 uncharacterized protein LOC103487331 isoform X21.5e-22375.73Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH-QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVK
        +TL+ +K  VKRECDDSFH EH   DKRFKPD H QS+ G D L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG +ST +G SN+ETFDFVVK
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH-QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVVK

Query:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET
         ES+D ++PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGPA LNVVLARRPLFFRET
Subjt:  KESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRET

Query:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD
        NPQPRKHTLWQAT+DFT GEAS+ RQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP +VL SPYFEPR+S FTT EQASI GLE+A N NQS+LS FQD
Subjt:  NPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQD

Query:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTS
        V SS+  +SL +EQASPQMVFEP T+EAPSPSS             VMDAHEIEEN S  KVTSKPRNWE IKVPGLHPSMS+SDLVNHIG+HITEQM S
Subjt:  VASSSVGSSLKMEQASPQMVFEPCTLEAPSPSS-------------VMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTS

Query:  TKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPE
        TK+PF DD S  +Q ML++IA YLLSDNQ SA SDEVSLMSRV+SLCCLLQKEP  +QS QTN E+  EG +NK+D++LK  +ELRDG+N E +I I P 
Subjt:  TKSPFPDDIS-GHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPE

Query:  VTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
             SGSIQ S+MSRKDS+ +LLL LPRIASLPKFLF+ISDGDE QD
Subjt:  VTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

A0A1S3B908 uncharacterized protein LOC103487331 isoform X33.2e-22677.43Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV
        +TL+ +K  VKRECDDSFH EH   DKRFKPD H  QS+ G D L  TSS NNPLDEPSPLGL LRKSPSLLDLIQMKLSQG +ST +G SN+ETFDFVV
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLH--QSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITST-SGDSNSETFDFVV

Query:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE
        K ES+D ++PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDDGPA LNVVLARRPLFFRE
Subjt:  KKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRE

Query:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ
        TNPQPRKHTLWQAT+DFT GEAS+ RQHF+QCP GLLNKHFEKL+QCDSRLNFLSRQP +VL SPYFEPR+S FTT EQASI GLE+A N NQS+LS FQ
Subjt:  TNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQ

Query:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-G
        DV SS+  +SL +EQASPQMVFEP T+EAPSPSSVMDAHEIEEN S  KVTSKPRNWE IKVPGLHPSMS+SDLVNHIG+HITEQM STK+PF DD S  
Subjt:  DVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDIS-G

Query:  HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPS
        +Q ML++IA YLLSDNQ SA SDEVSLMSRV+SLCCLLQKEP  +QS QTN E+  EG +NK+D++LK  +ELRDG+N E +I I P      SGSIQ S
Subjt:  HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPS

Query:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        +MSRKDS+ +LLL LPRIASLPKFLF+ISDGDE QD
Subjt:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

A0A6J1C1F5 uncharacterized protein LOC111007116 isoform X16.0e-22577.8Show/hide
Query:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKE
        M L PEK AVKR+CDDSF + H  LDKRFKPD H              QNNP DEPSPLGLTLRKSPSLLDLIQMKLS G  ST+G SNSE FDF+VKKE
Subjt:  MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKE

Query:  SKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNP
         KD +MPGT EKLKASNFPAS LKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGP TLNVVL+R PLFFRETNP
Subjt:  SKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNP

Query:  QPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVA
        QPRKHTLWQAT+DFT GEASM+RQHFLQCPQGLLNKHFEKLIQCDS LNFLS+QPE+VL SPYFEP++S FTT EQAS  GLERAE+  +  L TFQ VA
Subjt:  QPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVA

Query:  SSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDM
        S SV SSLK+EQ SPQMV EPCTLE PSPSSVMD HEIEENRS TKVT +PRNWEQIKVPGLHPSMS+SDLVNHIG+HITEQM ST  PF D  S +Q+M
Subjt:  SSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDM

Query:  LNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIH---PEVTKDVSGSIQ-PS
        L+EIALYLL+DNQSSAASDEV L SRVDSLCCLL KEPAA+QSFQT+S+SC  G D+KED +LK+A+ELRD +N   +IK+H    E TKDVS S Q P+
Subjt:  LNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIH---PEVTKDVSGSIQ-PS

Query:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD
        A+SRKDSF DLLL LPRI SLPKFLFNISDGDE QD
Subjt:  AMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein2.0e-2643.9Show/hide
Query:  NFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPATLNVVLARRPLFFRETNPQPRKHTLW
        NFP S ++IG W   ++   D+VAK Y+AK KL+WE L G        LK KIEIQW+D+ + + +    D    L + L +RP FF ETNPQ  KHT W
Subjt:  NFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPATLNVVLARRPLFFRETNPQPRKHTLW

Query:  -QATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGF
         Q   DFT   AS  R+H L  P G+L K+ EKL+  DS  + L   P  V  S YF+   SGF
Subjt:  -QATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGF

AT2G24100.1 unknown protein1.2e-11349.24Show/hide
Query:  YPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKESKD
        +P KL +    +DS  +EH  L+KR K   + +         + S+ + L+EPSPLGL+L+KSPS  +LI+MKLSQ    +  DSNS      VKKES  
Subjt:  YPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKESKD

Query:  VSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNPQPR
            GT EKLKASNFPA+ L+IG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKAN P+D P TL +VLARRPLFFRETNPQPR
Subjt:  VSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNPQPR

Query:  KHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVASSS
        KHTLWQATSDFT G+ASM RQHFLQCP G++NKHFEKL+QCD RL  LSRQPE+ LA+P+F+ R S    FE  S+ G                   +S 
Subjt:  KHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVASSS

Query:  VGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRN-WEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDMLN
        VG+    E  S        + +A SPSSVMDA  IE    +  + S+  N W QIK+PGLH S+S++D +  +                D    +     
Subjt:  VGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPRN-WEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDMLN

Query:  EIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSAMSRKD
        E+   LLSDN  +  SDE S+MS+V+S C LLQ              S A  Q N E +  +    + + R+     K    V    S S     MSRKD
Subjt:  EIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAAIQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSAMSRKD

Query:  SFADLLLGLPRIASLPKFLFNISDGD
        SF+DLL+ LPRI SLPKFLFNIS+ D
Subjt:  SFADLLLGLPRIASLPKFLFNISDGD

AT3G05770.1 unknown protein7.0e-3239.65Show/hide
Query:  SQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKH
        + N  +DE   L L L K+P L++ I+  L    T     +         +  SK  ++P + EKLKA NFP S +KIG   + ++   D+VAK Y+AK 
Subjt:  SQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKESKDVSMPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKH

Query:  KLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPATLNVVLARRPLFFRETNPQPRKHTLW-QATSDFTSGEASMQRQHFLQCPQGLLNKHF
        KL+WE L G        LKSKIEIQW+D+ + + +    D    L + L +RP FF ETNPQ  KHT W Q   DFT  +AS  R+H L  P G+L K+ 
Subjt:  KLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DGPATLNVVLARRPLFFRETNPQPRKHTLW-QATSDFTSGEASMQRQHFLQCPQGLLNKHF

Query:  EKLIQCDSRLNFLSRQPEMVLASPYFE
        EKL+  DS  + L + P  V  S YF+
Subjt:  EKLIQCDSRLNFLSRQPEMVLASPYFE

AT4G30780.1 unknown protein3.9e-11546.27Show/hide
Query:  EKLAVKRE-CDDSFHDEHVHLDKR---FKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQ-GITSTSGDSNSETFDFVVKKE
        ++L VK E  +D   +EH  L+KR   + P    S   P       ++ NPLDEPSPLGL+L+KSPSLL+LIQMK++  G    +    +      +K+E
Subjt:  EKLAVKRE-CDDSFHDEHVHLDKR---FKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQ-GITSTSGDSNSETFDFVVKKE

Query:  SKDVS---------MPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARR
        SK ++          PG+ EKLKASNFPAS LKIG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKANCP+DGP TL +VLAR+
Subjt:  SKDVS---------MPGTAEKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARR

Query:  PLFFRETNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQS
        PLFFRETNPQPRKHTLWQATSDFT G+ASM RQHFLQC QG++NKHFEKL+QCD RL  LSRQPE+ + SPYF+ R S    FE  S        N N  
Subjt:  PLFFRETNPQPRKHTLWQATSDFTSGEASMQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQS

Query:  ILSTFQDVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPR-NWEQIKVP------------------------------
         LST   ++ +   +S    Q+S + ++   + EAPSPSSV+DA   E    +  V S+ R +  QI+ P                              
Subjt:  ILSTFQDVASSSVGSSLKMEQASPQMVFEPCTLEAPSPSSVMDAHEIEENRSTTKVTSKPR-NWEQIKVP------------------------------

Query:  -GLHPSMSVSDLVNHI---------------GYHITEQMTSTKSPFPDDISG----HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAA
         GLH SMSVSD V ++               G H +  ++       D   G    H +    +   LLSDN    A DE SLM RV+SL  LL K+P  
Subjt:  -GLHPSMSVSDLVNHI---------------GYHITEQMTSTKSPFPDDISG----HQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAA

Query:  IQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDED
          + Q N+E  + G  ++    + D     +  N  NN ++      D + S +P  M RKDSF+DLLL LPRI SLPKFL NIS+ D D
Subjt:  IQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTTGTATCCCGAGAAATTGGCGGTGAAGCGGGAGTGTGATGATTCTTTTCACGACGAGCATGTTCACTTGGACAAGCGGTTCAAGCCCGATCTTCATCAGTCGAT
TTCAGGACCTGACGCATTACCTGTAACCTCTTCACAGAATAATCCACTTGATGAGCCTAGTCCTCTTGGTTTGACTCTAAGGAAAAGCCCGTCCCTGTTGGATCTAATTC
AAATGAAACTCTCACAGGGGATTACGTCTACATCTGGAGATTCAAACAGTGAAACTTTTGACTTCGTGGTTAAAAAAGAAAGCAAGGACGTTAGCATGCCGGGTACCGCT
GAGAAACTAAAGGCTTCAAACTTCCCAGCTTCACATTTAAAGATTGGACGTTGGGAGTATAAATCAAGATATGAAGGTGATCTCGTGGCCAAGTGTTACTATGCTAAGCA
TAAGCTTGTCTGGGAAATCCTTGAGGGTGGACTCAAGAGTAAGATAGAGATACAATGGTCAGATATTATGGCTTTGAAAGCGAATTGTCCCGATGATGGACCTGCTACAC
TAAACGTTGTGCTGGCTAGACGGCCTCTATTCTTCAGGGAGACCAACCCACAACCAAGGAAGCACACTTTGTGGCAGGCAACATCTGACTTTACTAGTGGTGAGGCCAGC
ATGCAGAGGCAGCATTTTCTGCAATGTCCTCAAGGGCTTCTAAACAAGCATTTCGAAAAGCTCATCCAATGTGACTCACGCCTAAACTTCTTAAGCCGACAACCAGAGAT
GGTGTTGGCTTCTCCATATTTTGAACCAAGATCCTCCGGTTTTACAACCTTTGAGCAAGCCAGCATCTGTGGTTTGGAGCGAGCAGAGAATGACAATCAATCTATACTTT
CTACTTTCCAGGATGTAGCTTCATCATCTGTTGGGTCATCGTTGAAGATGGAACAGGCTTCTCCTCAGATGGTGTTCGAACCTTGCACCCTGGAGGCGCCTTCCCCCAGT
TCAGTGATGGATGCTCACGAGATTGAAGAAAATAGAAGCACTACTAAAGTTACCAGCAAGCCAAGAAACTGGGAGCAGATAAAGGTTCCGGGGCTTCATCCGTCCATGTC
AGTGAGTGACCTCGTAAATCACATTGGATATCATATTACAGAACAAATGACTTCTACAAAATCACCCTTTCCTGATGATATTTCGGGACACCAAGATATGTTGAATGAAA
TTGCACTGTACTTGCTGAGTGACAACCAATCTTCAGCAGCTTCTGATGAGGTATCACTCATGTCGAGAGTCGATTCTCTCTGTTGCCTGTTGCAGAAAGAACCTGCAGCA
ATTCAAAGTTTTCAAACCAATAGTGAAAGCTGTGCTGAAGGACAAGATAACAAAGAGGATTCTCGACTCAAGGATGCCTCGGAATTGAGGGATGGCAGGAATACGGAAAA
CAACATTAAGATTCATCCAGAGGTGACAAAGGATGTTTCTGGAAGCATACAACCCTCAGCCATGTCAAGGAAAGATTCATTTGCGGATTTGCTGTTGGGTCTTCCTCGAA
TTGCGTCACTCCCTAAGTTCTTGTTTAACATTTCAGATGGTGATGAAGATCAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGTTGTATCCCGAGAAATTGGCGGTGAAGCGGGAGTGTGATGATTCTTTTCACGACGAGCATGTTCACTTGGACAAGCGGTTCAAGCCCGATCTTCATCAGTCGAT
TTCAGGACCTGACGCATTACCTGTAACCTCTTCACAGAATAATCCACTTGATGAGCCTAGTCCTCTTGGTTTGACTCTAAGGAAAAGCCCGTCCCTGTTGGATCTAATTC
AAATGAAACTCTCACAGGGGATTACGTCTACATCTGGAGATTCAAACAGTGAAACTTTTGACTTCGTGGTTAAAAAAGAAAGCAAGGACGTTAGCATGCCGGGTACCGCT
GAGAAACTAAAGGCTTCAAACTTCCCAGCTTCACATTTAAAGATTGGACGTTGGGAGTATAAATCAAGATATGAAGGTGATCTCGTGGCCAAGTGTTACTATGCTAAGCA
TAAGCTTGTCTGGGAAATCCTTGAGGGTGGACTCAAGAGTAAGATAGAGATACAATGGTCAGATATTATGGCTTTGAAAGCGAATTGTCCCGATGATGGACCTGCTACAC
TAAACGTTGTGCTGGCTAGACGGCCTCTATTCTTCAGGGAGACCAACCCACAACCAAGGAAGCACACTTTGTGGCAGGCAACATCTGACTTTACTAGTGGTGAGGCCAGC
ATGCAGAGGCAGCATTTTCTGCAATGTCCTCAAGGGCTTCTAAACAAGCATTTCGAAAAGCTCATCCAATGTGACTCACGCCTAAACTTCTTAAGCCGACAACCAGAGAT
GGTGTTGGCTTCTCCATATTTTGAACCAAGATCCTCCGGTTTTACAACCTTTGAGCAAGCCAGCATCTGTGGTTTGGAGCGAGCAGAGAATGACAATCAATCTATACTTT
CTACTTTCCAGGATGTAGCTTCATCATCTGTTGGGTCATCGTTGAAGATGGAACAGGCTTCTCCTCAGATGGTGTTCGAACCTTGCACCCTGGAGGCGCCTTCCCCCAGT
TCAGTGATGGATGCTCACGAGATTGAAGAAAATAGAAGCACTACTAAAGTTACCAGCAAGCCAAGAAACTGGGAGCAGATAAAGGTTCCGGGGCTTCATCCGTCCATGTC
AGTGAGTGACCTCGTAAATCACATTGGATATCATATTACAGAACAAATGACTTCTACAAAATCACCCTTTCCTGATGATATTTCGGGACACCAAGATATGTTGAATGAAA
TTGCACTGTACTTGCTGAGTGACAACCAATCTTCAGCAGCTTCTGATGAGGTATCACTCATGTCGAGAGTCGATTCTCTCTGTTGCCTGTTGCAGAAAGAACCTGCAGCA
ATTCAAAGTTTTCAAACCAATAGTGAAAGCTGTGCTGAAGGACAAGATAACAAAGAGGATTCTCGACTCAAGGATGCCTCGGAATTGAGGGATGGCAGGAATACGGAAAA
CAACATTAAGATTCATCCAGAGGTGACAAAGGATGTTTCTGGAAGCATACAACCCTCAGCCATGTCAAGGAAAGATTCATTTGCGGATTTGCTGTTGGGTCTTCCTCGAA
TTGCGTCACTCCCTAAGTTCTTGTTTAACATTTCAGATGGTGATGAAGATCAAGATTAA
Protein sequenceShow/hide protein sequence
MTLYPEKLAVKRECDDSFHDEHVHLDKRFKPDLHQSISGPDALPVTSSQNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQGITSTSGDSNSETFDFVVKKESKDVSMPGTA
EKLKASNFPASHLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDGPATLNVVLARRPLFFRETNPQPRKHTLWQATSDFTSGEAS
MQRQHFLQCPQGLLNKHFEKLIQCDSRLNFLSRQPEMVLASPYFEPRSSGFTTFEQASICGLERAENDNQSILSTFQDVASSSVGSSLKMEQASPQMVFEPCTLEAPSPS
SVMDAHEIEENRSTTKVTSKPRNWEQIKVPGLHPSMSVSDLVNHIGYHITEQMTSTKSPFPDDISGHQDMLNEIALYLLSDNQSSAASDEVSLMSRVDSLCCLLQKEPAA
IQSFQTNSESCAEGQDNKEDSRLKDASELRDGRNTENNIKIHPEVTKDVSGSIQPSAMSRKDSFADLLLGLPRIASLPKFLFNISDGDEDQD