; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001310 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001310
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSerine/threonine-protein kinase SRPK
Genome locationChr09:15960898..15965616
RNA-Seq ExpressionHG10001310
SyntenyHG10001310
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145826.1 uncharacterized protein LOC101215373 [Cucumis sativus]3.3e-22880.52Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSD DPI+AEL+ADLEPV+D NGPAHHPSAP DE+FDISTTVDPSYIISLIRKLLPLNA NT NS GNG    DTSV  MDE      GDQ+ SSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T SKCLGIEI D S KLA+K+GEDEGACP+SEQLISSSE+KVWEE GCILWDLSAS+S AELMVQNLVLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLLN GLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQE+VHVLL CLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFSFEMKILTNERS ERHSILDV+LRAVEALSG E++S+E+CSNKELFQL+RDLVKLPDAFEVSS CISAVVLIANILSDVPDLAF+ SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGLLD+FSF GDD EAR AVWSIIARIL+RVQEN +SRP+LFEYVSLL                                       LRRIISILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRDEY +EDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

XP_008458652.1 PREDICTED: uncharacterized protein LOC103497988 isoform X1 [Cucumis melo]1.9e-22881.09Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AELEAD+EPVED NGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNA NT NS  NG    DTSV  MDE      GDQ+LSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T SKCLG+EIADGS KLA+K+GEDEGAC +SEQLISS E+KVWEE GCILWDLSAS+S AELMVQNLVLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLLN GLQSSECVIWAEALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFSFEMKILTNERS ERHSILDV+LRAVE LSGIE++S E+CSNKELFQL+RDLVKLPDAFEVSS CISAVVLIANILSDVPDLAF+ SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGL D FSFAGDDLEAR AVWSIIARIL+RVQEN +SRP+L EYVSLL                                       LRRIISILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRDEY VEDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

XP_022944140.1 uncharacterized protein LOC111448685 isoform X1 [Cucurbita moschata]2.4e-22378.65Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AEL+ +LE VE G GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP +A N  NSYG  D DRD SVTNMDE      GDQVLSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T ++C GIEIADGSDKLA+++GEDEGACP SEQ ISSSE+ VWEE GCILWDLSASKSHAELMVQNLVLEVLSANL+VSQSVRVMEI LGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLIT IVNQLFLDDAQCLCEVCRLL+AGL SSEC IWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSS LFNLFSFEMKILTNERS ER+SILD +LRAVEALSGIE++SQE CSNK+LFQL+ +LVKLPDAFEVSS C+SAV+LIANILSDVPDLAFD SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGLLD+FSFAGDDLEAR AVWSIIARIL+ V+E  +SRPR+FEYVSLL                                       LRRII+ILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WT SKDEGTDVRDEY  ED+DVNRLL+CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

XP_038901475.1 uncharacterized protein LOC120088329 isoform X1 [Benincasa hispida]1.7e-22979.08Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDE-----------LFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE----
        MEVGSDSDPI+AELEADLEPVEDGNGPAHHPSAP DE           LFDISTTVDPSYIISLIRKLLP NA N  +SYGNGDGDRDTS+T MDE    
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDE-----------LFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE----

Query:  --GDQVLSSSGTASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISL
          GDQVLS SG+ SKCLGIEIADGSDKLA+K GEDEGAC +SEQL+SSSE+KVWEE GCILWDLSASKSHAELMVQN VLEVLSANL+VSQSVRVMEISL
Subjt:  --GDQVLSSSGTASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISL

Query:  GIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEV
        GIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSR+LW+SENTLNPQLIEKSVGLLSTIIESQQEV
Subjt:  GIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEV

Query:  VHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDV
        VH+LLPCLMKLGLSSVLFNLFS EMKILTNERS ERHSILDV+LR  EALSG+E++SQEICSNKELF+L+ DLVKLPDAFEV S CIS+VVLIANILSDV
Subjt:  VHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDV

Query:  PDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL--------------------------------------
        PD AFD SQDL+FLQGLLD+FSF G+D EAR A+WSI ARIL+ VQEN++SR RLFEYVSLL                                      
Subjt:  PDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL--------------------------------------

Query:  -LRRIISILNRWTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
         LRRIISIL+ WTASKDEGTDVRD YHVEDVD+NRLLNCC KHSE
Subjt:  -LRRIISILNRWTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

XP_038901476.1 uncharacterized protein LOC120088329 isoform X2 [Benincasa hispida]4.9e-23280.71Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AELEADLEPVEDGNGPAHHPSAP DELFDISTTVDPSYIISLIRKLLP NA N  +SYGNGDGDRDTS+T MDE      GDQVLS SG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        + SKCLGIEIADGSDKLA+K GEDEGAC +SEQL+SSSE+KVWEE GCILWDLSASKSHAELMVQN VLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSR+LW+SENTLNPQLIEKSVGLLSTIIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFS EMKILTNERS ERHSILDV+LR  EALSG+E++SQEICSNKELF+L+ DLVKLPDAFEV S CIS+VVLIANILSDVPD AFD SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        +FLQGLLD+FSF G+D EAR A+WSI ARIL+ VQEN++SR RLFEYVSLL                                       LRRIISIL+ 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRD YHVEDVD+NRLLNCC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

TrEMBL top hitse value%identityAlignment
A0A0A0KDI1 Uncharacterized protein1.6e-22880.52Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSD DPI+AEL+ADLEPV+D NGPAHHPSAP DE+FDISTTVDPSYIISLIRKLLPLNA NT NS GNG    DTSV  MDE      GDQ+ SSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T SKCLGIEI D S KLA+K+GEDEGACP+SEQLISSSE+KVWEE GCILWDLSAS+S AELMVQNLVLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLLN GLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQE+VHVLL CLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFSFEMKILTNERS ERHSILDV+LRAVEALSG E++S+E+CSNKELFQL+RDLVKLPDAFEVSS CISAVVLIANILSDVPDLAF+ SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGLLD+FSF GDD EAR AVWSIIARIL+RVQEN +SRP+LFEYVSLL                                       LRRIISILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRDEY +EDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

A0A1S3C8G6 uncharacterized protein LOC103497988 isoform X19.3e-22981.09Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AELEAD+EPVED NGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNA NT NS  NG    DTSV  MDE      GDQ+LSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T SKCLG+EIADGS KLA+K+GEDEGAC +SEQLISS E+KVWEE GCILWDLSAS+S AELMVQNLVLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLLN GLQSSECVIWAEALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFSFEMKILTNERS ERHSILDV+LRAVE LSGIE++S E+CSNKELFQL+RDLVKLPDAFEVSS CISAVVLIANILSDVPDLAF+ SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGL D FSFAGDDLEAR AVWSIIARIL+RVQEN +SRP+L EYVSLL                                       LRRIISILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRDEY VEDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

A0A1S3C8Y1 uncharacterized protein LOC103497988 isoform X32.0e-22379.96Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AELEAD+EPVED NGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNA NT NS  NG    DTSV  MDE      GDQ+LSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T SKCLG+EIADGS KLA+K+GEDEGAC +SEQLISS E+KVWEE GCILWDLSAS+S AELMVQNLVLEVLSANL+VSQSVRVMEISLGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIV+QLFLDDAQCLCEVCRLLN GLQSSECVIWAEALN EHVLSRILWVSENTLNPQLIEKSVGLLSTIIES QEVVH LLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSSVLFNLFSFEMKILTNERS ERHSILDV+LRAVE LSGIE++S E+CSNKELFQL+RDLVKLPDAFEVSS CISAVVLIANILSDVPDLAF+ S   
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
           QGL D FSFAGDDLEAR AVWSIIARIL+RVQEN +SRP+L EYVSLL                                       LRRIISILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WTASKDEGTDVRDEY VEDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

A0A6J1FYH6 uncharacterized protein LOC111448685 isoform X11.2e-22378.65Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AEL+ +LE VE G GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP +A N  NSYG  D DRD SVTNMDE      GDQVLSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T ++C GIEIADGSDKLA+++GEDEGACP SEQ ISSSE+ VWEE GCILWDLSASKSHAELMVQNLVLEVLSANL+VSQSVRVMEI LGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLIT IVNQLFLDDAQCLCEVCRLL+AGL SSEC IWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSS LFNLFSFEMKILTNERS ER+SILD +LRAVEALSGIE++SQE CSNK+LFQL+ +LVKLPDAFEVSS C+SAV+LIANILSDVPDLAFD SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGLLD+FSFAGDDLEAR AVWSIIARIL+ V+E  +SRPR+FEYVSLL                                       LRRII+ILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        WT SKDEGTDVRDEY  ED+DVNRLL+CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

A0A6J1J751 uncharacterized protein LOC1114840772.4e-22178.28Show/hide
Query:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG
        MEVGSDSDPI+AEL+ +LE VE G GPAHHPSAP DELFDISTTVDPSYIISLIRKLLP NA N  NSYG  D D + SVTNMDE      GDQVLSSSG
Subjt:  MEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDE------GDQVLSSSG

Query:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV
        T ++C GIEIADGSDKLA+++GEDEGACP SEQ ISSSE+ VWEE GCILWDLSASKSHAELMVQNLVLEVLSANL+VSQSVRVMEI LGIIGNLACHEV
Subjt:  TASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEV

Query:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
        PMKHIV KSGLITTIVNQLFLDDAQCLCEVCRLL+AGLQSSEC IWA ALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL
Subjt:  PMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKL

Query:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL
        GLSS LFNLFSFEMKILTNERS ER+SILD +LRAVEALSGIE++SQE CSNK+LFQL+ +LVKLPDAFEVSS CISAV+LIANILSD+PDLAFD SQDL
Subjt:  GLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDL

Query:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR
        SFLQGLLD+FSFAGDDLEAR AVWSIIARIL+ V+E  +SRPR+FE VSLL                                       L RII+ILN 
Subjt:  SFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLL---------------------------------------LRRIISILNR

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE
        W ASKDEGTDVRDEY  ED+DVNRLL+CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE

SwissProt top hitse value%identityAlignment
Q6DCP5 Protein saal16.2e-0425.15Show/hide
Query:  GEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFL
        G +  A  E+E  +   E+   E C   +WD+S ++  A  + +    E+L   ++ S+  R+ EI +GI+GN++C + P   I     L    +  L  
Subjt:  GEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFL

Query:  DDAQCLCEVCRLLNAGLQSSECV-IWAEALNSE-HVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVV
         D   L E  RLL   L  +E    WAE       V   + ++  ++ N  L+ K   LL  + +  ++++
Subjt:  DDAQCLCEVCRLLNAGLQSSECV-IWAEALNSE-HVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVV

Q803M5 Protein saal11.5e-0523.08Show/hide
Query:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL
        C  S+     ++D   EE  C +WD++  K  A  + +    ++L   +  S + R+ EI +GI+GN+AC       +   S L   ++  L  +D   L
Subjt:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL

Query:  CEVCRLLNAGL-QSSECVIWAEALNSEH-VLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPC------LMKLGLSSVLFNLFSFEMKILTN
         E CRLL   L Q+    +W E +  +  V S + ++  ++ N  L+ K   LL  + +  +E++   +        L       +L +L          
Subjt:  CEVCRLLNAGL-QSSECVIWAEALNSEH-VLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPC------LMKLGLSSVLFNLFSFEMKILTN

Query:  ERSTERHSILDVVLRAVEALSGIEDYSQEICSNK
        +  +E    L+V L +++ L+ +E+  Q + S++
Subjt:  ERSTERHSILDVVLRAVEALSGIEDYSQEICSNK

Q96ER3 Protein SAAL15.6e-0521.32Show/hide
Query:  EDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLD
        E+  +  + E+ ++  ++++  E  C +WD+S  +  A  + +    ++    L  S+  R+ EI +GI+GN+AC +     I +   L   +++ L+  
Subjt:  EDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLD

Query:  DAQCLCEVCRLLNAGLQSSECV-IWAEALNSEH--VLSRILWVSENTLNPQLIEKSVGLLSTII------------------------ESQQEVVHVLLP
        D   L E  RLL   L  +E   +W E +  EH  +   I ++  ++ N  L+ K   ++  +                         ES+++ V  L+P
Subjt:  DAQCLCEVCRLLNAGLQSSECV-IWAEALNSEH--VLSRILWVSENTLNPQLIEKSVGLLSTII------------------------ESQQEVVHVLLP

Query:  CLMKLGLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEIC----SNKELFQLLRDLV
        C+++                    +  +E    LDV +  ++ L+ ++D  Q I     + K+++ LL DLV
Subjt:  CLMKLGLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEIC----SNKELFQLLRDLV

Arabidopsis top hitse value%identityAlignment
AT5G22820.1 ARM repeat superfamily protein2.2e-11352.24Show/hide
Query:  PAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRD-TSVTNMDEGDQVLSSSGTASKCLG----IEIADGSDKLANKKGEDEGA
        P+HHP  P DELFDISTTVDPSY+ISLIRKLLP+++       G+ +   D  +  N+ +G   +S +G      G    ++I D  D+   + GE   +
Subjt:  PAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRD-TSVTNMDEGDQVLSSSGTASKCLG----IEIADGSDKLANKKGEDEGA

Query:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL
        CP       SS    WE+ GC+LWDL+AS++HAELMVQNL+LEVL ANL+VS+S R+ EI LGII NLACHE  +KHI + +G++ T+V QLFLDD QCL
Subjt:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL

Query:  CEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHS
         EVCR+L  GL  + C  WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  IIE Q EV  +L+P LM LGL+S+L NL SFEM  LT ER  ER+ 
Subjt:  CEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHS

Query:  ILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSII
        +L+++LRA+EALS  + YS+EICS+KELFQL+ DL+KL D  EV++ C++  VLIAN+LS+  D   +  +D SFL+GL     FA DD+EAR A+W++I
Subjt:  ILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSII

Query:  ARILIRVQENTISRPRLFEYVSLLL
        AR+L RV E+ I+   L +Y+ +LL
Subjt:  ARILIRVQENTISRPRLFEYVSLLL

AT5G22820.2 ARM repeat superfamily protein8.0e-11646.84Show/hide
Query:  PAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRD-TSVTNMDEGDQVLSSSGTASKCLG----IEIADGSDKLANKKGEDEGA
        P+HHP  P DELFDISTTVDPSY+ISLIRKLLP+++       G+ +   D  +  N+ +G   +S +G      G    ++I D  D+   + GE   +
Subjt:  PAHHPSAPLDELFDISTTVDPSYIISLIRKLLPLNAINTSNSYGNGDGDRD-TSVTNMDEGDQVLSSSGTASKCLG----IEIADGSDKLANKKGEDEGA

Query:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL
        CP       SS    WE+ GC+LWDL+AS++HAELMVQNL+LEVL ANL+VS+S R+ EI LGII NLACHE  +KHI + +G++ T+V QLFLDD QCL
Subjt:  CPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQNLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCL

Query:  CEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHS
         EVCR+L  GL  + C  WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  IIE Q EV  +L+P LM LGL+S+L NL SFEM  LT ER  ER+ 
Subjt:  CEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKSVGLLSTIIESQQEVVHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHS

Query:  ILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSII
        +L+++LRA+EALS  + YS+EICS+KELFQL+ DL+KL D  EV++ C++  VLIAN+LS+  D   +  +D SFL+GL     FA DD+EAR A+W++I
Subjt:  ILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANILSDVPDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSII

Query:  ARILIRVQENTISRPRLFEYVSLLL--------------------------------------RRIISILNRWTASKD--EGTDVRDEYHVEDVDVNRLL
        AR+L RV E+ I+   L +Y+ +LL                                      ++I SILN W A K+  +   V     +   DV RL 
Subjt:  ARILIRVQENTISRPRLFEYVSLLL--------------------------------------RRIISILNRWTASKD--EGTDVRDEYHVEDVDVNRLL

Query:  NCCRKH
        +CC ++
Subjt:  NCCRKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGAAGATACCCGGCCCGATTGAAGAAACTGGTCGAGTGCCGGAGAGACCGAGTAGCTGGGAAATTTCAGTCTTTACGAATACGACGCTTCTCTGTAGAGAATTT
TGGAGGGAAGCCAAGGTTGAAGACGACTGCGAGTCTGAAAGCTAACCGTGTTCTTGAACCTTCAAACCTTGCTATGGAGGTGGGCTCAGATTCAGACCCTATAGATGCGG
AATTGGAGGCGGACCTTGAACCTGTAGAAGACGGCAATGGACCTGCTCATCACCCTTCTGCTCCATTGGATGAGTTATTTGACATCTCAACGACGGTTGATCCTAGCTAT
ATTATCTCTCTAATACGGAAACTTCTGCCACTGAACGCAATTAACACGTCCAATTCTTATGGAAATGGAGATGGCGACCGTGACACCTCAGTAACCAACATGGATGAAGG
TGACCAAGTCTTAAGTTCTTCAGGAACAGCGAGTAAATGCCTGGGCATTGAAATTGCGGATGGTTCTGATAAACTTGCTAATAAAAAAGGCGAGGATGAAGGTGCATGTC
CTGAATCGGAGCAACTTATTTCATCCTCAGAAGATAAGGTCTGGGAAGAGTGTGGTTGCATTCTGTGGGATCTTTCTGCGAGTAAATCTCATGCAGAACTTATGGTTCAG
AACCTTGTCCTTGAAGTTCTTTCTGCAAACCTTTTGGTCTCACAATCTGTGCGTGTTATGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCAT
GAAACATATAGTCACCAAAAGTGGATTGATTACAACCATTGTGAACCAGCTGTTTCTAGATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAAATGCCGGACTTC
AAAGTAGCGAATGTGTTATATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTATCTCGTATTCTATGGGTTTCTGAGAATACCTTAAATCCACAACTTATAGAAAAGAGT
GTTGGGCTATTATCAACCATTATTGAAAGTCAGCAAGAAGTTGTTCATGTTCTTCTCCCATGTTTGATGAAGCTGGGTTTGTCGAGTGTTTTGTTCAACCTTTTTTCTTT
TGAGATGAAAATATTAACAAATGAAAGATCAACTGAAAGGCATTCAATTTTGGACGTGGTTCTTCGGGCAGTTGAAGCACTCTCTGGAATTGAAGACTATTCTCAGGAAA
TTTGTTCAAATAAAGAACTTTTTCAGCTTCTTCGTGACCTAGTCAAATTGCCAGATGCATTTGAGGTTTCCAGCTGTTGTATCAGTGCTGTAGTTTTGATCGCAAATATT
TTGTCAGATGTACCTGATCTAGCCTTTGACACTTCTCAGGATTTGTCTTTCCTACAAGGTCTACTTGATGTATTCTCTTTCGCTGGGGATGACTTAGAGGCACGTGGTGC
TGTTTGGAGCATCATTGCCAGGATATTGATTCGTGTTCAAGAAAATACGATTAGCAGACCAAGGCTGTTTGAGTATGTGTCATTACTACTAAGAAGGATAATTTCTATTT
TAAATCGTTGGACTGCTTCTAAGGATGAAGGGACAGATGTAAGAGACGAATATCATGTAGAAGATGTTGATGTCAATAGATTGTTGAATTGTTGCCGTAAACATTCTGAG
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGAAGATACCCGGCCCGATTGAAGAAACTGGTCGAGTGCCGGAGAGACCGAGTAGCTGGGAAATTTCAGTCTTTACGAATACGACGCTTCTCTGTAGAGAATTT
TGGAGGGAAGCCAAGGTTGAAGACGACTGCGAGTCTGAAAGCTAACCGTGTTCTTGAACCTTCAAACCTTGCTATGGAGGTGGGCTCAGATTCAGACCCTATAGATGCGG
AATTGGAGGCGGACCTTGAACCTGTAGAAGACGGCAATGGACCTGCTCATCACCCTTCTGCTCCATTGGATGAGTTATTTGACATCTCAACGACGGTTGATCCTAGCTAT
ATTATCTCTCTAATACGGAAACTTCTGCCACTGAACGCAATTAACACGTCCAATTCTTATGGAAATGGAGATGGCGACCGTGACACCTCAGTAACCAACATGGATGAAGG
TGACCAAGTCTTAAGTTCTTCAGGAACAGCGAGTAAATGCCTGGGCATTGAAATTGCGGATGGTTCTGATAAACTTGCTAATAAAAAAGGCGAGGATGAAGGTGCATGTC
CTGAATCGGAGCAACTTATTTCATCCTCAGAAGATAAGGTCTGGGAAGAGTGTGGTTGCATTCTGTGGGATCTTTCTGCGAGTAAATCTCATGCAGAACTTATGGTTCAG
AACCTTGTCCTTGAAGTTCTTTCTGCAAACCTTTTGGTCTCACAATCTGTGCGTGTTATGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCAT
GAAACATATAGTCACCAAAAGTGGATTGATTACAACCATTGTGAACCAGCTGTTTCTAGATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAAATGCCGGACTTC
AAAGTAGCGAATGTGTTATATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTATCTCGTATTCTATGGGTTTCTGAGAATACCTTAAATCCACAACTTATAGAAAAGAGT
GTTGGGCTATTATCAACCATTATTGAAAGTCAGCAAGAAGTTGTTCATGTTCTTCTCCCATGTTTGATGAAGCTGGGTTTGTCGAGTGTTTTGTTCAACCTTTTTTCTTT
TGAGATGAAAATATTAACAAATGAAAGATCAACTGAAAGGCATTCAATTTTGGACGTGGTTCTTCGGGCAGTTGAAGCACTCTCTGGAATTGAAGACTATTCTCAGGAAA
TTTGTTCAAATAAAGAACTTTTTCAGCTTCTTCGTGACCTAGTCAAATTGCCAGATGCATTTGAGGTTTCCAGCTGTTGTATCAGTGCTGTAGTTTTGATCGCAAATATT
TTGTCAGATGTACCTGATCTAGCCTTTGACACTTCTCAGGATTTGTCTTTCCTACAAGGTCTACTTGATGTATTCTCTTTCGCTGGGGATGACTTAGAGGCACGTGGTGC
TGTTTGGAGCATCATTGCCAGGATATTGATTCGTGTTCAAGAAAATACGATTAGCAGACCAAGGCTGTTTGAGTATGTGTCATTACTACTAAGAAGGATAATTTCTATTT
TAAATCGTTGGACTGCTTCTAAGGATGAAGGGACAGATGTAAGAGACGAATATCATGTAGAAGATGTTGATGTCAATAGATTGTTGAATTGTTGCCGTAAACATTCTGAG
TAA
Protein sequenceShow/hide protein sequence
MAGRYPARLKKLVECRRDRVAGKFQSLRIRRFSVENFGGKPRLKTTASLKANRVLEPSNLAMEVGSDSDPIDAELEADLEPVEDGNGPAHHPSAPLDELFDISTTVDPSY
IISLIRKLLPLNAINTSNSYGNGDGDRDTSVTNMDEGDQVLSSSGTASKCLGIEIADGSDKLANKKGEDEGACPESEQLISSSEDKVWEECGCILWDLSASKSHAELMVQ
NLVLEVLSANLLVSQSVRVMEISLGIIGNLACHEVPMKHIVTKSGLITTIVNQLFLDDAQCLCEVCRLLNAGLQSSECVIWAEALNSEHVLSRILWVSENTLNPQLIEKS
VGLLSTIIESQQEVVHVLLPCLMKLGLSSVLFNLFSFEMKILTNERSTERHSILDVVLRAVEALSGIEDYSQEICSNKELFQLLRDLVKLPDAFEVSSCCISAVVLIANI
LSDVPDLAFDTSQDLSFLQGLLDVFSFAGDDLEARGAVWSIIARILIRVQENTISRPRLFEYVSLLLRRIISILNRWTASKDEGTDVRDEYHVEDVDVNRLLNCCRKHSE