; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019468 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019468
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr04:22132059..22134612
RNA-Seq ExpressionHG10019468
SyntenyHG10019468
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606848.1 hypothetical protein SDJN03_00190, partial [Cucurbita argyrosperma subsp. sororia]1.9e-24685.31Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-
        MVQLMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERW SEANAF  SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLAT++K 
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-

Query:  -----KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQ
             KDQKGAN  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQ
Subjt:  -----KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQ

Query:  PLFFREINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGM
        PLFFREINPQPKKHTLWQATADFTGGEASR+R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NES EGI LK  EGPTFFSLGM
Subjt:  PLFFREINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGM

Query:  VSPSGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEEN
        VS SGTQSPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRL  NKWDQV+VPGIRPSMS+SDFV+HIEHCL        PMFSE+N
Subjt:  VSPSGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEEN

Query:  QQSREALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRND
        QQSRE LEGITQYLFGDSQHASD+DEQTIMSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+A E EI N E    RNGFESSKHIAMSRND
Subjt:  QQSREALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRND

Query:  SVGELLLNLPRIASLPQFLFNLFDDSDDRAR
        SVGELLLNLPRIASLP+F FNLFDDSDDRAR
Subjt:  SVGELLLNLPRIASLPQFLFNLFDDSDDRAR

XP_004137649.1 uncharacterized protein LOC101216149 [Cucumis sativus]1.2e-27593.86Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEANAFP SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKL +LSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        DQKG NAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES EG+DLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNG MFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQ+ SDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKA +NSLDV   NTYP  ASE+E M+REGL A +GF+SSKHIAMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDR+R
Subjt:  RIASLPQFLFNLFDDSDDRAR

XP_008462951.1 PREDICTED: uncharacterized protein LOC103501212 [Cucumis melo]5.2e-27694.05Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEANAFP  SSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKL TLSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        DQKG NAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGEASRYR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES EGIDLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNG MFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQ+ SDSDEQTIM+RVNSLCCLLQKDSCMAKTLQTKAG+NSLDV   NTYP  ASE+EIM+REGL A +GF+SSKHIAMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDR+R
Subjt:  RIASLPQFLFNLFDDSDDRAR

XP_023524787.1 uncharacterized protein LOC111788618 [Cucurbita pepo subsp. pepo]1.9e-24685.61Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-
        MVQLMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERW SEANAF  SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLAT++K 
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-

Query:  --KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLF
          KDQKGAN  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLF
Subjt:  --KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLF

Query:  FREINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSP
        FREINPQPKKHTLWQATADFTGGEASR+R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NES EGI LK  EGPTFFSLGMVS 
Subjt:  FREINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSP

Query:  SGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQS
        SGTQSPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRL  NKWDQV+VPGIRPSMS+SDFV+HIEHCL        PMFSE+NQQS
Subjt:  SGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQS

Query:  REALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVG
        RE LEGITQYLFGDSQHASD+DEQTIMSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+  E EI N E    RNGFESSKHIAMSRNDSVG
Subjt:  REALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVG

Query:  ELLLNLPRIASLPQFLFNLFDDSDDRAR
        ELLLNLPRIASLP+F FNLFDDSDDRAR
Subjt:  ELLLNLPRIASLPQFLFNLFDDSDDRAR

XP_038904139.1 uncharacterized protein LOC120090500 [Benincasa hispida]3.2e-28194.82Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNS TEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEA AFP SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        D KG+NAFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGE+SRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNG +ES EGIDLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHEC AGASEEYSE+SPSPNSGLEAQ TTEELRND SE SRL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQ KAG+NSLDVVGG+T+PIAASE+EI+NREGL ARNGFESSKH+AMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDRAR
Subjt:  RIASLPQFLFNLFDDSDDRAR

TrEMBL top hitse value%identityAlignment
A0A0A0LCI0 Uncharacterized protein5.6e-27693.86Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEANAFP SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKL +LSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        DQKG NAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES EG+DLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNG MFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQ+ SDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKA +NSLDV   NTYP  ASE+E M+REGL A +GF+SSKHIAMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDR+R
Subjt:  RIASLPQFLFNLFDDSDDRAR

A0A1S3CI35 uncharacterized protein LOC1035012122.5e-27694.05Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEANAFP  SSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKL TLSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        DQKG NAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGEASRYR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES EGIDLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNG MFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQ+ SDSDEQTIM+RVNSLCCLLQKDSCMAKTLQTKAG+NSLDV   NTYP  ASE+EIM+REGL A +GF+SSKHIAMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDR+R
Subjt:  RIASLPQFLFNLFDDSDDRAR

A0A5D3DDG2 Uncharacterized protein2.5e-27694.05Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERW SEANAFP  SSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKL TLSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        DQKG NAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS
        PQPKKHTLWQATADFTGGEASRYR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES EGIDLKEGPTFFSLGMVSPSGTQSPS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI
        SVKEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL  NKWDQVMVPGIRPSMS+SDFVNHIEHCLSQQMTPNG MFSEENQQSREALEGI
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGI

Query:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP
        TQYLFGDSQ+ SDSDEQTIM+RVNSLCCLLQKDSCMAKTLQTKAG+NSLDV   NTYP  ASE+EIM+REGL A +GF+SSKHIAMSRNDSVGELLLNLP
Subjt:  TQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLP

Query:  RIASLPQFLFNLFDDSDDRAR
        RIASLPQFLFNLFDDSDDR+R
Subjt:  RIASLPQFLFNLFDDSDDRAR

A0A6J1DGS5 uncharacterized protein LOC1110209272.1e-24684.15Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK
        MVQLMNSGTEK P TKRLK+EVEDSLEDLLDQFHKRSK  FSSE+  S+AN F   S+P NPLDEPSPLGL+LKKSPSLLDLIQAKLSQETAKLA LSKK
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKK

Query:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN
        D KGA AFS ADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATY EDGLGTLDVVLARQPLFFREIN
Subjt:  DQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREIN

Query:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSPSGTQS
        PQPKKHTLWQATADFTGGEASRYR+HFLQCSQGLLNKHFEKL+RCDPRLNFLSQQPDIVLECPYFKTN  NES EGIDLK  EGPTFFSLGMVSPSG QS
Subjt:  PQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSPSGTQS

Query:  PSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALE
        PSS+KEH+CLAGASEEYSEQSPSPNSG+E   TTEE+RNDGSE+ RL  NKWD+V+VPGIRPSMS+SDFV+HI HCLSQQMTPNG MFSEE QQSR+ALE
Subjt:  PSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALE

Query:  GITQYLFGDSQHASDSD-------EQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDS
        GITQYLFGDSQHA DSD       EQTIM+RVNSLCCLLQKD CMAK L           VGGN  P++A  +EI  +EG  ARNG+ES KHIAMSRNDS
Subjt:  GITQYLFGDSQHASDSD-------EQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDS

Query:  VGELLLNLPRIASLPQFLFNLFDDSDDRAR
        VGELLLNLPRIASLPQFLFNLFDDSDDRAR
Subjt:  VGELLLNLPRIASLPQFLFNLFDDSDDRAR

A0A6J1KGQ9 uncharacterized protein LOC1114930641.5e-24485.01Show/hide
Query:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-
        MVQLMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERW SEANAF  SSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLAT++K 
Subjt:  MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSK-

Query:  -KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFF
         KDQ+GAN  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLFF
Subjt:  -KDQKGAN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFF

Query:  REINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSPS
        REINPQPKKHTLWQATADFTGGEASR+R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NES EGI LK  EGPTFFSLGMVS S
Subjt:  REINPQPKKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLK--EGPTFFSLGMVSPS

Query:  GTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSR
        GTQSPSS+KEHEC AG SEEYSEQSPSPNSG+EA    EEL NDGSESSRL  NKWDQV+VPGIRPSMS+SDFV+HIEHCL        PMFSE+NQQSR
Subjt:  GTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSR

Query:  EALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGE
        E LEGITQYLFGDSQHASD+DEQTIMSRVNSLC LLQKDSCMAK  Q KAG NSL+V  GN   I+  E EI N E     NGFESSKHIAMSRNDSVGE
Subjt:  EALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGE

Query:  LLLNLPRIASLPQFLFNLFDDSDDRAR
        LLLNLPRIASLP+F FNLFDDSDDRAR
Subjt:  LLLNLPRIASLPQFLFNLFDDSDDRAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein4.1e-2941.58Show/hide
Query:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW
        NFP   ++IG W   ++   D+VAK YFAK KL+WE L G        LK KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W
Subjt:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW

Query:  -QATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES-------TEGIDLKEGPT-FFSLGM
         Q   DFTG  AS YRRH L    G+L K+ EKLV  D   + L + P  V E  YF +   N S       T G ++  GP   FS G+
Subjt:  -QATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNES-------TEGIDLKEGPT-FFSLGM

AT2G24100.1 unknown protein3.4e-10045.97Show/hide
Query:  VEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKKDQKGANAFSTADKLKASNFPA
        +EDSLE+     +KRSK       W +       S S ++ L+EPSPLGLSLKKSPS  +LI+ KLSQ +   +   KK+  G     T +KLKASNFPA
Subjt:  VEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKKDQKGANAFSTADKLKASNFPA

Query:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS
         IL+IG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PED  GTL +VLAR+PLFFRE NPQP+KHTLWQAT+DFT G+AS
Subjt:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS

Query:  RYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFF-SLGMVSPSGTQSPSSVKEHECLAGASEEYSEQSP
          R+HFLQC  G++NKHFEKLV+CD RL  LS+QP+I L  P+F +         + + E P+   S  + SP G QS S   EH  L       S  + 
Subjt:  RYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFF-SLGMVSPSGTQSPSSVKEHECLAGASEEYSEQSP

Query:  SPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGITQYLFGDSQHASDSDEQTIM
        SP+S ++A+A    +   G      ++N W Q+ +PG+  S+S++DF+      LS Q   N   F E           + Q L  D+     SDE+++M
Subjt:  SPNSGLEAQATTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGITQYLFGDSQHASDSDEQTIM

Query:  SRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD
        S+VNS C LLQ  +     ++T A +  +  V  N +         M   G    +   SSK +  MSR DS  +LL++LPRI SLP+FLFN+ ++
Subjt:  SRVNSLCCLLQKDSCMAKTLQTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD

AT3G05770.1 unknown protein4.4e-3138.14Show/hide
Query:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKKDQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----
        +DE   L L L K+P L++ I++ L          ++   K +    + +KLKA NFP   +KIG   + ++   D+VAK YFAK KL+WE L G     
Subjt:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKKDQKGANAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----

Query:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLS
           LK+KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W Q   DFTG +AS YRRH L    G+L K+ EKL+  D   + L 
Subjt:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLS

Query:  QQPDIVLECPYF-----KTNGSN--ESTEGIDLKEG
        + P  V E  YF       N SN    T G ++  G
Subjt:  QQPDIVLECPYF-----KTNGSN--ESTEGIDLKEG

AT4G30780.1 unknown protein3.0e-9642.21Show/hide
Query:  EDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETAKLATLS---KKDQKGANAFSTA-
        ED LE+     +KRS+       W    ++   + + YNPLDEPSPLGLSLKKSPSLL+LIQ K++        ET K   L    K++ K   A ++  
Subjt:  EDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETAKLATLS---KKDQKGANAFSTA-

Query:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP
                +KLKASNFPA +LKIG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PEDG GTL +VLARQPLFFRE NPQP
Subjt:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP

Query:  KKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NEST----EGIDLKEGPTFF-SLGMVSP
        +KHTLWQAT+DFT G+AS  R+HFLQC+QG++NKHFEKLV+CD RL  LS+QP+I ++ PYF    S     +ES       ++L  GP+   +  + SP
Subjt:  KKHTLWQATADFTGGEASRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NEST----EGIDLKEGPTFF-SLGMVSP

Query:  SGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWD--QVMVPGIRPSMSISDF------------------VNHIEH
         G QS S   EH  L       S ++PSP+S ++A+A        G   +  S N+ D  Q+  PGI  SMS+SDF                  V+ +  
Subjt:  SGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLSSNKWD--QVMVPGIRPSMSISDF------------------VNHIEH

Query:  CLS--------------------------QQMTPN------GPMFSEENQQSREALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQ
         +S                          Q M+ +             + +  E  E + Q L  D+      DE+++M RVNSL  LL KD  +A   Q
Subjt:  CLS--------------------------QQMTPN------GPMFSEENQQSREALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTLQ

Query:  TKAGSNSLDVVGGNTYPIA-ASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD
             N+   VG  + P    S+    N       +   SSK   M R DS  +LLL+LPRI SLP+FL N+ ++  D
Subjt:  TKAGSNSLDVVGGNTYPIA-ASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGCTGATGAATTCCGGGACGGAGAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGATTCTTTGGAAGACCTACTCGATCAGTTTCACAAGCGATC
CAAATCCGACTTCTCTTCCGAGAGATGGATATCAGAAGCCAATGCGTTTCCTGACTCATCGAGTCCGTACAATCCTCTAGATGAGCCAAGCCCGTTGGGTTTGAGTCTAA
AAAAGAGTCCGTCCTTGTTGGATTTGATACAAGCAAAACTGTCTCAAGAAACTGCTAAATTAGCGACTTTGAGCAAGAAAGACCAGAAGGGAGCCAATGCTTTTAGTACT
GCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTAATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAGATACGAGGGAGATTTAGTGGCGAAGTGTTATTTTGCAAA
GCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATCGTCGCCCTCAAGGCGACTTATCCTGAGGACGGTCTCGGGA
CTTTGGATGTAGTTCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACTTTATGGCAAGCAACAGCTGACTTTACCGGTGGAGAAGCA
AGCAGATACAGAAGGCATTTCCTGCAGTGTTCACAAGGCTTGTTGAACAAGCATTTTGAGAAGCTTGTACGTTGCGATCCTCGTCTCAACTTTCTAAGCCAACAACCAGA
TATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCACAGAAGGAATTGATTTGAAGGAGGGACCTACTTTCTTTAGTTTAGGTATGGTGTCACCAT
CTGGAACTCAGTCACCTTCCTCAGTTAAAGAACATGAGTGTCTTGCTGGTGCTTCCGAAGAATATTCTGAGCAATCTCCATCACCTAACTCAGGGCTGGAAGCTCAAGCG
ACGACGGAAGAATTGAGAAACGATGGATCCGAAAGTTCAAGACTGTCCTCCAATAAATGGGATCAAGTCATGGTTCCTGGAATTCGGCCATCGATGTCCATAAGTGATTT
TGTCAACCATATTGAACACTGCCTATCACAGCAGATGACACCCAACGGCCCCATGTTTTCTGAAGAAAATCAACAGAGCAGAGAGGCCCTAGAGGGAATTACACAATATC
TTTTTGGTGATTCTCAACATGCATCCGACTCCGATGAGCAGACCATCATGTCCAGGGTGAATTCCCTATGCTGTCTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTG
CAAACCAAAGCTGGTAGCAATAGCCTCGATGTTGTAGGTGGCAACACCTACCCCATTGCTGCATCAGAATTCGAAATCATGAATCGAGAAGGCTTAACCGCGCGCAATGG
CTTTGAATCAAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAATAGCTTCTCTACCTCAGTTTTTGTTCAACTTGTTCG
ATGATTCTGATGATCGTGCTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAGCTGATGAATTCCGGGACGGAGAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGATTCTTTGGAAGACCTACTCGATCAGTTTCACAAGCGATC
CAAATCCGACTTCTCTTCCGAGAGATGGATATCAGAAGCCAATGCGTTTCCTGACTCATCGAGTCCGTACAATCCTCTAGATGAGCCAAGCCCGTTGGGTTTGAGTCTAA
AAAAGAGTCCGTCCTTGTTGGATTTGATACAAGCAAAACTGTCTCAAGAAACTGCTAAATTAGCGACTTTGAGCAAGAAAGACCAGAAGGGAGCCAATGCTTTTAGTACT
GCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTAATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAGATACGAGGGAGATTTAGTGGCGAAGTGTTATTTTGCAAA
GCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATCGTCGCCCTCAAGGCGACTTATCCTGAGGACGGTCTCGGGA
CTTTGGATGTAGTTCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACTTTATGGCAAGCAACAGCTGACTTTACCGGTGGAGAAGCA
AGCAGATACAGAAGGCATTTCCTGCAGTGTTCACAAGGCTTGTTGAACAAGCATTTTGAGAAGCTTGTACGTTGCGATCCTCGTCTCAACTTTCTAAGCCAACAACCAGA
TATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCACAGAAGGAATTGATTTGAAGGAGGGACCTACTTTCTTTAGTTTAGGTATGGTGTCACCAT
CTGGAACTCAGTCACCTTCCTCAGTTAAAGAACATGAGTGTCTTGCTGGTGCTTCCGAAGAATATTCTGAGCAATCTCCATCACCTAACTCAGGGCTGGAAGCTCAAGCG
ACGACGGAAGAATTGAGAAACGATGGATCCGAAAGTTCAAGACTGTCCTCCAATAAATGGGATCAAGTCATGGTTCCTGGAATTCGGCCATCGATGTCCATAAGTGATTT
TGTCAACCATATTGAACACTGCCTATCACAGCAGATGACACCCAACGGCCCCATGTTTTCTGAAGAAAATCAACAGAGCAGAGAGGCCCTAGAGGGAATTACACAATATC
TTTTTGGTGATTCTCAACATGCATCCGACTCCGATGAGCAGACCATCATGTCCAGGGTGAATTCCCTATGCTGTCTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTG
CAAACCAAAGCTGGTAGCAATAGCCTCGATGTTGTAGGTGGCAACACCTACCCCATTGCTGCATCAGAATTCGAAATCATGAATCGAGAAGGCTTAACCGCGCGCAATGG
CTTTGAATCAAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAATAGCTTCTCTACCTCAGTTTTTGTTCAACTTGTTCG
ATGATTCTGATGATCGTGCTAGATAA
Protein sequenceShow/hide protein sequence
MVQLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSDFSSERWISEANAFPDSSSPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETAKLATLSKKDQKGANAFST
ADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEA
SRYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESTEGIDLKEGPTFFSLGMVSPSGTQSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQA
TTEELRNDGSESSRLSSNKWDQVMVPGIRPSMSISDFVNHIEHCLSQQMTPNGPMFSEENQQSREALEGITQYLFGDSQHASDSDEQTIMSRVNSLCCLLQKDSCMAKTL
QTKAGSNSLDVVGGNTYPIAASEFEIMNREGLTARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDDRAR