; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G194150 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G194150
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUnknown protein
Genome locationCmU531Chr10:25338424..25341336
RNA-Seq ExpressionCmUC10G194150
SyntenyCmUC10G194150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606848.1 hypothetical protein SDJN03_00190, partial [Cucurbita argyrosperma subsp. sororia]4.0e-24784.66Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K   
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---

Query:  ---KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPL
           KDQKG N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPL
Subjt:  ---KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPL

Query:  FFREINPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVS
        FFREINPQPKKHTLWQATADFTGGEAS++R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS
Subjt:  FFREINPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVS

Query:  PSGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQS
         SGT+SPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRL NKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQS
Subjt:  PSGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQS

Query:  REALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVG
        RE LEGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+A E+EIKN+E F  RNGFESSKHIAMSRNDSVG
Subjt:  REALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVG

Query:  ELLLNLPRIASLPQFLFNLFDDSDDRAR
        ELLLNLPRIASLP+F FNLFDDSDDRAR
Subjt:  ELLLNLPRIASLPQFLFNLFDDSDDRAR

XP_004137649.1 uncharacterized protein LOC101216149 [Cucumis sativus]2.4e-27693.24Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFP+SS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL +LSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEG+DLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRLLNKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+MSRVNSLCCLLQKDSCMAKTLQTKA NNSLDV   NTYP  ASEYE  ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

XP_008462951.1 PREDICTED: uncharacterized protein LOC103501212 [Cucumis melo]1.1e-27693.44Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL+NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

XP_023524787.1 uncharacterized protein LOC111788618 [Cucurbita pepo subsp. pepo]4.0e-24784.95Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K   
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---

Query:  KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFR
        KDQKG N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLFFR
Subjt:  KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFR

Query:  EINPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSG
        EINPQPKKHTLWQATADFTGGEAS++R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS SG
Subjt:  EINPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSG

Query:  TRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREA
        T+SPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRL NKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQSRE 
Subjt:  TRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREA

Query:  LEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELL
        LEGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+  E+EIKN+E F  RNGFESSKHIAMSRNDSVGELL
Subjt:  LEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELL

Query:  LNLPRIASLPQFLFNLFDDSDDRAR
        LNLPRIASLP+F FNLFDDSDDRAR
Subjt:  LNLPRIASLPQFLFNLFDDSDDRAR

XP_038904139.1 uncharacterized protein LOC120090500 [Benincasa hispida]3.0e-27993.82Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNS TEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEA AFPVSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLATLSKKD 
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KG NAFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGE+S+YRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNG +ESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHEC AGASEEYSE+SPSPNSGLEAQ TTEELRND SE SRLLNKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNGP+FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQ+ASDSDEQT+MSRVNSLCCLLQKDSCMAKTLQ KAGNNSLDV GG+T+PIAASEYEI N+EG  ARNGFESSKH+AMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDRAR
Subjt:  SLPQFLFNLFDDSDDRAR

TrEMBL top hitse value%identityAlignment
A0A0A0LCI0 Uncharacterized protein1.2e-27693.24Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFP+SS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL +LSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEG+DLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRLLNKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+MSRVNSLCCLLQKDSCMAKTLQTKA NNSLDV   NTYP  ASEYE  ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A1S3CI35 uncharacterized protein LOC1035012125.2e-27793.44Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL+NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A5D3DDG2 Uncharacterized protein5.2e-27793.44Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR HFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL+NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A6J1DGS5 uncharacterized protein LOC1110209271.9e-24784.25Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEK P TKRLK+EVEDSLEDLLDQFHKRSK  FSSE+ TS+AN F V S P NPLDEPSPLGL+LKKSPSLLDLIQAKLSQET KLA LSKKD 
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KG  AFS ADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATY EDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGTRSPS
        PKKHTLWQATADFTGGEAS+YR+HFLQCSQGLLNKHFEKL+RCDPRLNFLSQQPDIVLECPYFKTN  NESKEGIDLK  EGPTFFSLGMVSPSG +SPS
Subjt:  PKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGTRSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGIT
        S+KEH+CLAGASEEYSEQSPSPNSG+E   TTEE+RNDGSE+ RL NKWD+V+VPGIRPSMSVSDFV+HI  CLSQQMTPNG +FSEE QQSR+ALEGIT
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGIT

Query:  QYLFGDSQNASDSD-------EQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGE
        QYLFGDSQ+A DSD       EQT+M+RVNSLCCLLQKD CMAK         +LDV GGN  P++A  YEIK QEGF ARNG+ES KHIAMSRNDSVGE
Subjt:  QYLFGDSQNASDSD-------EQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGE

Query:  LLLNLPRIASLPQFLFNLFDDSDDRAR
        LLLNLPRIASLPQFLFNLFDDSDDRAR
Subjt:  LLLNLPRIASLPQFLFNLFDDSDDRAR

A0A6J1KGQ9 uncharacterized protein LOC1114930643.1e-24584.35Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK--K
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K  K
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK--K

Query:  DQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFRE
        DQ+G N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLFFRE
Subjt:  DQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFRE

Query:  INPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGT
        INPQPKKHTLWQATADFTGGEAS++R+HFLQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS SGT
Subjt:  INPQPKKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGT

Query:  RSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREAL
        +SPSS+KEHEC AG SEEYSEQSPSPNSG+EA    EEL NDGSESSRL NKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQSRE L
Subjt:  RSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREAL

Query:  EGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLL
        EGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V  GN   I+  E+EIKN+E F   NGFESSKHIAMSRNDSVGELLL
Subjt:  EGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLL

Query:  NLPRIASLPQFLFNLFDDSDDRAR
        NLPRIASLP+F FNLFDDSDDRAR
Subjt:  NLPRIASLPQFLFNLFDDSDDRAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein9.4e-2941.05Show/hide
Query:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW
        NFP   ++IG W   ++   D+VAK YFAK KL+WE L G        LK KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W
Subjt:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW

Query:  -QATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKE-------GIDLKEGPT-FFSLGM
         Q   DFTG  AS YRRH L    G+L K+ EKLV  D   + L + P  V E  YF +   N S         G ++  GP   FS G+
Subjt:  -QATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKE-------GIDLKEGPT-FFSLGM

AT2G24100.1 unknown protein1.6e-10045.56Show/hide
Query:  VEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPA
        +EDSLE+     +KRSK       W++       S + ++ L+EPSPLGLSLKKSPS  +LI+ KLSQ      ++ KK+  G     T +KLKASNFPA
Subjt:  VEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPA

Query:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS
         IL+IG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PED  GTL +VLAR+PLFFRE NPQP+KHTLWQAT+DFT G+AS
Subjt:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS

Query:  KYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFF-SLGMVSPSGTRSPSSVKEHECLAGASEEYSEQSP
          R+HFLQC  G++NKHFEKLV+CD RL  LS+QP+I L  P+F +         + + E P+   S  + SP G +S S   EH  L       S  + 
Subjt:  KYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFF-SLGMVSPSGTRSPSSVKEHECLAGASEEYSEQSP

Query:  SPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMS
        SP+S ++A+A        GS  SR  N W Q+ +PG+  S+S++DF+  +               S++  ++ +  E + Q L  D+     SDE++VMS
Subjt:  SPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMS

Query:  RVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQE-GFYARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD
        +VNS C LLQ            A N+ L++   +T  +   +      E G    +   SSK +  MSR DS  +LL++LPRI SLP+FLFN+ ++
Subjt:  RVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQE-GFYARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD

AT3G05770.1 unknown protein4.5e-3138.64Show/hide
Query:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----
        +DE   L L L K+P L++ I++ L          ++   K      + +KLKA NFP   +KIG   + ++   D+VAK YFAK KL+WE L G     
Subjt:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----

Query:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLS
           LK+KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W Q   DFTG +AS YRRH L    G+L K+ EKL+  D   + L 
Subjt:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLS

Query:  QQPDIVLECPYFKTNGSNES
        + P  V E  YF     N +
Subjt:  QQPDIVLECPYFKTNGSNES

AT4G30780.1 unknown protein6.0e-10042.33Show/hide
Query:  EDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETVKLATLSKKDQKGGNAFSTA----
        ED LE+     +KRS+       W+   ++  ++   YNPLDEPSPLGLSLKKSPSLL+LIQ K++        ET+K   L    ++     + A    
Subjt:  EDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETVKLATLSKKDQKGGNAFSTA----

Query:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP
                +KLKASNFPA +LKIG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PEDG GTL +VLARQPLFFRE NPQP
Subjt:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP

Query:  KKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NESK----EGIDLKEGPTFF-SLGMVSP
        +KHTLWQAT+DFT G+AS  R+HFLQC+QG++NKHFEKLV+CD RL  LS+QP+I ++ PYF    S     +ESK      ++L  GP+   +  + SP
Subjt:  KKHTLWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NESK----EGIDLKEGPTFF-SLGMVSP

Query:  SGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDF------------------VNHIEQCLS
         G +S S   EH  L       S ++PSP+S ++A+A  E +    + +SR      Q+  PGI  SMS+SDF                  V+ + Q +S
Subjt:  SGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDF------------------VNHIEQCLS

Query:  --------------------------QQMTPNG--PLFSE----ENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKA
                                  Q M+ +    L S+     + +  E  E + Q L  D+      DE+++M RVNSL  LL KD  +A   Q   
Subjt:  --------------------------QQMTPNG--PLFSE----ENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKA

Query:  GNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD
         N  + VG  +      S+    N       +   SSK   M R DS  +LLL+LPRI SLP+FL N+ ++  D
Subjt:  GNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGGGGAGTCCGAATGGACTGGAATGGAGGCTGACGTGGAAGCTGATGAATTCAGGGACGGAAAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGA
TTCTTTGGAGGACCTACTTGATCAGTTTCACAAGCGATCCAAATCCGAGTTCTCTTCCGAGAGATGGACATCAGAAGCCAACGCGTTTCCTGTATCATCCACTCCGTACA
ATCCTTTAGATGAGCCGAGTCCGTTGGGTTTGAGCCTCAAAAAGAGTCCGTCCTTATTGGATTTGATTCAAGCAAAACTCTCTCAAGAAACCGTTAAATTAGCGACTTTG
AGCAAAAAAGACCAGAAGGGAGGCAATGCTTTTAGTACTGCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTGATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAG
ATATGAGGGAGATTTAGTGGCGAAGTGTTACTTTGCAAAGCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATAG
TCGCCCTGAAGGCGACTTATCCCGAGGATGGACTCGGGACTTTGGATGTAGTGCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACT
TTATGGCAAGCAACAGCTGACTTTACAGGTGGAGAAGCAAGCAAATACAGAAGGCATTTCCTGCAGTGTTCACAAGGCTTGTTAAACAAGCATTTTGAGAAGCTTGTACG
TTGCGATCCACGTCTCAACTTTCTAAGCCAACAACCAGATATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCAAAGAAGGAATTGATTTGAAGG
AGGGGCCTACTTTCTTTAGTTTAGGTATGGTGTCACCATCTGGAACTCGATCACCATCCTCTGTTAAAGAACATGAGTGTCTTGCTGGGGCTTCTGAAGAATATTCCGAG
CAATCTCCATCACCTAACTCAGGGCTAGAAGCTCAAGCGACGACGGAAGAATTGAGGAACGATGGATCTGAAAGTTCAAGACTGTTGAATAAATGGGATCAAGTCATGGT
TCCTGGAATTCGGCCATCAATGTCCGTAAGTGATTTTGTCAACCATATTGAACAATGCCTATCACAGCAGATGACACCCAACGGCCCTCTGTTTTCTGAAGAGAACCAAC
AAAGCAGAGAGGCTCTAGAGGGAATTACACAGTATCTTTTTGGTGATTCTCAAAATGCATCCGACTCCGATGAACAAACCGTCATGTCCAGGGTGAATTCGCTATGCTGT
CTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTACAAACCAAAGCTGGCAATAATAGCCTCGACGTTGGAGGTGGCAACACCTACCCTATCGCTGCATCAGAATACGA
AATCAAGAATCAAGAAGGCTTCTATGCACGCAATGGCTTTGAATCGAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAA
TAGCTTCTCTCCCTCAGTTTTTGTTCAACTTGTTCGATGATTCTGATGATCGTGCTCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGGGGAGTCCGAATGGACTGGAATGGAGGCTGACGTGGAAGCTGATGAATTCAGGGACGGAAAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGA
TTCTTTGGAGGACCTACTTGATCAGTTTCACAAGCGATCCAAATCCGAGTTCTCTTCCGAGAGATGGACATCAGAAGCCAACGCGTTTCCTGTATCATCCACTCCGTACA
ATCCTTTAGATGAGCCGAGTCCGTTGGGTTTGAGCCTCAAAAAGAGTCCGTCCTTATTGGATTTGATTCAAGCAAAACTCTCTCAAGAAACCGTTAAATTAGCGACTTTG
AGCAAAAAAGACCAGAAGGGAGGCAATGCTTTTAGTACTGCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTGATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAG
ATATGAGGGAGATTTAGTGGCGAAGTGTTACTTTGCAAAGCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATAG
TCGCCCTGAAGGCGACTTATCCCGAGGATGGACTCGGGACTTTGGATGTAGTGCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACT
TTATGGCAAGCAACAGCTGACTTTACAGGTGGAGAAGCAAGCAAATACAGAAGGCATTTCCTGCAGTGTTCACAAGGCTTGTTAAACAAGCATTTTGAGAAGCTTGTACG
TTGCGATCCACGTCTCAACTTTCTAAGCCAACAACCAGATATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCAAAGAAGGAATTGATTTGAAGG
AGGGGCCTACTTTCTTTAGTTTAGGTATGGTGTCACCATCTGGAACTCGATCACCATCCTCTGTTAAAGAACATGAGTGTCTTGCTGGGGCTTCTGAAGAATATTCCGAG
CAATCTCCATCACCTAACTCAGGGCTAGAAGCTCAAGCGACGACGGAAGAATTGAGGAACGATGGATCTGAAAGTTCAAGACTGTTGAATAAATGGGATCAAGTCATGGT
TCCTGGAATTCGGCCATCAATGTCCGTAAGTGATTTTGTCAACCATATTGAACAATGCCTATCACAGCAGATGACACCCAACGGCCCTCTGTTTTCTGAAGAGAACCAAC
AAAGCAGAGAGGCTCTAGAGGGAATTACACAGTATCTTTTTGGTGATTCTCAAAATGCATCCGACTCCGATGAACAAACCGTCATGTCCAGGGTGAATTCGCTATGCTGT
CTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTACAAACCAAAGCTGGCAATAATAGCCTCGACGTTGGAGGTGGCAACACCTACCCTATCGCTGCATCAGAATACGA
AATCAAGAATCAAGAAGGCTTCTATGCACGCAATGGCTTTGAATCGAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAA
TAGCTTCTCTCCCTCAGTTTTTGTTCAACTTGTTCGATGATTCTGATGATCGTGCTCGATAA
Protein sequenceShow/hide protein sequence
MGVGSPNGLEWRLTWKLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATL
SKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHT
LWQATADFTGGEASKYRRHFLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSVKEHECLAGASEEYSE
QSPSPNSGLEAQATTEELRNDGSESSRLLNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCC
LLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDDRAR