; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G191900 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G191900
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
Genome locationCiama_Chr10:26398175..26401111
RNA-Seq ExpressionCaUC10G191900
SyntenyCaUC10G191900
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606848.1 hypothetical protein SDJN03_00190, partial [Cucurbita argyrosperma subsp. sororia]1.8e-24784.66Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K   
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---

Query:  ---KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPL
           KDQKG N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPL
Subjt:  ---KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPL

Query:  FFREINPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVS
        FFREINPQPKKHTLWQATADFTGGEAS++R+H+LQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS
Subjt:  FFREINPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVS

Query:  PSGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQS
         SGT+SPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRLFNKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQS
Subjt:  PSGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQS

Query:  REALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVG
        RE LEGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+A E+EIKN+E F  RNGFESSKHIAMSRNDSVG
Subjt:  REALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVG

Query:  ELLLNLPRIASLPQFLFNLFDDSDDRAR
        ELLLNLPRIASLP+F FNLFDDSDDRAR
Subjt:  ELLLNLPRIASLPQFLFNLFDDSDDRAR

XP_004137649.1 uncharacterized protein LOC101216149 [Cucumis sativus]1.6e-27592.86Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFP+SS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL +LSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YRRH+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEG+DLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+MSRVNSLCCLLQKDSCMAKTLQTKA NNSLDV   NTYP  ASEYE  ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

XP_008462951.1 PREDICTED: uncharacterized protein LOC103501212 [Cucumis melo]4.1e-27693.24Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR H+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

XP_023524787.1 uncharacterized protein LOC111788618 [Cucurbita pepo subsp. pepo]1.8e-24784.95Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K   
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK---

Query:  KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFR
        KDQKG N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLFFR
Subjt:  KDQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFR

Query:  EINPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSG
        EINPQPKKHTLWQATADFTGGEAS++R+H+LQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS SG
Subjt:  EINPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSG

Query:  TRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREA
        T+SPSS+KEHEC AGASEEYSEQSPSPNSG+EA    EEL NDGSESSRLFNKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQSRE 
Subjt:  TRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREA

Query:  LEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELL
        LEGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V GGN   I+  E+EIKN+E F  RNGFESSKHIAMSRNDSVGELL
Subjt:  LEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELL

Query:  LNLPRIASLPQFLFNLFDDSDDRAR
        LNLPRIASLP+F FNLFDDSDDRAR
Subjt:  LNLPRIASLPQFLFNLFDDSDDRAR

XP_038904139.1 uncharacterized protein LOC120090500 [Benincasa hispida]2.0e-27893.44Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNS TEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEA AFPVSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLATLSKKD 
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KG NAFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGE+S+YRRH+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNG +ESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHEC AGASEEYSE+SPSPNSGLEAQ TTEELRND SE SRL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNGP+FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQ+ASDSDEQT+MSRVNSLCCLLQKDSCMAKTLQ KAGNNSLDV GG+T+PIAASEYEI N+EG  ARNGFESSKH+AMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDRAR
Subjt:  SLPQFLFNLFDDSDDRAR

TrEMBL top hitse value%identityAlignment
A0A0A0LCI0 Uncharacterized protein7.5e-27692.86Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPT KRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFP+SS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL +LSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YRRH+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEG+DLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ESSRL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+MSRVNSLCCLLQKDSCMAKTLQTKA NNSLDV   NTYP  ASEYE  ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A1S3CI35 uncharacterized protein LOC1035012122.0e-27693.24Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR H+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A5D3DDG2 Uncharacterized protein2.0e-27693.24Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKS+FSSERWTSEANAFPV S+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KL TLSKKDQ
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KGGNAF+TADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV
        PKKHTLWQATADFTGGEAS+YR H+LQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGT+SPSSV
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSV

Query:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY
        KEHECLAGASEEYSEQSPSPNSGLEAQA TEELRNDG ES RL NKWDQVMVPGIRPSMSVSDFVNHIE CLSQQMTPNG +FSEENQQSREALEGITQY
Subjt:  KEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQY

Query:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA
        LFGDSQN SDSDEQT+M+RVNSLCCLLQKDSCMAKTLQTKAGNNSLDV   NTYP  ASEYEI ++EG  A +GF+SSKHIAMSRNDSVGELLLNLPRIA
Subjt:  LFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIA

Query:  SLPQFLFNLFDDSDDRAR
        SLPQFLFNLFDDSDDR+R
Subjt:  SLPQFLFNLFDDSDDRAR

A0A6J1DGS5 uncharacterized protein LOC1110209278.7e-24884.25Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ
        +LMNSGTEK P TKRLK+EVEDSLEDLLDQFHKRSK  FSSE+ TS+AN F V S P NPLDEPSPLGL+LKKSPSLLDLIQAKLSQET KLA LSKKD 
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQ

Query:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ
        KG  AFS ADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATY EDGLGTLDVVLARQPLFFREINPQ
Subjt:  KGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQ

Query:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGTRSPS
        PKKHTLWQATADFTGGEAS+YR+H+LQCSQGLLNKHFEKL+RCDPRLNFLSQQPDIVLECPYFKTN  NESKEGIDLK  EGPTFFSLGMVSPSG +SPS
Subjt:  PKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGTRSPS

Query:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGIT
        S+KEH+CLAGASEEYSEQSPSPNSG+E   TTEE+RNDGSE+ RLFNKWD+V+VPGIRPSMSVSDFV+HI  CLSQQMTPNG +FSEE QQSR+ALEGIT
Subjt:  SVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGIT

Query:  QYLFGDSQNASDSD-------EQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGE
        QYLFGDSQ+A DSD       EQT+M+RVNSLCCLLQKD CMAK         +LDV GGN  P++A  YEIK QEGF ARNG+ES KHIAMSRNDSVGE
Subjt:  QYLFGDSQNASDSD-------EQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGE

Query:  LLLNLPRIASLPQFLFNLFDDSDDRAR
        LLLNLPRIASLPQFLFNLFDDSDDRAR
Subjt:  LLLNLPRIASLPQFLFNLFDDSDDRAR

A0A6J1KGQ9 uncharacterized protein LOC1114930641.4e-24584.35Show/hide
Query:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK--K
        +LMNSGT+KIP TKRLKKEVEDSLEDLLDQFHKRSK    SERWTSEANAF VSS+PYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQET KLAT++K  K
Subjt:  KLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSK--K

Query:  DQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFRE
        DQ+G N  AFSTADKLKASNFPALILKIG+WEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKA Y EDGLGTLDVVLARQPLFFRE
Subjt:  DQKGGN--AFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFRE

Query:  INPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGT
        INPQPKKHTLWQATADFTGGEAS++R+H+LQCSQG LNKHFEKLVRCDPRLNFLSQQP+IVLECPYFKTN  NESKEGI LK  EGPTFFSLGMVS SGT
Subjt:  INPQPKKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLK--EGPTFFSLGMVSPSGT

Query:  RSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREAL
        +SPSS+KEHEC AG SEEYSEQSPSPNSG+EA    EEL NDGSESSRLFNKWDQV+VPGIRPSMSVSDFV+HIE CL        P+FSE+NQQSRE L
Subjt:  RSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREAL

Query:  EGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLL
        EGITQYLFGDSQ+ASD+DEQT+MSRVNSLC LLQKDSCMAK  Q KAG NSL+V  GN   I+  E+EIKN+E F   NGFESSKHIAMSRNDSVGELLL
Subjt:  EGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLL

Query:  NLPRIASLPQFLFNLFDDSDDRAR
        NLPRIASLP+F FNLFDDSDDRAR
Subjt:  NLPRIASLPQFLFNLFDDSDDRAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein9.4e-2941.05Show/hide
Query:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW
        NFP   ++IG W   ++   D+VAK YFAK KL+WE L G        LK KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W
Subjt:  NFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-------NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW

Query:  -QATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKE-------GIDLKEGPT-FFSLGM
         Q   DFTG  AS YRRH L    G+L K+ EKLV  D   + L + P  V E  YF +   N S         G ++  GP   FS G+
Subjt:  -QATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKE-------GIDLKEGPT-FFSLGM

AT2G24100.1 unknown protein4.6e-10045.36Show/hide
Query:  VEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPA
        +EDSLE+     +KRSK       W++       S + ++ L+EPSPLGLSLKKSPS  +LI+ KLSQ      ++ KK+  G     T +KLKASNFPA
Subjt:  VEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPA

Query:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS
         IL+IG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PED  GTL +VLAR+PLFFRE NPQP+KHTLWQAT+DFT G+AS
Subjt:  LILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHTLWQATADFTGGEAS

Query:  KYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFF-SLGMVSPSGTRSPSSVKEHECLAGASEEYSEQSP
          R+H+LQC  G++NKHFEKLV+CD RL  LS+QP+I L  P+F +         + + E P+   S  + SP G +S S   EH  L       S  + 
Subjt:  KYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFF-SLGMVSPSGTRSPSSVKEHECLAGASEEYSEQSP

Query:  SPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMS
        SP+S ++A+A        GS  SR  N W Q+ +PG+  S+S++DF+  +               S++  ++ +  E + Q L  D+     SDE++VMS
Subjt:  SPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMS

Query:  RVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQE-GFYARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD
        +VNS C LLQ            A N+ L++   +T  +   +      E G    +   SSK +  MSR DS  +LL++LPRI SLP+FLFN+ ++
Subjt:  RVNSLCCLLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQE-GFYARNGFESSKHI-AMSRNDSVGELLLNLPRIASLPQFLFNLFDD

AT3G05770.1 unknown protein5.9e-3138.64Show/hide
Query:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----
        +DE   L L L K+P L++ I++ L          ++   K      + +KLKA NFP   +KIG   + ++   D+VAK YFAK KL+WE L G     
Subjt:  LDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATLSKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDG-----

Query:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLS
           LK+KIEIQW+D+ + + +    D  G L + L ++P FF E NPQ  KHT W Q   DFTG +AS YRRH L    G+L K+ EKL+  D   + L 
Subjt:  --NLKNKIEIQWSDIVALKATY-PEDGLGTLDVVLARQPLFFREINPQPKKHTLW-QATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLS

Query:  QQPDIVLECPYFKTNGSNES
        + P  V E  YF     N +
Subjt:  QQPDIVLECPYFKTNGSNES

AT4G30780.1 unknown protein1.3e-9942.16Show/hide
Query:  EDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETVKLATLSKKDQKGGNAFSTA----
        ED LE+     +KRS+       W+   ++  ++   YNPLDEPSPLGLSLKKSPSLL+LIQ K++        ET+K   L    ++     + A    
Subjt:  EDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQ-------ETVKLATLSKKDQKGGNAFSTA----

Query:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP
                +KLKASNFPA +LKIG WEYKSRYEGDLVAKCYFAKHKLVWE+L+  LK+KIEIQWSDI+ALKA  PEDG GTL +VLARQPLFFRE NPQP
Subjt:  --------DKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQP

Query:  KKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NESK----EGIDLKEGPTFF-SLGMVSP
        +KHTLWQAT+DFT G+AS  R+H+LQC+QG++NKHFEKLV+CD RL  LS+QP+I ++ PYF    S     +ESK      ++L  GP+   +  + SP
Subjt:  KKHTLWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGS-----NESK----EGIDLKEGPTFF-SLGMVSP

Query:  SGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDF------------------VNHIEQCLS
         G +S S   EH  L       S ++PSP+S ++A+A  E +    + +SR      Q+  PGI  SMS+SDF                  V+ + Q +S
Subjt:  SGTRSPSSVKEHECLAGASEEYSEQSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDF------------------VNHIEQCLS

Query:  --------------------------QQMTPNG--PLFSE----ENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKA
                                  Q M+ +    L S+     + +  E  E + Q L  D+      DE+++M RVNSL  LL KD  +A   Q   
Subjt:  --------------------------QQMTPNG--PLFSE----ENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCCLLQKDSCMAKTLQTKA

Query:  GNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD
         N  + VG  +      S+    N       +   SSK   M R DS  +LLL+LPRI SLP+FL N+ ++  D
Subjt:  GNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGGGGAGTCCGAATGGACTGGAATGGAGGCTGACGTGGAAGCTGATGAATTCAGGGACGGAGAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGA
TTCTTTGGAGGACCTACTTGATCAGTTTCACAAGCGATCCAAATCCGAGTTCTCTTCCGAGAGATGGACATCAGAAGCCAATGCGTTTCCTGTATCATCCACTCCGTACA
ATCCTTTAGATGAGCCGAGTCCGTTGGGTTTGAGCCTCAAAAAGAGTCCGTCCTTATTGGATTTGATTCAAGCAAAACTCTCTCAAGAAACCGTTAAATTAGCGACTTTG
AGCAAAAAGGACCAGAAGGGAGGCAATGCTTTTAGTACTGCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTGATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAG
ATATGAGGGAGATTTAGTGGCGAAGTGTTACTTTGCAAAGCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATAG
TCGCCCTGAAGGCGACTTATCCCGAGGATGGACTCGGGACTTTGGATGTAGTGCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACT
TTATGGCAAGCAACAGCTGACTTTACAGGTGGAGAAGCAAGCAAATATAGAAGGCATTACCTGCAGTGTTCACAAGGCTTGTTAAACAAGCATTTTGAGAAGCTTGTACG
TTGCGATCCACGTCTCAACTTTCTAAGCCAACAACCAGATATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCAAAGAAGGAATTGATTTGAAGG
AGGGGCCTACTTTCTTTAGTTTAGGTATGGTGTCACCATCTGGAACTCGATCACCTTCCTCTGTTAAAGAACATGAGTGTCTTGCTGGGGCTTCCGAAGAATATTCCGAG
CAATCTCCATCACCTAACTCAGGGCTAGAAGCTCAAGCGACGACTGAAGAATTGAGGAACGATGGATCTGAAAGTTCAAGACTGTTCAATAAATGGGATCAAGTCATGGT
TCCTGGAATTCGGCCATCAATGTCCGTAAGTGATTTTGTCAACCATATTGAACAATGCCTATCACAGCAGATGACACCCAACGGCCCTCTGTTTTCTGAAGAAAACCAAC
AAAGCAGAGAGGCTCTAGAGGGAATTACACAGTATCTTTTTGGTGATTCTCAAAATGCATCTGACTCCGATGAACAAACCGTCATGTCCAGGGTGAATTCACTATGCTGT
CTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTACAAACCAAAGCTGGCAATAATAGCCTCGACGTTGGAGGTGGCAACACCTACCCTATCGCTGCATCAGAATACGA
AATCAAGAATCAAGAAGGCTTCTATGCACGCAATGGCTTTGAATCGAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAA
TAGCTTCTCTCCCTCAGTTTTTGTTCAACTTGTTCGATGATTCTGATGATCGTGCTCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGGGGAGTCCGAATGGACTGGAATGGAGGCTGACGTGGAAGCTGATGAATTCAGGGACGGAGAAGATTCCGACGACGAAGAGGCTGAAGAAAGAGGTTGAGGA
TTCTTTGGAGGACCTACTTGATCAGTTTCACAAGCGATCCAAATCCGAGTTCTCTTCCGAGAGATGGACATCAGAAGCCAATGCGTTTCCTGTATCATCCACTCCGTACA
ATCCTTTAGATGAGCCGAGTCCGTTGGGTTTGAGCCTCAAAAAGAGTCCGTCCTTATTGGATTTGATTCAAGCAAAACTCTCTCAAGAAACCGTTAAATTAGCGACTTTG
AGCAAAAAGGACCAGAAGGGAGGCAATGCTTTTAGTACTGCAGACAAACTCAAGGCTTCTAACTTCCCAGCGTTGATTCTTAAGATTGGTACTTGGGAGTACAAGTCAAG
ATATGAGGGAGATTTAGTGGCGAAGTGTTACTTTGCAAAGCATAAGTTGGTTTGGGAACTTCTAGATGGGAATCTTAAGAACAAGATAGAAATTCAATGGTCGGATATAG
TCGCCCTGAAGGCGACTTATCCCGAGGATGGACTCGGGACTTTGGATGTAGTGCTGGCGAGACAGCCCCTTTTCTTTAGGGAAATAAATCCACAGCCTAAGAAGCACACT
TTATGGCAAGCAACAGCTGACTTTACAGGTGGAGAAGCAAGCAAATATAGAAGGCATTACCTGCAGTGTTCACAAGGCTTGTTAAACAAGCATTTTGAGAAGCTTGTACG
TTGCGATCCACGTCTCAACTTTCTAAGCCAACAACCAGATATTGTGTTGGAATGTCCATATTTTAAAACCAATGGTTCAAATGAATCCAAAGAAGGAATTGATTTGAAGG
AGGGGCCTACTTTCTTTAGTTTAGGTATGGTGTCACCATCTGGAACTCGATCACCTTCCTCTGTTAAAGAACATGAGTGTCTTGCTGGGGCTTCCGAAGAATATTCCGAG
CAATCTCCATCACCTAACTCAGGGCTAGAAGCTCAAGCGACGACTGAAGAATTGAGGAACGATGGATCTGAAAGTTCAAGACTGTTCAATAAATGGGATCAAGTCATGGT
TCCTGGAATTCGGCCATCAATGTCCGTAAGTGATTTTGTCAACCATATTGAACAATGCCTATCACAGCAGATGACACCCAACGGCCCTCTGTTTTCTGAAGAAAACCAAC
AAAGCAGAGAGGCTCTAGAGGGAATTACACAGTATCTTTTTGGTGATTCTCAAAATGCATCTGACTCCGATGAACAAACCGTCATGTCCAGGGTGAATTCACTATGCTGT
CTTCTGCAGAAGGATTCTTGTATGGCTAAGACCTTACAAACCAAAGCTGGCAATAATAGCCTCGACGTTGGAGGTGGCAACACCTACCCTATCGCTGCATCAGAATACGA
AATCAAGAATCAAGAAGGCTTCTATGCACGCAATGGCTTTGAATCGAGCAAGCACATAGCAATGTCGAGGAACGATTCAGTTGGTGAGCTGCTGCTTAATCTTCCAAGAA
TAGCTTCTCTCCCTCAGTTTTTGTTCAACTTGTTCGATGATTCTGATGATCGTGCTCGATAA
Protein sequenceShow/hide protein sequence
MGVGSPNGLEWRLTWKLMNSGTEKIPTTKRLKKEVEDSLEDLLDQFHKRSKSEFSSERWTSEANAFPVSSTPYNPLDEPSPLGLSLKKSPSLLDLIQAKLSQETVKLATL
SKKDQKGGNAFSTADKLKASNFPALILKIGTWEYKSRYEGDLVAKCYFAKHKLVWELLDGNLKNKIEIQWSDIVALKATYPEDGLGTLDVVLARQPLFFREINPQPKKHT
LWQATADFTGGEASKYRRHYLQCSQGLLNKHFEKLVRCDPRLNFLSQQPDIVLECPYFKTNGSNESKEGIDLKEGPTFFSLGMVSPSGTRSPSSVKEHECLAGASEEYSE
QSPSPNSGLEAQATTEELRNDGSESSRLFNKWDQVMVPGIRPSMSVSDFVNHIEQCLSQQMTPNGPLFSEENQQSREALEGITQYLFGDSQNASDSDEQTVMSRVNSLCC
LLQKDSCMAKTLQTKAGNNSLDVGGGNTYPIAASEYEIKNQEGFYARNGFESSKHIAMSRNDSVGELLLNLPRIASLPQFLFNLFDDSDDRAR