; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G224880 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G224880
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
Genome locationCicolChr11:29237228..29239896
RNA-Seq ExpressionCcUC11G224880
SyntenyCcUC11G224880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]7.9e-23874.19Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPK  HNH STVVAAAA NRNPSNKRP DPK+R NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP IEPV
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA  PVSSEERAN AAL+LQYKGS+ACRGFFARNADSGSDEE EEEEA   +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K  K+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN
        VV +D  +QKLEEE+ AE PTSN+KD+ SG+NDDACK NDVKLQAEN  NS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSDNDVSENN
Subjt:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN

Query:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV
        LI+G+ VEE +EFKFFLKLF ENESLRRYYE+ YDDGEFFCLAC GAGKKML SFKTCGRL+QH TSLGK+K+ K+PVQKPHIAKMLKMK++AHRA S V
Subjt:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LG SLTK+D +K  DESVGN++DNT E                    +D  VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------G

Query:  VKETDSMKVDGNGEVT
        VKETDSMKVD NGE T
Subjt:  VKETDSMKVDGNGEVT

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]2.5e-23974.43Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPK  HNH STVVAAAA NRNPSNKRP DPK+R NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP IEPV
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA  PVSSEERAN AAL+LQYKGS+ACRGFFARNADSGSDEE EEEEA   +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K  K+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN
        VV +D  +QKLEEE+ AE PTSN+KD+ SG+NDDACK NDVKLQAEN  NS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSDNDVSENN
Subjt:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN

Query:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV
        LI+G+ VEE +EFKFFLKLF ENESLRRYYE+ YDDGEFFCLAC GAGKKML SFKTCGRL+QH TSLGK+K+ K+PVQKPHIAKMLKMK++AHRA S V
Subjt:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSKDESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVK
        ICKVLGWDIEKLPAVVLKG+ LG SLTK+D +KDESVGN++DNT E                    +D  VEDD                       GVK
Subjt:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSKDESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVK

Query:  ETDSMKVDGNGEVT
        ETDSMKVD NGE T
Subjt:  ETDSMKVDGNGEVT

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]2.8e-23573.86Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPK  HNH STVVAAAA NRNPSNKRP DPK+R NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP IEPV
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA  PVSSEERAN AAL+LQYKGS+ACRGFFARNADSGSDEE EEEEA   +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K  K+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LK  PEENHVAKEH             DDI+KKNE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN
        VV +D  +QKLEEE+ AE PTSN+KD+ SG+NDDACK NDVKLQAEN  NS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSDNDVSENN
Subjt:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDNDVSENN

Query:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV
        LI+G+ VEE +EFKFFLKLF ENESLRRYYE+ YDDGEFFCLAC GAGKKML SFKTCGRL+QH TSLGK+K+ K+PVQKPHIAKMLKMK++AHRA S V
Subjt:  LIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LG SLTK+D +K  DESVGN++DNT E                    +D  VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------G

Query:  VKETDSMKVDGNGEVT
        VKETDSMKVD NGE T
Subjt:  VKETDSMKVDGNGEVT

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]1.1e-23974.8Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPK  HNH STVVAAAA NRNPSNKRP DPK+R NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP IEPV
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA  PVSSEERAN AAL+LQYKGS+ACRGFFARNADSGSDEE EEEEA   +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K  K+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGD
        VV +D  +QKLEEE+ AE PTSN+KD+ SG+NDDACK NDVKLQAEN  NS+ GM ESN EMDNLPV ESILKACKEF AAFFTSMSDNDVSENNLI+G+
Subjt:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGD

Query:  RVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYE+ YDDGEFFCLAC GAGKKML SFKTCGRL+QH TSLGK+K+ K+PVQKPHIAKMLKMK++AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LG SLTK+D +K  DESVGN++DNT E                    +D  VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVKETD

Query:  SMKVDGNGEVT
        SMKVD NGE T
Subjt:  SMKVDGNGEVT

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]8.2e-21970.21Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPK  HNH STVVAAAA NRNPSNKRP DPK+R NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP IEPV
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA  PVSSEERAN AAL+LQYKGS+ACRGFFARNADSGSDEE EEEEA   +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K  K+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGD
        VV +D  +QKLEEE+ AE PTSN+KD+ SG+                                   V ESILKACKEF AAFFTSMSDNDVSENNLI+G+
Subjt:  VVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGD

Query:  RVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYE+ YDDGEFFCLAC GAGKKML SFKTCGRL+QH TSLGK+K+ K+PVQKPHIAKMLKMK++AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LG SLTK+D +K  DESVGN++DNT E                    +D  VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGCSLTKSDVSK--DESVGNAIDNTTE--------------------VDVPVEDDY----------------------GVKETD

Query:  SMKVDGNGEVT
        SMKVD NGE T
Subjt:  SMKVDGNGEVT

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X12.0e-19471.35Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PYS++ LT+EVLYLHSLW RGPPRNPK TH+H ST VA    + NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP I
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        +PVATPA Q VSSEER N AAL+LQYKGS+ACR FFARNADSGSDEEEEEEE +  DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK  K+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF
        QKLEE K AE PTSN+KD+ SGENDDA KD DVKLQ ENA NSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+D+DVSE    +G   EE +EF
Subjt:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF

Query:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYE+ Y DGEF CLACE AG+K +  FKTC RL+QH+T LGK+ + K+  QKP   K+LKM +LAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++  D  VEDD
Subjt:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD

A0A1S3CJZ1 uncharacterized protein LOC103501816 isoform X37.1e-19270.98Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PYS++ LT+EVLYLHSLW RGPPRNPK TH+H ST VA    + NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP I
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        +PVATPA Q VSSEER N AAL+LQYKGS+ACR FFARNADSGSDEEEEEEE +  DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK  K+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLK  PEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF
        QKLEE K AE PTSN+KD+ SGENDDA KD DVKLQ ENA NSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+D+DVSE    +G   EE +EF
Subjt:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF

Query:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYE+ Y DGEF CLACE AG+K +  FKTC RL+QH+T LGK+ + K+  QKP   K+LKM +LAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++  D  VEDD
Subjt:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X21.2e-19471.72Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PYS++ LT+EVLYLHSLW RGPPRNPK TH+H ST VA    + NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP I
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        +PVATPA Q VSSEER N AAL+LQYKGS+ACR FFARNADSGSDEEEEEEE +  DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK  K+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF
        QKLEE K AE PTSN+KD+ SGENDDA KD DVKLQ ENA NSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+D+DVSE    +G   EE +EF
Subjt:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF

Query:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYE+ Y DGEF CLACE AG+K +  FKTC RL+QH+T LGK+ + K+  QKP   K+LKM +LAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD
        A+VL G+ALG SLTKSDVSKD+S  +    ++  D  VEDD
Subjt:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD

A0A5D3DXE1 Uncharacterized protein9.9e-19473.27Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PYS++ LT+EVLYLHSLW RGPPRNPK TH+H ST VA    + NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP I
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        +PVATPA Q VSSEER N AAL+LQYKGS+ACR FFARNADSGSDEEEEEEE +  DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  EPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK  K+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF
        QKLEE K AE PTSN+KD+ SGENDDA KD DVKLQ ENA NSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+D+DVSE    +G   EE +EF
Subjt:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF

Query:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYE+ Y DGEF CLACE AG+K +  FKTC RL+QH+T LGK+ + K+  QKP   K+LKM +LAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGCSLTKSDVSK
        A+VL G+ALG SLTKSDVSK
Subjt:  AVVLKGKALGCSLTKSDVSK

A0A6J1IMA4 uncharacterized protein LOC111476868 isoform X11.3e-18567.84Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV
        MNPYSE+ LTEEVLYLHSLWRRGPPR PK T  +LST VAAA      +NKRPRDPK+R+ KKKK RPEP Q +GPEWP PEPVQ QP TSSGWP + P 
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPV

Query:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        ATPA + VSSEERAN  AL+LQY G EACR F  RNADSGSDEE EEE  EGNDGE+MESEEYKFFL +F+ENDELRGYYEKNSE GLFCCLVCGGMG+K
Subjt:  ATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDD-IDKKNEVVSVDE----NE
        KS KRFKNC+GLV HS SISRTKKK AHRAFGQ +CRVFGWDI+RLPTIVL GEPL RSLA+SGD K QPEE+ VA+EHD  +  +N  +S D+    NE
Subjt:  KSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDD-IDKKNEVVSVDE----NE

Query:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF
        QK EEEK AE                                SISG            V ESI++AC+EFFAAF TSM+D+DVSENN I     EEC+EF
Subjt:  QKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEF

Query:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP
        KFFLKLFIENESLRRYY++KYDDGEF CL C+GAGKK L SFKTC RL++H T  GK+K G + V KPHIAKMLK+K+LAHRAYSLVIC+VLGWDIEKLP
Subjt:  KFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD
        A+VLKG+  GCSLTK DV KD+ VGNA DNT EVD PV+DD
Subjt:  AVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein5.6e-5633.02Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRN---PKRTHNHLSTVVAAAATNRNPS-----------------NKRPRDPKSRKNKKKKPRPEPPQHSGPEWPC
        MN Y +++L +EV+YLHSLW +GPP     P    N +   +     N  P                  ++ P +P++  N  K+PRP+    SG EWP 
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRN---PKRTHNHLSTVVAAAATNRNPS-----------------NKRPRDPKSRKNKKKKPRPEPPQHSGPEWPC

Query:  PEPVQYQPSTSSGWPSIEPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNA---DSGSDEEEEEEEAEGNDGEMME------SEEYKFFLKMFV
         + V   PST SGWP   P      +P+S+EE+   AA  LQ      CR FF R +   DS     +E E  EG++ + +E      S+E++F  ++F 
Subjt:  PEPVQYQPSTSSGWPSIEPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNA---DSGSDEEEEEEEAEGNDGEMME------SEEYKFFLKMFV

Query:  ENDELRGYYEKNSESGLFCCLVCGGMGRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPE
        EN +L+ YYEKN+ +G F CLVCGG+G K  RK FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                           
Subjt:  ENDELRGYYEKNSESGLFCCLVCGGMGRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPE

Query:  ENHVAKEHDDIDKKNEVVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFT
                      N VVS  ++ Q + E  GA  P S++K                  Q +  V S+    E + +   L + ++  +A K+ F    T
Subjt:  ENHVAKEHDDIDKKNEVVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFT

Query:  SMSDNDVSENNLIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGA-GKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLK
          +D   +E N   GD     +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T                  K+ K
Subjt:  SMSDNDVSENNLIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGA-GKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLK

Query:  MKILAHRAYSLVICKVLGWDIEKLPAVVLKGKA
        MKI AH+ ++  +C++LGWD E LP  V+KG A
Subjt:  MKILAHRAYSLVICKVLGWDIEKLPAVVLKGKA

AT1G78810.2 unknown protein5.6e-5633.02Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRN---PKRTHNHLSTVVAAAATNRNPS-----------------NKRPRDPKSRKNKKKKPRPEPPQHSGPEWPC
        MN Y +++L +EV+YLHSLW +GPP     P    N +   +     N  P                  ++ P +P++  N  K+PRP+    SG EWP 
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRN---PKRTHNHLSTVVAAAATNRNPS-----------------NKRPRDPKSRKNKKKKPRPEPPQHSGPEWPC

Query:  PEPVQYQPSTSSGWPSIEPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNA---DSGSDEEEEEEEAEGNDGEMME------SEEYKFFLKMFV
         + V   PST SGWP   P      +P+S+EE+   AA  LQ      CR FF R +   DS     +E E  EG++ + +E      S+E++F  ++F 
Subjt:  PEPVQYQPSTSSGWPSIEPVATPAPQPVSSEERANFAALRLQYKGSEACRGFFARNA---DSGSDEEEEEEEAEGNDGEMME------SEEYKFFLKMFV

Query:  ENDELRGYYEKNSESGLFCCLVCGGMGRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPE
        EN +L+ YYEKN+ +G F CLVCGG+G K  RK FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                           
Subjt:  ENDELRGYYEKNSESGLFCCLVCGGMGRKKSRKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPE

Query:  ENHVAKEHDDIDKKNEVVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFT
                      N VVS  ++ Q + E  GA  P S++K                  Q +  V S+    E + +   L + ++  +A K+ F    T
Subjt:  ENHVAKEHDDIDKKNEVVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKLQAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFT

Query:  SMSDNDVSENNLIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGA-GKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLK
          +D   +E N   GD     +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T                  K+ K
Subjt:  SMSDNDVSENNLIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGA-GKKMLNSFKTCGRLIQHATSLGKSKMGKRPVQKPHIAKMLK

Query:  MKILAHRAYSLVICKVLGWDIEKLPAVVLKGKA
        MKI AH+ ++  +C++LGWD E LP  V+KG A
Subjt:  MKILAHRAYSLVICKVLGWDIEKLPAVVLKGKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAAAACACTCACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCGAGGCCCGCCGAGGAACCCTAAACGCACACACAACCATTTATCCAC
CGTCGTCGCTGCTGCCGCGACGAATCGGAACCCCTCCAACAAGAGACCTAGAGATCCAAAGAGTCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACT
CCGGCCCCGAGTGGCCCTGTCCGGAGCCGGTTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTGAGCCTGTTGCCACTCCGGCGCCTCAGCCTGTGTCTTCT
GAAGAGCGAGCAAATTTTGCGGCGTTGCGATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGA
GGAGGAAGCTGAGGGTAATGATGGGGAAATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAACT
CTGAAAGTGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAGAAAGAAATCTAGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCG
AGGACGAAGAAGAAGCAGGCTCATAGGGCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCCCTTGG
TCGATCATTAGCCAATTCTGGAGACTTGAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGACATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGA
ATGAACAGAAATTGGAGGAAGAAAAGGGAGCTGAAGGTCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTG
CAAGCAGAAAATGCAGTTAATTCAATATCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTTTTTGC
AGCCTTCTTTACATCTATGAGCGACAATGATGTTAGTGAAAACAACTTAATCGAAGGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCG
AGAACGAAAGCTTGAGAAGATATTACGAGGACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGAGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGT
GGCCGCCTTATCCAGCATGCAACTTCTCTAGGGAAGAGCAAAATGGGGAAAAGACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATACTGGCTCATAG
GGCATATAGTTTAGTTATATGTAAGGTTCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTTGTTCCTTAACAAAGTCAGACGTGT
CAAAGGACGAATCTGTTGGCAATGCAATTGATAATACGACGGAAGTGGATGTTCCTGTAGAAGACGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATGGCAAT
GGTGAAGTTACTTTGAAGGATGATGCCGTGGATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATAATTAAACCAGTACCGAACTGGTTGAAATTTAAATTAAAAGTATAAATTCTACAACTAAATTACGAAGAAAATACGTATTTTTCGGGGAAAAAAAACTCGTGAGAGTT
GAGTTGGGACTGGAAGTTGGGAACGGAGACGATGATACCAAACCTCCGTTACATTCCCTCTTGATTCCGCCGTTTTTCCACCAATGAATCCCTACTCCGAGAAAACACTC
ACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCGAGGCCCGCCGAGGAACCCTAAACGCACACACAACCATTTATCCACCGTCGTCGCTGCTGCCGCGACGAATCG
GAACCCCTCCAACAAGAGACCTAGAGATCCAAAGAGTCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACTCCGGCCCCGAGTGGCCCTGTCCGGAGC
CGGTTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTGAGCCTGTTGCCACTCCGGCGCCTCAGCCTGTGTCTTCTGAAGAGCGAGCAAATTTTGCGGCGTTG
CGATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGAGGAGGAAGCTGAGGGTAATGATGGGGA
AATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAACTCTGAAAGTGGGTTGTTTTGTTGCTTGG
TTTGTGGTGGAATGGGGAGAAAGAAATCTAGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCGAGGACGAAGAAGAAGCAGGCTCATAGG
GCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCCCTTGGTCGATCATTAGCCAATTCTGGAGACTT
GAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGACATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGAATGAACAGAAATTGGAGGAAGAAAAGG
GAGCTGAAGGTCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTGCAAGCAGAAAATGCAGTTAATTCAATA
TCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTTTTTGCAGCCTTCTTTACATCTATGAGCGACAA
TGATGTTAGTGAAAACAACTTAATCGAAGGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCGAGAACGAAAGCTTGAGAAGATATTACG
AGGACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGAGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGTGGCCGCCTTATCCAGCATGCAACTTCT
CTAGGGAAGAGCAAAATGGGGAAAAGACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATACTGGCTCATAGGGCATATAGTTTAGTTATATGTAAGGT
TCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTTGTTCCTTAACAAAGTCAGACGTGTCAAAGGACGAATCTGTTGGCAATGCAA
TTGATAATACGACGGAAGTGGATGTTCCTGTAGAAGACGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATGGCAATGGTGAAGTTACTTTGAAGGATGATGCC
GTGGATGTGTAA
Protein sequenceShow/hide protein sequence
MNPYSEKTLTEEVLYLHSLWRRGPPRNPKRTHNHLSTVVAAAATNRNPSNKRPRDPKSRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIEPVATPAPQPVSS
EERANFAALRLQYKGSEACRGFFARNADSGSDEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSRKRFKNCVGLVQHSISIS
RTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKGAEGPTSNAKDVSSGENDDACKDNDVKL
QAENAVNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDNDVSENNLIEGDRVEECKEFKFFLKLFIENESLRRYYEDKYDDGEFFCLACEGAGKKMLNSFKTC
GRLIQHATSLGKSKMGKRPVQKPHIAKMLKMKILAHRAYSLVICKVLGWDIEKLPAVVLKGKALGCSLTKSDVSKDESVGNAIDNTTEVDVPVEDDYGVKETDSMKVDGN
GEVTLKDDAVDV