; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G017430 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G017430
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUnknown protein
Genome locationCG_Chr11:30653003..30655666
RNA-Seq ExpressionClCG11G017430
SyntenyClCG11G017430
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]2.8e-24375.65Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        TPA  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPT VLKGEPL RSLA+SG+LKVQPEENHVAKEHD              I+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+D SENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G

Query:  VKETDSMKVDSNGEVT
        VKETDSMKVDSNGE T
Subjt:  VKETDSMKVDSNGEVT

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]8.8e-24575.9Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        TPA  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPT VLKGEPL RSLA+SG+LKVQPEENHVAKEHD              I+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+D SENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVP--------------------VEDDY----------------------GVK
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +KDESVGN++DNT E D                      VEDD                       GVK
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVP--------------------VEDDY----------------------GVK

Query:  ETDSMKVDSNGEVT
        ETDSMKVDSNGE T
Subjt:  ETDSMKVDSNGEVT

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]1.0e-24075.32Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        TPA  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPT VLKGEPL RSLA+SG+LK  PEENHVAKEHD              I+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+D SENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDASENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G

Query:  VKETDSMKVDSNGEVT
        VKETDSMKVDSNGE T
Subjt:  VKETDSMKVDSNGEVT

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]3.9e-24576.27Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        TPA  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPT VLKGEPL RSLA+SG+LKVQPEENHVAKEHD              I+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRD
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLPV ESILKACKEF AAFFTSMSD+D SENNLID +
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRD

Query:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD

Query:  SMKVDSNGEVT
        SMKVDSNGE T
Subjt:  SMKVDSNGEVT

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]3.2e-22371.36Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        TPA  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPT VLKGEPL RSLA+SG+LKVQPEENHVAKEHD              I+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDG-------------IDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRD
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+                                   V ESILKACKEF AAFFTSMSD+D SENNLID +
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRD

Query:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD

Query:  SMKVDSNGEVT
        SMKVDSNGE T
Subjt:  SMKVDSNGEVT

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X11.0e-19873.2Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVATPA Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPT VLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDD SE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A1S3CJZ1 uncharacterized protein LOC103501816 isoform X33.6e-19672.83Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVATPA Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPT VLKGEPL RSLANSGDLK  PEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDD SE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X24.6e-19973.57Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVATPA Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPT VLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDD SE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSKD+S  +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A5D3DXE1 Uncharacterized protein1.1e-19775Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVATPA Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPT VLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDD SE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSK
        A+VL G+ALG SLTKSDVSK
Subjt:  AVVLKGKALGRSLTKSDVSK

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X24.3e-18966.97Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPK--NRKNKKKKPRPEP--PQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PY E+ LTEEVL+LHSLWRRGPP+N K   NHS+  VA  ANR PSNKRP  P+    K KKKKPRP P  PQ SGPEWPCPEPVQ QPSTSSGWP+I
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPK--NRKNKKKKPRPEP--PQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  QPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGG
        QP ATPA QPVSSEERA  +ALQLQYK  +ACRGFFARNADSGS+ EEEEEEE E+NDG + + EEYKFFLKMFVEN EL  YYEKN E G FCCLVCGG
Subjt:  QPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHD-GIDKKNEVVSVDENE
        MG+KKSGKRFK+CVGLVQHSISISRTKKK+AHRAFG V+CRV GWD++RLP  VLKGEPL RSLA+SG+ +VQPE+NHVAKE   G+  +N+     +NE
Subjt:  MGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHD-GIDKKNEVVSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF
        +KLEE+KAAEDP SNAK+ SSGEN + CK+NDV +Q ENTDNSI GMG    EM NLPV + I KACKEFFA F  S SD+      L D D +EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF EN+ LR YYE+ Y+DGEF CLACEGAGKK    FKTCGRLLQH+TSL K+++G+        AKMLKMK LAHRAYS  +CKVLGWD+E+LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDS
        +VVLKG+ LGRSLTK  VSKDE +GN   N + +  P+E+  G  E   ++ D+
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein8.9e-5431.96Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG
        MN Y ++ L +EV+YLHSLW +GPP R P P+ N +                           V     +RNP+N     P+N  N  K+PRP+    SG
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG

Query:  PEWPCPEPVQYQPSTSSGWPSIQPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEDNDGEMMESEEYKFF
         EWP  + V   PST SGWP  +P      +P+S+EE+   AA  LQ      CR FF R +       +G DE E  E +E+      E   S+E++F 
Subjt:  PEWPCPEPVQYQPSTSSGWPSIQPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEDNDGEMMESEEYKFF

Query:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDL
         ++F EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                      
Subjt:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDL

Query:  KVQPEENHVAKEHDGIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF
                           N VVS  ++ Q + E   A +P S++K                K Q  +         E + +   L + ++  +A K+ F
Subjt:  KVQPEENHVAKEHDGIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF

Query:  AAFFTSMSD--DDASENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP
            T  +D  ++  + NL         +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T               
Subjt:  AAFFTSMSD--DDASENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP

Query:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKAL--------GRSLTKSDVSK---DESVGNAIDNTTEADVPVE
           K+ KMK+ AH+ ++  +C++LGWD E LP  V+KG A             T S V +   ++  GN  DN  EA+  VE
Subjt:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKAL--------GRSLTKSDVSK---DESVGNAIDNTTEADVPVE

AT1G78810.2 unknown protein3.1e-5432.41Show/hide
Query:  MNPYSEKILTEEVLYLHSLWRRGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG
        MN Y ++ L +EV+YLHSLW +GPP R P P+ N +                           V     +RNP+N     P+N  N  K+PRP+    SG
Subjt:  MNPYSEKILTEEVLYLHSLWRRGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG

Query:  PEWPCPEPVQYQPSTSSGWPSIQPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEDNDGEMMESEEYKFF
         EWP  + V   PST SGWP  +P      +P+S+EE+   AA  LQ      CR FF R +       +G DE E  E +E+      E   S+E++F 
Subjt:  PEWPCPEPVQYQPSTSSGWPSIQPVATPAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEDNDGEMMESEEYKFF

Query:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDL
         ++F EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                      
Subjt:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDL

Query:  KVQPEENHVAKEHDGIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF
                           N VVS  ++ Q + E   A +P S++K                K Q  +         E + +   L + ++  +A K+ F
Subjt:  KVQPEENHVAKEHDGIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF

Query:  AAFFTSMSD--DDASENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP
            T  +D  ++  + NL         +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T               
Subjt:  AAFFTSMSD--DDASENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP

Query:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKA
           K+ KMK+ AH+ ++  +C++LGWD E LP  V+KG A
Subjt:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAAAATACTCACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGTCGTCGCCGCTGCCGCGAATCGGAACCCCTCCAACAAGAGACCTAGAGATCCAAAGAATCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACTCCG
GCCCCGAGTGGCCCTGTCCGGAGCCGGTTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTCAGCCTGTTGCCACTCCGGCGCCTCAGCCTGTGTCTTCTGAA
GAGCGAGCAAATTTTGCGGCGTTGCAATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGAGGA
GGAGGAAGCTGAGGATAATGATGGGGAAATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAATT
CTGAAAGTGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAGAAAGAAATCTGGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCG
AGGACGAAGAAGAAGCAGGCTCATAGGGCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGACTGTGTTGAAGGGCGAGCCCCTTGG
TCGATCATTAGCCAATTCTGGAGACTTGAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGGCATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGA
ATGAACAGAAATTGGAGGAAGAAAAGGCAGCTGAAGATCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTG
CAAGCAGAAAATACAGATAATTCAATATCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTCTTTGC
GGCCTTCTTTACATCTATGAGCGACGATGATGCTAGTGAAAACAACTTAATCGACAGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCG
AGAACGAAAGCTTGAGAAGATATTACGAGAACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGTGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGT
GGCCGCCTTCTCCAGCATACAACTTCTCTAGGGAAGAGCAAAATGGGGAAAAAACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATGCTGGCTCATAG
GGCATATAGTTTAGTTATATGTAAGGTTCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTCGTTCCTTAACGAAGTCAGACGTGT
CAAAGGACGAATCTGTTGGCAATGCAATTGATAATACGACAGAAGCGGATGTTCCTGTAGAAGATGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATAGCAAT
GGTGAAGTTACTTTGAAGGATGATGCCGTGGATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATAATTAAACCAGTACCGAACTGGTTGAAATTTAAATTAGAAGTATAAATCCTACAACTAAATTACGAAGAAAATACGTAATTTTCGGGGAAAAAAAACTCGTGAGAGTT
GAGTTGGGACTGGAAGTTGGGAACGGAGACGATGATACCAAACCTCCATTACATTCTCTCTTGATTCCGCCATTTTTCCACCAATGAATCCCTACTCCGAGAAAATACTC
ACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCACCGTCGTCGCCGCTGCCGCGAATCGGAA
CCCCTCCAACAAGAGACCTAGAGATCCAAAGAATCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACTCCGGCCCCGAGTGGCCCTGTCCGGAGCCGG
TTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTCAGCCTGTTGCCACTCCGGCGCCTCAGCCTGTGTCTTCTGAAGAGCGAGCAAATTTTGCGGCGTTGCAA
TTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGAGGAGGAGGAAGCTGAGGATAATGATGGGGA
AATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAATTCTGAAAGTGGGTTGTTTTGTTGCTTGG
TTTGTGGTGGAATGGGGAGAAAGAAATCTGGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCGAGGACGAAGAAGAAGCAGGCTCATAGG
GCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGACTGTGTTGAAGGGCGAGCCCCTTGGTCGATCATTAGCCAATTCTGGAGACTT
GAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGGCATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGAATGAACAGAAATTGGAGGAAGAAAAGG
CAGCTGAAGATCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTGCAAGCAGAAAATACAGATAATTCAATA
TCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTCTTTGCGGCCTTCTTTACATCTATGAGCGACGA
TGATGCTAGTGAAAACAACTTAATCGACAGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCGAGAACGAAAGCTTGAGAAGATATTACG
AGAACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGTGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACTTCT
CTAGGGAAGAGCAAAATGGGGAAAAAACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATGCTGGCTCATAGGGCATATAGTTTAGTTATATGTAAGGT
TCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTCGTTCCTTAACGAAGTCAGACGTGTCAAAGGACGAATCTGTTGGCAATGCAA
TTGATAATACGACAGAAGCGGATGTTCCTGTAGAAGATGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATAGCAATGGTGAAGTTACTTTGAAGGATGATGCC
GTGGATGTGTAA
Protein sequenceShow/hide protein sequence
MNPYSEKILTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVATPAPQPVSSE
ERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEDNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISIS
RTKKKQAHRAFGQVVCRVFGWDINRLPTTVLKGEPLGRSLANSGDLKVQPEENHVAKEHDGIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKL
QAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDASENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTC
GRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDSN
GEVTLKDDAVDV