; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G018290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G018290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionChlorophyllase
Genome locationCmo_Chr02:10361418..10364323
RNA-Seq ExpressionCmoCh02G018290
SyntenyCmoCh02G018290
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606410.1 Ankyrin repeat-containing protein ITN1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.74Show/hide
Query:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM
        MSSHG NTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETV+KLVESVKNDKNNYK ILEIRNERGNNPLHLAAMMGSMKM
Subjt:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM

Query:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL
        CHAIA AHG+LV  RN+LDETPLYLAAGCGNKD+FFYLYF CGEDSSKIETNCRVN+TGDTVLHHALRNDQFDLA +LIY+H+EA NW NEAG TP+ VL
Subjt:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL

Query:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNT
        ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKC N NDTNN GDEKK+NT
Subjt:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNT

Query:  KNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRF
        KNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGT+EIKRIRKEKEKHTWAVQVMNQL +L K Y YG+DGKSPMDFRF
Subjt:  KNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRF

Query:  QNQKEEEKDTVPYDFVDDVVGFRHLF-EEPKQLEVPKQSGKP------KDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQ
        QNQ+E+EKDTVPYDFVDDV+GFRHL  EEPKQLE PKQS KP      KDVEATESAMLLAAKNGV+EIVK MS R PLAIYDTNKDKKNVVLLAAEYRQ
Subjt:  QNQKEEEKDTVPYDFVDDVVGFRHLF-EEPKQLEVPKQSGKP------KDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQ

Query:  PEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYD
        PEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDP LWRITGDAL+LQWEVKWY YVKESVPLHVFPHWNKEG+TANALFQETH++LA KGG+WLYD
Subjt:  PEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYD

Query:  TSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVS
        TS+SFT+VATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVS
Subjt:  TSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVS

Query:  FCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDLMHSSKANEEHDD
        FCSGHYFLI NRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDL+Q LLQTVPTRSAEVVLRDLMH SKANEE DD
Subjt:  FCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDLMHSSKANEEHDD

KAG6606411.1 hypothetical protein SDJN03_03728, partial [Cucurbita argyrosperma subsp. sororia]1.8e-30170.9Show/hide
Query:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM
        MSSHG NTP AR REN +TRNWK VIE YE  REAQKLKLNQEGDTALH AV+DNQEETVKKLVESVK D++N KHILEIRNE GNNPLHLAAMMGS+KM
Subjt:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM

Query:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL
        C AIA AHGELVH RN+LDETPLYLAA CGNKD FF LY  CGEDS+ I TNCRV   GDTVLH ALRNDQFDLA +LIY+H+EA NW NEAG TP+ VL
Subjt:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL

Query:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQP------------------------------RSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRG
        ASKPTSFKSGS I+G + I+YH   VKPLQP                              RSIEK KQESERS EKAKETAPPSPFPVNYQTCI F RG
Subjt:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQP------------------------------RSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRG

Query:  MKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHT
        MKD+ LRVWHFKC N NDTNN GDEKK+NTKNDGDKRTGD EEQKPLNTKSA +PPTT FP + +T   FF IAFSAIMII GCGTNEIKRIRK+KEKHT
Subjt:  MKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHT

Query:  WAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLF-EEPKQLEVPKQSGKPK------DVEATESAMLLAAKNGVVEIV
        WAVQVMNQLLELVKDY+YG+DGKSPMDFRFQNQ+E+EKDTVPYDFVDDVVGFRHLF EEPKQLE PKQS KPK      +VEATESAMLLAAKNGVVEIV
Subjt:  WAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLF-EEPKQLEVPKQSGKPK------DVEATESAMLLAAKNGVVEIV

Query:  KGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFP
        KGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKE K IE+LFRAVD  GNSALHLAAT     +WRI+G AL+LQWE+KWYK+VKES+PL  F 
Subjt:  KGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFP

Query:  HWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGG-NDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSR
         +NKEGK A  +F+ETH  L K  G WL  TS S  +V TLI TVAF +A +IPGG N  +G   L  +  F IF+  SLIAL LSS SV MFLAI++ R
Subjt:  HWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGG-NDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSR

Query:  FNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTLLQTVPTRSA
        F+  DF   LP KLLIGL  L+ SII+MLVSFC+GHYFL+   +R+   LLY+   +PVA IF + +  LY D++ T  + VP+RSA
Subjt:  FNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTLLQTVPTRSA

XP_022931017.1 uncharacterized protein LOC111437339 [Cucurbita moschata]0.0e+0083.97Show/hide
Query:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM
        MSSHG NTP AR REN  TRNWK VIE YE  REAQKLKLNQEGDTALH AV+DNQEETVKKLVESVK D++N KHILE RNERGNNPLHLAAMMGS+KM
Subjt:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM

Query:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL
        CHAIAHAHGELV ERNELDETPLYLAA CGNK+ FF LY  CG+DS+ I TNCRV   GDTVLH ALRNDQFDLA QLI +H+EA NW NEAG TP+ VL
Subjt:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL

Query:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKL------------------------------------------------------------KQE
        ASKPTSFKSGS I+G + I+YH   VKPL+PRSIEKL                                                            KQE
Subjt:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKL------------------------------------------------------------KQE

Query:  SERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF
        SERS EKAKETAPPSPFPVNYQTCIHF+RGMKD+ LRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF
Subjt:  SERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF

Query:  FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK
        FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK
Subjt:  FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK

Query:  PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA
        PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA
Subjt:  PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA

Query:  LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS
        LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS
Subjt:  LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS

Query:  SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL
        SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYF      
Subjt:  SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL

Query:  LQTVPTRSAEVVLRDLMHSSKANEEHDDDDENGSTDMSETDD
                      DLMHSSKANEEHDDDDENGSTDMSETDD
Subjt:  LQTVPTRSAEVVLRDLMHSSKANEEHDDDDENGSTDMSETDD

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]1.4e-30173.98Show/hide
Query:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP
        K V+E Y++  E    KLN+ GDT LHLAVIDNQE  V++LV+ V+        +LEI NE GNN LHLAAMMGS++MC AIA  H  LV  RN  D+TP
Subjt:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP

Query:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH
        L+LAA  GNKD FF LY  C +D+S+I  NCRV + GDTVLH+AL ++ FDLAFQLI MHKEAI+WVNEAGLTPIHVLA KPTSFKSGSHIKG Q I+YH
Subjt:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH

Query:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA
         SFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKD+ L   HFKC N NDTNN GD KKN  KNDGDKR GDLEEQKPLNTKSA
Subjt:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA

Query:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF
         +PPTT+FPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDG SPMD +F+N  EE   T+PY+FVD+ V F
Subjt:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF

Query:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL
         + F+EPK+LE      KP+DVEA ESAMLLAA+NGV+EIVKG   R PLAI DT KDKKNVVLLAAE+RQP+VYR LLK+ K I++LFRAVDH+GNSAL
Subjt:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL

Query:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE
        HLAATA DPKLWRITG AL++QWEVKWYKYVK SVPLH FP+WN +GKTA+A+F+ETH++LA KGG WLY TS+S +LVATLIATVAFATA TIPGGNDE
Subjt:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE

Query:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA
         GGA LG E GF IFSYSSLIAL LSSTSVIMFLAIMTSRF+IKDFGL LPWKLLIGLCCLYFSIIAMLVSFCSGHYFLI  RL NEAILLY+LTF PVA
Subjt:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA

Query:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL
        +IF +VQLPLYFDL+Q L++ VP RSAEVVL DL
Subjt:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]1.6e-29773.57Show/hide
Query:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP
        K V+E Y++  E    KLN+ GDT LHLAVIDNQE  V++LV+ V+        +LEI NE GNN LHLAAMMGS++MC AIA  H  LV  RN  D+TP
Subjt:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP

Query:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH
        L+LAA  GNKD FF LY  C +D+S+I  NCRV + GDTVLH+AL ++ FDLAFQLI MHKEAI+WVNEAGLTPIHVLA KPTSFKSGSHIKG Q I+YH
Subjt:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH

Query:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA
         SFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKD+ L   HFKC N NDTNN GD KKN  KNDGDKR GDLEEQKPLNTKSA
Subjt:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA

Query:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF
             T+FPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDG SPMD +F+N  EE   T+PY+FVD+ V F
Subjt:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF

Query:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL
         + F+EPK+LE      KP+DVEA ESAMLLAA+NGV+EIVKG   R PLAI DT KDKKNVVLLAAE+RQP+VYR LLK+ K I++LFRAVDH+GNSAL
Subjt:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL

Query:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE
        HLAATA DPKLWRITG AL++QWEVKWYKYVK SVPLH FP+WN +GKTA+A+F+ETH++LA KGG WLY TS+S +LVATLIATVAFATA TIPGGNDE
Subjt:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE

Query:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA
         GGA LG E GF IFSYSSLIAL LSSTSVIMFLAIMTSRF+IKDFGL LPWKLLIGLCCLYFSIIAMLVSFCSGHYFLI  RL NEAILLY+LTF PVA
Subjt:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA

Query:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL
        +IF +VQLPLYFDL+Q L++ VP RSAEVVL DL
Subjt:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL

TrEMBL top hitse value%identityAlignment
A0A6J1ESB1 uncharacterized protein LOC1114373390.0e+0083.97Show/hide
Query:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM
        MSSHG NTP AR REN  TRNWK VIE YE  REAQKLKLNQEGDTALH AV+DNQEETVKKLVESVK D++N KHILE RNERGNNPLHLAAMMGS+KM
Subjt:  MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKM

Query:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL
        CHAIAHAHGELV ERNELDETPLYLAA CGNK+ FF LY  CG+DS+ I TNCRV   GDTVLH ALRNDQFDLA QLI +H+EA NW NEAG TP+ VL
Subjt:  CHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVL

Query:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKL------------------------------------------------------------KQE
        ASKPTSFKSGS I+G + I+YH   VKPL+PRSIEKL                                                            KQE
Subjt:  ASKPTSFKSGSHIKGLQRIIYHRSFVKPLQPRSIEKL------------------------------------------------------------KQE

Query:  SERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF
        SERS EKAKETAPPSPFPVNYQTCIHF+RGMKD+ LRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF
Subjt:  SERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNFPVNYTTCIDF

Query:  FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK
        FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK
Subjt:  FHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK

Query:  PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA
        PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA
Subjt:  PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDA

Query:  LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS
        LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS
Subjt:  LRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYS

Query:  SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL
        SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYF      
Subjt:  SLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL

Query:  LQTVPTRSAEVVLRDLMHSSKANEEHDDDDENGSTDMSETDD
                      DLMHSSKANEEHDDDDENGSTDMSETDD
Subjt:  LQTVPTRSAEVVLRDLMHSSKANEEHDDDDENGSTDMSETDD

A0A6J1ET50 uncharacterized protein LOC111437341 isoform X27.8e-29873.57Show/hide
Query:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP
        K V+E Y++  E    KLN+ GDT LHLAVIDNQE  V++LV+ V+        +LEI NE GNN LHLAAMMGS++MC AIA  H  LV  RN  D+TP
Subjt:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP

Query:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH
        L+LAA  GNKD FF LY  C +D+S+I  NCRV + GDTVLH+AL ++ FDLAFQLI MHKEAI+WVNEAGLTPIHVLA KPTSFKSGSHIKG Q I+YH
Subjt:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH

Query:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA
         SFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKD+ L   HFKC N NDTNN GD KKN  KNDGDKR GDLEEQKPLNTKSA
Subjt:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA

Query:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF
             T+FPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDG SPMD +F+N  EE   T+PY+FVD+ V F
Subjt:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF

Query:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL
         + F+EPK+LE      KP+DVEA ESAMLLAA+NGV+EIVKG   R PLAI DT KDKKNVVLLAAE+RQP+VYR LLK+ K I++LFRAVDH+GNSAL
Subjt:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL

Query:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE
        HLAATA DPKLWRITG AL++QWEVKWYKYVK SVPLH FP+WN +GKTA+A+F+ETH++LA KGG WLY TS+S +LVATLIATVAFATA TIPGGNDE
Subjt:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE

Query:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA
         GGA LG E GF IFSYSSLIAL LSSTSVIMFLAIMTSRF+IKDFGL LPWKLLIGLCCLYFSIIAMLVSFCSGHYFLI  RL NEAILLY+LTF PVA
Subjt:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA

Query:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL
        +IF +VQLPLYFDL+Q L++ VP RSAEVVL DL
Subjt:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X16.9e-30273.98Show/hide
Query:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP
        K V+E Y++  E    KLN+ GDT LHLAVIDNQE  V++LV+ V+        +LEI NE GNN LHLAAMMGS++MC AIA  H  LV  RN  D+TP
Subjt:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP

Query:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH
        L+LAA  GNKD FF LY  C +D+S+I  NCRV + GDTVLH+AL ++ FDLAFQLI MHKEAI+WVNEAGLTPIHVLA KPTSFKSGSHIKG Q I+YH
Subjt:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH

Query:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA
         SFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKD+ L   HFKC N NDTNN GD KKN  KNDGDKR GDLEEQKPLNTKSA
Subjt:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA

Query:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF
         +PPTT+FPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDG SPMD +F+N  EE   T+PY+FVD+ V F
Subjt:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF

Query:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL
         + F+EPK+LE      KP+DVEA ESAMLLAA+NGV+EIVKG   R PLAI DT KDKKNVVLLAAE+RQP+VYR LLK+ K I++LFRAVDH+GNSAL
Subjt:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL

Query:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE
        HLAATA DPKLWRITG AL++QWEVKWYKYVK SVPLH FP+WN +GKTA+A+F+ETH++LA KGG WLY TS+S +LVATLIATVAFATA TIPGGNDE
Subjt:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDE

Query:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA
         GGA LG E GF IFSYSSLIAL LSSTSVIMFLAIMTSRF+IKDFGL LPWKLLIGLCCLYFSIIAMLVSFCSGHYFLI  RL NEAILLY+LTF PVA
Subjt:  NGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA

Query:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL
        +IF +VQLPLYFDL+Q L++ VP RSAEVVL DL
Subjt:  VIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDL

A0A6J1EYD3 uncharacterized protein LOC111437341 isoform X31.3e-26367.17Show/hide
Query:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP
        K V+E Y++  E    KLN+ GDT LHLAVIDNQE  V++LV+ V+        +LEI NE GNN LHLAAMMGS++MC AIA  H  LV  RN  D+TP
Subjt:  KPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETP

Query:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH
        L+LAA  GNKD FF LY  C +D+S+I  NCRV + GDTVLH+AL ++ FDLAFQLI MHKEAI+WVNEAGLTPIHVLA KPTSFKSGSHIKG Q I+YH
Subjt:  LYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRIIYH

Query:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA
         SFVKPLQPRS EKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKD+ L   HFKC N NDTNN GD KKN  KNDGDKR GDLEEQKPLNTKSA
Subjt:  RSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSA

Query:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF
         +PPTT+FPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDG SPMD +F+N  EE   T+PY+FVD+ V F
Subjt:  RDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGF

Query:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL
         + F+EPK+LE      KP+DVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEV+RFLLKE K IESLFRAVD  GNSAL
Subjt:  RHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSAL

Query:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGG-ND
        HLAAT     +WRI+G AL+LQWE+KWYK+VKES+PL  F  +NKEGK A  +F+ETH  L K+ G WL  TS S  +V TLI TVAF +A +IPGG N 
Subjt:  HLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGG-ND

Query:  ENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPV
         +G   L  +  F IF+  SLIAL LSSTSV MFLAI++ RF+  DF   LP KLLIGL  L+ SII+MLVSFC+GHYFL+   +R+   LLY+   +PV
Subjt:  ENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPV

Query:  AVIFAVVQLPLYFDLVQTLLQTVPTRSA
        A I+ + +  LY D++ T  + VP+RSA
Subjt:  AVIFAVVQLPLYFDLVQTLLQTVPTRSA

A0A6J1K7T7 uncharacterized protein LOC1114914291.3e-28175.53Show/hide
Query:  WKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVK---NDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNEL
        W+ VI+ Y+K + AQ LKLN+ GDTALHLAVIDNQEETV+ LV+ +     D    K +LE  +ERGNNPLHLAAMMGS++MC AIA  H  LV  RN +
Subjt:  WKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVK---NDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNEL

Query:  DETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQR
        DETPL+LAA  GNKD  F LY  C +++S+I  NCRV+  GDTVLHHALRN+ FDLAFQLI MHKEAI+WVNEAGLTPIHVLASKPTSFKSGSHIKG Q 
Subjt:  DETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQR

Query:  IIYHRSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLN
        I+YH SFVKPLQPRSIE LKQE +RS +KA+E A PSPFPVNY+TCI F+RGMKD+ L VW FKC NKNDT N G+EKKNNTKNDGDKR  DLEEQKPL+
Subjt:  IIYHRSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLN

Query:  TKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDD
        T+SA DPPTTNFPVNYTTCIDFFHIAFSAIMII G G+ EIK+IRK+KEKHTW+VQVM +LLEL +  KY EDGKSPMD +FQN  E    T+PY+FVD+
Subjt:  TKSARDPPTTNFPVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDD

Query:  VVGFRHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKG
         V F H  EEPK+LE      KP+DVEATE+AMLLAAKNGV+EIVKG+  R PLAI DT KDKKNVVLLAAE+RQP+VYRFLLK+R  I++LFRAVDHKG
Subjt:  VVGFRHLFEEPKQLEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKG

Query:  NSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPG
        NSALHLAAT+ DPKLWRITG AL++QWEVKWY YVK SVPL VFPHWNKEGKTA+A+FQETH++LA KGG+WLY+TS+SFTLVATLIATVAFATAVTIPG
Subjt:  NSALHLAATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPG

Query:  GNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGL
        GNDENGGAI GQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRF+IKDFGLVLPWKLLIGL
Subjt:  GNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGL

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR42.5e-0623.6Show/hide
Query:  LEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLA------
        +E+ K +GK        +++  AA+ G VEIVK +  + P      +K  +  + +A +    +V R L+     I  L    D  GN+ALH+A      
Subjt:  LEVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLA------

Query:  ---------------ATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKE----GKTANALFQETH-----------------EKLAKKGGK
                       A  RD K      +AL L  E    K +            N+      KT   + ++ H                 ++L K   +
Subjt:  ---------------ATARDPKLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKE----GKTANALFQETH-----------------EKLAKKGGK

Query:  WLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIA
         + + ++S T+VA L ATVAFA   T+PGGN  NG A++ Q   F IF   + IALF S   V++ + ++      +   + +  KL      ++ + + 
Subjt:  WLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIA

Query:  MLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVV
          +SF +  Y ++    +  A+L+  +  I +A +   +
Subjt:  MLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVV

Q04861 Nuclear factor NF-kappa-B p105 subunit1.1e-0625.91Show/hide
Query:  NQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYF
        +  GD  LHL++I    E VK L+E + +   NY +I+ +RN+    PLHLA +    ++   +  A G  V+  +    + L+LAA  G+  +   L  
Subjt:  NQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLYF

Query:  H----------CGEDSSKI--------------------ETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK
        H           GE  S I                    + N +  K+G T LH A+  +   LA  L+      ++     G TP+H+ A +
Subjt:  H----------CGEDSSKI--------------------ETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK

Q8BZ25 Ankyrin repeat and protein kinase domain-containing protein 15.9e-0829.88Show/hide
Query:  LNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLY
        L+  G + LH+A    ++   K L+         Y   LE+R ++G  PLHLA   G +++ H +A +H +L      +  TPL+LAA  G + V   L 
Subjt:  LNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLY

Query:  FHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK
          CG + +  E      ++G T LH A+    F L    +  +   I+  N+ G TP H+ A K
Subjt:  FHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK

Q8NFD2 Ankyrin repeat and protein kinase domain-containing protein 11.7e-0730.49Show/hide
Query:  LNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLY
        L+Q G   LH A    +    K L+         Y   LE+   +G  PLHLAA  G +++ H +A +H  +      ++ TPL+LAA  G + V   L 
Subjt:  LNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGELVHERNELDETPLYLAAGCGNKDVFFYLY

Query:  FHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK
          CG D +  E      ++G T LH A++   F     L+  H   ++  N+ G TP H+ A K
Subjt:  FHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASK

Q91974 NF-kappa-B inhibitor alpha2.1e-0529.81Show/hide
Query:  NQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMC-----HAIAHAHGELVHERNELDETPLYLAAGCGNKDVF
        N    T LHLAVI +Q E  + L+++  +        L++R+ RGN PLH+A   GS++       H   H    ++   N    T L+LA+  G   V 
Subjt:  NQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMC-----HAIAHAHGELVHERNELDETPLYLAAGCGNKDVF

Query:  FYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTP
         YL       S   + N +    G T LH A+     DL   L+  H   +N V   G +P
Subjt:  FYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTP

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein7.1e-3332.35Show/hide
Query:  EVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAA-TARDP
        E+PK     +       A+  A +NG+VE ++ M    P  ++  N    N+   A   RQ +++  +     +   L    D   N+ LH AA  A   
Subjt:  EVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAA-TARDP

Query:  KLWRITGDALRLQWEVKWYKYVKESV-PLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQ
        +L  I G AL++Q E++W+K V++ V P H      K+ KT  ALF + H+ L ++G KW+ +T+ S T+VA LI T+ F++A T+PGG   +G  +   
Subjt:  KLWRITGDALRLQWEVKWYKYVKESV-PLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQ

Query:  ELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQL
        +  F IF  S  I+LF S  S++MFL I+ SR+  +DF   LP KL++GL  L+ S+  M+V+F      L+  ++   +     L  IP+  +F V+Q 
Subjt:  ELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQL

Query:  PLYFDL
        P+  ++
Subjt:  PLYFDL

AT3G54070.1 Ankyrin repeat family protein3.0e-3135.02Show/hide
Query:  AAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRI---TGDALRLQWEVKWY
        AA+ G VEI+  +       ++  + + + +  +AA YR   ++  L+ E   I+ L  +   K +    L   AR P + R    +G AL +Q E+ W+
Subjt:  AAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRI---TGDALRLQWEVKWY

Query:  KYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENG--GAILG-----QELGFSIFSYSSLI
        K VKE VP       N +G+ A+ +F E HE L K+G +W+ +T+ +  L ATLIATV FA A+TIPGGND++G     LG     + L F IF+ S  +
Subjt:  KYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENG--GAILG-----QELGFSIFSYSSLI

Query:  ALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL
        ALF S  S+++FL+I TSR+  +DF   LP KL+ GL  L+ SII+M+++F +    LI     + +++L S      A+ FA +   L+F+ ++++
Subjt:  ALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTL

AT5G04700.1 Ankyrin repeat family protein9.2e-3332.53Show/hide
Query:  VDDVVGFRHLFEEPKQL-----EVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVV-LLAAEYRQPEVYRFLLKERKRIES
        +D+V   + +  + K+L     E     G  +  E  + A+L A + G V+ +  M       ++ T     + + LLA E+RQ +V+  L     R   
Subjt:  VDDVVGFRHLFEEPKQL-----EVPKQSGKPKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVV-LLAAEYRQPEVYRFLLKERKRIES

Query:  LFRAVDHKGNSALHLAATARDP-KLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATV
        L    D  GN  LHLA     P KL  + G  L+LQ E++W+K V+   P       N E +T   +F + H+ L ++  KW+ DT+ S +LVA LI TV
Subjt:  LFRAVDHKGNSALHLAATARDP-KLWRITGDALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATV

Query:  AFATAVTIPGGNDEN--GGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRL
         FA   T+PGG D+N  G     ++  F IF  S LI+ F S TSV++FL I+T+R++  DF + LP K++ GL  L+ SI AML++F S  + ++    
Subjt:  AFATAVTIPGGNDEN--GGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRL

Query:  RNEAILLYSLTFIPV-AVIFAVVQLPLYFDLV
          + I+  ++ F  + A++F ++Q PL  +++
Subjt:  RNEAILLYSLTFIPV-AVIFAVVQLPLYFDLV

AT5G04730.1 Ankyrin-repeat containing protein5.4e-3332.79Show/hide
Query:  EATESAMLLAAKNG----VVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDP-KLWRITGD
        E    A+L AAK+G     +EI+K  +S++   +  T+   +N+  LA E+++ +++  +     R  +L R+ D   N+ LH+A     P +L +I+G 
Subjt:  EATESAMLLAAKNG----VVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDP-KLWRITGD

Query:  ALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGND-ENGGAILGQELGFSIFS
        AL++Q E +W+K V+  V        NK+ KT   +F+  HE L K+G +W+  T+ + + VA LIATV F    T+PGG D  +G  ++  +L F  F 
Subjt:  ALRLQWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGND-ENGGAILGQELGFSIFS

Query:  YSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA----VIFAVVQLPLYF
        ++  +A F S  SV++FL+I+TSR++  DF + LP K+++G   L+ SI +MLV+F +     +S  +R++  L+Y L   P+A    ++F ++Q PL  
Subjt:  YSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVA----VIFAVVQLPLYF

Query:  DLVQT
        +++ +
Subjt:  DLVQT

AT5G35810.1 Ankyrin repeat family protein9.2e-3334.44Show/hide
Query:  SAMLL--AAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYR--FLLKERKRIESLFRAVDHKGNSALHLAATARDP-KLWRITGDALRL
        S MLL  AA++G +E++  +    P  I+  +   +++  +AA  R  +++   + L   K + ++++  +   N  LHL A    P +L  ++G AL++
Subjt:  SAMLL--AAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYR--FLLKERKRIESLFRAVDHKGNSALHLAATARDP-KLWRITGDALRL

Query:  QWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGG-AILG-----QELGFSIF
        Q E+ WYK VKE VP       NK+ + A+ LF + H+ L K+G KW+ +T+ +  LV+TLIATV FA A T+PGGND +G    LG     +E  F +F
Subjt:  QWEVKWYKYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGG-AILG-----QELGFSIF

Query:  SYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLV
          S  +AL  S TS+++FL+I+TSR+    F   LP KL++GL  L+ SII+M+++F +    LI ++    +++L        A+ F V+   L+FD +
Subjt:  SYSSLIALFLSSTSVIMFLAIMTSRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLV

Query:  QT
        ++
Subjt:  QT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTCATGGTGCCAATACCCCATTGGCAAGGCTTCGTGAAAACGCGATGACGCGGAATTGGAAGCCAGTGATTGAGGCGTATGAGAAAACCCGAGAGGCTCAGAA
GCTGAAGCTGAACCAAGAGGGCGACACGGCGCTGCATTTGGCTGTGATTGACAATCAAGAAGAAACAGTGAAAAAACTTGTGGAAAGCGTTAAGAACGACAAAAATAATT
ACAAGCATATTCTAGAGATTAGAAATGAGAGGGGAAACAATCCTCTCCACCTTGCCGCAATGATGGGGAGCATGAAAATGTGCCACGCCATTGCTCATGCCCATGGGGAG
TTGGTGCACGAGAGAAACGAATTGGACGAAACGCCTCTGTACTTGGCGGCTGGGTGCGGCAACAAGGACGTCTTTTTTTACCTTTACTTCCATTGCGGAGAGGACTCATC
TAAAATCGAAACCAACTGCAGAGTGAACAAAACGGGAGACACCGTGCTACATCACGCCCTCAGAAACGACCAGTTTGATTTGGCGTTCCAATTAATTTACATGCACAAGG
AGGCTATAAACTGGGTGAATGAGGCAGGCCTAACTCCTATCCATGTTCTAGCCAGTAAGCCAACTTCCTTCAAAAGTGGAAGCCACATCAAAGGATTGCAGCGCATCATC
TATCACCGCTCATTTGTGAAACCACTACAGCCTCGATCGATTGAGAAGCTGAAGCAAGAGTCGGAGCGAAGTACGGAGAAAGCAAAAGAGACAGCTCCTCCTTCCCCTTT
TCCGGTTAACTACCAGACATGCATCCACTTTTTTAGGGGGATGAAGGATGTATTTTTGAGAGTCTGGCATTTCAAATGCGACAACAAGAATGACACGAACAATGTTGGAG
ACGAGAAGAAGAACAATACCAAAAATGATGGAGACAAGAGGACCGGAGACTTGGAAGAACAGAAGCCTCTGAACACTAAATCAGCCAGAGATCCCCCAACTACAAACTTC
CCCGTAAACTACACCACATGCATCGATTTTTTTCATATTGCTTTCTCGGCCATCATGATCATTTTTGGATGCGGAACCAATGAAATCAAAAGGATACGGAAGGAAAAAGA
GAAGCACACTTGGGCGGTTCAAGTGATGAACCAACTTCTTGAGTTGGTTAAAGACTATAAATATGGAGAAGATGGAAAAAGTCCCATGGATTTCAGATTTCAGAACCAGA
AAGAAGAAGAGAAAGATACAGTTCCTTACGACTTTGTAGATGATGTGGTCGGATTCCGTCATTTGTTTGAGGAACCAAAACAATTAGAGGTGCCCAAACAATCGGGGAAG
CCCAAAGATGTGGAAGCAACCGAGTCGGCGATGTTATTAGCAGCAAAGAATGGAGTGGTTGAGATTGTGAAGGGAATGTCCAGTCGTATTCCGCTTGCTATCTATGATAC
TAACAAAGATAAGAAGAATGTGGTGCTGTTGGCTGCGGAGTACAGGCAGCCGGAGGTGTACAGGTTTTTACTCAAGGAAAGAAAAAGGATAGAAAGCCTGTTTCGAGCGG
TGGATCATAAGGGCAACAGCGCCTTACATCTCGCAGCAACCGCCAGAGATCCAAAGCTTTGGCGCATCACTGGAGATGCATTACGGTTGCAGTGGGAAGTTAAGTGGTAT
AAATACGTGAAGGAGTCTGTGCCACTCCATGTGTTTCCCCACTGGAACAAGGAAGGAAAAACTGCAAATGCACTCTTCCAGGAAACCCACGAGAAACTTGCGAAAAAGGG
TGGAAAGTGGCTTTACGATACCTCGGATTCATTCACTTTGGTGGCGACTCTGATTGCAACAGTAGCTTTTGCAACCGCAGTAACCATCCCAGGCGGCAACGACGAGAATG
GTGGTGCAATACTTGGACAGGAGCTAGGGTTTTCAATCTTCTCCTACTCTTCCCTAATTGCTCTCTTCCTCTCTTCAACCTCAGTCATCATGTTTCTAGCCATCATGACC
TCCAGGTTCAATATCAAAGACTTCGGGTTAGTGTTGCCCTGGAAATTGCTGATCGGCTTGTGCTGCCTCTACTTTTCCATCATCGCCATGTTGGTTTCCTTCTGTTCGGG
GCACTACTTTCTGATCAGCAACCGTCTTCGCAACGAAGCCATTCTGCTGTATTCACTTACTTTCATCCCAGTGGCAGTGATCTTCGCTGTTGTGCAGCTTCCTCTCTACT
TCGATTTGGTGCAGACTCTTCTCCAAACTGTGCCTACAAGAAGCGCCGAAGTGGTGCTACGCGATTTGATGCACAGTAGTAAAGCAAACGAAGAACATGATGATGATGAT
GAAAATGGGTCAACAGATATGTCGGAAACAGACGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTCATGGTGCCAATACCCCATTGGCAAGGCTTCGTGAAAACGCGATGACGCGGAATTGGAAGCCAGTGATTGAGGCGTATGAGAAAACCCGAGAGGCTCAGAA
GCTGAAGCTGAACCAAGAGGGCGACACGGCGCTGCATTTGGCTGTGATTGACAATCAAGAAGAAACAGTGAAAAAACTTGTGGAAAGCGTTAAGAACGACAAAAATAATT
ACAAGCATATTCTAGAGATTAGAAATGAGAGGGGAAACAATCCTCTCCACCTTGCCGCAATGATGGGGAGCATGAAAATGTGCCACGCCATTGCTCATGCCCATGGGGAG
TTGGTGCACGAGAGAAACGAATTGGACGAAACGCCTCTGTACTTGGCGGCTGGGTGCGGCAACAAGGACGTCTTTTTTTACCTTTACTTCCATTGCGGAGAGGACTCATC
TAAAATCGAAACCAACTGCAGAGTGAACAAAACGGGAGACACCGTGCTACATCACGCCCTCAGAAACGACCAGTTTGATTTGGCGTTCCAATTAATTTACATGCACAAGG
AGGCTATAAACTGGGTGAATGAGGCAGGCCTAACTCCTATCCATGTTCTAGCCAGTAAGCCAACTTCCTTCAAAAGTGGAAGCCACATCAAAGGATTGCAGCGCATCATC
TATCACCGCTCATTTGTGAAACCACTACAGCCTCGATCGATTGAGAAGCTGAAGCAAGAGTCGGAGCGAAGTACGGAGAAAGCAAAAGAGACAGCTCCTCCTTCCCCTTT
TCCGGTTAACTACCAGACATGCATCCACTTTTTTAGGGGGATGAAGGATGTATTTTTGAGAGTCTGGCATTTCAAATGCGACAACAAGAATGACACGAACAATGTTGGAG
ACGAGAAGAAGAACAATACCAAAAATGATGGAGACAAGAGGACCGGAGACTTGGAAGAACAGAAGCCTCTGAACACTAAATCAGCCAGAGATCCCCCAACTACAAACTTC
CCCGTAAACTACACCACATGCATCGATTTTTTTCATATTGCTTTCTCGGCCATCATGATCATTTTTGGATGCGGAACCAATGAAATCAAAAGGATACGGAAGGAAAAAGA
GAAGCACACTTGGGCGGTTCAAGTGATGAACCAACTTCTTGAGTTGGTTAAAGACTATAAATATGGAGAAGATGGAAAAAGTCCCATGGATTTCAGATTTCAGAACCAGA
AAGAAGAAGAGAAAGATACAGTTCCTTACGACTTTGTAGATGATGTGGTCGGATTCCGTCATTTGTTTGAGGAACCAAAACAATTAGAGGTGCCCAAACAATCGGGGAAG
CCCAAAGATGTGGAAGCAACCGAGTCGGCGATGTTATTAGCAGCAAAGAATGGAGTGGTTGAGATTGTGAAGGGAATGTCCAGTCGTATTCCGCTTGCTATCTATGATAC
TAACAAAGATAAGAAGAATGTGGTGCTGTTGGCTGCGGAGTACAGGCAGCCGGAGGTGTACAGGTTTTTACTCAAGGAAAGAAAAAGGATAGAAAGCCTGTTTCGAGCGG
TGGATCATAAGGGCAACAGCGCCTTACATCTCGCAGCAACCGCCAGAGATCCAAAGCTTTGGCGCATCACTGGAGATGCATTACGGTTGCAGTGGGAAGTTAAGTGGTAT
AAATACGTGAAGGAGTCTGTGCCACTCCATGTGTTTCCCCACTGGAACAAGGAAGGAAAAACTGCAAATGCACTCTTCCAGGAAACCCACGAGAAACTTGCGAAAAAGGG
TGGAAAGTGGCTTTACGATACCTCGGATTCATTCACTTTGGTGGCGACTCTGATTGCAACAGTAGCTTTTGCAACCGCAGTAACCATCCCAGGCGGCAACGACGAGAATG
GTGGTGCAATACTTGGACAGGAGCTAGGGTTTTCAATCTTCTCCTACTCTTCCCTAATTGCTCTCTTCCTCTCTTCAACCTCAGTCATCATGTTTCTAGCCATCATGACC
TCCAGGTTCAATATCAAAGACTTCGGGTTAGTGTTGCCCTGGAAATTGCTGATCGGCTTGTGCTGCCTCTACTTTTCCATCATCGCCATGTTGGTTTCCTTCTGTTCGGG
GCACTACTTTCTGATCAGCAACCGTCTTCGCAACGAAGCCATTCTGCTGTATTCACTTACTTTCATCCCAGTGGCAGTGATCTTCGCTGTTGTGCAGCTTCCTCTCTACT
TCGATTTGGTGCAGACTCTTCTCCAAACTGTGCCTACAAGAAGCGCCGAAGTGGTGCTACGCGATTTGATGCACAGTAGTAAAGCAAACGAAGAACATGATGATGATGAT
GAAAATGGGTCAACAGATATGTCGGAAACAGACGACTAA
Protein sequenceShow/hide protein sequence
MSSHGANTPLARLRENAMTRNWKPVIEAYEKTREAQKLKLNQEGDTALHLAVIDNQEETVKKLVESVKNDKNNYKHILEIRNERGNNPLHLAAMMGSMKMCHAIAHAHGE
LVHERNELDETPLYLAAGCGNKDVFFYLYFHCGEDSSKIETNCRVNKTGDTVLHHALRNDQFDLAFQLIYMHKEAINWVNEAGLTPIHVLASKPTSFKSGSHIKGLQRII
YHRSFVKPLQPRSIEKLKQESERSTEKAKETAPPSPFPVNYQTCIHFFRGMKDVFLRVWHFKCDNKNDTNNVGDEKKNNTKNDGDKRTGDLEEQKPLNTKSARDPPTTNF
PVNYTTCIDFFHIAFSAIMIIFGCGTNEIKRIRKEKEKHTWAVQVMNQLLELVKDYKYGEDGKSPMDFRFQNQKEEEKDTVPYDFVDDVVGFRHLFEEPKQLEVPKQSGK
PKDVEATESAMLLAAKNGVVEIVKGMSSRIPLAIYDTNKDKKNVVLLAAEYRQPEVYRFLLKERKRIESLFRAVDHKGNSALHLAATARDPKLWRITGDALRLQWEVKWY
KYVKESVPLHVFPHWNKEGKTANALFQETHEKLAKKGGKWLYDTSDSFTLVATLIATVAFATAVTIPGGNDENGGAILGQELGFSIFSYSSLIALFLSSTSVIMFLAIMT
SRFNIKDFGLVLPWKLLIGLCCLYFSIIAMLVSFCSGHYFLISNRLRNEAILLYSLTFIPVAVIFAVVQLPLYFDLVQTLLQTVPTRSAEVVLRDLMHSSKANEEHDDDD
ENGSTDMSETDD