; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021019 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021019
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationChr05:4593236..4614875
RNA-Seq ExpressionHG10021019
SyntenyHG10021019
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055385.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]2.6e-30189.51Show/hide
Query:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM
        + SNSFH HLS DT+SSI+      P STA DDGQ+EA+A+ G+G RG + +TIDIAAVLE+QQ RNS+     +APSPREKTFLYRSALKGEWRRVES+
Subjt:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM

Query:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM
        I+KYPHYAR A+TKN+ETVLHVAAGAKQTGFVKELV+RMSP DMTMQNKYGNTALCFAATSGIVRIAQLIV KNEDLPLIRGFSNLTPLFMAVSYKRKLM
Subjt:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM

Query:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA
         TYLF+VTD++QLT EDQIELLIA+IHSDFFDISLQI  MNPNLATMKCTKNNNESALHVMARKPLA+GSATKQLSIWRK IKFGFNGK YDKNMMNIFA
Subjt:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA

Query:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
        REVVKFLWEYIVEEFEEKEMLEFIK+PTRLLHD TRAENVEFLI+LINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
Subjt:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY

Query:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
        SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQ+SKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
Subjt:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA

Query:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL
        AFTVPGG D+NTG+PIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLL+GLTSLFVSIVCMVVTFTA FFLLYQN  LWVPL
Subjt:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL

Query:  TVAVMTILPL
        TVAVMTILP+
Subjt:  TVAVMTILPL

KAA0055387.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]0.0e+0058.06Show/hide
Query:  KIFLYQNALKGEWEYVELLLEESPHY----VLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHL
        K  L+Q AL G W  VE LL+    +    V+ SITR+KE +LHI+AG+K + FVEKL+++M+ D++ L+NK+GNTALCFAA SGVVR A++MV+KN  L
Subjt:  KIFLYQNALKGEWEYVELLLEESPHY----VLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHL

Query:  PLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAMDVTKELSS
        PLI GF N  TPLF+A+S K  EM SYLL +TD+++ N QEQ+ELLIATIHS+FYD+S  I +    LAI++D   N E AL VLARK SA+      + 
Subjt:  PLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAMDVTKELSS

Query:  WELYISS----WIYRKPVTKTLAHELVILLWTNVL-RNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENV
        W+  I++     IY K + K LA +LV  LW     +N P      FIK  TRLL+DAA  GNVEF+++LI   PDI+WE+DDDGK+IFH+AVENRLENV
Subjt:  WELYISS----WIYRKPVTKTLAHELVILLWTNVL-RNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENV

Query:  FNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFT
        FNLI+ I  + +F+ KY+T KG NYNILHLA  LAA NHLN+VSGAALQMQRELLWFKEVEKIVLPSQLEAK            D      KLTPR+LFT
Subjt:  FNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFT

Query:  QEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL
        +EH  LR+ GEEWMKNTANSCMLVATLI+TVVFAAAFTVPGG +N+ GTP+ ++++WFTVFVMSDA  L +SS+SILMF+S+LTSRYAEDDFLHSLP RL
Subjt:  QEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL

Query:  LFGLATLFISIVCMVVAFSATFFILYHKANICIPTIVSAMAILPEHAS--------NSFH-VHLSDTL---------------------------SSIVE
        LFGL  LF SIVCMVVAF+ATFF+LYH+A I +P  ++AMAI+P            ++FH   LS  L                             +VE
Subjt:  LFGLATLFISIVCMVVAFSATFFILYHKANICIPTIVSAMAILPEHAS--------NSFH-VHLSDTL---------------------------SSIVE

Query:  DDPVPPPS-TADDGQVEADASG--VGCRGLE-SRTIDIAAVLEDQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETV
         + + PPS TADD + E  ++     C   E SRTI       D + +S+   K    + R +  LY+SALKG+W+R E ++  YPHY R AIT+N+ETV
Subjt:  DDPVPPPS-TADDGQVEADASG--VGCRGLE-SRTIDIAAVLEDQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETV

Query:  LHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQI
        LHVAAGAKQ+ FV+ELV+RM+  DM +++KYGNTALCFAATS IV+IA+L+VEKN +LPLIR F   TPL +AVSYK + M++YL SVTDL QLT +++I
Subjt:  LHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQI

Query:  ELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKF--GFNGKY---------------------DKNMMN
        ELLIATIHSDF D+SL IL + P LA MK TKNNNE+ALHV+ARKP A+ S TKQL  W+  I     FN K                      +K++  
Subjt:  ELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKF--GFNGKY---------------------DKNMMN

Query:  IFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVW---EEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHR
          A ++V+FLW Y+V E  +KEMLEFI+HPT LL+D   A NVEFLI+LI  YPDI+W   +++D+ K+IF VA+ENRLENVFNLI+EIG LNEF+ K+R
Subjt:  IFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVW---EEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHR

Query:  LTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALIT
             YS+LH   NLA PN+LNRV+GAA QMQRE+LWFKEVEKIVLPSQL  KSNDP+P + KLTPR+LFTE HK LRK GEEWMKNTANSCM+VA LI+
Subjt:  LTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALIT

Query:  TVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNV
        TVVFAAAFTVPGG+D+NTG PIFQ+K WF +FV+SDA AL SSSTSILMF+SILTSRYAE+DFLHSLPS+LL GL SLF+SIV M + F++ FFL+Y N 
Subjt:  TVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNV

Query:  NLWVPLTVAVMTILPL
        N+ +P  V  M I+P+
Subjt:  NLWVPLTVAVMTILPL

KGN64473.1 hypothetical protein Csa_013828 [Cucumis sativus]1.2e-27482.76Show/hide
Query:  MKNRMRKSRSFSS-FPN-NNPFEEKMKMKKDEDEDDTT---------------LKRQGINVNAPLD----EINAATKIFLYQNALKGEWEYVELLLEESP
        MK RMRKS SF S  PN N+ FEE    + D+DE   T               L+    + + P D    +I AAT+IFLYQNALKGEWEYVELLL+ESP
Subjt:  MKNRMRKSRSFSS-FPN-NNPFEEKMKMKKDEDEDDTT---------------LKRQGINVNAPLD----EINAATKIFLYQNALKGEWEYVELLLEESP

Query:  HYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFNNAVTPLFIAVSYKCTEMVSYL
        + V  +ITRN+ETILHIAAGAKQ EFV KL++RM+ DDMILQN++GNTALCFAAASGVVRIA++MVEKN +LPLIRGFNNAVTPLFIAVSYKCTEMVSYL
Subjt:  HYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFNNAVTPLFIAVSYKCTEMVSYL

Query:  LSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKDMNEETALHVLARKPSAMDVTKELSSWELYISSWIYRKPVTKTLAHELVILLWTNV
        LS+TDL+QL  QEQIELLIATI SDFYDIS WI+QRYPYLAIM+D NEETALHV+ARKPSAMDVTK+LSSW L+++S IYRKPVTKTLAHELV+LL TNV
Subjt:  LSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKDMNEETALHVLARKPSAMDVTKELSSWELYISSWIYRKPVTKTLAHELVILLWTNV

Query:  LRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA
        LR LPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWED DDGKSIFHVA+ENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA
Subjt:  LRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA

Query:  APNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAA
         PNHLNRVSGAALQMQRE+LWFKEVEKIVLPSQLEAKSNV SSQ+L+ K NYPN PKLTPRQLFTQEH+DLRK GEEWMK+TANSCMLVATLISTVVFAA
Subjt:  APNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAA

Query:  AFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT
        AFTVPGGSNNN+GTPVFQQK WFTVF MSDAVALFSSSTSILMFMSILTSRYAEDDF+HSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT
Subjt:  AFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT

Query:  IVSAMAILP
        IVSAMAILP
Subjt:  IVSAMAILP

KGN64482.2 hypothetical protein Csa_013492 [Cucumis sativus]0.0e+0052.77Show/hide
Query:  INAATKIFLYQNALKGEWEYVELLLEESPHYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQH
        + +A K  L  +AL G+W+  E + ++        ITRN ETILHI+A  +  +FV+KL+  M+ D++ L+NK G+TALCFAA  G   IA+++VE N+ 
Subjt:  INAATKIFLYQNALKGEWEYVELLLEESPHYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQH

Query:  LPLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAM-DVTKEL
        LPLIRG +  +TPL+IAVSY+  +M SYLLS+TDL+QLN QE+  LLIA IHSDFY IS  I++  P LA M++   N+ETALHVLARK S +    +E+
Subjt:  LPLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAM-DVTKEL

Query:  SSWELYISSWIYRKPVTKTLAHELVILLWTNVLRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNL
        + W+  I      K   KTLA +LV  LW +VLR+L EK+ L FIKHPT LL+DAA  GNVEFLI+LIR YPDI+WE+DDD KSIFH+AVENRLENVFNL
Subjt:  SSWELYISSWIYRKPVTKTLAHELVILLWTNVLRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNL

Query:  INEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH
        I+EI    +F+ +Y+T    NYN+LHLA  L A NHLNRVSG ALQMQRELLW+KEVEKIVLP Q EAK N  S Q             LTPR+LFT+EH
Subjt:  INEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH

Query:  RDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFG
         +LRK GEEWMKNTANSCMLVATLI+TVVFAAAFTVPGG+++  G P  Q K WF VFV+SDAVAL SSSTSILMF+SILTSRY EDDFL SLPS+LL G
Subjt:  RDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFG

Query:  LATLFISIVCMVVAFSATFFILY-HKANICIPTIVSAMAILP---------EHASNSFH-----------------------------------------
        L  LFISIVCMVV FSATFF+LY H +NI +P  ++AMAI+P         + + ++FH                                         
Subjt:  LATLFISIVCMVVAFSATFFILY-HKANICIPTIVSAMAILP---------EHASNSFH-----------------------------------------

Query:  -------------------VHLSDTLSS----------IVEDDP-VPPPST-ADDGQVEADASG--VGCRGLE--SRTIDIAAVLEDQQRNSNAKIKIEA
                           + L+ T ++          +VE  P +PPPST ADD Q+   A+     C   E  S+TID A          ++ +K + 
Subjt:  -------------------VHLSDTLSS----------IVEDDP-VPPPST-ADDGQVEADASG--VGCRGLE--SRTIDIAAVLEDQQRNSNAKIKIEA

Query:  PS-------PREKTFLYRSALKGEWRRVESMIKKYPHY----ARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVR
        PS          ++ L+++AL G W  VE ++ K   +       +IT++EE +LH++AG+K +G V++L+ +MS  ++ ++NK+ NTALCFAA SG VR
Subjt:  PS-------PREKTFLYRSALKGEWRRVESMIKKYPHY----ARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVR

Query:  IAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKP
         A+LIV+KN +LPLI GF N TPLFMA+S KR+ M +YL  VTD+ +   ++Q ELLIA+IHS+F+D+S++I   N  LA ++   NNNE AL V+ARK 
Subjt:  IAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKP

Query:  LAVGSATKQLSIWRKYI-KFGFNGKYDKNMMNIFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIF
         A+G   K+ + W+K I    F G Y K+MM  FAR++VK LW    + F   +   FIK  TRLLHD  +A NVEFLI+LI   PDIVWEEDD+GKTIF
Subjt:  LAVGSATKQLSIWRKYI-KFGFNGKYDKNMMNIFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIF

Query:  DVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFT
         +A+ENRLENVFNLI    G+ +F+ K++    NY++LH  A LA  N+LN+V+GAA QMQRELLWFKEVEKIVLPSQL AK +  +  + KLTPRELFT
Subjt:  DVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFT

Query:  ENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRL
        + H  LR+ GEEWMKNTANSCM+VA LI TVVFAAAFT+PGG + +TG PI + + WFTVFV+SDAA L++SS+SIL+F+S+LTSRYAEDDFLHSLP RL
Subjt:  ENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRL

Query:  LVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPLTVAVMTILPL
        L GLT LF SIVCMV+ FTA FFL+Y   N+ VP+T+A M I+P+
Subjt:  LVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPLTVAVMTILPL

XP_011652459.1 ankyrin repeat-containing protein ITN1 isoform X1 [Cucumis sativus]4.1e-30289.18Show/hide
Query:  EHASNSFHVHLS-DTLSSIVEDDPVPPPSTADDGQVEADA-SGVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM
        ++ASNSFH HLS DT+SSI     +PPPSTADDGQ+E DA  G+G RG + +TIDIAAVLE+QQ RNS+     +APSPREKTFLYRSALKGEWRRVES+
Subjt:  EHASNSFHVHLS-DTLSSIVEDDPVPPPSTADDGQVEADA-SGVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM

Query:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM
        I++YPHYAR AITKN+ETVLHVAAGAKQTGFVKELV+RMSP DMTM NKYGNTALCFAATSGIVRIAQLIV KNEDLPL+RGFSNLTPLFMAVSYKRKLM
Subjt:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM

Query:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA
         TYLF VTD+YQLT EDQIELLIA+IHSDFFDISLQI+ MNPNLATMKC KNNNESALHVMARKPLA+GSATKQLSIWRK I FGFNGK YDKNMMNIFA
Subjt:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA

Query:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
        REVVK+LWEYIVEEFEEKEMLEFIKHPTRLLH  TRAENVEFLI+LINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
Subjt:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY

Query:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
        SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLM KSNDPNPQ+SKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
Subjt:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA

Query:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL
        AFTVPGG D+NTG+PIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLL+GLTSLFVSIVCMVVTFTA FFLLYQN  LWVPL
Subjt:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL

Query:  TVAVMTILPL
        TVAVMTILP+
Subjt:  TVAVMTILPL

TrEMBL top hitse value%identityAlignment
A0A0A0LRT6 ANK_REP_REGION domain-containing protein6.0e-27582.76Show/hide
Query:  MKNRMRKSRSFSS-FPN-NNPFEEKMKMKKDEDEDDTT---------------LKRQGINVNAPLD----EINAATKIFLYQNALKGEWEYVELLLEESP
        MK RMRKS SF S  PN N+ FEE    + D+DE   T               L+    + + P D    +I AAT+IFLYQNALKGEWEYVELLL+ESP
Subjt:  MKNRMRKSRSFSS-FPN-NNPFEEKMKMKKDEDEDDTT---------------LKRQGINVNAPLD----EINAATKIFLYQNALKGEWEYVELLLEESP

Query:  HYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFNNAVTPLFIAVSYKCTEMVSYL
        + V  +ITRN+ETILHIAAGAKQ EFV KL++RM+ DDMILQN++GNTALCFAAASGVVRIA++MVEKN +LPLIRGFNNAVTPLFIAVSYKCTEMVSYL
Subjt:  HYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFNNAVTPLFIAVSYKCTEMVSYL

Query:  LSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKDMNEETALHVLARKPSAMDVTKELSSWELYISSWIYRKPVTKTLAHELVILLWTNV
        LS+TDL+QL  QEQIELLIATI SDFYDIS WI+QRYPYLAIM+D NEETALHV+ARKPSAMDVTK+LSSW L+++S IYRKPVTKTLAHELV+LL TNV
Subjt:  LSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKDMNEETALHVLARKPSAMDVTKELSSWELYISSWIYRKPVTKTLAHELVILLWTNV

Query:  LRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA
        LR LPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWED DDGKSIFHVA+ENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA
Subjt:  LRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLA

Query:  APNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAA
         PNHLNRVSGAALQMQRE+LWFKEVEKIVLPSQLEAKSNV SSQ+L+ K NYPN PKLTPRQLFTQEH+DLRK GEEWMK+TANSCMLVATLISTVVFAA
Subjt:  APNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAA

Query:  AFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT
        AFTVPGGSNNN+GTPVFQQK WFTVF MSDAVALFSSSTSILMFMSILTSRYAEDDF+HSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT
Subjt:  AFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKANICIPT

Query:  IVSAMAILP
        IVSAMAILP
Subjt:  IVSAMAILP

A0A0A0LUL7 ANK_REP_REGION domain-containing protein2.0e-30289.18Show/hide
Query:  EHASNSFHVHLS-DTLSSIVEDDPVPPPSTADDGQVEADA-SGVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM
        ++ASNSFH HLS DT+SSI     +PPPSTADDGQ+E DA  G+G RG + +TIDIAAVLE+QQ RNS+     +APSPREKTFLYRSALKGEWRRVES+
Subjt:  EHASNSFHVHLS-DTLSSIVEDDPVPPPSTADDGQVEADA-SGVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM

Query:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM
        I++YPHYAR AITKN+ETVLHVAAGAKQTGFVKELV+RMSP DMTM NKYGNTALCFAATSGIVRIAQLIV KNEDLPL+RGFSNLTPLFMAVSYKRKLM
Subjt:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM

Query:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA
         TYLF VTD+YQLT EDQIELLIA+IHSDFFDISLQI+ MNPNLATMKC KNNNESALHVMARKPLA+GSATKQLSIWRK I FGFNGK YDKNMMNIFA
Subjt:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA

Query:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
        REVVK+LWEYIVEEFEEKEMLEFIKHPTRLLH  TRAENVEFLI+LINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
Subjt:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY

Query:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
        SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLM KSNDPNPQ+SKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
Subjt:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA

Query:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL
        AFTVPGG D+NTG+PIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLL+GLTSLFVSIVCMVVTFTA FFLLYQN  LWVPL
Subjt:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL

Query:  TVAVMTILPL
        TVAVMTILP+
Subjt:  TVAVMTILPL

A0A1S3B243 uncharacterized protein LOC103484975 isoform X15.6e-23388.33Show/hide
Query:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM
        + SNSFH HLS DT+SSI+      P STA DDGQ+EA+A+ G+G RG + +TIDIAAVLE+QQ RNS+     +APSPREKTFLYRSALKGEWRRVES+
Subjt:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM

Query:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM
        I+KYPHYAR A+TKN+ETVLHVAAGAKQTGFVKELV+RMSP DMTMQNKYGNTALCFAATSGIVRIAQLIV KNEDLPLIRGFSNLTPLFMAVSYKRKLM
Subjt:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM

Query:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA
         TYLF+VTD++QLT EDQIELLIA+IHSDFFDISLQI  MNPNLATMKCTKNNNESALHVMARKPLA+GSATKQLSIWRK IKFGFNGK YDKNMMNIFA
Subjt:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA

Query:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
        REVVKFLWEYIVEEFEEKEMLEFIK+PTRLLHD TRAENVEFLI+LINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
Subjt:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY

Query:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMK
        SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQ+SKLTPRELFTENHKDLRKAGEEWMK
Subjt:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMK

A0A5A7ULD5 Ankyrin repeat-containing protein0.0e+0058.06Show/hide
Query:  KIFLYQNALKGEWEYVELLLEESPHY----VLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHL
        K  L+Q AL G W  VE LL+    +    V+ SITR+KE +LHI+AG+K + FVEKL+++M+ D++ L+NK+GNTALCFAA SGVVR A++MV+KN  L
Subjt:  KIFLYQNALKGEWEYVELLLEESPHY----VLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHL

Query:  PLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAMDVTKELSS
        PLI GF N  TPLF+A+S K  EM SYLL +TD+++ N QEQ+ELLIATIHS+FYD+S  I +    LAI++D   N E AL VLARK SA+      + 
Subjt:  PLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLAIMKD--MNEETALHVLARKPSAMDVTKELSS

Query:  WELYISS----WIYRKPVTKTLAHELVILLWTNVL-RNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENV
        W+  I++     IY K + K LA +LV  LW     +N P      FIK  TRLL+DAA  GNVEF+++LI   PDI+WE+DDDGK+IFH+AVENRLENV
Subjt:  WELYISS----WIYRKPVTKTLAHELVILLWTNVL-RNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENV

Query:  FNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFT
        FNLI+ I  + +F+ KY+T KG NYNILHLA  LAA NHLN+VSGAALQMQRELLWFKEVEKIVLPSQLEAK            D      KLTPR+LFT
Subjt:  FNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFT

Query:  QEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL
        +EH  LR+ GEEWMKNTANSCMLVATLI+TVVFAAAFTVPGG +N+ GTP+ ++++WFTVFVMSDA  L +SS+SILMF+S+LTSRYAEDDFLHSLP RL
Subjt:  QEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL

Query:  LFGLATLFISIVCMVVAFSATFFILYHKANICIPTIVSAMAILPEHAS--------NSFH-VHLSDTL---------------------------SSIVE
        LFGL  LF SIVCMVVAF+ATFF+LYH+A I +P  ++AMAI+P            ++FH   LS  L                             +VE
Subjt:  LFGLATLFISIVCMVVAFSATFFILYHKANICIPTIVSAMAILPEHAS--------NSFH-VHLSDTL---------------------------SSIVE

Query:  DDPVPPPS-TADDGQVEADASG--VGCRGLE-SRTIDIAAVLEDQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETV
         + + PPS TADD + E  ++     C   E SRTI       D + +S+   K    + R +  LY+SALKG+W+R E ++  YPHY R AIT+N+ETV
Subjt:  DDPVPPPS-TADDGQVEADASG--VGCRGLE-SRTIDIAAVLEDQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETV

Query:  LHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQI
        LHVAAGAKQ+ FV+ELV+RM+  DM +++KYGNTALCFAATS IV+IA+L+VEKN +LPLIR F   TPL +AVSYK + M++YL SVTDL QLT +++I
Subjt:  LHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQI

Query:  ELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKF--GFNGKY---------------------DKNMMN
        ELLIATIHSDF D+SL IL + P LA MK TKNNNE+ALHV+ARKP A+ S TKQL  W+  I     FN K                      +K++  
Subjt:  ELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKF--GFNGKY---------------------DKNMMN

Query:  IFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVW---EEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHR
          A ++V+FLW Y+V E  +KEMLEFI+HPT LL+D   A NVEFLI+LI  YPDI+W   +++D+ K+IF VA+ENRLENVFNLI+EIG LNEF+ K+R
Subjt:  IFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVW---EEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHR

Query:  LTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALIT
             YS+LH   NLA PN+LNRV+GAA QMQRE+LWFKEVEKIVLPSQL  KSNDP+P + KLTPR+LFTE HK LRK GEEWMKNTANSCM+VA LI+
Subjt:  LTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALIT

Query:  TVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNV
        TVVFAAAFTVPGG+D+NTG PIFQ+K WF +FV+SDA AL SSSTSILMF+SILTSRYAE+DFLHSLPS+LL GL SLF+SIV M + F++ FFL+Y N 
Subjt:  TVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNV

Query:  NLWVPLTVAVMTILPL
        N+ +P  V  M I+P+
Subjt:  NLWVPLTVAVMTILPL

A0A5A7UP23 Ankyrin repeat-containing protein1.3e-30189.51Show/hide
Query:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM
        + SNSFH HLS DT+SSI+      P STA DDGQ+EA+A+ G+G RG + +TIDIAAVLE+QQ RNS+     +APSPREKTFLYRSALKGEWRRVES+
Subjt:  HASNSFHVHLS-DTLSSIVEDDPVPPPSTA-DDGQVEADAS-GVGCRGLESRTIDIAAVLEDQQ-RNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESM

Query:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM
        I+KYPHYAR A+TKN+ETVLHVAAGAKQTGFVKELV+RMSP DMTMQNKYGNTALCFAATSGIVRIAQLIV KNEDLPLIRGFSNLTPLFMAVSYKRKLM
Subjt:  IKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLM

Query:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA
         TYLF+VTD++QLT EDQIELLIA+IHSDFFDISLQI  MNPNLATMKCTKNNNESALHVMARKPLA+GSATKQLSIWRK IKFGFNGK YDKNMMNIFA
Subjt:  VTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGK-YDKNMMNIFA

Query:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
        REVVKFLWEYIVEEFEEKEMLEFIK+PTRLLHD TRAENVEFLI+LINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY
Subjt:  REVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNY

Query:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
        SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQ+SKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA
Subjt:  SMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAA

Query:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL
        AFTVPGG D+NTG+PIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLL+GLTSLFVSIVCMVVTFTA FFLLYQN  LWVPL
Subjt:  AFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPL

Query:  TVAVMTILPL
        TVAVMTILP+
Subjt:  TVAVMTILPL

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR41.8e-1025.09Show/hide
Query:  LNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINE-------IGRLNEFTAKYRTFKGRNYNILHLAGHL--AAPNHLNRVSG
        L+ AA  G+VE +  L+ K P +   +D  G++  H+AV+    +V   + +       +   N  TA +   + +   I+ +   L     N L R   
Subjt:  LNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINE-------IGRLNEFTAKYRTFKGRNYNILHLAGHL--AAPNHLNRVSG

Query:  AALQMQRELLWFKEVEKI--VLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH---------RDLRKAGEEWMKNTANSCMLVATLISTVVFA
         A  +   L   +E  +I  +L      +S  ++  R +++     I K    QL              ++LRK   E + N  NS  +VA L +TV FA
Subjt:  AALQMQRELLWFKEVEKI--VLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH---------RDLRKAGEEWMKNTANSCMLVATLISTVVFA

Query:  AAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFIL
        A FTVPGG N N+G  V  Q   F +F + +A+ALF+S   +++ ++++      +  +  + ++L      ++++ VC  ++F A+ +I+
Subjt:  AAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFIL

Q1RI31 Putative ankyrin repeat protein RBE_09022.5e-0427.33Show/hide
Query:  TFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPL--I
        T L  +A  G  +  E++I K      + +T N +TVL +AA        + L+ +M+   +   NK GNTAL  AA+S + +I + ++ K  D  +  I
Subjt:  TFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPL--I

Query:  RGFSNLTPLFMAVSYKRKLMVTYLFSVTD--LYQLTHEDQIELLIATIHS
          + N   +  A S   K+  T +  +T+  + Q  H+    L+ A  +S
Subjt:  RGFSNLTPLFMAVSYKRKLMVTYLFSVTD--LYQLTHEDQIELLIATIHS

Q9C7A2 Ankyrin repeat-containing protein ITN16.2e-1125.68Show/hide
Query:  LNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEF-------TAKYRTFKGRNYNILHLAGHL--AAPNHLNRVSG
        L+ AA  G+VE +  L+ K P +    D  G++  H+AV+ +   V  L+ +              TA +   + +   I+ L   L     N L R   
Subjt:  LNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINEIGRLNEF-------TAKYRTFKGRNYNILHLAGHL--AAPNHLNRVSG

Query:  AALQMQRELLWFKEVE--KIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH---------RDLRKAGEEWMKNTANSCMLVATLISTVVFA
         AL +   L   +E    K  L      ++N ++  R +++     I      QL   +          ++LRK   E + N  NS  +VA L +TV FA
Subjt:  AALQMQRELLWFKEVE--KIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEH---------RDLRKAGEEWMKNTANSCMLVATLISTVVFA

Query:  AAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKAN
        A FTVPGG +NNDG+ V   +  F +F + +A+ALF+S   +++ ++++      +  +  + ++L      ++++ +C  VAF A+ +I+  + N
Subjt:  AAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLATLFISIVCMVVAFSATFFILYHKAN

Arabidopsis top hitse value%identityAlignment
AT1G31130.1 unknown protein2.7e-8669.32Show/hide
Query:  QTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFKRLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNYFLFFFS
        ++RH WT LL FQF YLIFLFAFSLLSTAAVVFTVASLYT KPVS+SSTLSAIPKVFKRLF+TFLWV+LLM  YN +F  FLV+L++A+D  +  L   +
Subjt:  QTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFKRLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNYFLFFFS

Query:  VIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAYLAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVCVLVIVN
         ++I +L+  VH+Y TALWHL SV+SVLEP+YG AAM+K+YELLKGKT+ A  L+F YL +C  I V+FGA VVHGG  YG F R +VGG LV VLV+VN
Subjt:  VIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAYLAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVCVLVIVN

Query:  LVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD
        LVGLLVQSVFYYVCKS+HHQ IDK AL+D LGGYLG+YVPLKS+IQ+++LD
Subjt:  LVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD

AT3G18670.1 Ankyrin repeat family protein5.8e-7334.62Show/hide
Query:  ESRTIDIAAVLE-DQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPID--MTMQ
        +S  ID AA L+ D+ R  N+   +          L+++   GE    +  + + P    + +T N +T +H A  +     V+E++ R+   +  + ++
Subjt:  ESRTIDIAAVLE-DQQRNSNAKIKIEAPSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPID--MTMQ

Query:  NKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQIE---------LLIATIHSDFFDISLQIL
        N  G TAL +AAT GIVRIA+ +V K   L  +R      P+ +A  Y  K +V YL+S T L  L   D  +         L+   I    + I+L ++
Subjt:  NKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSNLTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQIE---------LLIATIHSDFFDISLQIL

Query:  TMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGKYDKNMMNIFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAEN
           P LA  +   ++N++A+  +A+ P A  S  +   I R+  K        K +++   +E+ KF      +  ++K          + L        
Subjt:  TMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGKYDKNMMNIFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAEN

Query:  VEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGG-LNEFAMKHRLTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKI
        VE++  ++  YPDIVW ++  G  IF  A+  R E +F+LI  IG   N  A    + + N  MLH  A  A  + LN + GAA QMQREL WFKEVEK+
Subjt:  VEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGG-LNEFAMKHRLTNRNYSMLHTVANLATPNNLNRVTGAAFQMQRELLWFKEVEKI

Query:  VLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSS
        V P     K         K TP+ LFT+ HKDL + GE+WMK TA SC +VAALITT++F++AFTVPGG   + G P++ H+  F +F+ISDA +L +S 
Subjt:  VLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAAAFTVPGGSDNNTGHPIFQHKPWFTVFVISDAAALVSSS

Query:  TSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPLTVAVMTILPL
         S+LMF+ IL SRY E+DFL SLP++L+VGL +LF+S+  M+VTF      L      WV      + ++PL
Subjt:  TSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPLTVAVMTILPL

AT3G54070.1 Ankyrin repeat family protein4.0e-9037.77Show/hide
Query:  LYQNALKGEWEYVELLLEESPHYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFN
        +Y+  L G+W+    L+      V+  IT N E  LHIA  AK  +FV  L+  M+  D+ L+NK GNT L FAAA G +  A++++   + LP I    
Subjt:  LYQNALKGEWEYVELLLEESPHYVLCSITRNKETILHIAAGAKQTEFVEKLVDRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFN

Query:  NAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFY----DISWWIVQRYPY----LAIMKDMNEETALHVLARKPSAMDVTKELSSW
          +TP+ IA  Y   EMV YL S T +  LN Q+ + L    I +D Y    D+  W+++R       LA+  + N+  ALH+LARK SA+    +L+ +
Subjt:  NAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFY----DISWWIVQRYPY----LAIMKDMNEETALHVLARKPSAMDVTKELSSW

Query:  ELYISSWIYRKPVTKTLAHELVILLWTNVLRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINE
        +   SSW                                        LL DAA  GNVE L++LIR + D++W  D++ +++FHVA   R EN+F+LI E
Subjt:  ELYISSWIYRKPVTKTLAHELVILLWTNVLRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAVENRLENVFNLINE

Query:  IGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDL
        +G + +  A Y+  + ++  +LHL   L   N     SGAAL MQ+ELLWFK V++IV  S +E K                N        +FT++H +L
Subjt:  IGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPRQLFTQEHRDL

Query:  RKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSN------NNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL
        RK GE WMK TA +CML ATLI+TVVFAAA T+PGG++      N  G P F+++L F +F +SD+VALFSS  SI++F+SI TSRYAE+DF + LP++L
Subjt:  RKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSN------NNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRL

Query:  LFGLATLFISIVCMVVAFSATFFIL-YHKANICIPTIVSAMAILPEHASNSFHVHL-SDTLSSI
        +FGL+ LFISI+ M++AF+ +  ++   KA++ +  ++S +A L        + HL  +TL S+
Subjt:  LFGLATLFISIVCMVVAFSATFFIL-YHKANICIPTIVSAMAILPEHASNSFHVHL-SDTLSSI

AT4G19950.1 unknown protein1.9e-9568.6Show/hide
Query:  VVTFTAIF---FLLYQNVNLWVPLTVAVMTILPLQNPFADPIQTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFK
        ++T T IF   F +  +     P+   + T      P AD  Q +H WT LL FQFCY+IFLFAFSLLSTAAVVFTVASLYT KPVS+SST+SAIP V K
Subjt:  VVTFTAIF---FLLYQNVNLWVPLTVAVMTILPLQNPFADPIQTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFK

Query:  RLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNYFLFFFSVIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAY
        RLF+TFLWVSLLM+ YN +FL FLV L++A+D QN  L  FS++VIF+LFLVVH+Y+TALWHLASVVSVLEPIYG AAMKKSYELLKGKT  A  +VF Y
Subjt:  RLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNYFLFFFSVIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAY

Query:  LAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVCVLVIVNLVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD
        L  C  I+ +FGA VV GGD YG+F RI+ GGFLV VLVIVNL+GLLVQSVFYYVCKSFHHQ IDK ALHDHLGGYLGEYVPLKS+IQM+N +
Subjt:  LAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVCVLVIVNLVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD

AT5G44860.1 unknown protein1.3e-9372.37Show/hide
Query:  PFADPIQTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFKRLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNY
        P +D  +T H WT LL +QF Y+IFLFAFSLLSTAAVVFTVASLYT KPVS+SST+SAIP V KRLF+TFLWVSL+M++YN +FL FLV+L++AID Q+ 
Subjt:  PFADPIQTRHRWTELLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFKRLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNY

Query:  FLFFFSVIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAYLAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVC
         L  FS++VIF+LFL VH+Y+TA WHLASVVSVLEPIYG AAMKKSYELL G+T  A  +VF YLA+C   + +FG  VVHGGD +G+F +I+VGGFLV 
Subjt:  FLFFFSVIVIFILFLVVHLYITALWHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAYLAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVC

Query:  VLVIVNLVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD
        +LVIVNLVGLLVQSVFYYVCKSFHHQ IDK ALHDHLGGYLG+YVPLKSSIQM+N D
Subjt:  VLVIVNLVGLLVQSVFYYVCKSFHHQGIDKLALHDHLGGYLGEYVPLKSSIQMDNLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATCGTATGAGAAAATCAAGGAGTTTTTCTTCATTTCCAAATAATAATCCATTTGAGGAAAAGATGAAGATGAAGAAAGATGAAGATGAAGATGACACCACTCT
GAAGAGGCAGGGCATAAACGTCAATGCACCCTTAGATGAGATTAATGCTGCTACAAAAATATTTCTGTATCAAAATGCACTGAAGGGTGAGTGGGAATATGTGGAATTAC
TGCTTGAGGAGAGCCCACATTATGTTCTTTGTTCAATAACAAGAAACAAAGAGACTATTCTTCACATTGCTGCAGGAGCCAAACAAACTGAATTCGTAGAGAAACTGGTG
GATAGAATGAATTATGATGACATGATTTTGCAGAACAAATATGGAAACACAGCCCTTTGTTTTGCTGCTGCATCAGGAGTTGTAAGAATTGCTCAAGTAATGGTTGAAAA
GAACCAACATCTTCCACTTATTCGTGGCTTCAACAATGCTGTCACCCCACTTTTCATAGCTGTCTCATATAAATGCACAGAAATGGTTTCTTATCTCTTGTCTATCACTG
ATCTTGATCAACTAAACCACCAAGAACAGATTGAGCTTCTCATTGCCACCATACACAGCGACTTTTATGATATAAGTTGGTGGATTGTGCAAAGATATCCATATTTAGCA
ATTATGAAGGACATGAATGAAGAGACAGCATTGCATGTGTTGGCCAGAAAACCGTCTGCCATGGACGTCACCAAGGAGCTAAGCAGTTGGGAATTGTACATCAGCTCTTG
GATCTATCGTAAACCTGTGACAAAGACACTAGCTCATGAACTAGTTATTTTGCTATGGACCAATGTTCTACGGAATTTGCCAGAAAAGAAGATGCTTCAGTTCATTAAAC
ATCCCACAAGATTACTAAATGATGCTGCATGTACAGGAAATGTTGAGTTTTTAATTGTCCTCATTCGTAAGTATCCAGATATAATATGGGAAGACGATGATGATGGTAAA
AGCATTTTTCATGTCGCCGTCGAAAATCGACTTGAGAATGTCTTCAATCTAATAAATGAGATTGGTAGGCTCAATGAGTTCACCGCAAAATATCGAACATTCAAGGGACG
AAATTATAACATTCTCCATTTGGCTGGACATCTAGCTGCTCCAAACCATCTTAATAGGGTCTCTGGAGCTGCCCTTCAAATGCAACGTGAATTGTTATGGTTCAAGGAAG
TAGAGAAGATAGTTTTGCCTTCTCAACTTGAAGCAAAATCCAATGTTATGTCTTCCCAACGACTCAAGGTGAAAGACAATTATCCAAACATTCCAAAGTTGACACCACGC
CAATTATTCACCCAAGAGCACAGAGATCTTCGCAAAGCTGGTGAGGAGTGGATGAAAAACACAGCAAACTCTTGCATGCTCGTCGCAACTTTGATCTCCACTGTAGTTTT
TGCAGCAGCTTTCACAGTTCCGGGCGGGAGTAATAACAATGACGGCACTCCTGTTTTTCAACAGAAGCTTTGGTTCACTGTGTTTGTAATGTCGGATGCAGTCGCTTTGT
TTTCGTCCTCAACTTCTATCCTAATGTTCATGTCGATCCTGACTTCGCGCTATGCCGAAGATGATTTTCTGCATTCATTGCCATCAAGATTGCTCTTTGGACTGGCCACA
CTATTCATATCCATTGTGTGTATGGTTGTGGCCTTTAGTGCTACCTTCTTTATCCTCTACCATAAGGCTAATATTTGTATTCCCACTATAGTCTCTGCCATGGCGATTCT
TCCAGAACATGCTTCTAATTCCTTCCATGTCCACTTGTCGGATACCCTTTCTTCCATTGTCGAAGATGATCCTGTTCCTCCGCCGTCTACGGCTGACGACGGCCAAGTGG
AGGCGGATGCCAGTGGGGTCGGTTGCAGGGGATTGGAGAGCCGGACAATAGACATTGCAGCAGTTTTGGAAGATCAACAACGGAATTCTAATGCTAAGATCAAGATCGAA
GCTCCATCACCTCGGGAGAAAACGTTTCTGTATCGATCTGCACTTAAGGGTGAGTGGAGAAGGGTGGAATCAATGATTAAGAAGTATCCACATTATGCTCGTTCTGCAAT
TACAAAAAACGAAGAGACTGTTCTTCATGTTGCTGCTGGAGCCAAGCAAACCGGGTTTGTAAAGGAGCTGGTTAACAGAATGAGCCCCATTGACATGACTATGCAAAATA
AATATGGAAATACTGCCCTTTGCTTTGCTGCTACATCTGGAATCGTAAGAATTGCACAGTTAATAGTGGAGAAAAATGAAGATCTTCCACTCATTCGTGGCTTCAGTAAC
CTCACTCCACTTTTCATGGCGGTATCTTATAAACGCAAATTAATGGTTACTTACCTCTTCTCTGTTACTGATCTTTATCAACTCACGCACGAAGACCAGATTGAGCTTCT
TATTGCCACCATACATAGCGACTTTTTTGATATAAGTTTGCAGATTCTAACAATGAATCCAAATTTAGCAACTATGAAATGCACAAAGAACAACAATGAAAGTGCATTGC
ATGTGATGGCTAGAAAGCCACTAGCAGTCGGTAGCGCCACAAAGCAGTTGAGCATTTGGAGAAAGTACATCAAGTTTGGTTTCAATGGGAAGTACGATAAAAATATGATG
AATATATTTGCTCGTGAAGTAGTTAAATTTCTATGGGAATATATTGTTGAGGAGTTTGAAGAAAAGGAAATGTTGGAATTCATTAAACATCCGACAAGATTACTACATGA
TGGTACAAGAGCAGAAAATGTTGAGTTCTTGATTTTGCTCATTAACTTATACCCAGATATAGTGTGGGAAGAAGATGATGAAGGTAAGACCATATTTGATGTGGCTATCG
AAAATCGACTTGAAAATGTGTTCAATCTCATAGATGAAATTGGTGGGCTTAATGAGTTTGCAATGAAACATAGATTAACTAATAGAAACTACAGCATGTTACATACTGTT
GCAAATCTTGCTACTCCAAACAATCTCAATAGGGTCACCGGAGCTGCCTTTCAAATGCAACGTGAATTGCTTTGGTTCAAGGAGGTGGAGAAGATAGTTCTACCTTCTCA
ACTCATGGCCAAATCCAATGATCCAAATCCACAAGTATCCAAATTAACCCCACGTGAGTTATTCACCGAAAATCACAAAGATCTTCGCAAAGCTGGTGAGGAATGGATGA
AAAACACAGCAAACTCATGCATGATCGTTGCAGCTCTAATCACCACCGTCGTTTTCGCCGCAGCCTTCACCGTTCCTGGCGGATCTGATAATAATACAGGCCATCCTATT
TTTCAGCACAAGCCTTGGTTCACTGTGTTCGTGATATCAGATGCAGCTGCTTTGGTTTCATCTTCAACTTCTATTTTAATGTTCATGTCAATCCTAACTTCGCGTTACGC
AGAAGACGATTTCTTGCACTCGCTGCCTTCCCGATTACTCGTCGGCCTCACGTCCTTATTTGTGTCTATAGTGTGCATGGTTGTGACCTTCACTGCAATCTTCTTTTTGC
TCTACCAAAATGTCAACTTATGGGTTCCACTCACTGTTGCTGTAATGACAATTCTTCCACTTCAAAACCCTTTCGCCGACCCCATTCAAACCCGCCATAGATGGACCGAG
CTCCTCTCCTTTCAATTCTGCTACCTCATCTTCCTCTTCGCTTTCTCTCTTCTCTCCACCGCGGCCGTCGTCTTCACCGTCGCCTCCCTCTACACTTCCAAACCCGTCTC
TTATTCCTCCACTCTCTCCGCTATTCCCAAAGTCTTCAAACGCCTGTTCGTCACTTTCCTCTGGGTTTCTCTCCTCATGATTATCTATAACTTCATCTTTCTCGCCTTCT
TGGTACTCCTTGTCCTTGCAATCGATACCCAGAATTACTTCCTGTTCTTCTTCTCTGTTATTGTAATCTTCATCCTGTTTCTAGTTGTACATCTTTACATCACCGCCTTG
TGGCATTTGGCTAGCGTCGTGTCGGTGCTTGAACCGATTTATGGATTTGCTGCCATGAAAAAGAGCTACGAATTGCTCAAGGGGAAAACCCGATTTGCGGGTGTGCTTGT
ATTTGCTTATTTAGCCATTTGTGCTACAATTAGTGTCATATTTGGGGCTGCAGTCGTGCACGGTGGGGATGGATATGGGGTTTTTGTGAGGATTATTGTTGGTGGCTTCT
TGGTTTGCGTTTTGGTGATTGTGAACTTGGTGGGATTGCTTGTTCAGAGTGTGTTTTACTATGTCTGCAAGAGCTTCCATCATCAAGGAATTGATAAGCTGGCTTTACAT
GATCATTTGGGTGGATATCTTGGTGAGTATGTACCTCTCAAAAGTAGTATTCAGATGGATAATTTAGATGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATCGTATGAGAAAATCAAGGAGTTTTTCTTCATTTCCAAATAATAATCCATTTGAGGAAAAGATGAAGATGAAGAAAGATGAAGATGAAGATGACACCACTCT
GAAGAGGCAGGGCATAAACGTCAATGCACCCTTAGATGAGATTAATGCTGCTACAAAAATATTTCTGTATCAAAATGCACTGAAGGGTGAGTGGGAATATGTGGAATTAC
TGCTTGAGGAGAGCCCACATTATGTTCTTTGTTCAATAACAAGAAACAAAGAGACTATTCTTCACATTGCTGCAGGAGCCAAACAAACTGAATTCGTAGAGAAACTGGTG
GATAGAATGAATTATGATGACATGATTTTGCAGAACAAATATGGAAACACAGCCCTTTGTTTTGCTGCTGCATCAGGAGTTGTAAGAATTGCTCAAGTAATGGTTGAAAA
GAACCAACATCTTCCACTTATTCGTGGCTTCAACAATGCTGTCACCCCACTTTTCATAGCTGTCTCATATAAATGCACAGAAATGGTTTCTTATCTCTTGTCTATCACTG
ATCTTGATCAACTAAACCACCAAGAACAGATTGAGCTTCTCATTGCCACCATACACAGCGACTTTTATGATATAAGTTGGTGGATTGTGCAAAGATATCCATATTTAGCA
ATTATGAAGGACATGAATGAAGAGACAGCATTGCATGTGTTGGCCAGAAAACCGTCTGCCATGGACGTCACCAAGGAGCTAAGCAGTTGGGAATTGTACATCAGCTCTTG
GATCTATCGTAAACCTGTGACAAAGACACTAGCTCATGAACTAGTTATTTTGCTATGGACCAATGTTCTACGGAATTTGCCAGAAAAGAAGATGCTTCAGTTCATTAAAC
ATCCCACAAGATTACTAAATGATGCTGCATGTACAGGAAATGTTGAGTTTTTAATTGTCCTCATTCGTAAGTATCCAGATATAATATGGGAAGACGATGATGATGGTAAA
AGCATTTTTCATGTCGCCGTCGAAAATCGACTTGAGAATGTCTTCAATCTAATAAATGAGATTGGTAGGCTCAATGAGTTCACCGCAAAATATCGAACATTCAAGGGACG
AAATTATAACATTCTCCATTTGGCTGGACATCTAGCTGCTCCAAACCATCTTAATAGGGTCTCTGGAGCTGCCCTTCAAATGCAACGTGAATTGTTATGGTTCAAGGAAG
TAGAGAAGATAGTTTTGCCTTCTCAACTTGAAGCAAAATCCAATGTTATGTCTTCCCAACGACTCAAGGTGAAAGACAATTATCCAAACATTCCAAAGTTGACACCACGC
CAATTATTCACCCAAGAGCACAGAGATCTTCGCAAAGCTGGTGAGGAGTGGATGAAAAACACAGCAAACTCTTGCATGCTCGTCGCAACTTTGATCTCCACTGTAGTTTT
TGCAGCAGCTTTCACAGTTCCGGGCGGGAGTAATAACAATGACGGCACTCCTGTTTTTCAACAGAAGCTTTGGTTCACTGTGTTTGTAATGTCGGATGCAGTCGCTTTGT
TTTCGTCCTCAACTTCTATCCTAATGTTCATGTCGATCCTGACTTCGCGCTATGCCGAAGATGATTTTCTGCATTCATTGCCATCAAGATTGCTCTTTGGACTGGCCACA
CTATTCATATCCATTGTGTGTATGGTTGTGGCCTTTAGTGCTACCTTCTTTATCCTCTACCATAAGGCTAATATTTGTATTCCCACTATAGTCTCTGCCATGGCGATTCT
TCCAGAACATGCTTCTAATTCCTTCCATGTCCACTTGTCGGATACCCTTTCTTCCATTGTCGAAGATGATCCTGTTCCTCCGCCGTCTACGGCTGACGACGGCCAAGTGG
AGGCGGATGCCAGTGGGGTCGGTTGCAGGGGATTGGAGAGCCGGACAATAGACATTGCAGCAGTTTTGGAAGATCAACAACGGAATTCTAATGCTAAGATCAAGATCGAA
GCTCCATCACCTCGGGAGAAAACGTTTCTGTATCGATCTGCACTTAAGGGTGAGTGGAGAAGGGTGGAATCAATGATTAAGAAGTATCCACATTATGCTCGTTCTGCAAT
TACAAAAAACGAAGAGACTGTTCTTCATGTTGCTGCTGGAGCCAAGCAAACCGGGTTTGTAAAGGAGCTGGTTAACAGAATGAGCCCCATTGACATGACTATGCAAAATA
AATATGGAAATACTGCCCTTTGCTTTGCTGCTACATCTGGAATCGTAAGAATTGCACAGTTAATAGTGGAGAAAAATGAAGATCTTCCACTCATTCGTGGCTTCAGTAAC
CTCACTCCACTTTTCATGGCGGTATCTTATAAACGCAAATTAATGGTTACTTACCTCTTCTCTGTTACTGATCTTTATCAACTCACGCACGAAGACCAGATTGAGCTTCT
TATTGCCACCATACATAGCGACTTTTTTGATATAAGTTTGCAGATTCTAACAATGAATCCAAATTTAGCAACTATGAAATGCACAAAGAACAACAATGAAAGTGCATTGC
ATGTGATGGCTAGAAAGCCACTAGCAGTCGGTAGCGCCACAAAGCAGTTGAGCATTTGGAGAAAGTACATCAAGTTTGGTTTCAATGGGAAGTACGATAAAAATATGATG
AATATATTTGCTCGTGAAGTAGTTAAATTTCTATGGGAATATATTGTTGAGGAGTTTGAAGAAAAGGAAATGTTGGAATTCATTAAACATCCGACAAGATTACTACATGA
TGGTACAAGAGCAGAAAATGTTGAGTTCTTGATTTTGCTCATTAACTTATACCCAGATATAGTGTGGGAAGAAGATGATGAAGGTAAGACCATATTTGATGTGGCTATCG
AAAATCGACTTGAAAATGTGTTCAATCTCATAGATGAAATTGGTGGGCTTAATGAGTTTGCAATGAAACATAGATTAACTAATAGAAACTACAGCATGTTACATACTGTT
GCAAATCTTGCTACTCCAAACAATCTCAATAGGGTCACCGGAGCTGCCTTTCAAATGCAACGTGAATTGCTTTGGTTCAAGGAGGTGGAGAAGATAGTTCTACCTTCTCA
ACTCATGGCCAAATCCAATGATCCAAATCCACAAGTATCCAAATTAACCCCACGTGAGTTATTCACCGAAAATCACAAAGATCTTCGCAAAGCTGGTGAGGAATGGATGA
AAAACACAGCAAACTCATGCATGATCGTTGCAGCTCTAATCACCACCGTCGTTTTCGCCGCAGCCTTCACCGTTCCTGGCGGATCTGATAATAATACAGGCCATCCTATT
TTTCAGCACAAGCCTTGGTTCACTGTGTTCGTGATATCAGATGCAGCTGCTTTGGTTTCATCTTCAACTTCTATTTTAATGTTCATGTCAATCCTAACTTCGCGTTACGC
AGAAGACGATTTCTTGCACTCGCTGCCTTCCCGATTACTCGTCGGCCTCACGTCCTTATTTGTGTCTATAGTGTGCATGGTTGTGACCTTCACTGCAATCTTCTTTTTGC
TCTACCAAAATGTCAACTTATGGGTTCCACTCACTGTTGCTGTAATGACAATTCTTCCACTTCAAAACCCTTTCGCCGACCCCATTCAAACCCGCCATAGATGGACCGAG
CTCCTCTCCTTTCAATTCTGCTACCTCATCTTCCTCTTCGCTTTCTCTCTTCTCTCCACCGCGGCCGTCGTCTTCACCGTCGCCTCCCTCTACACTTCCAAACCCGTCTC
TTATTCCTCCACTCTCTCCGCTATTCCCAAAGTCTTCAAACGCCTGTTCGTCACTTTCCTCTGGGTTTCTCTCCTCATGATTATCTATAACTTCATCTTTCTCGCCTTCT
TGGTACTCCTTGTCCTTGCAATCGATACCCAGAATTACTTCCTGTTCTTCTTCTCTGTTATTGTAATCTTCATCCTGTTTCTAGTTGTACATCTTTACATCACCGCCTTG
TGGCATTTGGCTAGCGTCGTGTCGGTGCTTGAACCGATTTATGGATTTGCTGCCATGAAAAAGAGCTACGAATTGCTCAAGGGGAAAACCCGATTTGCGGGTGTGCTTGT
ATTTGCTTATTTAGCCATTTGTGCTACAATTAGTGTCATATTTGGGGCTGCAGTCGTGCACGGTGGGGATGGATATGGGGTTTTTGTGAGGATTATTGTTGGTGGCTTCT
TGGTTTGCGTTTTGGTGATTGTGAACTTGGTGGGATTGCTTGTTCAGAGTGTGTTTTACTATGTCTGCAAGAGCTTCCATCATCAAGGAATTGATAAGCTGGCTTTACAT
GATCATTTGGGTGGATATCTTGGTGAGTATGTACCTCTCAAAAGTAGTATTCAGATGGATAATTTAGATGCATGA
Protein sequenceShow/hide protein sequence
MKNRMRKSRSFSSFPNNNPFEEKMKMKKDEDEDDTTLKRQGINVNAPLDEINAATKIFLYQNALKGEWEYVELLLEESPHYVLCSITRNKETILHIAAGAKQTEFVEKLV
DRMNYDDMILQNKYGNTALCFAAASGVVRIAQVMVEKNQHLPLIRGFNNAVTPLFIAVSYKCTEMVSYLLSITDLDQLNHQEQIELLIATIHSDFYDISWWIVQRYPYLA
IMKDMNEETALHVLARKPSAMDVTKELSSWELYISSWIYRKPVTKTLAHELVILLWTNVLRNLPEKKMLQFIKHPTRLLNDAACTGNVEFLIVLIRKYPDIIWEDDDDGK
SIFHVAVENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGHLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVMSSQRLKVKDNYPNIPKLTPR
QLFTQEHRDLRKAGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSNNNDGTPVFQQKLWFTVFVMSDAVALFSSSTSILMFMSILTSRYAEDDFLHSLPSRLLFGLAT
LFISIVCMVVAFSATFFILYHKANICIPTIVSAMAILPEHASNSFHVHLSDTLSSIVEDDPVPPPSTADDGQVEADASGVGCRGLESRTIDIAAVLEDQQRNSNAKIKIE
APSPREKTFLYRSALKGEWRRVESMIKKYPHYARSAITKNEETVLHVAAGAKQTGFVKELVNRMSPIDMTMQNKYGNTALCFAATSGIVRIAQLIVEKNEDLPLIRGFSN
LTPLFMAVSYKRKLMVTYLFSVTDLYQLTHEDQIELLIATIHSDFFDISLQILTMNPNLATMKCTKNNNESALHVMARKPLAVGSATKQLSIWRKYIKFGFNGKYDKNMM
NIFAREVVKFLWEYIVEEFEEKEMLEFIKHPTRLLHDGTRAENVEFLILLINLYPDIVWEEDDEGKTIFDVAIENRLENVFNLIDEIGGLNEFAMKHRLTNRNYSMLHTV
ANLATPNNLNRVTGAAFQMQRELLWFKEVEKIVLPSQLMAKSNDPNPQVSKLTPRELFTENHKDLRKAGEEWMKNTANSCMIVAALITTVVFAAAFTVPGGSDNNTGHPI
FQHKPWFTVFVISDAAALVSSSTSILMFMSILTSRYAEDDFLHSLPSRLLVGLTSLFVSIVCMVVTFTAIFFLLYQNVNLWVPLTVAVMTILPLQNPFADPIQTRHRWTE
LLSFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSYSSTLSAIPKVFKRLFVTFLWVSLLMIIYNFIFLAFLVLLVLAIDTQNYFLFFFSVIVIFILFLVVHLYITAL
WHLASVVSVLEPIYGFAAMKKSYELLKGKTRFAGVLVFAYLAICATISVIFGAAVVHGGDGYGVFVRIIVGGFLVCVLVIVNLVGLLVQSVFYYVCKSFHHQGIDKLALH
DHLGGYLGEYVPLKSSIQMDNLDA