; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025781 (gene) of Chayote v1 genome

Gene IDSed0025781
OrganismSechium edule (Chayote v1)
DescriptionChlorophyllase
Genome locationLG01:71499535..71502355
RNA-Seq ExpressionSed0025781
SyntenySed0025781
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606410.1 Ankyrin repeat-containing protein ITN1, partial [Cucurbita argyrosperma subsp. sororia]1.1e-22357.26Show/hide
Query:  LKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGA
        L+       W  V+  YE+  E AQ +KLN  GDTALHLAVIDN+EE VEK+V  +   +   K +LE   NERGN+PLHLAA+MGS++MC AIA A+G 
Subjt:  LKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGA

Query:  LVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTG
        LV+ RN +DETPL+LAA  GNKD FF LY+ CG+D++ I  NCRV++ GDTVLH AL N+QFDLA +LI+I++++ NW N+ GFTPL VLASKPTSFK+G
Subjt:  LVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTG

Query:  SHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEE
        SHIKG + I+Y+ +FV  L+P+  + + +  ERS  KA  T+P S FP NYQTC HFF  +   FL V        +      DE++ + + D D    +
Subjt:  SHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEE

Query:  EEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGT
         EE++     +  +  +T FP+NY+TCIDFF    S IMII G G+ EI++IRK+KEKHTWA+QVM++L +      Y D G  PM  ++Q +EEKE+ T
Subjt:  EEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGT

Query:  VPYIVKDGRVHF-------PDTLNTNNNN----EHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKK
        VPY   D  + F       P  L     +    +    KDV+ T++ ML+AA+ GVI+IV+ +  RFPLAI +T +DKKNVVLLAAE+RQPEVY+FLLK+
Subjt:  VPYIVKDGRVHF-------PDTLNTNNNN----EHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKK

Query:  RKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVAT
        RK IE+LFRAVD  GNSALHLAA A DP LW ITG A Q+QWEVKWY YVKESVPLH F  +NK+GETA  +F E+H +L  KGG+WLY TSES ++VAT
Subjt:  RKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVAT

Query:  LIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLII
        LIATVAFATA T+PGGN E G A L  E GF IFS SSLIAL LSSTS+IMFLAI+TSRF  K FG VLPWKL IGL CLYFSIIAMLVSFCSGHYFLII
Subjt:  LIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLII

Query:  SRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVVHR
        +RL N+A+L Y+L F+PVA IF +VQLPLYFDLLQA+L++VP RS+ VV R
Subjt:  SRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVVHR

KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]4.6e-22558.12Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQ--SVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVR
        K   EV +    K    K+ E   ++E+  EV +  + KLN RGDT LHLAVIDN+E  VE++V        E  ++LE + NER N+PLHLAA+MGSVR
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQ--SVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVR

Query:  MCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHV
        MC AIA  +  LV++RN  DETPLFLAA YGNKDAFFCLY FC    + I+ NCRV + GDTVLH AL NE FDLAFQLI+++K++++WVN+ G TP+HV
Subjt:  MCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHV

Query:  LASKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKANTSPRSR---------FPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE---
        LASKPTSFK+GSHIKGW++IVY+ +FV  L+P+ IK + +  +RS+ KA      +         FP NYQTC  F   +    L V   K  ++ +   
Subjt:  LASKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKANTSPRSR---------FPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE---

Query:  ----EKDLEKDNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDH
            +K++ K++ D++  + EE++     +  E  +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM+KL+E A   KY + 
Subjt:  ----EKDLEKDNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDH

Query:  GGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPE
        G  PM  K+   +E    T+PY   D  V F   +      E   P+DV+ +++ ML+AA+ GVI+IV+   ERFPLAIR+T++DKKNVVLLAAE RQP+
Subjt:  GGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPE

Query:  VYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTS
        VY+FLLKK   I+NLFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWY YVK SVPLH F  +N +G+TA  IF E+H +L  KGG WLY TS
Subjt:  VYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTS

Query:  ESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFC
        ESCSLVATLIATVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL CLYFSIIAMLVSFC
Subjt:  ESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFC

Query:  SGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        SGHYFLII RLHN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  SGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]3.3e-23159.84Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC
        K V E  D  +   K  +  EV+  Y++ LE   + KLN RGDT LHLAVIDN+E IVE++V+ +        ++LE + NE GN+ LHLAA+MGSVRMC
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC

Query:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         AIA  +  LV++RN  D+TPLFLAA YGNKDAFFCLY FC DDA+ I+ NCRV + GDTVLH AL++E FDLAFQLI+++K++++WVN+ G TP+HVLA
Subjt:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

Query:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK
         KPTSFK+GSHIKGW++IVY+ +FV  L+P+  + + +  ERS  KA  T+P S FP NYQTC HFF  +    L     K  ++ +       +K++ K
Subjt:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK

Query:  DNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQ
        ++ D++  + EE++     +  E  +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM KL+E A   KY + G  PM SK++
Subjt:  DNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQ

Query:  TKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRK
          EE    T+PY   D  V F +        E   P+DV+  ++ ML+AA+ GVI+IV+  LERFPLAIR+T++DKKNVVLLAAE RQP+VY+ LLKK K
Subjt:  TKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRK

Query:  GIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLI
         I+NLFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWYKYVK SVPLH F  +N +G+TA  IF E+H +L  KGG WLY TSESCSLVATLI
Subjt:  GIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLI

Query:  ATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISR
        ATVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL CLYFSIIAMLVSFCSGHYFLII R
Subjt:  ATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISR

Query:  LHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        LHN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  LHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]5.6e-23159.79Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC
        K V E  D  +   K  +  EV+  Y++ LE   + KLN RGDT LHLAVIDN+E IVE++V+ +        ++LE + NE GN+ LHLAA+MGSVRMC
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC

Query:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         AIA  +  LV++RN  D+TPLFLAA YGNKDAFFCLY FC DDA+ I+ NCRV + GDTVLH AL++E FDLAFQLI+++K++++WVN+ G TP+HVLA
Subjt:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

Query:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK
         KPTSFK+GSHIKGW++IVY+ +FV  L+P+  + + +  ERS  KA  T+P S FP NYQTC HFF  +    L     K  ++ +       +K++ K
Subjt:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK

Query:  DNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQT
        ++ D++  + EE++     ++   +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM KL+E A   KY + G  PM SK++ 
Subjt:  DNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQT

Query:  KEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKG
         EE    T+PY   D  V F +        E   P+DV+  ++ ML+AA+ GVI+IV+  LERFPLAIR+T++DKKNVVLLAAE RQP+VY+ LLKK K 
Subjt:  KEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKG

Query:  IENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIA
        I+NLFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWYKYVK SVPLH F  +N +G+TA  IF E+H +L  KGG WLY TSESCSLVATLIA
Subjt:  IENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIA

Query:  TVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRL
        TVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL CLYFSIIAMLVSFCSGHYFLII RL
Subjt:  TVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRL

Query:  HNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        HN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  HNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]4.4e-22858.44Show/hide
Query:  LKAKTKRGKWSEVVTIYE------------------QNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRT---------EEEKKKLLE-----
        L A  KRGKW EV+  Y+                  QN   A++ KLN RGDT LHLAVIDN+E  VE++VR I            EE +K++ E     
Subjt:  LKAKTKRGKWSEVVTIYE------------------QNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRT---------EEEKKKLLE-----

Query:  ----DSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDL
            ++ NERGN+PLHLAAVMGSVRMC AI   +  LV+ RN  DETPLFLAA YGNKDAFFCLY FC DDA+ I+ NCRV + GDTVLH+AL NE  DL
Subjt:  ----DSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDL

Query:  AFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA--NTSPRSRFPPNYQTCTHFFTNLYH
        AFQLI+++K++++WVN+ G TP+H+LASKPTSFK+GSHIKGW++IVY+ +FV  LK        +GL  S+ K     SP S FP NY+TC  F+  +  
Subjt:  AFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA--NTSPRSRFPPNYQTCTHFFTNLYH

Query:  FFLIVSGVKK--KSDEEEKDLEKDNDDEKEEEEEEEE-EEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDK
          L V   K+  K+D +    +K ND + + ++   + EE   ++   +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM+K
Subjt:  FFLIVSGVKK--KSDEEEKDLEKDNDDEKEEEEEEEE-EEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDK

Query:  LIESATMQKYNDHGGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKK
        L+E A   KY + G  PM SK+Q  +E    T+PY   D  V F   +      E   P+DV+  ++ ML+AA+ GVI+IV+  LERFP AIR+T +DKK
Subjt:  LIESATMQKYNDHGGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKK

Query:  NVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHID
        NVVLLAAE RQP+VY+F LK+R  I+ LFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWY YVK SVPLH F  +N +G+TA  IF E+H +
Subjt:  NVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHID

Query:  LVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSC
        L  KGG WLY TSESCSLVATLIATVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL C
Subjt:  LVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSC

Query:  LYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        LYFSIIAMLVSFCSGHYFLII RLHN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  LYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

TrEMBL top hitse value%identityAlignment
A0A6J1ESB1 uncharacterized protein LOC1114373392.3e-21453.72Show/hide
Query:  WSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVD
        W  V+  YE N E AQ +KLN  GDTALH AV+DN+EE V+K+V  + + E+  K +LE + NERGN+PLHLAA+MGSV+MC AIA+A+G LV++RN +D
Subjt:  WSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVD

Query:  ETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSHIKGWRNI
        ETPL+LAA  GNK+AFFCLY FCGDD+  I  NCRV   GDTVLH+AL N+QFDLA QLI+I++++ NW N+ GFTPL VLASKPTSFK+GS I+GWR+I
Subjt:  ETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSHIKGWRNI

Query:  VYYATFVNQLKPQKIKTIFKGL------------------------------------------------------------ERSIGKA-NTSPRSRFPP
        VY+   V  LKP+ I+ + +                                                              ERS+ KA  T+P S FP 
Subjt:  VYYATFVNQLKPQKIKTIFKGL------------------------------------------------------------ERSIGKA-NTSPRSRFPP

Query:  NYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEI
        NYQTC HF+  +    L V        +      DE++ + + D D    + EE++     +  +  +T FP+NY+TCIDFF    S IMII G G++EI
Subjt:  NYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEI

Query:  RKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNN----NEHGTPKDVKPTDTVMLIAAQTGV
        ++IRK+KEKHTWA+QVM++L+E     KY + G  PM  ++Q ++E+E+ TVPY   D  V F              + G PKDV+ T++ ML+AA+ GV
Subjt:  RKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNN----NEHGTPKDVKPTDTVMLIAAQTGV

Query:  IDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPL
        ++IV+ +  R PLAI +T +DKKNVVLLAAE+RQPEVY+FLLK+RK IE+LFRAVD  GNSALHLAA A DPKLW ITG A ++QWEVKWYKYVKESVPL
Subjt:  IDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPL

Query:  HTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAIL
        H F  +NK+G+TA  +F E+H  L  KGG WLY TS+S +LVATLIATVAFATA T+PGGN E G A L  E GF IFS SSLIAL LSSTS+IMFLAI+
Subjt:  HTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAIL

Query:  TSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA
        TSRF  K FG VLPWKL IGL CLYFSIIAMLVSFCSGHYFLI +RL N+A+L Y+L F+PVA IF +VQLPLYFDL+ +
Subjt:  TSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA

A0A6J1ET50 uncharacterized protein LOC111437341 isoform X22.7e-23159.79Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC
        K V E  D  +   K  +  EV+  Y++ LE   + KLN RGDT LHLAVIDN+E IVE++V+ +        ++LE + NE GN+ LHLAA+MGSVRMC
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC

Query:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         AIA  +  LV++RN  D+TPLFLAA YGNKDAFFCLY FC DDA+ I+ NCRV + GDTVLH AL++E FDLAFQLI+++K++++WVN+ G TP+HVLA
Subjt:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

Query:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK
         KPTSFK+GSHIKGW++IVY+ +FV  L+P+  + + +  ERS  KA  T+P S FP NYQTC HFF  +    L     K  ++ +       +K++ K
Subjt:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK

Query:  DNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQT
        ++ D++  + EE++     ++   +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM KL+E A   KY + G  PM SK++ 
Subjt:  DNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQT

Query:  KEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKG
         EE    T+PY   D  V F +        E   P+DV+  ++ ML+AA+ GVI+IV+  LERFPLAIR+T++DKKNVVLLAAE RQP+VY+ LLKK K 
Subjt:  KEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKG

Query:  IENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIA
        I+NLFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWYKYVK SVPLH F  +N +G+TA  IF E+H +L  KGG WLY TSESCSLVATLIA
Subjt:  IENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIA

Query:  TVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRL
        TVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL CLYFSIIAMLVSFCSGHYFLII RL
Subjt:  TVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRL

Query:  HNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        HN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  HNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X11.6e-23159.84Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC
        K V E  D  +   K  +  EV+  Y++ LE   + KLN RGDT LHLAVIDN+E IVE++V+ +        ++LE + NE GN+ LHLAA+MGSVRMC
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC

Query:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         AIA  +  LV++RN  D+TPLFLAA YGNKDAFFCLY FC DDA+ I+ NCRV + GDTVLH AL++E FDLAFQLI+++K++++WVN+ G TP+HVLA
Subjt:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

Query:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK
         KPTSFK+GSHIKGW++IVY+ +FV  L+P+  + + +  ERS  KA  T+P S FP NYQTC HFF  +    L     K  ++ +       +K++ K
Subjt:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA-NTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEE-------EKDLEK

Query:  DNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQ
        ++ D++  + EE++     +  E  +T FP+NY+TCIDFF    S IMIILGFGS+EI+KIRKKKEKHTW++QVM KL+E A   KY + G  PM SK++
Subjt:  DNDDEKEEE-EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQ

Query:  TKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRK
          EE    T+PY   D  V F +        E   P+DV+  ++ ML+AA+ GVI+IV+  LERFPLAIR+T++DKKNVVLLAAE RQP+VY+ LLKK K
Subjt:  TKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRK

Query:  GIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLI
         I+NLFRAVD  GNSALHLAA A+DPKLW ITGAA QMQWEVKWYKYVK SVPLH F  +N +G+TA  IF E+H +L  KGG WLY TSESCSLVATLI
Subjt:  GIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLI

Query:  ATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISR
        ATVAFATA T+PGGN E G A L  EQGF IFS SSLIALCLSSTS+IMFLAI+TSRF  K FG  LPWKL IGL CLYFSIIAMLVSFCSGHYFLII R
Subjt:  ATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISR

Query:  LHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        LHN+A+L YTL F PVA IFGIVQLPLYFDLLQA++K VP+RS+ VV
Subjt:  LHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X19.7e-20553.17Show/hide
Query:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC
        K  + + D L   TKRGKW EV+  YE+  E AQ +KL   GDTALHLAV+DNREE+V+K+V +I  T +  + L   + N+R  +PLHLAA MGS  MC
Subjt:  KNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMC

Query:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         AIA A+  LV+ RN VDETPL+LAA  GN+DAFFCLY+FC ++A+ I  NCR++  GDTVLH AL N+ FDLAFQ++H+N ++++WV + G TPLHVLA
Subjt:  CAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

Query:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKANTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKK-----KSDEEEKDL-----
        SKPT+FK+GS I+GWRNI YY T V+QLKPQ I ++ +     +   NTS    FP NY TC  FFT ++  FL  SG+K+     ++DE +KD      
Subjt:  SKPTSFKTGSHIKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKANTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKK-----KSDEEEKDL-----

Query:  ------EKDNDDEKEEEEEEEEEEAMNIDEDES-TMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGG
              E    DE  +  + E  +A  + ++ S T  P NY+TCI FFQ V S IMI LG GS+E +KIR+KKEKHTW++QVM+KL+E A   +Y  +GG
Subjt:  ------EKDNDDEKEEEEEEEEEEAMNIDEDES-TMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGG

Query:  PPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVY
         PM S  QT ++ E  T+PY +  G V   +++  +   E   PK+V+  +T ML+AA+ GVI+IV+ +  RFPL+I + ++DKKNVVLLAAE+ QP+VY
Subjt:  PPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVY

Query:  KFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSES
        +FLL  +   ENLFRAVD+NGNSALHLAA A    +W ITGAA QMQWE+KWYK+V+ESVPL+ F  YNK+G+ A  IFHE+H+DLV K G+WL KTS+S
Subjt:  KFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSES

Query:  CSLVATLIATVAFATATTVPGG-NGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCS
        CS+V TLI TVAF +  ++PGG N   G   LK  + FF F++ SLIALCLSSTS+ MFLAILT RF    F + LPWKLFIG S L+ SII+MLVSFC+
Subjt:  CSLVATLIATVAFATATTVPGG-NGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCS

Query:  GHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV
        GHYFL+   + + A L YT++ +PVA IF I +LPLY D++QAI K VP RS+ VV
Subjt:  GHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVV

A0A6J1K7T7 uncharacterized protein LOC1114914297.4e-20558.95Show/hide
Query:  KRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEE---KKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALV
        +RG+W EV+  Y++N + AQ++KLN RGDTALHLAVIDN+EE VE +V+ I    ++   KKK+LE + +ERGN+PLHLAA+MGSVRMC AIA  +  LV
Subjt:  KRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEE---KKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALV

Query:  NKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSH
        + RN VDETPLFLAA YGNKDA FCLY FC D+A+ I+ NCRVS  GDTVLH AL NE FDLAFQLI+++K++++WVN+ G TP+HVLASKPTSFK+GSH
Subjt:  NKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSH

Query:  IKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA--NTSPRSRFPPNYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEEE
        IKGW++IVY+ +FV  L+P+ I+++ + L+RS+ KA    SP S FP NY+TC  F+  +    L V        +  K   +E++ + + D D    + 
Subjt:  IKGWRNIVYYATFVNQLKPQKIKTIFKGLERSIGKA--NTSPRSRFPPNYQTCTHFFTNLYHFFLIV--------SGVKKKSDEEEKDLEKDNDDEKEEE

Query:  EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGTV
        EE++  +  +  +  +T FP+NY+TCIDFF    S IMIILGFGS EI+KIRKKKEKHTW++QVM+KL+E A   KY + G  PM SK+Q  +E    T+
Subjt:  EEEEEEEAMNIDEDESTMFPINYSTCIDFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGTV

Query:  PYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVD
        PY   D  V F   +      E   P+DV+ T+T ML+AA+ GVI+IV+ +LERFPLAIR+T++DKKNVVLLAAE RQP+VY+FLLKKR  I+NLFRAVD
Subjt:  PYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVD

Query:  ENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATT
          GNSALHLAA ++DPKLW ITGAA QMQWEVKWY YVK SVPL  F  +NK+G+TA  IF E+H +L  KGG+WLY TSES +LVATLIATVAFATA T
Subjt:  ENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATT

Query:  VPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGL
        +PGGN E G A    E GF IFS SSLIAL LSSTS+IMFLAI+TSRF  K FG VLPWKL IGL
Subjt:  VPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGL

SwissProt top hitse value%identityAlignment
G5E8K5 Ankyrin-36.8e-0630.67Show/hide
Query:  NLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKD-AFFCL
        N+RG+TALH+A    + E+V  +V+   + E + K            +PLH++A +G   +   +    GA  N       TPL LAA  G++D A F L
Subjt:  NLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKD-AFFCL

Query:  YYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA
         +  G   +I       +K G T LH A    + ++A  L+     S +   + G TPLHV A
Subjt:  YYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLA

P16157 Ankyrin-18.9e-0625.81Show/hide
Query:  DALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYAN
        + L   +K G    VV +  + + +  + K   +G+TALH+A +  ++E+V ++V        + +K         G +PL++AA    + +       N
Subjt:  DALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYAN

Query:  GALVNKRNAVDETPLFLAAMYGNKDAFFCLYYF---------------CGDD---ATIIAQNCR----VSKGGDTVLHRALNNEQFDLAFQLIHINKDSV
        GA  N       TPL +A   G+++    L  +                 DD   A ++ QN      +SK G T LH A + E  ++A QL+     SV
Subjt:  GALVNKRNAVDETPLFLAAMYGNKDAFFCLYYF---------------CGDD---ATIIAQNCR----VSKGGDTVLHRALNNEQFDLAFQLIHINKDSV

Query:  NWVNQQGFTPLHVLASK
        N+  Q G TPLH+ + +
Subjt:  NWVNQQGFTPLHVLASK

Q02357 Ankyrin-18.9e-0625.81Show/hide
Query:  DALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYAN
        + L   +K G    VV +  + + +  + K   +G+TALH+A +  ++E+V ++V        + +K         G +PL++AA    + +       N
Subjt:  DALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYAN

Query:  GALVNKRNAVDETPLFLAAMYGNKDAFFCLYYF---------------CGDD---ATIIAQNCR----VSKGGDTVLHRALNNEQFDLAFQLIHINKDSV
        GA  N       TPL +A   G+++    L  +                 DD   A ++ QN      +SK G T LH A + E  ++A QL+     SV
Subjt:  GALVNKRNAVDETPLFLAAMYGNKDAFFCLYYF---------------CGDD---ATIIAQNCR----VSKGGDTVLHRALNNEQFDLAFQLIHINKDSV

Query:  NWVNQQGFTPLHVLASK
        N+  Q G TPLH+ + +
Subjt:  NWVNQQGFTPLHVLASK

Q5GIG6 Serine/threonine-protein kinase TNNI3K1.2e-0528.57Show/hide
Query:  KLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFC
        +L   G  ALHLAV  +  E++  ++      ++             G + LH+AA+ G       +   +GA VN ++AV  TPL +AA YG++     
Subjt:  KLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANGALVNKRNAVDETPLFLAAMYGNKDAFFC

Query:  LYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLH
        L  F G D  +  +       GD  LH A     F++   L+  NK  VN  + +   PLH
Subjt:  LYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLH

Q9C7A2 Ankyrin repeat-containing protein ITN14.0e-0625.3Show/hide
Query:  IAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLA-----ANALDPKLWLITGAAFQMQWEV
        +AA+ G +++++ALL + P   R   +  +  + +A + +  EV K LL     I       D++ N+ALH+A     A  ++  L L    A  +  + 
Subjt:  IAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLA-----ANALDPKLWLITGAAFQMQWEV

Query:  KWYKYVKESVPLHTFTRYNKK--------------------GETAREIFHESHIDL------------VAKGGDWLYK-----TSESCSLVATLIATVAF
        K    + E +PL   + Y K+                      T  +I ++ HI L            ++K    L++      + S ++VA L ATVAF
Subjt:  KWYKYVKESVPLHTFTRYNKK--------------------GETAREIFHESHIDL------------VAKGGDWLYK-----TSESCSLVATLIATVAF

Query:  ATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAIL
        A   TVPGG+   G A + G   F IF I + +AL  S   +++ + ++
Subjt:  ATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAIL

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein4.5e-2931.94Show/hide
Query:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAA-NALDPKLWLITGAAFQMQWEVKWYKY
        A + G+++ +  ++  +P  +        N+   A   RQ +++  +         L    D   N+ LH AA  A   +L LI GAA QMQ E++W+K 
Subjt:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAA-NALDPKLWLITGAAFQMQWEVKWYKY

Query:  VKESV-PLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTS
        V++ V P H      K+ +T + +F + H DLV +G  W+ +T+ SC++VA LI T+ F++A TVPGG    G      +  F IF IS  I+L  S  S
Subjt:  VKESV-PLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTS

Query:  MIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA
        ++MFL IL SR+  + F   LP KL +GL  L+ S+  M+V+F      L+  ++   +  F  L  +P+  +F ++Q P+  ++ +A
Subjt:  MIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA

AT3G54070.1 Ankyrin repeat family protein4.8e-3133.77Show/hide
Query:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDE--NGNSALHLAANALDP--KLWLITGAAFQMQWEVKW
        AA+ G ++I+  L+      +     + + +  +AA +R   ++  L+ +  GI++L  +  E  + ++ LHL A  L P  +  + +GAA  MQ E+ W
Subjt:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDE--NGNSALHLAANALDP--KLWLITGAAFQMQWEVKW

Query:  YKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGE-------ANLKGEQGFFIFSISSL
        +K VKE VP       N KGE A +IF E H +L  +G  W+ +T+ +C L ATLIATV FA A T+PGGN + G+        N +    F IF++S  
Subjt:  YKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGE-------ANLKGEQGFFIFSISSL

Query:  IALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILK
        +AL  S  S+++FL+I TSR+  + F   LP KL  GLS L+ SII+M+++F      + + +     VL   L  L  A  F  +   L+F+ L+++  
Subjt:  IALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILK

Query:  SV
        S+
Subjt:  SV

AT5G04700.1 Ankyrin repeat family protein1.8e-3034.85Show/hide
Query:  GTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVV-LLAAEFRQPEVYKFL--LKKRKGIENLFRAVDENGNSALHLAANALDP-KLW
        G  +  +  D  +L A + G +D +  ++      +  T+    + + LLA EFRQ +V+  L  L  RK +  L    D +GN  LHLA     P KL 
Subjt:  GTPKDVKPTDTVMLIAAQTGVIDIVRALLERFPLAIRETQRDKKNVV-LLAAEFRQPEVYKFL--LKKRKGIENLFRAVDENGNSALHLAANALDP-KLW

Query:  LITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGG--NGEKGEANLKGEQ
         + GA  Q+Q E++W+K V+   P     R N + +T  EIF + H  L  +   W+  T+ SCSLVA LI TV FA   TVPGG  +  KG+     ++
Subjt:  LITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGG--NGEKGEANLKGEQ

Query:  GFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIF--LPVAFIFGIVQL
         F IF +S LI+   S TS+++FL ILT+R+    F   LP K+  GLS L+ SI AML++F S  + ++        ++  T++F  LP A +F ++Q 
Subjt:  GFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIF--LPVAFIFGIVQL

Query:  PLYFDLL
        PL  +++
Subjt:  PLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein1.1e-2734.59Show/hide
Query:  KNVVLLAAEFRQPEVYKFL--LKKRKGIENLFRAVDENGNSALHLAANALDP-KLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHE
        +N+  LA EF++ +++  +  L  RK    L R+ D+  N+ LH+A     P +L  I+GAA +MQ E +W+K V+  V      + NK  +T R+IF  
Subjt:  KNVVLLAAEFRQPEVYKFL--LKKRKGIENLFRAVDENGNSALHLAANALDP-KLWLITGAAFQMQWEVKWYKYVKESVPLHTFTRYNKKGETAREIFHE

Query:  SHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGG-NGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLF
         H  L  +G +W+  T+ +CS VA LIATV F    TVPGG +G  G   +  +  F  F  +  +A   S  S+++FL+ILTSR+    F   LP K+ 
Subjt:  SHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGG-NGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLF

Query:  IGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKS
        +G S L+ SI +MLV+F +    L  S  H  A+++      P+A    ++ L L + LL+ ++ S
Subjt:  IGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKS

AT5G35810.1 Ankyrin repeat family protein5.5e-3534.92Show/hide
Query:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYK--FLLKKRKGIENLFRAVDENGNSALHLAANALDP-KLWLITGAAFQMQWEVKWY
        AAQ+G ++++  L+  +P  I       +++  +AA  R  +++   + L   K +  +++  + N N  LHL A    P +L +++GAA QMQ E+ WY
Subjt:  AAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYK--FLLKKRKGIENLFRAVDENGNSALHLAANALDP-KLWLITGAAFQMQWEVKWY

Query:  KYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKG------EQGFFIFSISSLIA
        K VKE VP       NKK E A ++F + H +L  +G  W+ +T+ +C LV+TLIATV FA A T+PGGN   G+    G      E  F +F IS  +A
Subjt:  KYVKESVPLHTFTRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKG------EQGFFIFSISSLIA

Query:  LCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA
        L  S TS+++FL+ILTSR+    F + LP KL +GL  L+ SII+M+++F +    LI  +    +++    +    A  F ++   L+FD L++
Subjt:  LCLSSTSMIMFLAILTSRFVPKHFGSVLPWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGAACGTGGATGAAGTAGGAGATGCTCTTAAGGCAAAAACGAAGAGAGGGAAATGGTCGGAAGTGGTGACAATATACGAGCAAAACCTTGAGGTGGCTCAGAG
TGTGAAGCTAAATCTACGGGGAGACACGGCGCTGCATTTGGCTGTGATCGACAACCGAGAAGAAATAGTGGAAAAGATGGTGAGGAAGATCGGGAGGACAGAGGAGGAGA
AGAAGAAGCTTCTAGAGGACAGTATAAACGAGAGGGGAAACAGCCCTCTGCACCTGGCGGCAGTGATGGGGAGCGTGAGAATGTGCTGCGCCATTGCGTACGCCAACGGC
GCGTTGGTGAATAAGAGAAACGCGGTGGACGAGACCCCTCTGTTCTTGGCAGCCATGTACGGGAACAAGGACGCCTTCTTTTGCCTTTACTACTTCTGTGGAGATGATGC
CACCATAATTGCCCAAAACTGCAGAGTGAGCAAGGGGGGTGACACCGTCCTACATCGCGCCCTTAACAACGAGCAGTTTGATTTGGCGTTTCAGTTAATTCACATCAACA
AGGATTCTGTGAACTGGGTGAATCAGCAGGGCTTTACTCCTCTCCACGTTCTAGCAAGTAAGCCAACTTCCTTCAAAACTGGATCCCACATCAAGGGATGGCGGAACATC
GTGTACTACGCCACATTTGTGAATCAACTAAAGCCTCAAAAAATCAAAACCATTTTCAAAGGGTTGGAGCGAAGTATTGGCAAAGCAAACACAAGTCCACGATCCCGTTT
TCCACCGAACTACCAGACATGCACCCACTTCTTCACCAACCTCTACCATTTCTTTCTCATTGTGAGTGGTGTGAAGAAGAAAAGTGATGAAGAAGAGAAAGATTTGGAAA
AAGACAATGATGATGAGAAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGATGAATATTGATGAGGATGAGAGTACAATGTTCCCAATCAACTATAGTACCTGCATC
GACTTTTTTCAGTTTGTTTTGTCGGGCATCATGATCATCCTTGGGTTCGGGTCTAGCGAGATCAGGAAGATACGGAAGAAGAAAGAGAAACACACATGGGCAATCCAAGT
CATGGACAAGCTTATTGAATCTGCTACAATGCAAAAATATAACGACCACGGAGGCCCTCCCATGAGTTCAAAATATCAGACCAAAGAAGAAAAAGAACAAGGCACAGTTC
CCTACATCGTTAAAGATGGTAGAGTCCACTTCCCTGACACTTTAAATACTAATAATAATAATGAACACGGCACGCCCAAAGATGTGAAACCAACAGACACAGTGATGTTA
ATTGCGGCACAAACTGGCGTGATCGACATAGTGAGAGCACTCTTGGAGCGTTTTCCTTTGGCAATCCGTGAAACTCAGAGGGATAAGAAGAACGTGGTGCTTTTAGCAGC
CGAGTTCAGGCAGCCGGAAGTATACAAGTTCTTGTTGAAGAAAAGAAAGGGCATAGAAAATCTGTTTCGAGCGGTGGATGAGAATGGCAACAGTGCACTGCATCTGGCGG
CGAATGCCTTAGATCCTAAGCTTTGGCTCATCACTGGGGCGGCGTTCCAGATGCAATGGGAAGTCAAGTGGTATAAGTACGTGAAGGAGTCTGTGCCACTCCATACATTC
ACGCGGTACAACAAGAAAGGAGAGACTGCGCGTGAAATATTCCACGAAAGCCACATCGATTTAGTGGCGAAGGGCGGGGATTGGCTGTACAAAACCTCGGAGTCGTGCTC
GTTGGTGGCGACTCTGATCGCAACGGTGGCATTTGCAACTGCAACGACGGTGCCGGGGGGTAACGGGGAGAAGGGGGAAGCCAATCTTAAAGGGGAGCAAGGGTTTTTCA
TCTTCTCCATCTCCTCCCTGATTGCCCTCTGCCTGTCCTCCACCTCCATGATCATGTTTCTGGCCATTCTGACCTCCAGGTTCGTCCCCAAACACTTCGGCTCGGTGCTC
CCCTGGAAGTTGTTCATCGGACTATCCTGCCTTTACTTCTCCATCATCGCCATGCTGGTCAGCTTCTGCTCCGGCCACTACTTCCTCATCATCAGCCGCCTCCACAACGA
CGCTGTTCTGTTCTACACCCTTATATTTCTCCCAGTCGCATTCATCTTCGGAATTGTGCAGCTTCCTCTCTACTTCGATTTGCTGCAGGCTATTCTCAAAAGTGTCCCTA
AAAGGAGTTCCGCCGTCGTACATCGTTAA
mRNA sequenceShow/hide mRNA sequence
AAGTAAAGATAGTTTGTGCATGAAATTAAAGATATTAAATTAGAATATGAGTAAGAACGTGGATGAAGTAGGAGATGCTCTTAAGGCAAAAACGAAGAGAGGGAAATGGT
CGGAAGTGGTGACAATATACGAGCAAAACCTTGAGGTGGCTCAGAGTGTGAAGCTAAATCTACGGGGAGACACGGCGCTGCATTTGGCTGTGATCGACAACCGAGAAGAA
ATAGTGGAAAAGATGGTGAGGAAGATCGGGAGGACAGAGGAGGAGAAGAAGAAGCTTCTAGAGGACAGTATAAACGAGAGGGGAAACAGCCCTCTGCACCTGGCGGCAGT
GATGGGGAGCGTGAGAATGTGCTGCGCCATTGCGTACGCCAACGGCGCGTTGGTGAATAAGAGAAACGCGGTGGACGAGACCCCTCTGTTCTTGGCAGCCATGTACGGGA
ACAAGGACGCCTTCTTTTGCCTTTACTACTTCTGTGGAGATGATGCCACCATAATTGCCCAAAACTGCAGAGTGAGCAAGGGGGGTGACACCGTCCTACATCGCGCCCTT
AACAACGAGCAGTTTGATTTGGCGTTTCAGTTAATTCACATCAACAAGGATTCTGTGAACTGGGTGAATCAGCAGGGCTTTACTCCTCTCCACGTTCTAGCAAGTAAGCC
AACTTCCTTCAAAACTGGATCCCACATCAAGGGATGGCGGAACATCGTGTACTACGCCACATTTGTGAATCAACTAAAGCCTCAAAAAATCAAAACCATTTTCAAAGGGT
TGGAGCGAAGTATTGGCAAAGCAAACACAAGTCCACGATCCCGTTTTCCACCGAACTACCAGACATGCACCCACTTCTTCACCAACCTCTACCATTTCTTTCTCATTGTG
AGTGGTGTGAAGAAGAAAAGTGATGAAGAAGAGAAAGATTTGGAAAAAGACAATGATGATGAGAAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGATGAATATTGA
TGAGGATGAGAGTACAATGTTCCCAATCAACTATAGTACCTGCATCGACTTTTTTCAGTTTGTTTTGTCGGGCATCATGATCATCCTTGGGTTCGGGTCTAGCGAGATCA
GGAAGATACGGAAGAAGAAAGAGAAACACACATGGGCAATCCAAGTCATGGACAAGCTTATTGAATCTGCTACAATGCAAAAATATAACGACCACGGAGGCCCTCCCATG
AGTTCAAAATATCAGACCAAAGAAGAAAAAGAACAAGGCACAGTTCCCTACATCGTTAAAGATGGTAGAGTCCACTTCCCTGACACTTTAAATACTAATAATAATAATGA
ACACGGCACGCCCAAAGATGTGAAACCAACAGACACAGTGATGTTAATTGCGGCACAAACTGGCGTGATCGACATAGTGAGAGCACTCTTGGAGCGTTTTCCTTTGGCAA
TCCGTGAAACTCAGAGGGATAAGAAGAACGTGGTGCTTTTAGCAGCCGAGTTCAGGCAGCCGGAAGTATACAAGTTCTTGTTGAAGAAAAGAAAGGGCATAGAAAATCTG
TTTCGAGCGGTGGATGAGAATGGCAACAGTGCACTGCATCTGGCGGCGAATGCCTTAGATCCTAAGCTTTGGCTCATCACTGGGGCGGCGTTCCAGATGCAATGGGAAGT
CAAGTGGTATAAGTACGTGAAGGAGTCTGTGCCACTCCATACATTCACGCGGTACAACAAGAAAGGAGAGACTGCGCGTGAAATATTCCACGAAAGCCACATCGATTTAG
TGGCGAAGGGCGGGGATTGGCTGTACAAAACCTCGGAGTCGTGCTCGTTGGTGGCGACTCTGATCGCAACGGTGGCATTTGCAACTGCAACGACGGTGCCGGGGGGTAAC
GGGGAGAAGGGGGAAGCCAATCTTAAAGGGGAGCAAGGGTTTTTCATCTTCTCCATCTCCTCCCTGATTGCCCTCTGCCTGTCCTCCACCTCCATGATCATGTTTCTGGC
CATTCTGACCTCCAGGTTCGTCCCCAAACACTTCGGCTCGGTGCTCCCCTGGAAGTTGTTCATCGGACTATCCTGCCTTTACTTCTCCATCATCGCCATGCTGGTCAGCT
TCTGCTCCGGCCACTACTTCCTCATCATCAGCCGCCTCCACAACGACGCTGTTCTGTTCTACACCCTTATATTTCTCCCAGTCGCATTCATCTTCGGAATTGTGCAGCTT
CCTCTCTACTTCGATTTGCTGCAGGCTATTCTCAAAAGTGTCCCTAAAAGGAGTTCCGCCGTCGTACATCGTTAACATTTTTTCCTCTCTAAGTTCGCATATTTATTACC
TTATGTGCTCACTCACTCTCTTCCTTTCTCAACTAAATTCTTCCACTCTTTCATCGTATTCTTCCAATATCTCATATTTATTATCATTGCAAACTGTATAACTTAATTAC
GTGACTGATTGCATATTCAAATCCCATGAATATCAAACAAAGACATAATGTTAGAGATATCAAACATTTTTTTTTTAAAACATAAAACTAAAATCAATTATCAAGGACGA
AAAACTAGAAACACAAAACTAAAAATAAAATGGTTATCAAACAAAGACATAATGTTA
Protein sequenceShow/hide protein sequence
MSKNVDEVGDALKAKTKRGKWSEVVTIYEQNLEVAQSVKLNLRGDTALHLAVIDNREEIVEKMVRKIGRTEEEKKKLLEDSINERGNSPLHLAAVMGSVRMCCAIAYANG
ALVNKRNAVDETPLFLAAMYGNKDAFFCLYYFCGDDATIIAQNCRVSKGGDTVLHRALNNEQFDLAFQLIHINKDSVNWVNQQGFTPLHVLASKPTSFKTGSHIKGWRNI
VYYATFVNQLKPQKIKTIFKGLERSIGKANTSPRSRFPPNYQTCTHFFTNLYHFFLIVSGVKKKSDEEEKDLEKDNDDEKEEEEEEEEEEAMNIDEDESTMFPINYSTCI
DFFQFVLSGIMIILGFGSSEIRKIRKKKEKHTWAIQVMDKLIESATMQKYNDHGGPPMSSKYQTKEEKEQGTVPYIVKDGRVHFPDTLNTNNNNEHGTPKDVKPTDTVML
IAAQTGVIDIVRALLERFPLAIRETQRDKKNVVLLAAEFRQPEVYKFLLKKRKGIENLFRAVDENGNSALHLAANALDPKLWLITGAAFQMQWEVKWYKYVKESVPLHTF
TRYNKKGETAREIFHESHIDLVAKGGDWLYKTSESCSLVATLIATVAFATATTVPGGNGEKGEANLKGEQGFFIFSISSLIALCLSSTSMIMFLAILTSRFVPKHFGSVL
PWKLFIGLSCLYFSIIAMLVSFCSGHYFLIISRLHNDAVLFYTLIFLPVAFIFGIVQLPLYFDLLQAILKSVPKRSSAVVHR