; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025456 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025456
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr10:13290635..13298333
RNA-Seq ExpressionLag0025456
SyntenyLag0025456
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018289 - MULE transposase domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022940456.1 uncharacterized protein LOC111446049 [Cucurbita moschata]5.7e-15858.33Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        V LKL +FD+SLLEM D EGKTALQVLAN+P AFRSG + TF+ES IY  LPS+ I+                    + K  + ++   + S  +KNKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAG--SNRP-------NCCSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG  SNRP         C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+ G+  +++           + 
Subjt:  EDLEAG--SNRP-------NCCSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDI--VKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTV
        +DI  ++ K   +  +I   E+ D HETPLLLAAANGI E+V++I +A+PQA+DY+T Q RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT 
Subjt:  VDI--VKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTV

Query:  LHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGG
        LHHVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     K TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGG
Subjt:  LHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGG

Query:  LNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVAT
        LNS TGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F+M  FL+ LPLKLS+GFQLLF+SVA+TMM FALTIVLTVK+E+MKWTV LLY+AT
Subjt:  LNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVAT

Query:  FFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        FFPVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  FFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

XP_022974704.1 uncharacterized protein LOC111473443 [Cucurbita maxima]3.0e-15958.36Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        VVLKL +FDKSLLEM D EGKTALQVLAN+P AF SG ++TF+ES IY  LP++ I+                    N K  + ++   + S +++NKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG+N    PN       C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+TG+  +++   K+D   +  +
Subjt:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH
        + I  + E      I   E+ D HETPLLLAAANGI E+V++I +A+P A+DY+T   RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT LH
Subjt:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH

Query:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN
        HVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     + TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGGLN
Subjt:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN

Query:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF
        SKTGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F++ DFLH LPLKLS+GFQLLF+SVA+TMM FALTIVLT+K+E+MKWTV LLY+ATFF
Subjt:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF

Query:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        PVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

XP_022975228.1 uncharacterized protein LOC111474320 [Cucurbita maxima]7.2e-16158.91Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        VVLKL +FDKSLLEM D EGKTALQVLAN+P AF SG ++TF+ES IY  LP++ I+                    N K  + ++   + S +++NKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG+N    PN       C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+TG+  +++   K+D   +  +
Subjt:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH
        + I  + E      I   E+ D HETPLLLAAANGI E+V++I +A+PQA+DY+T   RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT LH
Subjt:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH

Query:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN
        HVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     K TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGGLN
Subjt:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN

Query:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF
        SKTGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F+M DFLH LPLKLS+GFQLLF+SVA+TMM FALTIVLT+K+E+MKWTV LLY+ATFF
Subjt:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF

Query:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        PVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

XP_031739511.1 uncharacterized protein LOC101203890 [Cucumis sativus]2.0e-16360.32Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        VVLKL +FDKSLLEMT+ EGKTAL VLAN+P AF+SGY M F+ESIIY+ LP++ I+   +S  GS + D                 NGY   ++   KN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAG---SNRPNCCSHSNCW------------------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTE-PEPENTKVDTTGLDSTNKGTNKVD
        EDLEAG   S  PN   HSNCW                  WPQ +E++E K+QHKLALT+T MLA ID SW +T+   PE T+VD+ GL    +G   VD
Subjt:  EDLEAG---SNRPNCCSHSNCW------------------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTE-PEPENTKVDTTGLDSTNKGTNKVD

Query:  TTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDA
              ++       + +  +I  IEF + HETPLLLAAANGI E+V++IVE  PQA+DYVT  QRN+LHVAI HRQK +FKWI+   L+M+RLV RID 
Subjt:  TTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDA

Query:  LGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAY
        LGFT LHHVG+TKF R  THGPAL+LQ EL WY+ V+  IP LY +HH      K T RE F +TH+ M + AKEW+KKTSESCSAVAVLVATVVFAAAY
Subjt:  LGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAY

Query:  TVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVL
        TVPGGLNSKTGSP+LLTEPI++VFTVMD++ALA+AL+SVVLFLSILTS F+M DFLH LP+KLSIGFQLLF SVASTMM FALTIVLTVKSE+MKWTV L
Subjt:  TVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVL

Query:  LYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        LY+ATFFPVTMFIIIQLPLY++LV+N+W Y H +  FLPMGF     K P K L+RK
Subjt:  LYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

XP_038894049.1 uncharacterized protein LOC120082801 isoform X1 [Benincasa hispida]3.8e-17061.4Show/hide
Query:  KVVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKK
        +V LKL++FDKSLLEMTD EGKTAL VLAN+P AFRSGY+M F+ESIIY  LPS+ I+K ++S   S  K                 DNG +S  + N K
Subjt:  KVVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKK

Query:  NEDLEAGSNR--PNCCSHSNCW------------------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTE-PEPENTKVDTTGLDSTNKGTNKVD
        N+DLEAG+N   P+   HSNCW                  WPQ + ++E K+QH+LALT+TKMLA +D SW +T+   PENT+VD+ G+   N+G     
Subjt:  NEDLEAGSNR--PNCCSHSNCW------------------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTE-PEPENTKVDTTGLDSTNKGTNKVD

Query:  TTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDA
          G+  N+   +   + +  ++  IE+ D HETPLLLAAANGI E+V++IVE +PQA+DY+T  QRN+LHVAI +RQK++F WI+   L+M+RLV RIDA
Subjt:  TTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDA

Query:  LGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAY
        LGFT LHHVGITKF R  THGPAL+LQHELKWY+ V+  IP LY +HH +    K TARE F +TH+ M E+AKEW+KKTSESCSAVAVLVATVVFAAAY
Subjt:  LGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAY

Query:  TVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVL
        TVPGGLNS+TGSPVLLTEPI+IVFT+MD++ALA+ALTSVVLFLSILTS F+M DFLH LPLKLSIGFQLLF SVASTMM FALTIVLTVKSE+MKWTV L
Subjt:  TVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVL

Query:  LYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        LY+ATFFPVTMFIIIQLPLY++LV+N+W+YRH +  FLPMGF    WK P K  +RK
Subjt:  LYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

TrEMBL top hitse value%identityAlignment
A0A5A7TMX3 Protein ACCELERATED CELL DEATH 6-like3.0e-12070.91Show/hide
Query:  IEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGITKFSRSETHGPAL
        +E+ D HETPLLLAAANGI E+V++IV  +PQA+DYVT  QRN+LHVAI HRQK +FKWI+   L+M+RLV RIDALGFT LHHVGITKF     HGPAL
Subjt:  IEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGITKFSRSETHGPAL

Query:  ELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPVLLTEPIFIVF
        +LQHELKWY+ V+  IP LY +HH      K TAR+ F +T + M ++AK+W+KKTSESCS VAVLVATVVFAAAYTVPGGLN+KTGSPVLLTEPI+IVF
Subjt:  ELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPVLLTEPIFIVF

Query:  TVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKLV
        TVMD+IALA+ALTSVVLFLSILTS F+M +FLH LP+KLSIGFQLLF SV ST M FALTIVLT+KSE+MKWT+ LLY+ATFFPVTMFIIIQLPLY++LV
Subjt:  TVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKLV

Query:  QNMWNYRHILFSFLPMGFFTRSWKAPPKFL
        +N+W+YR  +F FLP GF    WK P K L
Subjt:  QNMWNYRHILFSFLPMGFFTRSWKAPPKFL

A0A6J1FII8 uncharacterized protein LOC1114460492.8e-15858.33Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        V LKL +FD+SLLEM D EGKTALQVLAN+P AFRSG + TF+ES IY  LPS+ I+                    + K  + ++   + S  +KNKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAG--SNRP-------NCCSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG  SNRP         C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+ G+  +++           + 
Subjt:  EDLEAG--SNRP-------NCCSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDI--VKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTV
        +DI  ++ K   +  +I   E+ D HETPLLLAAANGI E+V++I +A+PQA+DY+T Q RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT 
Subjt:  VDI--VKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTV

Query:  LHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGG
        LHHVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     K TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGG
Subjt:  LHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGG

Query:  LNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVAT
        LNS TGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F+M  FL+ LPLKLS+GFQLLF+SVA+TMM FALTIVLTVK+E+MKWTV LLY+AT
Subjt:  LNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVAT

Query:  FFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        FFPVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  FFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

A0A6J1IIE1 uncharacterized protein LOC1114734431.5e-15958.36Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        VVLKL +FDKSLLEM D EGKTALQVLAN+P AF SG ++TF+ES IY  LP++ I+                    N K  + ++   + S +++NKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG+N    PN       C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+TG+  +++   K+D   +  +
Subjt:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH
        + I  + E      I   E+ D HETPLLLAAANGI E+V++I +A+P A+DY+T   RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT LH
Subjt:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH

Query:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN
        HVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     + TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGGLN
Subjt:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN

Query:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF
        SKTGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F++ DFLH LPLKLS+GFQLLF+SVA+TMM FALTIVLT+K+E+MKWTV LLY+ATFF
Subjt:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF

Query:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        PVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

A0A6J1IIM4 uncharacterized protein LOC1114743203.5e-16158.91Show/hide
Query:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN
        VVLKL +FDKSLLEM D EGKTALQVLAN+P AF SG ++TF+ES IY  LP++ I+                    N K  + ++   + S +++NKKN
Subjt:  VVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKN

Query:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN
         DLEAG+N    PN       C     W      WPQ +EM++ KR HKL +T+T  LAQIDNSW +T+  P+ T++D+TG+  +++   K+D   +  +
Subjt:  EDLEAGSN---RPNC------CSHSNCW------WPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDN

Query:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH
        + I  + E      I   E+ D HETPLLLAAANGI E+V++I +A+PQA+DY+T   RNILHVAI HR+K+IF WI KR L+M+RLV RID +GFT LH
Subjt:  VDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLH

Query:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN
        HVGITKF    THGPAL+LQ ELKWY+ V+  IP LYT+HH+     K TARE FE+TH  + E+ KEW+KKTSESCSAVAVL++TVVFAAAYTVPGGLN
Subjt:  HVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN

Query:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF
        SKTGSPVLLTEPI+IVFT+MD+I LA+ALTS+VLFLS+LTS F+M DFLH LPLKLS+GFQLLF+SVA+TMM FALTIVLT+K+E+MKWTV LLY+ATFF
Subjt:  SKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFF

Query:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK
        PVTMFIIIQ+PLY++LV+N+W YRH L  F PMGF    W AP K LTRK
Subjt:  PVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRK

A0A6J1JXD5 uncharacterized protein LOC1114905434.3e-12748.66Show/hide
Query:  SAVGTPSAVRPPSATTTPKVVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSC-DKDFESFNMKNKK
        +A GTP  +     +   ++V+ LVDFDKSLLEM D   KTAL VLAN+PH F SGY   FW  +IY++LP    +  NF   G   + D  S + K+KK
Subjt:  SAVGTPSAVRPPSATTTPKVVLKLVDFDKSLLEMTDEEGKTALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSC-DKDFESFNMKNKK

Query:  KEDLQADNGYESLNMKN---KKNEDLEAGSNRPNCCSHSNCWWPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKV
         EDL+A  G++S    N   K ++ L   S R          W + +E++  K++HKL L +T+ML  ID S Y+ + +P++T +D +            
Subjt:  KEDLQADNGYESLNMKN---KKNEDLEAGSNRPNCCSHSNCWWPQLREMHENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKV

Query:  DTTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRID
                     EK    IVD   IE+KD HETPLLLAAA GI E+VE+I+EAHP+A+DYVTE+ RNILHV I HRQ  IF+WI+++ L+M RL +RID
Subjt:  DTTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRID

Query:  ALGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAA
         LG+TVLHHVGITKF    T GPA++LQHEL+W+  V   +P LY + ++  + N    RE F+ETHKDM + AKEW+KKTSESCSAVAVL+ATV FAAA
Subjt:  ALGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAA

Query:  YTVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVV
        +TVPGGLNSKTGSP+LL++PI++ F  +D+ +L S+L+S+VLFL ILTSPF++  F H L ++LS+GF LLF+SVASTM+ FA+ IVLT+KS  M W   
Subjt:  YTVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVV

Query:  LLYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRKA
        LL++ T  P+ +F++++LPL L+L +++W     L+  LPMGF T  ++ P KFL+ K+
Subjt:  LLYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein1.0e-3232.99Show/hide
Query:  LLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGI-TKFSR-SETHGPALELQHELKW
        L  A  NGI E +E+++  +P  +        NI   A+  RQ+ IF  I   G   + L    D     +LHH       SR +   G AL++Q EL+W
Subjt:  LLLAAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGI-TKFSR-SETHGPALELQHELKW

Query:  YQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPVLLTEPIFIVFTVMDVIAL
        ++ V K +   +      K+  K+T + LF + HKD+ E  ++WMK+T+ SC+ VA L+ T++F++A+TVPGG  S  G P+ + +  F +F + D I+L
Subjt:  YQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPVLLTEPIFIVFTVMDVIAL

Query:  ASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKL
         ++  S+++FL IL S ++  DFL  LP KL +G   LF+S+A+ ++ F +T+ +T+  E++ W           P+ MF+++Q P+ L++
Subjt:  ASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKL

AT3G54070.1 Ankyrin repeat family protein8.5e-2730.33Show/hide
Query:  AAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALG-FTVLHHVG-ITKFSRSET-HGPALELQHELKWYQ
        AA  G  E++  ++ +H   +  V    R + HVA L+R ++IF  I + G +   +    +     T+LH V  +   +R +   G AL +Q EL W++
Subjt:  AAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALG-FTVLHHVG-ITKFSRSET-HGPALELQHELKWYQ

Query:  LVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN------SKTGSPVLLTEPIFIVFTVMD
         V++ +P  Y      K T    A ++F E H+++ +  + WMK+T+ +C   A L+ATVVFAAA T+PGG +      +  G P      +F +FT+ D
Subjt:  LVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLN------SKTGSPVLLTEPIFIVFTVMD

Query:  VIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKLVQNMW
         +AL S++ S+V+FLSI TS +   DF + LP KL  G   LF+S+ S ++ F  +++L ++ E+   ++VL+          F  +   L+   +++++
Subjt:  VIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKLVQNMW

AT5G04700.1 Ankyrin repeat family protein4.1e-2932.43Show/hide
Query:  ANGIKEMVEKIVEAHPQA-----IDYVTEQQRN---------------ILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGITK--FSRS
        A G+KE  E + EA   A     +D++ E  RN               +  +A+  RQ+ +F  +         L+   D  G  VLH  G        S
Subjt:  ANGIKEMVEKIVEAHPQA-----IDYVTEQQRN---------------ILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGITK--FSRS

Query:  ETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGL--NSKTGSPVL
           G  L+LQ EL+W++ V +  PE   +      T ++T  E+F + H+ + + A++WMK T+ SCS VA L+ TV FAA +TVPGG   NSK G P  
Subjt:  ETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGL--NSKTGSPVL

Query:  LTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIII
        L +  FI+F V D+I+  ++ TSV++FL ILT+ +   DFL  LP K+  G  +LFVS+A+ ++ F+ + + T+  ++ KW V    +    P  +F+++
Subjt:  LTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIII

Query:  QLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWK
        Q PL           + ++FS    G F R+ K
Subjt:  QLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWK

AT5G04730.1 Ankyrin-repeat containing protein6.7e-3231.25Show/hide
Query:  DIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQ-RNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVG--ITKF
        DI+  N+I +K+     LL AA +G ++   +I++ + Q +  +     RN+  +A+  +++ IF  I         L++  D     +LH  G   T  
Subjt:  DIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAHPQAIDYVTEQQ-RNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVG--ITKF

Query:  SRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPV
          S+  G AL++Q E +W++ V   + E   +    K  + +T R++FE  H+ + +  +EWMK T+ +CS VA L+ATV F A +TVPGG++  +GSP+
Subjt:  SRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPV

Query:  LLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLY---VATFFPVTM
        +L +  F  F   D +A  ++  SV++FLSILTS +   DF+  LP K+ +G  +LF+S+AS ++ F     +T  S  M+    L+Y       FP  +
Subjt:  LLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLY---VATFFPVTM

Query:  FIIIQLPLYLKLVQNMWNYR
        F+++Q PL  +++ + +  R
Subjt:  FIIIQLPLYLKLVQNMWNYR

AT5G35810.1 Ankyrin repeat family protein2.0e-3131.05Show/hide
Query:  TPLLL--AAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRL-VKRIDALGFTVLHHVG-ITKFSRSE-THGPALELQ
        +P+LL  AA +G  E++  ++ ++P  I  V  + +++ H+A ++R + IF  I + G +   + + +       +LH V  +   +R +   G AL++Q
Subjt:  TPLLL--AAANGIKEMVEKIVEAHPQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRL-VKRIDALGFTVLHHVG-ITKFSRSE-THGPALELQ

Query:  HELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSK-----TGSPVLLTEPIFI
         E+ WY+ V++ +P +Y      KE     A +LF + H ++ +  ++WMK+T+ +C  V+ L+ATVVFAAA+T+PGG ++       G P    E  F 
Subjt:  HELKWYQLVRKHIPELYTLHHTNKETNKRTARELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSK-----TGSPVLLTEPIFI

Query:  VFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLK
        VF + D +AL S++TS+++FLSILTS +    F   LP KL +G   LFVS+ S ++ F  T++L ++ ++ KW+++LL          F+++   L+  
Subjt:  VFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLSIGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLK

Query:  LVQNMW
         +++ +
Subjt:  LVQNMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAGGTGTTTCATATCAATTTTCATTATAATGGTCGTATGGTATTCCTCCCTAGACTGAACTACATTGGTGGAAGGCAAGCCGTTATTCAAGGGATAGAT
TGTGATAGGGTCTCGTTGATAGAGTTACGCATGATGTTGACTGATCTTAATGTATTCAATGTTGAGGAGTTTTACTATGTTAGGCTTGAGTATACACTGTCTACT
GAGTTCAAGGTCTTGTTAACTGATTCTGATGTCATACAAACGGTTAGTATAGTTCCTGAAAGGTCTGACGAAGAAGAGGATGTTAGGGGTCCTGAATTTAGGGCT
GAGACGAATATGAATCAGGTTGAGTTTAGGTTAGGGATGAGGTTTACCAGTCCGAAGTGTAAATGGAGGTTGCATGCTAGTGTTATTGAAAACGATAACACTATT
CAAATTAAGACATACATGGGTGATCATACATGCAGTAGGGAATTTGCAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACAGTTTAGGAGT
CAACCTAATTGGTCTCTTGCTGAAATTGTTGAGGAGGTGAAGAAGGACTATCAGGTAAGAATTTCTCGAAGTCAAGCCTACAGGGCAAGGCGTAAGGCAAACAGA
AGGATCAATGGTAGTCTTAATGAACAATATAAGAAATTGTGGGATTATGCTGCAAAGATTAGACGGTCCAATGTTGGATCAACGGTGAAAATTCAATGTGAGAGG
GTTACTGAAGATTCTCCCCCTGTTTTTAGGAGATGTTATGTTTGTTTTAAGGCTTGCAAGGACGGGTTTTTGGCGGGTTGCAGGCCATTTATTGCTTTAGATGCC
TGCCATCTAAAAGGTCCTTGTCAAGGACAATTAATGGCTGCAATTGGTGTTGATGGTGATGACAGTTTGTACCCTATTGCGTGGGCCATAGTTGAGGCAGAAACG
AAGGATAGCTGGACATGGTTCCTCGTTTTGCTACAGACAGACATTGGATGTTTCACATCTAAAGGATACACATTCATCTCAGATCAACAAAAGGGTTTAGTTCCG
ACTTTTGAGGAGTTTTTTCCAAATGTGGCTCATCGTTTCTGTGTGCGACATTTGTATGGAAATTTTCAAAAAGAATTTAAAGGGTTGGCATTAAAGACTTTGTTT
TGGAGGGTTGCTAGGTGTAACAACATGAAGGATTTTAATGATGTTATGACTGAGATTGGTAGGTTAAGTGTGGGGGCTTACCAATATTTGAGTGTGATTCCTAGC
AAGTATTGGTCTAGGCATGCCTTTCAATGTTATCCGAAGTGTGATTTCTTGCTTAATAATTTATCGGAGTCATTTAATGCACTTATACTGCTAGCAAGAAACAAG
CCCATAATTCAACTATTTGAAACCATTAGGAAGTTGGTAATGGTTAGGATAACCAAGAAGCATGAAAAAGGTAATAGGATGGAAAAGGAAATTGGAAATAGAATT
CTGAAGAAACTTGAGAAGAATATAAGAAGCTCTAGGAGTGCAATACCTTTGTGGGCTGGGAATGGGAAATTTGAAGTGGAGTCAGGTCCAGTGAAGAAGAGATCG
AGGAAAGCATCAAGTTCTACATTCATGGGGTTTAGTATTCCATGTGAACATATTGGAACACAAGAGTCTGTTGCTGGAACTTCATCTACTCCACAAAGGACTCCA
TCTGCTTCTGGGACTGCTTATACTGCAACTGCATATGCTGCTGGGACTCCTTATGCTGCTGGCCATCCGTCTGCTGCTTCAACTCCAAGTGCTGTCGGGACTCCA
TCTGCCGTTAGACCTCCATCTGCTACTACAACTCCAAAGGTGGTTTTAAAGCTGGTGGATTTTGACAAGTCACTGTTGGAGATGACGGATGAAGAAGGCAAAACA
GCTCTCCAAGTGCTGGCCAACATTCCACATGCTTTCCGAAGTGGATACACCATGACATTCTGGGAATCAATTATCTATCATTTCCTTCCAAGCAAGGGCATTTTC
AAATGTAATTTTTCCTATCATGGGTCTTGCGACAAGGATTTTGAAAGTTTCAACATGAAAAACAAGAAGAAAGAAGATTTACAAGCAGACAACGGCTATGAAAGT
TTGAATATGAAAAACAAGAAGAATGAAGATTTAGAAGCAGGATCTAACCGGCCCAATTGTTGTTCTCATTCAAATTGTTGGTGGCCGCAATTGAGAGAAATGCAC
GAGAACAAGCGACAACATAAATTAGCTCTAACTCTCACCAAGATGCTTGCCCAAATTGATAATTCATGGTATCGAACCGAACCAGAACCCGAAAACACCAAAGTC
GACACCACTGGACTCGACTCGACTAACAAAGGCACCAATAAAGTCGACACCACTGGACTCCCCGATAACGTCGATATTGTCAAAGAAAAAGAAAATGTCGATATT
GTGGACATTAACAAAATTGAATTTAAGGATCAGCACGAAACCCCTCTCCTTCTGGCGGCCGCCAATGGAATCAAGGAGATGGTGGAGAAGATTGTGGAAGCGCAT
CCACAAGCCATCGATTACGTGACGGAGCAGCAGAGGAACATCTTACACGTGGCGATCCTGCACCGGCAGAAGAGCATCTTCAAATGGATTGAAAAGCGAGGGTTG
TTAATGTCCAGACTGGTTAAGAGGATTGACGCGTTGGGATTCACGGTGCTGCACCATGTCGGAATCACGAAATTCTCCCGCAGTGAAACCCACGGCCCTGCACTG
GAGCTTCAACATGAGCTCAAATGGTATCAGCTCGTTCGGAAACATATCCCGGAACTCTACACACTACACCATACCAACAAGGAGACCAATAAACGTACGGCTCGA
GAGTTGTTCGAGGAAACTCACAAAGACATGTCGGAAAATGCAAAAGAGTGGATGAAGAAAACCTCCGAGTCCTGCTCTGCCGTCGCCGTCCTGGTCGCCACCGTT
GTCTTTGCTGCAGCCTACACCGTCCCTGGCGGCCTCAACAGCAAGACCGGCTCTCCGGTCCTCCTCACCGAACCCATCTTCATCGTCTTCACGGTCATGGACGTC
ATCGCTCTCGCCTCCGCGCTCACCTCCGTGGTTTTGTTTCTCTCAATTCTCACATCGCCATTTCAGATGGGTGATTTCCTGCATGTGCTGCCGCTGAAATTGTCG
ATTGGATTTCAGTTACTCTTCGTGTCGGTTGCCAGCACCATGATGGTCTTTGCGCTCACCATCGTGCTCACCGTCAAGTCGGAGCAGATGAAGTGGACGGTGGTT
TTGCTGTACGTGGCCACGTTTTTTCCGGTGACCATGTTTATAATAATTCAGCTGCCGTTGTATTTGAAGCTCGTCCAGAATATGTGGAATTATCGCCACATTCTT
TTTAGTTTCCTTCCCATGGGTTTCTTCACCAGATCTTGGAAGGCTCCTCCTAAATTTTTGACGAGGAAAGCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAGGTGTTTCATATCAATTTTCATTATAATGGTCGTATGGTATTCCTCCCTAGACTGAACTACATTGGTGGAAGGCAAGCCGTTATTCAAGGGATAGAT
TGTGATAGGGTCTCGTTGATAGAGTTACGCATGATGTTGACTGATCTTAATGTATTCAATGTTGAGGAGTTTTACTATGTTAGGCTTGAGTATACACTGTCTACT
GAGTTCAAGGTCTTGTTAACTGATTCTGATGTCATACAAACGGTTAGTATAGTTCCTGAAAGGTCTGACGAAGAAGAGGATGTTAGGGGTCCTGAATTTAGGGCT
GAGACGAATATGAATCAGGTTGAGTTTAGGTTAGGGATGAGGTTTACCAGTCCGAAGTGTAAATGGAGGTTGCATGCTAGTGTTATTGAAAACGATAACACTATT
CAAATTAAGACATACATGGGTGATCATACATGCAGTAGGGAATTTGCAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACAGTTTAGGAGT
CAACCTAATTGGTCTCTTGCTGAAATTGTTGAGGAGGTGAAGAAGGACTATCAGGTAAGAATTTCTCGAAGTCAAGCCTACAGGGCAAGGCGTAAGGCAAACAGA
AGGATCAATGGTAGTCTTAATGAACAATATAAGAAATTGTGGGATTATGCTGCAAAGATTAGACGGTCCAATGTTGGATCAACGGTGAAAATTCAATGTGAGAGG
GTTACTGAAGATTCTCCCCCTGTTTTTAGGAGATGTTATGTTTGTTTTAAGGCTTGCAAGGACGGGTTTTTGGCGGGTTGCAGGCCATTTATTGCTTTAGATGCC
TGCCATCTAAAAGGTCCTTGTCAAGGACAATTAATGGCTGCAATTGGTGTTGATGGTGATGACAGTTTGTACCCTATTGCGTGGGCCATAGTTGAGGCAGAAACG
AAGGATAGCTGGACATGGTTCCTCGTTTTGCTACAGACAGACATTGGATGTTTCACATCTAAAGGATACACATTCATCTCAGATCAACAAAAGGGTTTAGTTCCG
ACTTTTGAGGAGTTTTTTCCAAATGTGGCTCATCGTTTCTGTGTGCGACATTTGTATGGAAATTTTCAAAAAGAATTTAAAGGGTTGGCATTAAAGACTTTGTTT
TGGAGGGTTGCTAGGTGTAACAACATGAAGGATTTTAATGATGTTATGACTGAGATTGGTAGGTTAAGTGTGGGGGCTTACCAATATTTGAGTGTGATTCCTAGC
AAGTATTGGTCTAGGCATGCCTTTCAATGTTATCCGAAGTGTGATTTCTTGCTTAATAATTTATCGGAGTCATTTAATGCACTTATACTGCTAGCAAGAAACAAG
CCCATAATTCAACTATTTGAAACCATTAGGAAGTTGGTAATGGTTAGGATAACCAAGAAGCATGAAAAAGGTAATAGGATGGAAAAGGAAATTGGAAATAGAATT
CTGAAGAAACTTGAGAAGAATATAAGAAGCTCTAGGAGTGCAATACCTTTGTGGGCTGGGAATGGGAAATTTGAAGTGGAGTCAGGTCCAGTGAAGAAGAGATCG
AGGAAAGCATCAAGTTCTACATTCATGGGGTTTAGTATTCCATGTGAACATATTGGAACACAAGAGTCTGTTGCTGGAACTTCATCTACTCCACAAAGGACTCCA
TCTGCTTCTGGGACTGCTTATACTGCAACTGCATATGCTGCTGGGACTCCTTATGCTGCTGGCCATCCGTCTGCTGCTTCAACTCCAAGTGCTGTCGGGACTCCA
TCTGCCGTTAGACCTCCATCTGCTACTACAACTCCAAAGGTGGTTTTAAAGCTGGTGGATTTTGACAAGTCACTGTTGGAGATGACGGATGAAGAAGGCAAAACA
GCTCTCCAAGTGCTGGCCAACATTCCACATGCTTTCCGAAGTGGATACACCATGACATTCTGGGAATCAATTATCTATCATTTCCTTCCAAGCAAGGGCATTTTC
AAATGTAATTTTTCCTATCATGGGTCTTGCGACAAGGATTTTGAAAGTTTCAACATGAAAAACAAGAAGAAAGAAGATTTACAAGCAGACAACGGCTATGAAAGT
TTGAATATGAAAAACAAGAAGAATGAAGATTTAGAAGCAGGATCTAACCGGCCCAATTGTTGTTCTCATTCAAATTGTTGGTGGCCGCAATTGAGAGAAATGCAC
GAGAACAAGCGACAACATAAATTAGCTCTAACTCTCACCAAGATGCTTGCCCAAATTGATAATTCATGGTATCGAACCGAACCAGAACCCGAAAACACCAAAGTC
GACACCACTGGACTCGACTCGACTAACAAAGGCACCAATAAAGTCGACACCACTGGACTCCCCGATAACGTCGATATTGTCAAAGAAAAAGAAAATGTCGATATT
GTGGACATTAACAAAATTGAATTTAAGGATCAGCACGAAACCCCTCTCCTTCTGGCGGCCGCCAATGGAATCAAGGAGATGGTGGAGAAGATTGTGGAAGCGCAT
CCACAAGCCATCGATTACGTGACGGAGCAGCAGAGGAACATCTTACACGTGGCGATCCTGCACCGGCAGAAGAGCATCTTCAAATGGATTGAAAAGCGAGGGTTG
TTAATGTCCAGACTGGTTAAGAGGATTGACGCGTTGGGATTCACGGTGCTGCACCATGTCGGAATCACGAAATTCTCCCGCAGTGAAACCCACGGCCCTGCACTG
GAGCTTCAACATGAGCTCAAATGGTATCAGCTCGTTCGGAAACATATCCCGGAACTCTACACACTACACCATACCAACAAGGAGACCAATAAACGTACGGCTCGA
GAGTTGTTCGAGGAAACTCACAAAGACATGTCGGAAAATGCAAAAGAGTGGATGAAGAAAACCTCCGAGTCCTGCTCTGCCGTCGCCGTCCTGGTCGCCACCGTT
GTCTTTGCTGCAGCCTACACCGTCCCTGGCGGCCTCAACAGCAAGACCGGCTCTCCGGTCCTCCTCACCGAACCCATCTTCATCGTCTTCACGGTCATGGACGTC
ATCGCTCTCGCCTCCGCGCTCACCTCCGTGGTTTTGTTTCTCTCAATTCTCACATCGCCATTTCAGATGGGTGATTTCCTGCATGTGCTGCCGCTGAAATTGTCG
ATTGGATTTCAGTTACTCTTCGTGTCGGTTGCCAGCACCATGATGGTCTTTGCGCTCACCATCGTGCTCACCGTCAAGTCGGAGCAGATGAAGTGGACGGTGGTT
TTGCTGTACGTGGCCACGTTTTTTCCGGTGACCATGTTTATAATAATTCAGCTGCCGTTGTATTTGAAGCTCGTCCAGAATATGTGGAATTATCGCCACATTCTT
TTTAGTTTCCTTCCCATGGGTTTCTTCACCAGATCTTGGAAGGCTCCTCCTAAATTTTTGACGAGGAAAGCTGCTTGA
Protein sequenceShow/hide protein sequence
MDEVFHINFHYNGRMVFLPRLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRLEYTLSTEFKVLLTDSDVIQTVSIVPERSDEEEDVRGPEFRA
ETNMNQVEFRLGMRFTSPKCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLEQFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRARRKANR
RINGSLNEQYKKLWDYAAKIRRSNVGSTVKIQCERVTEDSPPVFRRCYVCFKACKDGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAET
KDSWTWFLVLLQTDIGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVRHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSVIPS
KYWSRHAFQCYPKCDFLLNNLSESFNALILLARNKPIIQLFETIRKLVMVRITKKHEKGNRMEKEIGNRILKKLEKNIRSSRSAIPLWAGNGKFEVESGPVKKRS
RKASSSTFMGFSIPCEHIGTQESVAGTSSTPQRTPSASGTAYTATAYAAGTPYAAGHPSAASTPSAVGTPSAVRPPSATTTPKVVLKLVDFDKSLLEMTDEEGKT
ALQVLANIPHAFRSGYTMTFWESIIYHFLPSKGIFKCNFSYHGSCDKDFESFNMKNKKKEDLQADNGYESLNMKNKKNEDLEAGSNRPNCCSHSNCWWPQLREMH
ENKRQHKLALTLTKMLAQIDNSWYRTEPEPENTKVDTTGLDSTNKGTNKVDTTGLPDNVDIVKEKENVDIVDINKIEFKDQHETPLLLAAANGIKEMVEKIVEAH
PQAIDYVTEQQRNILHVAILHRQKSIFKWIEKRGLLMSRLVKRIDALGFTVLHHVGITKFSRSETHGPALELQHELKWYQLVRKHIPELYTLHHTNKETNKRTAR
ELFEETHKDMSENAKEWMKKTSESCSAVAVLVATVVFAAAYTVPGGLNSKTGSPVLLTEPIFIVFTVMDVIALASALTSVVLFLSILTSPFQMGDFLHVLPLKLS
IGFQLLFVSVASTMMVFALTIVLTVKSEQMKWTVVLLYVATFFPVTMFIIIQLPLYLKLVQNMWNYRHILFSFLPMGFFTRSWKAPPKFLTRKAA