; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026392 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026392
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAnkyrin repeat family protein, putative
Genome locationscaffold402:267091..280628
RNA-Seq ExpressionMS026392
SyntenyMS026392
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142589.1 uncharacterized protein LOC111012667, partial [Momordica charantia]1.5e-14769.35Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQE-LIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYN
        MDF + M+  Y+T I+ +WKNL QE LI K +I+FPMTT GDTA+HLAVYS +EEPT++LLA  SE++ EFW N A NTPLHEAAAVGNLAAVKLLVEY 
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQE-LIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYN

Query:  KTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRS
        K DLLA NIYGETPLFRA RFGHLHIVEY LEDCED+FSRSPINWT  K+ PIIHAAIQSEKFDVVLRLVDF+KSLLEMKD + KTAL+VLANMPL FRS
Subjt:  KTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRS

Query:  GSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDH
        GS +  +E IIY VLPS+DIYKYNF  FGSS+KD+SE S A  NKKNEDLEAQ + D GL+FC SNCW                  WPQWKQLY KK+ H
Subjt:  GSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDH

Query:  KLAVTITKMLAQIDYSWLNTKPTPENNEVD---STGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVD
        KLAVTITKMLA+IDYSW NT  TPE NE+D     G   QI+      +  Q  EDDKS++KDI+YDD HETPLLLAAANGIIEIVEAIVDAHP A+D
Subjt:  KLAVTITKMLAQIDYSWLNTKPTPENNEVD---STGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVD

XP_022142681.1 uncharacterized protein LOC111012737, partial [Momordica charantia]2.5e-20894.34Show/hide
Query:  RERCNRGESMDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAA
        RERCNRGESMDFF+TMQIPYKTVI+EEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRA NTPLHEAAAVGNLAA
Subjt:  RERCNRGESMDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAA

Query:  VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA
        VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA
Subjt:  VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA

Query:  NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQ
        NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGL+FCHSNCW                  WPQWKQ
Subjt:  NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQ

Query:  LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE
        LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE
Subjt:  LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE

XP_023539599.1 uncharacterized protein LOC111800231 [Cucurbita pepo subsp. pepo]6.1e-12259.34Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        M  F+ M+  YK VI+EEWK++ +EL ++ +IVFPMTTS DTALHLAVYSGEE+P ++LLA ++ + ++FW+N A NTPLHEAA +GNLAAVKLLV+YNK
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DLL  NIYGETPLFRAAR GHL IVE+ LEDC+D+ SRSP NWTT    PIIHA IQS+KFDVVL+L +F++SLLEMKD E KTAL +LANMPLAF SG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDR-------------GLHFCHSNCWWPQWKQLYLKKRDHKLAVTI
        S L F ES IY +LP++DIY Y  SN   S+  ++  S   KNKKN DLEA  +S+R             GL +      WPQWK++Y KKR HKL VTI
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDR-------------GLHFCHSNCWWPQWKQLYLKKRDHKLAVTI

Query:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        TK LAQID SW  TK TP+  E+DSTG+ L  + +     +   +P +++ E  D EY+DHHETPLLLAAANGIIEIV+ I  A+PQAVDY
Subjt:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

XP_031739511.1 uncharacterized protein LOC101203890 [Cucumis sativus]9.5e-12362Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        MDF  TM  PY+ VI+EEWK + +EL D+T+IVFPMT S DTALHLAVYSG EEP R  L  + E+DE FW+N A NTPLHEAA VGNLAAVKLLVEY K
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DL+A NIYGETPLFRAAR GHL IV Y LEDCEDFFSR   +WT  K  PIIHAAIQS+KFDVVL+L +F+KSLLEM + E KTALHVLANMP AF+SG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGS-SATKNKKNEDLEAQRS-SDRGLHFCHSNCW------------------WPQWKQLYLKKRD
         P+KF ESIIY +LP+ DIY Y +SNFGSS+ D +  S S+    KNEDLEA  S S+     CHSNCW                  WPQWK+LY KK+ 
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGS-SATKNKKNEDLEAQRS-SDRGLHFCHSNCW------------------WPQWKQLYLKKRD

Query:  HKLAVTITKMLAQIDYSWLNTKPT-PENNEVDSTGVHLQIDGK-DFPTDTNQRPED-DKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        HKLA+TIT MLA ID+SW  T+ T PE  EVDS G+ L  +G  D    T     D ++ E  DIE+ +HHETPLLLAAANGIIEIV+ IV+  PQAVDY
Subjt:  HKLAVTITKMLAQIDYSWLNTKPT-PENNEVDSTGVHLQIDGK-DFPTDTNQRPED-DKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

XP_038894049.1 uncharacterized protein LOC120082801 isoform X1 [Benincasa hispida]7.7e-13362.91Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        MD    M++PY+ VI+EEWK + +EL DKT+IVFPMT SGDTA+HLAVYSGEEEP RALL  +SE+DE FW+N A NTPLHEAA VGNLAAVKLLVEY+K
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DL+  NIYGETPLFRAAR GHLHIV Y LEDCED FSRS  NWTT K  P+IHAAIQS+KF+V L+L++F+KSLLEM DPE KTALHVLANMP AFRSG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDHK
          +KF ESIIY +LPS+DIYK+++S F SS KD    SS   N KN+DLEA  +S    +F HSNCW                  WPQWK LY KK+ H+
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDHK

Query:  LAVTITKMLAQIDYSWLNTKPT-PENNEVDSTGVHLQIDGKDFPTD---TNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        LA+TITKMLA +D+SW  T+ T PEN EVDS G+H   +G     +   T    + ++ E +DIEY DHHETPLLLAAANGIIEIV+ IV+ +PQAVDY
Subjt:  LAVTITKMLAQIDYSWLNTKPT-PENNEVDSTGVHLQIDGKDFPTD---TNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

TrEMBL top hitse value%identityAlignment
A0A6J1CLX1 uncharacterized protein LOC1110126677.0e-14869.35Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQE-LIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYN
        MDF + M+  Y+T I+ +WKNL QE LI K +I+FPMTT GDTA+HLAVYS +EEPT++LLA  SE++ EFW N A NTPLHEAAAVGNLAAVKLLVEY 
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQE-LIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYN

Query:  KTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRS
        K DLLA NIYGETPLFRA RFGHLHIVEY LEDCED+FSRSPINWT  K+ PIIHAAIQSEKFDVVLRLVDF+KSLLEMKD + KTAL+VLANMPL FRS
Subjt:  KTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRS

Query:  GSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDH
        GS +  +E IIY VLPS+DIYKYNF  FGSS+KD+SE S A  NKKNEDLEAQ + D GL+FC SNCW                  WPQWKQLY KK+ H
Subjt:  GSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQLYLKKRDH

Query:  KLAVTITKMLAQIDYSWLNTKPTPENNEVD---STGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVD
        KLAVTITKMLA+IDYSW NT  TPE NE+D     G   QI+      +  Q  EDDKS++KDI+YDD HETPLLLAAANGIIEIVEAIVDAHP A+D
Subjt:  KLAVTITKMLAQIDYSWLNTKPTPENNEVD---STGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVD

A0A6J1CMX0 uncharacterized protein LOC1110127371.2e-20894.34Show/hide
Query:  RERCNRGESMDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAA
        RERCNRGESMDFF+TMQIPYKTVI+EEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRA NTPLHEAAAVGNLAA
Subjt:  RERCNRGESMDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAA

Query:  VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA
        VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA
Subjt:  VKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLA

Query:  NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQ
        NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGL+FCHSNCW                  WPQWKQ
Subjt:  NMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCW------------------WPQWKQ

Query:  LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE
        LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE
Subjt:  LYLKKRDHKLAVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIE

A0A6J1FII8 uncharacterized protein LOC1114460494.8e-12059.08Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        M  F+ M+  YK VI+EEWK++ +EL ++ +IVFPMTTS DTALHLAVYSGEE+P ++LLA ++ + ++FW+N A NTPLHEAA +GNLAAVKLLV+YNK
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DLLA NIYGETPLFRAAR GHL IV++ LEDC+D+ SRSP NWTT    PIIHA IQS+KFDV L+L +F++SLLEMKD E KTAL VLANMPLAFRSG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDR-------------GLHFCHSNCWWPQWKQLYLKKRDHKLAVTI
        S   F ES IY +LPS+DIY Y  SN   S+  ++  S   KNKKN DLEA  +S+R             GL +      WPQWK++Y KKR HKL VTI
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDR-------------GLHFCHSNCWWPQWKQLYLKKRDHKLAVTI

Query:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        T  LAQID SW  TK TP+  E+DS G+ L  + +     +   +P +++ E  D EY+DHHETPLLLAAANGIIEIV+ I  A+PQAVDY
Subjt:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

A0A6J1IIE1 uncharacterized protein LOC1114734437.3e-12159.59Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        M  F+ M+  YK VI+EEWKN+ +EL ++ +IVFPMTTS DTALHLAVYSGEE+P ++LLA +S + ++FW+N A NTPLHEAA +GNLAAVKLLV+YNK
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DLLA NIYGETP+FRAAR GHL IVE+ LEDC+D+ SRSP NWTT    PIIHA IQS+KFDVVL+L +F+KSLLEMKD E KTAL VLANMPLAF SG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSD-------------RGLHFCHSNCWWPQWKQLYLKKRDHKLAVTI
        S L F ES IY +LP++DIY Y  SNF SS   ++  S + +NKKN DLEA  +S+              GL +      WPQWK++Y KKR HKL VTI
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSD-------------RGLHFCHSNCWWPQWKQLYLKKRDHKLAVTI

Query:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        T  LAQID SW  TK TP+  E+DSTG+    + +          P +++ E  D EY+DHHETPLLLAAANGIIEIV+ I  A+P AVDY
Subjt:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

A0A6J1IIM4 uncharacterized protein LOC1114743201.1e-12159.85Show/hide
Query:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK
        M  F+ M+  YK VI+EEWKN+ +EL ++ +IVFPMTTS DTALHLAVYSGEE+P ++LLA +S + ++FW+N A NTPLHEAA +GNLAAVKLLV+YNK
Subjt:  MDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNK

Query:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG
         DLLA NIYGETP+FRAAR GHL IVE+ LEDC+D+ SRSP NWTT    PIIHA IQS+KFDVVL+L +F+KSLLEMKD E KTAL VLANMPLAF SG
Subjt:  TDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLANMPLAFRSG

Query:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSD-------------RGLHFCHSNCWWPQWKQLYLKKRDHKLAVTI
        S L F ES IY +LP++DIY Y  SNF SS   ++  S + +NKKN DLEA  +S+              GL +      WPQWK++Y KKR HKL VTI
Subjt:  SPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSD-------------RGLHFCHSNCWWPQWKQLYLKKRDHKLAVTI

Query:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY
        T  LAQID SW  TK TP+  E+DSTG+    + +          P +++ E  D EY+DHHETPLLLAAANGIIEIV+ I  A+PQAVDY
Subjt:  TKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFP-TDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDY

SwissProt top hitse value%identityAlignment
B1AK53 Espin3.3e-0930.17Show/hide
Query:  LQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALL-------AAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPL
        LQ L+S     +     S  T LHLA   G  E    LL        A ++M  +         P+H AAA G+  +++LLVE+    + A+   G TPL
Subjt:  LQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALL-------AAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPL

Query:  FRAARFGHLHIVEYFLEDC-EDFFSRSPINWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN
        + A + GHL + +Y +++C  D  +R+    T       +HAA Q     V++ LV   +  L  +D+ G TA+H  A+
Subjt:  FRAARFGHLHIVEYFLEDC-EDFFSRSPINWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN

P16157 Ankyrin-15.1e-1023.48Show/hide
Query:  KEEWKNLLQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPL
        KE    ++ EL+  K  I+   T   +TALH+A  +G++E  R L+   + ++    +++   TPL+ AA   +L  VK L+E      +A    G TPL
Subjt:  KEEWKNLLQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPL

Query:  FRAARFGHLHIVEYFLEDCEDFFSRSPINW-TTGK-QMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL
          A + GH ++V +             IN+ T GK ++P +H A +++       L+  D +  ++  + G T LH+ A   N+ VA      G+S+ + 
Subjt:  FRAARFGHLHIVEYFLEDCEDFFSRSPINW-TTGK-QMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL

Query:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTL-KNQELEAQCRPNPPLNFCHSNCSWPQWKELYLK-----KRKHKLALTIAQMLARIDYSWLQK
                      +   T     ++  + I     L +  ++E + +       C +     +  E+ L      + K K  L+   M A+ D+    +
Subjt:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTL-KNQELEAQCRPNPPLNFCHSNCSWPQWKELYLK-----KRKHKLALTIAQMLARIDYSWLQK

Query:  ATPENIQVDATGVHFRRIEDETIQNLS---LEGHIPEN--TDIEYDDHYE---------TPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFF
             +Q DA       I+D T+ +L+   +  H   +    +  D   +         TPL +A  +  V ++E +++              G S+D  
Subjt:  ATPENIQVDATGVHFRRIEDETIQNLS---LEGHIPEN--TDIEYDDHYE---------TPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFF

Query:  NTMQI-PYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNKTDL
            + P           + + L+ +           +T LH+A  +G  E  + LL   ++++    K + + TPLH AA +G+   VKLL+E N    
Subjt:  NTMQI-PYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNKTDL

Query:  LAMNIYGETPLFRAARFGHLHIVEYFLE
        LA    G TPL  AAR GH+  V   LE
Subjt:  LAMNIYGETPLFRAARFGHLHIVEYFLE

Q01484 Ankyrin-23.4e-1423.8Show/hide
Query:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLED-------CED
        ALHLA   G     + LL   S +D      +  NT LH A+  G    VK+LV+    ++ A++  G TPL+ AA+  H+ +V+Y LE+        ED
Subjt:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLED-------CED

Query:  FFSRSPI---------------NWTTGK-QMPIIHAAIQSEKFEVVLKLVDFDNS-------LLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL
         F+   +               N T GK ++P +H A + +  +    L+  D++       ++    E+G T LH+ A   N+ VA      G+++ + 
Subjt:  FFSRSPI---------------NWTTGK-QMPIIHAAIQSEKFEVVLKLVDFDNS-------LLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL

Query:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTLKNQELEAQCRPNPPLNFCHSNCSWPQWKELYLKK-----RKHKLALTIAQMLARIDY-----S
         +    + P     +   TN      D          +  +++A+ R       C +     Q  EL L++      + K  L+   M A+ D+      
Subjt:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTLKNQELEAQCRPNPPLNFCHSNCSWPQWKELYLKK-----RKHKLALTIAQMLARIDY-----S

Query:  WLQ-KATPENIQVD-ATGVHF-RRIEDETIQNLSLEGHIPENTDIEYDDHYETPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFFNTMQIPY
         LQ KA  +++ +D  T +H         +  L L+     N       +  TPL +A  +  ++++E +V+              G S+       +  
Subjt:  WLQ-KATPENIQVD-ATGVHF-RRIEDETIQNLSLEGHIPENTDIEYDDHYETPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFFNTMQIPY

Query:  KTVIQEEWKNLNQELIDKTRIVFPMTTS--GDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEY-NKTDLLAMNI
           +     +LN  L+       P  T+  G+TALH+A  +G+ E  R LL   + +D    + R E TPLH A+ +G    V+LL+++    D    N 
Subjt:  KTVIQEEWKNLNQELIDKTRIVFPMTTS--GDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEY-NKTDLLAMNI

Query:  YGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLAN
         G TPL  +AR G + +    LE      + +  +  T K    +H A +    DV   L+   ++  +       T LHV A+
Subjt:  YGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLAN

Q8C8R3 Ankyrin-21.2e-1423.97Show/hide
Query:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLED-------CED
        ALHLA   G     + LL   S +D      +  NT LH A+  G    VK+LV+    ++ A++  G TPL+ AA+  H+ +V+Y LE+        ED
Subjt:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLED-------CED

Query:  FFSRSPI---------------NWTTGK-QMPIIHAAIQSEKFEVVLKLVDFDNS-------LLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL
         F+   +               N T GK ++P +H A + +  +    L+  D++       ++    E+G T LH+ A   N+ VA      G+++ + 
Subjt:  FFSRSPI---------------NWTTGK-QMPIIHAAIQSEKFEVVLKLVDFDNS-------LLEMKDEAGNTALHVLA---NMPVA---FQSGSSMKYL

Query:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTLKNQELEAQCRPNPPLNFCHSNCSWPQWKELYLKKRKHKLA-----LTIAQMLARIDY-----S
         +    + P     +   TN      D          +  +++A+ R       C +     Q  EL L+++   LA     L+   M A+ D+      
Subjt:  DSIIYALLPSEDVYEYNFTNFESFNKDDSEIQNSTTLKNQELEAQCRPNPPLNFCHSNCSWPQWKELYLKKRKHKLA-----LTIAQMLARIDY-----S

Query:  WLQ-KATPENIQVD-ATGVHF-RRIEDETIQNLSLEGHIPENTDIEYDDHYETPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFFNTMQIPY
         LQ KA  +++ +D  T +H         +  L L+     N       +  TPL +A  +  ++++E +V+              G S+       +  
Subjt:  WLQ-KATPENIQVD-ATGVHF-RRIEDETIQNLSLEGHIPENTDIEYDDHYETPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFFNTMQIPY

Query:  KTVIQEEWKNLNQELIDKTRIVFPMTTS--GDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEY-NKTDLLAMNI
           +     +LN  L+       P  T+  G+TALH+A  +G+ E  R LL   + +D    + R E TPLH A+ +G    V+LL+++    D    N 
Subjt:  KTVIQEEWKNLNQELIDKTRIVFPMTTS--GDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEY-NKTDLLAMNI

Query:  YGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLAN
         G TPL  +AR G + +    LE      + +  +  T K    +H A +    DV   L+   ++  +       T LHV A+
Subjt:  YGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEKSLLEMKDPEDKTALHVLAN

Q9ET47 Espin8.6e-1031.79Show/hide
Query:  TALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSP
        T LHLA   G  +  + LL        +     A   P+H AAA G+L ++KLLV +    + A+   G TPL+ A + GHL + +Y +++C    S  P
Subjt:  TALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSP

Query:  INWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN
         +      M  +HAA Q     V++ LV F +     +D  G TA+H  A+
Subjt:  INWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN

Arabidopsis top hitse value%identityAlignment
AT1G07710.1 Ankyrin repeat family protein1.5e-0928.86Show/hide
Query:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPI
        A H+A   G+ +  + L  A SE+      + +  T LH AA  G+   V  L+E   +        G+T L  A+R GH+ +++  L        R   
Subjt:  ALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPI

Query:  NWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLA
             K    +H A++    EVV +L+  D S + + D  GNTALH+ A
Subjt:  NWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLA

AT2G31820.1 Ankyrin repeat family protein1.2e-0928.38Show/hide
Query:  HLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINW
        H+A   G  E  + LL     +      + +  T LH AA  G++  V LL+E +          G+T L  AAR GH+ +V+  +        R     
Subjt:  HLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINW

Query:  TTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN
        T  K    +H A++ +   +V++LV  D ++L ++D  GNT LH+  N
Subjt:  TTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLAN

AT3G54070.1 Ankyrin repeat family protein2.1e-1127.1Show/hide
Query:  NTMQIPYKTVIQEEWKNLNQELIDK-TRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNKTDL
        ++  + YK V+  +WK  +  +  K   +V  +T + + ALH+AV +  ++  R LL  M   D    KN+  NTPL  AAA+G++   ++L+   +   
Subjt:  NTMQIPYKTVIQEEWKNLNQELIDK-TRIVFPMTTSGDTALHLAVYSGEEEPTRALLATMSEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNKTDL

Query:  LAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQ-MPIIHAAIQSEKFDV-------VLRLVDFEKSLLEMKDPEDKTALHVLANMPL
           N    TP+  AA +GH  +V+Y        FS++ I     +Q + + H  I ++ + V       +L  VD  +  L +  P    ALH+LA    
Subjt:  LAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQ-MPIIHAAIQSEKFDV-------VLRLVDFEKSLLEMKDPEDKTALHVLANMPL

Query:  AFRSGSPLKFVESI
        A    S L   + +
Subjt:  AFRSGSPLKFVESI

AT5G35830.1 Ankyrin repeat family protein1.3e-0832.52Show/hide
Query:  YKTVIKEEWKNLLQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIY
        Y+  +K +WK     ++  K  I   +T+  +T LH+AV +  E   R LL ++ E +++  RN   NT L  AAA G +   K+L+E NK   +     
Subjt:  YKTVIKEEWKNLLQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIY

Query:  GETPLFRAARFGHLHIVEYFLED
          TP+  AA FGH  +V+Y  ++
Subjt:  GETPLFRAARFGHLHIVEYFLED

AT5G60070.1 ankyrin repeat family protein4.7e-1132.88Show/hide
Query:  HLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLA-ENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPIN
        H+A   GE +  R L+    E+      + +  T LH AAA G++  V+ L+E   + L A     G+T L  AAR GH  +V+  +    D  +R    
Subjt:  HLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLA-ENIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPIN

Query:  WTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHV
         T  K    +H A++ +  +VV++L+    S L M D  GNTALHV
Subjt:  WTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATCCCTTACAAAACGGTGATAAAAGAGGAATGGAAGAATCTGCTTCAGGAATTGATGAGCGACAAGACGAAAATCATGTTTCCGATGACGACCAGCGAAGACAC
GGCGTTGCATCTCGCCGTGTACAGCGGAGAAGAAGAGCCGACGAGAGCTCTACTGGCGGCGATGTCGGAGATGGATGAAGTATTCTGGAGGAACAGAGCCCAAAATACGC
CGTTGCACGAAGCCGCCGCCGTCGGGAATCTGGCCGCCGTCAAGCTTCTGGTGGAGTACAACAAAACGGATCTGTTGGCGGAGAACATTTACGGCGAAACGCCTCTGTTC
AGAGCGGCCAGATTCGGACACCTTCACATCGTCGAGTACTTTCTGGAAGATTGTGAAGACTTCTTTTCTCGATCGCCGATAAATTGGACCACAGGGAAGCAAATGCCGAT
CATTCACGCGGCTATTCAGAGCGAGAAATTCGAAGTGGTTCTGAAGCTGGTTGATTTTGACAACTCGCTATTGGAGATGAAGGATGAAGCAGGCAACACAGCTCTCCATG
TGCTCGCGAACATGCCGGTCGCTTTTCAAAGTGGATCCTCCATGAAATACTTGGACTCAATTATCTATGCTCTGCTTCCTAGCGAGGACGTTTACGAGTATAATTTTACA
AATTTTGAGTCATTCAATAAAGACGACTCTGAAATCCAAAACTCCACCACTTTGAAGAACCAAGAATTAGAAGCACAATGTAGACCTAATCCTCCGCTTAACTTTTGCCA
TTCAAATTGTTCATGGCCGCAATGGAAAGAACTGTACCTCAAAAAGCGGAAGCATAAATTAGCCCTAACAATCGCCCAGATGCTTGCTCGGATTGATTATTCATGGCTCC
AGAAAGCCACGCCGGAGAACATCCAAGTCGACGCCACCGGAGTCCATTTCCGCCGGATTGAAGACGAAACCATCCAAAACCTATCACTGGAGGGCCACATACCAGAAAAT
ACAGACATCGAGTACGACGATCACTACGAAACTCCACTCCTCCTCGCCGCCGCCAGGGGAATCGTCGAGATCGTCGAGAAGATCGTCGAAGCGCATCCTCAGGCCGTCGA
TTACAGAGAGCGATGCAATAGAGGAGAATCAATGGATTTCTTCAACACAATGCAAATCCCTTACAAAACGGTGATACAAGAGGAATGGAAGAATCTGAATCAAGAATTGA
TCGACAAGACGAGAATCGTGTTTCCAATGACGACCAGCGGAGACACGGCGTTGCATCTCGCCGTGTACAGCGGAGAAGAAGAGCCGACGAGAGCTCTACTGGCGACGATG
TCGGAGATAGATGAAGAATTCTGGAAGAACAGAGCCGAAAATACGCCGTTGCACGAAGCCGCCGCCGTCGGGAATCTGGCCGCCGTCAAGCTTCTGGTGGAGTACAACAA
AACGGATCTGTTGGCGATGAATATTTACGGCGAAACGCCTCTGTTCAGAGCGGCCAGATTCGGACACCTTCACATCGTCGAGTACTTTCTGGAAGATTGTGAAGATTTCT
TTTCTCGATCGCCGATAAATTGGACCACGGGGAAGCAAATGCCGATCATTCACGCGGCTATTCAGAGCGAGAAATTCGATGTGGTTTTGAGATTGGTTGATTTCGAAAAA
TCACTGTTGGAGATGAAGGATCCAGAAGACAAAACAGCTCTTCATGTGCTCGCAAACATGCCGCTCGCTTTTCGAAGTGGATCCCCCCTAAAATTCGTCGAATCAATTAT
CTATAAAGTGCTTCCTAGTGATGACATTTACAAATATAATTTTTCAAATTTTGGGTCTTCCGACAAAGACGAATCTGAAGGCTCTAGTGCTACGAAAAACAAGAAGAACG
AAGATTTAGAAGCACAACGTAGTTCTGATCGTGGACTTCATTTTTGCCATTCAAATTGCTGGTGGCCGCAATGGAAACAATTGTACCTGAAAAAGCGGGACCACAAATTG
GCTGTAACCATCACCAAGATGCTTGCTCAGATCGATTATTCATGGCTCAACACCAAACCGACGCCGGAGAACAATGAAGTCGACTCCACCGGCGTCCACCTCCAGATCGA
CGGCAAAGACTTCCCCACAGACACTAACCAAAGGCCGGAGGACGACAAATCGGAAACTAAAGACATCGAGTACGACGATCACCACGAAACTCCGCTCCTCCTCGCCGCCG
CCAACGGAATCATCGAGATCGTCGAGGCGATCGTCGATGCGCATCCTCAGGCCGTCGATTACGACGACTATCTCAAGATCGACCACTTTGGGACGACTATCCCAAGTCTG
ACCCTCCTCGAACGACTATCCAAGGCTCGACCATCTCAGGTCAACCACCCGAAAACCGATCACATGACTCGACCGCCCATGGTCAACCGCGTCAAACTCGACCATTCAAG
GCCGACAGAATGGGGCTCAACCCTCACTCTTACAAAAGTATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAATCCCTTACAAAACGGTGATAAAAGAGGAATGGAAGAATCTGCTTCAGGAATTGATGAGCGACAAGACGAAAATCATGTTTCCGATGACGACCAGCGAAGACAC
GGCGTTGCATCTCGCCGTGTACAGCGGAGAAGAAGAGCCGACGAGAGCTCTACTGGCGGCGATGTCGGAGATGGATGAAGTATTCTGGAGGAACAGAGCCCAAAATACGC
CGTTGCACGAAGCCGCCGCCGTCGGGAATCTGGCCGCCGTCAAGCTTCTGGTGGAGTACAACAAAACGGATCTGTTGGCGGAGAACATTTACGGCGAAACGCCTCTGTTC
AGAGCGGCCAGATTCGGACACCTTCACATCGTCGAGTACTTTCTGGAAGATTGTGAAGACTTCTTTTCTCGATCGCCGATAAATTGGACCACAGGGAAGCAAATGCCGAT
CATTCACGCGGCTATTCAGAGCGAGAAATTCGAAGTGGTTCTGAAGCTGGTTGATTTTGACAACTCGCTATTGGAGATGAAGGATGAAGCAGGCAACACAGCTCTCCATG
TGCTCGCGAACATGCCGGTCGCTTTTCAAAGTGGATCCTCCATGAAATACTTGGACTCAATTATCTATGCTCTGCTTCCTAGCGAGGACGTTTACGAGTATAATTTTACA
AATTTTGAGTCATTCAATAAAGACGACTCTGAAATCCAAAACTCCACCACTTTGAAGAACCAAGAATTAGAAGCACAATGTAGACCTAATCCTCCGCTTAACTTTTGCCA
TTCAAATTGTTCATGGCCGCAATGGAAAGAACTGTACCTCAAAAAGCGGAAGCATAAATTAGCCCTAACAATCGCCCAGATGCTTGCTCGGATTGATTATTCATGGCTCC
AGAAAGCCACGCCGGAGAACATCCAAGTCGACGCCACCGGAGTCCATTTCCGCCGGATTGAAGACGAAACCATCCAAAACCTATCACTGGAGGGCCACATACCAGAAAAT
ACAGACATCGAGTACGACGATCACTACGAAACTCCACTCCTCCTCGCCGCCGCCAGGGGAATCGTCGAGATCGTCGAGAAGATCGTCGAAGCGCATCCTCAGGCCGTCGA
TTACAGAGAGCGATGCAATAGAGGAGAATCAATGGATTTCTTCAACACAATGCAAATCCCTTACAAAACGGTGATACAAGAGGAATGGAAGAATCTGAATCAAGAATTGA
TCGACAAGACGAGAATCGTGTTTCCAATGACGACCAGCGGAGACACGGCGTTGCATCTCGCCGTGTACAGCGGAGAAGAAGAGCCGACGAGAGCTCTACTGGCGACGATG
TCGGAGATAGATGAAGAATTCTGGAAGAACAGAGCCGAAAATACGCCGTTGCACGAAGCCGCCGCCGTCGGGAATCTGGCCGCCGTCAAGCTTCTGGTGGAGTACAACAA
AACGGATCTGTTGGCGATGAATATTTACGGCGAAACGCCTCTGTTCAGAGCGGCCAGATTCGGACACCTTCACATCGTCGAGTACTTTCTGGAAGATTGTGAAGATTTCT
TTTCTCGATCGCCGATAAATTGGACCACGGGGAAGCAAATGCCGATCATTCACGCGGCTATTCAGAGCGAGAAATTCGATGTGGTTTTGAGATTGGTTGATTTCGAAAAA
TCACTGTTGGAGATGAAGGATCCAGAAGACAAAACAGCTCTTCATGTGCTCGCAAACATGCCGCTCGCTTTTCGAAGTGGATCCCCCCTAAAATTCGTCGAATCAATTAT
CTATAAAGTGCTTCCTAGTGATGACATTTACAAATATAATTTTTCAAATTTTGGGTCTTCCGACAAAGACGAATCTGAAGGCTCTAGTGCTACGAAAAACAAGAAGAACG
AAGATTTAGAAGCACAACGTAGTTCTGATCGTGGACTTCATTTTTGCCATTCAAATTGCTGGTGGCCGCAATGGAAACAATTGTACCTGAAAAAGCGGGACCACAAATTG
GCTGTAACCATCACCAAGATGCTTGCTCAGATCGATTATTCATGGCTCAACACCAAACCGACGCCGGAGAACAATGAAGTCGACTCCACCGGCGTCCACCTCCAGATCGA
CGGCAAAGACTTCCCCACAGACACTAACCAAAGGCCGGAGGACGACAAATCGGAAACTAAAGACATCGAGTACGACGATCACCACGAAACTCCGCTCCTCCTCGCCGCCG
CCAACGGAATCATCGAGATCGTCGAGGCGATCGTCGATGCGCATCCTCAGGCCGTCGATTACGACGACTATCTCAAGATCGACCACTTTGGGACGACTATCCCAAGTCTG
ACCCTCCTCGAACGACTATCCAAGGCTCGACCATCTCAGGTCAACCACCCGAAAACCGATCACATGACTCGACCGCCCATGGTCAACCGCGTCAAACTCGACCATTCAAG
GCCGACAGAATGGGGCTCAACCCTCACTCTTACAAAAGTATTATAG
Protein sequenceShow/hide protein sequence
MQIPYKTVIKEEWKNLLQELMSDKTKIMFPMTTSEDTALHLAVYSGEEEPTRALLAAMSEMDEVFWRNRAQNTPLHEAAAVGNLAAVKLLVEYNKTDLLAENIYGETPLF
RAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFEVVLKLVDFDNSLLEMKDEAGNTALHVLANMPVAFQSGSSMKYLDSIIYALLPSEDVYEYNFT
NFESFNKDDSEIQNSTTLKNQELEAQCRPNPPLNFCHSNCSWPQWKELYLKKRKHKLALTIAQMLARIDYSWLQKATPENIQVDATGVHFRRIEDETIQNLSLEGHIPEN
TDIEYDDHYETPLLLAAARGIVEIVEKIVEAHPQAVDYRERCNRGESMDFFNTMQIPYKTVIQEEWKNLNQELIDKTRIVFPMTTSGDTALHLAVYSGEEEPTRALLATM
SEIDEEFWKNRAENTPLHEAAAVGNLAAVKLLVEYNKTDLLAMNIYGETPLFRAARFGHLHIVEYFLEDCEDFFSRSPINWTTGKQMPIIHAAIQSEKFDVVLRLVDFEK
SLLEMKDPEDKTALHVLANMPLAFRSGSPLKFVESIIYKVLPSDDIYKYNFSNFGSSDKDESEGSSATKNKKNEDLEAQRSSDRGLHFCHSNCWWPQWKQLYLKKRDHKL
AVTITKMLAQIDYSWLNTKPTPENNEVDSTGVHLQIDGKDFPTDTNQRPEDDKSETKDIEYDDHHETPLLLAAANGIIEIVEAIVDAHPQAVDYDDYLKIDHFGTTIPSL
TLLERLSKARPSQVNHPKTDHMTRPPMVNRVKLDHSRPTEWGSTLTLTKVL