; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1436 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1436
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLate embryogenesis abundant protein
Genome locationMC06:21654820..21655458
RNA-Seq ExpressionMC06g1436
SyntenyMC06g1436
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042716.1 late embryogenesis abundant protein [Cucumis melo var. makuwa]5.27e-8865.58Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHKTD E  LAT   L  ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI+LS+ ++  FS  N  NS+S +S +L+  A+
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA+AKG+KRMN+TV+ +A   S S+      GIL LSSFVKLRGRVRLIH+FR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

KAG6579535.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]3.18e-8567.28Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE----LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLI
        M +DSQ+FP+AHYQAHHK+D E    L T  ALK ERSNKCFIYVFSAFVFL VA+LIFAL+VLRVN P++  SS ++A FS  +NTNS+S  S NLTL 
Subjt:  MGEDSQTFPLAHYQAHHKTDAE----LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLI

Query:  ALIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS
        A +AVDNSNFGPFNFD  ++G IYAGA+VGQTTTG GR KAKGTK MN+TV A+A    N S     S +L LSSF  LRGRVRLIHIFR+R +SEI CS
Subjt:  ALIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS

Query:  LNLDLNSRQIQHNWVCE
        + LDLN+ QIQHNWVCE
Subjt:  LNLDLNSRQIQHNWVCE

XP_004143966.1 late embryogenesis abundant protein At1g64065 [Cucumis sativus]3.52e-8666.05Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHK + E  LAT   L+ ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI LSS +    S  NNTNS+S +S NL+  A 
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA AKG+KRMN+TV+ +A   S S+      GIL  SSFVKLRGRVRLIHIFR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

XP_008437349.1 PREDICTED: late embryogenesis abundant protein At1g64065 [Cucumis melo]3.71e-8865.58Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHKTD E  LAT   L  ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI+LS+ ++  FS  N  NS+S +S +L+  A+
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA+AKG+KRMN+TV+ +A   S S+      GIL LSSFVKLRGRVRLIH+FR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

XP_038875090.1 late embryogenesis abundant protein At1g64065 [Benincasa hispida]8.98e-8766.67Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNS-SFNLTLIA
        M EDSQ+FPLAHYQAHHK+D E  LAT   L+ ERSNKCFIYVFS FVFL VA+LIFAL+VLRVN PSI LSS ++  FS    TN+ S+S S NLT+IA
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNS-SFNLTLIA

Query:  LIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSL
           VDNSNFGPFNFD+G +GL+Y GA+VG+ +TG GRA+AKG+KRMN+T++A+A    N S+     GIL L+SFVKLRGRVRLIHIFR+RT+SEI CS+
Subjt:  LIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSL

Query:  NLDLNSRQIQHNWVCE
        NLD+N+ QIQ+NWVCE
Subjt:  NLDLNSRQIQHNWVCE

TrEMBL top hitse value%identityAlignment
A0A0A0KQT7 LEA_2 domain-containing protein1.70e-8666.05Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHK + E  LAT   L+ ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI LSS +    S  NNTNS+S +S NL+  A 
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA AKG+KRMN+TV+ +A   S S+      GIL  SSFVKLRGRVRLIHIFR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

A0A1S3ATY3 late embryogenesis abundant protein At1g640651.80e-8865.58Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHKTD E  LAT   L  ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI+LS+ ++  FS  N  NS+S +S +L+  A+
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA+AKG+KRMN+TV+ +A   S S+      GIL LSSFVKLRGRVRLIH+FR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

A0A5A7TL68 Late embryogenesis abundant protein2.55e-8865.58Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL
        MGEDSQ+FPLAHYQAHHKTD E  LAT   L  ERSNKCFIY+FS FVFL VALLIFAL+VLRVN PSI+LS+ ++  FS  N  NS+S +S +L+  A+
Subjt:  MGEDSQTFPLAHYQAHHKTDAE--LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIAL

Query:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN
          VDNSNFGPFNFD+G +GL+Y G + G+ +TG GRA+AKG+KRMN+TV+ +A   S S+      GIL LSSFVKLRGRVRLIH+FR+R +SEI+CS+N
Subjt:  IAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLN

Query:  LDLNSRQIQHNWVCE
        LDLN+ QIQHNWVCE
Subjt:  LDLNSRQIQHNWVCE

A0A6J1H2C0 uncharacterized protein LOC1114597391.39e-6759.15Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAELATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA
        MGE S +FPL H QAHHKT          KNE SNKCFIY+FS+FVFLCVALLIF+L+VLRVN P+IDLSS ++  FS  +NTNS+S SS NLTLIA  +
Subjt:  MGEDSQTFPLAHYQAHHKTDAELATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA

Query:  VDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLNLD
        +DNSNFGPF FD   +  +Y G +VG+ +TG GRA+AKGT RMN++V+A+    S+   G   SGIL +SSF K  GR+ LIH+ RKR  SEI+CS+NLD
Subjt:  VDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLNLD

Query:  LNSRQIQHNWVCE
        LN+ QIQ  WVC+
Subjt:  LNSRQIQHNWVCE

A0A6J1I3M2 uncharacterized protein LOC1114688753.10e-8566.36Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAE----LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLI
        M +DSQ+FP+AHY+AHHK+D E    L T  AL+ ERSNKCFIYVFSAFVFL VA+LIFAL+VLRVN P++  SS ++A FS  +NTNS+S  S NLT+ 
Subjt:  MGEDSQTFPLAHYQAHHKTDAE----LATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLI

Query:  ALIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS
        A +AVDNSNFGPFNFD  ++G IYAGA+VGQ+TTG GRAKAKGTK MN+TV A+AN   N S     S +L LSSF  LRGRVRLIHIFR+R +SEI+CS
Subjt:  ALIAVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS

Query:  LNLDLNSRQIQHNWVCE
        + LDLN+ QIQHNWVCE
Subjt:  LNLDLNSRQIQHNWVCE

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640654.4e-1432.57Show/hide
Query:  DSQTFPLAHYQAHHKTDAELA--TITALKNER-SNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA
        D     LA  + + ++D E +   I   K E    KC +Y  +  V +    LI + + LR++ P I+  S  ++     +  NST N  FN TL++ I+
Subjt:  DSQTFPLAHYQAHHKTDAELA--TITALKNER-SNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA

Query:  VDNSNFGPFNFDDGALGLIYAG-AVVGQTTTGPGRAKAKGTKRMNLTVDAAANF---NSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKR-TTSEINC
        + NSNFG F F+D  L ++YA   VVG+T     R +A  T R+   V    +F   ++     D R G L+L S  ++RGR++++   RKR   S ++C
Subjt:  VDNSNFGPFNFDDGALGLIYAG-AVVGQTTTGPGRAKAKGTKRMNLTVDAAANF---NSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKR-TTSEINC

Query:  SLNLDLNSRQIQHNWVCE
        ++ L+L  R IQ N +CE
Subjt:  SLNLDLNSRQIQHNWVCE

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.1e-1532.57Show/hide
Query:  DSQTFPLAHYQAHHKTDAELA--TITALKNER-SNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA
        D     LA  + + ++D E +   I   K E    KC +Y  +  V +    LI + + LR++ P I+  S  ++     +  NST N  FN TL++ I+
Subjt:  DSQTFPLAHYQAHHKTDAELA--TITALKNER-SNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIA

Query:  VDNSNFGPFNFDDGALGLIYAG-AVVGQTTTGPGRAKAKGTKRMNLTVDAAANF---NSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKR-TTSEINC
        + NSNFG F F+D  L ++YA   VVG+T     R +A  T R+   V    +F   ++     D R G L+L S  ++RGR++++   RKR   S ++C
Subjt:  VDNSNFGPFNFDDGALGLIYAG-AVVGQTTTGPGRAKAKGTKRMNLTVDAAANF---NSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKR-TTSEINC

Query:  SLNLDLNSRQIQHNWVCE
        ++ L+L  R IQ N +CE
Subjt:  SLNLDLNSRQIQHNWVCE

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.5e-1326.07Show/hide
Query:  MGEDSQTFPLAHYQAHHKTDAELATITALKNERSN-KCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALI
        M +     PLA       +D   + I      R+  KC I V +  + L   +L     V RV  P I ++   +        TN       N+++I  +
Subjt:  MGEDSQTFPLAHYQAHHKTDAELATITALKNERSN-KCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALI

Query:  AVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAAN-FNSNSSTGD--FRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS
        +V N N   F + +    + Y G +VG+    PG+A+   T RMN+TVD   +   S+   G    RSG++ + S+ ++ G+V+++ I +K  T ++NC+
Subjt:  AVDNSNFGPFNFDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAAN-FNSNSSTGD--FRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCS

Query:  LNLDLNSRQIQ
        + +++  + IQ
Subjt:  LNLDLNSRQIQ

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.5e-0923.81Show/hide
Query:  AELATITALKNERSNK-CFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIAVDNSNFGPFNFDDGALGLI
        A   T   L+ +R+ K C  +     + + + ++I A  + +   P+  + S T+       N         NLTL   +++ N N   F++D  +  L 
Subjt:  AELATITALKNERSNK-CFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIAVDNSNFGPFNFDDGALGLI

Query:  YAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAAN--FNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLNLDLNSRQI
        Y G V+G+      R  A+ T  +N+T+   A+   +      D  +G++ L++FVK+ G+V ++ IF+ +  S  +C L++ ++ R +
Subjt:  YAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAAN--FNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLNLDLNSRQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAGACAGCCAAACTTTTCCATTGGCGCACTACCAAGCTCACCACAAGACCGACGCCGAGCTCGCCACCATCACCGCCCTCAAAAACGAGCGTTCCAACAAATG
CTTCATCTACGTCTTCTCCGCCTTCGTCTTCCTCTGCGTCGCCCTCCTCATCTTCGCCCTCGTCGTCCTCCGCGTCAATCCCCCCTCCATCGACCTCTCTTCCGCCACCC
TCGCCAATTTCTCTTTCCACAACAACACCAATTCCACTTCCAATTCCTCCTTCAATCTCACCTTGATTGCCCTGATCGCCGTCGACAACTCCAACTTCGGTCCCTTCAAT
TTCGACGACGGCGCCCTCGGTTTGATCTACGCCGGCGCCGTCGTCGGCCAGACCACCACCGGCCCCGGCAGGGCCAAGGCCAAGGGCACCAAGAGGATGAACCTCACCGT
CGACGCCGCCGCCAATTTCAATTCCAATTCCAGCACCGGAGATTTCAGATCTGGAATTTTGAAGCTGAGTAGTTTTGTGAAATTGAGAGGCAGAGTGCGTTTGATTCATA
TCTTCCGGAAGAGGACGACGTCGGAGATCAACTGCTCCTTGAATTTGGATTTGAATTCTCGTCAAATCCAGCACAATTGGGTTTGCGAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAGACAGCCAAACTTTTCCATTGGCGCACTACCAAGCTCACCACAAGACCGACGCCGAGCTCGCCACCATCACCGCCCTCAAAAACGAGCGTTCCAACAAATG
CTTCATCTACGTCTTCTCCGCCTTCGTCTTCCTCTGCGTCGCCCTCCTCATCTTCGCCCTCGTCGTCCTCCGCGTCAATCCCCCCTCCATCGACCTCTCTTCCGCCACCC
TCGCCAATTTCTCTTTCCACAACAACACCAATTCCACTTCCAATTCCTCCTTCAATCTCACCTTGATTGCCCTGATCGCCGTCGACAACTCCAACTTCGGTCCCTTCAAT
TTCGACGACGGCGCCCTCGGTTTGATCTACGCCGGCGCCGTCGTCGGCCAGACCACCACCGGCCCCGGCAGGGCCAAGGCCAAGGGCACCAAGAGGATGAACCTCACCGT
CGACGCCGCCGCCAATTTCAATTCCAATTCCAGCACCGGAGATTTCAGATCTGGAATTTTGAAGCTGAGTAGTTTTGTGAAATTGAGAGGCAGAGTGCGTTTGATTCATA
TCTTCCGGAAGAGGACGACGTCGGAGATCAACTGCTCCTTGAATTTGGATTTGAATTCTCGTCAAATCCAGCACAATTGGGTTTGCGAG
Protein sequenceShow/hide protein sequence
MGEDSQTFPLAHYQAHHKTDAELATITALKNERSNKCFIYVFSAFVFLCVALLIFALVVLRVNPPSIDLSSATLANFSFHNNTNSTSNSSFNLTLIALIAVDNSNFGPFN
FDDGALGLIYAGAVVGQTTTGPGRAKAKGTKRMNLTVDAAANFNSNSSTGDFRSGILKLSSFVKLRGRVRLIHIFRKRTTSEINCSLNLDLNSRQIQHNWVCE