; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0928 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0928
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationMC04:16982713..16984739
RNA-Seq ExpressionMC04g0928
SyntenyMC04g0928
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579221.1 hypothetical protein SDJN03_23669, partial [Cucurbita argyrosperma subsp. sororia]4.69e-11786.63Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGWQKHQNEG VGMV    + HEQHDR T GF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

KAG7016734.1 hypothetical protein SDJN02_21844 [Cucurbita argyrosperma subsp. argyrosperma]2.32e-11786.63Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGWQKHQNEG VGMV    + HEQHDR T GF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

XP_022152291.1 uncharacterized protein LOC111020044 [Momordica charantia]9.04e-138100Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
        SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EKV
        EKV
Subjt:  EKV

XP_022992987.1 uncharacterized protein LOC111489148 [Cucurbita maxima]1.34e-11686.14Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGW+KHQNEG VGMV    + HEQHDR T GF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

XP_023550275.1 uncharacterized protein LOC111808498 [Cucurbita pepo subsp. pepo]3.85e-11686.14Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         L LGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGWQKHQNEG VGMV    + HEQHDR T GF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

TrEMBL top hitse value%identityAlignment
A0A0A0KV04 Uncharacterized protein1.05e-10679.31Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL VI IIASLHLIAFV AIGAEMRRSTA VVPD+YDETT+CVYDSDAST YGL AFGLLLIS T+LM VTRCLCCGKGL+SGGSTVCA++ F++SW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTK+R  LP+ NLSC  LR GVFAAAAA+TFLSLVFSILYY  HSRADTGGWQKHQNEG +GM  S++ Q EQH+RR   F
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EKV
         KV
Subjt:  EKV

A0A1S3CSX8 uncharacterized protein LOC1035045961.05e-10679.31Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MAT+L VI IIASLHLIAFV AIGAEMRRSTA VVPD+YDETT+CVYDSDAST YGL AFGLLLIS T+LM VTRCLCCGKGLKSGGSTVCA++ F++SW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTK+R +LP+ NLSC  LR GVFAAAAA+TFLSLVFSILYY  HSRADTGGWQKHQNEG +GM  S++ Q E H+RR   F
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EKV
        EKV
Subjt:  EKV

A0A6J1DHA0 uncharacterized protein LOC1110200444.37e-138100Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
        SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EKV
        EKV
Subjt:  EKV

A0A6J1FLQ5 uncharacterized protein LOC1114451083.76e-11686.14Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGWQKHQNEG VGMV    + HEQHDR T  F
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

A0A6J1JRH3 uncharacterized protein LOC1114891486.50e-11786.14Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MATSL+ +AIIASLHLIAFV AIGAEMRRSTATV PDEYDETTHCVYDSDAST YGL AFGLLLISHT+LM VTRCLCCGKGLKSGGSTVCAI+LF+VSW
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF
         LFLGAESLLLAGSVRNAYHTKFRGALPIKNLSC+ LR GVFAAAAA+TFLSLVFSILYY+ HSRADTGGW+KHQNEG VGMV    + HEQHDR T GF
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGF

Query:  EK
        EK
Subjt:  EK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52910.1 Protein of unknown function (DUF1218)8.0e-3750.86Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MA+ L VI I+  L LIA  LAI AE RRS   VVPD   E  HC Y SD +T+YG  AF LL IS  ++M  +RC CCGK LK GGS  C I+LF++ W
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKN-LSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKH
          FL AE  LLAGS+RNAYHT +R    I+N  SC  +R GVFAA A+    + + S  YY+++SRA  G    H
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKN-LSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKH

AT1G61065.1 Protein of unknown function (DUF1218)1.6e-2946.01Show/hide
Query:  SLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSWSLF
        S+ ++ ++    LIAF LA+ AE RR+T  +  +  D  ++CVYD D +T  G+ +F +LL S  L+M  +RCLCCG+ L   GS   AI LF+ +W  F
Subjt:  SLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSWSLF

Query:  LGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA
          A+  LLAGSVRNAYHTK+R      + SC +LR GVF A AA   L+ + S LYY+T SRA
Subjt:  LGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA

AT1G68220.1 Protein of unknown function (DUF1218)9.1e-5759.49Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGS-TVCAIVLFVVS
        MA S+S++ ++ +LHL+AFV A GAE RRSTA  VPD+YDE T C Y ++AST YG+SAFGLLL+S  ++  VT+CLC GKGL +G S TV AIV FVVS
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGS-TVCAIVLFVVS

Query:  WSLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEG-GVGMVASDIAQHEQH
        W  FLGAE+ LL GS RNAYHTK  G    K LSC  L  GVFAA AA T +SL+ +ILYYL HS+ADTGGW+KHQN+G  +GM     A  +Q+
Subjt:  WSLFLGAESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEG-GVGMVASDIAQHEQH

AT3G15480.1 Protein of unknown function (DUF1218)2.2e-3449.1Show/hide
Query:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW
        MA+ L V+ I+  L LIA  LAI AE RRS   V  D   +  +CVY +D +T+YG  AF LL +S  L+MA +RC CCGK L  GGS  CAI+LF++ W
Subjt:  MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSW

Query:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKN-LSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA
          FL AE  LLA S+RNAYHT++R    +++  SC  +R GVFAA AA T  + + S  YY+ +SRA
Subjt:  SLFLGAESLLLAGSVRNAYHTKFRGALPIKN-LSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA

AT4G27435.1 Protein of unknown function (DUF1218)2.0e-3551.88Show/hide
Query:  VIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSWSLFLGA
        V AI+   +LIAF LA+ AE RRSTA VV D   +  +CVYDSD +T YG+ AF   + S  L+M V+RC CCGK LK GGS   A++LF+VSW  FL A
Subjt:  VIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSWSLFLGA

Query:  ESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA
        E  LLAGSV NAYHTK+R         C TLR GVFAA A+  F + + S  YY  +  A
Subjt:  ESLLLAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCTCTCTGTCCGTCATAGCCATCATCGCCTCTCTGCACCTCATCGCCTTCGTCCTCGCCATCGGAGCTGAAATGCGCCGGAGCACGGCAACTGTAGTGCCGGA
TGAGTACGACGAGACTACTCACTGCGTCTACGATTCCGATGCCTCCACGGCGTACGGCCTTTCGGCGTTTGGTTTGCTACTCATTAGCCACACACTGCTTATGGCTGTCA
CCAGGTGTCTCTGCTGCGGCAAGGGCTTGAAAAGTGGAGGATCCACTGTCTGTGCCATCGTCCTCTTCGTCGTTTCTTGGAGTCTGTTCTTGGGAGCGGAGTCTTTGTTG
CTGGCTGGGTCCGTGAGGAATGCCTACCACACAAAGTTCAGAGGAGCTTTGCCCATCAAAAACTTGTCGTGCACCACGCTCCGCTGTGGTGTGTTCGCTGCTGCAGCAGC
AATTACATTTCTCTCATTGGTGTTTTCAATCCTTTACTACTTAACGCACTCGAGAGCCGATACTGGAGGCTGGCAGAAGCACCAGAATGAAGGTGGCGTTGGCATGGTGG
CTTCTGATATTGCACAGCATGAGCAGCATGATCGACGAACTGCTGGATTCGAGAAGGTTTAG
mRNA sequenceShow/hide mRNA sequence
GACAAAATCAAATTAAAAGGATGAAATTGAAAAATCGGACACGTGACGTAAGAAGAAAGTGCATTTACTCAGAAATAAGAAAATAAAAACAAAAACAAAATTTAAAAAAA
AAAAAAAACATCAGCAAATAGAAAACAAAAAGCATCAAAACCGAAAATTCATCCTATTCTTGAAAAATTGACAGACGAAAAACGAAGAACGAAGAAGAAGAAGAAGAAGA
AGAAGAAGAAGAAGAAGAGTGGCTGCCGCAGATCGGAAGCTTCGCCGGAGTTTGAGTTGGAGTAGTACAGTACAGTACAACAATGGCTACCTCTCTGTCCGTCATAGCCA
TCATCGCCTCTCTGCACCTCATCGCCTTCGTCCTCGCCATCGGAGCTGAAATGCGCCGGAGCACGGCAACTGTAGTGCCGGATGAGTACGACGAGACTACTCACTGCGTC
TACGATTCCGATGCCTCCACGGCGTACGGCCTTTCGGCGTTTGGTTTGCTACTCATTAGCCACACACTGCTTATGGCTGTCACCAGGTGTCTCTGCTGCGGCAAGGGCTT
GAAAAGTGGAGGATCCACTGTCTGTGCCATCGTCCTCTTCGTCGTTTCTTGGAGTCTGTTCTTGGGAGCGGAGTCTTTGTTGCTGGCTGGGTCCGTGAGGAATGCCTACC
ACACAAAGTTCAGAGGAGCTTTGCCCATCAAAAACTTGTCGTGCACCACGCTCCGCTGTGGTGTGTTCGCTGCTGCAGCAGCAATTACATTTCTCTCATTGGTGTTTTCA
ATCCTTTACTACTTAACGCACTCGAGAGCCGATACTGGAGGCTGGCAGAAGCACCAGAATGAAGGTGGCGTTGGCATGGTGGCTTCTGATATTGCACAGCATGAGCAGCA
TGATCGACGAACTGCTGGATTCGAGAAGGTTTAGCTATGGCAGGTTTGGATGATTAGAAGTTGCTTGCAGGGTCAAGGTTGATTAGAGCTTATTCTGGGAAAGAAAGATA
GGTAGATCAAAGTATCAGGTTGTTCTTGCTAAGCTACGAATGTGTAAAGTGATCTCATACTTAAATCTAGATGTGGAAAATGCAGTTGCTTTATGTTTTGGTTTTTAACT
GTGAACTCCAAGATTCATAGTTAAAATGAATTTTGGCCTATTTTCACTCCTTTATATTATTACCTTTATACTTCCAATTTATTAGTTTGTCATCA
Protein sequenceShow/hide protein sequence
MATSLSVIAIIASLHLIAFVLAIGAEMRRSTATVVPDEYDETTHCVYDSDASTAYGLSAFGLLLISHTLLMAVTRCLCCGKGLKSGGSTVCAIVLFVVSWSLFLGAESLL
LAGSVRNAYHTKFRGALPIKNLSCTTLRCGVFAAAAAITFLSLVFSILYYLTHSRADTGGWQKHQNEGGVGMVASDIAQHEQHDRRTAGFEKV