; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0338 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0338
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLEA_2 domain-containing protein
Genome locationMC01:10090263..10090889
RNA-Seq ExpressionMC01g0338
SyntenyMC01g0338
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142665.1 uncharacterized protein LOC101208230 [Cucumis sativus]3.05e-9975.46Show/hide
Query:  MEIASSAT---KDLKST----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNP
        MEIASS++   KD KST     AA RSR+RRN CIG S+  LLLL+I+I+ILAFTVFKA+RPIT +NSVALADL VSL++A V+VDINVTLIA +A+TNP
Subjt:  MEIASSAT---KDLKST----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNP

Query:  NKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDI
        NKVGFSY NSTA LNYRGELVGEAPI AG+IDA + K+MNITLTIMADRLL K+  VF+D VAGSMPLNTYTRISG+VKILGIF IHVVS+TSCD  +DI
Subjt:  NKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDI

Query:  SSRKIGDQQCNYHTKI
        S RKIGDQQCNYHTKI
Subjt:  SSRKIGDQQCNYHTKI

XP_008463309.1 PREDICTED: uncharacterized protein LOC103501497 [Cucumis melo]4.05e-10276.96Show/hide
Query:  MEIASSAT---KDLKST-----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTN
        MEIASS++   KD KST      AAARSR+RRN CIG S+  LLLL+ILI+ILAFTVFKA+RPIT +NSVALADL VSL++ARV+VDINVTLIA +A+TN
Subjt:  MEIASSAT---KDLKST-----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTN

Query:  PNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTID
        PNKVGFSY NSTA LNYRGELVGEAPI AG+IDA + K+MNITLTIMADRLL K+  VFSDVVAGSMPLNTY RISG+VKILGIF IHVVSTTSCD  +D
Subjt:  PNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTID

Query:  ISSRKIGDQQCNYHTKI
        IS RK+GDQQCNYHTKI
Subjt:  ISSRKIGDQQCNYHTKI

XP_022156243.1 uncharacterized protein LOC111023175 [Momordica charantia]5.35e-133100Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
        MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY

Query:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
        SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
Subjt:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD

Query:  QQCNYHTKI
        QQCNYHTKI
Subjt:  QQCNYHTKI

XP_022966458.1 uncharacterized protein LOC111466106 [Cucurbita maxima]1.80e-10077.51Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
        MEIASS TKD KS     RSRRRRN CIG S+  +LLL++LI+ILAFTVFKA+RPIT INSVALADL +SL+IAR AV +N+TLI  V++TNPNKVGFSY
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY

Query:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
        SNSTALLNYRGEL+GEAPIP+GRI+A+QSK MNIT+TIMADRLL +S+ V SDVVAGSMPLNTYTRISG+V+ILGIFKI VVS+TSCD TIDIS RKIGD
Subjt:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD

Query:  QQCNYHTKI
        QQC+YHTKI
Subjt:  QQCNYHTKI

XP_038882665.1 uncharacterized protein LOC120073854 [Benincasa hispida]1.21e-10480.09Show/hide
Query:  MEIASSATKDLKSTT--AAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGF
        MEIASS+ KD KST   AAARSRRRRN CIG S+  ++LLV+LI+ILAFTVFKA+RPITAINSV LADL VSL++ARV+VDINVTLIA VA+TNPNKVGF
Subjt:  MEIASSATKDLKSTT--AAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGF

Query:  SYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKI
        SYSNSTA LNYRGELVGEAPI AGRIDA Q K+MNITLTIMADRLL K+  VFSDVVAG+MPLNTYTRISG+V+ILGIF IHVVSTTSCD  + IS RK+
Subjt:  SYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKI

Query:  GDQQCNYHTKI
        GDQQCNYHTKI
Subjt:  GDQQCNYHTKI

TrEMBL top hitse value%identityAlignment
A0A0A0L094 LEA_2 domain-containing protein1.48e-9975.46Show/hide
Query:  MEIASSAT---KDLKST----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNP
        MEIASS++   KD KST     AA RSR+RRN CIG S+  LLLL+I+I+ILAFTVFKA+RPIT +NSVALADL VSL++A V+VDINVTLIA +A+TNP
Subjt:  MEIASSAT---KDLKST----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNP

Query:  NKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDI
        NKVGFSY NSTA LNYRGELVGEAPI AG+IDA + K+MNITLTIMADRLL K+  VF+D VAGSMPLNTYTRISG+VKILGIF IHVVS+TSCD  +DI
Subjt:  NKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDI

Query:  SSRKIGDQQCNYHTKI
        S RKIGDQQCNYHTKI
Subjt:  SSRKIGDQQCNYHTKI

A0A1S3CJB5 uncharacterized protein LOC1035014971.96e-10276.96Show/hide
Query:  MEIASSAT---KDLKST-----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTN
        MEIASS++   KD KST      AAARSR+RRN CIG S+  LLLL+ILI+ILAFTVFKA+RPIT +NSVALADL VSL++ARV+VDINVTLIA +A+TN
Subjt:  MEIASSAT---KDLKST-----TAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTN

Query:  PNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTID
        PNKVGFSY NSTA LNYRGELVGEAPI AG+IDA + K+MNITLTIMADRLL K+  VFSDVVAGSMPLNTY RISG+VKILGIF IHVVSTTSCD  +D
Subjt:  PNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTID

Query:  ISSRKIGDQQCNYHTKI
        IS RK+GDQQCNYHTKI
Subjt:  ISSRKIGDQQCNYHTKI

A0A6J1DQ35 uncharacterized protein LOC1110231752.59e-133100Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
        MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY

Query:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
        SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
Subjt:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD

Query:  QQCNYHTKI
        QQCNYHTKI
Subjt:  QQCNYHTKI

A0A6J1EAI5 uncharacterized protein LOC1114323076.66e-9977.51Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
        MEIASS TKD KS     RSRRRRN CIG S+  +LLL++LI+ILAFTVFKA+RPITAINSVALADL +SL+IAR AV +N+TLI  V++TNPNKVGFSY
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY

Query:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
        SNSTALLNYRGEL+GEAPIP+GRI+A+QSK MNIT+TIMADRLL +S+ V SDVVAGS+PLNTYTRISG+V+ILGIFKI VVS+TSCD TIDIS RKIGD
Subjt:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD

Query:  QQCNYHTKI
        QQC+YHTKI
Subjt:  QQCNYHTKI

A0A6J1HS80 uncharacterized protein LOC1114661068.71e-10177.51Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY
        MEIASS TKD KS     RSRRRRN CIG S+  +LLL++LI+ILAFTVFKA+RPIT INSVALADL +SL+IAR AV +N+TLI  V++TNPNKVGFSY
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSY

Query:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD
        SNSTALLNYRGEL+GEAPIP+GRI+A+QSK MNIT+TIMADRLL +S+ V SDVVAGSMPLNTYTRISG+V+ILGIFKI VVS+TSCD TIDIS RKIGD
Subjt:  SNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGD

Query:  QQCNYHTKI
        QQC+YHTKI
Subjt:  QQCNYHTKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family3.5e-1137.5Show/hide
Query:  RSRRRRNI--CIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGE
        RS  R N+  C  A++  L+LLV+L L++ FTVFK + P  ++N+V L   AVS + A      N +    VAV NPN+  FS+ +S+  L Y G  VG 
Subjt:  RSRRRRNI--CIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGE

Query:  APIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMP
          IPAG+ID+ + + M  T T+ +  +   S++  S V A  +P
Subjt:  APIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMP

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.7e-0624.31Show/hide
Query:  ASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIA---RVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDA
        A L  L+L V+LI+ILA    K ++P   +  VA+  + +S   A        +++T+       NPNKVG  Y  S+  + Y+G  +G A +P    DA
Subjt:  ASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIA---RVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGEAPIPAGRIDA

Query:  DQSKDMNITLTIMADRLLSKSAA--VFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQQCNY
          +K++  T+++    L+   AA  V    +   + L     +  +++++      V  + +C + I    + +  +QC +
Subjt:  DQSKDMNITLTIMADRLLSKSAA--VFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQQCNY

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.5e-2233.33Show/hide
Query:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARV-AVDINVTLIAAVAVTNPNKVGFS
        + ++  +  ++K+ T  +R+R + +IC+ A+    L+L  ++L L FTVF+ + PI  +N V +  L       +V  +  N+++I  V+V NPN   F 
Subjt:  MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARV-AVDINVTLIAAVAVTNPNKVGFS

Query:  YSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIG
        YSN+T  + Y+G LVGEA    G+    ++  MN+T+ IM DR+LS          +G + + +YTR+ G+VKI+GI K HV    +C + ++I+ + I 
Subjt:  YSNSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIG

Query:  DQQC
        D  C
Subjt:  DQQC

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-1527.13Show/hide
Query:  RRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGEAPIPA
        +RR  CI + +  +L ++ +  ++   VFK + PI    S  +  ++ ++ +    V +N TL   + + NPN   F Y     L+ YR  LVG   +P+
Subjt:  RRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYRGELVGEAPIPA

Query:  GRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQQCNYHTKI
          + A  S  +   L +  D+ ++    +  DV+ G + + T  ++ G++ +LGIFKI + S + C+L +   S  + DQ C+  TK+
Subjt:  GRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQQCNYHTKI

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-4548.56Show/hide
Query:  ASSATKDLKSTTAAARSRRRRN--ICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYS
        ASS      +T  A + RR+RN  ICI  ++  +LL+ I+I+ILAFT+FK +RP T I+SV +  L  S++   + V +N+TL   +++ NPN++GFSY 
Subjt:  ASSATKDLKSTTAAARSRRRRN--ICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYS

Query:  NSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQ
        +S+ALLNYRG+++GEAP+PA RI A ++  +NITLT+MADRLLS++  + SDV+AG +PLNT+ +++G+V +L IFKI V S++SCDL+I +S R +  Q
Subjt:  NSTALLNYRGELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQ

Query:  QCNYHTKI
         C Y TK+
Subjt:  QCNYHTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATCGCCTCCTCCGCAACCAAGGATCTCAAATCCACCACCGCCGCCGCCCGCTCCCGGAGACGCCGCAACATCTGCATCGGAGCCTCCCTCGGTGCGCTCCTCCT
CCTCGTCATTCTAATCCTCATTCTGGCCTTCACGGTGTTCAAGGCCAGGCGCCCGATCACCGCCATCAACTCCGTCGCCCTAGCGGATCTCGCCGTCTCGCTCGACATAG
CCCGAGTCGCCGTCGACATCAACGTCACTCTGATCGCCGCCGTCGCCGTTACGAACCCTAACAAGGTCGGTTTCAGCTACTCGAACAGCACCGCGCTGCTCAACTACAGA
GGCGAGCTGGTCGGGGAGGCGCCGATTCCCGCCGGCCGGATCGACGCGGATCAGAGCAAGGACATGAACATCACGCTCACGATCATGGCGGACCGGCTGCTGAGTAAGTC
GGCGGCGGTGTTCTCCGACGTGGTCGCCGGATCGATGCCGTTGAACACGTACACGAGAATTTCAGGGAGGGTGAAGATATTGGGGATTTTCAAGATTCACGTCGTTTCGA
CCACGTCCTGTGATTTGACAATCGACATATCGAGCAGAAAAATTGGAGATCAGCAGTGTAATTATCATACTAAGATC
mRNA sequenceShow/hide mRNA sequence
ATGGAAATCGCCTCCTCCGCAACCAAGGATCTCAAATCCACCACCGCCGCCGCCCGCTCCCGGAGACGCCGCAACATCTGCATCGGAGCCTCCCTCGGTGCGCTCCTCCT
CCTCGTCATTCTAATCCTCATTCTGGCCTTCACGGTGTTCAAGGCCAGGCGCCCGATCACCGCCATCAACTCCGTCGCCCTAGCGGATCTCGCCGTCTCGCTCGACATAG
CCCGAGTCGCCGTCGACATCAACGTCACTCTGATCGCCGCCGTCGCCGTTACGAACCCTAACAAGGTCGGTTTCAGCTACTCGAACAGCACCGCGCTGCTCAACTACAGA
GGCGAGCTGGTCGGGGAGGCGCCGATTCCCGCCGGCCGGATCGACGCGGATCAGAGCAAGGACATGAACATCACGCTCACGATCATGGCGGACCGGCTGCTGAGTAAGTC
GGCGGCGGTGTTCTCCGACGTGGTCGCCGGATCGATGCCGTTGAACACGTACACGAGAATTTCAGGGAGGGTGAAGATATTGGGGATTTTCAAGATTCACGTCGTTTCGA
CCACGTCCTGTGATTTGACAATCGACATATCGAGCAGAAAAATTGGAGATCAGCAGTGTAATTATCATACTAAGATC
Protein sequenceShow/hide protein sequence
MEIASSATKDLKSTTAAARSRRRRNICIGASLGALLLLVILILILAFTVFKARRPITAINSVALADLAVSLDIARVAVDINVTLIAAVAVTNPNKVGFSYSNSTALLNYR
GELVGEAPIPAGRIDADQSKDMNITLTIMADRLLSKSAAVFSDVVAGSMPLNTYTRISGRVKILGIFKIHVVSTTSCDLTIDISSRKIGDQQCNYHTKI