; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g09030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g09030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr10:6661336..6662371
RNA-Seq ExpressionMoc10g09030
SyntenyMoc10g09030
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600430.1 Protein MODIFYING WALL LIGNIN-1, partial [Cucurbita argyrosperma subsp. sororia]2.7e-6273.12Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQS-SKRPNLA
        MEKRR  Y L+LSIVVS ALVA VSCIAAELHRTK KDL+LDGK CYLPE++AF YGVAAL CLV+AQVIGN+L C S   NSR KK ++Q   +R NLA
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQS-SKRPNLA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        IILLV+SWASFTVVI+LLSAA+SMSR+Q Y AGWL GECYLVK+GV++A+AILILV+  S V SAV +LRKSLQIDES KTS  PK
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_008451867.1 PREDICTED: uncharacterized protein LOC103493027 [Cucumis melo]2.5e-6372.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_011653242.1 protein MODIFYING WALL LIGNIN-2 [Cucumis sativus]3.5e-6574.59Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K R GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LLS ASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ STVGSAVT+     +I+ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_022136395.1 uncharacterized protein LOC111008116 [Momordica charantia]7.4e-92100Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_038895935.1 protein MODIFYING WALL LIGNIN-2-like [Benincasa hispida]4.8e-6776.22Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEK  L YTL+LSI+VSLAL+AFVSC+AAELHRTK  DLKLDGK CYLPE+RAFGYGVAAL CLVMAQVIGNILLC S  FN R+KK S QS KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        I L+VSWASFTVVI+LLSAASSMSRRQPYAAGWLGGECYLVK+GV+VA+A+LIL++  STVGSAVT+     QI++S K++A PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

TrEMBL top hitse value%identityAlignment
A0A0A0KYL9 Uncharacterized protein1.7e-6574.59Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K R GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LLS ASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ STVGSAVT+     +I+ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A1S3BTM2 uncharacterized protein LOC1034930271.2e-6372.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A5A7T9R1 DUF1218 domain-containing protein1.2e-6372.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A6J1C3S2 uncharacterized protein LOC1110081163.6e-92100Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A6J1J6B1 uncharacterized protein LOC1114816211.5e-6172.04Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQS-SKRPNLA
        MEKRR  Y L+LSIVVS ALVA VSCIAAELHRTK KDL+LDGK CYLPE++AF YGVAAL C V+AQVIGN+L C +   NSR KK ++Q   +R  LA
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQS-SKRPNLA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        IILLV+SWASFTVVI+LLSAA+SMSR+Q Y AGWL GECYLVK+GV++A+AILILV+ GS V SAV +LRKSLQIDES KTS  PK
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-14.4e-3141.95Show/hide
Query:  LVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWAS
        L AF  C++AE  + K           KDLK DG+ CYLPENRAFG G+AALVC+ +AQ++GN+++CR F             ++     IILL+ SW +
Subjt:  LVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWAS

Query:  FTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ
        F V + L+S  +SM+R Q Y  GWL  ECYLVK GVF AS  L + T+ + +G+    ++ SLQ++   K   Q
Subjt:  FTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ

O65708 Protein MODIFYING WALL LIGNIN-24.6e-2843.21Show/hide
Query:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT
        S+V SL LV+F++C AAE  RT+ +D++ D  + CY+P + AFG G AA++C  +AQ++GNI++ R+ +  +R K+          L  +LL++SW++F 
Subjt:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT

Query:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ
        VV+++LS A SMSR Q Y  GWL  +CYLVK GVF AS  L ++ +G+   SA  I  K  Q
Subjt:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)7.0e-3240.22Show/hide
Query:  LTLSIVVSLALVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLA
        L    +    L AF  C++AE  + K           KDLK DG+ CYLPENRAFG G+AALVC+ +AQ++GN+++CR F             ++     
Subjt:  LTLSIVVSLALVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ
        IILL+ SW +F V + L+S  +SM+R Q Y  GWL  ECYLVK GVF AS  L + T+ + +G+    ++ SLQ++   K   Q
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ

AT4G19370.1 Protein of unknown function (DUF1218)3.3e-2943.21Show/hide
Query:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT
        S+V SL LV+F++C AAE  RT+ +D++ D  + CY+P + AFG G AA++C  +AQ++GNI++ R+ +  +R K+          L  +LL++SW++F 
Subjt:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT

Query:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ
        VV+++LS A SMSR Q Y  GWL  +CYLVK GVF AS  L ++ +G+   SA  I  K  Q
Subjt:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ

AT4G21310.1 Protein of unknown function (DUF1218)1.5e-0528.68Show/hide
Query:  RRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNIL---LCRSFKFNSREKKCSNQSSKRPNLAI
        R +G+ + + +++++ + A +  I AE+ + K+K LK+   +C  P   AF YG+AA + LV+A V  N L   LC +       ++   +SS    LA+
Subjt:  RRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNIL---LCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLL---SAASSMSRR
          L+ +W    +   +L   + A+S SR+
Subjt:  ILLVVSWASFTVVIVLL---SAASSMSRR

AT5G17210.1 Protein of unknown function (DUF1218)3.9e-0629.01Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGK----QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRP
        ME+R++   +   ++  L L++ V+   AE  R K   + +       +C  P + AF  G  + + L+MAQ+I ++    S  F  R+    ++S+   
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGK----QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRP

Query:  NLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTV
         +++I  VVSW +F +  +VLLS A+        +       CY+VK GVF   A+L LVT+
Subjt:  NLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTV

AT5G17210.2 Protein of unknown function (DUF1218)4.3e-0532.46Show/hide
Query:  QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKA
        +C  P + AF  G  + + L+MAQ+I ++    S  F  R+    ++S+    +++I  VVSW +F +  +VLLS A+        +       CY+VK 
Subjt:  QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKA

Query:  GVFVASAILILVTV
        GVF   A+L LVT+
Subjt:  GVFVASAILILVTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACGCCGTCTCGGGTACACCTTAACCCTCTCCATAGTTGTCTCGCTCGCGCTCGTCGCCTTTGTTTCGTGTATAGCTGCAGAATTACACAGAACAAAG
ATCAAAGACCTCAAGTTGGATGGGAAGCAGTGTTATTTGCCAGAAAATCGAGCATTTGGATATGGAGTTGCAGCCTTGGTGTGTTTGGTTATGGCTCAAGTTATT
GGGAATATTTTACTTTGCAGAAGTTTCAAGTTCAATTCAAGAGAGAAAAAATGCAGCAATCAATCTTCTAAAAGACCAAATTTAGCCATAATTCTTCTGGTTGTT
TCTTGGGCAAGCTTTACCGTGGTGATCGTGTTGCTGAGCGCGGCGTCGAGTATGAGCAGACGGCAGCCGTACGCGGCGGGCTGGTTAGGCGGCGAGTGCTACTTG
GTGAAGGCCGGCGTATTCGTCGCCTCTGCAATTCTGATCCTCGTCACGGTAGGCTCAACCGTTGGCTCCGCCGTAACAATCTTGAGGAAGAGCCTTCAGATCGAC
GAATCCGGAAAAACTAGCGCACAGCCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAACGCCGTCTCGGGTACACCTTAACCCTCTCCATAGTTGTCTCGCTCGCGCTCGTCGCCTTTGTTTCGTGTATAGCTGCAGAATTACACAGAACAAAG
ATCAAAGACCTCAAGTTGGATGGGAAGCAGTGTTATTTGCCAGAAAATCGAGCATTTGGATATGGAGTTGCAGCCTTGGTGTGTTTGGTTATGGCTCAAGTTATT
GGGAATATTTTACTTTGCAGAAGTTTCAAGTTCAATTCAAGAGAGAAAAAATGCAGCAATCAATCTTCTAAAAGACCAAATTTAGCCATAATTCTTCTGGTTGTT
TCTTGGGCAAGCTTTACCGTGGTGATCGTGTTGCTGAGCGCGGCGTCGAGTATGAGCAGACGGCAGCCGTACGCGGCGGGCTGGTTAGGCGGCGAGTGCTACTTG
GTGAAGGCCGGCGTATTCGTCGCCTCTGCAATTCTGATCCTCGTCACGGTAGGCTCAACCGTTGGCTCCGCCGTAACAATCTTGAGGAAGAGCCTTCAGATCGAC
GAATCCGGAAAAACTAGCGCACAGCCAAAATGA
Protein sequenceShow/hide protein sequence
MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVV
SWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK