; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0794 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0794
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationMC10:6642872..6644549
RNA-Seq ExpressionMC10g0794
SyntenyMC10g0794
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600430.1 Protein MODIFYING WALL LIGNIN-1, partial [Cucurbita argyrosperma subsp. sororia]3.31e-8173.12Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSS-KRPNLA
        MEKRR  Y L+LSIVVS ALVA VSCIAAELHRTK KDL+LDGK CYLPE++AF YGVAAL CLV+AQVIGN+L C S   NSR KK ++Q   +R NLA
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSS-KRPNLA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        IILLV+SWASFTVVI+LLSAA+SMSR+Q Y AGWL GECYLVK+GV++A+AILILV+  S V SAV +LRKSLQIDES KTS  PK
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_008451867.1 PREDICTED: uncharacterized protein LOC103493027 [Cucumis melo]1.64e-8272.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_011653242.1 protein MODIFYING WALL LIGNIN-2 [Cucumis sativus]6.02e-8574.59Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K R GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LLS ASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ STVGSAVT+     +I+ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_022136395.1 uncharacterized protein LOC111008116 [Momordica charantia]4.54e-120100Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

XP_038895935.1 protein MODIFYING WALL LIGNIN-2-like [Benincasa hispida]2.20e-8776.22Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEK  L YTL+LSI+VSLAL+AFVSC+AAELHRTK  DLKLDGK CYLPE+RAFGYGVAAL CLVMAQVIGNILLC S  FN R+KK S QS KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        I L+VSWASFTVVI+LLSAASSMSRRQPYAAGWLGGECYLVK+GV+VA+A+LIL++  STVGSAVT+     QI++S K++A PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

TrEMBL top hitse value%identityAlignment
A0A0A0KYL9 Uncharacterized protein2.91e-8574.59Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K R GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LLS ASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ STVGSAVT+     +I+ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A1S3BTM2 uncharacterized protein LOC1034930277.94e-8372.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A5A7T9R1 DUF1218 domain-containing protein7.94e-8372.97Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        M K   GY L+LSIVVSLAL+AFVSC+AAELHRTK KDLKLDGK CYLPE++AFGYGVAAL CLVMAQVIGNILLC S   NSREKK S Q  KRPNLA 
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
          LVVSWASFTVVI+LL AASSMSR+QPYA GWLGGECYLVK+GV+VA+AILIL+++ +TVGSAVT+     +++ES K++  PK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A6J1C3S2 uncharacterized protein LOC1110081162.20e-120100Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
        MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
Subjt:  ILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

A0A6J1J6B1 uncharacterized protein LOC1114816213.76e-8072.04Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPN-LA
        MEKRR  Y L+LSIVVS ALVA VSCIAAELHRTK KDL+LDGK CYLPE++AF YGVAAL C V+AQVIGN+L C +   NSR KK ++Q   R + LA
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPN-LA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK
        IILLV+SWASFTVVI+LLSAA+SMSR+Q Y AGWL GECYLVK+GV++A+AILILV+ GS V SAV +LRKSLQIDES KTS  PK
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-14.4e-3141.95Show/hide
Query:  LVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWAS
        L AF  C++AE  + K           KDLK DG+ CYLPENRAFG G+AALVC+ +AQ++GN+++CR F             ++     IILL+ SW +
Subjt:  LVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWAS

Query:  FTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ
        F V + L+S  +SM+R Q Y  GWL  ECYLVK GVF AS  L + T+ + +G+    ++ SLQ++   K   Q
Subjt:  FTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ

O65708 Protein MODIFYING WALL LIGNIN-24.6e-2843.21Show/hide
Query:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT
        S+V SL LV+F++C AAE  RT+ +D++ D  + CY+P + AFG G AA++C  +AQ++GNI++ R+ +  +R K+          L  +LL++SW++F 
Subjt:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT

Query:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ
        VV+++LS A SMSR Q Y  GWL  +CYLVK GVF AS  L ++ +G+   SA  I  K  Q
Subjt:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)7.0e-3240.22Show/hide
Query:  LTLSIVVSLALVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLA
        L    +    L AF  C++AE  + K           KDLK DG+ CYLPENRAFG G+AALVC+ +AQ++GN+++CR F             ++     
Subjt:  LTLSIVVSLALVAFVSCIAAELHRTKI----------KDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLA

Query:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ
        IILL+ SW +F V + L+S  +SM+R Q Y  GWL  ECYLVK GVF AS  L + T+ + +G+    ++ SLQ++   K   Q
Subjt:  IILLVVSWASFTVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQ

AT4G19370.1 Protein of unknown function (DUF1218)3.3e-2943.21Show/hide
Query:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT
        S+V SL LV+F++C AAE  RT+ +D++ D  + CY+P + AFG G AA++C  +AQ++GNI++ R+ +  +R K+          L  +LL++SW++F 
Subjt:  SIVVSLALVAFVSCIAAELHRTKIKDLKLD-GKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFT

Query:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ
        VV+++LS A SMSR Q Y  GWL  +CYLVK GVF AS  L ++ +G+   SA  I  K  Q
Subjt:  VVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQ

AT4G21310.1 Protein of unknown function (DUF1218)1.5e-0528.68Show/hide
Query:  RRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNIL---LCRSFKFNSREKKCSNQSSKRPNLAI
        R +G+ + + +++++ + A +  I AE+ + K+K LK+   +C  P   AF YG+AA + LV+A V  N L   LC +       ++   +SS    LA+
Subjt:  RRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNIL---LCRSFKFNSREKKCSNQSSKRPNLAI

Query:  ILLVVSWASFTVVIVLL---SAASSMSRR
          L+ +W    +   +L   + A+S SR+
Subjt:  ILLVVSWASFTVVIVLL---SAASSMSRR

AT5G17210.1 Protein of unknown function (DUF1218)3.9e-0629.01Show/hide
Query:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGK----QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRP
        ME+R++   +   ++  L L++ V+   AE  R K   + +       +C  P + AF  G  + + L+MAQ+I ++    S  F  R+    ++S+   
Subjt:  MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGK----QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRP

Query:  NLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTV
         +++I  VVSW +F +  +VLLS A+        +       CY+VK GVF   A+L LVT+
Subjt:  NLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTV

AT5G17210.2 Protein of unknown function (DUF1218)4.3e-0532.46Show/hide
Query:  QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKA
        +C  P + AF  G  + + L+MAQ+I ++    S  F  R+    ++S+    +++I  VVSW +F +  +VLLS A+        +       CY+VK 
Subjt:  QCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASFTVV-IVLLSAASSMSRRQPYAAGWLGGECYLVKA

Query:  GVFVASAILILVTV
        GVF   A+L LVT+
Subjt:  GVFVASAILILVTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACGCCGTCTCGGGTACACCTTAACCCTCTCCATAGTTGTCTCGCTCGCGCTCGTCGCCTTTGTTTCGTGTATAGCTGCAGAATTACACAGAACAAAGATCAA
AGACCTCAAGTTGGATGGGAAGCAGTGTTATTTGCCAGAAAATCGAGCATTTGGATATGGAGTTGCAGCCTTGGTGTGTTTGGTTATGGCTCAAGTTATTGGGAATATTT
TACTTTGCAGAAGTTTCAAGTTCAATTCAAGAGAGAAAAAATGCAGCAATCAATCTTCTAAAAGACCAAATTTAGCCATAATTCTTCTGGTTGTTTCTTGGGCAAGCTTT
ACCGTGGTGATCGTGTTGCTGAGCGCGGCGTCGAGTATGAGCAGACGGCAGCCGTACGCGGCGGGCTGGTTAGGCGGCGAGTGCTACTTGGTGAAGGCCGGCGTATTCGT
CGCCTCTGCAATTCTGATCCTCGTCACGGTAGGCTCAACCGTTGGCTCCGCCGTAACAATCTTGAGGAAGAGCCTTCAGATCGACGAATCCGGAAAAACTAGCGCACAGC
CAAAATGA
mRNA sequenceShow/hide mRNA sequence
TGTTGAAGTAATTAGTTCACAAATTCCCTTTCAAACTTGTAATTTTCCTCCATCCATTATCACATACATATATCCAAAATCCCAATTGGACATAGCGTACCAATATTGAA
TTAAATTGACCAAATTGCCAAATCTAAGTCAACTTTACCCCCAGAAAACTAATAATAAAAGAAAAAAAAATTAAACTACCACACTTTCCAAGTTTCCATTCAGGTTGGTC
TCAAACAAACCAAATTGGTATCTCTGTTTCCAAGCAACTAAAAAATGCCAATTGAAAGAAAAATTTCAACTACTAACTCAAAAGTAGCAAGCAACCCCATGAGCTCCCAA
TTCCTCTTTAGTCACATTCAAGCTACTCCTTTCTCTGTGCTTTAAATTCTTCCTTCACAAAACACCACAGCCCGCCATGGAAAAACGCCGTCTCGGGTACACCTTAACCC
TCTCCATAGTTGTCTCGCTCGCGCTCGTCGCCTTTGTTTCGTGTATAGCTGCAGAATTACACAGAACAAAGATCAAAGACCTCAAGTTGGATGGGAAGCAGTGTTATTTG
CCAGAAAATCGAGCATTTGGATATGGAGTTGCAGCCTTGGTGTGTTTGGTTATGGCTCAAGTTATTGGGAATATTTTACTTTGCAGAAGTTTCAAGTTCAATTCAAGAGA
GAAAAAATGCAGCAATCAATCTTCTAAAAGACCAAATTTAGCCATAATTCTTCTGGTTGTTTCTTGGGCAAGCTTTACCGTGGTGATCGTGTTGCTGAGCGCGGCGTCGA
GTATGAGCAGACGGCAGCCGTACGCGGCGGGCTGGTTAGGCGGCGAGTGCTACTTGGTGAAGGCCGGCGTATTCGTCGCCTCTGCAATTCTGATCCTCGTCACGGTAGGC
TCAACCGTTGGCTCCGCCGTAACAATCTTGAGGAAGAGCCTTCAGATCGACGAATCCGGAAAAACTAGCGCACAGCCAAAATGAGAGAGAGGGAAGAAGATAAAATAAAA
GCGTCGGAACGGCGGCGGCCGGTGGCCGGTTGCCGGCGGCGTGAACGCCATTGGATGAGAGCTCCCCCGGTCGCCGTATTCCTGTGGATGGCGCCTCCCCAAGACGGGCC
GCGAGATCTCGAGTTCACAGTTTTATTTTATTTTATTTTTTCATAATTGTAAATCGGGGAAAATGCACACGAATTGGATAATGGATAGTTTATTTGGCAC
Protein sequenceShow/hide protein sequence
MEKRRLGYTLTLSIVVSLALVAFVSCIAAELHRTKIKDLKLDGKQCYLPENRAFGYGVAALVCLVMAQVIGNILLCRSFKFNSREKKCSNQSSKRPNLAIILLVVSWASF
TVVIVLLSAASSMSRRQPYAAGWLGGECYLVKAGVFVASAILILVTVGSTVGSAVTILRKSLQIDESGKTSAQPK