; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1632 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1632
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC08:24208642..24210992
RNA-Seq ExpressionMC08g1632
SyntenyMC08g1632
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606925.1 hypothetical protein SDJN03_00267, partial [Cucurbita argyrosperma subsp. sororia]3.08e-10369.73Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        M GL+ MAT++RE  IDLEGG  +SE+D ++E +S SK H RK   RLRSGFLC DGSI+RG SFASSSN+T+L KL VDENVELL+DKS +GEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        AL EKKKN+KE IK NGK+HKPPRPP GPSLDAADRI V+E+T+LA KKRA VERIKALKKMKAEKTSSFNS++PALFITLLFFV+IIFQGMSA GSA++
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN
         ESPVP+   SA LIS ++HSS+S P  VN PQS  L+FA          GEA  VEDLKN
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN

XP_004147994.1 uncharacterized protein LOC101214824 [Cucumis sativus]3.61e-10672.28Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG ++ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR   FASSSN+T+L KL VDENVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EK  N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRATVERIKALKKMKAEKTSSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
          S+SP P+VGGSA LI  V+HS Q  S P  VN P+S  LNFA K TSD  +AV EA LVE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

XP_008447470.1 PREDICTED: uncharacterized protein LOC103489909 [Cucumis melo]1.96e-10370.79Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EK  N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRATVERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
          S+SP P+VGG A LI  V+HS Q  S P  VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

XP_022157030.1 uncharacterized protein LOC111023856 [Momordica charantia]9.80e-169100Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
        SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

XP_038903260.1 uncharacterized protein LOC120089897 [Benincasa hispida]1.79e-10773.11Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGL+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTF RLRSGFL  DGSISR  SFASSSN+T+L KL VDENVEL MD SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EKK N+K KI   GK+HKPPRPP GPSLDAADRI V+E+TELA KKRATVERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
          S+SP P+VGGSA LI  V+HS QS P  VN PQS  LNFA K TSD  A+ EAR VE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

TrEMBL top hitse value%identityAlignment
A0A0A0LB07 Uncharacterized protein1.75e-10672.28Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG ++ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR   FASSSN+T+L KL VDENVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EK  N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRATVERIKALKKMKAEKTSSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
          S+SP P+VGGSA LI  V+HS Q  S P  VN P+S  LNFA K TSD  +AV EA LVE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A1S3BHI9 uncharacterized protein LOC1034899099.48e-10470.79Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EK  N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRATVERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
          S+SP P+VGG A LI  V+HS Q  S P  VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A5D3DAG0 Putative transmembrane protein9.48e-10470.79Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        A  EK  N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRATVERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGMSAKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
          S+SP P+VGG A LI  V+HS Q  S P  VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQ--SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A6J1DS03 uncharacterized protein LOC1110238564.75e-169100Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
        SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

A0A6J1G9J4 uncharacterized protein LOC1114521941.63e-10068.58Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        M GL+ MAT++RE  IDLEGG  +SE+D ++E +S SK H RK   RLRSGFLC DGSI+RG SFASSSN+T+L KL VDENVELL+DKS +GEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV
        AL EKKKN+KE IK NGK+HKPPRPP  PSLDAADRI V+E+T+LA KKRA VERIKALKK KAEKTSSFNS++PALFITLLFFV+IIFQGMSA GSA++
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN
         ESPVP+   SA LI  ++HSS+S P  VN PQS  L+FA          GEA  VEDLKN
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein3.1e-1730.08Show/hide
Query:  LNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALV
        +++M + E++L +D+E G S   ++S ++                         ++S    ++  +N     K++ D +  L+ D++      + L   +
Subjt:  LNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALV

Query:  EKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERI-KALKKMKAEKTSSFNSSIP--ALFITLLFFVIIIFQGMSAKGSAMV
         +KK    K KK+ K  KPPRPP GPSL   DR ++R+I ELA +KRA +ER+ K+LK++KA KTS  +  I   ++ IT +FF  ++FQG S   S+M 
Subjt:  EKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERI-KALKKMKAEKTSSFNSSIP--ALFITLLFFVIIIFQGMSAKGSAMV

Query:  SE-SPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTS
        S+ SP PTV  +  +IS   ++  +P ++ +   +  L +  K  S
Subjt:  SE-SPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTS

AT3G17120.1 unknown protein1.0e-1543.7Show/hide
Query:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMSAKGSAMVSE
        + + KEK KK+    KPPRPP GPSLDAAD+ L+REI ELA  KRA +ER++ALKK +A K +S  SS   + A   T +FF +++FQG+S + +    +
Subjt:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMSAKGSAMVSE

Query:  SPVPTVGGSASLISSVRHS
        S +   G +     SV+++
Subjt:  SPVPTVGGSASLISSVRHS

AT3G17120.2 unknown protein1.0e-1543.7Show/hide
Query:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMSAKGSAMVSE
        + + KEK KK+    KPPRPP GPSLDAAD+ L+REI ELA  KRA +ER++ALKK +A K +S  SS   + A   T +FF +++FQG+S + +    +
Subjt:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMSAKGSAMVSE

Query:  SPVPTVGGSASLISSVRHS
        S +   G +     SV+++
Subjt:  SPVPTVGGSASLISSVRHS

AT4G01960.1 unknown protein1.3e-1833.33Show/hide
Query:  KERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNL
        +E +L +D+E G  +  ++      S ++  S      + SG L  DG                 S+ S D+ V+ LM +    E+  +   L + K + 
Subjt:  KERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNL

Query:  KEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMVSE-SPVPTV
         +K KK  K  KPPRPP GP L A D+ L+REITELA +KRA +ER+K L+++KA K+SS  SSI A+ +T++FFV +IFQG     +++ S+ SP P  
Subjt:  KEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMVSE-SPVPTV

Query:  GGSASLISSVRHSSQSPPQKVN
          +  ++S   ++  +P ++++
Subjt:  GGSASLISSVRHSSQSPPQKVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTTTAAATCGTATGGCTACAAAAGAGAGAGAGTTGGTGATAGACCTTGAAGGTGGTGGTAGTAGTAGTGAAGAGGATTCGAACAATGAACCTGAATCAATATC
AAAAGTACATTCTAGAAAAACCTTTGGCAGGCTTCGGAGTGGGTTTCTGTGCCCCGATGGATCTATAAGTCGAGGTTCTAGCTTTGCTTCCAGTAGTAATGCCACCAGGC
TCAGTAAGCTTAGTGTTGATGAGAATGTGGAGTTGTTGATGGACAAGAGTTCAGAGGGAGAGAAGAGAAGAGAACTTGGGGCTCTTGTCGAGAAGAAGAAGAACCTGAAA
GAGAAGATTAAAAAGAATGGAAAGATTCACAAGCCACCACGACCTCCGACGGGTCCATCGCTTGATGCTGCTGACAGGATTTTGGTTAGGGAGATCACAGAGTTGGCATC
GAAAAAGCGTGCCACTGTTGAGCGAATAAAAGCTTTGAAAAAGATGAAAGCAGAGAAAACATCTTCATTCAACAGCAGCATACCTGCCTTATTTATCACATTGCTTTTCT
TTGTAATTATCATCTTTCAAGGAATGAGTGCTAAAGGTAGTGCTATGGTGTCGGAGTCTCCTGTGCCTACTGTTGGTGGGAGCGCAAGCTTGATATCTAGTGTTCGGCAT
TCATCTCAATCTCCTCCCCAAAAAGTTAACAGACCTCAATCCCGCTTTCTCAATTTTGCAGAAAAGACAACTTCTGATCGTTCTGCCGTAGGAGAAGCGAGATTGGTGGA
AGATTTGAAGAACCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCGCACCTCACAATTTCAGATCTGAAATTCAATTCTCTTCCATTTCGGGTTTCCCCCAAATTTCTTCTCTCATCTCGTTTCTTCGAGGGTTTATATTCAGTTTCT
TTATTCCCCAGATCTCTTTCTGGGTTCTGTTGTTAGATCCCTCCATGGATTTTAGCTGTAAAATGGTATAAATGGCTTCTACTTGTTGGTCTACGATCACCCGGTTCACA
AGAGGGTTAAGAAGATGAGAAGAAAGGAAGGCCCCTTCGATTCTGGATCCTGGTAGGATGAGTGCCCTGCCGAAATGGAAAGTGAAACCTCATTAGATGGGTGGTTTAAA
TCGTATGGCTACAAAAGAGAGAGAGTTGGTGATAGACCTTGAAGGTGGTGGTAGTAGTAGTGAAGAGGATTCGAACAATGAACCTGAATCAATATCAAAAGTACATTCTA
GAAAAACCTTTGGCAGGCTTCGGAGTGGGTTTCTGTGCCCCGATGGATCTATAAGTCGAGGTTCTAGCTTTGCTTCCAGTAGTAATGCCACCAGGCTCAGTAAGCTTAGT
GTTGATGAGAATGTGGAGTTGTTGATGGACAAGAGTTCAGAGGGAGAGAAGAGAAGAGAACTTGGGGCTCTTGTCGAGAAGAAGAAGAACCTGAAAGAGAAGATTAAAAA
GAATGGAAAGATTCACAAGCCACCACGACCTCCGACGGGTCCATCGCTTGATGCTGCTGACAGGATTTTGGTTAGGGAGATCACAGAGTTGGCATCGAAAAAGCGTGCCA
CTGTTGAGCGAATAAAAGCTTTGAAAAAGATGAAAGCAGAGAAAACATCTTCATTCAACAGCAGCATACCTGCCTTATTTATCACATTGCTTTTCTTTGTAATTATCATC
TTTCAAGGAATGAGTGCTAAAGGTAGTGCTATGGTGTCGGAGTCTCCTGTGCCTACTGTTGGTGGGAGCGCAAGCTTGATATCTAGTGTTCGGCATTCATCTCAATCTCC
TCCCCAAAAAGTTAACAGACCTCAATCCCGCTTTCTCAATTTTGCAGAAAAGACAACTTCTGATCGTTCTGCCGTAGGAGAAGCGAGATTGGTGGAAGATTTGAAGAACC
ACTGAAGTTCAGTGTTAAATAATCTGGTAAAACAACTGAAACTTTTCATACCTTATATAACTTGATTGTGAAGGCTTCAGACTGCTTTGGTGTCTGTATATATGCAAAGG
CCTCAAAGCTTCCTATGATCTTTTGGTAGGCTTTTCTTGTACAGAGAGAGTTCACATTTTATTGAACCATATTATTTTTTTGTTCTGTTAAAGTCCAATTTGAAATTGGT
ATGCTCTCACCAGAAACCCAGAACGTTGAACTTACTCACATGGACTTCAAAAGAATTGATGGTGTATACAAGATTTATATAAATATATTGACATTTGGAGACACAATTTC
Protein sequenceShow/hide protein sequence
MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNLK
EKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRATVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMSAKGSAMVSESPVPTVGGSASLISSVRH
SSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH