; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018961 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018961
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold20:244924..246041
RNA-Seq ExpressionMS018961
SyntenyMS018961
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606925.1 hypothetical protein SDJN03_00267, partial [Cucurbita argyrosperma subsp. sororia]9.9e-8269.35Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        M GL+ MAT++RE  IDLEGG  +SE+D ++E +S SK H RK   RLRSGFLC DGSI+RG SFASSSN+T+L KL VDENVELL+DKS +GEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV
        AL EKKKN+KE IK NGK+HKPPRPP GPSLDAADRI V+E+T+LA KKRA+VERIKALKKMKAEKTSSFNS++PALFITLLFFV+IIFQGM A GSA++
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN
         ESPVP+   SA LI S++HSS+S P  VN PQS  L+FA          GEA  VEDLKN
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN

XP_004147994.1 uncharacterized protein LOC101214824 [Cucumis sativus]2.3e-8371.05Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG ++ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR   FASSSN+T+L KL VDENVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAM-
        A  E K N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRA+VERIKALKKMKAEKTSSFNSS+PALFITLLFFV+IIFQGM AKGS M 
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAM-

Query:  -VSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
         VS+SP P+VGGSA LI  V+HS Q      VN P+S  LNFA K TSD  +AV EA LVE+LKNH
Subjt:  -VSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

XP_008447470.1 PREDICTED: uncharacterized protein LOC103489909 [Cucumis melo]2.9e-8169.55Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A
        A  E K N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRA+VERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGM AKGS   
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A

Query:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
        MVS+SP P+VGG A LI  V+HS Q      VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

XP_022157030.1 uncharacterized protein LOC111023856 [Momordica charantia]5.2e-13199.24Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV
        ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRA+VERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGM AKGSAMV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
        SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

XP_038903260.1 uncharacterized protein LOC120089897 [Benincasa hispida]2.1e-8472.35Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGL+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTF RLRSGFL  DGSISR  SFASSSN+T+L KL VDENVEL MD SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV
        A  E KKN+K KI   GK+HKPPRPP GPSLDAADRI V+E+TELA KKRA+VERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGM AKGS MV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV

Query:  --SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
          S+SP P+VGGSA LI  V+HS QS P  VN PQS  LNFA K TSD  A+ EAR VE+LKNH
Subjt:  --SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

TrEMBL top hitse value%identityAlignment
A0A0A0LB07 Uncharacterized protein1.1e-8371.05Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG ++ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR   FASSSN+T+L KL VDENVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAM-
        A  E K N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRA+VERIKALKKMKAEKTSSFNSS+PALFITLLFFV+IIFQGM AKGS M 
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAM-

Query:  -VSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
         VS+SP P+VGGSA LI  V+HS Q      VN P+S  LNFA K TSD  +AV EA LVE+LKNH
Subjt:  -VSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A1S3BHI9 uncharacterized protein LOC1034899091.4e-8169.55Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A
        A  E K N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRA+VERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGM AKGS   
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A

Query:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
        MVS+SP P+VGG A LI  V+HS Q      VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A5D3DAG0 Putative transmembrane protein1.4e-8169.55Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MG L+ MATK+RE  IDLEGGG++SE+D ++E +S SK H+RKTFGRLRSGFL  D S+SR  +FASSSN+T+L KL VD+NVELLM+ SS+GEKRRE G
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A
        A  E K N+K KIKKNGK+HKPPRPP GPSLDAADRI VREI ELA KKRA+VERIKALKKMKAEK SSFNSS+PALFITLLFFV+IIFQGM AKGS   
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGS--A

Query:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH
        MVS+SP P+VGG A LI  V+HS Q      VN P+S  LNFA K TSD  +AV E   VE+LKNH
Subjt:  MVSESPVPTVGGSASLISSVRHSSQ-SPPQKVNRPQSRFLNFAEKTTSD-RSAVGEARLVEDLKNH

A0A6J1DS03 uncharacterized protein LOC1110238562.5e-13199.24Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV
        ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRA+VERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGM AKGSAMV
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
        SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH

A0A6J1G9J4 uncharacterized protein LOC1114521949.9e-8068.2Show/hide
Query:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG
        M GL+ MAT++RE  IDLEGG  +SE+D ++E +S SK H RK   RLRSGFLC DGSI+RG SFASSSN+T+L KL VDENVELL+DKS +GEKRRELG
Subjt:  MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELG

Query:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV
        AL EKKKN+KE IK NGK+HKPPRPP  PSLDAADRI V+E+T+LA KKRA+VERIKALKK KAEKTSSFNS++PALFITLLFFV+IIFQGM A GSA++
Subjt:  ALVEKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMV

Query:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN
         ESPVP+   SA LI  ++HSS+S P  VN PQS  L+FA          GEA  VEDLKN
Subjt:  SESPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein9.1e-1729.67Show/hide
Query:  LNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALV
        +++M + E++L +D+E G S   ++S ++                         ++S    ++  +N     K++ D +  L+ D++      + L   +
Subjt:  LNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALV

Query:  EKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERI-KALKKMKAEKTSSFNSSIP--ALFITLLFFVIIIFQGMGAKGSAMV
         +KK    K KK+ K  KPPRPP GPSL   DR ++R+I ELA +KRA +ER+ K+LK++KA KTS  +  I   ++ IT +FF  ++FQG     S+M 
Subjt:  EKKKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERI-KALKKMKAEKTSSFNSSIP--ALFITLLFFVIIIFQGMGAKGSAMV

Query:  SE-SPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTS
        S+ SP PTV  +  +IS   ++  +P ++ +   +  L +  K  S
Subjt:  SE-SPVPTVGGSASLISSVRHSSQSPPQKVNRPQSRFLNFAEKTTS

AT3G17120.1 unknown protein2.9e-1542.86Show/hide
Query:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMGAKGSAMVSE
        + + KEK KK+    KPPRPP GPSLDAAD+ L+REI ELA  KRA +ER++ALKK +A K +S  SS   + A   T +FF +++FQG+  + +    +
Subjt:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMGAKGSAMVSE

Query:  SPVPTVGGSASLISSVRHS
        S +   G +     SV+++
Subjt:  SPVPTVGGSASLISSVRHS

AT3G17120.2 unknown protein2.9e-1542.86Show/hide
Query:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMGAKGSAMVSE
        + + KEK KK+    KPPRPP GPSLDAAD+ L+REI ELA  KRA +ER++ALKK +A K +S  SS   + A   T +FF +++FQG+  + +    +
Subjt:  KKNLKEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSS---IPALFITLLFFVIIIFQGMGAKGSAMVSE

Query:  SPVPTVGGSASLISSVRHS
        S +   G +     SV+++
Subjt:  SPVPTVGGSASLISSVRHS

AT4G01960.1 unknown protein1.3e-1833.33Show/hide
Query:  KERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNL
        +E +L +D+E G  +  ++      S ++  S      + SG L  DG                 S+ S D+ V+ LM +    E+  +   L + K + 
Subjt:  KERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNL

Query:  KEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMVSE-SPVPTV
         +K KK  K  KPPRPP GP L A D+ L+REITELA +KRA +ER+K L+++KA K+SS  SSI A+ +T++FFV +IFQG     +++ S+ SP P  
Subjt:  KEKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMVSE-SPVPTV

Query:  GGSASLISSVRHSSQSPPQKVN
          +  ++S   ++  +P ++++
Subjt:  GGSASLISSVRHSSQSPPQKVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTTTAAATCGTATGGCTACAAAAGAGAGAGAGTTGGTGATAGACCTTGAAGGTGGTGGTAGTAGTAGTGAAGAGGATTCGAACAATGAACCTGAATCAATATC
AAAAGTACATTCTAGAAAAACCTTTGGCAGGCTTCGGAGTGGGTTTCTGTGCCCCGATGGATCTATAAGTCGAGGTTCTAGCTTTGCTTCCAGTAGTAATGCCACCAGGC
TCAGTAAGCTTAGTGTTGATGAGAATGTGGAGTTGTTGATGGACAAGAGTTCAGAGGGAGAGAAGAGAAGAGAACTTGGGGCTCTTGTCGAGAAGAAGAAGAACCTGAAA
GAGAAGATTAAAAAGAATGGAAAGATTCACAAGCCACCACGGCCTCCGACGGGTCCATCGCTTGATGCTGCTGACAGGATTTTGGTTAGGGAGATCACAGAGTTGGCATC
GAAAAAGCGTGCCAGTGTTGAGCGAATAAAAGCTTTGAAAAAGATGAAAGCAGAGAAAACATCTTCATTCAACAGCAGCATACCTGCCTTATTTATCACATTGCTTTTCT
TTGTAATTATCATCTTTCAAGGTATGGGTGCTAAAGGTAGTGCTATGGTGTCGGAGTCTCCTGTGCCTACTGTTGGTGGGAGCGCAAGCTTGATATCTAGTGTTCGGCAT
TCATCTCAATCTCCTCCCCAAAAAGTTAACAGACCTCAATCCCGCTTTCTCAATTTTGCAGAAAAGACAACTTCTGATCGTTCTGCCGTAGGAGAAGCGAGATTGGTGGA
AGATTTGAAGAACCAC
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTTTAAATCGTATGGCTACAAAAGAGAGAGAGTTGGTGATAGACCTTGAAGGTGGTGGTAGTAGTAGTGAAGAGGATTCGAACAATGAACCTGAATCAATATC
AAAAGTACATTCTAGAAAAACCTTTGGCAGGCTTCGGAGTGGGTTTCTGTGCCCCGATGGATCTATAAGTCGAGGTTCTAGCTTTGCTTCCAGTAGTAATGCCACCAGGC
TCAGTAAGCTTAGTGTTGATGAGAATGTGGAGTTGTTGATGGACAAGAGTTCAGAGGGAGAGAAGAGAAGAGAACTTGGGGCTCTTGTCGAGAAGAAGAAGAACCTGAAA
GAGAAGATTAAAAAGAATGGAAAGATTCACAAGCCACCACGGCCTCCGACGGGTCCATCGCTTGATGCTGCTGACAGGATTTTGGTTAGGGAGATCACAGAGTTGGCATC
GAAAAAGCGTGCCAGTGTTGAGCGAATAAAAGCTTTGAAAAAGATGAAAGCAGAGAAAACATCTTCATTCAACAGCAGCATACCTGCCTTATTTATCACATTGCTTTTCT
TTGTAATTATCATCTTTCAAGGTATGGGTGCTAAAGGTAGTGCTATGGTGTCGGAGTCTCCTGTGCCTACTGTTGGTGGGAGCGCAAGCTTGATATCTAGTGTTCGGCAT
TCATCTCAATCTCCTCCCCAAAAAGTTAACAGACCTCAATCCCGCTTTCTCAATTTTGCAGAAAAGACAACTTCTGATCGTTCTGCCGTAGGAGAAGCGAGATTGGTGGA
AGATTTGAAGAACCAC
Protein sequenceShow/hide protein sequence
MGGLNRMATKERELVIDLEGGGSSSEEDSNNEPESISKVHSRKTFGRLRSGFLCPDGSISRGSSFASSSNATRLSKLSVDENVELLMDKSSEGEKRRELGALVEKKKNLK
EKIKKNGKIHKPPRPPTGPSLDAADRILVREITELASKKRASVERIKALKKMKAEKTSSFNSSIPALFITLLFFVIIIFQGMGAKGSAMVSESPVPTVGGSASLISSVRH
SSQSPPQKVNRPQSRFLNFAEKTTSDRSAVGEARLVEDLKNH