; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15926451..15929550
RNA-Seq ExpressionMoc04g21870
SyntenyMoc04g21870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.6e-11059.52Show/hide
Query:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER
        +GRKIGTLVTDKLLLESGLLDYNPLVRPIEA RPNSELAMVCGFTSSVK KSKGRAHALK VQSSDP TPAVDQ+AAQDQAGPSS  PTPVIELDSTGER
Subjt:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER

Query:  SREKRSRSESEALDVSPLREPRRSVDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQGVHCLHPLSSHDQGRAGWKGGIG
        SREKRSRSESEALDVSPLRE R                                                                              
Subjt:  SREKRSRSESEALDVSPLREPRRSVDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQGVHCLHPLSSHDQGRAGWKGGIG

Query:  SEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKE
         E + EL                                   +REDERHKAHLRA HAITKGLEKEKFQLLKEKDD+LQA E KDA IGRL AELKAEKE
Subjt:  SEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKE

Query:  RLSNGTLLEAAFRQHPDFDG--------------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEV
        RL+NG LLEAAFRQHPDFDG                            GDLKKRYAEKWASGPNGT GPASLVDKYVRDLDSDYSDL+EDE PSQ+PTEV
Subjt:  RLSNGTLLEAAFRQHPDFDG--------------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEV

Query:  GTTQDGAPSQQNGSQEVNLL
        GTTQ+G PSQQ+GSQEVNLL
Subjt:  GTTQDGAPSQQNGSQEVNLL

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]9.6e-6051.74Show/hide
Query:  MRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGS
        MRFRM+ SSSGVKDQVSRISA  LDRCLRRAS+FV   G              +  +H          G +     ERE   S  L      +G   +  
Subjt:  MRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGS

Query:  GRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG-----
        G          ++    ++E E+HKAHLRA HAITKGLEKEKFQLLKEKDDL Q  E KDA+IGRLT ELK  KERL++G LLE +FRQHP+FDG     
Subjt:  GRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG-----

Query:  ---------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNG
                                DLKKRY+E WASGPNGTPGP SLVDKYVR+LDSDYSD+ E++APSQ+PT+VGTTQ+ APSQ  G
Subjt:  ---------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]5.1e-6954.13Show/hide
Query:  MGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYA
        MGGT DV+ RFRM+PSSSGVKDQVSRISA  LDRCL+RASKFV   G              V  +H          G +     ERE   S  L      
Subjt:  MGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYA

Query:  QGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPD
        +G   +  G  G       ++    ++E E+HKAHLRA HAITKGLEKEKFQLLKEKDDL Q  EGKD +IGRLTAELK  KERL+NG+LLE +FRQH D
Subjt:  QGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPD

Query:  FDGSTGD--------------------------LKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEV
        FDG   D                          LKK+Y+EKWASGPNGTPGP SLV KYVR+LDSDYSD+ E++APSQ+P E+GTTQ+  PSQQ+GSQEV
Subjt:  FDGSTGD--------------------------LKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEV

Query:  NLL
        NLL
Subjt:  NLL

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]4.0e-5062.92Show/hide
Query:  EREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG----------------------
        ++E E+HKAHL A HAITK +EKEKFQLLKEKDDL QA E  DA IGRL+ ELK  KERL+NG LLE AF+QHPDFDG                      
Subjt:  EREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG----------------------

Query:  ----STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEVNLL
               D+KK+Y+EKWASGPNGTPGP SLVDKYVR+LDSDYSD+ E +APSQ+P EVGTTQ+  PSQ  GSQEVNLL
Subjt:  ----STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEVNLL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.8e-11155.84Show/hide
Query:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER
        + RKI TLVTDKLLLESGLLDYNPLVR IEA RPNSELAMVCGFT SVK KSKGRAHALKTV  ++P TP V ++ AQ  +GPSS VPTPVIELD +G R
Subjt:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER

Query:  SREKRSRSESEALDVSPLREPRRS-------------------------------VDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRR
        S EKRSR ESEALDVSPL E R                                 VDDPEARM GTS+V+MRF M+PSSSGVKDQVSRISA  LDR LRR
Subjt:  SREKRSRSESEALDVSPLREPRRS-------------------------------VDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRR

Query:  ASKFVVTQG-----------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAH
        ASKFV   G                 +H    + +   GR   +     ERE   +  L      +G   +  G          ++    ++E E+HKAH
Subjt:  ASKFVVTQG-----------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAH

Query:  LRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGSTGD--------------------------LK
        LRA HAITKGLEKEKFQLLKEKDDL Q  E KDA+IGRLT ELK  KERL+NGTLLE +FRQHPDFDG   D                          LK
Subjt:  LRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGSTGD--------------------------LK

Query:  KRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGS
        K+Y+EKWASGPNGTP P SLVDKYVR+LDSDYSD+ E++APSQ+P EVGTTQ+  PSQQ GS
Subjt:  KRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124677.5e-11159.52Show/hide
Query:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER
        +GRKIGTLVTDKLLLESGLLDYNPLVRPIEA RPNSELAMVCGFTSSVK KSKGRAHALK VQSSDP TPAVDQ+AAQDQAGPSS  PTPVIELDSTGER
Subjt:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER

Query:  SREKRSRSESEALDVSPLREPRRSVDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQGVHCLHPLSSHDQGRAGWKGGIG
        SREKRSRSESEALDVSPLRE R                                                                              
Subjt:  SREKRSRSESEALDVSPLREPRRSVDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQGVHCLHPLSSHDQGRAGWKGGIG

Query:  SEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKE
         E + EL                                   +REDERHKAHLRA HAITKGLEKEKFQLLKEKDD+LQA E KDA IGRL AELKAEKE
Subjt:  SEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKE

Query:  RLSNGTLLEAAFRQHPDFDG--------------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEV
        RL+NG LLEAAFRQHPDFDG                            GDLKKRYAEKWASGPNGT GPASLVDKYVRDLDSDYSDL+EDE PSQ+PTEV
Subjt:  RLSNGTLLEAAFRQHPDFDG--------------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEV

Query:  GTTQDGAPSQQNGSQEVNLL
        GTTQ+G PSQQ+GSQEVNLL
Subjt:  GTTQDGAPSQQNGSQEVNLL

A0A6J1D1N9 uncharacterized protein LOC1110161934.6e-6051.74Show/hide
Query:  MRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGS
        MRFRM+ SSSGVKDQVSRISA  LDRCLRRAS+FV   G              +  +H          G +     ERE   S  L      +G   +  
Subjt:  MRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGS

Query:  GRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG-----
        G          ++    ++E E+HKAHLRA HAITKGLEKEKFQLLKEKDDL Q  E KDA+IGRLT ELK  KERL++G LLE +FRQHP+FDG     
Subjt:  GRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG-----

Query:  ---------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNG
                                DLKKRY+E WASGPNGTPGP SLVDKYVR+LDSDYSD+ E++APSQ+PT+VGTTQ+ APSQ  G
Subjt:  ---------------------STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNG

A0A6J1DF31 uncharacterized protein LOC1110199092.4e-6954.13Show/hide
Query:  MGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYA
        MGGT DV+ RFRM+PSSSGVKDQVSRISA  LDRCL+RASKFV   G              V  +H          G +     ERE   S  L      
Subjt:  MGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRRASKFVVTQG--------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYA

Query:  QGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPD
        +G   +  G  G       ++    ++E E+HKAHLRA HAITKGLEKEKFQLLKEKDDL Q  EGKD +IGRLTAELK  KERL+NG+LLE +FRQH D
Subjt:  QGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPD

Query:  FDGSTGD--------------------------LKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEV
        FDG   D                          LKK+Y+EKWASGPNGTPGP SLV KYVR+LDSDYSD+ E++APSQ+P E+GTTQ+  PSQQ+GSQEV
Subjt:  FDGSTGD--------------------------LKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEV

Query:  NLL
        NLL
Subjt:  NLL

A0A6J1DZB3 uncharacterized protein LOC1110256658.9e-11255.84Show/hide
Query:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER
        + RKI TLVTDKLLLESGLLDYNPLVR IEA RPNSELAMVCGFT SVK KSKGRAHALKTV  ++P TP V ++ AQ  +GPSS VPTPVIELD +G R
Subjt:  KGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSKGRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGER

Query:  SREKRSRSESEALDVSPLREPRRS-------------------------------VDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRR
        S EKRSR ESEALDVSPL E R                                 VDDPEARM GTS+V+MRF M+PSSSGVKDQVSRISA  LDR LRR
Subjt:  SREKRSRSESEALDVSPLREPRRS-------------------------------VDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASLDRCLRR

Query:  ASKFVVTQG-----------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAH
        ASKFV   G                 +H    + +   GR   +     ERE   +  L      +G   +  G          ++    ++E E+HKAH
Subjt:  ASKFVVTQG-----------------VHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAH

Query:  LRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGSTGD--------------------------LK
        LRA HAITKGLEKEKFQLLKEKDDL Q  E KDA+IGRLT ELK  KERL+NGTLLE +FRQHPDFDG   D                          LK
Subjt:  LRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGSTGD--------------------------LK

Query:  KRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGS
        K+Y+EKWASGPNGTP P SLVDKYVR+LDSDYSD+ E++APSQ+P EVGTTQ+  PSQQ GS
Subjt:  KRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGS

A0A6J1DZB5 uncharacterized protein LOC1110248981.9e-5062.92Show/hide
Query:  EREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG----------------------
        ++E E+HKAHL A HAITK +EKEKFQLLKEKDDL QA E  DA IGRL+ ELK  KERL+NG LLE AF+QHPDFDG                      
Subjt:  EREDERHKAHLRATHAITKGLEKEKFQLLKEKDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDG----------------------

Query:  ----STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEVNLL
               D+KK+Y+EKWASGPNGTPGP SLVDKYVR+LDSDYSD+ E +APSQ+P EVGTTQ+  PSQ  GSQEVNLL
Subjt:  ----STGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPSQQNGSQEVNLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCAAAACGTAGACAGCCAAAAACCCCGAAGGAACTGATAATCTACCAAAGAACTAGTGAGAATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCT
CCGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGAGTAATGGATCCGACAGTACACACGACCGGCGGTTATGTGTCTTTTTCTCATATTGGACCTGTCGGGTTCC
GAGCAGATCGGACCCTAGTCAGGCCGTTGCGTATCTCGAGGAGATCCCAACCGCTCGTTGATTACACGTGTACGGCGCAGAGGTTTTTCCGATCAGCTATAAATAGTTCC
GAAACTTCAGGGGGTTTTAACATTCCGAATGACATCCTCCTCAGGATTCCAGAGGAAGGGAAAGAGCTGACAATCCCCAGAGGGATGGGTCACTCTTATCTCAAGATGTT
TGAGTACGGTCCTCAAGCTTCCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACTGGCTCTTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCT
TTAGCCATTCTTTTTTGTTGCGAGCTCGGGATGAGGACGAGGCCGAGCTGCTAGTGTTGATCAGCTCCTTGGATGTTTTGAGCCAAGAGGATAGCCAAAAAACCTGGTCG
GTACTATATGTGCACAAGGAAGGGCGCGAGTGGTATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGCAAGTGGTTCTTTGCCTCTGGAGAGTGTATCAATCAAG
CCGATTCCCGAGCTCAATCAAGCCACTTTGGACACCCTCAATTCTACAAGGACAACTTTCCAAGGGCCGGAAGATCGGGACCTTGGTCACCGATAAGCTGTTGCTAGAAT
CAGGGCTACTGGACTACAATCCTTTAGTTCGTCCGATTGAAGCTTTGAGGCCAAACTCTGAGCTCGCCATGGTGTGTGGATTTACGAGCAGCGTGAAACATAAGTCTAAG
GGCCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTGATCCTTCTACCCCTGCTGTGGATCAGAGTGCAGCTCAGGACCAGGCGGGTCCATCTTCTGAAGTTCCTACTCC
AGTGATCGAGTTGGATTCTACTGGAGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAGGCATTGGACGTGTCGCCTCTTCGCGAGCCACGTCGATCTGTGGATG
ATCCTGAAGCTCGGATGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGAAACCGTCAAGCTCCGGGGTGAAGGACCAAGTGTCACGCATCTCGGCCGCGAGCTTG
GATCGCTGCCTCAGGAGAGCGTCCAAGTTTGTAGTGACCCAGGGCGTTCACTGCCTCCATCCACTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCATTGG
CAGCGAAAGAGAGGGCGAACTCTCTGTCTGCCTTGGAGGGTTCCACTACGCTCAAGGGCGAGCTGCTGAAGGCTCGGGGCGAGGTGGATGTACTGAGGGCCGAGGTAGAA
GCCAAGGCCGAACTGCTGAAAGGGAGGATGAAAGGCATAAGGCCCACCTCCGAGCTACCCACGCCATCACTAAAGGGCTGGAGAAGGAGAAGTTCCAACTCCTTAAGGAG
AAGGACGACCTGCTCCAGGCCTTCGAAGGGAAAGACGCTACAATTGGGCGTCTTACTGCCGAGCTAAAGGCGGAGAAGGAGCGCCTCTCCAATGGAACTCTTCTTGAAGC
AGCCTTCAGGCAACACCCAGATTTTGATGGCTCGACCGGCGATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCCAATGGCACTCCAGGTCCTGCCTCTCTAG
TGGACAAGTATGTCAGAGATCTGGACTCTGACTACTCCGACCTGAATGAAGACGAGGCCCCTAGTCAGGATCCTACTGAGGTCGGCACTACCCAGGATGGGGCTCCTTCT
CAGCAGAACGGATCTCAGGAGGTCAACCTTCTGGTTCTCAAGGCGAGCTATCTTCTCACCTCGGGAGCGGCTGAGCTTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCAAAACGTAGACAGCCAAAAACCCCGAAGGAACTGATAATCTACCAAAGAACTAGTGAGAATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCT
CCGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGAGTAATGGATCCGACAGTACACACGACCGGCGGTTATGTGTCTTTTTCTCATATTGGACCTGTCGGGTTCC
GAGCAGATCGGACCCTAGTCAGGCCGTTGCGTATCTCGAGGAGATCCCAACCGCTCGTTGATTACACGTGTACGGCGCAGAGGTTTTTCCGATCAGCTATAAATAGTTCC
GAAACTTCAGGGGGTTTTAACATTCCGAATGACATCCTCCTCAGGATTCCAGAGGAAGGGAAAGAGCTGACAATCCCCAGAGGGATGGGTCACTCTTATCTCAAGATGTT
TGAGTACGGTCCTCAAGCTTCCCCTTCATCCTTTCGCCCAGGAGTTCTTAAACCGAACTGGACTGGCTCTTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCT
TTAGCCATTCTTTTTTGTTGCGAGCTCGGGATGAGGACGAGGCCGAGCTGCTAGTGTTGATCAGCTCCTTGGATGTTTTGAGCCAAGAGGATAGCCAAAAAACCTGGTCG
GTACTATATGTGCACAAGGAAGGGCGCGAGTGGTATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGCAAGTGGTTCTTTGCCTCTGGAGAGTGTATCAATCAAG
CCGATTCCCGAGCTCAATCAAGCCACTTTGGACACCCTCAATTCTACAAGGACAACTTTCCAAGGGCCGGAAGATCGGGACCTTGGTCACCGATAAGCTGTTGCTAGAAT
CAGGGCTACTGGACTACAATCCTTTAGTTCGTCCGATTGAAGCTTTGAGGCCAAACTCTGAGCTCGCCATGGTGTGTGGATTTACGAGCAGCGTGAAACATAAGTCTAAG
GGCCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTGATCCTTCTACCCCTGCTGTGGATCAGAGTGCAGCTCAGGACCAGGCGGGTCCATCTTCTGAAGTTCCTACTCC
AGTGATCGAGTTGGATTCTACTGGAGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCCGAGGCATTGGACGTGTCGCCTCTTCGCGAGCCACGTCGATCTGTGGATG
ATCCTGAAGCTCGGATGGGGGGGACATCCGACGTGAAGATGCGGTTCAGAATGAAACCGTCAAGCTCCGGGGTGAAGGACCAAGTGTCACGCATCTCGGCCGCGAGCTTG
GATCGCTGCCTCAGGAGAGCGTCCAAGTTTGTAGTGACCCAGGGCGTTCACTGCCTCCATCCACTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGGCATTGG
CAGCGAAAGAGAGGGCGAACTCTCTGTCTGCCTTGGAGGGTTCCACTACGCTCAAGGGCGAGCTGCTGAAGGCTCGGGGCGAGGTGGATGTACTGAGGGCCGAGGTAGAA
GCCAAGGCCGAACTGCTGAAAGGGAGGATGAAAGGCATAAGGCCCACCTCCGAGCTACCCACGCCATCACTAAAGGGCTGGAGAAGGAGAAGTTCCAACTCCTTAAGGAG
AAGGACGACCTGCTCCAGGCCTTCGAAGGGAAAGACGCTACAATTGGGCGTCTTACTGCCGAGCTAAAGGCGGAGAAGGAGCGCCTCTCCAATGGAACTCTTCTTGAAGC
AGCCTTCAGGCAACACCCAGATTTTGATGGCTCGACCGGCGATCTGAAGAAGAGGTATGCTGAGAAATGGGCTTCTGGGCCCAATGGCACTCCAGGTCCTGCCTCTCTAG
TGGACAAGTATGTCAGAGATCTGGACTCTGACTACTCCGACCTGAATGAAGACGAGGCCCCTAGTCAGGATCCTACTGAGGTCGGCACTACCCAGGATGGGGCTCCTTCT
CAGCAGAACGGATCTCAGGAGGTCAACCTTCTGGTTCTCAAGGCGAGCTATCTTCTCACCTCGGGAGCGGCTGAGCTTCATTAG
Protein sequenceShow/hide protein sequence
MEAKRRQPKTPKELIIYQRTSENLHNGSSRIELEPGLRSDLNTRVDLHKRVMDPTVHTTGGYVSFSHIGPVGFRADRTLVRPLRISRRSQPLVDYTCTAQRFFRSAINSS
ETSGGFNIPNDILLRIPEEGKELTIPRGMGHSYLKMFEYGPQASPSSFRPGVLKPNWTGSCSSGPQWVGCHFCFSHSFLLRARDEDEAELLVLISSLDVLSQEDSQKTWS
VLYVHKEGREWYSQGADLHQGMGRQVVLCLWRVYQSSRFPSSIKPLWTPSILQGQLSKGRKIGTLVTDKLLLESGLLDYNPLVRPIEALRPNSELAMVCGFTSSVKHKSK
GRAHALKTVQSSDPSTPAVDQSAAQDQAGPSSEVPTPVIELDSTGERSREKRSRSESEALDVSPLREPRRSVDDPEARMGGTSDVKMRFRMKPSSSGVKDQVSRISAASL
DRCLRRASKFVVTQGVHCLHPLSSHDQGRAGWKGGIGSEREGELSVCLGGFHYAQGRAAEGSGRGGCTEGRGRSQGRTAEREDERHKAHLRATHAITKGLEKEKFQLLKE
KDDLLQAFEGKDATIGRLTAELKAEKERLSNGTLLEAAFRQHPDFDGSTGDLKKRYAEKWASGPNGTPGPASLVDKYVRDLDSDYSDLNEDEAPSQDPTEVGTTQDGAPS
QQNGSQEVNLLVLKASYLLTSGAAELH