; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr4:9717029..9719228
RNA-Seq ExpressionMoc04g12640
SyntenyMoc04g12640
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]5.2e-17487.03Show/hide
Query:  VEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRK
        VEH GE +KSH QRLEP VSIGQKRKGKEV+TD EI  DGSSRRLTPK SSMENRDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNMPE EDTENEVVR 
Subjt:  VEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRK

Query:  DIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRRE
        D QEPSPLDTP EGAF SHQALP GATGSTPLAT+EY T MATLPGVRD+H IPSNAVNPL C TGR KVGENSTQE  REEDPEDTQT++EMFQYKRRE
Subjt:  DIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRRE

Query:  KKSSKRRAVHTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVNTIELDLSKGDEVETQWNTANLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARD
         KSSKRRAV   KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPV+ IELDLS+G++VET WN ANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR 
Subjt:  KKSSKRRAVHTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVNTIELDLSKGDEVETQWNTANLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARD

Query:  KEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  KEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]2.8e-9559.42Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINP---------------------------------------------------------------
        M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPP+INP                                                               
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINP---------------------------------------------------------------

Query:  --------------------------------------------------GVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG
                                                          GVEH GEQ+KSH QRLEPE+SIGQKRKGKEV+TD EI EDGSSRRL PK 
Subjt:  --------------------------------------------------GVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG

Query:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD
        SSME RDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNM E EDTENEVVR D QEPSPLDTPTEGAF SHQ LP GATGSTPL TDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD

Query:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMF
        +H IPSNAVNPLGCDTGRSKVGENSTQEPT EEDPEDTQT++EMF
Subjt:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMF

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]1.2e-9658.54Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV------------------------------------------------------------
        +M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPP+INPG+                                                            
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV------------------------------------------------------------

Query:  -----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPK
                                                             EH GEQ+K H  RLEP VSIGQKRKGKEV+TD EIEEDGSSRRLTPK
Subjt:  -----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPK

Query:  GSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVR
         S+MENRDEEQFYSS  IIT  DGNDDFLLVSRG+CSNMPE EDTENEVVR D QEPSPLDTPTEGAF SHQ LP GATGSTPLATDEYVTPMATLPGVR
Subjt:  GSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVR

Query:  DSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSK
        D+H IPSN VNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+++MFQYKRREKK  K
Subjt:  DSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSK

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]4.5e-10160.83Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV-------------------------------------------------------------
        M PHGYVNFQ LPTLNIPQNSEFRA +PQQLPP+INPG+                                                             
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV-------------------------------------------------------------

Query:  ----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG
                                                            EH GEQ+KSH QRLEP VSIGQKRKGKEV++D EIEEDGSSRRLTPK 
Subjt:  ----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG

Query:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNMPE EDTENEVVR D QEPS LDTPTEGAF SHQ LP GATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD

Query:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAV
        +H IPSNAVNPLGCDTGRSKVGENSTQEPT EED EDTQT++EMFQYKRREKKSSKRRAV
Subjt:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAV

XP_022158884.1 uncharacterized protein LOC111025345 [Momordica charantia]5.5e-9979.22Show/hide
Query:  NFQQLPTLNIPQNSEFRAANPQQLPPIINPGVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITS
        N  QL  L  P     R   P  +P  ++ GVEH GEQ+KSH Q LEP VSIGQKRK  EV+TD E EEDGSSRRLTPK SSMENRD+EQFYSS LIIT 
Subjt:  NFQQLPTLNIPQNSEFRAANPQQLPPIINPGVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITS

Query:  GDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSK
        GDGNDDFLLVSR NCSNM EMEDTENEVVRKD QEPSPLDTPT+GAFFSHQALP GATGSTPLATDEY+TPMATLPGVRD+HAIPSNAVN LGCDTGR K
Subjt:  GDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSK

Query:  VGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAVHTKKPTVPMNEPKTR
        VGENSTQEPTR++DPEDTQTL+EMFQYKRREKKSSK RAV TKKPTVPMN   TR
Subjt:  VGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAVHTKKPTVPMNEPKTR

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.5e-17487.3Show/hide
Query:  VEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRK
        VEH GE +KSH QRLEP VSIGQKRKGKEV+TD EI  DGSSRRLTPK SSMENRDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNMPE EDTENEVVR 
Subjt:  VEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRK

Query:  DIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRRE
        D QEPSPLDTP EGAF SHQALP GATGSTPLATDEY T MATLPGVRD+H IPSNAVNPL C TGR KVGENSTQE  REEDPEDTQT++EMFQYKRRE
Subjt:  DIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRRE

Query:  KKSSKRRAVHTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVNTIELDLSKGDEVETQWNTANLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARD
         KSSKRRAV   KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPV+ IELDLS+G++VET WN ANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR 
Subjt:  KKSSKRRAVHTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVNTIELDLSKGDEVETQWNTANLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARD

Query:  KEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  KEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

A0A6J1DWG3 uncharacterized protein LOC1110237631.4e-9559.42Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINP---------------------------------------------------------------
        M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPP+INP                                                               
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINP---------------------------------------------------------------

Query:  --------------------------------------------------GVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG
                                                          GVEH GEQ+KSH QRLEPE+SIGQKRKGKEV+TD EI EDGSSRRL PK 
Subjt:  --------------------------------------------------GVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG

Query:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD
        SSME RDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNM E EDTENEVVR D QEPSPLDTPTEGAF SHQ LP GATGSTPL TDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD

Query:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMF
        +H IPSNAVNPLGCDTGRSKVGENSTQEPT EEDPEDTQT++EMF
Subjt:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMF

A0A6J1DX11 uncharacterized protein LOC1110248605.6e-9758.54Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV------------------------------------------------------------
        +M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPP+INPG+                                                            
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV------------------------------------------------------------

Query:  -----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPK
                                                             EH GEQ+K H  RLEP VSIGQKRKGKEV+TD EIEEDGSSRRLTPK
Subjt:  -----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPK

Query:  GSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVR
         S+MENRDEEQFYSS  IIT  DGNDDFLLVSRG+CSNMPE EDTENEVVR D QEPSPLDTPTEGAF SHQ LP GATGSTPLATDEYVTPMATLPGVR
Subjt:  GSSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVR

Query:  DSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSK
        D+H IPSN VNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+++MFQYKRREKK  K
Subjt:  DSHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSK

A0A6J1DY94 uncharacterized protein LOC1110253162.2e-10160.83Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV-------------------------------------------------------------
        M PHGYVNFQ LPTLNIPQNSEFRA +PQQLPP+INPG+                                                             
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGV-------------------------------------------------------------

Query:  ----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG
                                                            EH GEQ+KSH QRLEP VSIGQKRKGKEV++D EIEEDGSSRRLTPK 
Subjt:  ----------------------------------------------------EHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKG

Query:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIIT GDGNDDFLLVSRGNCSNMPE EDTENEVVR D QEPS LDTPTEGAF SHQ LP GATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITSGDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRD

Query:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAV
        +H IPSNAVNPLGCDTGRSKVGENSTQEPT EED EDTQT++EMFQYKRREKKSSKRRAV
Subjt:  SHAIPSNAVNPLGCDTGRSKVGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAV

A0A6J1E0Q9 uncharacterized protein LOC1110253452.7e-9979.22Show/hide
Query:  NFQQLPTLNIPQNSEFRAANPQQLPPIINPGVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITS
        N  QL  L  P     R   P  +P  ++ GVEH GEQ+KSH Q LEP VSIGQKRK  EV+TD E EEDGSSRRLTPK SSMENRD+EQFYSS LIIT 
Subjt:  NFQQLPTLNIPQNSEFRAANPQQLPPIINPGVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITS

Query:  GDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSK
        GDGNDDFLLVSR NCSNM EMEDTENEVVRKD QEPSPLDTPT+GAFFSHQALP GATGSTPLATDEY+TPMATLPGVRD+HAIPSNAVN LGCDTGR K
Subjt:  GDGNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSK

Query:  VGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAVHTKKPTVPMNEPKTR
        VGENSTQEPTR++DPEDTQTL+EMFQYKRREKKSSK RAV TKKPTVPMN   TR
Subjt:  VGENSTQEPTREEDPEDTQTLKEMFQYKRREKKSSKRRAVHTKKPTVPMNEPKTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGCCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGCAAATCCTCAACAACTTCCTCCAATCATAAA
TCCGGGAGTTGAGCATGAAGGAGAGCAAAAGAAGAGCCATAAACAAAGGCTAGAGCCCGAGGTCTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGCTGACGGATCTAG
AAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGGTTCATCTATGGAGAACAGGGATGAGGAGCAGTTTTATTCATCTCCGTTGATTATTACATCTGGAGAT
GGTAATGATGATTTTCTACTGGTTTCACGAGGCAATTGTTCAAATATGCCAGAAATGGAGGATACCGAAAACGAAGTGGTAAGAAAAGATATTCAGGAACCTTCCCCATT
GGATACACCTACAGAAGGAGCATTTTTTAGTCACCAAGCGTTGCCTATTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTAC
CAGGGGTAAGGGATTCTCACGCTATTCCTTCTAATGCAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGAGAA
GAAGATCCTGAGGACACGCAGACGTTAAAGGAAATGTTCCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCATACTAAGAAGCCGACAGTGCCCAT
GAATGAACCTAAAACAAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTAATACAATCGAACTAGACTTGTCTAAGGGAGATG
AGGTCGAGACGCAATGGAACACGGCGAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGACCTCAATCTAGGAGATGTGCCT
GACGATTGGAGGGAGACCGCCAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTACGCTGCTGTCCATCCCCA
GTCACATATAGCCATTGTGCGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATTAGAGATGCTGTGGGAAATAAGATGTTAG
TGACTCCGACTCTAGAACAGCTCGGTGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTTATGGCAAGGTACGACTAAAACCCGAGGATGTT
TCACTAGCTGTTGCAGGATGGTTATATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGTCATGCTAAA
GGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGCTTGGTCACTTCTTTATGCTTGC
GACAAGGTGTGCAGCTCCCTACGGATCAAATTAAGAGAGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAGGGCAGGTACT
GGGTTGTCTCCTACATCGGAGATCCGTCGTCTCCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGA
TTTTGCAGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTGATCCTTGTCCACAACCTCCAACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGGCCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGCAAATCCTCAACAACTTCCTCCAATCATAAA
TCCGGGAGTTGAGCATGAAGGAGAGCAAAAGAAGAGCCATAAACAAAGGCTAGAGCCCGAGGTCTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGCTGACGGATCTAG
AAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGGTTCATCTATGGAGAACAGGGATGAGGAGCAGTTTTATTCATCTCCGTTGATTATTACATCTGGAGAT
GGTAATGATGATTTTCTACTGGTTTCACGAGGCAATTGTTCAAATATGCCAGAAATGGAGGATACCGAAAACGAAGTGGTAAGAAAAGATATTCAGGAACCTTCCCCATT
GGATACACCTACAGAAGGAGCATTTTTTAGTCACCAAGCGTTGCCTATTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTAC
CAGGGGTAAGGGATTCTCACGCTATTCCTTCTAATGCAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGAGAA
GAAGATCCTGAGGACACGCAGACGTTAAAGGAAATGTTCCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCATACTAAGAAGCCGACAGTGCCCAT
GAATGAACCTAAAACAAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTAATACAATCGAACTAGACTTGTCTAAGGGAGATG
AGGTCGAGACGCAATGGAACACGGCGAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGACCTCAATCTAGGAGATGTGCCT
GACGATTGGAGGGAGACCGCCAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTACGCTGCTGTCCATCCCCA
GTCACATATAGCCATTGTGCGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATTAGAGATGCTGTGGGAAATAAGATGTTAG
TGACTCCGACTCTAGAACAGCTCGGTGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTTATGGCAAGGTACGACTAAAACCCGAGGATGTT
TCACTAGCTGTTGCAGGATGGTTATATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGTCATGCTAAA
GGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGCTTGGTCACTTCTTTATGCTTGC
GACAAGGTGTGCAGCTCCCTACGGATCAAATTAAGAGAGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAGGGCAGGTACT
GGGTTGTCTCCTACATCGGAGATCCGTCGTCTCCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGA
TTTTGCAGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTGATCCTTGTCCACAACCTCCAACTTCATGA
Protein sequenceShow/hide protein sequence
MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPIINPGVEHEGEQKKSHKQRLEPEVSIGQKRKGKEVLTDLEIEEDGSSRRLTPKGSSMENRDEEQFYSSPLIITSGD
GNDDFLLVSRGNCSNMPEMEDTENEVVRKDIQEPSPLDTPTEGAFFSHQALPIGATGSTPLATDEYVTPMATLPGVRDSHAIPSNAVNPLGCDTGRSKVGENSTQEPTRE
EDPEDTQTLKEMFQYKRREKKSSKRRAVHTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVNTIELDLSKGDEVETQWNTANLATRTSLMKSRKIMTELGFDLNLGDVP
DDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLGEALECVGKPSATWDLTTYGKVRLKPEDV
SLAVAGWLYIVKNRILPTEHDEHVTQDRALLVYVMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPTDQIKRDAPVVEEKNIRRIIAHALQRRAGT
GLSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSSSTDTDPCPQPPTS