; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr2:14573100..14575246
RNA-Seq ExpressionMoc02g19620
SyntenyMoc02g19620
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.6e-12358.06Show/hide
Query:  INPASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQ
        +  ASVHPYGMPNPSTLFPSLPPYYGM HWVPPHNYGMISPRVP PPQLPFLERGPQAPQLNPNILSTFN G LKP EPP MPIPTNM MD RVEHGGE 
Subjt:  INPASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQ

Query:  EKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLHREMVMMIFYWFYEA---------------------------------IVR--------
        EKSH QRLEP VSIGQKRKG      P          L      M       FY +                                 +VR        
Subjt:  EKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLHREMVMMIFYWFYEA---------------------------------IVR--------

Query:  --------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGVRDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSK
                 C  + +P    ATGSTPLAT+EY T MATLPGVRD H I SN+VNPL C TGR                             KRRE KSSK
Subjt:  --------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGVRDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSK

Query:  RRAVQTKKPTMPMDEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAVNLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARDKEWRP
        RRAVQ  KPT+PM+EPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWRP
Subjt:  RRAVQTKKPTMPMDEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAVNLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARDKEWRP

Query:  LIHPY----------------------------------NINYTFNIKNIRDAVGNKMLVTPTLE
        LI P                                    INYTFNIKNI+DAVGNKMLVTPTLE
Subjt:  LIHPY----------------------------------NINYTFNIKNIRDAVGNKMLVTPTLE

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.7e-8055.59Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP
        +M PHGYVNFQQLPT NIPQ+SEFRA+NPQQL PMIN                        PASVHPYGMPNPSTLFPSL PYYGM HWVPP NYGMISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLL
        RVPPPPQL FLERGPQAPQLNPNILSTFNMG LKPLEPPRMP PTNMPMD   EHGGEQEKSHS RLEPGV IGQKRKG      P          L   
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLL

Query:  HREMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG
           M       FY +                                 +VR                 C  + +P    ATGS PLATDEY+T MATLPG
Subjt:  HREMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG

Query:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSK
        VRD H I SN+VNPLGCDTGRS+  E  + +
Subjt:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSK

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]1.8e-8555.11Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR
        M PHGYVNFQQLPTLNIPQNSEFRA+NPQQL PMINP                        ASVHPYGMPNPSTLFPS PPYYGM HWVPPHNYGMISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH
        VPPPPQLPFLERGPQ PQLNPN LSTFNMG LKPLEPPRMPIPTNMPMD  VEHGGEQEKSH QRLEP +SIGQKRKG      P          L    
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH

Query:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV
          M       FY +                                 +VR                 C  + +P    ATGSTPL TDEY+TPMATLPGV
Subjt:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV

Query:  RDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK
        RD H I SN+VNPLGCDTGRSK  E          T++PT   D   T+  +
Subjt:  RDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]2.6e-8454.11Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP
        +M PHGYVNFQQLPTLNIPQNSEFRA+NPQQL PMIN                        PASVHPYGMPNPSTLFPSLPPYYGM HW+PP NYGMISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMR-------------------
        +VPPPPQLPFLERGPQAPQL+PNILST+NMG LKPLEPPRMP PTNMPMD   EHGGEQEK HS RLEPGVSIGQKRKG                     
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMR-------------------

Query:  --------------SSFIPLHLLHREMVMMIFYWF----------YEAIVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG
                      SSFI       +  +++                 +VR                 C  + +P    ATGSTPLATDEY+TPMATLPG
Subjt:  --------------SSFIPLHLLHREMVMMIFYWF----------YEAIVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG

Query:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK
        VRD H I SN+VNPLGCDTGRSK  E          T++PT   D   T+  +
Subjt:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]2.5e-8756.08Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR
        M PHGYVNFQ LPTLNIPQNSEFRA++PQQL PMIN                        PASVHPYGMPNPSTLFPSLPPYYGM HWVPPHNYGMISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH
        VPPPPQLPFLERGPQAPQLNPNILSTFNMG LKPLEP RM IPTNMPMDT  EHGGEQEKSHSQRLEPGVSIGQKRKG      P          L    
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH

Query:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV
          M       FY +                                 +VR                 C  + +P    ATGSTPLATDEY+TPMATLPGV
Subjt:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV

Query:  RDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSKRRAV
        RD H I SN+VNPLGCDTGRS                            KRREKKSSKRRAV
Subjt:  RDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSKRRAV

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195154.7e-12458.28Show/hide
Query:  INPASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQ
        +  ASVHPYGMPNPSTLFPSLPPYYGM HWVPPHNYGMISPRVP PPQLPFLERGPQAPQLNPNILSTFN G LKP EPP MPIPTNM MD RVEHGGE 
Subjt:  INPASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQ

Query:  EKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLHREMVMMIFYWFYEA---------------------------------IVR--------
        EKSH QRLEP VSIGQKRKG      P          L      M       FY +                                 +VR        
Subjt:  EKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLHREMVMMIFYWFYEA---------------------------------IVR--------

Query:  --------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGVRDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSK
                 C  + +P    ATGSTPLATDEY T MATLPGVRD H I SN+VNPL C TGR                             KRRE KSSK
Subjt:  --------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGVRDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSK

Query:  RRAVQTKKPTMPMDEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAVNLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARDKEWRP
        RRAVQ  KPT+PM+EPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWRP
Subjt:  RRAVQTKKPTMPMDEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAVNLATRTSLMKSRKIMTELGFDLNLGDVPDDWRETARDKEWRP

Query:  LIHPY----------------------------------NINYTFNIKNIRDAVGNKMLVTPTLE
        LI P                                    INYTFNIKNI+DAVGNKMLVTPTLE
Subjt:  LIHPY----------------------------------NINYTFNIKNIRDAVGNKMLVTPTLE

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201108.3e-8155.59Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP
        +M PHGYVNFQQLPT NIPQ+SEFRA+NPQQL PMIN                        PASVHPYGMPNPSTLFPSL PYYGM HWVPP NYGMISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLL
        RVPPPPQL FLERGPQAPQLNPNILSTFNMG LKPLEPPRMP PTNMPMD   EHGGEQEKSHS RLEPGV IGQKRKG      P          L   
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLL

Query:  HREMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG
           M       FY +                                 +VR                 C  + +P    ATGS PLATDEY+T MATLPG
Subjt:  HREMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG

Query:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSK
        VRD H I SN+VNPLGCDTGRS+  E  + +
Subjt:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSK

A0A6J1DWG3 uncharacterized protein LOC1110237638.6e-8655.11Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR
        M PHGYVNFQQLPTLNIPQNSEFRA+NPQQL PMINP                        ASVHPYGMPNPSTLFPS PPYYGM HWVPPHNYGMISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH
        VPPPPQLPFLERGPQ PQLNPN LSTFNMG LKPLEPPRMPIPTNMPMD  VEHGGEQEKSH QRLEP +SIGQKRKG      P          L    
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH

Query:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV
          M       FY +                                 +VR                 C  + +P    ATGSTPL TDEY+TPMATLPGV
Subjt:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV

Query:  RDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK
        RD H I SN+VNPLGCDTGRSK  E          T++PT   D   T+  +
Subjt:  RDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK

A0A6J1DX11 uncharacterized protein LOC1110248601.2e-8454.11Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP
        +M PHGYVNFQQLPTLNIPQNSEFRA+NPQQL PMIN                        PASVHPYGMPNPSTLFPSLPPYYGM HW+PP NYGMISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMR-------------------
        +VPPPPQLPFLERGPQAPQL+PNILST+NMG LKPLEPPRMP PTNMPMD   EHGGEQEK HS RLEPGVSIGQKRKG                     
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMR-------------------

Query:  --------------SSFIPLHLLHREMVMMIFYWF----------YEAIVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG
                      SSFI       +  +++                 +VR                 C  + +P    ATGSTPLATDEY+TPMATLPG
Subjt:  --------------SSFIPLHLLHREMVMMIFYWF----------YEAIVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPG

Query:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK
        VRD H I SN+VNPLGCDTGRSK  E          T++PT   D   T+  +
Subjt:  VRDVHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAK

A0A6J1DY94 uncharacterized protein LOC1110253161.2e-8756.08Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR
        M PHGYVNFQ LPTLNIPQNSEFRA++PQQL PMIN                        PASVHPYGMPNPSTLFPSLPPYYGM HWVPPHNYGMISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH
        VPPPPQLPFLERGPQAPQLNPNILSTFNMG LKPLEP RM IPTNMPMDT  EHGGEQEKSHSQRLEPGVSIGQKRKG      P          L    
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGPLKPLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIP----------LHLLH

Query:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV
          M       FY +                                 +VR                 C  + +P    ATGSTPLATDEY+TPMATLPGV
Subjt:  REMVMMIFYWFYEA---------------------------------IVR----------------ICRKRRIPRTMWATGSTPLATDEYITPMATLPGV

Query:  RDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSKRRAV
        RD H I SN+VNPLGCDTGRS                            KRREKKSSKRRAV
Subjt:  RDVHAILSNSVNPLGCDTGRS----------------------------KRREKKSSKRRAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGTCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTAAAAATCCTCAACAACTTTCTCCAATGATCAA
TCCGGCTAGTGTTCATCCTTATGGAATGCCCAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGAGTCATTGGGTACCTCCACACAATTATGGGATGA
TTTCACCTAGGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGACCACTAAAA
CCCTTAGAGCCTCCTAGGATGCCAATCCCAACCAATATGCCAATGGATACAAGAGTTGAGCATGGAGGAGAGCAAGAGAAGAGTCATAGCCAGAGGCTAGAGCCCGGGGT
TTCGATAGGGCAAAAGAGGAAGGGGATGAGGAGCAGTTTTATTCCTCTCCATTTATTACACCGGGAGATGGTAATGATGATTTTCTACTGGTTTTACGAGGCAATTGTTC
GGATATGCCGGAAACGGAGGATACCGAGAACGATGTGGGCAACTGGATCTACTCCGTTGGCAACAGATGAGTATATCACACCGATGGCCACTCTACCAGGGGTAAGGGAT
GTTCACGCTATTCTTTCTAATTCAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGAGGCGGGAGAAGAAGAGTTCAAAACGTCGTGCAGTTCAGACTAAGAAGCC
GACAATGCCTATGGATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTAGACTTGT
CCGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGTGAATTTAGCCACTCGTACTTCATTAATGAAATCCCGTAAGATTATGACCGAATTAGGATTCGACCTCAATCTA
GGCGATGTGCCTGACGATTGGAGGGAGACCGCTAGAGATAAAGAATGGAGACCACTCATTCACCCATACAATATCAACTACACCTTCAACATTAAGAATATCAGAGATGC
TGTGGGAAATAAGATGTTAGTGACTCCGACTCTAGAACAGCTCGGATGGTTATATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATA
GGGCACTGTTGGTTTATGCCATGCTTAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCAC
CCACGCTTGGTCACTTCTTTATGCTTGCGACAAGGTGTGCAGCTCCCTGCGGATCAAATTAAGAGAGATGCGCCAGTTTTGGAAGAGAAGAATATTCGGAGAATTATCGC
CCATGCGCTACAAAGAAGGGAAGGTACTGGGACGTCTCCGAGAGGAGAACCAACAGCTGCGAGATCAGATTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCA
TTGGATTTTGCGGTTTTACCTTCATGGTCTCTAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGGTCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTAAAAATCCTCAACAACTTTCTCCAATGATCAA
TCCGGCTAGTGTTCATCCTTATGGAATGCCCAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGAGTCATTGGGTACCTCCACACAATTATGGGATGA
TTTCACCTAGGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGACCACTAAAA
CCCTTAGAGCCTCCTAGGATGCCAATCCCAACCAATATGCCAATGGATACAAGAGTTGAGCATGGAGGAGAGCAAGAGAAGAGTCATAGCCAGAGGCTAGAGCCCGGGGT
TTCGATAGGGCAAAAGAGGAAGGGGATGAGGAGCAGTTTTATTCCTCTCCATTTATTACACCGGGAGATGGTAATGATGATTTTCTACTGGTTTTACGAGGCAATTGTTC
GGATATGCCGGAAACGGAGGATACCGAGAACGATGTGGGCAACTGGATCTACTCCGTTGGCAACAGATGAGTATATCACACCGATGGCCACTCTACCAGGGGTAAGGGAT
GTTCACGCTATTCTTTCTAATTCAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGAGGCGGGAGAAGAAGAGTTCAAAACGTCGTGCAGTTCAGACTAAGAAGCC
GACAATGCCTATGGATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTAGACTTGT
CCGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGTGAATTTAGCCACTCGTACTTCATTAATGAAATCCCGTAAGATTATGACCGAATTAGGATTCGACCTCAATCTA
GGCGATGTGCCTGACGATTGGAGGGAGACCGCTAGAGATAAAGAATGGAGACCACTCATTCACCCATACAATATCAACTACACCTTCAACATTAAGAATATCAGAGATGC
TGTGGGAAATAAGATGTTAGTGACTCCGACTCTAGAACAGCTCGGATGGTTATATATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATA
GGGCACTGTTGGTTTATGCCATGCTTAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCAC
CCACGCTTGGTCACTTCTTTATGCTTGCGACAAGGTGTGCAGCTCCCTGCGGATCAAATTAAGAGAGATGCGCCAGTTTTGGAAGAGAAGAATATTCGGAGAATTATCGC
CCATGCGCTACAAAGAAGGGAAGGTACTGGGACGTCTCCGAGAGGAGAACCAACAGCTGCGAGATCAGATTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCA
TTGGATTTTGCGGTTTTACCTTCATGGTCTCTAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGGCACTGA
Protein sequenceShow/hide protein sequence
MMGPHGYVNFQQLPTLNIPQNSEFRAKNPQQLSPMINPASVHPYGMPNPSTLFPSLPPYYGMSHWVPPHNYGMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGPLK
PLEPPRMPIPTNMPMDTRVEHGGEQEKSHSQRLEPGVSIGQKRKGMRSSFIPLHLLHREMVMMIFYWFYEAIVRICRKRRIPRTMWATGSTPLATDEYITPMATLPGVRD
VHAILSNSVNPLGCDTGRSKRREKKSSKRRAVQTKKPTMPMDEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAVNLATRTSLMKSRKIMTELGFDLNL
GDVPDDWRETARDKEWRPLIHPYNINYTFNIKNIRDAVGNKMLVTPTLEQLGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
PRLVTSLCLRQGVQLPADQIKRDAPVLEEKNIRRIIAHALQRREGTGTSPRGEPTAARSDSRSRATYLQLEGFIGFCGFTFMVSSASCYPWSSIFQYRH