; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g21960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g21960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr5:15719749..15721950
RNA-Seq ExpressionMoc05g21960
SyntenyMoc05g21960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.4e-17672.96Show/hide
Query:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHG
        P  +  ASVHPYGMPNPSTLFPSLPPYYGMG WVPP NY MISPRVP PPQLPFLERGPQAPQLNPNILSTFN G LKP EPP MPIPTNM MD  VEHG
Subjt:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHG

Query:  GEQEKSHRQRLKP-------------------------------------------------------------------------EGTENEVVRKDTQE
        GE EKSH QRL+P                                                                         E TENEVVR DTQE
Subjt:  GEQEKSHRQRLKP-------------------------------------------------------------------------EGTENEVVRKDTQE

Query:  PSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRDAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSL
        PSPLDTP EGAF SHQALPTGATGSTPLAT+EY T +ATLPGVRDAH IPSNAVNPL C TGR KVGENSTQE  +EEDPEDTQT+REMFQYK+RE KS 
Subjt:  PSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRDAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSL

Query:  KRRAVQTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPRPVDTIELDLSEGEEVETQWNAANLATRTSLMKSCKIMTELGFDLNLGDVPDDWRETARDKEWR
        KRRAVQ  KPTVPMNEPKTRAAKAKAAEAKKKVVAP PVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWR
Subjt:  KRRAVQTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPRPVDTIELDLSEGEEVETQWNAANLATRTSLMKSCKIMTELGFDLNLGDVPDDWRETARDKEWR

Query:  PLIQPIQCEALELVREFYADIHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        PLIQPIQCEALELVREFYA  HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  PLIQPIQCEALELVREFYADIHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]9.9e-10963.31Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP
        +M PHGYVNFQQLPT NIPQ+SEFRA NPQQLPPMIN                        PASVHPYGMPNPSTLFPSL PYYGMG WVPP NY MISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------
        RVPPPPQL FLERGPQAPQLNPNILSTFNMGQLKPLEPP MP PTNMPMD G EHGGEQEKSH  RL+P                               
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------

Query:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR
                                                  E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGS PLATDEYVT +ATLPGVR
Subjt:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR

Query:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK
        DAH IPSN VNPLGCDTGRS+VGENSTQEPT EED ED QT+REMFQYK+REK S K
Subjt:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]5.6e-11265.51Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR
        M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPPMINP                        ASVHPYGMPNPSTLFPS PPYYGMG WVPP NY MISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------
        VPPPPQLPFLERGPQ PQLNPN LSTFNMG LKPLEPP MPIPTNMPMD GVEHGGEQEKSH QRL+P                                
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------

Query:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD
                                                 E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGSTPL TDEYVTP+ATLPGVRD
Subjt:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD

Query:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMF
        AH IPSNAVNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+REMF
Subjt:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMF

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]2.1e-11464.43Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP
        +M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMG W+PPQNY MISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------
        +VPPPPQLPFLERGPQAPQL+PNILST+NMGQLKPLEPP MP PTNMPMD G EHGGEQEK H  RL+P                               
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------

Query:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR
                                                  E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGSTPLATDEYVTP+ATLPGVR
Subjt:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR

Query:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK
        DAH IPSN VNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+R+MFQYK+REKK  K
Subjt:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]1.6e-11465.83Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR
        M PHGYVNFQ LPTLNIPQNSEFRA +PQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMG WVPP NY MISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------
        VPPPPQLPFLERGPQAPQLNPNILSTFNMG LKPLEP  M IPTNMPMDTG EHGGEQEKSH QRL+P                                
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------

Query:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD
                                                 E TENEVVR DTQEPS LDTPTEGAF SHQ LPTGATGSTPLATDEYVTP+ATLPGVRD
Subjt:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD

Query:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLKRRAV
        AH IPSNAVNPLGCDTGRSKVGENSTQEPT EED EDTQT+REMFQYK+REKKS KRRAV
Subjt:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLKRRAV

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195154.1e-17773.18Show/hide
Query:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHG
        P  +  ASVHPYGMPNPSTLFPSLPPYYGMG WVPP NY MISPRVP PPQLPFLERGPQAPQLNPNILSTFN G LKP EPP MPIPTNM MD  VEHG
Subjt:  PPMINPASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHG

Query:  GEQEKSHRQRLKP-------------------------------------------------------------------------EGTENEVVRKDTQE
        GE EKSH QRL+P                                                                         E TENEVVR DTQE
Subjt:  GEQEKSHRQRLKP-------------------------------------------------------------------------EGTENEVVRKDTQE

Query:  PSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRDAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSL
        PSPLDTP EGAF SHQALPTGATGSTPLATDEY T +ATLPGVRDAH IPSNAVNPL C TGR KVGENSTQE  +EEDPEDTQT+REMFQYK+RE KS 
Subjt:  PSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRDAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSL

Query:  KRRAVQTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPRPVDTIELDLSEGEEVETQWNAANLATRTSLMKSCKIMTELGFDLNLGDVPDDWRETARDKEWR
        KRRAVQ  KPTVPMNEPKTRAAKAKAAEAKKKVVAP PVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWR
Subjt:  KRRAVQTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPRPVDTIELDLSEGEEVETQWNAANLATRTSLMKSCKIMTELGFDLNLGDVPDDWRETARDKEWR

Query:  PLIQPIQCEALELVREFYADIHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        PLIQPIQCEALELVREFYA  HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  PLIQPIQCEALELVREFYADIHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201104.8e-10963.31Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP
        +M PHGYVNFQQLPT NIPQ+SEFRA NPQQLPPMIN                        PASVHPYGMPNPSTLFPSL PYYGMG WVPP NY MISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------
        RVPPPPQL FLERGPQAPQLNPNILSTFNMGQLKPLEPP MP PTNMPMD G EHGGEQEKSH  RL+P                               
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------

Query:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR
                                                  E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGS PLATDEYVT +ATLPGVR
Subjt:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR

Query:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK
        DAH IPSN VNPLGCDTGRS+VGENSTQEPT EED ED QT+REMFQYK+REK S K
Subjt:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK

A0A6J1DWG3 uncharacterized protein LOC1110237632.7e-11265.51Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR
        M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPPMINP                        ASVHPYGMPNPSTLFPS PPYYGMG WVPP NY MISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMINP------------------------ASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------
        VPPPPQLPFLERGPQ PQLNPN LSTFNMG LKPLEPP MPIPTNMPMD GVEHGGEQEKSH QRL+P                                
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------

Query:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD
                                                 E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGSTPL TDEYVTP+ATLPGVRD
Subjt:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD

Query:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMF
        AH IPSNAVNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+REMF
Subjt:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMF

A0A6J1DX11 uncharacterized protein LOC1110248609.9e-11564.43Show/hide
Query:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP
        +M PHGYVNFQQLPTLNIPQNSEFRA NPQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMG W+PPQNY MISP
Subjt:  MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISP

Query:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------
        +VPPPPQLPFLERGPQAPQL+PNILST+NMGQLKPLEPP MP PTNMPMD G EHGGEQEK H  RL+P                               
Subjt:  RVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP-------------------------------

Query:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR
                                                  E TENEVVR DTQEPSPLDTPTEGAF SHQ LPTGATGSTPLATDEYVTP+ATLPGVR
Subjt:  ------------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVR

Query:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK
        DAH IPSN VNPLGCDTGRSKVGENSTQEPT EEDPEDTQT+R+MFQYK+REKK  K
Subjt:  DAHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLK

A0A6J1DY94 uncharacterized protein LOC1110253167.6e-11565.83Show/hide
Query:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR
        M PHGYVNFQ LPTLNIPQNSEFRA +PQQLPPMIN                        PASVHPYGMPNPSTLFPSLPPYYGMG WVPP NY MISPR
Subjt:  MGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMIN------------------------PASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPR

Query:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------
        VPPPPQLPFLERGPQAPQLNPNILSTFNMG LKPLEP  M IPTNMPMDTG EHGGEQEKSH QRL+P                                
Subjt:  VPPPPQLPFLERGPQAPQLNPNILSTFNMGQLKPLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKP--------------------------------

Query:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD
                                                 E TENEVVR DTQEPS LDTPTEGAF SHQ LPTGATGSTPLATDEYVTP+ATLPGVRD
Subjt:  -----------------------------------------EGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRD

Query:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLKRRAV
        AH IPSNAVNPLGCDTGRSKVGENSTQEPT EED EDTQT+REMFQYK+REKKS KRRAV
Subjt:  AHAIPSNAVNPLGCDTGRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLKRRAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGCCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGCAAATCCTCAACAACTTCCTCCAATGATCAA
TCCGGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTAAGTGGGTACCTCCACAAAATTATGAGATGA
TTTCACCTAGGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGACAACTAAAA
CCCTTAGAGCCTCCTATGATGCCAATCCCAACCAATATGCCAATGGATACAGGAGTTGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGACAGAGGCTAAAGCCCGAGGG
TACCGAAAACGAAGTGGTAAGAAAAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTTTAGTCACCAAGCGTTGCCTACTGGGGCAACTGGAT
CTACTCCGTTGGCAACGGATGAGTATGTCACACCGATAGCCACTCTACCAGGGGTAAGGGATGCTCACGCTATTCCTTCTAATGCAGTTAACCCGCTTGGGTGTGATACA
GGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAAAGAAGAAGATCCCGAGGACACGCAGACGTTAAGGGAAATGTTCCAATACAAGAAGCGGGAGAAAAA
GAGTTTAAAACGTCGTGCAGTTCAGACTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACAAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGG
CACCTAGGCCAGTTGATACAATCGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGCAATGGAACGCGGCGAATTTAGCCACTCGAACTTCATTAATGAAATCCTGT
AAGATTATGACAGAATTGGGATTCGACCTCAATCTAGGAGATGTGCCTGACGATTGGAGGGAGACCGCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATG
TGAGGCTTTGGAGTTAGTCAGAGAGTTCTACGCTGATATCCATCCCCAGTCACATATAGCCATAGTGCGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACA
CCTTCAACATTGAGAATATCAGAGATGCTGTGGGAAATAAGATGTTAGTGACTCCGACTCTAGAACAGCTCGGTGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACT
TGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCACTAGCTGCTGCAGGATGGTTATATATAGTTAAAAACAGAATTCTGCCAACGGAGCATAA
TGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAAGTATGGAGAATTGATCAATACCAGTATCCATGAGTCTGCCCACCGGA
CACGTGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAGGGCAGGTACTGGGACGTCTCCTACATCGGAGATCCGTCGTCTC
CGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAACTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACATTCATGGCCTCCAGCACT
AGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTAATCCTAGTCCACAACCTCCAACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGGCCCACATGGTTATGTAAATTTTCAGCAGCTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGCAAATCCTCAACAACTTCCTCCAATGATCAA
TCCGGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTAAGTGGGTACCTCCACAAAATTATGAGATGA
TTTCACCTAGGGTTCCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGACAACTAAAA
CCCTTAGAGCCTCCTATGATGCCAATCCCAACCAATATGCCAATGGATACAGGAGTTGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGACAGAGGCTAAAGCCCGAGGG
TACCGAAAACGAAGTGGTAAGAAAAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTTTAGTCACCAAGCGTTGCCTACTGGGGCAACTGGAT
CTACTCCGTTGGCAACGGATGAGTATGTCACACCGATAGCCACTCTACCAGGGGTAAGGGATGCTCACGCTATTCCTTCTAATGCAGTTAACCCGCTTGGGTGTGATACA
GGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAAAGAAGAAGATCCCGAGGACACGCAGACGTTAAGGGAAATGTTCCAATACAAGAAGCGGGAGAAAAA
GAGTTTAAAACGTCGTGCAGTTCAGACTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACAAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGG
CACCTAGGCCAGTTGATACAATCGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGCAATGGAACGCGGCGAATTTAGCCACTCGAACTTCATTAATGAAATCCTGT
AAGATTATGACAGAATTGGGATTCGACCTCAATCTAGGAGATGTGCCTGACGATTGGAGGGAGACCGCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATG
TGAGGCTTTGGAGTTAGTCAGAGAGTTCTACGCTGATATCCATCCCCAGTCACATATAGCCATAGTGCGTGGGAAGGAAATAAGGTTTGATGCCACTCAGATCAACTACA
CCTTCAACATTGAGAATATCAGAGATGCTGTGGGAAATAAGATGTTAGTGACTCCGACTCTAGAACAGCTCGGTGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACT
TGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCACTAGCTGCTGCAGGATGGTTATATATAGTTAAAAACAGAATTCTGCCAACGGAGCATAA
TGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAAGTATGGAGAATTGATCAATACCAGTATCCATGAGTCTGCCCACCGGA
CACGTGATGCCCCAGTTGTGGAAGAGAAGAATATTCGGAGAATTATCGCCCATGCGCTACAAAGAAGGGCAGGTACTGGGACGTCTCCTACATCGGAGATCCGTCGTCTC
CGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAACTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACATTCATGGCCTCCAGCACT
AGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACACTAATCCTAGTCCACAACCTCCAACTTCATGA
Protein sequenceShow/hide protein sequence
MMGPHGYVNFQQLPTLNIPQNSEFRAANPQQLPPMINPASVHPYGMPNPSTLFPSLPPYYGMGKWVPPQNYEMISPRVPPPPQLPFLERGPQAPQLNPNILSTFNMGQLK
PLEPPMMPIPTNMPMDTGVEHGGEQEKSHRQRLKPEGTENEVVRKDTQEPSPLDTPTEGAFFSHQALPTGATGSTPLATDEYVTPIATLPGVRDAHAIPSNAVNPLGCDT
GRSKVGENSTQEPTKEEDPEDTQTLREMFQYKKREKKSLKRRAVQTKKPTVPMNEPKTRAAKAKAAEAKKKVVAPRPVDTIELDLSEGEEVETQWNAANLATRTSLMKSC
KIMTELGFDLNLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYADIHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLGEALECVGKPSAT
WDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHNEHVTQDRALLVYAMLKGIDVKYGELINTSIHESAHRTRDAPVVEEKNIRRIIAHALQRRAGTGTSPTSEIRRL
REENQQLRDQVRELVQHIYNLRASLDFAVLHSWPPALAAILGHPSSSTDTNPSPQPPTS