; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr4:11403463..11405489
RNA-Seq ExpressionMoc04g14980
SyntenyMoc04g14980
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.2e-20582.39Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGEQEKC
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHN GMISPRV  PPQLPFLERG QAPQLNPNILSTFN G LKP EPP MPIPTN+ +D  VEHGGE EK 
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGEQEKC

Query:  HSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT
        H QRLEP VSI QKRKGKEVMTDPEI  D SSR LTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPSPLDT
Subjt:  HSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT

Query:  PTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT-----------------DDKGNVPIQE------AEKKSSKRRAVQ
        P EGAFCSH+ LPTGATGSTPLAT+EY T MATLPGVRDAH IPSNAVNPL C T                 D +    I+E       E KSSKRRAVQ
Subjt:  PTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT-----------------DDKGNVPIQE------AEKKSSKRRAVQ

Query:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFHPTLGDVPDYWRETARDKEWRPLIQPI
          KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGF  TLGDVPD WR+TAR KEWRPLIQPI
Subjt:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFHPTLGDVPDYWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVCGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        QCEALELVREFYAA HPQSHIAIV GKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  QCEALELVREFYAAVHPQSHIAIVCGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]7.0e-12990.35Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP N GMISPRV PPPQL FLERG QAPQLNPNILSTFNMGQLKPLEPPRMP PTN+P+D G EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK HS RLEPGV I QKRKGKEVM DPEIEED SSR LTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRG+CSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGS PLATDEYVT MATLPGVRDAHTIPSN VNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]6.4e-13089.96Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHN GMISPRV PPPQLPFLERG Q PQLNPN LSTFNMG LKPLEPPRMPIPTN+P+DAGVEHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK H QRLEP +SI QKRKGKEVMTDPEI ED SSR L PKDSSME RDEEQFYSSPLIITP DGNDDFLLVSRGNCSNM E EDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGSTPL TDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]7.3e-13491.12Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP N GMISP+V PPPQLPFLERG QAPQL+PNILST+NMGQLKPLEPPRMP PTN+P+DAG EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEKCHS RLEPGVSI QKRKGKEVMTDPEIEED SSR LTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRG+CSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSN VNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]7.3e-13483.17Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHN GMISPRV PPPQLPFLERG QAPQLNPNILSTFNMG LKPLEP RM IPTN+P+D G EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK HSQRLEPGVSI QKRKGKEVM+DPEIEED SSR LTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT------DDKGNVPIQE-----------------AEKKSSKR
         LDTPTEGAFCSH+ELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT      ++    P  E                  EKKSSKR
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT------DDKGNVPIQE-----------------AEKKSSKR

Query:  RAV
        RAV
Subjt:  RAV

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195153.4e-20682.61Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGEQEKC
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHN GMISPRV  PPQLPFLERG QAPQLNPNILSTFN G LKP EPP MPIPTN+ +D  VEHGGE EK 
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGEQEKC

Query:  HSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT
        H QRLEP VSI QKRKGKEVMTDPEI  D SSR LTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPSPLDT
Subjt:  HSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT

Query:  PTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT-----------------DDKGNVPIQE------AEKKSSKRRAVQ
        P EGAFCSH+ LPTGATGSTPLATDEY T MATLPGVRDAH IPSNAVNPL C T                 D +    I+E       E KSSKRRAVQ
Subjt:  PTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT-----------------DDKGNVPIQE------AEKKSSKRRAVQ

Query:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFHPTLGDVPDYWRETARDKEWRPLIQPI
          KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGF  TLGDVPD WR+TAR KEWRPLIQPI
Subjt:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFHPTLGDVPDYWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVCGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        QCEALELVREFYAA HPQSHIAIV GKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  QCEALELVREFYAAVHPQSHIAIVCGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201103.4e-12990.35Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP N GMISPRV PPPQL FLERG QAPQLNPNILSTFNMGQLKPLEPPRMP PTN+P+D G EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK HS RLEPGV I QKRKGKEVM DPEIEED SSR LTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRG+CSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGS PLATDEYVT MATLPGVRDAHTIPSN VNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

A0A6J1DWG3 uncharacterized protein LOC1110237633.1e-13089.96Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHN GMISPRV PPPQLPFLERG Q PQLNPN LSTFNMG LKPLEPPRMPIPTN+P+DAGVEHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK H QRLEP +SI QKRKGKEVMTDPEI ED SSR L PKDSSME RDEEQFYSSPLIITP DGNDDFLLVSRGNCSNM E EDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGSTPL TDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

A0A6J1DX11 uncharacterized protein LOC1110248603.5e-13491.12Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP N GMISP+V PPPQLPFLERG QAPQL+PNILST+NMGQLKPLEPPRMP PTN+P+DAG EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEKCHS RLEPGVSI QKRKGKEVMTDPEIEED SSR LTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRG+CSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT
        PLDTPTEGAFCSH+ELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSN VNPLGCDT
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT

A0A6J1DY94 uncharacterized protein LOC1110253163.5e-13483.17Show/hide
Query:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE
        MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHN GMISPRV PPPQLPFLERG QAPQLNPNILSTFNMG LKPLEP RM IPTN+P+D G EHGGE
Subjt:  MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGE

Query:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS
        QEK HSQRLEPGVSI QKRKGKEVM+DPEIEED SSR LTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPS
Subjt:  QEKCHSQRLEPGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPS

Query:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT------DDKGNVPIQE-----------------AEKKSSKR
         LDTPTEGAFCSH+ELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT      ++    P  E                  EKKSSKR
Subjt:  PLDTPTEGAFCSHRELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDT------DDKGNVPIQE-----------------AEKKSSKR

Query:  RAV
        RAV
Subjt:  RAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATTCCAGCAAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTG
TGGGATGATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGATCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGAC
AACTAAAACCCTTAGAGCCTCCTAGGATGCCAATCCCAACCAATTTGCCAATCGATGCAGGAGTTGAGCATGGAGGAGAGCAAGAGAAGTGCCATAGCCAGAGGCTAGAG
CCCGGGGTTTCGATACGGCAAAAGAGGAAGGGCAAGGAGGTGATGACAGACCCAGAAATTGAGGAAGATGAGAGTAGTAGGCATCTGACGCCTAAGGATTCGTCTATGGA
GAACAGGGATGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGCAATGATGATTTTCTACTAGTTTCCCGAGGCAATTGTTCAAATATGCCGGAAA
CAGAGGATACCGACAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTGTAGTCACCGAGAGTTGCCTACTGGGGCA
ACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATGCAGTTAACCCACTTGGGTG
TGATACAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAA
CGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAA
TGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCCACCCCACTCTAGGAGATGTGCCTGATTATTGGAGGGA
GACCGCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCTCAGTCACATATAGCCA
TAGTGTGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTA
GAACATCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTAC
AGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGTATAGATGTGA
ATTCTGAAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGAGAAGGTGTACAG
CTGCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAATAATATTCGGCGAATTATCGCCCATGCGCTACAAAGAAGGGAAGGTACTGGGATGTCTTCTAC
ATCGGAGATCCGTCGTCTCCGAGAGGAGAACCAACAGTTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGTTTCATTGGATTTTGCAATTTTAC
CTTCATGGTCTTCAGTGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACATTGATCCTAGTCCACAACCTCCAACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATTCCAGCAAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTG
TGGGATGATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGATCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTCAACATGGGAC
AACTAAAACCCTTAGAGCCTCCTAGGATGCCAATCCCAACCAATTTGCCAATCGATGCAGGAGTTGAGCATGGAGGAGAGCAAGAGAAGTGCCATAGCCAGAGGCTAGAG
CCCGGGGTTTCGATACGGCAAAAGAGGAAGGGCAAGGAGGTGATGACAGACCCAGAAATTGAGGAAGATGAGAGTAGTAGGCATCTGACGCCTAAGGATTCGTCTATGGA
GAACAGGGATGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGCAATGATGATTTTCTACTAGTTTCCCGAGGCAATTGTTCAAATATGCCGGAAA
CAGAGGATACCGACAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCATTTTGTAGTCACCGAGAGTTGCCTACTGGGGCA
ACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATGCAGTTAACCCACTTGGGTG
TGATACAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAA
CGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAA
TGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCCACCCCACTCTAGGAGATGTGCCTGATTATTGGAGGGA
GACCGCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCTCAGTCACATATAGCCA
TAGTGTGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTA
GAACATCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTAC
AGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGTATAGATGTGA
ATTCTGAAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGAGAAGGTGTACAG
CTGCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAATAATATTCGGCGAATTATCGCCCATGCGCTACAAAGAAGGGAAGGTACTGGGATGTCTTCTAC
ATCGGAGATCCGTCGTCTCCGAGAGGAGAACCAACAGTTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGTTTCATTGGATTTTGCAATTTTAC
CTTCATGGTCTTCAGTGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACCGACATTGATCCTAGTCCACAACCTCCAACTTCATGA
Protein sequenceShow/hide protein sequence
MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNCGMISPRVSPPPQLPFLERGSQAPQLNPNILSTFNMGQLKPLEPPRMPIPTNLPIDAGVEHGGEQEKCHSQRLE
PGVSIRQKRKGKEVMTDPEIEEDESSRHLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGAFCSHRELPTGA
TGSTPLATDEYVTPMATLPGVRDAHTIPSNAVNPLGCDTDDKGNVPIQEAEKKSSKRRAVQAKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETK
WNAANLATRTSLMKSRKIMTELGFHPTLGDVPDYWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVCGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
EHLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAATGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNSEELINTSIHECAHRTRGKLYHPRLVTSLCLREGVQ
LPEDQIKRDAPIVEENNIRRIIAHALQRREGTGMSSTSEIRRLREENQQLRDQVREVVQHIYNLRVSLDFAILPSWSSVLAAILGHPSSSTDIDPSPQPPTS