; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr6:15863057..15864820
RNA-Seq ExpressionMoc06g20280
SyntenyMoc06g20280
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.3e-17788.27Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        M MD   EHGGE EKSH  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLT KDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTP EGAFCSHQ LPT ATGSTPLAT+EY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDW
        QYKR+E KSSKRRAVQ  KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDW
Subjt:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDW

Query:  RETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL
        R+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTL
Subjt:  RETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.2e-10393.84Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLT KDS+MENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTPTEGAFCSHQELPT ATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSK
        QYKR+E+ SSK
Subjt:  QYKRQERKSSK

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]1.4e-10494.31Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLT KDSTMENRDEEQFYSS  IITPEDGNDDFLLVSRGHCSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTPTEGAFCSHQELPT ATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED EDTQTIR+MF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSK
        QYKR+E+K  K
Subjt:  QYKRQERKSSK

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]9.8e-10694.42Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLT KDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPS LDTPTEGAFCSHQELPT ATGSTPLATDEYVTPMATLPGVRDAHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAV
        QYKR+E+KSSKRRAV
Subjt:  QYKRQERKSSKRRAV

XP_022158884.1 uncharacterized protein LOC111025345 [Momordica charantia]1.3e-9482.17Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD G EHGGEQEKSHS  LEPGVSIGQKRK  EVM D E EEDGSSRRLT KDS+MENRD+EQFYSS LIITP DGNDDFLLVSR +CSNM E EDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVR DTQEPSPLDTPT+GAF SHQ LPT ATGSTPLATDEY+TPMATLPGVRDAH IPSN VN LGCDTGR KVGENSTQEPT ++D EDTQT+REMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTR
        QYKR+E+KSSK RAVQ KKPTVP+N   TR
Subjt:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTR

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195153.6e-17888.53Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        M MD   EHGGE EKSH  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLT KDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTP EGAFCSHQ LPT ATGSTPLATDEY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDW
        QYKR+E KSSKRRAVQ  KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDW
Subjt:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDW

Query:  RETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL
        R+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTL
Subjt:  RETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201105.8e-10493.84Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLT KDS+MENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTPTEGAFCSHQELPT ATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSK
        QYKR+E+ SSK
Subjt:  QYKRQERKSSK

A0A6J1DX11 uncharacterized protein LOC1110248606.9e-10594.31Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLT KDSTMENRDEEQFYSS  IITPEDGNDDFLLVSRGHCSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPSPLDTPTEGAFCSHQELPT ATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED EDTQTIR+MF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSK
        QYKR+E+K  K
Subjt:  QYKRQERKSSK

A0A6J1DY94 uncharacterized protein LOC1110253164.8e-10694.42Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLT KDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVRTDTQEPS LDTPTEGAFCSHQELPT ATGSTPLATDEYVTPMATLPGVRDAHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAV
        QYKR+E+KSSKRRAV
Subjt:  QYKRQERKSSKRRAV

A0A6J1E0Q9 uncharacterized protein LOC1110253456.5e-9582.17Show/hide
Query:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE
        MPMD G EHGGEQEKSHS  LEPGVSIGQKRK  EVM D E EEDGSSRRLT KDS+MENRD+EQFYSS LIITP DGNDDFLLVSR +CSNM E EDTE
Subjt:  MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTE

Query:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
        NEVVR DTQEPSPLDTPT+GAF SHQ LPT ATGSTPLATDEY+TPMATLPGVRDAH IPSN VN LGCDTGR KVGENSTQEPT ++D EDTQT+REMF
Subjt:  NEVVRTDTQEPSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

Query:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTR
        QYKR+E+KSSK RAVQ KKPTVP+N   TR
Subjt:  QYKRQERKSSKRRAVQAKKPTVPVNEPKTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGAT
GATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCATAAGGATTCAACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTA
CACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCGGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAA
CCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTAGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGAT
GGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAAC
CTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCAGGAGAGAAAAAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCA
ACAGTGCCCGTGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTAGACTTGTC
TGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAG
GAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCT
GTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAA
TAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACACATGGCAAGGTACGACTAAAAC
CCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTAT
GCCCTGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTC
TTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAA
GGGAAGGTATTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGG
GCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTC
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGAT
GATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCATAAGGATTCAACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTA
CACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCGGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAA
CCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTAGGGCAACTGGATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGAT
GGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAAC
CTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCAGGAGAGAAAAAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCA
ACAGTGCCCGTGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTAGACTTGTC
TGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAG
GAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCT
GTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAA
TAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACACATGGCAAGGTACGACTAAAAC
CCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTAT
GCCCTGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTC
TTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAA
GGGAAGGTATTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGG
GCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGCACTGACACTGATCCTAGTCCACAACCTCCGACTTC
ATGA
Protein sequenceShow/hide protein sequence
MPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTHKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQE
PSPLDTPTEGAFCSHQELPTRATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRQERKSSKRRAVQAKKP
TVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAA
VHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTLAQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVY
ALLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGIGMSPTSEIRRLREENQQLRDQVREVVQHIYNLR
ASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS