; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111020110
Genome locationchr7:3881503..3890093
RNA-Seq ExpressionMoc07g04500
SyntenyMoc07g04500
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]2.0e-22388.02Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNI STFN G LKP EPP MP PTNM MD   EHGGE EKS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKS

Query:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDT
        H  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+PETEDTENEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDT

Query:  PTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAVQ
        P EGAFCSHQ LPTGATGSTPLAT+EY T   TLPGVRD H IPSN VN L C TGR KVGENSTQE   EED EDTQTIREMFQYKRRE KSSKRRAVQ
Subjt:  PTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAVQ

Query:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKTMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI
          KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  K MTELGFDLTLGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKTMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]2.1e-2393.65Show/hide
Query:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL
        EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP+
Subjt:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]3.2e-18993.33Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNI STFNMG LKPLEP RM  PTNMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+PETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTP  TLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD

Query:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAV
         HTIPSN VN LGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRE+KSSKRRAV
Subjt:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAV

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]4.0e-22494.47Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPE
        PTPTNMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDF+LVSRGHCSN+PE
Subjt:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT   TLPGVRD HTIPSNTVN LGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRRERKSSK
        IREMFQYKRRE+ SSK
Subjt:  IREMFQYKRRERKSSK

XP_022156936.1 uncharacterized protein LOC111023763 [Momordica charantia]5.2e-17690.43Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINPGMYQPFMFN VPSYHFPLSQMQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQ PQLNPN  STFNMG LKPLEPPRMP PTNMPMDAG EHGGEQEKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRRL PKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD
        SSME RDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+ E EDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPL TDEYVTP  TLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD

Query:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
         HTIPSN VN LGCDTGRSKVGENSTQEPTSEED EDTQTIREMF
Subjt:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]1.6e-22893.84Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDF+LVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGH

Query:  CSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEED
        CSN+PETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTP  TLPGVRD HTIPSNTVN LGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRRERKSSK
         EDTQTIR+MFQYKRRE+K  K
Subjt:  LEDTQTIREMFQYKRRERKSSK

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195155.6e-22488.24Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNI STFN G LKP EPP MP PTNM MD   EHGGE EKS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKS

Query:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDT
        H  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+PETEDTENEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDT

Query:  PTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAVQ
        P EGAFCSHQ LPTGATGSTPLATDEY T   TLPGVRD H IPSN VN L C TGR KVGENSTQE   EED EDTQTIREMFQYKRRE KSSKRRAVQ
Subjt:  PTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAVQ

Query:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKTMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI
          KPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  K MTELGFDLTLGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKTMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.0e-2393.65Show/hide
Query:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL
        EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP+
Subjt:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.5e-18993.33Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNI STFNMG LKPLEP RM  PTNMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+PETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTP  TLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD

Query:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAV
         HTIPSN VN LGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRE+KSSKRRAV
Subjt:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAV

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201101.9e-22494.47Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPE
        PTPTNMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDF+LVSRGHCSN+PE
Subjt:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT   TLPGVRD HTIPSNTVN LGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRRERKSSK
        IREMFQYKRRE+ SSK
Subjt:  IREMFQYKRRERKSSK

A0A6J1DWG3 uncharacterized protein LOC1110237632.5e-17690.43Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINPGMYQPFMFN VPSYHFPLSQMQI ASVHPYGMPNPSTLFPS PPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQ PQLNPN  STFNMG LKPLEPPRMP PTNMPMDAG EHGGEQEKSH  RLEP +SIGQKRKGKEVM DPEI EDGSSRRL PKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD
        SSME RDEEQFYSSPLIITP DGNDDF+LVSRG+CSN+ E EDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPL TDEYVTP  TLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRD

Query:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF
         HTIPSN VN LGCDTGRSKVGENSTQEPTSEED EDTQTIREMF
Subjt:  THTIPSNTVNLLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMF

A0A6J1DX11 uncharacterized protein LOC1110248607.5e-22993.84Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDF+LVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFVLVSRGH

Query:  CSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEED
        CSN+PETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTP  TLPGVRD HTIPSNTVN LGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRRERKSSK
         EDTQTIR+MFQYKRRE+K  K
Subjt:  LEDTQTIREMFQYKRRERKSSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
ACAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTC
AACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGA
TGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTTTGCACTCCTGCCCCTGTTTGCC
AAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGTAACAATAGGAACTTT
AACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTTTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAA
GCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATA
TGATGAAGAAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGA
CCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCC
AACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTG
CTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAAT
CCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCT
AGGGGATAACGACACTTTACCACTACCAAAGAGAAGAAGAAATTTCCAGCATTTGCTTACATCTATGGCTTCCTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTC
AAGCTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACA
TTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTT
TAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCTCAAATGCAAA
TTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGTATG
ATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAA
ACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCGGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGG
TTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGG
GACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTGTACTGGTTTCACGAGGACATTGTTCTAATATACCGGAAACAGAGGA
TACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGAT
CTACTCCGTTGGCAACAGATGAGTATGTCACACCGACGACCACTCTACCAGGGGTAAGGGATACTCACACTATTCCTTCTAATACAGTTAACCTGCTTGGGTGTGATACA
GGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGGAGAGAAA
GAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGG
CACCTGGGCCAGTTGATACAATTGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGT
AAGACTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATG
TGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAATCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACA
CCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCGGCTACT
TGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGA
TGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCGTAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAA
CACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATCCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAG
AATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTACGAGATCA
GGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCA
GTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAG
ACAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGG
ATCATGCGAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTC
AACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGA
TGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAACCTGTGCAGTCGGATTTTGCACTCCTGCCCCTGTTTGCC
AAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGTAACAATAGGAACTTT
AACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTTTCATGGGGAGGTCAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAA
GCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATA
TGATGAAGAAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGA
CCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCC
AACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTG
CTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAAT
CCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCT
AGGGGATAACGACACTTTACCACTACCAAAGAGAAGAAGAAATTTCCAGCATTTGCTTACATCTATGGCTTCCTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTC
AAGCTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACA
TTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTT
TAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCTCAAATGCAAA
TTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGTATG
ATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAA
ACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCGGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGG
TTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGG
GACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTGTACTGGTTTCACGAGGACATTGTTCTAATATACCGGAAACAGAGGA
TACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGAT
CTACTCCGTTGGCAACAGATGAGTATGTCACACCGACGACCACTCTACCAGGGGTAAGGGATACTCACACTATTCCTTCTAATACAGTTAACCTGCTTGGGTGTGATACA
GGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGGAGAGAAA
GAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGG
CACCTGGGCCAGTTGATACAATTGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGT
AAGACTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATG
TGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAATCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACA
CCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCGGCTACT
TGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGA
TGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCGTAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAA
CACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATCCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAG
AATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTACGAGATCA
GGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCA
GTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKERQGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIV
NPIPAHANFELKPMITKEARSSWSFGSGHCDLDAKRDGYNEPEAERDGIGNKKSISHADTTCAVGFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNNRNF
NPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKKYMARTDAVIQSQAASMRNFETQLGQLANELKNR
PQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDLLVTEIEEELDKIAEGPEDVAN
PIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPLPKRRRNFQHLLTSMASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQT
FECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGM
ISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENR
DEEQFYSSPLIITPEDGNDDFVLVSRGHCSNIPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPTTTLPGVRDTHTIPSNTVNLLGCDT
GRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRERKSSKRRAVQAKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSR
KTMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLAQLDEALACVGKPSAT
WDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGVDVNYGELINTSIHECAHRTRGKLYHPRLVTSLSLRQGVQLPEDQIKRDAPIVEEK
NIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS