; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr8:14016855..14024895
RNA-Seq ExpressionMoc08g18540
SyntenyMoc08g18540
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]2.7e-23543.36Show/hide
Query:  MTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTVPVI
        MTKCSS AVGSPLPMKCNDPGSFTIP SIGGKNL  EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDT+PVI
Subjt:  MTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTVPVI

Query:  ITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIE-------------RFSLGKALNRPIAYALQFHQ-------------
        + SNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKN IE             R  + K L+  + Y +                 
Subjt:  ITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIE-------------RFSLGKALNRPIAYALQFHQ-------------

Query:  -----NPHNSLLP---------LVFFGK----------PTPFITLLFD---------------------FQPSPKRR------------RNFQHLLTSMA
             N  N L+           + + K          P PFI  + D                       P  + +            R     L +  
Subjt:  -----NPHNSLLP---------LVFFGK----------PTPFITLLFD---------------------FQPSPKRR------------RNFQHLLTSMA

Query:  SSSQHTSLPTSSTQAPNATSI-----------------------------------PFPP-----LENFQHHIG--------------------------
        ++ Q   +   S    N   +                                     PP      E+F  H G                          
Subjt:  SSSQHTSLPTSSTQAPNATSI-----------------------------------PFPP-----LENFQHHIG--------------------------

Query:  --------------------------------------SDSRLAAVRGGNP-------------------------------------------------
                                              SD  + A+ G                                                    
Subjt:  --------------------------------------SDSRLAAVRGGNP-------------------------------------------------

Query:  --------------------------LQTF------------QCPPSQAPTQRPIMCPHG------YVNFQQLPTLNIPQNSEFR----------AENPQ
                                  LQ F            Q     +  + P +C  G      +++ Q L   ++P  ++              N Q
Subjt:  --------------------------LQTF------------QCPPSQAPTQRPIMCPHG------YVNFQQLPTLNIPQNSEFR----------AENPQ

Query:  QLPPMINPGMY----QPFMF------------------------------------------------------NPVPS---------------------
        Q+  ++N   +    +PF++                                                       P PS                     
Subjt:  QLPPMINPGMY----QPFMF------------------------------------------------------NPVPS---------------------

Query:  ---------------------------------YHFPLSQMQ----------------------------------------------------------
                                          HF    MQ                                                          
Subjt:  ---------------------------------YHFPLSQMQ----------------------------------------------------------

Query:  ---------------------IP-----------------------------------------------------------------ASVHPYGMPNPS
                             +P                                                                 ASVHPYGMPNPS
Subjt:  ---------------------IP-----------------------------------------------------------------ASVHPYGMPNPS

Query:  TLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIG
        TLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNI STFN G LKP EPP MP PTNM MD   EHGGE EKSH  RLEP VSIG
Subjt:  TLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIG

Query:  QKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQEL
        QKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPSPLDTP EGAFCSHQ L
Subjt:  QKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQEL

Query:  PTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAVQAKKPTVPMNEPK
        PTGATGSTPLAT+EY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKR E KSSKRRAVQ  KPTVPMNEPK
Subjt:  PTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAVQAKKPTVPMNEPK

Query:  TRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSH-------------------------KEWRPLIQPIQCEALELVREFY
        TRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK H                         KEWRPLIQPIQCEALELVREFY
Subjt:  TRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSH-------------------------KEWRPLIQPIQCEALELVREFY

Query:  AAVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL
        AA HPQSHIAIV GKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTL
Subjt:  AAVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]9.6e-22594.47Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHH+ SDSRLAAVRGGNPLQTF+CPPSQAPTQ PIM PHGYVNFQQLPT NIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
        PTPTNMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRWERKSSK
        IREMFQYKR E+ SSK
Subjt:  IREMFQYKRWERKSSK

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]8.1e-0696.77Show/hide
Query:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSS
        MPNYAKFLKDIVSRKKKIGEHELVAMTKCS+
Subjt:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSS

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]3.3e-19395Show/hide
Query:  MCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M PHGYVNFQ LPTLNIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNI STFNMG LKPLEP RM  PTNMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR E+KSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAV

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]2.6e-23094.31Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHH+ SDSRLAAVRGGNPLQTF+CPPSQAPTQ PIM PHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRWERKSSK
         EDTQTIR+MFQYKR E+K  K
Subjt:  LEDTQTIREMFQYKRWERKSSK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]1.3e-19265.93Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNERLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIFSFCIENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLI            C               
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNERLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIFSFCIENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYIARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEY+ARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYIARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.0e-23543.43Show/hide
Query:  MTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTVPVI
        MTKCSS AVGSPLPMKCNDPGSFTIP SIGGKNL  EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDT+PVI
Subjt:  MTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTVPVI

Query:  ITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIE-------------RFSLGKALNRPIAYALQFHQ-------------
        + SNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKN IE             R  + K L+  + Y +                 
Subjt:  ITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIE-------------RFSLGKALNRPIAYALQFHQ-------------

Query:  -----NPHNSLLP---------LVFFGK----------PTPFITLLFD---------------------FQPSPKRR------------RNFQHLLTSMA
             N  N L+           + + K          P PFI  + D                       P  + +            R     L +  
Subjt:  -----NPHNSLLP---------LVFFGK----------PTPFITLLFD---------------------FQPSPKRR------------RNFQHLLTSMA

Query:  SSSQHTSLPTSSTQAPNATSI-----------------------------------PFPP-----LENFQHHIG--------------------------
        ++ Q   +   S    N   +                                     PP      E+F  H G                          
Subjt:  SSSQHTSLPTSSTQAPNATSI-----------------------------------PFPP-----LENFQHHIG--------------------------

Query:  --------------------------------------SDSRLAAVRGGNP-------------------------------------------------
                                              SD  + A+ G                                                    
Subjt:  --------------------------------------SDSRLAAVRGGNP-------------------------------------------------

Query:  --------------------------LQTF------------QCPPSQAPTQRPIMCPHG------YVNFQQLPTLNIPQNSEFR----------AENPQ
                                  LQ F            Q     +  + P +C  G      +++ Q L   ++P  ++              N Q
Subjt:  --------------------------LQTF------------QCPPSQAPTQRPIMCPHG------YVNFQQLPTLNIPQNSEFR----------AENPQ

Query:  QLPPMINPGMY----QPFMF------------------------------------------------------NPVPS---------------------
        Q+  ++N   +    +PF++                                                       P PS                     
Subjt:  QLPPMINPGMY----QPFMF------------------------------------------------------NPVPS---------------------

Query:  ---------------------------------YHFPLSQMQ----------------------------------------------------------
                                          HF    MQ                                                          
Subjt:  ---------------------------------YHFPLSQMQ----------------------------------------------------------

Query:  ---------------------IP-----------------------------------------------------------------ASVHPYGMPNPS
                             +P                                                                 ASVHPYGMPNPS
Subjt:  ---------------------IP-----------------------------------------------------------------ASVHPYGMPNPS

Query:  TLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIG
        TLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNI STFN G LKP EPP MP PTNM MD   EHGGE EKSH  RLEP VSIG
Subjt:  TLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIG

Query:  QKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQEL
        QKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPSPLDTP EGAFCSHQ L
Subjt:  QKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQEL

Query:  PTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAVQAKKPTVPMNEPK
        PTGATGSTPLATDEY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKR E KSSKRRAVQ  KPTVPMNEPK
Subjt:  PTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAVQAKKPTVPMNEPK

Query:  TRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSH-------------------------KEWRPLIQPIQCEALELVREFY
        TRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK H                         KEWRPLIQPIQCEALELVREFY
Subjt:  TRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSH-------------------------KEWRPLIQPIQCEALELVREFY

Query:  AAVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL
        AA HPQSHIAIV GKEIRFDATQINYTFNIKNI+DAVGNKMLVTPTL
Subjt:  AAVHPQSHIAIVCGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTL

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201104.6e-22594.47Show/hide
Query:  HTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQAPNATSIPFPPLENFQHH+ SDSRLAAVRGGNPLQTF+CPPSQAPTQ PIM PHGYVNFQQLPT NIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
        PTPTNMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGS PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRWERKSSK
        IREMFQYKR E+ SSK
Subjt:  IREMFQYKRWERKSSK

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201103.9e-0696.77Show/hide
Query:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSS
        MPNYAKFLKDIVSRKKKIGEHELVAMTKCS+
Subjt:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSS

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201101.6e-19395Show/hide
Query:  MCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M PHGYVNFQ LPTLNIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNI STFNMG LKPLEP RM  PTNMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR E+KSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAV

A0A6J1DW02 uncharacterized protein LOC1110248976.1e-19365.93Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGI-------------------------------------------------------------------------------------------
        AFQNFDSGI                                                                                           
Subjt:  AFQNFDSGI-------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
                                                              IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW
Subjt:  ------------------------------------------------------IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNERLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIFSFCIENHIYDNCPHNPASVFYVGHGNN
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMN+RLKEMALGIKNPLAT IQPVQSDYCT APVCQVNDLI            C               
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNERLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIFSFCIENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYIARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEY+ARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYIARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKGN+DT SVPPQ
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQ

A0A6J1DX11 uncharacterized protein LOC1110248601.3e-23094.31Show/hide
Query:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST APNATSIPFPPLENFQHH+ SDSRLAAVRGGNPLQTF+CPPSQAPTQ PIM PHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTPTNMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRWERKSSK
         EDTQTIR+MFQYKR E+K  K
Subjt:  LEDTQTIREMFQYKRWERKSSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCACTAAGCACCAGCGCCCATGCCGCCCTCGTGCCAGCGCTTGCCCCCAGCGCTTGCCGTCCACACCGCGCGCCCCTACCGCTTGCCCATGCTGTCCCACCCGC
ACTGATTACGCTTGTTTACATGCGTAAAAAGAAGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAA
CTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATT
CCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAAT
GCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACG
GAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAA
GATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACGAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGC
CACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTTTTCATTTTGCATTGAAAACCATATTTATGATAATTGTC
CACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACGTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGG
GGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAA
TCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATAGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAA
TGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAG
TGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATAGTAAAGATACCAGAAAATCC
AACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAG
TAAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTATAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAG
TTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAA
TGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATC
CTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTA
GGGGATAACGACACTGTACCGGTTATTATAACTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACAAT
AGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAGATTCAGTTTGGGGAAAGCTTTAAATAGAC
CCATTGCCTATGCTTTACAATTCCACCAAAACCCACACAACTCTCTTCTTCCCCTTGTTTTTTTCGGCAAACCCACTCCCTTCATCACCCTTCTTTTCGATTTTCAGCCA
TCACCAAAAAGAAGAAGAAATTTCCAGCATTTGCTTACATCTATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAATGCCACAAGCAT
TCCTTTTCCACCATTGGAGAACTTCCAACACCACATAGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTCAATGTCCTCCATCCCAAG
CTCCTACACAACGTCCTATAATGTGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAA
CTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTA
TGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGTATGATTTCACCTAGAGTTTCCCCAC
CACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATG
CCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAA
GGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCAT
CTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGA
ACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGA
GTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGA
ATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGTGGGAGAGAAAGAGTTCAAAACGTCGTGCAGTT
CAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAAT
TGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCACAAAGAGTGGAGACCACTCATTC
AGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGTGCGGGAAGGAAATACGGTTTGATGCCACT
CAGATCAACTACACCTTTAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGTTTGATGAGGCTTTAGAATGTGTTGGGAA
GCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAATCCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGC
CAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAG
TGTGCCCACCGAACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAAT
TGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAAC
AGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGT
CATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCACTAAGCACCAGCGCCCATGCCGCCCTCGTGCCAGCGCTTGCCCCCAGCGCTTGCCGTCCACACCGCGCGCCCCTACCGCTTGCCCATGCTGTCCCACCCGC
ACTGATTACGCTTGTTTACATGCGTAAAAAGAAGGGTAAGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCCTTCGAAAAA
CTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATT
CCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGACGTGGCAAT
GCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACG
GAGCCTTTACAAAGAAGACATTCAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAGCAA
GATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACGAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCATTAGC
CACGCCGATACAACCTGTGCAGTCGGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTTTTCATTTTGCATTGAAAACCATATTTATGATAATTGTC
CACATAACCCTGCTTCCGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACGTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGG
GGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAACAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAA
TCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATAGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAA
TGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAG
TGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATAGTAAAGATACCAGAAAATCC
AACAACACCAGAAAAAGAAAATATTAGAAAAGGTAATGAGGACACCCCGAGTGTTCCTCCACAGGTACAGCCTTCTTTTACCCCTACTCCCCCGTTTCCACAAAGATTAG
TAAAGAAAAATAATGATTCCCAGTTTAGAAAATTTTTAGATATTATAAAACAACTGCATATAAATATACCATTAGTAGATGCTCTAGAACAGATGCCAAATTATGCTAAG
TTTTTGAAAGATATAGTTTCTAGGAAGAAAAAAATAGGAGAGCATGAACTGGTAGCCATGACAAAATGTAGTAGTGAAGCTGTAGGCAGCCCGCTACCCATGAAATGTAA
TGATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGAAAAAACTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATC
CTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTA
GGGGATAACGACACTGTACCGGTTATTATAACTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACAAT
AGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGAGATTCAGTTTGGGGAAAGCTTTAAATAGAC
CCATTGCCTATGCTTTACAATTCCACCAAAACCCACACAACTCTCTTCTTCCCCTTGTTTTTTTCGGCAAACCCACTCCCTTCATCACCCTTCTTTTCGATTTTCAGCCA
TCACCAAAAAGAAGAAGAAATTTCCAGCATTTGCTTACATCTATGGCTTCTTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAGCTCCAAATGCCACAAGCAT
TCCTTTTCCACCATTGGAGAACTTCCAACACCACATAGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTCAATGTCCTCCATCCCAAG
CTCCTACACAACGTCCTATAATGTGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACCCTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAA
CTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCCTTGTCCCAAATGCAAATTCCAGCTAGTGTTCATCCTTA
TGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGTATGATTTCACCTAGAGTTTCCCCAC
CACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTCAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATG
CCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCATAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAA
GGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCAT
CTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGA
ACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGCGTTTTGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCCGTTGGCAACGGATGA
GTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGGATGCTCACACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGA
ATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGACACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGTGGGAGAGAAAGAGTTCAAAACGTCGTGCAGTT
CAGGCTAAGAAGCCGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAAT
TGAACTAGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCACAAAGAGTGGAGACCACTCATTC
AGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGTGCGGGAAGGAAATACGGTTTGATGCCACT
CAGATCAACTACACCTTTAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTAGCACAGTTTGATGAGGCTTTAGAATGTGTTGGGAA
GCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAATCCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGC
CAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAG
TGTGCCCACCGAACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAAT
TGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAAC
AGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGT
CATCCATCTCCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MPPLSTSAHAALVPALAPSACRPHRAPLPLAHAVPPALITLVYMRKKKGKLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADI
PPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQ
DPAGVLALDIATSMQKEMVTMNERLKEMALGIKNPLATPIQPVQSDYCTPAPVCQVNDLIFSFCIENHIYDNCPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSW
GGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYIARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQ
CKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGNEDTPSVPPQVQPSFTPTPPFPQRLVKKNNDSQFRKFLDIIKQLHINIPLVDALEQMPNYAK
FLKDIVSRKKKIGEHELVAMTKCSSEAVGSPLPMKCNDPGSFTIPCSIGGKNLVTEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYL
GDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIERFSLGKALNRPIAYALQFHQNPHNSLLPLVFFGKPTPFITLLFDFQP
SPKRRRNFQHLLTSMASSSQHTSLPTSSTQAPNATSIPFPPLENFQHHIGSDSRLAAVRGGNPLQTFQCPPSQAPTQRPIMCPHGYVNFQQLPTLNIPQNSEFRAENPQQ
LPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNISSTFNMGQLKPLEPPRM
PTPTNMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVR
TDTQEPSPLDTPTEGAFCSHQELPTGATGSTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRWERKSSKRRAV
QAKKPTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSHKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVCGKEIRFDAT
QINYTFNIKNIRDAVGNKMLVTPTLAQFDEALECVGKPSATWDLTTHGKVRLKSEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHE
CAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILG
HPSPSTDTDPSPQPPTS