; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111020110
Genome locationchr4:11310114..11320210
RNA-Seq ExpressionMoc04g14810
SyntenyMoc04g14810
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.8e-22388.02Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGM+SPRVP P QLPFLERG QAPQLNPNI STFN G LKP EPP MP P NM MD   EHGGE EKS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKS

Query:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDT
        H  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDT

Query:  PTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAVQ
        P EGAF SHQ LPTGAT STPLAT+EY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKRRE KSSKRRAVQ
Subjt:  PTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAVQ

Query:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLALGDVPDDWRETARDKEWRPLIQPI
          KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLALGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]2.5e-2393.65Show/hide
Query:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL
        EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP+
Subjt:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]9.9e-19093.89Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGM+SPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPR

Query:  VPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERG QAPQLNPNI STFNMG LKPLEP RM  P NMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAF SHQELPTGAT STPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]4.7e-22494.71Show/hide
Query:  HTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQ PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGM+SPRVPPP QL FLERG QAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
        PTP NMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAF SHQELPTGAT S PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRREKKSSK
        IREMFQYKRREK SSK
Subjt:  IREMFQYKRREKKSSK

XP_022158357.1 uncharacterized protein LOC111024860 [Momordica charantia]1.8e-22894.08Show/hide
Query:  MASSSQHTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST  PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGM+SP+VPPP QLPFLERG QAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTP NMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAF SHQELPTGAT STPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRREKKSSK
         EDTQTIR+MFQYKRREKK  K
Subjt:  LEDTQTIREMFQYKRREKKSSK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]5.8e-29187.88Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAQANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIATAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPA  NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIA AFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAQANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIATAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCSNHGLPACIQIEHFFRGLDHPTKMMLNNAANEAFTKKTLNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAAN AFTKKT NEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCSNHGLPACIQIEHFFRGLDHPTKMMLNNAANEAFTKKTLNEIVDILNDLASHNELW

Query:  CSQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQ SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSD+CT APVCQVNDLIC                           
Subjt:  CSQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195155.1e-22488.24Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGM+SPRVP P QLPFLERG QAPQLNPNI STFN G LKP EPP MP P NM MD   EHGGE EKS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKS

Query:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDT
        H  RLEP VSIGQKRKGKEVM DPEI  DGSSRRLTPKDSSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDT

Query:  PTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAVQ
        P EGAF SHQ LPTGAT STPLATDEY T MATLPGVRDAH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKRRE KSSKRRAVQ
Subjt:  PTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAVQ

Query:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLALGDVPDDWRETARDKEWRPLIQPI
          KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDL LGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLALGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.2e-2393.65Show/hide
Query:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL
        EIEEELDKIAEGPEDV +P+EKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLP+
Subjt:  EIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPL

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195154.8e-19093.89Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGM+SPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPR

Query:  VPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        VPPP QLPFLERG QAPQLNPNI STFNMG LKPLEP RM  P NMPMD GDEHGGEQEKSHS RLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRD
        SSMENRDEEQFYSSPLIITP DGNDDFLLVSRG+CSNMPETEDTENEVVRTDTQEPS LDTPTEGAF SHQELPTGAT STPLATDEYVTPMATLPGVRD
Subjt:  SSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRD

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAV

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201102.3e-22494.71Show/hide
Query:  HTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY
        HTSLPTSSTQ PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMY
Subjt:  HTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMY

Query:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRM
        QPFMFN VPSYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGM+SPRVPPP QL FLERG QAPQLNPNI STFNMGQLKPLEPPRM
Subjt:  QPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRM

Query:  PTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE
        PTP NMPMD GDEHGGEQEKSHSHRLEPGV IGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIIT EDGNDDFLLVSRGHCSNMPE
Subjt:  PTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPE

Query:  TEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT
        TEDTENEVVRTDTQEPSPLDTPTEGAF SHQELPTGAT S PLATDEYVT MATLPGVRDAHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QT
Subjt:  TEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQT

Query:  IREMFQYKRREKKSSK
        IREMFQYKRREK SSK
Subjt:  IREMFQYKRREKKSSK

A0A6J1DW02 uncharacterized protein LOC1110248972.8e-29187.88Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDSGIVNPIPAQANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIATAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE
        AFQNFDSGIVNPIPA  NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIA AFRLPGITDDA  LTLFPFSLKDQAR  LNAFP GSITTWGSLVE
Subjt:  AFQNFDSGIVNPIPAQANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIATAFRLPGITDDALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCSNHGLPACIQIEHFFRGLDHPTKMMLNNAANEAFTKKTLNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLDHPTKMMLNNAAN AFTKKT NEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCSNHGLPACIQIEHFFRGLDHPTKMMLNNAANEAFTKKTLNEIVDILNDLASHNELW

Query:  CSQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN
        CSQ SRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSD+CT APVCQVNDLIC                           
Subjt:  CSQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVGHGNN

Query:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE
                     WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF 
Subjt:  RNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFE

Query:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        TQLG LANELKNRPQGSFPGHTELP+REGKEQCKAVTLRSGL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  TQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DX11 uncharacterized protein LOC1110248608.9e-22994.08Show/hide
Query:  MASSSQHTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM
        MASSSQHTSLPTSST  PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPM
Subjt:  MASSSQHTSLPTSSTQTPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPM

Query:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKP
        INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGM+SP+VPPP QLPFLERG QAPQL+PNI ST+NMGQLKP
Subjt:  INPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKP

Query:  LEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH
        LEPPRMPTP NMPMDAGDEHGGEQEK HSHRLEPGVSIGQKRKGKEVM DPEIEEDGSSRRLTPKDS+MENRDEEQFYSS  IITPEDGNDDFLLVSRGH
Subjt:  LEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGH

Query:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
        CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAF SHQELPTGAT STPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED
Subjt:  CSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPGVRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED

Query:  LEDTQTIREMFQYKRREKKSSK
         EDTQTIR+MFQYKRREKK  K
Subjt:  LEDTQTIREMFQYKRREKKSSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTACCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACAATTTGGGTACGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCAAGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAACTGCATTTCGATTACCTGGTATAACAGAC
GATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGTCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTA
AAGAACTAATCAGGAAATGTTCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCC
AACGAAGCCTTTACAAAGAAGACATTGAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAATATCTAGGGCAGCACCAAAGAA
GCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCAT
TAGCCACGCCGATACAACCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAAT
TGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTC
ATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGT
ATAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCA
TCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGA
ACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACCGAACCAATTGTAAAGATACCAGAAA
ATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGTTAGTT
ACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGT
GGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCACTACCAAAGAGAAGAAGGAATTTCCAGCATT
TGCTTACATCTATGGCTTCCTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAACTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACAC
CACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACA
TGGTTATGTTAATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACC
AACCGTTTATGTTTAACCCGGTTCCTTCTTATCATTTTCCCTTGTCTCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCT
AGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGGATGCTTTCACCTAGGGTTCCCCCACCAGCTCAACTTCCATTCCTAGAAAGAGGATC
TCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTTAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAATCAATATGCCAATGGATGCAG
GAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCACAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATT
GAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAA
TGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATA
CACCTACAGAAGGAGCATTTGGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGAATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGG
GTAAGGGATGCTCATACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGA
TCTTGAGGACACGCAGACGATAAGGGAAATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGACGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCCGTGAATG
AACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTC
GAGACGAAATGGAACGCAGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTTGATCTCGCCCTAGGTGATGTGCCTGATGA
TTGGAGGGAGACCGCCAGAGACAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAATTCTATGCTGCAGTCCATCCCCAGTCAC
ATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACT
CCGACTCTAGCACAGCTTGATGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCT
AGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTAAAGGGCA
TAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAA
GGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAAAAGAATATTCGGCGTATTATTGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGAT
GTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCACCAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTG
CGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTATTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTACCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACAATTTGGGTACGTTAAAGTGCATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAGCGGACCC
TTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAAGAGAGAGAAGGTGAAATCAGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATG
GCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGTAATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAGATGGCGGACAACAGAGA
CGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTTGATTCAGGGATAGTCAACCCTATTCCAGCCCAAGCAAACTTTGAGCTTAAACCAATGATGTTCCAAA
TGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACATGAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAACTGCATTTCGATTACCTGGTATAACAGAC
GATGCTCTGAGACTAACACTTTTTCCATTTTCTTTGAAGGACCAAGCTAGAGCATGGCTCAATGCATTTCCACCGGGATCCATCACCACATGGGGGTCGTTAGTGGAGAA
GTTCTTAACAAAATTCTTCCCACCTACTCGTCACGCTGATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAGAGATTTA
AAGAACTAATCAGGAAATGTTCGAACCATGGCTTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGCTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCC
AACGAAGCCTTTACAAAGAAGACATTGAACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAATATCTAGGGCAGCACCAAAGAA
GCAAGATCCAGCTGGAGTTTTGGCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCATTGGGAATAAAAAATCCAT
TAGCCACGCCGATACAACCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATCTATGATAAT
TGTCCACATAACCCTGCTTCTGTTTTTTATGTAGGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTC
ATGGGGAGGACAAGGAGGTTCGAGCGGTTTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCACCACCGCAACAGCAGT
ATAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAGTACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCA
TCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGA
ACAGTGCAAAGCTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACCGAACCAATTGTAAAGATACCAGAAA
ATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTATGACTCGTTAGTT
ACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATGTGGCTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGT
GGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCACTACCAAAGAGAAGAAGGAATTTCCAGCATT
TGCTTACATCTATGGCTTCCTCTTCTCAACATACTTCACTTCCTACCTCTTCTACTCAAACTCCAAATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACAC
CACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTTGAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACA
TGGTTATGTTAATTTTCAGCAACTACCCACCTTTAATATACCTCAAAACAGTGAGTTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACC
AACCGTTTATGTTTAACCCGGTTCCTTCTTATCATTTTCCCTTGTCTCAAATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCT
AGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACATAATTATGGGATGCTTTCACCTAGGGTTCCCCCACCAGCTCAACTTCCATTCCTAGAAAGAGGATC
TCAAGCACCCCAATTGAACCCTAACATATCGAGTACCTTTAACATGGGACAACTAAAACCCTTAGAGCCTCCTAGGATGCCAACCCCAATCAATATGCCAATGGATGCAG
GAGATGAGCATGGAGGAGAGCAAGAGAAGAGCCACAGCCATAGGCTAGAGCCCGGGGTTTCGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATT
GAGGAAGATGGGAGTAGTAGGCGTCTGACGCCTAAGGATTCATCTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAA
TGATGATTTTCTACTGGTTTCACGAGGACATTGTTCAAATATGCCAGAAACAGAGGATACCGAGAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATA
CACCTACAGAAGGAGCATTTGGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGAATCTACTCCGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGG
GTAAGGGATGCTCATACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGA
TCTTGAGGACACGCAGACGATAAGGGAAATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGACGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCCGTGAATG
AACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTC
GAGACGAAATGGAACGCAGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTTGATCTCGCCCTAGGTGATGTGCCTGATGA
TTGGAGGGAGACCGCCAGAGACAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAATTCTATGCTGCAGTCCATCCCCAGTCAC
ATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACT
CCGACTCTAGCACAGCTTGATGAGGCTTTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCT
AGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTAAAGGGCA
TAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAA
GGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAAAAGAATATTCGGCGTATTATTGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGAT
GTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCACCAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTG
CGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTATTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSTNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHNLGTLKCMSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEISPESEVESTSTSM
ADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNFDSGIVNPIPAQANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIATAFRLPGITD
DALRLTLFPFSLKDQARAWLNAFPPGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCSNHGLPACIQIEHFFRGLDHPTKMMLNNAA
NEAFTKKTLNEIVDILNDLASHNELWCSQISRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAPVCQVNDLICSFCSENHIYDN
CPHNPASVFYVGHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAA
SMRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFYDSLV
TEIEEELDKIAEGPEDVANPIEKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPLPKRRRNFQHLLTSMASSSQHTSLPTSSTQTPNATSIPFPPLENFQH
HMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFP
SLPPYYGMGHWVPPHNYGMLSPRVPPPAQLPFLERGSQAPQLNPNISSTFNMGQLKPLEPPRMPTPINMPMDAGDEHGGEQEKSHSHRLEPGVSIGQKRKGKEVMIDPEI
EEDGSSRRLTPKDSSMENRDEEQFYSSPLIITPEDGNDDFLLVSRGHCSNMPETEDTENEVVRTDTQEPSPLDTPTEGAFGSHQELPTGATESTPLATDEYVTPMATLPG
VRDAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEV
ETKWNAANLATRTSLMKSRKIMTELGFDLALGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVT
PTLAQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQ
GVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENHQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSIDTDPSPQPPTS