; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr3:10015813..10022055
RNA-Seq ExpressionMoc03g14880
SyntenyMoc03g14880
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]4.3e-22487.58Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNILSTFN G L P EPP MP PTNM MD   EHGGE +KS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKS

Query:  HSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT
        H  RLEP V IGQKRKGKEVM DPEI  DGSSRRLTPKDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT

Query:  PTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAVQ
        P EG FCSHQ LPTGATGST LAT+EY T MATLPGVR+AH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKRR+ KSSKRRAVQ
Subjt:  PTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAVQ

Query:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI
          KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYATLHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYA  HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYATLHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]7.2e-5490.76Show/hide
Query:  EIEEELDKMVEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYVYLGDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP
        EIEEELDK+ EGPEDVT+P+EKIQKEECKSLLPSIVEPPTLEQKPL SHLKY YLGDNDT+PVI+ SNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP
Subjt:  EIEEELDKMVEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYVYLGDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP

Query:  AFCMHKILLEEDAKNFIET
        AFCMHKILLEEDAKN IE+
Subjt:  AFCMHKILLEEDAKNFIET

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]4.5e-22193.58Show/hide
Query:  PNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY
        PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY
Subjt:  PNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY

Query:  HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAG
        HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNILST+NMGQL PLEPPRMPTPTNMPMDAG
Subjt:  HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAG

Query:  DEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRT
        DEHGGEQ+K HSHRLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKDSTMENRDEEQFYSS  IITPEDGNDDFLLVSRG+CSNMPETEDT+NEVVRT
Subjt:  DEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRT

Query:  DTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRK
        DTQEPSPLDTPTEG FCSHQELPTGATGST LATDEYVTPMATLPGVR+AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED EDTQTIR+MFQYKRR+
Subjt:  DTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRK

Query:  RKSSK
        +K  K
Subjt:  RKSSK

XP_022152366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111020110 [Momordica charantia]1.4e-21993.61Show/hide
Query:  ETPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVP
        + PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMYQPFMFN VP
Subjt:  ETPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVP

Query:  SYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMD
        SYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNILSTFNMGQL PLEPPRMPTPTNMPMD
Subjt:  SYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMD

Query:  AGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVV
         GDEHGGEQ+KSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDS+MENRDEEQFYSSPLIIT EDGNDDFLLVSRG+CSNMPETEDT+NEVV
Subjt:  AGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVV

Query:  RTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR
        RTDTQEPSPLDTPTEG FCSHQELPTGATGS  LATDEYVT MATLPGVR+AHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QTIREMFQYKR
Subjt:  RTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR

Query:  RKRKSSK
        R++ SSK
Subjt:  RKRKSSK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]4.0e-24682.6Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEIRPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEI PESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEIRPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDKGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANEFRLP--------------------------------------VE
        AFQNFD GIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIAN FRLP                                      VE
Subjt:  AFQNFDKGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANEFRLP--------------------------------------VE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDNPTKMMLNNAANGAFAKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLD+PTKMMLNNAANGAF KKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDNPTKMMLNNAANGAFAKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAP-----DM------------GTIGTLTHIRTPTTKNK
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSD+CT AP     D+            G  G+    +  + +NK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAP-----DM------------GTIGTLTHIRTPTTKNK

Query:  QFYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS
        Q YVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS
Subjt:  QFYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

XP_022158852.1 uncharacterized protein LOC111025316 [Momordica charantia]5.3e-19093.06Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNILSTFNMG L PLEP RM  PTNMPMD GDEHGGEQ+KSHS RLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  STMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRN
        S+MENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPS LDTPTEG FCSHQELPTGATGST LATDEYVTPMATLPGVR+
Subjt:  STMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRN

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRR++KSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAV

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.2e-22487.8Show/hide
Query:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKS
        ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRV  PPQLPFLERGPQAPQLNPNILSTFN G L P EPP MP PTNM MD   EHGGE +KS
Subjt:  ASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKS

Query:  HSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT
        H  RLEP V IGQKRKGKEVM DPEI  DGSSRRLTPKDS+MENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPSPLDT
Subjt:  HSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDT

Query:  PTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAVQ
        P EG FCSHQ LPTGATGST LATDEY T MATLPGVR+AH IPSN VNPL C TGR KVGENSTQE   EED EDTQTIREMFQYKRR+ KSSKRRAVQ
Subjt:  PTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAVQ

Query:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI
          KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPI
Subjt:  AKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPI

Query:  QCEALELVREFYATLHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL
        QCEALELVREFYA  HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTL
Subjt:  QCEALELVREFYATLHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTL

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195154.6e-5490.76Show/hide
Query:  EIEEELDKMVEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYVYLGDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP
        EIEEELDK+ EGPEDVT+P+EKIQKEECKSLLPSIVEPPTLEQKPL SHLKY YLGDNDT+PVI+ SNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP
Subjt:  EIEEELDKMVEGPEDVTNPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYVYLGDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISP

Query:  AFCMHKILLEEDAKNFIET
        AFCMHKILLEEDAKN IE+
Subjt:  AFCMHKILLEEDAKNFIET

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195152.2e-22193.58Show/hide
Query:  PNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY
        PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPT NIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY
Subjt:  PNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSY

Query:  HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAG
        HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHW+PP NYGMISP+V PPPQLPFLERGPQAPQL+PNILST+NMGQL PLEPPRMPTPTNMPMDAG
Subjt:  HFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAG

Query:  DEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRT
        DEHGGEQ+K HSHRLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKDSTMENRDEEQFYSS  IITPEDGNDDFLLVSRG+CSNMPETEDT+NEVVRT
Subjt:  DEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRT

Query:  DTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRK
        DTQEPSPLDTPTEG FCSHQELPTGATGST LATDEYVTPMATLPGVR+AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEED EDTQTIR+MFQYKRR+
Subjt:  DTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRK

Query:  RKSSK
        +K  K
Subjt:  RKSSK

A0A6J1DHH9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110201106.9e-22093.61Show/hide
Query:  ETPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVP
        + PNATSIPFPPLENFQHHM SDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQ+SEFRAENPQQLPPMINPGMYQPFMFN VP
Subjt:  ETPNATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVP

Query:  SYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMD
        SYHFPLSQMQIPASVHPYGMPNPSTLFPSL PYYGMGHWVPP NYGMISPRV PPPQL FLERGPQAPQLNPNILSTFNMGQL PLEPPRMPTPTNMPMD
Subjt:  SYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMD

Query:  AGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVV
         GDEHGGEQ+KSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDS+MENRDEEQFYSSPLIIT EDGNDDFLLVSRG+CSNMPETEDT+NEVV
Subjt:  AGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVV

Query:  RTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR
        RTDTQEPSPLDTPTEG FCSHQELPTGATGS  LATDEYVT MATLPGVR+AHTIPSNTVNPLGCDTGRS+VGENSTQEPTSEEDLED QTIREMFQYKR
Subjt:  RTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKR

Query:  RKRKSSK
        R++ SSK
Subjt:  RKRKSSK

A0A6J1DW02 uncharacterized protein LOC1110248971.9e-24682.6Show/hide
Query:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEIRPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT
        MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLE QKEREGEI PESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFN+IQMADNRDVAMREYAAT
Subjt:  MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEIRPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAAT

Query:  AFQNFDKGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANEFRLP--------------------------------------VE
        AFQNFD GIVNPIPAH NFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIAN FRLP                                      VE
Subjt:  AFQNFDKGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANEFRLP--------------------------------------VE

Query:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDNPTKMMLNNAANGAFAKKTFNEIVDILNDLASHNELW
        KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKC NHGLPAC QIEHFFRGLD+PTKMMLNNAANGAF KKTFNEIVDILNDLASHNELW
Subjt:  KFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHGLPACIQIEHFFRGLDNPTKMMLNNAANGAFAKKTFNEIVDILNDLASHNELW

Query:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAP-----DM------------GTIGTLTHIRTPTTKNK
        CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAT IQPVQSD+CT AP     D+            G  G+    +  + +NK
Subjt:  CSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQPVQSDFCTPAP-----DM------------GTIGTLTHIRTPTTKNK

Query:  QFYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS
        Q YVPPTQQ+IPPPQQQYNQRTQTPP+QNNNSNLENMMKEYMARTDAVIQSQAASMRNF TQLG LANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS
Subjt:  QFYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQGSFPGHTELPRREGKEQCKAVTLRS

Query:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS
        GL YDGPTMPTTDVQIPST+P VKIPENPTTPEKENIRKG+ +  S
Subjt:  GLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECS

A0A6J1DY94 uncharacterized protein LOC1110253162.6e-19093.06Show/hide
Query:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
        M+PHGYVNFQ LPT NIPQNSEFRAE+PQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR
Subjt:  MSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQMQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPR

Query:  VSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKD
        V PPPQLPFLERGPQAPQLNPNILSTFNMG L PLEP RM  PTNMPMD GDEHGGEQ+KSHS RLEPGV IGQKRKGKEVM DPEIEEDGSSRRLTPKD
Subjt:  VSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKSHSHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKD

Query:  STMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRN
        S+MENRDEEQFYSSPLIITP DGNDDFLLVSRGNCSNMPETEDT+NEVVRTDTQEPS LDTPTEG FCSHQELPTGATGST LATDEYVTPMATLPGVR+
Subjt:  STMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGVFCSHQELPTGATGSTLLATDEYVTPMATLPGVRN

Query:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAV
        AHTIPSN VNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRR++KSSKRRAV
Subjt:  AHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAACGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAA
GAGAGAGAAGGTGAAATCCGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGT
AATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAAATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTT
GATAAAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACAT
GAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGAATTTCGATTACCTGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCT
GATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAAAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGC
TTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGTTTAGATAATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTGCAAAGAAGACATTC
AACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAACAAGATCCAGCTGGAGTTTTG
GCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAA
CCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCAAGAACAAACAGTTCTATGTTCCA
CCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAG
TACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAA
GGTTCTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCA
ACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAGAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGC
TCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATGGTAGAAGGACCGGAAGAT
GTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGTCGTCGCATTTG
AAATATGTGTATCTAGGGGATAACGACACTGTACCGGTTATTATAACTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAA
AAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGACTCCA
AATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTT
GAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACTTTTAATATACCTCAAAACAGTGAG
TTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCATTGTCTCAA
ATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACAT
AATTATGGTATGATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTC
AACATGGGACAACTAATACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGACAAGAGCCAT
AGCCATAGGCTAGAGCCCGGGGTGTTGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACA
CCTAAGGATTCGACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGA
GGCAATTGTTCAAATATGCCGGAAACAGAGGATACCGACAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGTGTTT
TGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCTGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGAATGCTCAC
ACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGAC
ACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGAAGAGAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCTGTGAATGAACCT
AAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTC
GAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCT
GATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTTAGAGAGTTCTATGCTACTCTCCAT
CCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAAT
AAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCTGCTACTTGGGATTTGACTACTCATGGCAAGGTACGACTA
AAACCCGAGAATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTG
CTGGTTTATGCCATGCTAAAGGGCATAAATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCA
CGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATC
GCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATGAGGTTCGAGAAGTC
GTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGAC
ATTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAAGATCTTTTCTTCTACCCCTTGACCCTGAGATTGAACGGACCCTTCGAAAAACTCGAAAGGAGCAAAGACTTCGAAAACAACTAGAAAAGCAAAAA
GAGAGAGAAGGTGAAATCCGTCCTGAAAGTGAGGTAGAGAGTACGAGCACATCAATGGCAGATATTCCACCTCGTGATCCGGTTGATCCACCTGCTGTTAACGGT
AATATGAGGGATCATGCAAGAAATGATGAATTCAACCATATCCAAATGGCGGACAACAGAGACGTGGCAATGCGAGAATATGCCGCCACGGCTTTTCAGAACTTT
GATAAAGGGATAGTCAACCCTATTCCAGCCCACGCAAACTTTGAGCTTAAACCAATGATGTTCCAAATGTTGCAGACAATTGGACATTTTGGAGGGCAGGAACAT
GAGGATCCACATGATCATCTGAAATCATTCATTCAAATTGCAAATGAATTTCGATTACCTGTGGAGAAGTTCTTAACAAAATTCTTCCCACCTACTCGCCACGCT
GATATCAGAGAGGAGATCATCTCCTTTAGACAGTATGATCGTGAACCTGTTCACGAGGCGTGGGAAAGATTTAAAGAACTAATCAGGAAATGTCCGAACCATGGC
TTGCCGGCATGCATCCAGATAGAACATTTCTTTAGAGGTTTAGATAATCCTACTAAGATGATGCTAAACAATGCTGCCAATGGAGCCTTTGCAAAGAAGACATTC
AACGAGATAGTTGACATCCTAAATGACTTAGCTTCACACAACGAACTATGGTGTTCGCAAAGATCTAGGGCAGCACCAAAGAAACAAGATCCAGCTGGAGTTTTG
GCTCTGGACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTAAAAGAGATGGCATTGGGAATAAAAAATCCATTAGCCACGCCGATACAA
CCTGTGCAGTCGGATTTTTGCACTCCTGCCCCTGACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCAAGAACAAACAGTTCTATGTTCCA
CCTACACAGCAATACATCCCACCACCGCAACAGCAGTACAATCAGAGAACACAGACTCCACCAGTTCAAAATAACAACTCAAATCTTGAGAATATGATGAAGGAG
TACATGGCCCGAACCGACGCAGTGATACAATCCCAAGCGGCATCAATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAA
GGTTCTTTTCCAGGCCATACTGAATTACCAAGACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGGCATATGATGGACCAACAATGCCA
ACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATACCAGAAAATCCAACAACACCAGAGAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGC
TCTGCTATAACTAGCTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATGGTAGAAGGACCGGAAGAT
GTGACTAATCCTATTGAAAAAATACAAAAAGAAGAATGCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCAGAAGCCATTGTCGTCGCATTTG
AAATATGTGTATCTAGGGGATAACGACACTGTACCGGTTATTATAACTTCCAATTTATCACCTACTAATGAATATTCTTTATTGCAGATTTTGGAGAAGCACAAA
AAGGCCATTGGATGGACGATAGCAGATATCCGAGGGATAAGCCCGGCCTTTTGCATGCACAAAATCTTATTGGAAGAAGATGCTAAGAACTTTATCGAGACTCCA
AATGCCACAAGCATTCCTTTTCCACCATTGGAGAACTTCCAACACCACATGGGTTCAGATTCTAGGCTAGCTGCTGTTAGGGGAGGAAACCCGCTTCAAACATTT
GAATGTCCTCCATCCCAAGCTCCTACACAACATCCTATAATGAGCCCACATGGTTATGTTAATTTTCAGCAACTACCCACTTTTAATATACCTCAAAACAGTGAG
TTTAGGGCTGAAAATCCTCAACAACTTCCTCCAATGATCAATCCGGGTATGTACCAACCGTTTATGTTTAACCCGGTTCCTTCCTATCATTTTCCATTGTCTCAA
ATGCAAATTCCAGCTAGTGTTCATCCTTATGGAATGCCAAACCCATCTACTCTATTTCCTAGCCTTCCACCTTATTATGGAATGGGTCATTGGGTACCTCCACAT
AATTATGGTATGATTTCACCTAGAGTTTCCCCACCACCTCAACTTCCATTCCTAGAAAGAGGACCTCAAGCACCCCAATTGAACCCTAACATATTGAGTACCTTC
AACATGGGACAACTAATACCCTTAGAGCCTCCTAGGATGCCAACCCCAACCAATATGCCAATGGATGCAGGAGATGAGCATGGAGGAGAGCAAGACAAGAGCCAT
AGCCATAGGCTAGAGCCCGGGGTGTTGATAGGGCAAAAGAGGAAGGGCAAGGAGGTGATGATAGACCCAGAAATTGAGGAAGATGGGAGTAGTAGGCGTCTGACA
CCTAAGGATTCGACTATGGAGAACAGGGACGAGGAGCAGTTTTATTCATCTCCATTGATTATTACACCGGAAGATGGTAATGATGATTTTCTACTGGTTTCACGA
GGCAATTGTTCAAATATGCCGGAAACAGAGGATACCGACAACGAAGTGGTGAGAACAGATACTCAGGAACCTTCCCCATTGGATACACCTACAGAAGGAGTGTTT
TGTAGTCACCAAGAGTTGCCTACTGGGGCAACTGGATCTACTCTGTTGGCAACGGATGAGTATGTCACACCGATGGCCACTCTACCAGGGGTAAGGAATGCTCAC
ACTATTCCTTCTAATACAGTTAACCCGCTTGGGTGTGATACAGGTAGATCCAAGGTTGGGGAGAATAGTACTCAAGAACCTACCAGCGAAGAAGATCTTGAGGAC
ACGCAGACGATAAGGGAAATGTTCCAATACAAGAGGCGGAAGAGAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCTGTGAATGAACCT
AAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTC
GAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCT
GATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTTAGAGAGTTCTATGCTACTCTCCAT
CCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAAT
AAGATGTTAGTGACTCCGACTCTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCTGCTACTTGGGATTTGACTACTCATGGCAAGGTACGACTA
AAACCCGAGAATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTG
CTGGTTTATGCCATGCTAAAGGGCATAAATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCA
CGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATC
GCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATGAGGTTCGAGAAGTC
GTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGAC
ATTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MSTRSFLLPLDPEIERTLRKTRKEQRLRKQLEKQKEREGEIRPESEVESTSTSMADIPPRDPVDPPAVNGNMRDHARNDEFNHIQMADNRDVAMREYAATAFQNF
DKGIVNPIPAHANFELKPMMFQMLQTIGHFGGQEHEDPHDHLKSFIQIANEFRLPVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCPNHG
LPACIQIEHFFRGLDNPTKMMLNNAANGAFAKKTFNEIVDILNDLASHNELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATPIQ
PVQSDFCTPAPDMGTIGTLTHIRTPTTKNKQFYVPPTQQYIPPPQQQYNQRTQTPPVQNNNSNLENMMKEYMARTDAVIQSQAASMRNFETQLGQLANELKNRPQ
GSFPGHTELPRREGKEQCKAVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAITSLNPVMFDEFYDLLVTEIEEELDKMVEGPED
VTNPIEKIQKEECKSLLPSIVEPPTLEQKPLSSHLKYVYLGDNDTVPVIITSNLSPTNEYSLLQILEKHKKAIGWTIADIRGISPAFCMHKILLEEDAKNFIETP
NATSIPFPPLENFQHHMGSDSRLAAVRGGNPLQTFECPPSQAPTQHPIMSPHGYVNFQQLPTFNIPQNSEFRAENPQQLPPMINPGMYQPFMFNPVPSYHFPLSQ
MQIPASVHPYGMPNPSTLFPSLPPYYGMGHWVPPHNYGMISPRVSPPPQLPFLERGPQAPQLNPNILSTFNMGQLIPLEPPRMPTPTNMPMDAGDEHGGEQDKSH
SHRLEPGVLIGQKRKGKEVMIDPEIEEDGSSRRLTPKDSTMENRDEEQFYSSPLIITPEDGNDDFLLVSRGNCSNMPETEDTDNEVVRTDTQEPSPLDTPTEGVF
CSHQELPTGATGSTLLATDEYVTPMATLPGVRNAHTIPSNTVNPLGCDTGRSKVGENSTQEPTSEEDLEDTQTIREMFQYKRRKRKSSKRRAVQAKKPTVPVNEP
KTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYATLH
PQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLAQLDEALACVGKPSATWDLTTHGKVRLKPENVSLAAAGWLYIVKNRILPTEHDEHVTQDRAL
LVYAMLKGINVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDEVREV
VQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDIDPSPQPPTS