; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1667 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1667
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionVQ domain-containing protein
Genome locationMC06:24115339..24118006
RNA-Seq ExpressionMC06g1667
SyntenyMC06g1667
Gene Ontology termsGO:0009960 - endosperm development (biological process)
GO:0080113 - regulation of seed growth (biological process)
InterPro domainsIPR008889 - VQ
IPR039612 - VQ motif-containing protein 5/9/14
IPR039825 - VQ motif-containing protein 5/14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148477.1 protein HAIKU1 [Cucumis sativus]3.11e-16880.3Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPSTQQ      PQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQRIRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ
        LTPINRP   PP+PVS+ PPQ+PYYNG  R AQ  CDQSST M QGQPA TQ  PQ IP DS+WPK A+SPISAYMRYLQSSAIDSP +GNQA    QAQ
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP
        V GQ+QNQVA SG    PDP +P    ST+  PVPSL NFPP Q++SP+ FPSPTQFHVPSPS YLNLLSPQSPYPLLSPG+RFPPPLSPNFAFSPMAQP
Subjt:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP

Query:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        GILGP P PPLSPGLVFP SPSGLFPLLSPRWRDW
Subjt:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

XP_008465974.1 PREDICTED: protein HAIKU1-like [Cucumis melo]1.20e-16579.4Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPSTQQ      PQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQRIRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ
        LTPINR H  PPVPVS+  P +PYYNG  R AQ  CDQSST M QGQPA TQ  PQ I  DS WPK A+SPISAYMRYLQ+SA+DSP +GNQ+   PQAQ
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP
        V GQVQNQVA SG    PDP +P    ST+  PVPS PNFPP Q+NSP+ FPSPTQFHVPSPS YLNLLSP SPYPLLSPG+RFPPPLSPNFAFSPMAQP
Subjt:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP

Query:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        GILGPGP PPLSPGL+FP SPSGLFPLLSPRWRDW
Subjt:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

XP_022159730.1 protein HAIKU1-like [Momordica charantia]2.91e-236100Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT
        MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT

Query:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ
        PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ
Subjt:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ

Query:  NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP
        NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP
Subjt:  NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP

Query:  HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS
        HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS
Subjt:  HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS

XP_038888616.1 protein HAIKU1-like [Benincasa hispida]6.35e-16679.28Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPS+QQ          PQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQ+IRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQV
        L PINRP VPPPV VS+ PPQ+PYYNG  RPA+  CDQSS M QGQPA TQ  PQ IP DS+WPK A+SPISAYMRYLQSSA+DSP IGNQA   PQAQV
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQV

Query:  SGQVQNQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGI
         GQVQNQVA S LPP    P  A   +   PVPSLPNFPP Q+NSP+ FPSPTQFHVPSPS YLNLLSPQSPYPLLSPG RFPPPLSPNF+FSPMAQPGI
Subjt:  SGQVQNQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGI

Query:  LGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        LGPGP PPLSPGL+FP SPSGLFPLLSPRWRDW
Subjt:  LGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

XP_040989596.1 protein HAIKU1-like [Juglans microcarpa x Juglans regia]1.86e-11863.58Show/hide
Query:  MDYSKNKQN-EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGS-SQEP-SRPPQNPAKQQSLRLQRIRPP
        MD SKN+ N +HLGVNKMGKNIKKSPLHQPNFG+ P+ QQ          PQPQVYNI+KNDFRNIVQQLTGS SQEP  RPP NP K Q++RLQ+IRPP
Subjt:  MDYSKNKQN-EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGS-SQEP-SRPPQNPAKQQSLRLQRIRPP

Query:  PLTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ
        PLTPINRPH+PPP+PV   PP + Y N  +RP Q +          QP+ T   P L PGD IWP TAESPISAYMRYLQ+S +D    GNQA    Q Q
Subjt:  PLTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPAT-HPSTSCPVPSLPNFPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQ
          GQ+Q Q  ++ L   P+PP+PA   P  + P P +PN    Q N P+  PSPT QF +PSPSGYLN +SP+SPYPLLSPGM+FPP LSPNFAFSPMAQ
Subjt:  VSGQVQNQVAASGLPPRPDPPIPAT-HPSTSCPVPSLPNFPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQ

Query:  PGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRD
         GILGPGP PPLSPGL FP SPSG FP+LSPRWRD
Subjt:  PGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRD

TrEMBL top hitse value%identityAlignment
A0A0A0LI32 VQ domain-containing protein1.51e-16880.3Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPSTQQ      PQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQRIRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ
        LTPINRP   PP+PVS+ PPQ+PYYNG  R AQ  CDQSST M QGQPA TQ  PQ IP DS+WPK A+SPISAYMRYLQSSAIDSP +GNQA    QAQ
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAP---QAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP
        V GQ+QNQVA SG    PDP +P    ST+  PVPSL NFPP Q++SP+ FPSPTQFHVPSPS YLNLLSPQSPYPLLSPG+RFPPPLSPNFAFSPMAQP
Subjt:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP

Query:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        GILGP P PPLSPGLVFP SPSGLFPLLSPRWRDW
Subjt:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

A0A1S3CQ56 protein HAIKU1-like5.82e-16679.4Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPSTQQ      PQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQRIRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ
        LTPINR H  PPVPVS+  P +PYYNG  R AQ  CDQSST M QGQPA TQ  PQ I  DS WPK A+SPISAYMRYLQ+SA+DSP +GNQ+   PQAQ
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP
        V GQVQNQVA SG    PDP +P    ST+  PVPS PNFPP Q+NSP+ FPSPTQFHVPSPS YLNLLSP SPYPLLSPG+RFPPPLSPNFAFSPMAQP
Subjt:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP

Query:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        GILGPGP PPLSPGL+FP SPSGLFPLLSPRWRDW
Subjt:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

A0A5A7T8P5 Protein HAIKU1-like5.82e-16679.4Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP
        MDYSKNKQNE LGVNK+GKNIKKSPLHQPNFG+IPSTQQ      PQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS RPPQNPA KQQSLRLQRIRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS-RPPQNPA-KQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ
        LTPINR H  PPVPVS+  P +PYYNG  R AQ  CDQSST M QGQPA TQ  PQ I  DS WPK A+SPISAYMRYLQ+SA+DSP +GNQ+   PQAQ
Subjt:  LTPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSST-MLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA---PQAQ

Query:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP
        V GQVQNQVA SG    PDP +P    ST+  PVPS PNFPP Q+NSP+ FPSPTQFHVPSPS YLNLLSP SPYPLLSPG+RFPPPLSPNFAFSPMAQP
Subjt:  VSGQVQNQVAASGLPPRPDPPIPATHPSTSC-PVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQP

Query:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        GILGPGP PPLSPGL+FP SPSGLFPLLSPRWRDW
Subjt:  GILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

A0A6J1DZK6 protein HAIKU1-like1.41e-236100Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT
        MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLT

Query:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ
        PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ
Subjt:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQ

Query:  NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP
        NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP
Subjt:  NQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGP

Query:  HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS
        HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS
Subjt:  HPPLSPGLVFPWSPSGLFPLLSPRWRDWQS

A0A6M2ERX8 VQ domain-containing protein1.36e-11661.22Show/hide
Query:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGS-SQEP-SRPPQNPAKQQSLRLQRIRPPP
        MD SKN+Q++HLGVNK+GKNIKKSPLHQPNFG+ P+ QQ          PQPQVYNI+KNDFRNIVQQLTGS SQEP  RPP NP K QS+RLQ+IRPPP
Subjt:  MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGS-SQEP-SRPPQNPAKQQSLRLQRIRPPP

Query:  LTPINRPHVPPPVPV-SMTPPQIPYYNGLIRPAQ-PQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA------
        LTP+NRPHVPPP P  ++ PP +PY+N  +RP   P   Q      G P+ T   P L PGDS W  TAESPISAYMRYLQ+S ID  S GNQA      
Subjt:  LTPINRPHVPPPVPV-SMTPPQIPYYNGLIRPAQ-PQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQA------

Query:  ----PQAQVSGQVQNQVAASGLPPRPD-PPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPN
            PQ     Q Q+Q  + GL P P  PP+P   P  + PVP +PN P  + N P   PSPT QF +PSP+GY+NLLSP+SPYPLLSPG++F P L+PN
Subjt:  ----PQAQVSGQVQNQVAASGLPPRPD-PPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPN

Query:  FAFSPMAQPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRD
        FAFSPMAQ G+LGPGP PPLSPG  FP SPSG FP  SPRWRD
Subjt:  FAFSPMAQPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRD

SwissProt top hitse value%identityAlignment
O82170 Protein HAIKU18.7e-5441.01Show/hide
Query:  KQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS--RPPQNPA-KQQSLRLQRIRPPPLTPIN
        +QN+HLGVN++GKNI+KSPLHQ  F +  S         P+ Q QPQVYNI+KNDFR+IVQQLTGS    S  RPPQN + + Q+ RLQRIRP PLT +N
Subjt:  KQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS--RPPQNPA-KQQSLRLQRIRPPPLTPIN

Query:  RPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------------------
        RP VP P   SM PPQ          + PQ  +        P +TQQ P +   D  W  TAESP+S YMRYLQSS  DS                    
Subjt:  RPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------------------

Query:  ---------------------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLP
                             P I     ++ +  Q Q+Q      P       P P P                    P P  H    P  + PVP  P
Subjt:  ---------------------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLP

Query:  NFP-PTQANSPSFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFP
          P P  +     FPSP            +QF  PSP+GY N+ SP+SPYPLLSPG+++P PL+PNF+FS +AQ G LGPG       P PP SPGL+FP
Subjt:  NFP-PTQANSPSFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFP

Query:  WSPSGLFPLLSPRWRDW
         SPSG FP+ SPRW D+
Subjt:  WSPSGLFPLLSPRWRDW

Q9M9F0 VQ motif-containing protein 94.8e-1234.56Show/hide
Query:  VNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQP-------QPQPQVYNINKNDFRNIVQQLTGS------SQEPSRPPQNPAKQQSLRLQRIRPPPLT
        +NK+   I K P +  +  S+ + +   P P P         Q QP VYNINKNDFR++VQ+LTGS      S  P +P  +P  QQS RL RIRPPPL 
Subjt:  VNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQP-------QPQPQVYNINKNDFRNIVQQLTGS------SQEPSRPPQNPAKQQSLRLQRIRPPPLT

Query:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSS--AIDSPSIGNQAPQAQVSGQ
                  V V   PP +   + LI       +Q+ T +      T     L P   +    AESP+S+YMRYLQ+S  AIDS               
Subjt:  PINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSS--AIDSPSIGNQAPQAQVSGQ

Query:  VQNQVAASGLPPRPDPPIPATHPSTSCPVPSLPN-FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMAQP
          N+   SGL P      P  +       PS  N FPP     PS   S T    +P+P  +    SP+SPY LLSP +   P      F  SP   P
Subjt:  VQNQVAASGLPPRPDPPIPATHPSTSCPVPSLPN-FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMAQP

Arabidopsis top hitse value%identityAlignment
AT1G32610.1 hydroxyproline-rich glycoprotein family protein1.1e-3239.47Show/hide
Query:  EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRP---PQNPAKQQSLRLQRIRPPPLTPINRPH
        + LGVNK+GKNIKKSPL                       PQPQ Y+++ NDF +IVQQLT S    S P   P+N  K Q    Q+IRP     INRP 
Subjt:  EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRP---PQNPAKQQSLRLQRIRPPPLTPINRPH

Query:  VPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------PSIGNQAPQAQVSGQ
        VPPPV         P +  + RP               P      P +  GD     TAES +S YMRY QSS  DS        PS  NQ  Q QV GQ
Subjt:  VPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------PSIGNQAPQAQVSGQ

Query:  VQNQVAAS---GLPPRPDPPIPATHPSTSCPVPSLPN--FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMA
         Q+    S       R  P +P   P    P   + N   P  + N     P+PT Q+   SP+ Y NLLSP+SP PLLS G+++PPPL+P N+ FS M 
Subjt:  VQNQVAAS---GLPPRPDPPIPATHPSTSCPVPSLPN--FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMA

Query:  QPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        QPGILGPG  P      +   SP G+ P+ S RWR +
Subjt:  QPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

AT1G32610.2 hydroxyproline-rich glycoprotein family protein1.1e-3239.47Show/hide
Query:  EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRP---PQNPAKQQSLRLQRIRPPPLTPINRPH
        + LGVNK+GKNIKKSPL                       PQPQ Y+++ NDF +IVQQLT S    S P   P+N  K Q    Q+IRP     INRP 
Subjt:  EHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRP---PQNPAKQQSLRLQRIRPPPLTPINRPH

Query:  VPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------PSIGNQAPQAQVSGQ
        VPPPV         P +  + RP               P      P +  GD     TAES +S YMRY QSS  DS        PS  NQ  Q QV GQ
Subjt:  VPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------PSIGNQAPQAQVSGQ

Query:  VQNQVAAS---GLPPRPDPPIPATHPSTSCPVPSLPN--FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMA
         Q+    S       R  P +P   P    P   + N   P  + N     P+PT Q+   SP+ Y NLLSP+SP PLLS G+++PPPL+P N+ FS M 
Subjt:  VQNQVAAS---GLPPRPDPPIPATHPSTSCPVPSLPN--FPPTQANSPSFFPSPT-QFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSP-NFAFSPMA

Query:  QPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW
        QPGILGPG  P      +   SP G+ P+ S RWR +
Subjt:  QPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDW

AT2G35230.1 VQ motif-containing protein6.2e-5541.01Show/hide
Query:  KQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS--RPPQNPA-KQQSLRLQRIRPPPLTPIN
        +QN+HLGVN++GKNI+KSPLHQ  F +  S         P+ Q QPQVYNI+KNDFR+IVQQLTGS    S  RPPQN + + Q+ RLQRIRP PLT +N
Subjt:  KQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPS--RPPQNPA-KQQSLRLQRIRPPPLTPIN

Query:  RPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------------------
        RP VP P   SM PPQ          + PQ  +        P +TQQ P +   D  W  TAESP+S YMRYLQSS  DS                    
Subjt:  RPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS--------------------

Query:  ---------------------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLP
                             P I     ++ +  Q Q+Q      P       P P P                    P P  H    P  + PVP  P
Subjt:  ---------------------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLP

Query:  NFP-PTQANSPSFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFP
          P P  +     FPSP            +QF  PSP+GY N+ SP+SPYPLLSPG+++P PL+PNF+FS +AQ G LGPG       P PP SPGL+FP
Subjt:  NFP-PTQANSPSFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFP

Query:  WSPSGLFPLLSPRWRDW
         SPSG FP+ SPRW D+
Subjt:  WSPSGLFPLLSPRWRDW

AT2G35230.2 VQ motif-containing protein2.1e-2634.64Show/hide
Query:  MTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS-------------------------------
        M PPQ          + PQ  +        P +TQQ P +   D  W  TAESP+S YMRYLQSS  DS                               
Subjt:  MTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDS-------------------------------

Query:  ----------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLPNFP-PTQANSP
                  P I     ++ +  Q Q+Q      P       P P P                    P P  H    P  + PVP  P  P P  +   
Subjt:  ----------PSIGNQAPQAQVSGQVQNQVAASGLP-------PRPDP--------------------PIPATH----PSTSCPVPSLPNFP-PTQANSP

Query:  SFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFPWSPSGLFPLLS
          FPSP            +QF  PSP+GY N+ SP+SPYPLLSPG+++P PL+PNF+FS +AQ G LGPG       P PP SPGL+FP SPSG FP+ S
Subjt:  SFFPSP------------TQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG-------PHPPLSPGLVFPWSPSGLFPLLS

Query:  PRWRDW
        PRW D+
Subjt:  PRWRDW

AT5G46780.1 VQ motif-containing protein3.8e-2035.06Show/hide
Query:  DYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTG-SSQEPSRPPQ-NPAKQQSLRLQRIRPPPL
        +++ +  + HLGVNKMGKNI+K P +Q N                Q  PQ  VYNINK DFR+IVQQLTG  S     PPQ N  K  + RL ++RP PL
Subjt:  DYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTG-SSQEPSRPPQ-NPAKQQSLRLQRIRPPPL

Query:  TPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQV
        T +N P  PPP P    PP                      +Q  P +++      P +      AESPISAYMRYL    I+S  +GN+        Q 
Subjt:  TPINRPHVPPPVPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQV

Query:  QNQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG
        QNQ      P +P   +  +H +        PN        P  F SP      SP        P+SP+PL           SPNFAFSP    G     
Subjt:  QNQVAASGLPPRPDPPIPATHPSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPG

Query:  PHPPLSPGLVFPWSPSGLFPLLSPRWRD
          PP SPG          FPLLSP W++
Subjt:  PHPPLSPGLVFPWSPSGLFPLLSPRWRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTACTCGAAGAACAAACAGAATGAGCACTTGGGCGTGAACAAAATGGGGAAGAATATAAAGAAAAGTCCACTGCACCAACCTAATTTTGGTAGCATTCCTTCTAC
ACAACAGCCCCATCCTCAGCCTCAGCCTCAGCCTCAGCCTCAGCCTCAGGTTTACAACATCAACAAGAATGATTTTCGGAATATTGTTCAGCAGCTGACTGGTTCTTCCC
AAGAGCCGTCTAGACCACCTCAAAATCCAGCCAAACAACAAAGCTTGAGACTGCAAAGGATACGACCTCCCCCATTAACACCTATCAATCGGCCCCATGTTCCACCTCCA
GTACCCGTTTCCATGACCCCGCCACAGATTCCTTATTACAATGGCTTGATCAGGCCTGCACAACCACAATGTGATCAGTCATCAACAATGCTTCAGGGGCAGCCAGCATC
CACACAACAAGCGCCTCAATTGATACCTGGAGACTCAATTTGGCCAAAGACTGCTGAGTCTCCAATATCTGCCTACATGCGTTATCTTCAAAGTTCAGCTATAGATTCCC
CTTCGATAGGAAACCAGGCTCCACAAGCACAAGTTTCAGGTCAAGTTCAAAACCAAGTGGCTGCCTCTGGTTTACCGCCCAGGCCCGACCCACCCATTCCTGCCACTCAT
CCTAGTACAAGTTGTCCTGTGCCATCTCTTCCTAATTTTCCTCCCACCCAAGCAAACAGTCCTTCATTTTTTCCCTCTCCTACACAGTTTCATGTGCCCTCTCCTTCTGG
TTACTTAAATTTGTTATCACCACAGTCACCTTATCCATTGCTTTCACCTGGAATGCGGTTTCCTCCACCACTGAGTCCTAATTTTGCATTTTCACCCATGGCCCAACCAG
GGATTTTAGGTCCTGGGCCTCATCCTCCGCTTTCTCCTGGCCTTGTATTCCCATGGTCTCCATCAGGATTATTCCCCCTACTGAGTCCAAGATGGAGAGATTGGCAGTCC
TAG
mRNA sequenceShow/hide mRNA sequence
CTGTTGCGCTATTTCATTTCATTGTGGACCAAAAAGATCCGGTGTTTTCACTTTCAGCCGATCCATCGCTAGGGTTTCGGCTTCATCAGCTCATTATTGTTCCGAACAAT
TGTCGACGGATTGGATAAAGAAGATCCAGAGGAATTCAATTTTCTTAGGGATTTGCTTTCTGATTCATTATCCTTTTAGGAGTCTTCTTTTTCCTGTTGGGCGGTTTTGG
GAAAGATTACGATTACTGGGCTTTCTTGTTATCCTGGTCTGAGCATTGAGCAAGTTAAGACTGGGGTTTGAGGAAGTTTTTTTTTTTTAAGGCTTTAAGTTGGGGGATAA
GTCATTGCTTCTCAAATATCTGCTCTGTAGTTCACCTTTTTTTCTTGTTCTTCCTGCTTTTGGACCCCTTTGTGCTTGAATAACTCAAGGGGTGTTGATTTGAGCTTCAA
TTTGAGACTTCTGGTTGATTGGTTGAAGATTTGAGAGTGTTTGAAGCCGTGTATAATGTTTGGCAAGAGAGTGGTTGATTGAGGTTTGAATTTATATTCTAGATTTGTAG
ATTTGTCACGCCGAGGAAATCTGCTGGAGTTTGATCGATCCAGAGTTTTGGATTATGAGTATTTGATTGTGCTCGTGGTTGAGTTTTAAGCCTTTTTATTTTCTGGTTAT
GGATTACTCGAAGAACAAACAGAATGAGCACTTGGGCGTGAACAAAATGGGGAAGAATATAAAGAAAAGTCCACTGCACCAACCTAATTTTGGTAGCATTCCTTCTACAC
AACAGCCCCATCCTCAGCCTCAGCCTCAGCCTCAGCCTCAGCCTCAGGTTTACAACATCAACAAGAATGATTTTCGGAATATTGTTCAGCAGCTGACTGGTTCTTCCCAA
GAGCCGTCTAGACCACCTCAAAATCCAGCCAAACAACAAAGCTTGAGACTGCAAAGGATACGACCTCCCCCATTAACACCTATCAATCGGCCCCATGTTCCACCTCCAGT
ACCCGTTTCCATGACCCCGCCACAGATTCCTTATTACAATGGCTTGATCAGGCCTGCACAACCACAATGTGATCAGTCATCAACAATGCTTCAGGGGCAGCCAGCATCCA
CACAACAAGCGCCTCAATTGATACCTGGAGACTCAATTTGGCCAAAGACTGCTGAGTCTCCAATATCTGCCTACATGCGTTATCTTCAAAGTTCAGCTATAGATTCCCCT
TCGATAGGAAACCAGGCTCCACAAGCACAAGTTTCAGGTCAAGTTCAAAACCAAGTGGCTGCCTCTGGTTTACCGCCCAGGCCCGACCCACCCATTCCTGCCACTCATCC
TAGTACAAGTTGTCCTGTGCCATCTCTTCCTAATTTTCCTCCCACCCAAGCAAACAGTCCTTCATTTTTTCCCTCTCCTACACAGTTTCATGTGCCCTCTCCTTCTGGTT
ACTTAAATTTGTTATCACCACAGTCACCTTATCCATTGCTTTCACCTGGAATGCGGTTTCCTCCACCACTGAGTCCTAATTTTGCATTTTCACCCATGGCCCAACCAGGG
ATTTTAGGTCCTGGGCCTCATCCTCCGCTTTCTCCTGGCCTTGTATTCCCATGGTCTCCATCAGGATTATTCCCCCTACTGAGTCCAAGATGGAGAGATTGGCAGTCCTA
GTTTTCAGTGTATTATATTTTAAAGTGCCAAGCTTACCTTACCACTTCCTCACAGGAGATTGTAACATATAGAGGTGGTGGAAAAGGTCATTTTGGTTGTTTGCACTGTC
TGTTGCTTATTTTACATTCTTTGTTTTGCTTTTGTCTACACCCCATTTTGCAGCTTGATCAGCTTATGGATCCTGTTTCAGCTACTTTACCCTCCTTTTGGAGGAAGAGA
AAGAAAAAAGAAAGCATCAAGTAAAGAGGTTTTCATTAAGCAATGATCAATAGTTCAGGCCATTGGGCTTCATATTGAATGGAAAAGAGCTGTAGCACGAAAGTTAATCA
CAGAATGTCAAATATCAAGATACGTTATAGTTGAAGATTTCTTATCATGTTGTAGTTGTGGATTATTATGTAGGAAGTTTGGTTTCCTTTCATTTAATTCATTGAACTCA
TTTGTTGAAGGAACTGTGAAGTGGAGAAATCAGATATATACTGCATGTTGTTCTGTTCAAGGAAGTACATAGTTTGATCTACGAACTCTTTGCTGAGGAAGGATTCAAGT
TGCTTATAAAATTGTGGGTTATTTTGGGTGTTTCTTATGCAAATTTGAGCCTAGCTCAATTGGTTAAGTCACTATCTTTGATCAAAATATTATGAGTTCGAATCTCCACC
TCAGCAAAAAAAGTGTCTCTTTCCTTTATATATTTCAATTCTCGCCCACCTAAAAACTATGGTCAAAATAAATCTTAAACACGATGAAGGTAACTAAAATCATGTCCATG
ACTTCAATTTTCATGATGTGTTATCCATTCTTCCCACCTTTCCTTTAAATAAAGATTCTATTGC
Protein sequenceShow/hide protein sequence
MDYSKNKQNEHLGVNKMGKNIKKSPLHQPNFGSIPSTQQPHPQPQPQPQPQPQVYNINKNDFRNIVQQLTGSSQEPSRPPQNPAKQQSLRLQRIRPPPLTPINRPHVPPP
VPVSMTPPQIPYYNGLIRPAQPQCDQSSTMLQGQPASTQQAPQLIPGDSIWPKTAESPISAYMRYLQSSAIDSPSIGNQAPQAQVSGQVQNQVAASGLPPRPDPPIPATH
PSTSCPVPSLPNFPPTQANSPSFFPSPTQFHVPSPSGYLNLLSPQSPYPLLSPGMRFPPPLSPNFAFSPMAQPGILGPGPHPPLSPGLVFPWSPSGLFPLLSPRWRDWQS