; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh07G012020 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh07G012020
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionKH domain-containing protein
Genome locationCma_Chr07:6601472..6607764
RNA-Seq ExpressionCmaCh07G012020
SyntenyCmaCh07G012020
Gene Ontology termsGO:0048024 - regulation of mRNA splicing, via spliceosome (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR004087 - K Homology domain
IPR004088 - K Homology domain, type 1
IPR032377 - STAR protein, homodimerisation region
IPR036612 - K Homology domain, type 1 superfamily
IPR045071 - KH domain-containing BBP-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022925242.1 KH domain-containing protein At3g08620-like isoform X1 [Cucurbita moschata]7.7e-15799.65Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFM VLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

XP_022925243.1 KH domain-containing protein At3g08620-like isoform X2 [Cucurbita moschata]7.2e-15599.29Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFM VLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQE RLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

XP_022966419.1 KH domain-containing protein At3g08620-like isoform X1 [Cucurbita maxima]9.1e-158100Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

XP_022966420.1 KH domain-containing protein At3g08620-like isoform X2 [Cucurbita maxima]8.5e-15699.65Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQE RLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

XP_023518806.1 KH domain-containing protein At3g08620-like isoform X1 [Cucurbita pepo subsp. pepo]2.2e-15699.29Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDV+SQYLSELLAEHQKFGPFM VLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

TrEMBL top hitse value%identityAlignment
A0A1S3CIV3 KH domain-containing protein At2g386101.0e-15196.81Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFM VLPIC RLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNL+SNVSS GLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQE RLSR PGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIE DLP N+VDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

A0A6J1EB88 KH domain-containing protein At3g08620-like isoform X13.7e-15799.65Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFM VLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

A0A6J1EEN1 KH domain-containing protein At3g08620-like isoform X23.5e-15599.29Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFM VLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQE RLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

A0A6J1HRK9 KH domain-containing protein At3g08620-like isoform X24.1e-15699.65Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQE RLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

A0A6J1HTR7 KH domain-containing protein At3g08620-like isoform X14.4e-158100Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
        MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG
Subjt:  MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSG

Query:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
        WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
Subjt:  WNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

SwissProt top hitse value%identityAlignment
Q0WLR1 KH domain-containing protein At4g264801.9e-7855.71Show/hide
Query:  SGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSGW
        S   S NFS    + P      +   +YLSELLAE  K  PF+ VLP   RL+NQEILRV+ ++ N     L + R   PSP+AS  +  N S A ++GW
Subjt:  SGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSGW

Query:  -NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
         +  P E+ +S  P    +W ++P S S L VKR +R++IPVD YPN+NFVGRLLGPRGNSLKRVE +T CRV IRG+GSIKDP KE+ +RG+PGYEHLN
Subjt:  -NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT
        EPLHIL+E +LP+ IVD RL QA+EI+++LL PV+E+HD+ K+QQLRELA+LN S REE     GS+SP+NS GMKRAKT
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT

Q75GR5 KH domain-containing protein SPIN12.1e-11272.44Show/hide
Query:  MSGMYSTNFSPARTASPHIRTTP-DVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLS
        MSG+YS  FSPAR  SP IR+ P DVDSQYL+ELLAEHQK GPFM VLPICS+LL+QEI+RVS ++ N GF D DR R RSPSPM+S N  SN S  G S
Subjt:  MSGMYSTNFSPARTASPHIRTTP-DVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLS

Query:  GWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHL
         WNGL QE RL  P G +MDWQ AP SPSS  VK+ILRL++PVD+YPNFNFVGR+LGPRGNSLKRVE +TGCRV+IRGKGSIKDP KE+KLRG+PGYEHL
Subjt:  GWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHL

Query:  NEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        ++PLHILIE + P +I+D RLR AQE+IEELLKPVDES D+ KRQQLRELAMLNS+ RE+SP P GSVSPF++ GMKRAKTG+
Subjt:  NEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

Q8GYR4 KH domain-containing protein At3g086203.0e-12779.93Show/hide
Query:  MSGMYS-TNFSPARTASPHIRT-TPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGL
        MSG+Y+  NFSP+R ASP IRT + DVDSQY+S+LLAEHQK GPFM VLPICSRLLNQEI R++GMM NQGF D DRLRHRSPSPMAS NL+SNVS  GL
Subjt:  MSGMYS-TNFSPARTASPHIRT-TPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGL

Query:  SGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEH
         GWNGLP E R+  P GM M+WQ APASPSS  VKRILRL++PVDTYPNFNFVGRLLGPRGNSLKRVE TTGCRVYIRGKGSIKDP+KEEKL+G+PGYEH
Subjt:  SGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEH

Query:  LNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        LNE LHILIE DLP++IVDI+LRQAQEIIEEL+KPVDES DYIKRQQLRELA+LNS+ RE SPGP GSVSPFNS+ MKR KTGR
Subjt:  LNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

Q9FKT4 KH domain-containing protein At5g561402.2e-7755.24Show/hide
Query:  YSTNFS--PARTASPH----IRTTPDV---DSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSS
        YS++ S  P+   SP+    +R+   V     +YLSELLAE  K  PF+ VLP   RLLNQEILRV+ ++ N        L H  PSP+AS  +  N + 
Subjt:  YSTNFS--PARTASPH----IRTTPDV---DSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSS

Query:  AGLSGW-NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRP
        A ++GW +  P E+ +   PG   +W ++P S S L  KR +R++IPVD YPNFNFVGRLLGPRGNSLKRVE +T CRV IRG+GSIKDP KEE +RG+P
Subjt:  AGLSGW-NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRP

Query:  GYEHLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT
        GYEHLNEPLHIL+E +LP+ IVD RL QA+EI+++LL P++E+HD  K+QQLRELA+LN + REE     GSVSP+NS GMKRAKT
Subjt:  GYEHLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT

Q9ZVI3 KH domain-containing protein At2g386107.3e-12682.58Show/hide
Query:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG
        MSG+Y  S+ FSPAR ASP IR+TP++D SQYL+ELLAEHQK  PFM VLPICSRLLNQE+ RVSGMMSNQGF D DRLRHRSPSPMASSNL+SNVS+ G
Subjt:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG

Query:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE
        L GWNGL QE RLS  PGMTMDWQ AP SPSS TVKRILRLEIPVD YPNFNFVGRLLGPRGNSLKRVE TTGCRV+IRGKGSIKDP+KE+KLRGRPGYE
Subjt:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE

Query:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG
        HLNE LHILIE DLP +IV+IRLRQAQEIIEELLKPVDES D+IKRQQLRELA+LNS + REESPGP  GGSVSPFNSSG KR KTG
Subjt:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG

Arabidopsis top hitse value%identityAlignment
AT2G38610.1 RNA-binding KH domain-containing protein5.2e-12782.58Show/hide
Query:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG
        MSG+Y  S+ FSPAR ASP IR+TP++D SQYL+ELLAEHQK  PFM VLPICSRLLNQE+ RVSGMMSNQGF D DRLRHRSPSPMASSNL+SNVS+ G
Subjt:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG

Query:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE
        L GWNGL QE RLS  PGMTMDWQ AP SPSS TVKRILRLEIPVD YPNFNFVGRLLGPRGNSLKRVE TTGCRV+IRGKGSIKDP+KE+KLRGRPGYE
Subjt:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE

Query:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG
        HLNE LHILIE DLP +IV+IRLRQAQEIIEELLKPVDES D+IKRQQLRELA+LNS + REESPGP  GGSVSPFNSSG KR KTG
Subjt:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG

AT2G38610.2 RNA-binding KH domain-containing protein5.2e-12782.58Show/hide
Query:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG
        MSG+Y  S+ FSPAR ASP IR+TP++D SQYL+ELLAEHQK  PFM VLPICSRLLNQE+ RVSGMMSNQGF D DRLRHRSPSPMASSNL+SNVS+ G
Subjt:  MSGMY--STNFSPARTASPHIRTTPDVD-SQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAG

Query:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE
        L GWNGL QE RLS  PGMTMDWQ AP SPSS TVKRILRLEIPVD YPNFNFVGRLLGPRGNSLKRVE TTGCRV+IRGKGSIKDP+KE+KLRGRPGYE
Subjt:  LSGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYE

Query:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG
        HLNE LHILIE DLP +IV+IRLRQAQEIIEELLKPVDES D+IKRQQLRELA+LNS + REESPGP  GGSVSPFNSSG KR KTG
Subjt:  HLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNS-SFREESPGP--GGSVSPFNSSGMKRAKTG

AT3G08620.1 RNA-binding KH domain-containing protein2.1e-12879.93Show/hide
Query:  MSGMYS-TNFSPARTASPHIRT-TPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGL
        MSG+Y+  NFSP+R ASP IRT + DVDSQY+S+LLAEHQK GPFM VLPICSRLLNQEI R++GMM NQGF D DRLRHRSPSPMAS NL+SNVS  GL
Subjt:  MSGMYS-TNFSPARTASPHIRT-TPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGL

Query:  SGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEH
         GWNGLP E R+  P GM M+WQ APASPSS  VKRILRL++PVDTYPNFNFVGRLLGPRGNSLKRVE TTGCRVYIRGKGSIKDP+KEEKL+G+PGYEH
Subjt:  SGWNGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEH

Query:  LNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR
        LNE LHILIE DLP++IVDI+LRQAQEIIEEL+KPVDES DYIKRQQLRELA+LNS+ RE SPGP GSVSPFNS+ MKR KTGR
Subjt:  LNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR

AT4G26480.1 RNA-binding KH domain-containing protein1.4e-7955.71Show/hide
Query:  SGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSGW
        S   S NFS    + P      +   +YLSELLAE  K  PF+ VLP   RL+NQEILRV+ ++ N     L + R   PSP+AS  +  N S A ++GW
Subjt:  SGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSGW

Query:  -NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN
         +  P E+ +S  P    +W ++P S S L VKR +R++IPVD YPN+NFVGRLLGPRGNSLKRVE +T CRV IRG+GSIKDP KE+ +RG+PGYEHLN
Subjt:  -NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLN

Query:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT
        EPLHIL+E +LP+ IVD RL QA+EI+++LL PV+E+HD+ K+QQLRELA+LN S REE     GS+SP+NS GMKRAKT
Subjt:  EPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT

AT5G56140.1 RNA-binding KH domain-containing protein1.5e-7855.24Show/hide
Query:  YSTNFS--PARTASPH----IRTTPDV---DSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSS
        YS++ S  P+   SP+    +R+   V     +YLSELLAE  K  PF+ VLP   RLLNQEILRV+ ++ N        L H  PSP+AS  +  N + 
Subjt:  YSTNFS--PARTASPH----IRTTPDV---DSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSS

Query:  AGLSGW-NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRP
        A ++GW +  P E+ +   PG   +W ++P S S L  KR +R++IPVD YPNFNFVGRLLGPRGNSLKRVE +T CRV IRG+GSIKDP KEE +RG+P
Subjt:  AGLSGW-NGLPQEQRLSRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRP

Query:  GYEHLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT
        GYEHLNEPLHIL+E +LP+ IVD RL QA+EI+++LL P++E+HD  K+QQLRELA+LN + REE     GSVSP+NS GMKRAKT
Subjt:  GYEHLNEPLHILIEGDLPVNIVDIRLRQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGTATGTACAGTACCAATTTCTCCCCGGCAAGAACTGCTTCGCCTCACATCAGAACCACTCCTGATGTCGACAGTCAGTACTTGTCGGAGCTTTTGGCAGAGCA
TCAGAAGTTCGGCCCCTTCATGCATGTTCTTCCGATCTGCAGCCGACTTTTGAATCAAGAGATATTGCGGGTTTCTGGAATGATGTCCAACCAGGGTTTCTGTGACCTCG
ATAGGCTTCGGCATAGAAGTCCTAGTCCAATGGCTTCTTCGAACCTTATATCTAATGTTAGCAGCGCAGGCCTGAGTGGCTGGAATGGTCTTCCTCAGGAGCAGAGGTTA
AGTAGACCCCCTGGGATGACAATGGACTGGCAAAGTGCACCTGCAAGTCCTAGCTCGCTCACAGTAAAGAGGATTCTACGCCTGGAAATACCTGTGGATACTTACCCCAA
TTTCAATTTTGTAGGGCGGCTTTTGGGACCTCGTGGTAATTCCTTGAAACGGGTGGAAGTTACTACAGGTTGTCGTGTATACATTCGAGGGAAAGGATCGATAAAGGATC
CAGACAAGGAAGAGAAATTAAGGGGACGACCTGGCTATGAGCACTTGAACGAACCGTTGCACATCTTGATTGAGGGGGATTTACCAGTCAATATTGTTGATATAAGGCTC
AGACAAGCACAGGAAATTATTGAAGAACTGCTCAAACCTGTGGACGAGAGCCATGATTACATTAAGAGGCAGCAGTTACGCGAACTCGCAATGCTAAACTCGAGCTTCCG
AGAAGAAAGTCCAGGACCTGGTGGTAGTGTCTCTCCATTCAATTCAAGTGGAATGAAACGTGCCAAAACAGGTCGTTAA
mRNA sequenceShow/hide mRNA sequence
ACAATCATAGATTTTCCACATTTCAGCCTCCATATTTACAGGCATCCACCCTTACACTCCTTAGCTCCGCCTCCGCCTCCCTCGTCGGAGCTCTCCTCATCCACCGTATT
CTCCGGCGTCATCGCCGCCTCAATCTTCCAAAGACCCTTCTTCTACTTTCCAATCCCCAATTCTCTGTATCCATCTCTGCTCAGAGCAATAAACACAAAGTGCTATGTCA
GGTATGTACAGTACCAATTTCTCCCCGGCAAGAACTGCTTCGCCTCACATCAGAACCACTCCTGATGTCGACAGTCAGTACTTGTCGGAGCTTTTGGCAGAGCATCAGAA
GTTCGGCCCCTTCATGCATGTTCTTCCGATCTGCAGCCGACTTTTGAATCAAGAGATATTGCGGGTTTCTGGAATGATGTCCAACCAGGGTTTCTGTGACCTCGATAGGC
TTCGGCATAGAAGTCCTAGTCCAATGGCTTCTTCGAACCTTATATCTAATGTTAGCAGCGCAGGCCTGAGTGGCTGGAATGGTCTTCCTCAGGAGCAGAGGTTAAGTAGA
CCCCCTGGGATGACAATGGACTGGCAAAGTGCACCTGCAAGTCCTAGCTCGCTCACAGTAAAGAGGATTCTACGCCTGGAAATACCTGTGGATACTTACCCCAATTTCAA
TTTTGTAGGGCGGCTTTTGGGACCTCGTGGTAATTCCTTGAAACGGGTGGAAGTTACTACAGGTTGTCGTGTATACATTCGAGGGAAAGGATCGATAAAGGATCCAGACA
AGGAAGAGAAATTAAGGGGACGACCTGGCTATGAGCACTTGAACGAACCGTTGCACATCTTGATTGAGGGGGATTTACCAGTCAATATTGTTGATATAAGGCTCAGACAA
GCACAGGAAATTATTGAAGAACTGCTCAAACCTGTGGACGAGAGCCATGATTACATTAAGAGGCAGCAGTTACGCGAACTCGCAATGCTAAACTCGAGCTTCCGAGAAGA
AAGTCCAGGACCTGGTGGTAGTGTCTCTCCATTCAATTCAAGTGGAATGAAACGTGCCAAAACAGGTCGTTAAGCGCTCACATCATTCTCTACCCTAAACAATTTCCACC
CCAGTCCAGAAATTCGGTCACTCGTATACCTCAAAGTGAATCACAACCAACCGTGTGACATTGCTACAATCACGAGAAAGGGGAGAGCCTAACTCGAGTTCCCGGTGAGC
GTTCTGCTGGCTTTTTCTGTGTTTCTTTAATATTTTTGACAGAAAAGACTTGAAAAAGAAAATAAAAGAGAGGGTGAAAGAAAACAAAAAAAGTGTTAGTATCAGTAGGG
AATTCTCACGTTGAGTGTAGGTTTGAATAGGTCAGTTTTTCCAACAAAGTTATATATTCATTGGATTTATTCTTGTGTTAGTGACTGCTGAGGAGTGTGTAGTTTCTACA
TTATTGATGAAAAAACATGAAGAACATTAGGAAGTGATGATGTTGTACTGTTTGATATCTGTTAGGGTATTGTATGCAATGTCAATGTCAATGTCAATTATATTTTTGGT
GCCACTCGGTTTGTGTTTGCTATCAGTTTCATAATAGATTCTTCCTTTTTGTCTATCATTTTGTGAAAAAATCGCAAATATATGCCTTGACTACTGACAATGTGTTCTTG
G
Protein sequenceShow/hide protein sequence
MSGMYSTNFSPARTASPHIRTTPDVDSQYLSELLAEHQKFGPFMHVLPICSRLLNQEILRVSGMMSNQGFCDLDRLRHRSPSPMASSNLISNVSSAGLSGWNGLPQEQRL
SRPPGMTMDWQSAPASPSSLTVKRILRLEIPVDTYPNFNFVGRLLGPRGNSLKRVEVTTGCRVYIRGKGSIKDPDKEEKLRGRPGYEHLNEPLHILIEGDLPVNIVDIRL
RQAQEIIEELLKPVDESHDYIKRQQLRELAMLNSSFREESPGPGGSVSPFNSSGMKRAKTGR