; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G218100 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G218100
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmU531Chr11:26464354..26466456
RNA-Seq ExpressionCmUC11G218100
SyntenyCmUC11G218100
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591691.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.7e-25790.67Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SP RLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY SILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNG CKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVFDEMKSLGM+PDT+SYTTLMNCLCRTGRV+EATEL Q+MKD+DCRADT+TLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIVS
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL++EE+L VS
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIVS

XP_023536308.1 pentatricopeptide repeat-containing protein At5g18475 isoform X1 [Cucurbita pepo subsp. pepo]6.1e-25791.22Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SPTRLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVFDEMKSLGM+PDT+SYTTLMNCLCRTGRV+EA ELLQQMKD+DCRAD VTLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL+ EE+L
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL

XP_023536309.1 pentatricopeptide repeat-containing protein At5g18475 isoform X2 [Cucurbita pepo subsp. pepo]6.1e-25791.22Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SPTRLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVFDEMKSLGM+PDT+SYTTLMNCLCRTGRV+EA ELLQQMKD+DCRAD VTLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL+ EE+L
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL

XP_023536311.1 pentatricopeptide repeat-containing protein At5g18475 isoform X4 [Cucurbita pepo subsp. pepo]6.1e-25791.06Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SPTRLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVFDEMKSLGM+PDT+SYTTLMNCLCRTGRV+EA ELLQQMKD+DCRAD VTLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIV
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL+ EE+L V
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIV

XP_038900183.1 pentatricopeptide repeat-containing protein At5g18475 [Benincasa hispida]2.7e-26592.63Show/hide
Query:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ
        MNLA+FSSRFMNLAIV VSKAIIS+S  PTRLFTTS + QKKSKSSYISHETA+RLIKHERDPQHALE+FNMVSEQKGFNHNNATYASILQKLAKSKKFQ
Subjt:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ

Query:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH
        AIDGVLHQMTYDTCKLHEGIFL+LMKH+SKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVES+RVDLA+KLLVNARS+LNLRPNTCIFNILVKHH
Subjt:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH

Query:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY
        CR  DLQAAFEVV+EMKSARVSYP+LITYSTL+GGLCESGKL+EAIELFEEMVSKDKILPDALTYNILINGFCQ GKVDRARRIVEFMKSNGC+PNVFNY
Subjt:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY

Query:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL
        SVLMNGFCKEGRLQEAKEVFDEMKSLGM+PDTISYTTL+NCLCRTGRV+EATELLQQMK +DCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL
Subjt:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL

Query:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT
        NKGSYRIVLN LSQKGELKRA ELLGLMLNRGFVPHYATSNNLLVLLCN+GMV  AVESLFGLLEMGFKPEP SWFTLVDLICRERKMLPVFELL+EL+T
Subjt:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT

Query:  EE
        EE
Subjt:  EE

TrEMBL top hitse value%identityAlignment
A0A0A0LGW7 Uncharacterized protein1.1e-25487.7Show/hide
Query:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ
        MN+A+FSSRF NLAI WVSK +IS+S +  RLF TS + QKKSKSSYISHETAI+LIK+ERDPQHAL+IFNMVSEQ+GFNHN+ATYASI+Q LAK KKFQ
Subjt:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ

Query:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH
        AIDGVLHQMTYDTCK+HEGIFLNLMKH+SKSSMHERVLDMFYAI+SIVR+KPSLKAIS CLNLLVES+RVDLARKLLVNARSKLNLRPNTCIFNILVKHH
Subjt:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH

Query:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY
        CR GDLQAAFEVV+EMKSARVSYPNL+TYSTL+GGLCE+GKLKEAIE FEEMVSKD ILPDALTYNILINGFCQ GKVDRAR I+EFMKSNGC+PNVFNY
Subjt:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY

Query:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL
        SVLMNG+CKEGRLQEAKEVF+E+KSLGM+PDTISYTTL+NCLCRTGRV+EATELLQQMKDKDCRADTVT NVMLGGLCREGRF+EALDMVQKLPFEGFYL
Subjt:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL

Query:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT
        NKGSYRIVLN L+QKGEL++A ELLGLMLNRGFVPH+ATSN LL+LLCN+GMVK AVESL GLLEMGFKPE ESWFTLVDLICRERKMLPVFELLD LVT
Subjt:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT

Query:  EEYL
        +EYL
Subjt:  EEYL

A0A5A7T5D4 Pentatricopeptide repeat-containing protein1.2e-25387.9Show/hide
Query:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ
        MNLA+FSSRF NLAI WVSK   S+S +  RLF TS + QKKSKSSY+SHETAI+LIK+ERDPQHAL+IFNMVSEQ+GFNHN+ATYASI+QKLAKSKKFQ
Subjt:  MNLAVFSSRFMNLAIVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQ

Query:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH
        AIDGVLHQMTYDTCK+HEGIFLNLMKH+S SSMHERVLDMFYAI+SIVR+KPSLKA S CLNLLVES+RVDLARKLLVNARSKLNLRPNTCIFNILVKHH
Subjt:  AIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHH

Query:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY
        CR GDLQAAFEVV+EMKSARVSYPNL+TYSTL+GGLCE+GKLKEAIE FEEMVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC PNVFNY
Subjt:  CRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNY

Query:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL
        S LMNG+CKEGRLQEAKEVF+E+KSLGM+PDTISYTTL+NCLCRTGRV+EATELLQQMKDKDCRADTVT NVMLGGLCREGRFEEALDMVQKLPFEGFYL
Subjt:  SVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYL

Query:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT
        NKGSYRIVLN L+QKGEL+RA ELLGLMLNRGFVPH+ATSN LL+LLCN+GMVK AVESL GLLEMGFKPE ESWFTLV+LICRERKMLP+FELLDELVT
Subjt:  NKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT

Query:  EEYL
        E+YL
Subjt:  EEYL

A0A6J1FAE2 pentatricopeptide repeat-containing protein At5g18475 isoform X18.6e-25790.82Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SP RLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVF+EMKSLGM+PDT+SYTTLMNCLCRTGRV+EATELLQ+MKD+DCRADT+TLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL+ EE+L
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL

A0A6J1FFA8 pentatricopeptide repeat-containing protein At5g18475 isoform X28.6e-25790.82Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IVW+SKAIIS+  SP RLFTTSI+AQKKSKSSYISHETAIRLIK+ERDPQHALEIFNMVSEQKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVKHHCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVF+EMKSLGM+PDT+SYTTLMNCLCRTGRV+EATELLQ+MKD+DCRADT+TLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL
        KGELKRA ELLGLMLNRGFVPHYATSN+LLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDEL+ EE+L
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYL

A0A6J1IJ31 pentatricopeptide repeat-containing protein At5g184751.6e-25590.47Show/hide
Query:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC
        IV +SKAIIS+S SPTRLFTTSI+AQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVS+QKGFNHNNATY  ILQ+LAKSKKFQAIDGVLHQMTYDTC
Subjt:  IVWVSKAIISESTSPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTC

Query:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR
        +LHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAIS CLNLLVE++RVDLARKLLVNA SKLNL PNTCIFNILVK+HCR G+LQAAFEVVR
Subjt:  KLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVR

Query:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ
        EMKSARVSYPNLITYSTL+GGLC+SGKLKEAIELFE MVSKDKILPDALTYNILINGFCQ GKVDRAR+IVEFMKSNGC+PNVFNYSVLMNGFCKEG+LQ
Subjt:  EMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQ

Query:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ
        EAKEVFDEMKSLGM+PDT+ +TTLMNCLCRTGRV+EATELLQQMKD+DCRAD+VTLNV+LGGLCREGRFEEALDMVQKLPFEG+YLNKGSYRIVLN LSQ
Subjt:  EAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ

Query:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIVS
         GELKRA +LLGLMLNRGFVPHYATSNNLLVLLCNSGMVK AVESL GLLEMGFKPEP+SWF+LVDLICRE+K+LPVFELLDELV EE+L VS
Subjt:  KGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIVS

SwissProt top hitse value%identityAlignment
Q3E9F0 Pentatricopeptide repeat-containing protein At5g184752.0e-15755.47Show/hide
Query:  RFMNLAIVWVSKAIISE-STSPTRLFTTSIK-AQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVL
        RF + +  WVS    SE    P+    +SI   +   K+ +ISHE+A+ L+K ERDPQ  L+IFN  S+QKGFNHNNATY+ +L  L + KKF A+D +L
Subjt:  RFMNLAIVWVSKAIISE-STSPTRLFTTSIK-AQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVL

Query:  HQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDL
        HQM Y+TC+  E +FLNLM+H+S+S +H++V++MF  IQ I R KPSL AIS CLNLL++S  V+L+RKLL+ A+  L L+PNTCIFNILVKHHC+ GD+
Subjt:  HQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDL

Query:  QAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNG
          AF VV EMK + +SYPN ITYSTL+  L    + KEA+ELFE+M+SK+ I PD +T+N++INGFC+AG+V+RA++I++FMK NGCNPNV+NYS LMNG
Subjt:  QAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNG

Query:  FCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYR
        FCK G++QEAK+ FDE+K  G++ DT+ YTTLMNC CR G  +EA +LL +MK   CRADT+T NV+L GL  EGR EEAL M+ +   EG +LNKGSYR
Subjt:  FCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYR

Query:  IVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT
        I+LN L   GEL++A + L +M  RG  PH+AT N L+V LC SG  +  V  L G L +G  P P+SW  +V+ IC+ERK++ VFELLD LV+
Subjt:  IVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099004.8e-5535.37Show/hide
Query:  PNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEF
        P+   +N+++  +C+ G++  A  V+  M  +    P+++TY+T+L  LC+SGKLK+A+E+ + M+ +D   PD +TY ILI   C+   V  A ++++ 
Subjt:  PNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEF

Query:  MKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEAL
        M+  GC P+V  Y+VL+NG CKEGRL EA +  ++M S G +P+ I++  ++  +C TGR  +A +LL  M  K      VT N+++  LCR+G    A+
Subjt:  MKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEAL

Query:  DMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERK
        D+++K+P  G   N  SY  +L+   ++ ++ RA E L  M++RG  P   T N +L  LC  G V+ AVE L  L   G  P   ++ T++D + +  K
Subjt:  DMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERK

Query:  MLPVFELLDEL
             +LLDE+
Subjt:  MLPVFELLDEL

Q9FFE3 Pentatricopeptide repeat-containing protein At5g16420, mitochondrial1.5e-5630.23Show/hide
Query:  IRLIKHERDPQHALEIFNMVSE-QKGFNHNNATYASILQKLAKSKKFQAIDGVLHQM--TYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQ
        + +I  +++   AL+IF    +   GF HN  TY SIL KL++++ F  ++ ++  +  +Y   K  E +F++L+++Y  +  +E  + +F  I      
Subjt:  IRLIKHERDPQHALEIFNMVSE-QKGFNHNNATYASILQKLAKSKKFQAIDGVLHQM--TYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQ

Query:  KPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFE
        K S+++++  LN+L+++ R DL   +  N++    + PN    N+LVK  C+  D+++A++V+ E+ S  +  PNL+TY+T+LGG    G ++ A  + E
Subjt:  KPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFE

Query:  EMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNE
        EM+ +    PDA TY +L++G+C+ G+   A  +++ M+ N   PN   Y V++   CKE +  EA+ +FDEM      PD+     +++ LC   +V+E
Subjt:  EMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNE

Query:  ATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNS
        A  L ++M   +C  D   L+ ++  LC+EGR  EA  +  +   +G   +  +Y  ++  + +KGEL  A  L   M  R   P+  T N L+  L  +
Subjt:  ATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNS

Query:  GMVKGAVESLFGLLEMGFKPEPESWFTLVD
        G VK  V  L  +LE+G  P   ++  L +
Subjt:  GMVKGAVESLFGLLEMGFKPEPESWFTLVD

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461001.2e-5631.53Show/hide
Query:  SSYISHETAIRLIKHERDPQHALEIFNMVSEQ--KGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFY
        S  I+    I+L++ E+D + ++ +F+  + +   G+ H+ +++  ++ +L  + KF+A + ++ +M  + C + E I L++ + Y +       L +F+
Subjt:  SSYISHETAIRLIKHERDPQHALEIFNMVSEQ--KGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFY

Query:  AIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCR-IGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGK
         ++      PS KA    L +LVE N+++LA K   N R ++ L P     N+L+K  CR  G + A  ++  EM   R   P+  TY TL+ GLC  G+
Subjt:  AIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCR-IGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGK

Query:  LKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNC
        + EA +LF EMV KD   P  +TY  LING C +  VD A R +E MKS G  PNVF YS LM+G CK+GR  +A E+F+ M + G RP+ ++YTTL+  
Subjt:  LKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNC

Query:  LCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ------KGELKRANELLGLMLNRGFVP
        LC+  ++ EA ELL +M  +  + D      ++ G C   +F EA + + ++   G   N+ ++ I +   ++           RA  L   M +RG   
Subjt:  LCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ------KGELKRANELLGLMLNRGFVP

Query:  HYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLV
           T  +L+  LC  G  + AV+ +  ++  G  P   +W  L+
Subjt:  HYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLV

Q9LQQ1 Pentatricopeptide repeat-containing protein At1g07740, mitochondrial4.4e-5631.63Show/hide
Query:  IKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKA
        +K   DP+ AL +F+   E  GF H+  +Y+S++ KLAKS+ F A+D +L  + Y   +  E +F+ L++HY K+   ++ +D+F+ I S    + ++++
Subjt:  IKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKA

Query:  ISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKD
        ++  +N+LV++  ++ A+     A+  + LRPN+  FNIL+K      D +AA +V  EM    V  P+++TY++L+G LC +  + +A  L E+M+ K 
Subjt:  ISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKD

Query:  KILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQ
        +I P+A+T+ +L+ G C  G+ + A++++  M+  GC P + NY +LM+   K GR+ EAK +  EMK   ++PD + Y  L+N LC   RV EA  +L 
Subjt:  KILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQ

Query:  QMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLC
        +M+ K C+ +  T  +M+ G CR   F+  L+++  +          ++  ++  L + G L  A  +L +M  +          NLL  LC
Subjt:  QMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLC

Arabidopsis top hitse value%identityAlignment
AT1G07740.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-5731.63Show/hide
Query:  IKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKA
        +K   DP+ AL +F+   E  GF H+  +Y+S++ KLAKS+ F A+D +L  + Y   +  E +F+ L++HY K+   ++ +D+F+ I S    + ++++
Subjt:  IKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKA

Query:  ISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKD
        ++  +N+LV++  ++ A+     A+  + LRPN+  FNIL+K      D +AA +V  EM    V  P+++TY++L+G LC +  + +A  L E+M+ K 
Subjt:  ISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKD

Query:  KILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQ
        +I P+A+T+ +L+ G C  G+ + A++++  M+  GC P + NY +LM+   K GR+ EAK +  EMK   ++PD + Y  L+N LC   RV EA  +L 
Subjt:  KILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQ

Query:  QMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLC
        +M+ K C+ +  T  +M+ G CR   F+  L+++  +          ++  ++  L + G L  A  +L +M  +          NLL  LC
Subjt:  QMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLC

AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.4e-5635.37Show/hide
Query:  PNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEF
        P+   +N+++  +C+ G++  A  V+  M  +    P+++TY+T+L  LC+SGKLK+A+E+ + M+ +D   PD +TY ILI   C+   V  A ++++ 
Subjt:  PNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEF

Query:  MKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEAL
        M+  GC P+V  Y+VL+NG CKEGRL EA +  ++M S G +P+ I++  ++  +C TGR  +A +LL  M  K      VT N+++  LCR+G    A+
Subjt:  MKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEAL

Query:  DMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERK
        D+++K+P  G   N  SY  +L+   ++ ++ RA E L  M++RG  P   T N +L  LC  G V+ AVE L  L   G  P   ++ T++D + +  K
Subjt:  DMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERK

Query:  MLPVFELLDEL
             +LLDE+
Subjt:  MLPVFELLDEL

AT5G16420.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-5730.23Show/hide
Query:  IRLIKHERDPQHALEIFNMVSE-QKGFNHNNATYASILQKLAKSKKFQAIDGVLHQM--TYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQ
        + +I  +++   AL+IF    +   GF HN  TY SIL KL++++ F  ++ ++  +  +Y   K  E +F++L+++Y  +  +E  + +F  I      
Subjt:  IRLIKHERDPQHALEIFNMVSE-QKGFNHNNATYASILQKLAKSKKFQAIDGVLHQM--TYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQ

Query:  KPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFE
        K S+++++  LN+L+++ R DL   +  N++    + PN    N+LVK  C+  D+++A++V+ E+ S  +  PNL+TY+T+LGG    G ++ A  + E
Subjt:  KPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFE

Query:  EMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNE
        EM+ +    PDA TY +L++G+C+ G+   A  +++ M+ N   PN   Y V++   CKE +  EA+ +FDEM      PD+     +++ LC   +V+E
Subjt:  EMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNE

Query:  ATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNS
        A  L ++M   +C  D   L+ ++  LC+EGR  EA  +  +   +G   +  +Y  ++  + +KGEL  A  L   M  R   P+  T N L+  L  +
Subjt:  ATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNS

Query:  GMVKGAVESLFGLLEMGFKPEPESWFTLVD
        G VK  V  L  +LE+G  P   ++  L +
Subjt:  GMVKGAVESLFGLLEMGFKPEPESWFTLVD

AT5G18475.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-15855.47Show/hide
Query:  RFMNLAIVWVSKAIISE-STSPTRLFTTSIK-AQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVL
        RF + +  WVS    SE    P+    +SI   +   K+ +ISHE+A+ L+K ERDPQ  L+IFN  S+QKGFNHNNATY+ +L  L + KKF A+D +L
Subjt:  RFMNLAIVWVSKAIISE-STSPTRLFTTSIK-AQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVL

Query:  HQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDL
        HQM Y+TC+  E +FLNLM+H+S+S +H++V++MF  IQ I R KPSL AIS CLNLL++S  V+L+RKLL+ A+  L L+PNTCIFNILVKHHC+ GD+
Subjt:  HQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDL

Query:  QAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNG
          AF VV EMK + +SYPN ITYSTL+  L    + KEA+ELFE+M+SK+ I PD +T+N++INGFC+AG+V+RA++I++FMK NGCNPNV+NYS LMNG
Subjt:  QAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNG

Query:  FCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYR
        FCK G++QEAK+ FDE+K  G++ DT+ YTTLMNC CR G  +EA +LL +MK   CRADT+T NV+L GL  EGR EEAL M+ +   EG +LNKGSYR
Subjt:  FCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYR

Query:  IVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT
        I+LN L   GEL++A + L +M  RG  PH+AT N L+V LC SG  +  V  L G L +G  P P+SW  +V+ IC+ERK++ VFELLD LV+
Subjt:  IVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLVDLICRERKMLPVFELLDELVT

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-5831.53Show/hide
Query:  SSYISHETAIRLIKHERDPQHALEIFNMVSEQ--KGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFY
        S  I+    I+L++ E+D + ++ +F+  + +   G+ H+ +++  ++ +L  + KF+A + ++ +M  + C + E I L++ + Y +       L +F+
Subjt:  SSYISHETAIRLIKHERDPQHALEIFNMVSEQ--KGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERVLDMFY

Query:  AIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCR-IGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGK
         ++      PS KA    L +LVE N+++LA K   N R ++ L P     N+L+K  CR  G + A  ++  EM   R   P+  TY TL+ GLC  G+
Subjt:  AIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCR-IGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGK

Query:  LKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNC
        + EA +LF EMV KD   P  +TY  LING C +  VD A R +E MKS G  PNVF YS LM+G CK+GR  +A E+F+ M + G RP+ ++YTTL+  
Subjt:  LKEAIELFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNC

Query:  LCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ------KGELKRANELLGLMLNRGFVP
        LC+  ++ EA ELL +M  +  + D      ++ G C   +F EA + + ++   G   N+ ++ I +   ++           RA  L   M +RG   
Subjt:  LCRTGRVNEATELLQQMKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQ------KGELKRANELLGLMLNRGFVP

Query:  HYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLV
           T  +L+  LC  G  + AV+ +  ++  G  P   +W  L+
Subjt:  HYATSNNLLVLLCNSGMVKGAVESLFGLLEMGFKPEPESWFTLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAGTCTCCCCAACTTTATTGCCCCTCCCTCTGTCGCACTCACCTCCTCCCTCACGCCGCTGGCCGATCTCTGGCGAGACCTCCTCAATCTCTTGCGCCGCCTAGA
CGACCCCGTCTCAGTGCCGGCGACGCATCTTTTCGACTCTCCTTCGATCACCGTCGCACCTACCAGTTTTAAAGCGGCTACTCATATGAAATTCATGATAGATGTGAACA
AACATGTAGGATTAGCAAGCGAGTTTTTCATGAATCTTGCCGTCTTCTCCTCTCGTTTCATGAATCTTGCTATTGTTTGGGTTTCAAAAGCAATAATCTCCGAGAGCACT
AGTCCTACGAGACTCTTTACTACCTCCATAAAAGCTCAGAAAAAATCCAAATCCAGTTACATTTCTCACGAAACGGCCATAAGGTTAATCAAACATGAAAGAGATCCTCA
ACATGCCCTTGAAATATTCAACATGGTATCAGAGCAGAAAGGATTCAATCACAATAACGCCACTTATGCAAGCATTCTTCAAAAGCTTGCGAAGTCCAAGAAGTTTCAGG
CTATTGATGGAGTTTTGCATCAAATGACATATGACACCTGCAAATTACACGAGGGTATATTCCTTAATCTCATGAAGCATTATTCAAAGTCTTCTATGCACGAAAGAGTT
CTTGACATGTTTTATGCCATCCAGTCGATCGTTCGTCAGAAGCCTTCTCTTAAAGCGATCAGCATGTGTCTCAATCTTCTCGTCGAGTCCAATCGGGTTGATCTAGCCAG
GAAATTGCTTGTGAATGCCAGGAGTAAGCTCAACTTAAGACCAAACACTTGCATTTTCAACATTTTAGTTAAGCACCATTGCAGAATTGGAGATCTTCAAGCTGCATTTG
AGGTTGTGAGGGAAATGAAAAGTGCTAGAGTCTCTTATCCTAATCTGATCACCTACTCAACTCTGCTAGGTGGCCTTTGTGAAAGTGGAAAACTTAAAGAAGCCATTGAA
CTTTTTGAAGAAATGGTTTCAAAGGACAAGATCTTGCCTGATGCCTTGACTTACAATATTTTGATCAATGGTTTTTGTCAAGCAGGGAAAGTAGACCGTGCAAGGAGAAT
AGTTGAGTTCATGAAAAGCAATGGATGTAATCCTAATGTATTCAATTACTCTGTCTTAATGAATGGCTTCTGTAAAGAGGGAAGATTGCAAGAGGCAAAGGAGGTTTTTG
ATGAAATGAAGAGCCTCGGGATGAGACCCGATACAATCAGCTACACTACTTTAATGAACTGCCTATGTAGAACTGGAAGAGTCAATGAGGCGACAGAGTTACTCCAGCAG
ATGAAGGACAAAGATTGCAGAGCTGATACCGTGACATTGAACGTGATGCTTGGAGGGCTATGTCGAGAAGGTAGGTTTGAAGAGGCTCTTGATATGGTGCAGAAGCTTCC
TTTTGAAGGTTTCTATTTGAACAAAGGCAGCTATAGGATTGTGTTGAATTGCCTTTCTCAAAAAGGAGAATTGAAAAGGGCTAATGAGTTGTTGGGTCTGATGTTGAATA
GGGGTTTTGTACCTCACTATGCAACTTCAAATAATTTGCTGGTTCTTCTTTGTAACAGTGGAATGGTGAAGGGTGCTGTAGAATCTTTGTTTGGGTTGTTAGAAATGGGC
TTCAAACCTGAGCCTGAATCTTGGTTTACTTTGGTTGATTTGATCTGCAGGGAGAGAAAAATGTTGCCTGTATTTGAATTGCTTGATGAGTTGGTCACTGAAGAGTATTT
AATTGTAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATAGTCTCCCCAACTTTATTGCCCCTCCCTCTGTCGCACTCACCTCCTCCCTCACGCCGCTGGCCGATCTCTGGCGAGACCTCCTCAATCTCTTGCGCCGCCTAGA
CGACCCCGTCTCAGTGCCGGCGACGCATCTTTTCGACTCTCCTTCGATCACCGTCGCACCTACCAGTTTTAAAGCGGCTACTCATATGAAATTCATGATAGATGTGAACA
AACATGTAGGATTAGCAAGCGAGTTTTTCATGAATCTTGCCGTCTTCTCCTCTCGTTTCATGAATCTTGCTATTGTTTGGGTTTCAAAAGCAATAATCTCCGAGAGCACT
AGTCCTACGAGACTCTTTACTACCTCCATAAAAGCTCAGAAAAAATCCAAATCCAGTTACATTTCTCACGAAACGGCCATAAGGTTAATCAAACATGAAAGAGATCCTCA
ACATGCCCTTGAAATATTCAACATGGTATCAGAGCAGAAAGGATTCAATCACAATAACGCCACTTATGCAAGCATTCTTCAAAAGCTTGCGAAGTCCAAGAAGTTTCAGG
CTATTGATGGAGTTTTGCATCAAATGACATATGACACCTGCAAATTACACGAGGGTATATTCCTTAATCTCATGAAGCATTATTCAAAGTCTTCTATGCACGAAAGAGTT
CTTGACATGTTTTATGCCATCCAGTCGATCGTTCGTCAGAAGCCTTCTCTTAAAGCGATCAGCATGTGTCTCAATCTTCTCGTCGAGTCCAATCGGGTTGATCTAGCCAG
GAAATTGCTTGTGAATGCCAGGAGTAAGCTCAACTTAAGACCAAACACTTGCATTTTCAACATTTTAGTTAAGCACCATTGCAGAATTGGAGATCTTCAAGCTGCATTTG
AGGTTGTGAGGGAAATGAAAAGTGCTAGAGTCTCTTATCCTAATCTGATCACCTACTCAACTCTGCTAGGTGGCCTTTGTGAAAGTGGAAAACTTAAAGAAGCCATTGAA
CTTTTTGAAGAAATGGTTTCAAAGGACAAGATCTTGCCTGATGCCTTGACTTACAATATTTTGATCAATGGTTTTTGTCAAGCAGGGAAAGTAGACCGTGCAAGGAGAAT
AGTTGAGTTCATGAAAAGCAATGGATGTAATCCTAATGTATTCAATTACTCTGTCTTAATGAATGGCTTCTGTAAAGAGGGAAGATTGCAAGAGGCAAAGGAGGTTTTTG
ATGAAATGAAGAGCCTCGGGATGAGACCCGATACAATCAGCTACACTACTTTAATGAACTGCCTATGTAGAACTGGAAGAGTCAATGAGGCGACAGAGTTACTCCAGCAG
ATGAAGGACAAAGATTGCAGAGCTGATACCGTGACATTGAACGTGATGCTTGGAGGGCTATGTCGAGAAGGTAGGTTTGAAGAGGCTCTTGATATGGTGCAGAAGCTTCC
TTTTGAAGGTTTCTATTTGAACAAAGGCAGCTATAGGATTGTGTTGAATTGCCTTTCTCAAAAAGGAGAATTGAAAAGGGCTAATGAGTTGTTGGGTCTGATGTTGAATA
GGGGTTTTGTACCTCACTATGCAACTTCAAATAATTTGCTGGTTCTTCTTTGTAACAGTGGAATGGTGAAGGGTGCTGTAGAATCTTTGTTTGGGTTGTTAGAAATGGGC
TTCAAACCTGAGCCTGAATCTTGGTTTACTTTGGTTGATTTGATCTGCAGGGAGAGAAAAATGTTGCCTGTATTTGAATTGCTTGATGAGTTGGTCACTGAAGAGTATTT
AATTGTAAGTTAA
Protein sequenceShow/hide protein sequence
MYSLPNFIAPPSVALTSSLTPLADLWRDLLNLLRRLDDPVSVPATHLFDSPSITVAPTSFKAATHMKFMIDVNKHVGLASEFFMNLAVFSSRFMNLAIVWVSKAIISEST
SPTRLFTTSIKAQKKSKSSYISHETAIRLIKHERDPQHALEIFNMVSEQKGFNHNNATYASILQKLAKSKKFQAIDGVLHQMTYDTCKLHEGIFLNLMKHYSKSSMHERV
LDMFYAIQSIVRQKPSLKAISMCLNLLVESNRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRIGDLQAAFEVVREMKSARVSYPNLITYSTLLGGLCESGKLKEAIE
LFEEMVSKDKILPDALTYNILINGFCQAGKVDRARRIVEFMKSNGCNPNVFNYSVLMNGFCKEGRLQEAKEVFDEMKSLGMRPDTISYTTLMNCLCRTGRVNEATELLQQ
MKDKDCRADTVTLNVMLGGLCREGRFEEALDMVQKLPFEGFYLNKGSYRIVLNCLSQKGELKRANELLGLMLNRGFVPHYATSNNLLVLLCNSGMVKGAVESLFGLLEMG
FKPEPESWFTLVDLICRERKMLPVFELLDELVTEEYLIVS