; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G012030 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G012030
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationGy14Chr2:11942489..11945319
RNA-Seq ExpressionCsGy2G012030
SyntenyCsGy2G012030
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045943.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.094.19Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+T RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

XP_004145397.1 pentatricopeptide repeat-containing protein At2g15980 [Cucumis sativus]0.0100Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

XP_008459266.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15980 [Cucumis melo]0.093.99Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

XP_023512842.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita pepo subsp. pepo]1.84e-28881.36Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST +L FS SF SSSP   PSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDI+LQIKNN HL L FFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQTAIRA  LED D+ S  ERFS SRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+FREVFGL+CEIEE++VK+K RVSPNVHTFNTLM CFY+DG  GRVKEIWDQLADS S PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE D VAYNTIIGGFCKAG+  RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
         + R LCAE RLLEALDVFGFAIE+++FCPTMET+E+LIN LCQEGK+E AFKLQAQMVG+GFKPN KIYQSFIDAY+KEGN EMV+KL +E+ EIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

XP_038901621.1 pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida]1.94e-30185.57Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MS PLL+RTL PI NST +L FS SF SSSPP +PSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDILLQIKNNPHLALRFF WTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNL+SYST+IHILARGRLRTHAKDVIQTAIRAA+LED D+ SK ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKL+S+I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQV TLNSLILLVSK QGAN  YAIF+EVFGLDCEIEEE+VKLK  VSPNVHTFNTLM+CFY+DG  GRVK+IWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
         CEEKR GEAEELW EMK+KKLE D VAYNTIIGGFCKAG+ HRAEEFYREMELSGIESTFST EHLINGYC+TGDVDSALLVYKDMRRK+F+ NA  LE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
         LIR LCAE RLLEALDVF FAIE S+FCPT+ET+E+LIN LCQEGKIE AFKLQAQMVG+GFKPNLKIYQSFIDAY KEGN EMVEKL KE+ EIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

TrEMBL top hitse value%identityAlignment
A0A0A0LIN1 Uncharacterized protein0.0100Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

A0A1S3CAB5 pentatricopeptide repeat-containing protein At2g159800.093.99Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

A0A5A7TQU6 Pentatricopeptide repeat-containing protein0.094.19Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+T RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

A0A5D3CQ25 Pentatricopeptide repeat-containing protein0.093.99Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159802.08e-28780.76Show/hide
Query:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST +L FS SF SSSP   P PSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDI+LQIKNN HL LRFFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQ AIRA  LED D+ S+ ERFS SRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+FREVFGL+CEIEEE+VK+K R SPNVHTFNTLM CFY+DG  GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTV

Query:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE D VAYNTIIGGFCKAG+  RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLE

Query:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
         + R LC E RLLEALDVFGFA E+++FCPTMET+E+LIN LCQ+GK+E AFKLQAQMVG+GFKPN KIYQSFIDAY+KEGN EMV+KL +E+ EIQLS
Subjt:  GLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial7.5e-4128.03Show/hide
Query:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD
        +L++IK +  L L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD
Subjt:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS
        +  + L+D   L  +  +   + + G+   V + N  +  +SK C     A  +FRE               V    C+   I+E H     ++LKG  +
Subjt:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS

Query:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME
        P+V +++T+++ + R G   +V ++ + +      PNSY Y  ++ +LC   +  EAEE + EM  + + PD V Y T+I GFCK G    A +F+ EM 
Subjt:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME

Query:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK
           I     T   +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    I+ +   P + T+  LI+ LC+EG ++ A +
Subjt:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK

Query:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        L  +M   G +PN+  Y S ++   K GN E   KL  E     L+
Subjt:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic6.0e-3827.27Show/hide
Query:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKT-----ERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRML
        N ++++TLIH L      + A  +I   +      D   Y        +R      L L + + K   +  +   ++  +I AL + K ++ ++ +   +
Subjt:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKT-----ERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRML

Query:  RSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMT
         ++GI P V T NSLI  +      + A  +  ++             ++ +++PNV TF+ L+D F ++G     ++++D++   +  P+ ++YS L+ 
Subjt:  RSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMT

Query:  VLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTL
          C   R  EA+ ++E M  K   P+VV YNT+I GFCKA       E +REM   G+     T   LI G    GD D A  ++K M       +  T 
Subjt:  VLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTL

Query:  EGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE
          L+  LC   +L +AL VF + ++ S   P + T+ I+I  +C+ GK+E  + L   +  +G KPN+ IY + I  + ++G  E  + L++EM E
Subjt:  EGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE

Q9SH26 Pentatricopeptide repeat-containing protein At1g634001.2e-3828.35Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGR
        ++  +I +L   +  D ++ +   + ++G+ P V T +SLI  +   +  + A  +  ++             ++ +++PNV TFN L+D F ++G    
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGR

Query:  VKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+ ++YS L+   C   R  EA+ ++E M  K   P+VV YNT+I GFCKA       E +REM   G+     T   LI+G+   
Subjt:  VKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFI
         D D+A +V+K M       N  T   L+  LC   +L +A+ VF + ++ S   PT+ T+ I+I  +C+ GK+E  + L   +  +G KP++ IY + I
Subjt:  GDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFI

Query:  DAYTKEGNAEMVEKLWKEMHE
          + ++G  E  + L+++M E
Subjt:  DAYTKEGNAEMVEKLWKEMHE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.2e-3826.9Show/hide
Query:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSR-PLKLFETLVKTYKRCGSAP--FVFDLLIKALLDSKKLDSSIEIVRMLRS
        N ++++TLIH L      + A  +I   +      D   Y         R    L   L+   ++    P   +++ +I  L   K +D ++ + + + +
Subjt:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSR-PLKLFETLVKTYKRCGSAP--FVFDLLIKALLDSKKLDSSIEIVRMLRS

Query:  RGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVL
        +GI P V T +SLI  +      + A  +  ++             ++ +++P+V TF+ L+D F ++G     ++++D++   +  P+  +YS L+   
Subjt:  RGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVL

Query:  CEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEG
        C   R  EA++++E M  K   PDVV YNT+I GFCK        E +REM   G+     T   LI G    GD D A  ++K+M       N  T   
Subjt:  CEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEG

Query:  LIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE
        L+  LC   +L +A+ VF + ++ S   PT+ T+ I+I  +C+ GK+E  + L   +  +G KP++  Y + I  + ++G+ E  + L+KEM E
Subjt:  LIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159802.4e-12446.51Show/hide
Query:  MSGPLLKRTLRPI--GNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQN
        MS  +L+R L P         L  S   + SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPI--GNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADS-NSTPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++REVFGLD    +E  K+ G++ PN  TFN++M  FYR+G    V+ IW ++ +    +PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADS-NSTPNSYSYSIL

Query:  MTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNAS
        M   C      EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNAS

Query:  TLEGLIRMLCAER---RLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMH
        T+E L+  LC +R   R++EA D+   A+  + F P+   +E+L+  LC++GK++ A  +QA+MVG+GFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIRMLCAER---RLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMH

Query:  E
        E
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.3e-4228.03Show/hide
Query:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD
        +L++IK +  L L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD
Subjt:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS
        +  + L+D   L  +  +   + + G+   V + N  +  +SK C     A  +FRE               V    C+   I+E H     ++LKG  +
Subjt:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS

Query:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME
        P+V +++T+++ + R G   +V ++ + +      PNSY Y  ++ +LC   +  EAEE + EM  + + PD V Y T+I GFCK G    A +F+ EM 
Subjt:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME

Query:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK
           I     T   +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    I+ +   P + T+  LI+ LC+EG ++ A +
Subjt:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK

Query:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        L  +M   G +PN+  Y S ++   K GN E   KL  E     L+
Subjt:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein5.3e-4228.03Show/hide
Query:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD
        +L++IK +  L L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD
Subjt:  ILLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFD

Query:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS
        +  + L+D   L  +  +   + + G+   V + N  +  +SK C     A  +FRE               V    C+   I+E H     ++LKG  +
Subjt:  LLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFRE---------------VFGLDCE---IEEEH-----VKLKGRVS

Query:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME
        P+V +++T+++ + R G   +V ++ + +      PNSY Y  ++ +LC   +  EAEE + EM  + + PD V Y T+I GFCK G    A +F+ EM 
Subjt:  PNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREME

Query:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK
           I     T   +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    I+ +   P + T+  LI+ LC+EG ++ A +
Subjt:  LSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFK

Query:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS
        L  +M   G +PN+  Y S ++   K GN E   KL  E     L+
Subjt:  LQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS

AT1G62670.1 rna processing factor 28.5e-4026.9Show/hide
Query:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSR-PLKLFETLVKTYKRCGSAP--FVFDLLIKALLDSKKLDSSIEIVRMLRS
        N ++++TLIH L      + A  +I   +      D   Y         R    L   L+   ++    P   +++ +I  L   K +D ++ + + + +
Subjt:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSR-PLKLFETLVKTYKRCGSAP--FVFDLLIKALLDSKKLDSSIEIVRMLRS

Query:  RGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVL
        +GI P V T +SLI  +      + A  +  ++             ++ +++P+V TF+ L+D F ++G     ++++D++   +  P+  +YS L+   
Subjt:  RGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVL

Query:  CEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEG
        C   R  EA++++E M  K   PDVV YNT+I GFCK        E +REM   G+     T   LI G    GD D A  ++K+M       N  T   
Subjt:  CEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEG

Query:  LIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE
        L+  LC   +L +A+ VF + ++ S   PT+ T+ I+I  +C+ GK+E  + L   +  +G KP++  Y + I  + ++G+ E  + L+KEM E
Subjt:  LIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHE

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein8.5e-4028.35Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGR
        ++  +I +L   +  D ++ +   + ++G+ P V T +SLI  +   +  + A  +  ++             ++ +++PNV TFN L+D F ++G    
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGR

Query:  VKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+ ++YS L+   C   R  EA+ ++E M  K   P+VV YNT+I GFCKA       E +REM   G+     T   LI+G+   
Subjt:  VKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFI
         D D+A +V+K M       N  T   L+  LC   +L +A+ VF + ++ S   PT+ T+ I+I  +C+ GK+E  + L   +  +G KP++ IY + I
Subjt:  GDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFI

Query:  DAYTKEGNAEMVEKLWKEMHE
          + ++G  E  + L+++M E
Subjt:  DAYTKEGNAEMVEKLWKEMHE

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-12546.51Show/hide
Query:  MSGPLLKRTLRPI--GNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQN
        MS  +L+R L P         L  S   + SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPI--GNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADS-NSTPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++REVFGLD    +E  K+ G++ PN  TFN++M  FYR+G    V+ IW ++ +    +PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADS-NSTPNSYSYSIL

Query:  MTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNAS
        M   C      EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYNTIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNAS

Query:  TLEGLIRMLCAER---RLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMH
        T+E L+  LC +R   R++EA D+   A+  + F P+   +E+L+  LC++GK++ A  +QA+MVG+GFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIRMLCAER---RLLEALDVFGFAIEYSSFCPTMETFEILINELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMH

Query:  E
        E
Subjt:  E


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGTCCGTTGCTTAAACGAACCCTCCGCCCGATCGGAAACTCCACCGTTCACCTCCAATTTTCCTCTTCCTTTTCCTCATCCTCACCGCCCACCGACCCTTCGCC
GTCGACGAAACCCTCAATCTCCACTGTCGTTTCAGTTCTCACTCACCAACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCG
AGTTTTCCGATATCCTTCTCCAAATCAAGAACAATCCTCATCTCGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCG
ACCCTCATTCACATCCTTGCTCGCGGTCGACTCAGAACTCACGCCAAGGATGTTATTCAAACTGCCATTAGGGCTGCGCAGCTCGAAGATAGCGATAATTATTCTAAAAC
TGAGCGGTTCTCTCCTTCGAGGCCTTTGAAGCTTTTTGAAACCCTCGTCAAGACGTATAAACGGTGTGGCTCTGCCCCATTTGTGTTTGATTTATTGATTAAAGCTCTTC
TGGATTCTAAAAAGCTCGATTCATCCATTGAAATTGTTAGAATGTTACGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAA
TGCCAGGGGGCTAATGTAGCTTATGCAATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAATTGAGGAAGAACATGTGAAATTGAAGGGTAGAGTCAGTCCTAATGTTCA
TACTTTTAACACATTAATGGACTGTTTTTATCGAGATGGGTTTGCAGGGAGGGTGAAGGAGATTTGGGATCAATTAGCAGATTCAAATTCGACTCCAAACAGCTATAGTT
ATAGTATTCTAATGACAGTTTTATGTGAAGAGAAAAGAACGGGAGAAGCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAGCCTGATGTTGTAGCTTACAAT
ACTATAATTGGAGGATTTTGTAAAGCAGGACATACTCATAGAGCTGAAGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCT
CATCAATGGCTATTGTGATACTGGAGATGTTGATTCTGCATTACTTGTGTATAAGGATATGCGTAGGAAACAGTTTAGTCTCAACGCATCGACGCTTGAAGGACTTATTA
GAATGTTGTGTGCTGAGAGAAGGCTTTTAGAAGCTTTAGATGTTTTTGGTTTTGCCATTGAATACTCTAGCTTTTGTCCTACAATGGAAACTTTTGAAATTCTGATAAAT
GAGTTGTGTCAAGAAGGGAAAATTGAAGGTGCATTTAAGCTTCAAGCGCAGATGGTAGGGAGAGGTTTTAAGCCAAATTTGAAGATTTATCAATCATTTATCGATGCTTA
CACGAAAGAAGGAAATGCAGAAATGGTTGAGAAATTGTGGAAGGAAATGCATGAAATCCAGCTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
CTACACGAAACTTAATTACTCAAAATATGAATATAAGAAAGTGAGTGAGTTCTAAGTAAAAGTAAACTTTATAAACTTGTAAAACAAATTGATGTATTAAATAGATTGTG
AATTTTATGAAATCACGAAGGAACGGGAGAGTCATTTGATACAAACTTATTGAATTGTGTTTTTTTTTCAGGTCTCTCCTTCTTGCAAAGTTTAATGTCCGGTCCGTTGC
TTAAACGAACCCTCCGCCCGATCGGAAACTCCACCGTTCACCTCCAATTTTCCTCTTCCTTTTCCTCATCCTCACCGCCCACCGACCCTTCGCCGTCGACGAAACCCTCA
ATCTCCACTGTCGTTTCAGTTCTCACTCACCAACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCGAGTTTTCCGATATCCT
TCTCCAAATCAAGAACAATCCTCATCTCGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCGACCCTCATTCACATCC
TTGCTCGCGGTCGACTCAGAACTCACGCCAAGGATGTTATTCAAACTGCCATTAGGGCTGCGCAGCTCGAAGATAGCGATAATTATTCTAAAACTGAGCGGTTCTCTCCT
TCGAGGCCTTTGAAGCTTTTTGAAACCCTCGTCAAGACGTATAAACGGTGTGGCTCTGCCCCATTTGTGTTTGATTTATTGATTAAAGCTCTTCTGGATTCTAAAAAGCT
CGATTCATCCATTGAAATTGTTAGAATGTTACGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAATGCCAGGGGGCTAATG
TAGCTTATGCAATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAATTGAGGAAGAACATGTGAAATTGAAGGGTAGAGTCAGTCCTAATGTTCATACTTTTAACACATTA
ATGGACTGTTTTTATCGAGATGGGTTTGCAGGGAGGGTGAAGGAGATTTGGGATCAATTAGCAGATTCAAATTCGACTCCAAACAGCTATAGTTATAGTATTCTAATGAC
AGTTTTATGTGAAGAGAAAAGAACGGGAGAAGCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAGCCTGATGTTGTAGCTTACAATACTATAATTGGAGGAT
TTTGTAAAGCAGGACATACTCATAGAGCTGAAGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCTCATCAATGGCTATTGT
GATACTGGAGATGTTGATTCTGCATTACTTGTGTATAAGGATATGCGTAGGAAACAGTTTAGTCTCAACGCATCGACGCTTGAAGGACTTATTAGAATGTTGTGTGCTGA
GAGAAGGCTTTTAGAAGCTTTAGATGTTTTTGGTTTTGCCATTGAATACTCTAGCTTTTGTCCTACAATGGAAACTTTTGAAATTCTGATAAATGAGTTGTGTCAAGAAG
GGAAAATTGAAGGTGCATTTAAGCTTCAAGCGCAGATGGTAGGGAGAGGTTTTAAGCCAAATTTGAAGATTTATCAATCATTTATCGATGCTTACACGAAAGAAGGAAAT
GCAGAAATGGTTGAGAAATTGTGGAAGGAAATGCATGAAATCCAGCTGAGTTGAGAAGGGAATTGTATTGCATTGTATCTCGCCATCTCCAATGCAGCTGATCCGGAAGA
AGTTTCTATTGGCAAGAGAGGGATAGGACATATAAAAGGTTGGATCATAGTGGGAATCAAATGAGAGCTTGCAAAGTGTAGACGCTATGGAGGACAAGAAGGCCGATGCG
AGCTAGTTGGACCATGTGTGGTCTAGTTGCCTCACACTCTCTTCAATGGTGCCTCACTTGGATCTGTTTGCAACTCATTTAGTGTCCCTCATCTGGGCTTTCCTCATAGG
TCCAAAATTGTTCCTGCCAAACCAAACAAATTTATTCTTTAAAGCAATTGAAAATCTGTGAGGGTATTATTAGTTTATGACATTTTTTTTTCTTGTAAATATTTTGATTT
TGAAAAGTATTTAAATTGAAAGGTAGGTAGAGAGGAGAGAGAGATTGGTGGTTGATAATTGTAAGAAAGGGAAGCAAGTGTACTTAGATTCCAACATTTATTTGCTTTGT
ACCAAAAATTCTCAATACTTGGATCACTTGTGTGAATTGTTCTTT
Protein sequenceShow/hide protein sequence
MSGPLLKRTLRPIGNSTVHLQFSSSFSSSSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHLALRFFLWTQNKSLCNHNLISYS
TLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTERFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK
CQGANVAYAIFREVFGLDCEIEEEHVKLKGRVSPNVHTFNTLMDCFYRDGFAGRVKEIWDQLADSNSTPNSYSYSILMTVLCEEKRTGEAEELWEEMKMKKLEPDVVAYN
TIIGGFCKAGHTHRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKQFSLNASTLEGLIRMLCAERRLLEALDVFGFAIEYSSFCPTMETFEILIN
ELCQEGKIEGAFKLQAQMVGRGFKPNLKIYQSFIDAYTKEGNAEMVEKLWKEMHEIQLS