; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003231 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003231
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr11:4613493..4615149
RNA-Seq ExpressionPay0003231
SyntenyPay0003231
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045943.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]9.9e-27798Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGN QRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSF         LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_004145397.1 pentatricopeptide repeat-containing protein At2g15980 [Cucumis sativus]1.7e-26092.38Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSF         LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_008459266.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15980 [Cucumis melo]3.4e-27798.2Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSF         LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_023512842.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita pepo subsp. pepo]1.0e-22881.76Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST NL FS SFFSSSP A PSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVLQIKNN HL L FFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQTAIRA  LED D+ S  ERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+F EVFGL+CEIE+++VK+K RVSPNVHTFNTLM CFYQDG VGRVKEIWDQLADS SIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLE+D VAYNTIIGGFCKAGN +RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         +   LCAE RLLEALDVFGFA+E ++F         LIN LCQEGK+E AFKLQAQMVGKGFKPN KIYQSFIDAY KEGN EMV+KLG+E+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_038901621.1 pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida]1.9e-24086.37Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLL+RTL PI NST NL FS SFFSSSPP EPSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDI+LQIKNNPHLALRFF WTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNL+SYST+IHILARGRLRTHAKDVIQTAIRAAELED D+ S+ ERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKL+S+I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQV TLNSLILLVSK QGAN  YAIF EVFGLDCEIE+E+VKLK  VSPNVHTFNTLM+CFYQDG VGRVK+IWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
         CEEKRMGEAEELW EMK+KKLELD VAYNTIIGGFCKAGN  RAEEFYREMELSGIESTFST EHLINGYC+TGDVDSALLVYKDMRRKRF+ NA  LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         LI  LCAE RLLEALDVF FA+EDS+F         LIN LCQEGKIE AFKLQAQMVGKGFKPNLKIYQSFIDAY+KEGN EMVEKLGKE+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

TrEMBL top hitse value%identityAlignment
A0A0A0LIN1 Uncharacterized protein8.2e-26192.38Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSF         LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A1S3CAB5 pentatricopeptide repeat-containing protein At2g159801.6e-27798.2Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSF         LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A5A7TQU6 Pentatricopeptide repeat-containing protein4.8e-27798Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGN QRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSF         LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A5D3CQ25 Pentatricopeptide repeat-containing protein1.6e-27798.2Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSF         LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159802.4e-22881.36Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST NL FS SFFSSSP A P PSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQ AIRA  LED D+ S+ ERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+F EVFGL+CEIE+E+VK+K R SPNVHTFNTLM CFYQDG VGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLE+D VAYNTIIGGFCKAGN +RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         +   LC E RLLEALDVFGFA E ++F         LIN LCQ+GK+E AFKLQAQMVGKGFKPN KIYQSFIDAY KEGN EMV+KLG+E+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSF--------FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.1e-3525.66Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++        +   LI+ LC+EG ++ A +L  +M   G +P
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP

Query:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        N+  Y S ++   K GN E   KL  E     L+
Subjt:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

Q9SH26 Pentatricopeptide repeat-containing protein At1g634001.1e-3327.19Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        ++  +I +L   +  D ++ +   + ++G+ P V T +SLI  +   +  + A  + +++             ++ +++PNV TFN L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+ ++YS L+   C   R+ EA+ ++E M  K    +VV YNT+I GFCKA       E +REM   G+     T   LI+G+   
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID
         D D+A +V+K M       N  T   L++ LC   +L +A+ VF +     +E + +    +I  +C+ GK+E  + L   +  KG KP++ IY + I 
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID

Query:  AYMKEGNAEMVEKLGKEMHE
         + ++G  E  + L ++M E
Subjt:  AYMKEGNAEMVEKLGKEMHE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial5.1e-3427.5Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        +++ +I  L   K +D ++ + + + ++GI P V T +SLI  +      + A  + +++             ++ +++P+V TF+ L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+  +YS L+   C   R+ EA++++E M  K    DVV YNT+I GFCK    +   E +REM   G+     T   LI G    
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID
        GD D A  ++K+M       N  T   L++ LC   +L +A+ VF +     +E + +    +I  +C+ GK+E  + L   +  KG KP++  Y + I 
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID

Query:  AYMKEGNAEMVEKLGKEMHE
         + ++G+ E  + L KEM E
Subjt:  AYMKEGNAEMVEKLGKEMHE

Q9SZ10 Pentatricopeptide repeat-containing protein At4g26680, mitochondrial6.7e-3423.86Show/hide
Query:  PAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQT
        P   +P  K      V+V      +S W  LN L  +  D     +++L+I+ +  L+L FF W + ++  +H+L +++ ++H L + R    A+ +++ 
Subjt:  PAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQT

Query:  AIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIF
         +    ++               P K+F+ L+ +Y+ C S P VFD L K     KK  ++ +    ++  G  P V + N+ +  +      ++A   +
Subjt:  AIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIF

Query:  TEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNT
         E+              + ++SPN +T N +M  + + G + +  E+   +          SY+ L+A  CE+  +  A +L   M    L+ +VV +NT
Subjt:  TEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNT

Query:  IIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEA------LDVFGFAVED
        +I GFC+A   Q A + + EM+   +     T   LINGY   GD + A   Y+DM       +  T   LI  LC + +  +A      LD        
Subjt:  IIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEA------LDVFGFAVED

Query:  SSF-FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEM
        S+F  LI   C     +  F+L   M+  G  PN + +   + A+ +  + +   ++ +EM
Subjt:  SSF-FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEM

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159806.5e-12246.51Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P      +   S S  +  SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++ EVFGLD     E  K+ G++ PN  TFN++M  FY++G    V+ IW ++ +     PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL

Query:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS
        M   C    M EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS

Query:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFF--------LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH
        T+E L+E LC +R   R++EA D+   AV ++ F+        L+  LC++GK++ A  +QA+MVGKGFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFF--------LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH

Query:  E
        E
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.5e-3625.66Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++        +   LI+ LC+EG ++ A +L  +M   G +P
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP

Query:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        N+  Y S ++   K GN E   KL  E     L+
Subjt:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.5e-3625.66Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++        +   LI+ LC+EG ++ A +L  +M   G +P
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVED-------SSFFLINWLCQEGKIEGAFKLQAQMVGKGFKP

Query:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        N+  Y S ++   K GN E   KL  E     L+
Subjt:  NLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

AT1G62670.1 rna processing factor 23.6e-3527.5Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        +++ +I  L   K +D ++ + + + ++GI P V T +SLI  +      + A  + +++             ++ +++P+V TF+ L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+  +YS L+   C   R+ EA++++E M  K    DVV YNT+I GFCK    +   E +REM   G+     T   LI G    
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID
        GD D A  ++K+M       N  T   L++ LC   +L +A+ VF +     +E + +    +I  +C+ GK+E  + L   +  KG KP++  Y + I 
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGF----AVEDSSF---FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFID

Query:  AYMKEGNAEMVEKLGKEMHE
         + ++G+ E  + L KEM E
Subjt:  AYMKEGNAEMVEKLGKEMHE

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-12346.51Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P      +   S S  +  SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++ EVFGLD     E  K+ G++ PN  TFN++M  FY++G    V+ IW ++ +     PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL

Query:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS
        M   C    M EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS

Query:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFF--------LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH
        T+E L+E LC +R   R++EA D+   AV ++ F+        L+  LC++GK++ A  +QA+MVGKGFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFF--------LINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH

Query:  E
        E
Subjt:  E

AT4G26680.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-3523.86Show/hide
Query:  PAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQT
        P   +P  K      V+V      +S W  LN L  +  D     +++L+I+ +  L+L FF W + ++  +H+L +++ ++H L + R    A+ +++ 
Subjt:  PAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQT

Query:  AIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIF
         +    ++               P K+F+ L+ +Y+ C S P VFD L K     KK  ++ +    ++  G  P V + N+ +  +      ++A   +
Subjt:  AIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIF

Query:  TEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNT
         E+              + ++SPN +T N +M  + + G + +  E+   +          SY+ L+A  CE+  +  A +L   M    L+ +VV +NT
Subjt:  TEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNT

Query:  IIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEA------LDVFGFAVED
        +I GFC+A   Q A + + EM+   +     T   LINGY   GD + A   Y+DM       +  T   LI  LC + +  +A      LD        
Subjt:  IIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEA------LDVFGFAVED

Query:  SSF-FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEM
        S+F  LI   C     +  F+L   M+  G  PN + +   + A+ +  + +   ++ +EM
Subjt:  SSF-FLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCCCGTTGCTTAAACGAACCCTCCGGCCGATCGGAAACTCCACCGTTAACCTCCAATTTTCCTCTTCCTTTTTCTCATCCTCACCGCCCGCGGAACCTTCGCC
GTCGACGAAACCCTCAATTTCCACTGTCGTTTCAGTTCTCACTCACCAACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCG
AGTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCTCATCTAGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCG
ACCCTCATCCACATCCTTGCTCGCGGTCGACTCAGAACTCATGCCAAGGATGTTATTCAAACCGCCATTAGGGCTGCGGAGCTCGAAGATAGCGATAATTATTCTGAATC
TGAGCGGTTCTCTTCTTCGAGGCCTTTGAAGCTTTTTGAAACCCTCGTCAAGACATATAAACGGTGTGGCTCTGCCCCCTTTGTGTTTGATTTATTGATTAAAGCTCTTC
TGGATTCTAAAAAGCTCGATTCATCCATTGAAATTGTTAGAATGTTACGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAA
TGCCAGGGGGCTAATGTAGCTTATGCAATTTTTACAGAGGTTTTTGGTTTAGATTGTGAAATAGAGAAAGAACATGTGAAATTGAAGGGTAGAGTTAGTCCTAATGTTCA
TACTTTTAACACATTAATGGACTGTTTTTATCAAGATGGGTTTGTAGGGAGGGTGAAGGAGATTTGGGATCAATTGGCTGATTCAAATTCAATTCCAAACAGCTATAGTT
ATAGTATCCTAATGGCAGTTTTATGTGAAGAGAAGAGAATGGGAGAAGCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAACTTGATGTTGTAGCTTACAAT
ACAATAATTGGAGGATTTTGTAAAGCAGGAAATGCTCAGAGAGCTGAAGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCT
CATCAACGGCTATTGTGACACTGGAGATGTTGATTCTGCATTACTTGTGTACAAGGATATGCGTAGGAAACGGTTTAGTCTCAATGCGTCGACGCTTGAAGGACTTATTG
AAGTGTTGTGTGCTGAGAGAAGGCTTTTAGAAGCTTTAGATGTTTTTGGTTTTGCCGTTGAAGACTCTAGCTTTTTTCTGATAAATTGGTTGTGTCAAGAAGGGAAAATT
GAAGGTGCATTTAAGCTTCAAGCGCAGATGGTAGGGAAAGGTTTTAAGCCAAATTTGAAGATTTATCAATCGTTTATCGATGCTTACATGAAAGAAGGAAATGCAGAAAT
GGTTGAGAAATTGGGGAAGGAAATGCATGAAATCCAGCTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGGCCCGTTGCTTAAACGAACCCTCCGGCCGATCGGAAACTCCACCGTTAACCTCCAATTTTCCTCTTCCTTTTTCTCATCCTCACCGCCCGCGGAACCTTCGCC
GTCGACGAAACCCTCAATTTCCACTGTCGTTTCAGTTCTCACTCACCAACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCG
AGTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCTCATCTAGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCG
ACCCTCATCCACATCCTTGCTCGCGGTCGACTCAGAACTCATGCCAAGGATGTTATTCAAACCGCCATTAGGGCTGCGGAGCTCGAAGATAGCGATAATTATTCTGAATC
TGAGCGGTTCTCTTCTTCGAGGCCTTTGAAGCTTTTTGAAACCCTCGTCAAGACATATAAACGGTGTGGCTCTGCCCCCTTTGTGTTTGATTTATTGATTAAAGCTCTTC
TGGATTCTAAAAAGCTCGATTCATCCATTGAAATTGTTAGAATGTTACGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAA
TGCCAGGGGGCTAATGTAGCTTATGCAATTTTTACAGAGGTTTTTGGTTTAGATTGTGAAATAGAGAAAGAACATGTGAAATTGAAGGGTAGAGTTAGTCCTAATGTTCA
TACTTTTAACACATTAATGGACTGTTTTTATCAAGATGGGTTTGTAGGGAGGGTGAAGGAGATTTGGGATCAATTGGCTGATTCAAATTCAATTCCAAACAGCTATAGTT
ATAGTATCCTAATGGCAGTTTTATGTGAAGAGAAGAGAATGGGAGAAGCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAACTTGATGTTGTAGCTTACAAT
ACAATAATTGGAGGATTTTGTAAAGCAGGAAATGCTCAGAGAGCTGAAGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCT
CATCAACGGCTATTGTGACACTGGAGATGTTGATTCTGCATTACTTGTGTACAAGGATATGCGTAGGAAACGGTTTAGTCTCAATGCGTCGACGCTTGAAGGACTTATTG
AAGTGTTGTGTGCTGAGAGAAGGCTTTTAGAAGCTTTAGATGTTTTTGGTTTTGCCGTTGAAGACTCTAGCTTTTTTCTGATAAATTGGTTGTGTCAAGAAGGGAAAATT
GAAGGTGCATTTAAGCTTCAAGCGCAGATGGTAGGGAAAGGTTTTAAGCCAAATTTGAAGATTTATCAATCGTTTATCGATGCTTACATGAAAGAAGGAAATGCAGAAAT
GGTTGAGAAATTGGGGAAGGAAATGCATGAAATCCAGCTGAGTTGA
Protein sequenceShow/hide protein sequence
MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYS
TLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK
CQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYN
TIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFFLINWLCQEGKI
EGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS