; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019947 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019947
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr11:4346806..4351238
RNA-Seq ExpressionIVF0019947
SyntenyIVF0019947
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045943.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.099.8Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGN QRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_004145397.1 pentatricopeptide repeat-containing protein At2g15980 [Cucumis sativus]0.093.99Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_008459266.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15980 [Cucumis melo]0.0100Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_023512842.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita pepo subsp. pepo]1.06e-29683.17Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST NL FS SFFSSSP A PSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVLQIKNN HL L FFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQTAIRA  LED D+ S  ERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+F EVFGL+CEIE+++VK+K RVSPNVHTFNTLM CFYQDG VGRVKEIWDQLADS SIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLE+D VAYNTIIGGFCKAGN +RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         +   LCAE RLLEALDVFGFA+E ++FCPTMET+E+LIN LCQEGK+E AFKLQAQMVGKGFKPN KIYQSFIDAY KEGN EMV+KLG+E+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

XP_038901621.1 pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida]3.35e-31187.58Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLL+RTL PI NST NL FS SFFSSSPP EPSPSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDI+LQIKNNPHLALRFF WTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNL+SYST+IHILARGRLRTHAKDVIQTAIRAAELED D+ S+ ERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKL+S+I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQV TLNSLILLVSK QGAN  YAIF EVFGLDCEIE+E+VKLK  VSPNVHTFNTLM+CFYQDG VGRVK+IWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
         CEEKRMGEAEELW EMK+KKLELD VAYNTIIGGFCKAGN  RAEEFYREMELSGIESTFST EHLINGYC+TGDVDSALLVYKDMRRKRF+ NA  LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         LI  LCAE RLLEALDVF FA+EDS+FCPT+ET+E+LIN LCQEGKIE AFKLQAQMVGKGFKPNLKIYQSFIDAY+KEGN EMVEKLGKE+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

TrEMBL top hitse value%identityAlignment
A0A0A0LIN1 Uncharacterized protein6.4e-26993.99Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTV+LQFSSSF SSSPP +PSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDI+LQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAA+LEDSDNYS++ERFS SRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIF EVFGLDCEIE+EHVKLKGRVSPNVHTFNTLMDCFY+DGF GRVKEIWDQLADSNS PNSYSYSILM V
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKR GEAEELWEEMKMKKLE DVVAYNTIIGGFCKAG+  RAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRK+FSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLI +LCAERRLLEALDVFGFA+E SSFCPTMETFE+LIN LCQEGKIEGAFKLQAQMVG+GFKPNLKIYQSFIDAY KEGNAEMVEKL KEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A1S3CAB5 pentatricopeptide repeat-containing protein At2g159807.5e-286100Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A5A7TQU6 Pentatricopeptide repeat-containing protein2.2e-28599.8Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGN QRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A5D3CQ25 Pentatricopeptide repeat-containing protein7.5e-286100Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159807.1e-23682.77Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS
        MS PLLKR+L  I NST NL FS SFFSSSP A P PSTKPSISTVVSVLTH RSKSRWRFLNSLCP+GFDPGEFSDIVLQIKNN HL LRFFLWT++KS
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKS

Query:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR
        LCNH+L+SYST+IHILARGRLRTHAKDVIQ AIRA  LED D+ S+ ERFSSSRPLKLFETLVKTYK+CGSAPFVFDLLIKALLDSKKLD +I+IVRMLR
Subjt:  LCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLR

Query:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV
        SRGISPQ+ TLNSLIL +SKC+GAN  YA+F EVFGL+CEIE+E+VK+K R SPNVHTFNTLM CFYQDG VGRVKEIWDQLADSNSIPNSYSYSILMAV
Subjt:  SRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV

Query:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE
        LCEEKRMGEAEELWEEMKMKKLE+D VAYNTIIGGFCKAGN +RAEEF+REMEL G ESTFST EHLINGYC+TGDVDSALLVYKDMRRK FSLN   LE
Subjt:  LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLE

Query:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
         +   LC E RLLEALDVFGFA E ++FCPTMET+E+LIN LCQ+GK+E AFKLQAQMVGKGFKPN KIYQSFIDAY KEGN EMV+KLG+E+ EIQLS
Subjt:  GLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.7e-3825.98Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++ +   P + T+  LI+ LC+EG ++ A +L  +M   G +
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK

Query:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        PN+  Y S ++   K GN E   KL  E     L+
Subjt:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic6.6e-3727.02Show/hide
Query:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSE-----SERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRML
        N ++++TLIH L      + A  +I   +      D   Y        +R      L L + + K   +  +   ++  +I AL + K ++ ++ +   +
Subjt:  NLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSE-----SERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRML

Query:  RSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMA
         ++GI P V T NSLI  +      + A  + +++             ++ +++PNV TF+ L+D F ++G +   ++++D++   +  P+ ++YS L+ 
Subjt:  RSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMA

Query:  VLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTL
          C   R+ EA+ ++E M  K    +VV YNT+I GFCKA   +   E +REM   G+     T   LI G    GD D A  ++K M       +  T 
Subjt:  VLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTL

Query:  EGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHE
          L++ LC   +L +AL VF + ++ S   P + T+ ++I  +C+ GK+E  + L   +  KG KPN+ IY + I  + ++G  E  + L +EM E
Subjt:  EGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHE

Q9SH26 Pentatricopeptide repeat-containing protein At1g634001.0e-3728.04Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        ++  +I +L   +  D ++ +   + ++G+ P V T +SLI  +   +  + A  + +++             ++ +++PNV TFN L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+ ++YS L+   C   R+ EA+ ++E M  K    +VV YNT+I GFCKA       E +REM   G+     T   LI+G+   
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI
         D D+A +V+K M       N  T   L++ LC   +L +A+ VF + ++ S   PT+ T+ ++I  +C+ GK+E  + L   +  KG KP++ IY + I
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI

Query:  DAYMKEGNAEMVEKLGKEMHE
          + ++G  E  + L ++M E
Subjt:  DAYMKEGNAEMVEKLGKEMHE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial4.5e-3828.35Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        +++ +I  L   K +D ++ + + + ++GI P V T +SLI  +      + A  + +++             ++ +++P+V TF+ L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+  +YS L+   C   R+ EA++++E M  K    DVV YNT+I GFCK    +   E +REM   G+     T   LI G    
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI
        GD D A  ++K+M       N  T   L++ LC   +L +A+ VF + ++ S   PT+ T+ ++I  +C+ GK+E  + L   +  KG KP++  Y + I
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI

Query:  DAYMKEGNAEMVEKLGKEMHE
          + ++G+ E  + L KEM E
Subjt:  DAYMKEGNAEMVEKLGKEMHE

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159802.2e-12546.91Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P      +   S S  +  SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++ EVFGLD     E  K+ G++ PN  TFN++M  FY++G    V+ IW ++ +     PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL

Query:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS
        M   C    M EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS

Query:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH
        T+E L+E LC +R   R++EA D+   AV ++ F P+   +E+L+  LC++GK++ A  +QA+MVGKGFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH

Query:  E
        E
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.9e-3925.98Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++ +   P + T+  LI+ LC+EG ++ A +L  +M   G +
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK

Query:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        PN+  Y S ++   K GN E   KL  E     L+
Subjt:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.9e-3925.98Show/hide
Query:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL
        +G L+KR TL    N     +LQ     FS+     P P   P          +  + +V+  +R++   R L    C   F       ++++IK +  L
Subjt:  SGPLLKR-TLRPIGN--STVNLQFSSSFFSSSPPAEPSPSTKPS---------ISTVVSVLTHQRSKSRWRFLNSL-CPNGFDPGEFSDIVLQIKNNPHL

Query:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK
         L FF W +++   + NL S   +IH+    +    A+ +I +     +L  +D++           ++ F+ LV TYK  GS P VFD+  + L+D   
Subjt:  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKK

Query:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD
        L  +  +   + + G+   V + N  +  +SK C     A  +F E               V    C++    E  H    ++LKG  +P+V +++T+++
Subjt:  LDSSIEIVRMLRSRGISPQVSTLNSLILLVSK-CQGANVAYAIFTE---------------VFGLDCEI----EKEH----VKLKGRVSPNVHTFNTLMD

Query:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL
         + + G + +V ++ + +      PNSY Y  ++ +LC   ++ EAEE + EM  + +  D V Y T+I GFCK G+ + A +F+ EM    I     T 
Subjt:  CFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTL

Query:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK
          +I+G+C  GD+  A  ++ +M  K    ++ T   LI   C    + +A  V    ++ +   P + T+  LI+ LC+EG ++ A +L  +M   G +
Subjt:  EHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFK

Query:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS
        PN+  Y S ++   K GN E   KL  E     L+
Subjt:  PNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS

AT1G62670.1 rna processing factor 23.2e-3928.35Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        +++ +I  L   K +D ++ + + + ++GI P V T +SLI  +      + A  + +++             ++ +++P+V TF+ L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+  +YS L+   C   R+ EA++++E M  K    DVV YNT+I GFCK    +   E +REM   G+     T   LI G    
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI
        GD D A  ++K+M       N  T   L++ LC   +L +A+ VF + ++ S   PT+ T+ ++I  +C+ GK+E  + L   +  KG KP++  Y + I
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI

Query:  DAYMKEGNAEMVEKLGKEMHE
          + ++G+ E  + L KEM E
Subjt:  DAYMKEGNAEMVEKLGKEMHE

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein7.2e-3928.04Show/hide
Query:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR
        ++  +I +L   +  D ++ +   + ++G+ P V T +SLI  +   +  + A  + +++             ++ +++PNV TFN L+D F ++G +  
Subjt:  VFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGR

Query:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT
         ++++D++   +  P+ ++YS L+   C   R+ EA+ ++E M  K    +VV YNT+I GFCKA       E +REM   G+     T   LI+G+   
Subjt:  VKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDT

Query:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI
         D D+A +V+K M       N  T   L++ LC   +L +A+ VF + ++ S   PT+ T+ ++I  +C+ GK+E  + L   +  KG KP++ IY + I
Subjt:  GDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFI

Query:  DAYMKEGNAEMVEKLGKEMHE
          + ++G  E  + L ++M E
Subjt:  DAYMKEGNAEMVEKLGKEMHE

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-12646.91Show/hide
Query:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN
        MS  +L+R L P      +   S S  +  SSP   PSP + P IS  VS+LTH RSKSRW  L SL P+GF P +FS+I L ++NNPHL+LRFFL+T+ 
Subjt:  MSGPLLKRTLRPIGNSTVNLQFSSSFFS--SSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQN

Query:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM
         SLC+H+  S STLIHIL+R RL++HA ++I+ A+R A  ++ ++          R LK+F +L+K+Y RCGSAPFVFDLLIK+ LDSK++D ++ ++R 
Subjt:  KSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRM

Query:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL
        LRSRGI+ Q+ST N+LI  VS+ +GA+  Y ++ EVFGLD     E  K+ G++ PN  TFN++M  FY++G    V+ IW ++ +     PN YSY++L
Subjt:  LRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADS-NSIPNSYSYSIL

Query:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS
        M   C    M EAE++WEEMK++ +  D+VAYNT+IGG C      +A+E +R+M L GIE T  T EHL+NGYC  GDVDS L+VY++M+RK F  +  
Subjt:  MAVLCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNAS

Query:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH
        T+E L+E LC +R   R++EA D+   AV ++ F P+   +E+L+  LC++GK++ A  +QA+MVGKGFKP+ + Y++FID Y   G+ E    L  EM 
Subjt:  TLEGLIEVLCAER---RLLEALDVFGFAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMH

Query:  E
        E
Subjt:  E


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCCCGTTGCTTAAACGAACCCTCCGGCCGATCGGAAACTCCACCGTTAACCTCCAATTTTCCTCTTCCTTTTTCTCATCCTCACCGCCCGCGGAACCTTCGCC
GTCGACGAAACCCTCAATTTCCACTGTCGTTTCAGTTCTCACTCACCAACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCG
AGTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCTCATCTAGCCCTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCG
ACCCTCATCCACATCCTTGCTCGCGGTCGACTCAGAACTCATGCCAAGGATGTTATTCAAACCGCCATTAGGGCTGCGGAGCTCGAAGATAGCGATAATTATTCTGAATC
TGAGCGGTTCTCTTCTTCGAGGCCTTTGAAGCTTTTTGAAACCCTCGTCAAGACATATAAACGGTGTGGCTCTGCCCCCTTTGTGTTTGATTTATTGATTAAAGCTCTTC
TGGATTCTAAAAAGCTCGATTCATCCATTGAAATTGTTAGAATGTTACGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAA
TGCCAGGGGGCTAATGTAGCTTATGCAATTTTTACAGAGGTTTTTGGTTTAGATTGTGAAATAGAGAAAGAACATGTGAAATTGAAGGGTAGAGTTAGTCCTAATGTTCA
TACTTTTAACACATTAATGGACTGTTTTTATCAAGATGGGTTTGTAGGGAGGGTGAAGGAGATTTGGGATCAATTGGCTGATTCAAATTCAATTCCAAACAGCTATAGTT
ATAGTATCCTAATGGCAGTTTTATGTGAAGAGAAGAGAATGGGAGAAGCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAACTTGATGTTGTAGCTTACAAT
ACAATAATTGGAGGATTTTGTAAAGCAGGAAATGCTCAGAGAGCTGAAGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCT
CATCAACGGCTATTGTGACACTGGAGATGTTGATTCTGCATTACTTGTGTACAAGGATATGCGTAGGAAACGGTTTAGTCTCAATGCGTCGACGCTTGAAGGACTTATTG
AAGTGTTGTGTGCTGAGAGAAGGCTTTTAGAAGCTTTAGATGTTTTTGGTTTTGCCGTTGAAGACTCTAGCTTTTGTCCTACAATGGAAACTTTTGAAGTTCTGATAAAT
TGGTTGTGTCAAGAAGGGAAAATTGAAGGTGCATTTAAGCTTCAAGCGCAGATGGTAGGGAAAGGTTTTAAGCCAAATTTGAAGATTTATCAATCGTTTATCGATGCTTA
CATGAAAGAAGGAAATGCAGAAATGGTTGAGAAATTGGGGAAGGAAATGCATGAAATCCAGCTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
AACCAATCAAAAGAAAAAAAAAAACTCAAAGAAAATGTCTCAAATAAGAGAAATCATTTAGATTATGAAGGAGAAGTCTCAAGTCGCTAAAATTTTCTAATTGAACAACT
AATCACATCAGAGTACATAACGTAGCACTATGTATATGCATATACAACTCCTTATAGTCACACTAATTACTAATGCTAAAATGATCAATCCAATTAGGTTTAGTATTTCA
ACTAGAGATTTTATAGAAATACCTAATTCATAAAAACATATCATCTAAATAGTATCATTCTCTCAACTCAACTCAATTCTACCGATACATAACCCTAAAATATCAAACAA
AGTGACTTCTAAGAATGTGAGTTACATGTCTGAGTAAAACCATATAAACTTCTAAAATAAATTGATGTATTAAATTGATTATGAATTTTATCAAATCACTAAGGAATAGG
AGAGCCATTTGATACAAACTTATTGAATTGTTTTGTTTAGGTCTCACCTTCTTGCAAAGTTTAATGTCCGGCCCGTTGCTTAAACGAACCCTCCGGCCGATCGGAAACTC
CACCGTTAACCTCCAATTTTCCTCTTCCTTTTTCTCATCCTCACCGCCCGCGGAACCTTCGCCGTCGACGAAACCCTCAATTTCCACTGTCGTTTCAGTTCTCACTCACC
AACGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGTCCCAACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTTCTCCAAATCAAGAACAATCCTCATCTAGCC
CTCCGTTTCTTCCTCTGGACTCAGAACAAATCCCTCTGCAATCACAATCTCATTTCTTACTCGACCCTCATCCACATCCTTGCTCGCGGTCGACTCAGAACTCATGCCAA
GGATGTTATTCAAACCGCCATTAGGGCTGCGGAGCTCGAAGATAGCGATAATTATTCTGAATCTGAGCGGTTCTCTTCTTCGAGGCCTTTGAAGCTTTTTGAAACCCTCG
TCAAGACATATAAACGGTGTGGCTCTGCCCCCTTTGTGTTTGATTTATTGATTAAAGCTCTTCTGGATTCTAAAAAGCTCGATTCATCCATTGAAATTGTTAGAATGTTA
CGGTCTCGTGGGATTAGCCCACAAGTTAGTACGTTGAATTCGTTGATTTTGTTGGTGTCAAAATGCCAGGGGGCTAATGTAGCTTATGCAATTTTTACAGAGGTTTTTGG
TTTAGATTGTGAAATAGAGAAAGAACATGTGAAATTGAAGGGTAGAGTTAGTCCTAATGTTCATACTTTTAACACATTAATGGACTGTTTTTATCAAGATGGGTTTGTAG
GGAGGGTGAAGGAGATTTGGGATCAATTGGCTGATTCAAATTCAATTCCAAACAGCTATAGTTATAGTATCCTAATGGCAGTTTTATGTGAAGAGAAGAGAATGGGAGAA
GCAGAGGAATTGTGGGAAGAAATGAAAATGAAGAAGTTGGAACTTGATGTTGTAGCTTACAATACAATAATTGGAGGATTTTGTAAAGCAGGAAATGCTCAGAGAGCTGA
AGAGTTCTATAGAGAAATGGAACTCAGTGGAATAGAGAGTACTTTCTCCACCCTTGAACATCTCATCAACGGCTATTGTGACACTGGAGATGTTGATTCTGCATTACTTG
TGTACAAGGATATGCGTAGGAAACGGTTTAGTCTCAATGCGTCGACGCTTGAAGGACTTATTGAAGTGTTGTGTGCTGAGAGAAGGCTTTTAGAAGCTTTAGATGTTTTT
GGTTTTGCCGTTGAAGACTCTAGCTTTTGTCCTACAATGGAAACTTTTGAAGTTCTGATAAATTGGTTGTGTCAAGAAGGGAAAATTGAAGGTGCATTTAAGCTTCAAGC
GCAGATGGTAGGGAAAGGTTTTAAGCCAAATTTGAAGATTTATCAATCGTTTATCGATGCTTACATGAAAGAAGGAAATGCAGAAATGGTTGAGAAATTGGGGAAGGAAA
TGCATGAAATCCAGCTGAGTTGAGAAGGGAATTGTATCGCATTGTATCTCTCCATCTCCACAATGCAGCTGATCAAGAAGTTTCTATTGGCAAGAGAGGGAAGAAGATGA
ACTTACCTCCTACAAAGTTGTAACATAGTGGGGGCCTAAACCTAGGTTCCTTTCATCCTTCTAGGGATAGGACTTATGAAAGGTTGGATCATAGGGGGAATCAAATGAGA
GCTTGCAAAGTGTAGACTCTTTGGAGGACAAGATGGCCTCAGCCTCCGAGGCCGATGCGAGCTAGTTGGACCAAGTGGTCTAGTTGCCTCACACTCTCTTCAATGGTGCC
TCTCTTGGATCTGTTTGCAACTCATTTAGTGTCCCTCATTTGGGCTTTCCTCGTAGGTCCAAAATTGTTCCAATCAAATGTGATGCAACCAAACAAATTTATTCTTTAAA
GCAATTGAAAATTTGTGAGGTATTATTAGTTTATTACTTTTTCTTTTCTTGAAAATATTTTGTTTTCGAAAAGTATTTAAATTGAAAGGTAGGTAGAGAGGAGAGAGAGA
GATTGGCAATTGATAGTTGTAAGAAAAGGAACCAAGAGTACTTAGATTCCAACATTTCTTTGTTGTGTACCAAAAATTCTCTAGAAAGACTGTATGTGGATTATGGGTTC
TCAATTCTTGGATCACTTGTGTGAAATGTTCTTTCATATGTTCCTACCCAATGAAAAACAAAAAGTAGCTTCATTTTAATTCTCTTCAAACTTAAAAGATAAAAGCAACT
TTGGTCTTAATGACATTAATAACGTGTAACAATTACAAATATAACCATTAAATTTAAAATATTATATAACAATATTTTTAAAAAGATACAAATATAGTAAAATATGTTAG
AGTCATCAATGGTAAAAGTTTATCATGAACAGACAATATATTATAAATATTAGTCTATTTCTCGATAGAAATCTATTTTTGATAGAAGTCTATCGTAGATATATTTTATT
GTATTTATAAATTTTTTGAATGTTGTTATACACTTGATTATTATTCTTAAAATTGTTATTCATTACAATTACTCTAATAAACATTGGTCATAATGAGCTCCGTATAATGG
AAACATTCATCCTTGGAGATTGAAAGTTCAATTTACCCTTCCGAAACTATTGTACTAAAAAATGGCATTGGTTTGTTTTTTTAATAACTTTTCACCTATTATTGGTAGGA
TAATTATTTTTCTTTTGATATTGTAACTGTATATTTTATTTAATTGTATTGATTGTTTTAATGGGTATAAATTTATATATTATACTTACTAAAACTCTATTTAATTTGTG
GACCAATTTTAAGAAATTTGAAAGCTATAAAATTAAAGACACAGACCTTCAAGTGAATGAGTTTGTAATTTAATATACTGCACCCAGGACAAATGTATGCATCCCAAATC
CTTTGTATGTTTTCTGTTGGCTGCTTCTTCTGTCTGACCTCATCTTTCGAGTTGACCTACGTAAGAGTACTCACACCGTTTGGTGCCAAGTAAACCAAACTCCAATGTAT
AAGTTAGTACCATGTATATATGGTTTAGTTGTACATAAAGAATTATATTAGAGTTATTTTTTGAAATTTACCTGCTCTTCTTTTGGGTTGTCAAATGAGGTTATTTATAA
TTCTCTTCTTCTCACACTTACCTATGGACTGTGATTACGATAACACCTCTGATGTCATATTGATTGTGACCCAAGGTTGAAGAAAATTAGGATCAAAGCTCCGTTTAACT
TTTTTCAATTAATACAATTCCTTTAGCTCGATTGATCGGTCTAGAACTTGTGAGCCCGACTTTTCTCCTTTCATACACATAATTCCCTATGTTGCTATCCGGTTTGGACT
CACCCATAAACTTCGTTTATAATGTTCTTTAAAGAGAAAAAAGGGTAAATGAAAC
Protein sequenceShow/hide protein sequence
MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYS
TLIHILARGRLRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSK
CQGANVAYAIFTEVFGLDCEIEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAVLCEEKRMGEAEELWEEMKMKKLELDVVAYN
TIIGGFCKAGNAQRAEEFYREMELSGIESTFSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFGFAVEDSSFCPTMETFEVLIN
WLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKEGNAEMVEKLGKEMHEIQLS