; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013045 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013045
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153653:46338..47837
RNA-Seq ExpressionSgr013045
SyntenySgr013045
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571131.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-23882.6Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKR+L    I NS F LPF PSFFSSSP+  PSPS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVL IKNN HL LRFFLWT++
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNHDLVSYST+IHILARGRLRTHAK VIQTAIRAT LED +  SKC++F   RPLKLFETLVKTYK+CGSAPFVFDLLIK+L+DSKKL+PA+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQIGTLNS+ILW+SKCEGA AGYA+FREVFGL+C ++E+NVK+KA+VSPNVHTFNTLMVCFYQDGLVGRVKEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLE+D VAYNTIIGGFCKAGNIRRAEEFFREMEL G ESTFSTFE+LINGYCE+GDVDSALLVYK+MRRK FS+N   
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LEA+ RGLCA+TRLLEALDV GFATED+N CPT+ETYELLIN LCQ+GK+EAAFK Q+QMVGKGFKPN KIY +FIDAY+KEGNEEMV+KL +E+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

XP_022148504.1 pentatricopeptide repeat-containing protein At2g15980 [Momordica charantia]4.6e-24584.74Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKRTLS  SIRN  FKLPF PSF SS      SPSAKPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVLHIKNNPHL+LRFFLWTQN
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLR
        KSLC H+LVSYST+IHILARGRLRTHAK VIQTAIRA ELEDD+  S CKQF RPL+LF+TLVKTYKRCGSAPFVFDLLIK+L+DS+KLEPA+QI+RMLR
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLR

Query:  SRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAV
        SRGISPQ+ TLNS+ILWVSKCEGA AGYAIFREVFGLDC VKEE VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQLT+SNSIPNSYSYSILM V
Subjt:  SRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAV

Query:  FCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLE
        FC+++RMV+AE+LWKEMR+KKLELD VAYNTIIGGFCKAG+I+RAEE FREMELSGIESTFSTFE+LINGYCETGD+DSALLVYK+MRRK+FSLNASTLE
Subjt:  FCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLE

Query:  AIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        AI+RGL A+TRLLEALDV GF TEDSNFCPT+ETYELLIN LC+EG+IEAAFK QAQMVGKGFKP+ K+Y +FIDAYT EGNEEMVEKL KELLEIQL
Subjt:  AIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

XP_022944388.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita moschata]4.9e-23983Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKR+L    I NS F LPF PSFFSSSP+  PSPS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVL IKNN HL LRFFLWT++
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNHDLVSYST+IHILARGRLRT AK VIQTAIRAT LED +D SKC++F   RPLKLFETLVKTYK+CGSAPFVFDLLIK+L+DSKKL+PA+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQIGTLNS+ILW+SKCEGA AGYA+FREVFGL+C ++E+NVK+KA+VSPNVHTFNTLMVCFYQDGLVGR KEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLE+D VAYNTIIGGFCKAGNIRRAEEFFREMEL G ESTFSTFE+LINGYCETGDVDSALLVYK+MRRK FSLN   
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LEA+ RGLCA+TRLLEALD+ GFATED+N CPT+ETYELLIN LCQEGK+EAAFK QAQMVGKGFKPN KIY +FIDAY+KEGNEEMV+KL +E+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

XP_023512842.1 pentatricopeptide repeat-containing protein At2g15980 [Cucurbita pepo subsp. pepo]3.7e-23983Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKR+L    I NS F LPF PSFFSSSP+  PSPS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVL IKNN HL L FFLWT++
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNHDLVSYST+IHILARGRLRTHAK VIQTAIRAT LED +D S C++F   RPLKLFETLVKTYK+CGSAPFVFDLLIK+L+DSKKL+PA+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQIGTLNS+ILW+SKCEGA AGYA+FREVFGL+C ++E+NVK+KA+VSPNVHTFNTLMVCFYQDGLVGRVKEIWDQL DS SIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLE+D VAYNTIIGGFCKAGN+RRAEEFFREMEL G ESTFSTFE+LINGYCETGDVDSALLVYK+MRRK FSLN   
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LEAI RGLCA+TRLLEALDV GFA E +NFCPT+ETYELLIN LCQEGK+EAAFK QAQMVGKGFKPN KIY +FIDAY+KEGNEEMV+KLG+E+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

XP_038901621.1 pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida]2.0e-24084.2Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MS+PLL+RTL    IRNS F LPF  SFFSSSP  EPSPS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDI+L IKNNPHLALRFF WTQN
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNH+LVSYST+IHILARGRLRTHAK VIQTAIRA ELED +D SKC++F   RPLKLFETLVKTYKRCGSAPFVFDLLIK+L+DSKKLE A+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQ+GTLNS+IL VSK +GA AGYAIF+EVFGLDC ++EENVKLKA VSPNVHTFNTLM CFYQDGLVGRVK+IWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AVFCEEKRM +AE+LW EM++KKLELD VAYNTIIGGFCKAGN+ RAEEF+REMELSGIESTFSTFE+LINGYCETGDVDSALLVYK+MRRK F+ NA  
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LE +IRGLCA+TRLLEALDV  FA EDSNFCPT+ETYELLIN LCQEGKIE AFK QAQMVGKGFKPNLKIY +FIDAY KEGNEEMVEKLGKE+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

TrEMBL top hitse value%identityAlignment
A0A5A7TQU6 Pentatricopeptide repeat-containing protein9.0e-23180.8Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MS PLLKRTL    I NS   L F  SFFSSSP  EPSPS KPSISTVVSVLTH RSKSRWR+LNSLCP+GFDPGEFSDIVL IKNNPHLALRFFLWTQN
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNH+L+SYST+IHILARGRLRTHAK VIQTAIRA ELED ++YS+ ++F   RPLKLFETLVKTYKRCGSAPFVFDLLIK+L+DSKKL+ +++IVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQ+ TLNS+IL VSKC+GA   YAIF EVFGLDC +++E+VKLK +VSPNVHTFNTLM CFYQDG VGRVKEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLELD VAYNTIIGGFCKAGN +RAEEF+REMELSGIESTFST E+LINGYC+TGDVDSALLVYK+MRRK FSLNAST
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LE +I  LCA+ RLLEALDV GFA EDS+FCPT+ET+E+LIN LCQEGKIE AFK QAQMVGKGFKPNLKIY +FIDAY KEGN EMVEKLGKE+ EIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

A0A5D3CQ25 Pentatricopeptide repeat-containing protein9.0e-23180.8Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MS PLLKRTL    I NS   L F  SFFSSSP  EPSPS KPSISTVVSVLTH RSKSRWR+LNSLCP+GFDPGEFSDIVL IKNNPHLALRFFLWTQN
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNH+L+SYST+IHILARGRLRTHAK VIQTAIRA ELED ++YS+ ++F   RPLKLFETLVKTYKRCGSAPFVFDLLIK+L+DSKKL+ +++IVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQ+ TLNS+IL VSKC+GA   YAIF EVFGLDC +++E+VKLK +VSPNVHTFNTLM CFYQDG VGRVKEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLELD VAYNTIIGGFCKAGN +RAEEF+REMELSGIESTFST E+LINGYC+TGDVDSALLVYK+MRRK FSLNAST
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LE +I  LCA+ RLLEALDV GFA EDS+FCPT+ET+E+LIN LCQEGKIE AFK QAQMVGKGFKPNLKIY +FIDAY KEGN EMVEKLGKE+ EIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

A0A6J1D472 pentatricopeptide repeat-containing protein At2g159802.2e-24584.74Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKRTLS  SIRN  FKLPF PSF SS      SPSAKPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVLHIKNNPHL+LRFFLWTQN
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLR
        KSLC H+LVSYST+IHILARGRLRTHAK VIQTAIRA ELEDD+  S CKQF RPL+LF+TLVKTYKRCGSAPFVFDLLIK+L+DS+KLEPA+QI+RMLR
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLR

Query:  SRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAV
        SRGISPQ+ TLNS+ILWVSKCEGA AGYAIFREVFGLDC VKEE VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQLT+SNSIPNSYSYSILM V
Subjt:  SRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAV

Query:  FCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLE
        FC+++RMV+AE+LWKEMR+KKLELD VAYNTIIGGFCKAG+I+RAEE FREMELSGIESTFSTFE+LINGYCETGD+DSALLVYK+MRRK+FSLNASTLE
Subjt:  FCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLE

Query:  AIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        AI+RGL A+TRLLEALDV GF TEDSNFCPT+ETYELLIN LC+EG+IEAAFK QAQMVGKGFKP+ K+Y +FIDAYT EGNEEMVEKL KELLEIQL
Subjt:  AIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

A0A6J1FY05 pentatricopeptide repeat-containing protein At2g159802.4e-23983Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKR+L    I NS F LPF PSFFSSSP+  PSPS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVL IKNN HL LRFFLWT++
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNHDLVSYST+IHILARGRLRT AK VIQTAIRAT LED +D SKC++F   RPLKLFETLVKTYK+CGSAPFVFDLLIK+L+DSKKL+PA+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQIGTLNS+ILW+SKCEGA AGYA+FREVFGL+C ++E+NVK+KA+VSPNVHTFNTLMVCFYQDGLVGR KEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLE+D VAYNTIIGGFCKAGNIRRAEEFFREMEL G ESTFSTFE+LINGYCETGDVDSALLVYK+MRRK FSLN   
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LEA+ RGLCA+TRLLEALD+ GFATED+N CPT+ETYELLIN LCQEGK+EAAFK QAQMVGKGFKPN KIY +FIDAY+KEGNEEMV+KL +E+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

A0A6J1JGQ1 pentatricopeptide repeat-containing protein At2g159806.9e-23982.8Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN
        MSIPLLKR+L    I NS F LPF PSFFSSSP+  P PS KPSISTVVSVLTHHRSKSRWR+LNSLCP GFDPGEFSDIVL IKNN HL LRFFLWT++
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQN

Query:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM
        KSLCNHDLVSYST+IHILARGRLRTHAK VIQ AIRAT LEDD+D S+C++F   RPLKLFETLVKTYK+CGSAPFVFDLLIK+L+DSKKL+PA+QIVRM
Subjt:  KSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQF--PRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRM

Query:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM
        LRSRGISPQIGTLNS+IL +SKCEGA AGYA+FREVFGL+C ++EENVK+KA+ SPNVHTFNTLMVCFYQDGLVGRVKEIWDQL DSNSIPNSYSYSILM
Subjt:  LRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILM

Query:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST
        AV CEEKRM +AE+LW+EM+MKKLE+D VAYNTIIGGFCKAGN+RRAEEFFREMEL G ESTFSTFE+LINGYCETGDVDSALLVYK+MRRK FSLN   
Subjt:  AVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNAST

Query:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL
        LEAI RGLC +TRLLEALDV GFATE +NFCPT+ETYELLIN LCQ+GK+EAAFK QAQMVGKGFKPN KIY +FIDAY+KEGNEEMV+KLG+E+LEIQL
Subjt:  LEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQL

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.3e-3725.76Show/hide
Query:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL
        L+KR TLSS       F L      FS+     P P   P          +  + +V+   R++   R   SL P+   F       +++ IK +  L L
Subjt:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL

Query:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA
         FF W +++   + +L S   +IH+    +    A+++I +     +L   + +         ++ F+ LV TYK  GS P VFD+  + L+D   L  A
Subjt:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA

Query:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ
         ++   + + G+   + + N  +  +SK C        +FRE    G+   V   N+                     +LK   +P+V +++T++  + +
Subjt:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ

Query:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI
         G + +V ++ + +      PNSY Y  ++ + C   ++ +AE+ + EM  + +  DTV Y T+I GFCK G+IR A +FF EM    I     T+  +I
Subjt:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI

Query:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK
        +G+C+ GD+  A  ++  M  K    ++ T   +I G C    + +A  V     + +   P + TY  LI+ LC+EG +++A +   +M   G +PN+ 
Subjt:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK

Query:  IYHAFIDAYTKEGN-EEMVEKLGK
         Y++ ++   K GN EE V+ +G+
Subjt:  IYHAFIDAYTKEGN-EEMVEKLGK

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.8e-3725.11Show/hide
Query:  SFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTH
        S F+SSPSD        +         HH S +            F P   S+++L  +N+  L L+F  W          L      +HIL + +L   
Subjt:  SFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTH

Query:  AKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKA
        A+ + +     T    D++Y+          +F++L +TY  C S   VFDL++KS      ++ A+ IV + ++ G  P + + N+V+        +K 
Subjt:  AKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKA

Query:  GYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDT
          +    VF      KE    L+++VSPNV T+N L+  F   G +     ++D++     +PN  +Y+ L+  +C+ +++     L + M +K LE + 
Subjt:  GYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDT

Query:  VAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDS
        ++YN +I G C+ G ++       EM   G      T+  LI GYC+ G+   AL+++  M R   + +  T  ++I  +C    +  A++ +       
Subjt:  VAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDS

Query:  NFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEE
          CP   TY  L++   Q+G +  A++   +M   GF P++  Y+A I+ +   G  E
Subjt:  NFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEE

Q9SH26 Pentatricopeptide repeat-containing protein At1g634006.6e-3727.41Show/hide
Query:  VFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGR
        ++  +I SL   +  + A+ +   + ++G+ P + T +S+I  +   E       +  ++             ++ K++PNV TFN L+  F ++G +  
Subjt:  VFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGR

Query:  VKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCET
         ++++D++   +  P+ ++YS L+  FC   R+ +A+ +++ M  K    + V YNT+I GFCKA  I    E FREM   G+     T+  LI+G+ + 
Subjt:  VKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCET

Query:  GDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFI
         D D+A +V+K M       N  T   ++ GLC + +L +A+ V  +  + S   PTI TY ++I  +C+ GK+E  +     +  KG KP++ IY+  I
Subjt:  GDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFI

Query:  DAYTKEGNEEMVEKLGKELLE
          + ++G +E  + L +++ E
Subjt:  DAYTKEGNEEMVEKLGKELLE

Q9SZ10 Pentatricopeptide repeat-containing protein At4g26680, mitochondrial1.1e-3923.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA
        +P  K      V+V   H  +S W  LN L  H  D     +++L I+ +  L+L FF W + ++  +H L +++ ++H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA

Query:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL
          ++             P K+F+ L+ +Y+ C S P VFD L K+    KK   A      ++  G  P + + N+   ++S   G             +
Subjt:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL

Query:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC
        D  ++      + K+SPN +T N +M  + + G + +  E+   +          SY+ L+A  CE+  +  A  L   M    L+ + V +NT+I GFC
Subjt:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC

Query:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL
        +A  ++ A + F EM+   +     T+  LINGY + GD + A   Y++M       +  T  A+I GLC   +  +A   +    +  N  P   T+  
Subjt:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL

Query:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL
        LI   C     +  F+    M+  G  PN + ++  + A+ +  + +   ++ +E++
Subjt:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL

Q9XIM8 Pentatricopeptide repeat-containing protein At2g159807.3e-12947.32Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFS----SSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFL
        MS  +L+R L          + P P +  S    ++ S  PSP + P IS  VS+LTHHRSKSRW  L SL P GF P +FS+I L ++NNPHL+LRFFL
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFS----SSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFL

Query:  WTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIV
        +T+  SLC+HD  S ST+IHIL+R RL++HA  +I+ A+R    ++DED        R LK+F +L+K+Y RCGSAPFVFDLLIKS +DSK+++ AV ++
Subjt:  WTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIV

Query:  RMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDS-NSIPNSYSYS
        R LRSRGI+ QI T N++I  VS+  GA  GY ++REVFGLD    +E  K+  K+ PN  TFN++MV FY++G    V+ IW ++ +     PN YSY+
Subjt:  RMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDS-NSIPNSYSYS

Query:  ILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLN
        +LM  +C    M +AE +W+EM+++ +  D VAYNT+IGG C    + +A+E FR+M L GIE T  T+E+L+NGYC+ GDVDS L+VY+ M+RK F  +
Subjt:  ILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLN

Query:  ASTLEAIIRGLCAD---TRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKE
          T+EA++ GLC D    R++EA D++  A  ++ F P+   YELL+  LC++GK++ A   QA+MVGKGFKP+ + Y AFID Y   G+EE    L  E
Subjt:  ASTLEAIIRGLCAD---TRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKE

Query:  LLE
        + E
Subjt:  LLE

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein9.4e-3925.76Show/hide
Query:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL
        L+KR TLSS       F L      FS+     P P   P          +  + +V+   R++   R   SL P+   F       +++ IK +  L L
Subjt:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL

Query:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA
         FF W +++   + +L S   +IH+    +    A+++I +     +L   + +         ++ F+ LV TYK  GS P VFD+  + L+D   L  A
Subjt:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA

Query:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ
         ++   + + G+   + + N  +  +SK C        +FRE    G+   V   N+                     +LK   +P+V +++T++  + +
Subjt:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ

Query:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI
         G + +V ++ + +      PNSY Y  ++ + C   ++ +AE+ + EM  + +  DTV Y T+I GFCK G+IR A +FF EM    I     T+  +I
Subjt:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI

Query:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK
        +G+C+ GD+  A  ++  M  K    ++ T   +I G C    + +A  V     + +   P + TY  LI+ LC+EG +++A +   +M   G +PN+ 
Subjt:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK

Query:  IYHAFIDAYTKEGN-EEMVEKLGK
         Y++ ++   K GN EE V+ +G+
Subjt:  IYHAFIDAYTKEGN-EEMVEKLGK

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein9.4e-3925.76Show/hide
Query:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL
        L+KR TLSS       F L      FS+     P P   P          +  + +V+   R++   R   SL P+   F       +++ IK +  L L
Subjt:  LLKR-TLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPS---------ISTVVSVLTHHRSKSRWRYLNSLCPH--GFDPGEFSDIVLHIKNNPHLAL

Query:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA
         FF W +++   + +L S   +IH+    +    A+++I +     +L   + +         ++ F+ LV TYK  GS P VFD+  + L+D   L  A
Subjt:  RFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPA

Query:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ
         ++   + + G+   + + N  +  +SK C        +FRE    G+   V   N+                     +LK   +P+V +++T++  + +
Subjt:  VQIVRMLRSRGISPQIGTLNSVILWVSK-CEGAKAGYAIFREV--FGLDCGVKEENV---------------------KLKAKVSPNVHTFNTLMVCFYQ

Query:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI
         G + +V ++ + +      PNSY Y  ++ + C   ++ +AE+ + EM  + +  DTV Y T+I GFCK G+IR A +FF EM    I     T+  +I
Subjt:  DGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLI

Query:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK
        +G+C+ GD+  A  ++  M  K    ++ T   +I G C    + +A  V     + +   P + TY  LI+ LC+EG +++A +   +M   G +PN+ 
Subjt:  NGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLK

Query:  IYHAFIDAYTKEGN-EEMVEKLGK
         Y++ ++   K GN EE V+ +G+
Subjt:  IYHAFIDAYTKEGN-EEMVEKLGK

AT2G15980.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-13047.32Show/hide
Query:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFS----SSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFL
        MS  +L+R L          + P P +  S    ++ S  PSP + P IS  VS+LTHHRSKSRW  L SL P GF P +FS+I L ++NNPHL+LRFFL
Subjt:  MSIPLLKRTLSSPSIRNSNFKLPFPPSFFS----SSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFL

Query:  WTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIV
        +T+  SLC+HD  S ST+IHIL+R RL++HA  +I+ A+R    ++DED        R LK+F +L+K+Y RCGSAPFVFDLLIKS +DSK+++ AV ++
Subjt:  WTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIV

Query:  RMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDS-NSIPNSYSYS
        R LRSRGI+ QI T N++I  VS+  GA  GY ++REVFGLD    +E  K+  K+ PN  TFN++MV FY++G    V+ IW ++ +     PN YSY+
Subjt:  RMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDS-NSIPNSYSYS

Query:  ILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLN
        +LM  +C    M +AE +W+EM+++ +  D VAYNT+IGG C    + +A+E FR+M L GIE T  T+E+L+NGYC+ GDVDS L+VY+ M+RK F  +
Subjt:  ILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLN

Query:  ASTLEAIIRGLCAD---TRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKE
          T+EA++ GLC D    R++EA D++  A  ++ F P+   YELL+  LC++GK++ A   QA+MVGKGFKP+ + Y AFID Y   G+EE    L  E
Subjt:  ASTLEAIIRGLCAD---TRLLEALDVIGFATEDSNFCPTIETYELLINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKE

Query:  LLE
        + E
Subjt:  LLE

AT4G26680.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.7e-4123.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA
        +P  K      V+V   H  +S W  LN L  H  D     +++L I+ +  L+L FF W + ++  +H L +++ ++H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA

Query:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL
          ++             P K+F+ L+ +Y+ C S P VFD L K+    KK   A      ++  G  P + + N+   ++S   G             +
Subjt:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL

Query:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC
        D  ++      + K+SPN +T N +M  + + G + +  E+   +          SY+ L+A  CE+  +  A  L   M    L+ + V +NT+I GFC
Subjt:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC

Query:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL
        +A  ++ A + F EM+   +     T+  LINGY + GD + A   Y++M       +  T  A+I GLC   +  +A   +    +  N  P   T+  
Subjt:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL

Query:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL
        LI   C     +  F+    M+  G  PN + ++  + A+ +  + +   ++ +E++
Subjt:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL

AT4G26680.2 Tetratricopeptide repeat (TPR)-like superfamily protein7.7e-4123.85Show/hide
Query:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA
        +P  K      V+V   H  +S W  LN L  H  D     +++L I+ +  L+L FF W + ++  +H L +++ ++H L + R    A+++++  +  
Subjt:  SPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVSYSTIIHILARGRLRTHAKTVIQTAIRA

Query:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL
          ++             P K+F+ L+ +Y+ C S P VFD L K+    KK   A      ++  G  P + + N+   ++S   G             +
Subjt:  TELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSKCEGAKAGYAIFREVFGL

Query:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC
        D  ++      + K+SPN +T N +M  + + G + +  E+   +          SY+ L+A  CE+  +  A  L   M    L+ + V +NT+I GFC
Subjt:  DCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYNTIIGGFC

Query:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL
        +A  ++ A + F EM+   +     T+  LINGY + GD + A   Y++M       +  T  A+I GLC   +  +A   +    +  N  P   T+  
Subjt:  KAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYEL

Query:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL
        LI   C     +  F+    M+  G  PN + ++  + A+ +  + +   ++ +E++
Subjt:  LINSLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATTCCGCTGCTGAAACGCACCCTCTCGTCACCGTCAATCCGAAACTCCAACTTTAAGCTCCCATTTCCCCCTTCCTTCTTCTCCTCCTCACCGTCCGACGAACC
TTCGCCGTCGGCAAAACCCTCAATTTCGACCGTGGTTTCCGTTCTCACTCACCACCGATCAAAATCTCGCTGGAGATACCTCAACTCTCTGTGCCCCCATGGCTTTGATC
CCGGCGAGTTTTCTGACATCGTCCTCCACATCAAGAACAATCCCCATCTTGCCCTCCGCTTTTTCCTCTGGACTCAGAACAAGTCCCTCTGCAATCACGACCTTGTTTCT
TACTCAACCATCATCCACATCCTTGCCCGCGGTCGACTCAGAACTCACGCCAAGACTGTTATTCAGACCGCCATTAGGGCCACCGAGCTAGAAGATGATGAAGATTATTC
CAAATGCAAGCAATTTCCGAGGCCTCTGAAGCTGTTTGAGACCCTCGTGAAGACGTATAAACGGTGTGGGTCTGCTCCCTTCGTGTTTGATTTATTGATTAAATCCCTCA
TAGATTCTAAAAAGCTCGAGCCGGCCGTTCAAATTGTGAGAATGTTGAGGTCTCGTGGGATTAGCCCACAAATTGGTACGTTGAATTCAGTGATTTTATGGGTGTCGAAG
TGCGAGGGGGCTAAAGCAGGTTATGCCATTTTTAGAGAGGTTTTTGGCTTAGATTGTGGAGTTAAGGAAGAAAATGTGAAATTGAAGGCTAAGGTTAGTCCCAATGTGCA
TACTTTTAATACATTAATGGTGTGTTTTTATCAAGATGGTTTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACTGATTCAAATTCAATTCCAAACAGTTACAGTT
ATAGTATTCTGATGGCGGTTTTCTGTGAAGAAAAAAGAATGGTTCAAGCAGAGGATTTGTGGAAAGAAATGAGAATGAAGAAGTTGGAGCTTGATACTGTAGCTTACAAC
ACTATAATTGGAGGGTTTTGTAAAGCAGGAAATATTCGAAGGGCTGAAGAGTTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGTATCT
CATCAATGGCTATTGTGAGACTGGAGATGTTGACTCTGCATTACTTGTGTATAAGAATATGCGCAGGAAACATTTCAGTCTCAACGCGTCGACATTGGAAGCAATTATTA
GAGGGTTATGTGCTGACACTAGGCTTTTAGAAGCTTTAGATGTTATCGGGTTTGCCACGGAAGACTCTAACTTCTGCCCAACAATTGAAACTTACGAACTTCTGATAAAT
AGTTTGTGTCAGGAAGGGAAAATTGAAGCTGCATTTAAGTTTCAGGCACAGATGGTTGGGAAAGGATTTAAACCGAATTTGAAGATTTACCATGCTTTTATCGATGCCTA
TACAAAAGAAGGAAACGAAGAAATGGTCGAGAAGTTGGGGAAGGAATTACTTGAAATCCAGCTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATTCCGCTGCTGAAACGCACCCTCTCGTCACCGTCAATCCGAAACTCCAACTTTAAGCTCCCATTTCCCCCTTCCTTCTTCTCCTCCTCACCGTCCGACGAACC
TTCGCCGTCGGCAAAACCCTCAATTTCGACCGTGGTTTCCGTTCTCACTCACCACCGATCAAAATCTCGCTGGAGATACCTCAACTCTCTGTGCCCCCATGGCTTTGATC
CCGGCGAGTTTTCTGACATCGTCCTCCACATCAAGAACAATCCCCATCTTGCCCTCCGCTTTTTCCTCTGGACTCAGAACAAGTCCCTCTGCAATCACGACCTTGTTTCT
TACTCAACCATCATCCACATCCTTGCCCGCGGTCGACTCAGAACTCACGCCAAGACTGTTATTCAGACCGCCATTAGGGCCACCGAGCTAGAAGATGATGAAGATTATTC
CAAATGCAAGCAATTTCCGAGGCCTCTGAAGCTGTTTGAGACCCTCGTGAAGACGTATAAACGGTGTGGGTCTGCTCCCTTCGTGTTTGATTTATTGATTAAATCCCTCA
TAGATTCTAAAAAGCTCGAGCCGGCCGTTCAAATTGTGAGAATGTTGAGGTCTCGTGGGATTAGCCCACAAATTGGTACGTTGAATTCAGTGATTTTATGGGTGTCGAAG
TGCGAGGGGGCTAAAGCAGGTTATGCCATTTTTAGAGAGGTTTTTGGCTTAGATTGTGGAGTTAAGGAAGAAAATGTGAAATTGAAGGCTAAGGTTAGTCCCAATGTGCA
TACTTTTAATACATTAATGGTGTGTTTTTATCAAGATGGTTTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACTGATTCAAATTCAATTCCAAACAGTTACAGTT
ATAGTATTCTGATGGCGGTTTTCTGTGAAGAAAAAAGAATGGTTCAAGCAGAGGATTTGTGGAAAGAAATGAGAATGAAGAAGTTGGAGCTTGATACTGTAGCTTACAAC
ACTATAATTGGAGGGTTTTGTAAAGCAGGAAATATTCGAAGGGCTGAAGAGTTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGTATCT
CATCAATGGCTATTGTGAGACTGGAGATGTTGACTCTGCATTACTTGTGTATAAGAATATGCGCAGGAAACATTTCAGTCTCAACGCGTCGACATTGGAAGCAATTATTA
GAGGGTTATGTGCTGACACTAGGCTTTTAGAAGCTTTAGATGTTATCGGGTTTGCCACGGAAGACTCTAACTTCTGCCCAACAATTGAAACTTACGAACTTCTGATAAAT
AGTTTGTGTCAGGAAGGGAAAATTGAAGCTGCATTTAAGTTTCAGGCACAGATGGTTGGGAAAGGATTTAAACCGAATTTGAAGATTTACCATGCTTTTATCGATGCCTA
TACAAAAGAAGGAAACGAAGAAATGGTCGAGAAGTTGGGGAAGGAATTACTTGAAATCCAGCTGAGGTGA
Protein sequenceShow/hide protein sequence
MSIPLLKRTLSSPSIRNSNFKLPFPPSFFSSSPSDEPSPSAKPSISTVVSVLTHHRSKSRWRYLNSLCPHGFDPGEFSDIVLHIKNNPHLALRFFLWTQNKSLCNHDLVS
YSTIIHILARGRLRTHAKTVIQTAIRATELEDDEDYSKCKQFPRPLKLFETLVKTYKRCGSAPFVFDLLIKSLIDSKKLEPAVQIVRMLRSRGISPQIGTLNSVILWVSK
CEGAKAGYAIFREVFGLDCGVKEENVKLKAKVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLTDSNSIPNSYSYSILMAVFCEEKRMVQAEDLWKEMRMKKLELDTVAYN
TIIGGFCKAGNIRRAEEFFREMELSGIESTFSTFEYLINGYCETGDVDSALLVYKNMRRKHFSLNASTLEAIIRGLCADTRLLEALDVIGFATEDSNFCPTIETYELLIN
SLCQEGKIEAAFKFQAQMVGKGFKPNLKIYHAFIDAYTKEGNEEMVEKLGKELLEIQLR