; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001785 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001785
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold10:450038..452092
RNA-Seq ExpressionSpg001785
SyntenySpg001785
Gene Ontology termsGO:0008380 - RNA splicing (biological process)
GO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577831.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.79Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  + LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG VDVARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+RFDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV NGC+VFEHM+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVP+VG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

KAG7015869.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.79Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  + LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG VDVARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+RFDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV NGC+VFEHM+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVP+VG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

XP_022923215.1 pentatricopeptide repeat-containing protein At2g13600 [Cucurbita moschata]0.0e+0090.79Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  + LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG VDVARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+RFDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV NGC+VFEHM+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLL+EGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVPYVG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

XP_022965365.1 pentatricopeptide repeat-containing protein At2g13600 [Cucurbita maxima]0.0e+0090.2Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  K LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG V VARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+ FDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV +GC+VFE M+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK +IEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNV+RIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVPYVG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

XP_023552034.1 pentatricopeptide repeat-containing protein At2g13600 [Cucurbita pepo subsp. pepo]0.0e+0090.35Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  + LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG VDVARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+RFDEAL YF +MH HGF MNEYSFGSALSACA LQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV NGC+VFEHM+ERDCVSWNAMIVGYAQNGFGNK LG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMR+RGV+K PGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVP VG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

TrEMBL top hitse value%identityAlignment
A0A0A0KJ63 Uncharacterized protein0.0e+0090.2Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NGL KHL GDLL LDSSPF+KLLNQCARS+SARDTSRVHACIIKSPFASE FIQNRLIDVYGKCG VDVARKLFD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH RFDEAL YFA+MH HGFL+NEYSFGSALSACAGLQDLKLGSQIHSL+YRSNYLSDVYMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RV+ AQSVFD MTVRSRVSWNSLITCYEQNGPVDEAL IFVEMIKCGVEPDEVTLASVVSACATI AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR IFD MPIRSVVSETSMVSGYAK S VK ARYMFSNMMVKD+ITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRFQYG++SDVFVGNSLIDMYMKCGSV NGC+VF+HM+E+DCVSWNAMIVGYAQNGFGNKAL VF KMLESGE PDHVTMIGVLC
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGR+YFRSM+AQHGL+PLKDHYTCMVDLLGRAG LEEAK LIEEM M+PDAIVWGSLLAACKVHRNI+LGEYVV+KLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAE  DW NVVR+RKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +KKEIYM+LRT+LQ MK+AGYVPYVGSNE DE+
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

A0A5A7VCZ2 Pentatricopeptide repeat-containing protein0.0e+0089.33Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MARNGL KHL GD L LDSSPF+KLLNQC RS+SARDTSRVHACIIKSPFASE FIQNRLIDVYGKCG VDVARKLFD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMPE DQCSWNSMISGFEQH RF EAL YFA+MH HGFL+NEYSFGSALSACAGLQDLKLGSQIHSL+YRSNYLSDVYMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RV+ AQS FD MTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACATI AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR IFD MPIRSVVSETSMVSGYAK S VK AR MFSNMMVKD+ITWNALIAGCTQNGENEEALILFRLLKRES+WPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRFQYG++SDVFVGNSLIDMYMKCGSV NGC+VF+HM+ERDCVSWNAMIVGYAQNGFGNKAL VF+KMLESGE PDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGR+YFRSM+AQHGL+PLKDHYTCMVDLLGRAG LEEAK LIEEM M+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAE  DW NVVR+RKLMR+RGVIKQPGCSWIEIQG+LNVFMVKDKRH +KKEI M+LRT+L  MK+AGYVPY GSNE DE+
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

A0A6J1E1T7 pentatricopeptide repeat-containing protein At2g13600-like0.0e+0090.79Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MARNGL KHLT D L LDSS FAKLLNQC  SKSARDTS VHACIIK PFASE FIQNRLIDVYGKCG VDVARKLFD LL+RNIFSWN+IICA+TK GF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAV IFE+MPE DQCSWNSMISGFEQH+RFDEALNYFA+MH+HGFLMNEYSFGSALSACAGL+D KLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIF EMIKC VE DEVTLASVVSACATI AI EGQQIHARVVKCDEFR+DLILGNALVDMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRIN+AR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRFQ G+ESD+FVGNSLIDMYMKCGSV NGCKVFEHMV+RDCVSWNAMIVGYAQNGFGN+ALGVF+KMLE GEKPDHVTMIGVLC
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYF+SMSAQHGLV LKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDA++WGSLLAACKVHRNI+LGEYVVEKLLEVD E SGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMRKRGV+K PGCSWIEIQGQLNVFMVKDK+HPKKKEIY LLRTLL+ M+R GYVPYV SNEIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

A0A6J1EB52 pentatricopeptide repeat-containing protein At2g136000.0e+0090.79Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  + LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG VDVARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+RFDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV NGC+VFEHM+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLL+EGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK LIEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNVVRIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVPYVG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

A0A6J1HNH1 pentatricopeptide repeat-containing protein At2g136000.0e+0090.2Show/hide
Query:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF
        MA NG  K LTGDLL LDSSP +KLLNQCARSKSARDTSRVHACIIKSPFASE+FIQNRLIDVYGKCG V VARK+FD +L+RNIFSWN+IICAFTKSGF
Subjt:  MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGF

Query:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG
        LDDAVHIFEKMP+ DQCSWNSMISGFEQH+ FDEAL YF +MH HGF MNEYSFGSALSACAGLQDLK+GSQIHSLIYRSNYLSD+YMGSALVDMYSKCG
Subjt:  LDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCG

Query:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK
        RVDCA+SVFDGMTVRSRVSWNSLITCYEQNGPVDEAL IFVEMI+CGVEPDEVTLASVVSACAT+ AIKEGQQIHARVVKCDEFR+DLILGNAL+DMYAK
Subjt:  RVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAK

Query:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD
        CNRINEAR +FDRMPIRSVVSETSMVSGYAK SSVKAAR MFSNMMVKD+ITWNALIAGCTQNGENEEAL LFRLLKRESVWPTHYTFGNLLNACANLAD
Subjt:  CNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLAD

Query:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC
        LQLGRQAHSHVLKHGFRF+YG ESD+FVGNSLIDMYMKCGSV +GC+VFE M+ERDCVSWNAMIVGYAQNGFGNKALG+F++MLESGEKPDHVTMIGVL 
Subjt:  LQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLC

Query:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL
        ACSHAGLLDEGRHYFRSM A+HGLVPLKDHYTCMVDLLGRAGCLEEAK +IEEMPM+PDAIVWGSLLAACKVHRNI+LGEYVVEKLLEVDPENSGPYVLL
Subjt:  ACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLL

Query:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE
        SNMYAERGDWGNV+RIRKLMR+RGV+KQPGCSWIEIQG+LNVFMVKDKRH +K+EIYMLLRTLLQ MKRAGYVPYVG++EIDEE
Subjt:  SNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDEE

SwissProt top hitse value%identityAlignment
Q9FRI5 Pentatricopeptide repeat-containing protein At1g253606.9e-13036.98Show/hide
Query:  DLLSLDSSPFAKLLNQC--ARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEK
        DL+   ++ +A  L  C   R  S +    VH  II   F     I NRLIDVY K   ++ AR+LFD + + +  +   ++  +  SG +  A  +FEK
Subjt:  DLLSLDSSPFAKLLNQC--ARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEK

Query:  MPEA--DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGR----VD
         P    D   +N+MI+GF  +     A+N F +M   GF  + ++F S L+  A +  D K   Q H+   +S       + +ALV +YSKC      + 
Subjt:  MPEA--DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGR----VD

Query:  CAQSVFDGMTVRSRVSWNSLITCYEQNGPVD--------------------------------EALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEG
         A+ VFD +  +   SW +++T Y +NG  D                                EAL +   M+  G+E DE T  SV+ ACAT   ++ G
Subjt:  CAQSVFDGMTVRSRVSWNSLITCYEQNGPVD--------------------------------EALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEG

Query:  QQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALI
        +Q+HA V++ ++F       N+LV +Y KC + +EAR IF++MP + +VS  +++SGY  +  +  A+ +F  M  K+I++W  +I+G  +NG  EE L 
Subjt:  QQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALI

Query:  LFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNG
        LF  +KRE   P  Y F   + +CA L     G+Q H+ +LK GF      +S +  GN+LI MY KCG V    +VF  M   D VSWNA+I    Q+G
Subjt:  LFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNG

Query:  FGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACK
         G +A+ V+ +ML+ G +PD +T++ VL ACSHAGL+D+GR YF SM   + + P  DHY  ++DLL R+G   +A+ +IE +P +P A +W +LL+ C+
Subjt:  FGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACK

Query:  VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAG
        VH N+ELG    +KL  + PE+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  CSWIE++ Q++ F+V D  HP+ + +Y+ L+ L + M+R G
Subjt:  VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAG

Query:  YVP
        YVP
Subjt:  YVP

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.7e-12635.66Show/hide
Query:  LSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEA
        +S D   F   L+ CA+S++  +  ++H  I+K  +A ++F+QN L+  Y +CG +D ARK+FD + +RN+ SW ++IC + +  F  DAV +F +M   
Subjt:  LSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEA

Query:  DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTV
        ++ + NS+                              +    +SACA L+DL+ G ++++ I  S    +  M SALVDMY KC  +D A+ +FD    
Subjt:  DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTV

Query:  RSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRM
         +    N++ + Y + G   EAL +F  M+  GV PD +++ S +S+C+ +  I  G+  H  V++ + F     + NAL+DMY KC+R + A  IFDRM
Subjt:  RSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRM

Query:  PIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILF-RLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLK
          ++VV+  S+V+GY +   V AA   F  M  K+I++WN +I+G  Q    EEA+ +F  +  +E V     T  ++ +AC +L  L L +  + ++ K
Subjt:  PIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILF-RLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLK

Query:  HGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRH
        +G +       DV +G +L+DM+ +CG   +   +F  +  RD  +W A I   A  G   +A+ +F+ M+E G KPD V  +G L ACSH GL+ +G+ 
Subjt:  HGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRH

Query:  YFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNV
         F SM   HG+ P   HY CMVDLLGRAG LEEA +LIE+MPM P+ ++W SLLAAC+V  N+E+  Y  EK+  + PE +G YVLLSN+YA  G W ++
Subjt:  YFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNV

Query:  VRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE
         ++R  M+++G+ K PG S I+I+G+ + F   D+ HP+   I  +L  + Q     G+VP + +   ++DE+
Subjt:  VRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220702.1e-13435.96Show/hide
Query:  LLNQCARSKSARDTSR-VHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMI
        LL +     + R T++ VH  +IKS     +++ N L++VY K G    ARKLFD +  R  FSWN ++ A++K G +D     F+++P+ D  SW +MI
Subjt:  LLNQCARSKSARDTSR-VHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMI

Query:  SGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSL
         G++   ++ +A+     M   G    +++  + L++ A  + ++ G ++HS I +     +V + ++L++MY+KCG    A+ VFD M VR   SWN++
Subjt:  SGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSL

Query:  ITCYEQNGPVD-------------------------------EALVIFVEMIKCG-VEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILG
        I  + Q G +D                                AL IF +M++   + PD  TLASV+SACA +  +  G+QIH+ +V        ++L 
Subjt:  ITCYEQNGPVD-------------------------------EALVIFVEMIKCG-VEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILG

Query:  NALVDMYAKCNRINEARTIFDRMPIRSVVSE--TSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFG
        NAL+ MY++C  +  AR + ++   + +  E  T+++ GY K   +  A+ +F ++  +D++ W A+I G  Q+G   EA+ LFR +      P  YT  
Subjt:  NALVDMYAKCNRINEARTIFDRMPIRSVVSE--TSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFG

Query:  NLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHM-VERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGE
         +L+  ++LA L  G+Q H   +K G  +       V V N+LI MY K G++ +  + F+ +  ERD VSW +MI+  AQ+G   +AL +F  ML  G 
Subjt:  NLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHM-VERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGE

Query:  KPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLE
        +PDH+T +GV  AC+HAGL+++GR YF  M     ++P   HY CMVDL GRAG L+EA++ IE+MP+ PD + WGSLL+AC+VH+NI+LG+   E+LL 
Subjt:  KPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLE

Query:  VDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE
        ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+D  HP+K EIYM ++ +   +K+ GYVP   S  ++++EE
Subjt:  VDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136008.4e-26965.98Show/hide
Query:  LTGDLLSL-DSSPFAKLLNQCARSK-SARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHI
        L  DL S  DSSPFAKLL+ C +SK SA     VHA +IKS F++EIFIQNRLID Y KCG ++  R++FD +  RNI++WN+++   TK GFLD+A  +
Subjt:  LTGDLLSL-DSSPFAKLLNQCARSK-SARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHI

Query:  FEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQS
        F  MPE DQC+WNSM+SGF QH+R +EAL YFA MH  GF++NEYSF S LSAC+GL D+  G Q+HSLI +S +LSDVY+GSALVDMYSKCG V+ AQ 
Subjt:  FEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQS

Query:  VFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEA
        VFD M  R+ VSWNSLITC+EQNGP  EAL +F  M++  VEPDEVTLASV+SACA++ AIK GQ++H RVVK D+ R+D+IL NA VDMYAKC+RI EA
Subjt:  VFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEA

Query:  RTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA
        R IFD MPIR+V++ETSM+SGYA  +S KAAR MF+ M  +++++WNALIAG TQNGENEEAL LF LLKRESV PTHY+F N+L ACA+LA+L LG QA
Subjt:  RTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA

Query:  HSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGL
        H HVLKHGF+FQ G+E D+FVGNSLIDMY+KCG V  G  VF  M+ERDCVSWNAMI+G+AQNG+GN+AL +F +MLESGEKPDH+TMIGVL AC HAG 
Subjt:  HSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGL

Query:  LDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER
        ++EGRHYF SM+   G+ PL+DHYTCMVDLLGRAG LEEAK +IEEMPM+PD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVLLSNMYAE 
Subjt:  LDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER

Query:  GDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDE
        G W +V+ +RK MRK GV KQPGCSWI+IQG  +VFMVKDK HP+KK+I+ LL  L+  M+       +GS   +E
Subjt:  GDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDE

Q9SY02 Pentatricopeptide repeat-containing protein At4g027506.3e-13139.9Show/hide
Query:  NRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSA
        N +I  Y + G  ++ARKLFD + +R++ SWN +I  + ++  L  A  +FE MPE D CSWN+M+SG+ Q+   D+A + F RM       N+ S+ + 
Subjt:  NRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSA

Query:  LSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLAS
        LSA   +Q+ K+  +   ++++S     +   + L+  + K  ++  A+  FD M VR  VSWN++IT Y Q+G +DEA  +F E        D  T  +
Subjt:  LSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLAS

Query:  VVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALI
        +VS       ++E +++  ++ + +E     +  NA++  Y +  R+  A+ +FD MP R+V +  +M++GYA+   +  A+ +F  M  +D ++W A+I
Subjt:  VVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALI

Query:  AGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDC
        AG +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+      E+  FVGN+L+ MY KCGS+     +F+ M  +D 
Subjt:  AGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDC

Query:  VSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMR
        VSWN MI GY+++GFG  AL  F  M   G KPD  TM+ VL ACSH GL+D+GR YF +M+  +G++P   HY CMVDLLGRAG LE+A  L++ MP  
Subjt:  VSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMR

Query:  PDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIY
        PDA +WG+LL A +VH N EL E   +K+  ++PENSG YVLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ HP+K EI+
Subjt:  PDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIY

Query:  MLLRTLLQLMKRAGYV--PYVGSNEIDEE
          L  L   MK+AGYV    V  ++++EE
Subjt:  MLLRTLLQLMKRAGYV--PYVGSNEIDEE

Arabidopsis top hitse value%identityAlignment
AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-13136.98Show/hide
Query:  DLLSLDSSPFAKLLNQC--ARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEK
        DL+   ++ +A  L  C   R  S +    VH  II   F     I NRLIDVY K   ++ AR+LFD + + +  +   ++  +  SG +  A  +FEK
Subjt:  DLLSLDSSPFAKLLNQC--ARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEK

Query:  MPEA--DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGR----VD
         P    D   +N+MI+GF  +     A+N F +M   GF  + ++F S L+  A +  D K   Q H+   +S       + +ALV +YSKC      + 
Subjt:  MPEA--DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGL-QDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGR----VD

Query:  CAQSVFDGMTVRSRVSWNSLITCYEQNGPVD--------------------------------EALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEG
         A+ VFD +  +   SW +++T Y +NG  D                                EAL +   M+  G+E DE T  SV+ ACAT   ++ G
Subjt:  CAQSVFDGMTVRSRVSWNSLITCYEQNGPVD--------------------------------EALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEG

Query:  QQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALI
        +Q+HA V++ ++F       N+LV +Y KC + +EAR IF++MP + +VS  +++SGY  +  +  A+ +F  M  K+I++W  +I+G  +NG  EE L 
Subjt:  QQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALI

Query:  LFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNG
        LF  +KRE   P  Y F   + +CA L     G+Q H+ +LK GF      +S +  GN+LI MY KCG V    +VF  M   D VSWNA+I    Q+G
Subjt:  LFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNG

Query:  FGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACK
         G +A+ V+ +ML+ G +PD +T++ VL ACSHAGL+D+GR YF SM   + + P  DHY  ++DLL R+G   +A+ +IE +P +P A +W +LL+ C+
Subjt:  FGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACK

Query:  VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAG
        VH N+ELG    +KL  + PE+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  CSWIE++ Q++ F+V D  HP+ + +Y+ L+ L + M+R G
Subjt:  VHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAG

Query:  YVP
        YVP
Subjt:  YVP

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-27065.98Show/hide
Query:  LTGDLLSL-DSSPFAKLLNQCARSK-SARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHI
        L  DL S  DSSPFAKLL+ C +SK SA     VHA +IKS F++EIFIQNRLID Y KCG ++  R++FD +  RNI++WN+++   TK GFLD+A  +
Subjt:  LTGDLLSL-DSSPFAKLLNQCARSK-SARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHI

Query:  FEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQS
        F  MPE DQC+WNSM+SGF QH+R +EAL YFA MH  GF++NEYSF S LSAC+GL D+  G Q+HSLI +S +LSDVY+GSALVDMYSKCG V+ AQ 
Subjt:  FEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQS

Query:  VFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEA
        VFD M  R+ VSWNSLITC+EQNGP  EAL +F  M++  VEPDEVTLASV+SACA++ AIK GQ++H RVVK D+ R+D+IL NA VDMYAKC+RI EA
Subjt:  VFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEA

Query:  RTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA
        R IFD MPIR+V++ETSM+SGYA  +S KAAR MF+ M  +++++WNALIAG TQNGENEEAL LF LLKRESV PTHY+F N+L ACA+LA+L LG QA
Subjt:  RTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA

Query:  HSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGL
        H HVLKHGF+FQ G+E D+FVGNSLIDMY+KCG V  G  VF  M+ERDCVSWNAMI+G+AQNG+GN+AL +F +MLESGEKPDH+TMIGVL AC HAG 
Subjt:  HSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGL

Query:  LDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER
        ++EGRHYF SM+   G+ PL+DHYTCMVDLLGRAG LEEAK +IEEMPM+PD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVLLSNMYAE 
Subjt:  LDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAER

Query:  GDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDE
        G W +V+ +RK MRK GV KQPGCSWI+IQG  +VFMVKDK HP+KK+I+ LL  L+  M+       +GS   +E
Subjt:  GDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGSNEIDE

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.5e-13535.96Show/hide
Query:  LLNQCARSKSARDTSR-VHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMI
        LL +     + R T++ VH  +IKS     +++ N L++VY K G    ARKLFD +  R  FSWN ++ A++K G +D     F+++P+ D  SW +MI
Subjt:  LLNQCARSKSARDTSR-VHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMI

Query:  SGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSL
         G++   ++ +A+     M   G    +++  + L++ A  + ++ G ++HS I +     +V + ++L++MY+KCG    A+ VFD M VR   SWN++
Subjt:  SGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSL

Query:  ITCYEQNGPVD-------------------------------EALVIFVEMIKCG-VEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILG
        I  + Q G +D                                AL IF +M++   + PD  TLASV+SACA +  +  G+QIH+ +V        ++L 
Subjt:  ITCYEQNGPVD-------------------------------EALVIFVEMIKCG-VEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILG

Query:  NALVDMYAKCNRINEARTIFDRMPIRSVVSE--TSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFG
        NAL+ MY++C  +  AR + ++   + +  E  T+++ GY K   +  A+ +F ++  +D++ W A+I G  Q+G   EA+ LFR +      P  YT  
Subjt:  NALVDMYAKCNRINEARTIFDRMPIRSVVSE--TSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFG

Query:  NLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHM-VERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGE
         +L+  ++LA L  G+Q H   +K G  +       V V N+LI MY K G++ +  + F+ +  ERD VSW +MI+  AQ+G   +AL +F  ML  G 
Subjt:  NLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHM-VERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGE

Query:  KPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLE
        +PDH+T +GV  AC+HAGL+++GR YF  M     ++P   HY CMVDL GRAG L+EA++ IE+MP+ PD + WGSLL+AC+VH+NI+LG+   E+LL 
Subjt:  KPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLE

Query:  VDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE
        ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+D  HP+K EIYM ++ +   +K+ GYVP   S  ++++EE
Subjt:  VDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.9e-12735.66Show/hide
Query:  LSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEA
        +S D   F   L+ CA+S++  +  ++H  I+K  +A ++F+QN L+  Y +CG +D ARK+FD + +RN+ SW ++IC + +  F  DAV +F +M   
Subjt:  LSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEA

Query:  DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTV
        ++ + NS+                              +    +SACA L+DL+ G ++++ I  S    +  M SALVDMY KC  +D A+ +FD    
Subjt:  DQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTV

Query:  RSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRM
         +    N++ + Y + G   EAL +F  M+  GV PD +++ S +S+C+ +  I  G+  H  V++ + F     + NAL+DMY KC+R + A  IFDRM
Subjt:  RSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRM

Query:  PIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILF-RLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLK
          ++VV+  S+V+GY +   V AA   F  M  K+I++WN +I+G  Q    EEA+ +F  +  +E V     T  ++ +AC +L  L L +  + ++ K
Subjt:  PIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILF-RLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLK

Query:  HGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRH
        +G +       DV +G +L+DM+ +CG   +   +F  +  RD  +W A I   A  G   +A+ +F+ M+E G KPD V  +G L ACSH GL+ +G+ 
Subjt:  HGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRH

Query:  YFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNV
         F SM   HG+ P   HY CMVDLLGRAG LEEA +LIE+MPM P+ ++W SLLAAC+V  N+E+  Y  EK+  + PE +G YVLLSN+YA  G W ++
Subjt:  YFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNV

Query:  VRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE
         ++R  M+++G+ K PG S I+I+G+ + F   D+ HP+   I  +L  + Q     G+VP + +   ++DE+
Subjt:  VRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLLRTLLQLMKRAGYVPYVGS--NEIDEE

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-13239.9Show/hide
Query:  NRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSA
        N +I  Y + G  ++ARKLFD + +R++ SWN +I  + ++  L  A  +FE MPE D CSWN+M+SG+ Q+   D+A + F RM       N+ S+ + 
Subjt:  NRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEKMPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSA

Query:  LSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLAS
        LSA   +Q+ K+  +   ++++S     +   + L+  + K  ++  A+  FD M VR  VSWN++IT Y Q+G +DEA  +F E        D  T  +
Subjt:  LSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSWNSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLAS

Query:  VVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALI
        +VS       ++E +++  ++ + +E     +  NA++  Y +  R+  A+ +FD MP R+V +  +M++GYA+   +  A+ +F  M  +D ++W A+I
Subjt:  VVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYAKTSSVKAARYMFSNMMVKDIITWNALI

Query:  AGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDC
        AG +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+      E+  FVGN+L+ MY KCGS+     +F+ M  +D 
Subjt:  AGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCGSVANGCKVFEHMVERDC

Query:  VSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMR
        VSWN MI GY+++GFG  AL  F  M   G KPD  TM+ VL ACSH GL+D+GR YF +M+  +G++P   HY CMVDLLGRAG LE+A  L++ MP  
Subjt:  VSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKLIEEMPMR

Query:  PDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIY
        PDA +WG+LL A +VH N EL E   +K+  ++PENSG YVLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ HP+K EI+
Subjt:  PDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIY

Query:  MLLRTLLQLMKRAGYV--PYVGSNEIDEE
          L  L   MK+AGYV    V  ++++EE
Subjt:  MLLRTLLQLMKRAGYV--PYVGSNEIDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGGAATGGATTGCCTAAACATCTCACGGGTGACCTTTTATCTCTTGATTCGTCGCCTTTTGCCAAGCTCTTGAACCAGTGTGCTCGCTCCAAGTCAGCTAGAGA
CACGAGTCGTGTACATGCTTGCATAATTAAATCGCCCTTTGCTTCCGAAATTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGACGTGTGGACGTTGCTC
GCAAGTTGTTTGATAATTTGCTTGACAGAAATATTTTCTCTTGGAACGCCATCATTTGTGCATTCACTAAGTCCGGGTTTCTTGATGATGCTGTCCACATCTTTGAGAAG
ATGCCTGAAGCTGATCAATGCTCATGGAATTCTATGATTTCGGGGTTTGAACAACATGAACGCTTTGATGAAGCTTTAAATTATTTTGCTCGAATGCATGCTCATGGTTT
TTTGATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGCGCAGGTTTACAAGATTTGAAATTGGGTTCCCAAATCCATAGTTTAATATATAGGTCAAATTATTTAT
CAGATGTGTATATGGGTTCTGCTCTAGTTGATATGTACTCTAAATGTGGAAGAGTTGACTGTGCTCAGAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGG
AACAGCTTGATTACGTGTTATGAACAGAATGGTCCAGTTGATGAAGCTCTTGTTATTTTTGTTGAGATGATCAAATGTGGGGTTGAACCTGATGAGGTAACTCTTGCAAG
TGTTGTTAGTGCATGTGCAACTATCTTGGCGATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAGATGATCTTATTTTAGGCAATGCAT
TGGTTGATATGTATGCAAAATGTAACAGGATTAACGAGGCTAGAACAATTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGCTATGCG
AAAACATCAAGTGTTAAAGCTGCAAGATATATGTTCTCAAATATGATGGTGAAAGATATAATTACTTGGAATGCACTTATTGCAGGGTGTACACAAAATGGAGAGAATGA
AGAGGCACTTATACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTG
GCCGACAGGCTCACTCTCATGTTTTAAAGCATGGATTTCGATTCCAATATGGAAAAGAGTCGGATGTTTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGA
TCAGTTGCGAATGGTTGTAAGGTGTTTGAACATATGGTGGAAAGGGATTGTGTCTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAATAAGGCCCT
TGGAGTTTTCAATAAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTTTGTGCTTGTAGTCATGCCGGGTTGCTTGATGAAGGTCGCCATT
ACTTTCGATCAATGAGTGCACAACATGGTTTGGTGCCCTTAAAGGACCATTACACATGTATGGTTGATTTACTTGGCCGAGCTGGCTGCCTTGAAGAAGCAAAAAAACTA
ATAGAGGAAATGCCAATGCGGCCTGATGCTATTGTCTGGGGATCCTTGCTTGCCGCTTGTAAAGTCCATCGGAACATCGAATTGGGGGAATATGTAGTGGAGAAGCTTTT
AGAGGTAGATCCAGAGAATTCTGGGCCATATGTTCTTCTCTCAAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTGTGAGAATAAGAAAGCTGATGAGAAAGAGAG
GAGTGATTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTCAGTTGAATGTTTTTATGGTTAAAGATAAAAGGCATCCGAAGAAGAAAGAAATCTACATGCTTTTG
AGAACACTTCTACAACTGATGAAACGAGCTGGATATGTCCCATATGTTGGCAGCAATGAGATTGATGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGGAATGGATTGCCTAAACATCTCACGGGTGACCTTTTATCTCTTGATTCGTCGCCTTTTGCCAAGCTCTTGAACCAGTGTGCTCGCTCCAAGTCAGCTAGAGA
CACGAGTCGTGTACATGCTTGCATAATTAAATCGCCCTTTGCTTCCGAAATTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGACGTGTGGACGTTGCTC
GCAAGTTGTTTGATAATTTGCTTGACAGAAATATTTTCTCTTGGAACGCCATCATTTGTGCATTCACTAAGTCCGGGTTTCTTGATGATGCTGTCCACATCTTTGAGAAG
ATGCCTGAAGCTGATCAATGCTCATGGAATTCTATGATTTCGGGGTTTGAACAACATGAACGCTTTGATGAAGCTTTAAATTATTTTGCTCGAATGCATGCTCATGGTTT
TTTGATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGCGCAGGTTTACAAGATTTGAAATTGGGTTCCCAAATCCATAGTTTAATATATAGGTCAAATTATTTAT
CAGATGTGTATATGGGTTCTGCTCTAGTTGATATGTACTCTAAATGTGGAAGAGTTGACTGTGCTCAGAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGG
AACAGCTTGATTACGTGTTATGAACAGAATGGTCCAGTTGATGAAGCTCTTGTTATTTTTGTTGAGATGATCAAATGTGGGGTTGAACCTGATGAGGTAACTCTTGCAAG
TGTTGTTAGTGCATGTGCAACTATCTTGGCGATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAGATGATCTTATTTTAGGCAATGCAT
TGGTTGATATGTATGCAAAATGTAACAGGATTAACGAGGCTAGAACAATTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGCTATGCG
AAAACATCAAGTGTTAAAGCTGCAAGATATATGTTCTCAAATATGATGGTGAAAGATATAATTACTTGGAATGCACTTATTGCAGGGTGTACACAAAATGGAGAGAATGA
AGAGGCACTTATACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTG
GCCGACAGGCTCACTCTCATGTTTTAAAGCATGGATTTCGATTCCAATATGGAAAAGAGTCGGATGTTTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGA
TCAGTTGCGAATGGTTGTAAGGTGTTTGAACATATGGTGGAAAGGGATTGTGTCTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAATAAGGCCCT
TGGAGTTTTCAATAAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTTTGTGCTTGTAGTCATGCCGGGTTGCTTGATGAAGGTCGCCATT
ACTTTCGATCAATGAGTGCACAACATGGTTTGGTGCCCTTAAAGGACCATTACACATGTATGGTTGATTTACTTGGCCGAGCTGGCTGCCTTGAAGAAGCAAAAAAACTA
ATAGAGGAAATGCCAATGCGGCCTGATGCTATTGTCTGGGGATCCTTGCTTGCCGCTTGTAAAGTCCATCGGAACATCGAATTGGGGGAATATGTAGTGGAGAAGCTTTT
AGAGGTAGATCCAGAGAATTCTGGGCCATATGTTCTTCTCTCAAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTGTGAGAATAAGAAAGCTGATGAGAAAGAGAG
GAGTGATTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTCAGTTGAATGTTTTTATGGTTAAAGATAAAAGGCATCCGAAGAAGAAAGAAATCTACATGCTTTTG
AGAACACTTCTACAACTGATGAAACGAGCTGGATATGTCCCATATGTTGGCAGCAATGAGATTGATGAAGAATAG
Protein sequenceShow/hide protein sequence
MARNGLPKHLTGDLLSLDSSPFAKLLNQCARSKSARDTSRVHACIIKSPFASEIFIQNRLIDVYGKCGRVDVARKLFDNLLDRNIFSWNAIICAFTKSGFLDDAVHIFEK
MPEADQCSWNSMISGFEQHERFDEALNYFARMHAHGFLMNEYSFGSALSACAGLQDLKLGSQIHSLIYRSNYLSDVYMGSALVDMYSKCGRVDCAQSVFDGMTVRSRVSW
NSLITCYEQNGPVDEALVIFVEMIKCGVEPDEVTLASVVSACATILAIKEGQQIHARVVKCDEFRDDLILGNALVDMYAKCNRINEARTIFDRMPIRSVVSETSMVSGYA
KTSSVKAARYMFSNMMVKDIITWNALIAGCTQNGENEEALILFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFQYGKESDVFVGNSLIDMYMKCG
SVANGCKVFEHMVERDCVSWNAMIVGYAQNGFGNKALGVFNKMLESGEKPDHVTMIGVLCACSHAGLLDEGRHYFRSMSAQHGLVPLKDHYTCMVDLLGRAGCLEEAKKL
IEEMPMRPDAIVWGSLLAACKVHRNIELGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVVRIRKLMRKRGVIKQPGCSWIEIQGQLNVFMVKDKRHPKKKEIYMLL
RTLLQLMKRAGYVPYVGSNEIDEE