; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G001480 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G001480
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr20:723059..725599
RNA-Seq ExpressionCmoCh20G001480
SyntenyCmoCh20G001480
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570443.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.31Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQY SEMLAQHLEPTE+HYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

XP_022944174.1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X1 [Cucurbita moschata]0.0e+0098.34Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVK SWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLF ANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANY LNGVALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQY SEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE LFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

XP_022944175.1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X2 [Cucurbita moschata]0.0e+0098.89Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIH MGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE LFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

XP_022944177.1 pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita moschata]0.0e+0089.15Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ-----------------------------------------
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ                                         
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ-----------------------------------------

Query:  -----------------------------------------------VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI
                                                       VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI
Subjt:  -----------------------------------------------VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI

Query:  RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
        RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
Subjt:  RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG

Query:  FESGGLLAEFV
        FESGGLLAEFV
Subjt:  FESGGLLAEFV

XP_022944178.1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X2 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

TrEMBL top hitse value%identityAlignment
A0A6J1FV31 pentatricopeptide repeat-containing protein At4g14170-like isoform X10.0e+0089.15Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ-----------------------------------------
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ                                         
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQ-----------------------------------------

Query:  -----------------------------------------------VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI
                                                       VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI
Subjt:  -----------------------------------------------VLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELI

Query:  RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
        RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
Subjt:  RRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG

Query:  FESGGLLAEFV
        FESGGLLAEFV
Subjt:  FESGGLLAEFV

A0A6J1FW82 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X10.0e+0098.34Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVK SWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLF ANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANY LNGVALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQY SEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE LFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

A0A6J1FXS7 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X20.0e+00100Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

A0A6J1FYI3 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X20.0e+0098.89Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
        AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIH MGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE LFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGGLLAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

A0A6J1JDW1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X20.0e+0096.82Show/hide
Query:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
        MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFN+QSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT
Subjt:  MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT

Query:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE
         FLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLN AKKVFDEMSE
Subjt:  AFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSE

Query:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI
        RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Subjt:  RDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI

Query:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN
        IEKNEVSWNSIINGLAFKGHF DAL+VFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGH TEASSIFHN
Subjt:  IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHN

Query:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
        MDGRNIVSWNAMIANY LNGVALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN
Subjt:  MDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN

Query:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG
        TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL KKPDVVSFMGV+SACANLAAVKQVLSACSHGGLVERG QY SEMLAQHLEPTEMHYTCLVDLLG
Subjt:  TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLG

Query:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ
        RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGN++LGCKAAE LFELKPQHCGYYILL+NM+AETGRWD+VNRIRELMKSRGAKKSPGCSWVQIHDQ
Subjt:  RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQ

Query:  LHAFVVDDRAEGFESGGLLAEFV
        LHAFVVDDRAEGFESGG LAEFV
Subjt:  LHAFVVDDRAEGFESGGLLAEFV

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic8.7e-11534.64Show/hide
Query:  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLK
        SL++ +++  +   NGL          L+  + ++   +    +F     + +   L++T+++    A    LD  L+ + RM    V+   + F ++LK
Subjt:  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLK

Query:  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISL
        +C D  ++  G E+HG++ K GF  D++    L  +Y  C  +N+A+KVFD M ERD+VSWNT++   S NG  R A      M     ++P+ ++++S+
Subjt:  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISL

Query:  LPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS
        LP  + L    + + IH Y ++ G DSLV    ALVD Y KCGS++ + Q+FD ++E+N VSWNS+I+      + ++A+ +F+ M+D G KP  V++  
Subjt:  LPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS

Query:  ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT
         L    +L   + G+ IH  S+ +G + ++ + NSLI MY K      A+S+F  +  R +VSWNAMI  +  NG  ++A+ +   ++    +P+  T+ 
Subjt:  ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT

Query:  NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSF
        +V+ A A        K IH + +R  L  ++FVT AL DMYAKCG    AR +F+  S +   ++N +I GY        +L LF EM+    KP+ V+F
Subjt:  NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSF

Query:  MGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA
        + VI             SACSH GLVE G +    M   + +E +  HY  +VDLLGRAG + EA + I ++P+ P  N++GA+LGAC+I+ NV    KA
Subjt:  MGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA

Query:  AEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAF
        AE+LFEL P   GY++LLAN++     W++V ++R  M  +G +K+PGCS V+I +++H+F
Subjt:  AEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAF

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168601.1e-10934.73Show/hide
Query:  QSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVL
        +++ Q K +H   L  G+L  +++L + LI  Y          +L  +   +    + WN+LIR  S   NG  +  L  +  M       D++TFPFV 
Subjt:  QSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVL

Query:  KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVIS
        K C +   +  G   H +    GF S+V+VGN L+ +Y  C  L+DA+KVFDEMS  DVVSWN++I   +  G  + A   +  MT   G +P+ +++++
Subjt:  KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVIS

Query:  LLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVF----------------
        +LP  A L    + +++HC+ V   +   +   N LVD Y KCG +  +  VF  +  K+ VSWN+++ G +  G F DA+ +F                
Subjt:  LLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVF----------------

Query:  -------------------RMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRM-------GTETDLFIANSLIDMYAKSGHSTEASSIFHNMD
                           R M+ +G KPN VT+ S+L     +     GKEIH ++++        G   +  + N LIDMYAK      A ++F ++ 
Subjt:  -------------------RMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRM-------GTETDLFIANSLIDMYAKSGHSTEASSIFHNMD

Query:  --GRNIVSWNAMIANYVLNGVALEAIRFV--ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTS-DLFVTNALTDMYAKCGCFRSARN
           R++V+W  MI  Y  +G A +A+  +  +  ++   RPNA T +  L ACA    L  GK+IHA  +R    +  LFV+N L DMYAKCG    AR 
Subjt:  --GRNIVSWNAMIANYVLNGVALEAIRFV--ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTS-DLFVTNALTDMYAKCGCFRSARN

Query:  VF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCL
        VF N   K+EV++  L+TGY       E+L +F EMR +  K D V+ +              VL ACSH G++++G +Y + M     + P   HY CL
Subjt:  VF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCL

Query:  VDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV
        VDLLGRAG +  A  LI  +P+ P   +W A L  CRI+G VELG  AAE++ EL   H G Y LL+N++A  GRW +V RIR LM+ +G KK PGCSWV
Subjt:  VDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV

Query:  QIHDQLHAFVVDDR
        +       F V D+
Subjt:  QIHDQLHAFVVDDR

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic6.2e-12134.5Show/hide
Query:  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYN
        R   +  V + L+ LC   ++  +  +V++I  L+ +    V L  + +  + +F +      +F +  +  R  F WN L+  ++  G    + +  Y+
Subjt:  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYN

Query:  RMVRF-GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY
        RM+   GV+ D +TFP VL+ C    D+ +G EVH  V + G++ D+ V N L+ +Y  CG +  A+ +FD M  RD++SWN +I     NG   E    
Subjt:  RMVRF-GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY

Query:  YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL
        +F M   S + P+L+++ S++     L D  + R IH Y++  G    ++ CN+L   Y   GS + + ++F  +  K+ VSW ++I+G  +      A+
Subjt:  YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL

Query:  DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEA
        D +RMM     KP+ +T++++L     L     G E+H  +++    + + +AN+LI+MY+K     +A  IFHN+  +N++SW ++IA   LN    EA
Subjt:  DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEA

Query:  IRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLES
        + F+  ++ +  +PNA+T T  L ACAR G L  GKEIHA  +R G+  D F+ NAL DMY +CG   +A + FN+  KD  S+NIL+TGYSE       
Subjt:  IRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLES

Query:  LNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWG
        + LF  M   R +PD ++F+              +L  CS   +V +G  Y S+M    + P   HY C+VDLLGRAG ++EA + I+++P+ PD  +WG
Subjt:  LNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWG

Query:  ALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR
        ALL ACRI+  ++LG  +A+ +FEL  +  GYYILL N++A+ G+W EV ++R +MK  G     GCSWV++  ++HAF+ DD+
Subjt:  ALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic7.1e-11735.65Show/hide
Query:  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQL
        ++L LC +++SL+  KEV      NG +  S +L + L L Y      +    +F +       A  WN L+  + +A +G   G +  + +M+  GV++
Subjt:  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQL

Query:  DDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGI
        D +TF  V K  S    +  G ++HG + K GF     VGN+L+  Y     ++ A+KVFDEM+ERDV+SWN++I     NG   +  + +  M L SGI
Subjt:  DDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGI

Query:  QPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG
        + +L +++S+    A      + R +H   VK         CN L+D Y KCG + ++  VF E+ +++ VS+ S+I G A +G   +A+ +F  M + G
Subjt:  QPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG

Query:  TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQE
          P+  T++++L           GK +H +        D+F++N+L+DMYAK G   EA  +F  M  ++I+SWN +I  Y  N  A EA+  F +LL+E
Subjt:  TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQE

Query:  SGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMR
            P+  T   VLPACA       G+EIH   +R G  SD  V N+L DMYAKCG    A  +F + + KD VS+ ++I GY       E++ LF++MR
Subjt:  SGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMR

Query:  LLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
            + D +SF+              +L ACSH GLV+ GW++ + M  +  +EPT  HY C+VD+L R G + +A   I  +PI PD+ IWGALL  CR
Subjt:  LLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR

Query:  IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD
        I+ +V+L  K AE++FEL+P++ GYY+L+AN++AE  +W++V R+R+ +  RG +K+PGCSW++I  +++ FV  D
Subjt:  IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD

Q9STE1 Pentatricopeptide repeat-containing protein At4g213002.2e-11034.73Show/hide
Query:  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGF
        + +SLI  Y ++   +    LF + +Q  +   +WN ++  +  A  G LD  ++ ++ M    +  +  TF  VL +C+  L I  G+++HG+V   G 
Subjt:  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGF

Query:  DSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVK-
        D +  + N+LL +Y  CG  +DA K+F  MS  D V+WN +I     +G   E+  +++ M + SG+ P+ ++  SLLP  +  E+ E  ++IHCYI++ 
Subjt:  DSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVK-

Query:  -VGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS
         + LD  +TS  AL+DAY+KC  V  +  +F +    + V + ++I+G    G + D+L++FR ++     PN +T+ SILPV   L   K G+E+HGF 
Subjt:  -VGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS

Query:  MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAM
        ++ G +    I  ++IDMYAK G    A  IF  +  R+IVSWN+MI     +     AI     +  SG   + V+ +  L ACA       GK IH  
Subjt:  MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAM

Query:  GVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEM-RLLRKKPDVVSFMGVISACANLAAVKQVLSAC
         ++  L SD++  + L DMYAKCG  ++A NVF T   K+ VS+N +I          +SL LF EM      +PD ++F+             +++S+C
Subjt:  GVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEM-RLLRKKPDVVSFMGVISACANLAAVKQVLSAC

Query:  SHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLAN
         H G V+ G ++   M   + ++P + HY C+VDL GRAG + EA E ++ +P  PD+ +WG LLGACR++ NVEL   A+ +L +L P + GYY+L++N
Subjt:  SHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLAN

Query:  MHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD
         HA    W+ V ++R LMK R  +K PG SW++I+ + H FV  D
Subjt:  MHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein6.2e-11634.64Show/hide
Query:  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLK
        SL++ +++  +   NGL          L+  + ++   +    +F     + +   L++T+++    A    LD  L+ + RM    V+   + F ++LK
Subjt:  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLK

Query:  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISL
        +C D  ++  G E+HG++ K GF  D++    L  +Y  C  +N+A+KVFD M ERD+VSWNT++   S NG  R A      M     ++P+ ++++S+
Subjt:  ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISL

Query:  LPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS
        LP  + L    + + IH Y ++ G DSLV    ALVD Y KCGS++ + Q+FD ++E+N VSWNS+I+      + ++A+ +F+ M+D G KP  V++  
Subjt:  LPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS

Query:  ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT
         L    +L   + G+ IH  S+ +G + ++ + NSLI MY K      A+S+F  +  R +VSWNAMI  +  NG  ++A+ +   ++    +P+  T+ 
Subjt:  ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT

Query:  NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSF
        +V+ A A        K IH + +R  L  ++FVT AL DMYAKCG    AR +F+  S +   ++N +I GY        +L LF EM+    KP+ V+F
Subjt:  NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSF

Query:  MGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA
        + VI             SACSH GLVE G +    M   + +E +  HY  +VDLLGRAG + EA + I ++P+ P  N++GA+LGAC+I+ NV    KA
Subjt:  MGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKA

Query:  AEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAF
        AE+LFEL P   GY++LLAN++     W++V ++R  M  +G +K+PGCS V+I +++H+F
Subjt:  AEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAF

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-12234.5Show/hide
Query:  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYN
        R   +  V + L+ LC   ++  +  +V++I  L+ +    V L  + +  + +F +      +F +  +  R  F WN L+  ++  G    + +  Y+
Subjt:  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYN

Query:  RMVRF-GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY
        RM+   GV+ D +TFP VL+ C    D+ +G EVH  V + G++ D+ V N L+ +Y  CG +  A+ +FD M  RD++SWN +I     NG   E    
Subjt:  RMVRF-GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY

Query:  YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL
        +F M   S + P+L+++ S++     L D  + R IH Y++  G    ++ CN+L   Y   GS + + ++F  +  K+ VSW ++I+G  +      A+
Subjt:  YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL

Query:  DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEA
        D +RMM     KP+ +T++++L     L     G E+H  +++    + + +AN+LI+MY+K     +A  IFHN+  +N++SW ++IA   LN    EA
Subjt:  DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEA

Query:  IRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLES
        + F+  ++ +  +PNA+T T  L ACAR G L  GKEIHA  +R G+  D F+ NAL DMY +CG   +A + FN+  KD  S+NIL+TGYSE       
Subjt:  IRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLES

Query:  LNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWG
        + LF  M   R +PD ++F+              +L  CS   +V +G  Y S+M    + P   HY C+VDLLGRAG ++EA + I+++P+ PD  +WG
Subjt:  LNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWG

Query:  ALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR
        ALL ACRI+  ++LG  +A+ +FEL  +  GYYILL N++A+ G+W EV ++R +MK  G     GCSWV++  ++HAF+ DD+
Subjt:  ALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-11835.65Show/hide
Query:  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQL
        ++L LC +++SL+  KEV      NG +  S +L + L L Y      +    +F +       A  WN L+  + +A +G   G +  + +M+  GV++
Subjt:  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQL

Query:  DDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGI
        D +TF  V K  S    +  G ++HG + K GF     VGN+L+  Y     ++ A+KVFDEM+ERDV+SWN++I     NG   +  + +  M L SGI
Subjt:  DDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGI

Query:  QPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG
        + +L +++S+    A      + R +H   VK         CN L+D Y KCG + ++  VF E+ +++ VS+ S+I G A +G   +A+ +F  M + G
Subjt:  QPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG

Query:  TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQE
          P+  T++++L           GK +H +        D+F++N+L+DMYAK G   EA  +F  M  ++I+SWN +I  Y  N  A EA+  F +LL+E
Subjt:  TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQE

Query:  SGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMR
            P+  T   VLPACA       G+EIH   +R G  SD  V N+L DMYAKCG    A  +F + + KD VS+ ++I GY       E++ LF++MR
Subjt:  SGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMR

Query:  LLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
            + D +SF+              +L ACSH GLV+ GW++ + M  +  +EPT  HY C+VD+L R G + +A   I  +PI PD+ IWGALL  CR
Subjt:  LLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQ-HLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR

Query:  IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD
        I+ +V+L  K AE++FEL+P++ GYY+L+AN++AE  +W++V R+R+ +  RG +K+PGCSW++I  +++ FV  D
Subjt:  IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-11134.73Show/hide
Query:  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGF
        + +SLI  Y ++   +    LF + +Q  +   +WN ++  +  A  G LD  ++ ++ M    +  +  TF  VL +C+  L I  G+++HG+V   G 
Subjt:  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGF

Query:  DSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVK-
        D +  + N+LL +Y  CG  +DA K+F  MS  D V+WN +I     +G   E+  +++ M + SG+ P+ ++  SLLP  +  E+ E  ++IHCYI++ 
Subjt:  DSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVK-

Query:  -VGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS
         + LD  +TS  AL+DAY+KC  V  +  +F +    + V + ++I+G    G + D+L++FR ++     PN +T+ SILPV   L   K G+E+HGF 
Subjt:  -VGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS

Query:  MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAM
        ++ G +    I  ++IDMYAK G    A  IF  +  R+IVSWN+MI     +     AI     +  SG   + V+ +  L ACA       GK IH  
Subjt:  MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAM

Query:  GVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEM-RLLRKKPDVVSFMGVISACANLAAVKQVLSAC
         ++  L SD++  + L DMYAKCG  ++A NVF T   K+ VS+N +I          +SL LF EM      +PD ++F+             +++S+C
Subjt:  GVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEM-RLLRKKPDVVSFMGVISACANLAAVKQVLSAC

Query:  SHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLAN
         H G V+ G ++   M   + ++P + HY C+VDL GRAG + EA E ++ +P  PD+ +WG LLGACR++ NVEL   A+ +L +L P + GYY+L++N
Subjt:  SHGGLVERGWQYLSEMLAQH-LEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLAN

Query:  MHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD
         HA    W+ V ++R LMK R  +K PG SW++I+ + H FV  D
Subjt:  MHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.8e-11134.73Show/hide
Query:  QSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVL
        +++ Q K +H   L  G+L  +++L + LI  Y          +L  +   +    + WN+LIR  S   NG  +  L  +  M       D++TFPFV 
Subjt:  QSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVL

Query:  KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVIS
        K C +   +  G   H +    GF S+V+VGN L+ +Y  C  L+DA+KVFDEMS  DVVSWN++I   +  G  + A   +  MT   G +P+ +++++
Subjt:  KICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVIS

Query:  LLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVF----------------
        +LP  A L    + +++HC+ V   +   +   N LVD Y KCG +  +  VF  +  K+ VSWN+++ G +  G F DA+ +F                
Subjt:  LLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVF----------------

Query:  -------------------RMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRM-------GTETDLFIANSLIDMYAKSGHSTEASSIFHNMD
                           R M+ +G KPN VT+ S+L     +     GKEIH ++++        G   +  + N LIDMYAK      A ++F ++ 
Subjt:  -------------------RMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRM-------GTETDLFIANSLIDMYAKSGHSTEASSIFHNMD

Query:  --GRNIVSWNAMIANYVLNGVALEAIRFV--ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTS-DLFVTNALTDMYAKCGCFRSARN
           R++V+W  MI  Y  +G A +A+  +  +  ++   RPNA T +  L ACA    L  GK+IHA  +R    +  LFV+N L DMYAKCG    AR 
Subjt:  --GRNIVSWNAMIANYVLNGVALEAIRFV--ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTS-DLFVTNALTDMYAKCGCFRSARN

Query:  VF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCL
        VF N   K+EV++  L+TGY       E+L +F EMR +  K D V+ +              VL ACSH G++++G +Y + M     + P   HY CL
Subjt:  VF-NTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQH-LEPTEMHYTCL

Query:  VDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV
        VDLLGRAG +  A  LI  +P+ P   +W A L  CRI+G VELG  AAE++ EL   H G Y LL+N++A  GRW +V RIR LM+ +G KK PGCSWV
Subjt:  VDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV

Query:  QIHDQLHAFVVDDR
        +       F V D+
Subjt:  QIHDQLHAFVVDDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAGTTTTGTATTCGAAGCTTTAGATTTCACTTCGCTCAAATCGCTCGATTCCAATTCAGAAACTTCGTTCGAAGAACAGAGCCAAATTCTTCCGTTCACATCAA
CCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCC
TTATTCTTAATTACGCCAAGTTTCAGCACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCT
CACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATG
TTCTGATTCGCTTGATATTTGCAAGGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGA
ATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGG
GAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATTAGTCTTTTACCCATTTCTGCTGGCCTTGAAGACGAGGAGAT
GACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTGGATGCGTATTGGAAATGTGGGAGTGTGAAAGCTTCAT
GGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAGCTTTTAAGGGTCATTTCCGGGATGCCTTGGATGTTTTTAGGATG
ATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTAT
GAGAATGGGAACAGAAACTGATCTTTTCATTGCAAATTCCCTGATCGATATGTATGCCAAGTCTGGTCACTCAACTGAGGCATCTAGCATATTCCACAACATGGATGGAA
GGAACATAGTTTCTTGGAACGCTATGATAGCTAATTATGTTCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGTCCCAAT
GCAGTGACTTTTACCAATGTTCTTCCTGCTTGTGCACGTTCGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTT
TGTAACCAATGCTCTGACCGACATGTATGCAAAATGTGGTTGCTTTCGTTCTGCTCGAAACGTCTTTAACACTTCCCATAAAGATGAAGTTTCTTATAACATATTAATTA
CAGGATATTCCGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTCGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCA
TGTGCAAACCTAGCTGCAGTCAAGCAAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTGAGCGAGATGTTAGCTCAACATCTTGAACC
CACTGAAATGCACTATACATGTCTGGTGGATCTACTCGGGCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTT
GGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGCAAGGCAGCAGAGCAGTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTT
CTTGCAAACATGCATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGAT
TCATGACCAGCTGCATGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATCAGGTGGTTTACTGGCAGAATTCGTTTGA
mRNA sequenceShow/hide mRNA sequence
TACAGTAACCGGCGGGCCGCACAGAGCAATGCCGAAATTCGCTAATTTACAGCAAACGGCCGCAAGCTCTCTTGCTTTCATGATTCCGCCATATTCCCCGCCTTAATGTT
ACAGTTTTGTATTCGAAGCTTTAGATTTCACTTCGCTCAAATCGCTCGATTCCAATTCAGAAACTTCGTTCGAAGAACAGAGCCAAATTCTTCCGTTCACATCAACCTCC
TCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATT
CTTAATTACGCCAAGTTTCAGCACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTC
CATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTG
ATTCGCTTGATATTTGCAAGGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGT
GGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGC
TCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATTAGTCTTTTACCCATTTCTGCTGGCCTTGAAGACGAGGAGATGACAA
GACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTGGATGCGTATTGGAAATGTGGGAGTGTGAAAGCTTCATGGCAA
GTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAGCTTTTAAGGGTCATTTCCGGGATGCCTTGGATGTTTTTAGGATGATGAT
CGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAA
TGGGAACAGAAACTGATCTTTTCATTGCAAATTCCCTGATCGATATGTATGCCAAGTCTGGTCACTCAACTGAGGCATCTAGCATATTCCACAACATGGATGGAAGGAAC
ATAGTTTCTTGGAACGCTATGATAGCTAATTATGTTCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGTCCCAATGCAGT
GACTTTTACCAATGTTCTTCCTGCTTGTGCACGTTCGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAA
CCAATGCTCTGACCGACATGTATGCAAAATGTGGTTGCTTTCGTTCTGCTCGAAACGTCTTTAACACTTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGA
TATTCCGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTCGTAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCATGTGC
AAACCTAGCTGCAGTCAAGCAAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTGAGCGAGATGTTAGCTCAACATCTTGAACCCACTG
AAATGCACTATACATGTCTGGTGGATCTACTCGGGCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGA
GCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGCAAGGCAGCAGAGCAGTTATTTGAGCTAAAGCCTCAGCATTGTGGATACTATATTCTTCTTGC
AAACATGCATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATG
ACCAGCTGCATGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATCAGGTGGTTTACTGGCAGAATTCGTTTGA
Protein sequenceShow/hide protein sequence
MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRA
HSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYR
EARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRM
MIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPN
AVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPDVVSFMGVISA
CANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYIL
LANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAEFV