; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G03180 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G03180
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr4:1972372..1975995
RNA-Seq ExpressionCSPI04G03180
SyntenyCSPI04G03180
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044580.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0084.36Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK+            G  N K          +Q      +   L L    
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR

Query:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP
              ++N   +G ++ KK      +   ++K DLSRVKHQ NTNHIK WS A       +KDCRI KLDSS CFTMKSFFSCLTPL      F P
Subjt:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP

KAE8649102.1 hypothetical protein Csa_015255 [Cucumis sativus]0.0e+0099.64Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
        FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
        NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKL LVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKD
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK+
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKD

TYK17003.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0084.22Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNT+GSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK+            G  N K          +Q      +   L L    
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR

Query:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP
              ++N   +G ++ KK      +   ++K DLSRVKHQ NTNHIK WS A       +KDCRI KLDSS CFTMKSFFSCLTPL      F P
Subjt:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP

XP_004152299.1 pentatricopeptide repeat-containing protein At4g20090 isoform X1 [Cucumis sativus]0.0e+0099.82Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
        FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
        NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKL LVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK

XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]2.7e-30996.56Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNT+GSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK

TrEMBL top hitse value%identityAlignment
A0A0A0KU61 Uncharacterized protein0.0e+0099.69Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
        FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
        NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKL LVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR

Query:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSAS
        IISNSPQDNTMAIGNKEGKKA NSSLKGKIHSKADLSRVKHQINTNHIKDWSSAS
Subjt:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSAS

A0A1S3BX34 putative pentatricopeptide repeat-containing protein At5g59900 isoform X22.8e-27888.95Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNT+GSSD                                   
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
                  LLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK

A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.3e-30996.56Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNT+GSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKK

A0A5A7TN12 Pentatricopeptide repeat-containing protein0.0e+0084.36Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK+            G  N K          +Q      +   L L    
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR

Query:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP
              ++N   +G ++ KK      +   ++K DLSRVKHQ NTNHIK WS A       +KDCRI KLDSS CFTMKSFFSCLTPL      F P
Subjt:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP

A0A5D3CYL1 Pentatricopeptide repeat-containing protein0.0e+0084.22Show/hide
Query:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD
        MLVTIRSSSALLKLLSLHFHG SSHFFSTSKTT HIAIAPRAL RRPTSRTAPTPRSPNT+GSSDVVNSVCSLLSNKNPQTPNLD++HLLKRFKDNLSSD
Subjt:  MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSD

Query:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
         VLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  FVLQILMNYKLLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTR+A NILIGNLCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRV

Query:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR
         STYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCR+GQMQEAI+VLKVVE DKLRCAEECYSVVMKALCEHRH+DEASDLFGR
Subjt:  NSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGR

Query:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL
        MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENR+WSAAYGLLKEMLSLGMSPHFHVYS+VDKLMREHGQ+DLCL
Subjt:  MLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR
        KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGL PPIYVRDAFESAFQKK+            G  N K          +Q      +   L L    
Subjt:  KLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQKKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMR

Query:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP
              ++N   +G ++ KK      +   ++K DLSRVKHQ NTNHIK WS A       +KDCRI KLDSS CFTMKSFFSCLTPL      F P
Subjt:  IISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAIEKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTP

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397109.4e-2927.63Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIEL----PDKYSYSNVIIGLCKFGR-----------
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID   K+ R + L    P+  SY+ VI GLC+ GR           
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIEL----PDKYSYSNVIIGLCKFGR-----------

Query:  -----------YSTAIEAF-------------GEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVE---KVRVNS---TYRPFTVLVPNVNPKSGAIEPAV
                   Y+T I+ +              EM R GL P+      LI ++C       A+E   ++RV       R +T LV   + K G +  A 
Subjt:  -----------YSTAIEAF-------------GEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVE---KVRVNS---TYRPFTVLVPNVNPKSGAIEPAV

Query:  GIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGN
         +    N  G  PS      LI+  C  G+M++AI VL+ ++   L      YS V+   C    VDEA  +   M+ +G+KP    Y+ +I   C+   
Subjt:  GIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGN

Query:  LDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVD-----------------KLMRE---------HGQIDLC
           A  ++  M +    PD  TY+ALI+AY    D   A  L  EM+  G+ P    YS++                  KL  E         H  I+ C
Subjt:  LDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVD-----------------KLMRE---------HGQIDLC

Query:  LKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSP
          +E K    +++  C +G +  A +  +SML K   P
Subjt:  LKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSP

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011101.3e-3025.49Show/hide
Query:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEM
        I+  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ VI GLCK G+Y  A E F EM
Subjt:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEM

Query:  YRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVL--------VPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQE
         R+GL P  T    L+   C    K   VE  +V S  R   V+        + ++  +SG ++ A+  F +  + GL+P + +   LI   CR G +  
Subjt:  YRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVL--------VPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQE

Query:  AIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGEN
        A+ +   +           Y+ ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F  M +KR   D VTY+ L+  +G+ 
Subjt:  AIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGEN

Query:  RDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAF----QKKDS
         D   A  +  +M+S  + P    YSI                        ++  LC +G L  A+     M+ K + P + + ++    +       D 
Subjt:  RDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAF----QKKDS

Query:  NSF-STQIDPGF
         SF    I  GF
Subjt:  NSF-STQIDPGF

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.0e-3025.66Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y++VI GLCK G    A+E   +M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNI

Query:  LIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIE---------PAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKL
        LI  LC  +  E A E  RV ++      ++P+V   +  I+          A+ +F      G  P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIE---------PAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKL

Query:  RCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEM
          +   Y+ ++   C+     EA ++F  M   G+      YN +I  LCK   ++ A ++   M  +   PD  TY++L+  +    D   A  +++ M
Subjt:  RCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEM

Query:  LSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGLSPP
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+  +PP
Subjt:  LSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGLSPP

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655602.5e-2924.16Show/hide
Query:  LGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELP----D
        L R  L D+MK + + +L  K   +  T++  +    + G V EA     ++  + G  PD   + +++   C+++     +D+A K+F  + L     +
Subjt:  LGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELP----D

Query:  KYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGA------VEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANK
        + +Y+++I GLC   R   A++ F +M      PT     +LI +LC    K  A      +E+  +      +TVL+ ++  +    E A  +     +
Subjt:  KYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGA------VEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANK

Query:  LGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVF
         GL+P+      LI+  C+ G +++A+ V++++E  KL      Y+ ++K  C+  +V +A  +  +ML + + P +  YN +I   C+ GN DSA R+ 
Subjt:  LGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVF

Query:  GIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQID------------LCLKLEMKWEAQILQKLCKQGQLE
         +MN +   PD  TY+++I +  +++    A  L   +   G++P+  +Y+ +     + G++D             CL   + + A ++  LC  G+L+
Subjt:  GIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQID------------LCLKLEMKWEAQILQKLCKQGQLE

Query:  AAYEKMKSMLEKGLSPPI
         A    + M++ GL P +
Subjt:  AAYEKMKSMLEKGLSPPI

Q9ZUE9 Pentatricopeptide repeat-containing protein At2g060008.0e-2825.92Show/hide
Query:  RTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIE-----LPDKYSYSNVIIGLCKFGRYSTAIEAFG
        +TF+I IR L   G+  +AL L   M   FGC+PD + +N ++   CK       ++ A ++F+ ++      PD  +Y+++I G CK G+   A     
Subjt:  RTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIE-----LPDKYSYSNVIIGLCKFGRYSTAIEAFG

Query:  EMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLK
        +M R G+ PT    N+L+                                  K+G +  A  I       G  P       LI   CR+GQ+ +  R+ +
Subjt:  EMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLK

Query:  VVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAA
         +    +      YS+++ ALC    + +A +L G++ S+ + P+  +YN VI   CK G ++ A  +   M KK+C PD +T++ LI  +        A
Subjt:  VVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAA

Query:  YGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQ
          +  +M+++G SP        DK+      +   LK  M  EA  L ++ ++GQ
Subjt:  YGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQ

Arabidopsis top hitse value%identityAlignment
AT2G06000.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2925.92Show/hide
Query:  RTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIE-----LPDKYSYSNVIIGLCKFGRYSTAIEAFG
        +TF+I IR L   G+  +AL L   M   FGC+PD + +N ++   CK       ++ A ++F+ ++      PD  +Y+++I G CK G+   A     
Subjt:  RTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIE-----LPDKYSYSNVIIGLCKFGRYSTAIEAFG

Query:  EMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLK
        +M R G+ PT    N+L+                                  K+G +  A  I       G  P       LI   CR+GQ+ +  R+ +
Subjt:  EMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLK

Query:  VVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAA
         +    +      YS+++ ALC    + +A +L G++ S+ + P+  +YN VI   CK G ++ A  +   M KK+C PD +T++ LI  +        A
Subjt:  VVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAA

Query:  YGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQ
          +  +M+++G SP        DK+      +   LK  M  EA  L ++ ++GQ
Subjt:  YGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQ

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein7.2e-3225.66Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y++VI GLCK G    A+E   +M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNI

Query:  LIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIE---------PAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKL
        LI  LC  +  E A E  RV ++      ++P+V   +  I+          A+ +F      G  P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIE---------PAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKL

Query:  RCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEM
          +   Y+ ++   C+     EA ++F  M   G+      YN +I  LCK   ++ A ++   M  +   PD  TY++L+  +    D   A  +++ M
Subjt:  RCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEM

Query:  LSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGLSPP
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+  +PP
Subjt:  LSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAYEKMKSMLEKGLSPP

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-3225.49Show/hide
Query:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEM
        I+  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ VI GLCK G+Y  A E F EM
Subjt:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEM

Query:  YRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVL--------VPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQE
         R+GL P  T    L+   C    K   VE  +V S  R   V+        + ++  +SG ++ A+  F +  + GL+P + +   LI   CR G +  
Subjt:  YRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVL--------VPNVNPKSGAIEPAVGIFWAANKLGLVPSSFVTVQLISELCRLGQMQE

Query:  AIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGEN
        A+ +   +           Y+ ++  LC+ + + EA  LF  M  + + P       +I   CKLGNL +A  +F  M +KR   D VTY+ L+  +G+ 
Subjt:  AIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKRCAPDHVTYSALIHAYGEN

Query:  RDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAF----QKKDS
         D   A  +  +M+S  + P    YSI                        ++  LC +G L  A+     M+ K + P + + ++    +       D 
Subjt:  RDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAF----QKKDS

Query:  NSF-STQIDPGF
         SF    I  GF
Subjt:  NSF-STQIDPGF

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-3027.63Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIEL----PDKYSYSNVIIGLCKFGR-----------
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK       ID   K+ R + L    P+  SY+ VI GLC+ GR           
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIEL----PDKYSYSNVIIGLCKFGR-----------

Query:  -----------YSTAIEAF-------------GEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVE---KVRVNS---TYRPFTVLVPNVNPKSGAIEPAV
                   Y+T I+ +              EM R GL P+      LI ++C       A+E   ++RV       R +T LV   + K G +  A 
Subjt:  -----------YSTAIEAF-------------GEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVE---KVRVNS---TYRPFTVLVPNVNPKSGAIEPAV

Query:  GIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGN
         +    N  G  PS      LI+  C  G+M++AI VL+ ++   L      YS V+   C    VDEA  +   M+ +G+KP    Y+ +I   C+   
Subjt:  GIFWAANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGN

Query:  LDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVD-----------------KLMRE---------HGQIDLC
           A  ++  M +    PD  TY+ALI+AY    D   A  L  EM+  G+ P    YS++                  KL  E         H  I+ C
Subjt:  LDSAERVFGIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVD-----------------KLMRE---------HGQIDLC

Query:  LKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSP
          +E K    +++  C +G +  A +  +SML K   P
Subjt:  LKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSP

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-3024.16Show/hide
Query:  LGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELP----D
        L R  L D+MK + + +L  K   +  T++  +    + G V EA     ++  + G  PD   + +++   C+++     +D+A K+F  + L     +
Subjt:  LGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALKIFRRIELP----D

Query:  KYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGA------VEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANK
        + +Y+++I GLC   R   A++ F +M      PT     +LI +LC    K  A      +E+  +      +TVL+ ++  +    E A  +     +
Subjt:  KYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGA------VEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWAANK

Query:  LGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVF
         GL+P+      LI+  C+ G +++A+ V++++E  KL      Y+ ++K  C+  +V +A  +  +ML + + P +  YN +I   C+ GN DSA R+ 
Subjt:  LGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVF

Query:  GIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQID------------LCLKLEMKWEAQILQKLCKQGQLE
         +MN +   PD  TY+++I +  +++    A  L   +   G++P+  +Y+ +     + G++D             CL   + + A ++  LC  G+L+
Subjt:  GIMNKKRCAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQID------------LCLKLEMKWEAQILQKLCKQGQLE

Query:  AAYEKMKSMLEKGLSPPI
         A    + M++ GL P +
Subjt:  AAYEKMKSMLEKGLSPPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTAACGATTAGGAGCTCTTCGGCGTTGCTCAAACTCCTTTCTCTTCATTTCCATGGCTCCTCCTCACACTTTTTCAGCACTTCAAAGACCACTAACCACATTGC
CATAGCTCCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAACTCCTCGGTCTCCGAACACCCTCGGCTCTTCCGATGTCGTCAACTCAGTATGTTCTTTAC
TTTCAAACAAAAATCCCCAAACACCTAATCTCGATCTTGATCATTTATTGAAAAGGTTCAAAGACAACTTAAGTTCGGATTTCGTGCTTCAAATTCTGATGAATTATAAG
CTGTTAGGTCGGGCTAAAACGCTAGAATTCTTCTCTTGGTCCGGATTGCAAATGGGGTTTCGGTTTGATGCGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAGGAG
GAAATTGTTTGATGATATGAAGTGTCTTTTAGTGACTGTGTTGTCTCATAAGGGTCGGATTTCTTGTCGAACATTTTCAATTTGTATTAGATTTTTGGGTAGGCAGGGGA
GGGTTAGAGAAGCACTTTGCTTGTTTGAAGAAATGGAACCAAAATTTGGGTGTAAACCTGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCA
ACTGGGGAATTGATTGATACTGCTCTAAAGATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATGTTATAATTGGATTGTGTAAATTTGGTAGGTATAG
TACAGCTATTGAAGCGTTTGGTGAAATGTATAGGGCAGGTTTGGTACCTACTCGAACTGCTGTGAACATTCTCATTGGGAATTTGTGTTCTTTGAGTGCTAAAGAAGGGG
CTGTAGAAAAAGTTAGGGTCAATAGTACTTATAGACCTTTTACCGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCTGCAGTTGGGATTTTTTGGGCA
GCTAATAAGCTGGGTTTAGTTCCCAGTTCTTTTGTAACAGTTCAGCTCATCTCAGAGCTTTGTCGGTTAGGTCAAATGCAAGAAGCAATTAGAGTATTGAAGGTTGTTGA
GGGTGACAAGCTTAGATGTGCTGAAGAGTGTTATTCTGTTGTGATGAAAGCATTATGTGAACATCGTCACGTAGACGAAGCTAGTGATCTGTTTGGGAGGATGCTTTCTC
AGGGCATGAAGCCAAAATTGGCTATTTACAATTATGTTATTTGCATGTTATGCAAATTAGGAAATTTGGATAGTGCTGAAAGGGTGTTTGGGATTATGAACAAGAAAAGA
TGTGCACCTGATCATGTTACTTATTCGGCGTTAATCCATGCCTACGGTGAAAATAGGGATTGGTCAGCTGCCTACGGTTTATTGAAAGAAATGTTGAGTTTAGGCATGTC
TCCTCATTTTCATGTGTATAGTATAGTGGATAAACTAATGAGAGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAATGGGAAGCCCAAATTTTGCAGAAGC
TTTGTAAACAAGGACAACTGGAGGCTGCGTATGAAAAGATGAAGTCAATGCTTGAAAAGGGTTTGTCTCCTCCTATATATGTTAGAGATGCGTTTGAGAGTGCATTTCAA
AAGAAGGACTCTAATTCTTTCTCAACGCAGATTGATCCTGGATTCGGATTGAGCAATTCAAAATGTGCCCTTCCAGTGAACGTGTTCCCAATCAACCAAGATCTGCATTC
AAGTTTAGAGAGTTTACCACTTGAACTAGCTTTGATGAGGATCATATCGAATTCACCCCAGGACAACACAATGGCAATTGGCAACAAAGAGGGCAAGAAGGCAAATAATT
CAAGCTTGAAAGGAAAAATTCATTCAAAAGCTGACCTTTCACGTGTCAAGCATCAGATCAACACTAATCACATCAAAGATTGGTCCTCAGCTAGCGAAAATGTTGCCATA
GAAAAGAAAGATTGCAGAATCTGGAAGCTTGATAGTAGTGGATGCTTTACTATGAAATCTTTCTTCTCTTGTCTAACTCCACTTCTGCACCGGCACGGTGCTTTTACCCC
TATCTGTGCAGTCAACTATCATAGAGGCGATTCAGATCGTCCTAGCTTTCCTTTGTAA
mRNA sequenceShow/hide mRNA sequence
GAAATATGGAAATTTAGAAAGCTTGGGCCTTTGGGCGAAGAAGAGCACAGCGAGAACTTTCAAGGTCGAAACTATCGGCTCTGCTTCGGCACTACAAAATCGTCGATGTT
GGTAACGATTAGGAGCTCTTCGGCGTTGCTCAAACTCCTTTCTCTTCATTTCCATGGCTCCTCCTCACACTTTTTCAGCACTTCAAAGACCACTAACCACATTGCCATAG
CTCCAAGAGCTCTTGCAAGAAGACCCACTTCGCGAACTGCCCCAACTCCTCGGTCTCCGAACACCCTCGGCTCTTCCGATGTCGTCAACTCAGTATGTTCTTTACTTTCA
AACAAAAATCCCCAAACACCTAATCTCGATCTTGATCATTTATTGAAAAGGTTCAAAGACAACTTAAGTTCGGATTTCGTGCTTCAAATTCTGATGAATTATAAGCTGTT
AGGTCGGGCTAAAACGCTAGAATTCTTCTCTTGGTCCGGATTGCAAATGGGGTTTCGGTTTGATGCGTCCGTGGTTGAGTATATGGCTGATTTCTTAGGTAGGAGGAAAT
TGTTTGATGATATGAAGTGTCTTTTAGTGACTGTGTTGTCTCATAAGGGTCGGATTTCTTGTCGAACATTTTCAATTTGTATTAGATTTTTGGGTAGGCAGGGGAGGGTT
AGAGAAGCACTTTGCTTGTTTGAAGAAATGGAACCAAAATTTGGGTGTAAACCTGATAATCTGGTCTTTAACAACATGCTTTATGCACTTTGTAAGAAGGAACCAACTGG
GGAATTGATTGATACTGCTCTAAAGATTTTCAGAAGAATTGAATTGCCTGATAAATATTCATACAGTAATGTTATAATTGGATTGTGTAAATTTGGTAGGTATAGTACAG
CTATTGAAGCGTTTGGTGAAATGTATAGGGCAGGTTTGGTACCTACTCGAACTGCTGTGAACATTCTCATTGGGAATTTGTGTTCTTTGAGTGCTAAAGAAGGGGCTGTA
GAAAAAGTTAGGGTCAATAGTACTTATAGACCTTTTACCGTTCTAGTTCCAAATGTGAATCCGAAGAGCGGTGCCATTGAACCTGCAGTTGGGATTTTTTGGGCAGCTAA
TAAGCTGGGTTTAGTTCCCAGTTCTTTTGTAACAGTTCAGCTCATCTCAGAGCTTTGTCGGTTAGGTCAAATGCAAGAAGCAATTAGAGTATTGAAGGTTGTTGAGGGTG
ACAAGCTTAGATGTGCTGAAGAGTGTTATTCTGTTGTGATGAAAGCATTATGTGAACATCGTCACGTAGACGAAGCTAGTGATCTGTTTGGGAGGATGCTTTCTCAGGGC
ATGAAGCCAAAATTGGCTATTTACAATTATGTTATTTGCATGTTATGCAAATTAGGAAATTTGGATAGTGCTGAAAGGGTGTTTGGGATTATGAACAAGAAAAGATGTGC
ACCTGATCATGTTACTTATTCGGCGTTAATCCATGCCTACGGTGAAAATAGGGATTGGTCAGCTGCCTACGGTTTATTGAAAGAAATGTTGAGTTTAGGCATGTCTCCTC
ATTTTCATGTGTATAGTATAGTGGATAAACTAATGAGAGAACATGGGCAAATTGATCTGTGCTTGAAGCTGGAAATGAAATGGGAAGCCCAAATTTTGCAGAAGCTTTGT
AAACAAGGACAACTGGAGGCTGCGTATGAAAAGATGAAGTCAATGCTTGAAAAGGGTTTGTCTCCTCCTATATATGTTAGAGATGCGTTTGAGAGTGCATTTCAAAAGAA
GGACTCTAATTCTTTCTCAACGCAGATTGATCCTGGATTCGGATTGAGCAATTCAAAATGTGCCCTTCCAGTGAACGTGTTCCCAATCAACCAAGATCTGCATTCAAGTT
TAGAGAGTTTACCACTTGAACTAGCTTTGATGAGGATCATATCGAATTCACCCCAGGACAACACAATGGCAATTGGCAACAAAGAGGGCAAGAAGGCAAATAATTCAAGC
TTGAAAGGAAAAATTCATTCAAAAGCTGACCTTTCACGTGTCAAGCATCAGATCAACACTAATCACATCAAAGATTGGTCCTCAGCTAGCGAAAATGTTGCCATAGAAAA
GAAAGATTGCAGAATCTGGAAGCTTGATAGTAGTGGATGCTTTACTATGAAATCTTTCTTCTCTTGTCTAACTCCACTTCTGCACCGGCACGGTGCTTTTACCCCTATCT
GTGCAGTCAACTATCATAGAGGCGATTCAGATCGTCCTAGCTTTCCTTTGTAAATATCTTTTGGAGTCTTTATGATCCCGAAGAAGTGCCCTCTTCAGTTGGTCTGTATC
TCTTGTTGGCCTCGTTATTAACAGTAAAATTCAAAGAAGAAATCCTTCTTATGCTCTCAATTGGTGTTGGCACTCATTTTGTTGGACTTGTATTATGAACCTTGAGATTA
GAACATTTGATTCTCGCACCCATGCATTATACAATATCAAATTCAGATATAGTTAGACATCTTCATTGTGCATTAATTTTATACATTTTGCTCTCTTGTTGTTT
Protein sequenceShow/hide protein sequence
MLVTIRSSSALLKLLSLHFHGSSSHFFSTSKTTNHIAIAPRALARRPTSRTAPTPRSPNTLGSSDVVNSVCSLLSNKNPQTPNLDLDHLLKRFKDNLSSDFVLQILMNYK
LLGRAKTLEFFSWSGLQMGFRFDASVVEYMADFLGRRKLFDDMKCLLVTVLSHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEP
TGELIDTALKIFRRIELPDKYSYSNVIIGLCKFGRYSTAIEAFGEMYRAGLVPTRTAVNILIGNLCSLSAKEGAVEKVRVNSTYRPFTVLVPNVNPKSGAIEPAVGIFWA
ANKLGLVPSSFVTVQLISELCRLGQMQEAIRVLKVVEGDKLRCAEECYSVVMKALCEHRHVDEASDLFGRMLSQGMKPKLAIYNYVICMLCKLGNLDSAERVFGIMNKKR
CAPDHVTYSALIHAYGENRDWSAAYGLLKEMLSLGMSPHFHVYSIVDKLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAYEKMKSMLEKGLSPPIYVRDAFESAFQ
KKDSNSFSTQIDPGFGLSNSKCALPVNVFPINQDLHSSLESLPLELALMRIISNSPQDNTMAIGNKEGKKANNSSLKGKIHSKADLSRVKHQINTNHIKDWSSASENVAI
EKKDCRIWKLDSSGCFTMKSFFSCLTPLLHRHGAFTPICAVNYHRGDSDRPSFPL