; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015663 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015663
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr02:28710114..28712462
RNA-Seq ExpressionHG10015663
SyntenyHG10015663
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604009.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.99Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ ++P SSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYL+NNLMTFYAKTGS+  AHHVFDEMP+KSTFSWNTLISAYAKQGNF+ SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQLGLFDNAI MFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS+ASVATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETS AVGNALISMYAKSGGVEIAR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWN+AL+LFR M
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPEPNSYTLA+MLSV+SSLA+LEHGKQIHA AIKAG SSTASVTNALIAMYAK+GSINIAKRVFDLTSGKKETVSWTSMIMALAQHG GEEAI+LFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV MKPDHITYVGVLSACTHVGL+EQGR YYN+MTEVH+IEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMKDRGV+KEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLMDEIWE+IKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQIL++HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDG+CSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

XP_008446823.1 PREDICTED: pentatricopeptide repeat-containing protein At2g22070 [Cucumis melo]0.0e+0083.25Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ NSP S +FFAHILQTSVRI+DPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLIS YAKQGNF+VSRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQ GLFDNAIWMFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS A VATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMT+KNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TE ETSGAVGNALISMYAKSGGVEIAR I+EHNRTSNLNIIAFTSLLDGYTKLG+VKPAREIFNKLRD DVIAWTAMIVGYVQNGLWNDALDLFRLM
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPEPNSYTLAAMLSV+SSLATLEHGKQIHASAIKAGESST SVTNALI MYAK+G+I++AKRVFDLTSGKKE VSWTSMIMALAQHGLG+EAINLFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RM S+ MKPDHITYVGVLSACTHVG VEQGRKYY MMTEVHEIEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASC+VHKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAY ALANV+SACGKWE+AAKTRKLMKDRGVRKEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLM EIWEEIKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQILKHHSEKLAIAFGLL+TPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

XP_022950762.1 pentatricopeptide repeat-containing protein At2g22070 isoform X1 [Cucurbita moschata]0.0e+0083.12Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ ++P SSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYL+NNLMTFYAKTGS+  AHHVFDEMP+KSTFSWNTLISAYAKQGNFD SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQLGLFDNAI MFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS+ASVATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETS AVGNALISMYAKSGGVEIAR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWN+AL+LFR M
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPE NSYTLA+MLSV+SSLA+LEHGKQIHA AIKAGESSTASVTNALIAMYAK+GSINIAKRVFDLTSGKKETVSWTSMIMALAQHG GEEAI+LFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV MKPDHITYVGVLSACTHVGL+EQGR YYN+MTEVH+IEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMKDRGV+KEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLMDEIWE+IKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQIL++HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDG+CSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

XP_023543916.1 pentatricopeptide repeat-containing protein At2g22070 [Cucurbita pepo subsp. pepo]0.0e+0083.12Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ ++P SSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYL+NNLMTFYAKTGS+  AHHVFDEMP+KSTFSWNTLISAYAKQGNFD SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQLGLFDNAI MFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS+ASVATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETS AVGNALISMYAKSGGVEIAR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWN+AL+LFR M
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPEPNSYTLA+MLSV+SSLA+LEHGKQIHA AIKAGESSTASVTNALIAMYAK+GSINIAKRVFDLTSGKKETVSWTSMIMALAQHG GEEAI+LFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV MKPDHITYVGVLSACTHVGL+EQGR YYN+MTEVH+IEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMKDRGV+K+KGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLMDEIWE+IKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQIL++HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDG+CSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

XP_038892799.1 pentatricopeptide repeat-containing protein At2g22070 [Benincasa hispida]0.0e+0084.14Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ NSPASS+FFAHILQTSVRIRDP AGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAH VFDEMPLKSTFSWNTLISAYAKQGNFD SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPD DPVSWTAIIVGYNQLGLFDNAIWMFA MISERVPPSQFTVSNV+SSCAANQALDVG+KIHSFVVKLGLGS+ASVATS+LNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETSGAVGNALISMYAKSGGVE+AR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKL+D DVIAWTAMIVGYVQNGLWNDAL+LFRLM
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GP+PNSYTLAAMLSV+SSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSI+IAKR FDLTS KKETVSWTSMIMALAQHGLGEEAINLFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV +KPDHITYVGVLSACTHVGLVEQGR YYNMMTEVHEIEPTLSHYACM+DLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCKVHKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMK+RGVRKEKG SWIH+KNKVHAFGVEDVIHPQKDEIYKLM ++WEEIKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQILKHHSEKLA+AFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KWQ7 DYW_deaminase domain-containing protein0.0e+0088.09Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ NSP SS+FFAHILQTSVRI+DPFAGRSVH QIIKKGLHLGVYLMNNLMTFYAKTGSL FAHHVFDEMPLKSTFSWNTLIS YAKQGNF+VSRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQ GLFDNAIWMFA MISERVPPSQFTVSNV+SSCAANQ LD+GRKIHSFVVKLGLGS   VATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTW--------------------------NALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVK
        DRMT+ N  +                           +A I   ETETSGAVGNALISMYAKSGGVEIAR I+EHNRTSNLNIIAFTSLLDGYTKLG+VK
Subjt:  DRMTLKNISTW--------------------------NALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVK

Query:  PAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSI
        PAREIFNKLRDRDV+AWTAMIVGYVQNGLWNDAL+LFRLMVN GPEPNSYTLAAMLSV+SSL  LEHGKQIHASAIKAGESST SVTNALIAMYAK+G+I
Subjt:  PAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSI

Query:  NIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRA
        N+AKRVFDL +GKKE VSWTSMIMALAQHGLG+EAINLFERMLSV MKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRA
Subjt:  NIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRA

Query:  GLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVH
        GLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKVAAERLL IDPGNSGAY ALANV+SACGKWENAA+TRKLMKDRGVRKEKG SWIH+KN+VH
Subjt:  GLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVH

Query:  AFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREI
        AFGVEDVIHPQKDEIYKLM EIWEEIKKMGFIPDTESVLHDLEEEVKEQILK+HSEKLAIAFGLL+TPENT LRIMKNLRVCNDCHSAIKFISKLVGREI
Subjt:  AFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREI

Query:  IVRDATRFHHFKDGSCSCRDYW
        IVRDATRFHHFKDGSCSCRDYW
Subjt:  IVRDATRFHHFKDGSCSCRDYW

A0A1S4DWB2 pentatricopeptide repeat-containing protein At2g220700.0e+0083.25Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ NSP S +FFAHILQTSVRI+DPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLIS YAKQGNF+VSRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQ GLFDNAIWMFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS A VATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMT+KNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TE ETSGAVGNALISMYAKSGGVEIAR I+EHNRTSNLNIIAFTSLLDGYTKLG+VKPAREIFNKLRD DVIAWTAMIVGYVQNGLWNDALDLFRLM
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPEPNSYTLAAMLSV+SSLATLEHGKQIHASAIKAGESST SVTNALI MYAK+G+I++AKRVFDLTSGKKE VSWTSMIMALAQHGLG+EAINLFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RM S+ MKPDHITYVGVLSACTHVG VEQGRKYY MMTEVHEIEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASC+VHKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAY ALANV+SACGKWE+AAKTRKLMKDRGVRKEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLM EIWEEIKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQILKHHSEKLAIAFGLL+TPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

A0A5D3CDH3 Pentatricopeptide repeat-containing protein0.0e+0083.25Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ NSP S +FFAHILQTSVRI+DPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLIS YAKQGNF+VSRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQ GLFDNAIWMFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS A VATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMT+KNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TE ETSGAVGNALISMYAKSGGVEIAR I+EHNRTSNLNIIAFTSLLDGYTKLG+VKPAREIFNKLRD DVIAWTAMIVGYVQNGLWNDALDLFRLM
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPEPNSYTLAAMLSV+SSLATLEHGKQIHASAIKAGESST SVTNALI MYAK+G+I++AKRVFDLTSGKKE VSWTSMIMALAQHGLG+EAINLFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RM S+ MKPDHITYVGVLSACTHVG VEQGRKYY MMTEVHEIEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASC+VHKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAY ALANV+SACGKWE+AAKTRKLMKDRGVRKEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLM EIWEEIKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQILKHHSEKLAIAFGLL+TPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

A0A6J1GFR3 pentatricopeptide repeat-containing protein At2g22070 isoform X10.0e+0083.12Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ ++P SSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYL+NNLMTFYAKTGS+  AHHVFDEMP+KSTFSWNTLISAYAKQGNFD SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQLGLFDNAI MFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS+ASVATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETS AVGNALISMYAKSGGVEIAR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWN+AL+LFR M
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        VN GPE NSYTLA+MLSV+SSLA+LEHGKQIHA AIKAGESSTASVTNALIAMYAK+GSINIAKRVFDLTSGKKETVSWTSMIMALAQHG GEEAI+LFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV MKPDHITYVGVLSACTHVGL+EQGR YYN+MTEVH+IEPTLSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMKDRGV+KEKGFSWIH+KNKVHAFGVEDVIHPQKDEIYKLMDEIWE+IKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQIL++HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDG+CSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

A0A6J1IT29 pentatricopeptide repeat-containing protein At2g22070 isoform X10.0e+0082.1Show/hide
Query:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY
        ME+ ++P SSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYL+NNLMTFY KTGS+  AH VFDEMP+KSTFSWNTLISAYAKQGNF+ SRRLLY
Subjt:  MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLY

Query:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF
        EMPDCDPVSWTAIIVGYNQ GLFDNAI MFA MISERVPPSQFTVSNV+SSCAANQALD+GRKIHSFVVKLGLGS+ASVATSLLNMYAKCGDPV AKVVF
Subjt:  EMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVF

Query:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------
        DRMTLKNISTWNALISL                                                                                   
Subjt:  DRMTLKNISTWNALISL-----------------------------------------------------------------------------------

Query:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM
           TETETS AVGNALISMYAKSGGVEIAR IIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWN+AL+LFR M
Subjt:  ---TETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLM

Query:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE
        +N GPEPNSYTLA+MLSV+SSLA+LEHGKQIHA AIKAGESSTASVTNALIAMYAK+GSINIAKRVFDLTSGKKETVSWTSMIMALAQHG GEEAI+LFE
Subjt:  VNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFE

Query:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV
        RMLSV MKPDHITYVGVLSACTHVGL+EQGR YYN+MTEVH+I P+LSHYACMIDLYGRAGLLQEAY FIESMPIEPDNIAWGSLLASCK+HKNADLAKV
Subjt:  RMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKV

Query:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH
        AAERLL IDPGNSGAYSALANV+SACGKWENAAKTRKLMKDRGV+K+KGFSWIH+KNKVHAFGVEDVIHPQK EIYKLMDEIWE+IKKMGFIPDTESVLH
Subjt:  AAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLH

Query:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        DLEEEVKEQIL++HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDG+CSCRDYW
Subjt:  DLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

SwissProt top hitse value%identityAlignment
Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689301.9e-13535.74Show/hide
Query:  RSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWM
        + +H  II+   +   +L NN++  YA   S ++A  VFD +P  + FSWN L+ AY+K G          ++PD D V+W  +I GY+  GL   A+  
Subjt:  RSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWM

Query:  FANMISE-RVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------------
        +  M+ +     ++ T+  ++   ++N  + +G++IH  V+KLG  SY  V + LL MYA  G    AK VF  +  +N   +N+L+             
Subjt:  FANMISE-RVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------------

Query:  --LTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNL-------------------------------------NIIAFTSLLDGYTKLGDVKPAR
              E       A+I   A++G  + A +     +   L                                     +I   ++L+D Y K   +  A+
Subjt:  --LTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNL-------------------------------------NIIAFTSLLDGYTKLGDVKPAR

Query:  EIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIA
         +F++++ ++V++WTAM+VGY Q G   +A+ +F  M   G +P+ YTL   +S  +++++LE G Q H  AI +G     +V+N+L+ +Y K G I+ +
Subjt:  EIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIA

Query:  KRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLL
         R+F+     ++ VSWT+M+ A AQ G   E I LF++M+   +KPD +T  GV+SAC+  GLVE+G++Y+ +MT  + I P++ HY+CMIDL+ R+G L
Subjt:  KRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLL

Query:  QEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFG
        +EA  FI  MP  PD I W +LL++C+   N ++ K AAE L+ +DP +   Y+ L++++++ GKW++ A+ R+ M+++ V+KE G SWI  K K+H+F 
Subjt:  QEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFG

Query:  VEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVR
         +D   P  D+IY  ++E+  +I   G+ PDT  V HD+EE VK ++L +HSE+LAIAFGL+  P    +R+ KNLRVC DCH+A K IS + GREI+VR
Subjt:  VEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVR

Query:  DATRFHHFKDGSCSCRDYW
        DA RFH FKDG+CSC D+W
Subjt:  DATRFHHFKDGSCSCRDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.3e-13536.64Show/hide
Query:  LPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEM
        LPNS      F  +L++  + +    G+ +H  ++K G  L +Y+  +L++ Y + G L  AH VFD+ P +   S+  LI  YA +G  + +++L  E+
Subjt:  LPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEM

Query:  PDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDR
        P  D VSW A+I GY + G +  A+ +F +M+   V P + T+  V+S+CA + ++++GR++H ++   G GS   +  +L+++Y+KCG+  TA  +F+R
Subjt:  PDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDR

Query:  MTLKNISTWNALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNG
        +  K                                                                                DVI+W  +I GY    
Subjt:  MTLKNISTWNALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNG

Query:  LWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIK--AGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMAL
        L+ +AL LF+ M+  G  PN  T+ ++L   + L  ++ G+ IH    K   G ++ +S+  +LI MYAK G I  A +VF+ +   K   SW +MI   
Subjt:  LWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIK--AGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMAL

Query:  AQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLL
        A HG  + + +LF RM  + ++PD IT+VG+LSAC+H G+++ GR  +  MT+ +++ P L HY CMIDL G +GL +EA   I  M +EPD + W SLL
Subjt:  AQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLL

Query:  ASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEI
         +CK+H N +L +  AE L+ I+P N G+Y  L+N++++ G+W   AKTR L+ D+G++K  G S I + + VH F + D  HP+  EIY +++E+   +
Subjt:  ASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEI

Query:  KKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        +K GF+PDT  VL ++EEE KE  L+HHSEKLAIAFGL+ST   T L I+KNLRVC +CH A K ISK+  REII RD TRFHHF+DG CSC DYW
Subjt:  KKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202303.3e-13537.43Show/hide
Query:  GRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMP----DCDPVSWTAIIVGYNQLGLFD
        G+ +H      GL +  ++  ++   Y + G +  A  VFD M  K   + + L+ AYA++G  +   R+L EM     + + VSW  I+ G+N+ G   
Subjt:  GRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMP----DCDPVSWTAIIVGYNQLGLFD

Query:  NAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALISLTETETSG
         A+ MF  +      P Q TVS+V+ S   ++ L++GR IH +V+K GL     V +++++MY K G       +F++  +      NA I+        
Subjt:  NAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALISLTETETSG

Query:  AVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDR----DVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPE
                                                 G ++ G V  A E+F   +++    +V++WT++I G  QNG   +AL+LFR M   G +
Subjt:  AVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDR----DVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPE

Query:  PNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVR
        PN  T+ +ML    ++A L HG+  H  A++        V +ALI MYAK G IN+++ VF++    K  V W S++   + HG  +E +++FE ++  R
Subjt:  PNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVR

Query:  MKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLL
        +KPD I++  +LSAC  VGL ++G KY+ MM+E + I+P L HY+CM++L GRAG LQEAY  I+ MP EPD+  WG+LL SC++  N DLA++AAE+L 
Subjt:  MKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLL

Query:  FIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEV
         ++P N G Y  L+N+++A G W      R  M+  G++K  G SWI +KN+V+     D  HPQ D+I + MDEI +E++K G  P+ +  LHD+EE+ 
Subjt:  FIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEV

Query:  KEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        +EQ+L  HSEKLA+ FGLL+TP+ T L+++KNLR+C DCH+ IKFIS   GREI +RD  RFHHFKDG CSC D+W
Subjt:  KEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.2e-25457.61Show/hide
Query:  HILQTSV-RIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAI
        ++LQ SV +    F  + VH ++IK GL   VYLMNNLM  Y+KTG    A  +FDEMPL++ FSWNT++SAY+K+G+ D +     ++P  D VSWT +
Subjt:  HILQTSV-RIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAI

Query:  IVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNA
        IVGY  +G +  AI +  +M+ E + P+QFT++NV++S AA + ++ G+K+HSF+VKLGL    SV+ SLLNMYAKCGDP+ AK VFDRM +++IS+WNA
Subjt:  IVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNA

Query:  LISL--------------------------------------------------------------------------------------TETETSGAVG
        +I+L                                                                                      T  + SG V 
Subjt:  LISL--------------------------------------------------------------------------------------TETETSGAVG

Query:  NALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLA
        NALISMY++ GGVE AR++IE   T +L I  FT+LLDGY KLGD+  A+ IF  L+DRDV+AWTAMIVGY Q+G + +A++LFR MV GG  PNSYTLA
Subjt:  NALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLA

Query:  AMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHIT
        AMLSV SSLA+L HGKQIH SA+K+GE  + SV+NALI MYAK+G+I  A R FDL   +++TVSWTSMI+ALAQHG  EEA+ LFE ML   ++PDHIT
Subjt:  AMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHIT

Query:  YVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNS
        YVGV SACTH GLV QGR+Y++MM +V +I PTLSHYACM+DL+GRAGLLQEA  FIE MPIEPD + WGSLL++C+VHKN DL KVAAERLL ++P NS
Subjt:  YVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNS

Query:  GAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKH
        GAYSALAN++SACGKWE AAK RK MKD  V+KE+GFSWI +K+KVH FGVED  HP+K+EIY  M +IW+EIKKMG++PDT SVLHDLEEEVKEQIL+H
Subjt:  GAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKH

Query:  HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        HSEKLAIAFGL+STP+ T LRIMKNLRVCNDCH+AIKFISKLVGREIIVRD TRFHHFKDG CSCRDYW
Subjt:  HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

Q9SY02 Pentatricopeptide repeat-containing protein At4g027502.0e-13737.26Show/hide
Query:  NNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANM-----------ISER
        N +++ Y + G    A  +FDEMP +   SWN +I  Y +  N   +R L   MP+ D  SW  ++ GY Q G  D+A  +F  M           +S  
Subjt:  NNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANM-----------ISER

Query:  VPPSQFTVSNVISSCAANQAL---------DVGRK----IHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------LT
        V  S+   + ++     N AL          V +K       F   + +    S  T ++  YA+ G    A+ +FD   ++++ TW A++S      + 
Subjt:  VPPSQFTVSNVISSCAANQAL---------DVGRK----IHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------LT

Query:  E---------TETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALD
        E          E +    NA+++ Y +   +E+A+++ +     N++   + +++ GY + G +  A+ +F+K+  RD ++W AMI GY Q+G   +AL 
Subjt:  E---------TETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALD

Query:  LFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEA
        LF  M   G   N  + ++ LS  + +  LE GKQ+H   +K G  +   V NAL+ MY K GSI  A  +F   +G K+ VSW +MI   ++HG GE A
Subjt:  LFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEA

Query:  INLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNA
        +  FE M    +KPD  T V VLSAC+H GLV++GR+Y+  MT+ + + P   HYACM+DL GRAGLL++A++ +++MP EPD   WG+LL + +VH N 
Subjt:  INLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNA

Query:  DLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDT
        +LA+ AA+++  ++P NSG Y  L+N++++ G+W +  K R  M+D+GV+K  G+SWI ++NK H F V D  HP+KDEI+  ++E+   +KK G++  T
Subjt:  DLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDT

Query:  ESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
          VLHD+EEE KE+++++HSE+LA+A+G++       +R++KNLRVC DCH+AIK+++++ GR II+RD  RFHHFKDGSCSC DYW
Subjt:  ESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-13636.64Show/hide
Query:  LPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEM
        LPNS      F  +L++  + +    G+ +H  ++K G  L +Y+  +L++ Y + G L  AH VFD+ P +   S+  LI  YA +G  + +++L  E+
Subjt:  LPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEM

Query:  PDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDR
        P  D VSW A+I GY + G +  A+ +F +M+   V P + T+  V+S+CA + ++++GR++H ++   G GS   +  +L+++Y+KCG+  TA  +F+R
Subjt:  PDCDPVSWTAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDR

Query:  MTLKNISTWNALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNG
        +  K                                                                                DVI+W  +I GY    
Subjt:  MTLKNISTWNALISLTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNG

Query:  LWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIK--AGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMAL
        L+ +AL LF+ M+  G  PN  T+ ++L   + L  ++ G+ IH    K   G ++ +S+  +LI MYAK G I  A +VF+ +   K   SW +MI   
Subjt:  LWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIK--AGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMAL

Query:  AQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLL
        A HG  + + +LF RM  + ++PD IT+VG+LSAC+H G+++ GR  +  MT+ +++ P L HY CMIDL G +GL +EA   I  M +EPD + W SLL
Subjt:  AQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLL

Query:  ASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEI
         +CK+H N +L +  AE L+ I+P N G+Y  L+N++++ G+W   AKTR L+ D+G++K  G S I + + VH F + D  HP+  EIY +++E+   +
Subjt:  ASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEI

Query:  KKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        +K GF+PDT  VL ++EEE KE  L+HHSEKLAIAFGL+ST   T L I+KNLRVC +CH A K ISK+  REII RD TRFHHF+DG CSC DYW
Subjt:  KKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-13637.43Show/hide
Query:  GRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMP----DCDPVSWTAIIVGYNQLGLFD
        G+ +H      GL +  ++  ++   Y + G +  A  VFD M  K   + + L+ AYA++G  +   R+L EM     + + VSW  I+ G+N+ G   
Subjt:  GRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMP----DCDPVSWTAIIVGYNQLGLFD

Query:  NAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALISLTETETSG
         A+ MF  +      P Q TVS+V+ S   ++ L++GR IH +V+K GL     V +++++MY K G       +F++  +      NA I+        
Subjt:  NAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALISLTETETSG

Query:  AVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDR----DVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPE
                                                 G ++ G V  A E+F   +++    +V++WT++I G  QNG   +AL+LFR M   G +
Subjt:  AVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDR----DVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPE

Query:  PNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVR
        PN  T+ +ML    ++A L HG+  H  A++        V +ALI MYAK G IN+++ VF++    K  V W S++   + HG  +E +++FE ++  R
Subjt:  PNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVR

Query:  MKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLL
        +KPD I++  +LSAC  VGL ++G KY+ MM+E + I+P L HY+CM++L GRAG LQEAY  I+ MP EPD+  WG+LL SC++  N DLA++AAE+L 
Subjt:  MKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLL

Query:  FIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEV
         ++P N G Y  L+N+++A G W      R  M+  G++K  G SWI +KN+V+     D  HPQ D+I + MDEI +E++K G  P+ +  LHD+EE+ 
Subjt:  FIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEV

Query:  KEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        +EQ+L  HSEKLA+ FGLL+TP+ T L+++KNLR+C DCH+ IKFIS   GREI +RD  RFHHFKDG CSC D+W
Subjt:  KEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein1.4e-13635.74Show/hide
Query:  RSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWM
        + +H  II+   +   +L NN++  YA   S ++A  VFD +P  + FSWN L+ AY+K G          ++PD D V+W  +I GY+  GL   A+  
Subjt:  RSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWM

Query:  FANMISE-RVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------------
        +  M+ +     ++ T+  ++   ++N  + +G++IH  V+KLG  SY  V + LL MYA  G    AK VF  +  +N   +N+L+             
Subjt:  FANMISE-RVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------------

Query:  --LTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNL-------------------------------------NIIAFTSLLDGYTKLGDVKPAR
              E       A+I   A++G  + A +     +   L                                     +I   ++L+D Y K   +  A+
Subjt:  --LTETETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNL-------------------------------------NIIAFTSLLDGYTKLGDVKPAR

Query:  EIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIA
         +F++++ ++V++WTAM+VGY Q G   +A+ +F  M   G +P+ YTL   +S  +++++LE G Q H  AI +G     +V+N+L+ +Y K G I+ +
Subjt:  EIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIA

Query:  KRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLL
         R+F+     ++ VSWT+M+ A AQ G   E I LF++M+   +KPD +T  GV+SAC+  GLVE+G++Y+ +MT  + I P++ HY+CMIDL+ R+G L
Subjt:  KRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLL

Query:  QEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFG
        +EA  FI  MP  PD I W +LL++C+   N ++ K AAE L+ +DP +   Y+ L++++++ GKW++ A+ R+ M+++ V+KE G SWI  K K+H+F 
Subjt:  QEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFG

Query:  VEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVR
         +D   P  D+IY  ++E+  +I   G+ PDT  V HD+EE VK ++L +HSE+LAIAFGL+  P    +R+ KNLRVC DCH+A K IS + GREI+VR
Subjt:  VEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVR

Query:  DATRFHHFKDGSCSCRDYW
        DA RFH FKDG+CSC D+W
Subjt:  DATRFHHFKDGSCSCRDYW

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein8.5e-25657.61Show/hide
Query:  HILQTSV-RIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAI
        ++LQ SV +    F  + VH ++IK GL   VYLMNNLM  Y+KTG    A  +FDEMPL++ FSWNT++SAY+K+G+ D +     ++P  D VSWT +
Subjt:  HILQTSV-RIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAI

Query:  IVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNA
        IVGY  +G +  AI +  +M+ E + P+QFT++NV++S AA + ++ G+K+HSF+VKLGL    SV+ SLLNMYAKCGDP+ AK VFDRM +++IS+WNA
Subjt:  IVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNA

Query:  LISL--------------------------------------------------------------------------------------TETETSGAVG
        +I+L                                                                                      T  + SG V 
Subjt:  LISL--------------------------------------------------------------------------------------TETETSGAVG

Query:  NALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLA
        NALISMY++ GGVE AR++IE   T +L I  FT+LLDGY KLGD+  A+ IF  L+DRDV+AWTAMIVGY Q+G + +A++LFR MV GG  PNSYTLA
Subjt:  NALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLA

Query:  AMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHIT
        AMLSV SSLA+L HGKQIH SA+K+GE  + SV+NALI MYAK+G+I  A R FDL   +++TVSWTSMI+ALAQHG  EEA+ LFE ML   ++PDHIT
Subjt:  AMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHIT

Query:  YVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNS
        YVGV SACTH GLV QGR+Y++MM +V +I PTLSHYACM+DL+GRAGLLQEA  FIE MPIEPD + WGSLL++C+VHKN DL KVAAERLL ++P NS
Subjt:  YVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNS

Query:  GAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKH
        GAYSALAN++SACGKWE AAK RK MKD  V+KE+GFSWI +K+KVH FGVED  HP+K+EIY  M +IW+EIKKMG++PDT SVLHDLEEEVKEQIL+H
Subjt:  GAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKH

Query:  HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
        HSEKLAIAFGL+STP+ T LRIMKNLRVCNDCH+AIKFISKLVGREIIVRD TRFHHFKDG CSCRDYW
Subjt:  HSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-13837.26Show/hide
Query:  NNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANM-----------ISER
        N +++ Y + G    A  +FDEMP +   SWN +I  Y +  N   +R L   MP+ D  SW  ++ GY Q G  D+A  +F  M           +S  
Subjt:  NNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSWTAIIVGYNQLGLFDNAIWMFANM-----------ISER

Query:  VPPSQFTVSNVISSCAANQAL---------DVGRK----IHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------LT
        V  S+   + ++     N AL          V +K       F   + +    S  T ++  YA+ G    A+ +FD   ++++ TW A++S      + 
Subjt:  VPPSQFTVSNVISSCAANQAL---------DVGRK----IHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALIS------LT

Query:  E---------TETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALD
        E          E +    NA+++ Y +   +E+A+++ +     N++   + +++ GY + G +  A+ +F+K+  RD ++W AMI GY Q+G   +AL 
Subjt:  E---------TETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALD

Query:  LFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEA
        LF  M   G   N  + ++ LS  + +  LE GKQ+H   +K G  +   V NAL+ MY K GSI  A  +F   +G K+ VSW +MI   ++HG GE A
Subjt:  LFRLMVNGGPEPNSYTLAAMLSVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEA

Query:  INLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNA
        +  FE M    +KPD  T V VLSAC+H GLV++GR+Y+  MT+ + + P   HYACM+DL GRAGLL++A++ +++MP EPD   WG+LL + +VH N 
Subjt:  INLFERMLSVRMKPDHITYVGVLSACTHVGLVEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNA

Query:  DLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDT
        +LA+ AA+++  ++P NSG Y  L+N++++ G+W +  K R  M+D+GV+K  G+SWI ++NK H F V D  HP+KDEI+  ++E+   +KK G++  T
Subjt:  DLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTRKLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDT

Query:  ESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW
          VLHD+EEE KE+++++HSE+LA+A+G++       +R++KNLRVC DCH+AIK+++++ GR II+RD  RFHHFKDGSCSC DYW
Subjt:  ESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCHSAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACTGCCTAACTCACCGGCTTCTTCCGACTTCTTTGCACATATCCTGCAAACAAGCGTTAGAATCAGAGACCCATTTGCTGGAAGGTCAGTCCACTCTCAAATAAT
CAAGAAAGGCCTTCATCTTGGTGTGTACTTGATGAATAATCTTATGACCTTCTATGCCAAAACTGGCTCACTCAGTTTTGCCCACCATGTGTTCGATGAAATGCCCCTTA
AGTCAACATTCTCATGGAACACTTTGATCTCTGCGTATGCCAAACAGGGCAATTTTGATGTGTCGCGTCGCCTTTTGTATGAAATGCCTGATTGTGATCCTGTTTCCTGG
ACTGCAATCATCGTTGGTTACAATCAATTGGGTTTATTTGACAATGCGATTTGGATGTTTGCGAATATGATATCTGAGAGAGTCCCGCCTTCTCAATTCACGGTCAGTAA
TGTTATTTCTTCCTGCGCAGCAAATCAAGCTTTGGACGTTGGTAGGAAGATTCATTCATTTGTTGTTAAACTTGGACTTGGTAGTTATGCTTCTGTTGCAACTTCCCTGC
TGAATATGTACGCAAAATGTGGAGATCCAGTCACAGCGAAAGTTGTTTTTGATAGGATGACGCTGAAGAACATTTCAACTTGGAATGCTTTGATTTCATTAACAGAAACT
GAAACTTCTGGAGCAGTTGGCAATGCATTGATATCAATGTATGCAAAGTCAGGTGGGGTAGAAATAGCTAGACAGATTATTGAGCATAACAGGACTTCAAATCTAAATAT
TATAGCGTTCACTTCCCTTCTGGATGGATATACCAAGCTAGGAGATGTTAAACCAGCCAGAGAGATATTTAACAAATTAAGAGATCGTGATGTGATAGCATGGACAGCTA
TGATTGTTGGTTATGTGCAAAACGGGTTATGGAACGATGCACTGGATCTGTTTCGGTTGATGGTTAATGGAGGCCCTGAGCCGAATAGTTATACACTGGCGGCCATGCTT
AGTGTAACTTCAAGTTTGGCTACTTTGGAGCATGGAAAACAAATCCATGCCAGTGCTATAAAAGCTGGAGAATCTTCAACAGCTTCTGTTACTAATGCCTTGATTGCCAT
GTATGCTAAATCCGGAAGCATCAATATAGCCAAAAGAGTTTTTGACCTTACAAGCGGGAAGAAGGAAACTGTGTCCTGGACATCAATGATCATGGCTCTAGCGCAGCATG
GCCTTGGAGAAGAAGCCATCAACCTGTTTGAGAGGATGCTCTCAGTTCGTATGAAACCTGACCATATCACTTACGTTGGAGTGCTCTCTGCTTGTACACACGTGGGATTA
GTAGAACAAGGTCGAAAATACTATAATATGATGACTGAAGTCCATGAAATTGAACCCACTCTAAGCCATTATGCATGTATGATTGACCTTTATGGACGGGCTGGATTACT
TCAAGAAGCATACCACTTCATTGAAAGCATGCCTATAGAACCAGATAATATAGCTTGGGGATCTTTACTAGCTTCTTGTAAAGTTCATAAAAATGCGGATCTGGCAAAAG
TTGCAGCAGAAAGACTGCTTTTTATTGATCCTGGAAATAGCGGCGCCTACTCGGCCCTTGCTAATGTGTTTTCGGCTTGTGGGAAATGGGAAAATGCTGCCAAAACTAGA
AAGCTAATGAAGGATAGAGGAGTGAGGAAAGAGAAAGGATTTAGTTGGATTCATATGAAGAATAAAGTTCACGCATTCGGAGTTGAAGACGTTATTCATCCACAGAAAGA
TGAAATCTACAAGTTGATGGATGAGATATGGGAGGAGATCAAAAAGATGGGTTTCATCCCAGACACTGAATCAGTACTTCATGACCTTGAAGAAGAGGTGAAGGAGCAGA
TTCTTAAACATCACAGTGAAAAACTTGCTATAGCATTTGGGCTTTTAAGTACTCCAGAGAACACTGTACTGAGAATTATGAAAAACCTTAGAGTTTGTAATGACTGCCAT
TCTGCTATAAAGTTCATCTCCAAGCTTGTAGGAAGGGAAATCATTGTGAGAGATGCTACTCGGTTTCACCATTTTAAGGATGGCTCCTGTTCTTGTCGAGACTATTGGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAACTGCCTAACTCACCGGCTTCTTCCGACTTCTTTGCACATATCCTGCAAACAAGCGTTAGAATCAGAGACCCATTTGCTGGAAGGTCAGTCCACTCTCAAATAAT
CAAGAAAGGCCTTCATCTTGGTGTGTACTTGATGAATAATCTTATGACCTTCTATGCCAAAACTGGCTCACTCAGTTTTGCCCACCATGTGTTCGATGAAATGCCCCTTA
AGTCAACATTCTCATGGAACACTTTGATCTCTGCGTATGCCAAACAGGGCAATTTTGATGTGTCGCGTCGCCTTTTGTATGAAATGCCTGATTGTGATCCTGTTTCCTGG
ACTGCAATCATCGTTGGTTACAATCAATTGGGTTTATTTGACAATGCGATTTGGATGTTTGCGAATATGATATCTGAGAGAGTCCCGCCTTCTCAATTCACGGTCAGTAA
TGTTATTTCTTCCTGCGCAGCAAATCAAGCTTTGGACGTTGGTAGGAAGATTCATTCATTTGTTGTTAAACTTGGACTTGGTAGTTATGCTTCTGTTGCAACTTCCCTGC
TGAATATGTACGCAAAATGTGGAGATCCAGTCACAGCGAAAGTTGTTTTTGATAGGATGACGCTGAAGAACATTTCAACTTGGAATGCTTTGATTTCATTAACAGAAACT
GAAACTTCTGGAGCAGTTGGCAATGCATTGATATCAATGTATGCAAAGTCAGGTGGGGTAGAAATAGCTAGACAGATTATTGAGCATAACAGGACTTCAAATCTAAATAT
TATAGCGTTCACTTCCCTTCTGGATGGATATACCAAGCTAGGAGATGTTAAACCAGCCAGAGAGATATTTAACAAATTAAGAGATCGTGATGTGATAGCATGGACAGCTA
TGATTGTTGGTTATGTGCAAAACGGGTTATGGAACGATGCACTGGATCTGTTTCGGTTGATGGTTAATGGAGGCCCTGAGCCGAATAGTTATACACTGGCGGCCATGCTT
AGTGTAACTTCAAGTTTGGCTACTTTGGAGCATGGAAAACAAATCCATGCCAGTGCTATAAAAGCTGGAGAATCTTCAACAGCTTCTGTTACTAATGCCTTGATTGCCAT
GTATGCTAAATCCGGAAGCATCAATATAGCCAAAAGAGTTTTTGACCTTACAAGCGGGAAGAAGGAAACTGTGTCCTGGACATCAATGATCATGGCTCTAGCGCAGCATG
GCCTTGGAGAAGAAGCCATCAACCTGTTTGAGAGGATGCTCTCAGTTCGTATGAAACCTGACCATATCACTTACGTTGGAGTGCTCTCTGCTTGTACACACGTGGGATTA
GTAGAACAAGGTCGAAAATACTATAATATGATGACTGAAGTCCATGAAATTGAACCCACTCTAAGCCATTATGCATGTATGATTGACCTTTATGGACGGGCTGGATTACT
TCAAGAAGCATACCACTTCATTGAAAGCATGCCTATAGAACCAGATAATATAGCTTGGGGATCTTTACTAGCTTCTTGTAAAGTTCATAAAAATGCGGATCTGGCAAAAG
TTGCAGCAGAAAGACTGCTTTTTATTGATCCTGGAAATAGCGGCGCCTACTCGGCCCTTGCTAATGTGTTTTCGGCTTGTGGGAAATGGGAAAATGCTGCCAAAACTAGA
AAGCTAATGAAGGATAGAGGAGTGAGGAAAGAGAAAGGATTTAGTTGGATTCATATGAAGAATAAAGTTCACGCATTCGGAGTTGAAGACGTTATTCATCCACAGAAAGA
TGAAATCTACAAGTTGATGGATGAGATATGGGAGGAGATCAAAAAGATGGGTTTCATCCCAGACACTGAATCAGTACTTCATGACCTTGAAGAAGAGGTGAAGGAGCAGA
TTCTTAAACATCACAGTGAAAAACTTGCTATAGCATTTGGGCTTTTAAGTACTCCAGAGAACACTGTACTGAGAATTATGAAAAACCTTAGAGTTTGTAATGACTGCCAT
TCTGCTATAAAGTTCATCTCCAAGCTTGTAGGAAGGGAAATCATTGTGAGAGATGCTACTCGGTTTCACCATTTTAAGGATGGCTCCTGTTCTTGTCGAGACTATTGGTA
G
Protein sequenceShow/hide protein sequence
MELPNSPASSDFFAHILQTSVRIRDPFAGRSVHSQIIKKGLHLGVYLMNNLMTFYAKTGSLSFAHHVFDEMPLKSTFSWNTLISAYAKQGNFDVSRRLLYEMPDCDPVSW
TAIIVGYNQLGLFDNAIWMFANMISERVPPSQFTVSNVISSCAANQALDVGRKIHSFVVKLGLGSYASVATSLLNMYAKCGDPVTAKVVFDRMTLKNISTWNALISLTET
ETSGAVGNALISMYAKSGGVEIARQIIEHNRTSNLNIIAFTSLLDGYTKLGDVKPAREIFNKLRDRDVIAWTAMIVGYVQNGLWNDALDLFRLMVNGGPEPNSYTLAAML
SVTSSLATLEHGKQIHASAIKAGESSTASVTNALIAMYAKSGSINIAKRVFDLTSGKKETVSWTSMIMALAQHGLGEEAINLFERMLSVRMKPDHITYVGVLSACTHVGL
VEQGRKYYNMMTEVHEIEPTLSHYACMIDLYGRAGLLQEAYHFIESMPIEPDNIAWGSLLASCKVHKNADLAKVAAERLLFIDPGNSGAYSALANVFSACGKWENAAKTR
KLMKDRGVRKEKGFSWIHMKNKVHAFGVEDVIHPQKDEIYKLMDEIWEEIKKMGFIPDTESVLHDLEEEVKEQILKHHSEKLAIAFGLLSTPENTVLRIMKNLRVCNDCH
SAIKFISKLVGREIIVRDATRFHHFKDGSCSCRDYW