; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016271 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016271
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr03:3894807..3898022
RNA-Seq ExpressionHG10016271
SyntenyHG10016271
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603840.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.4e-27791.59Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LS ST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFD+FLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

XP_022950030.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita moschata]1.7e-27891.78Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LF QGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

XP_022977696.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima]7.0e-27790.84Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENA+AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSG++HD F+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEES+N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCI IAE IYMSKLQKNLVALTAIIS YAEK DMG+VVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+IS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

XP_023543683.1 pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo]9.7e-27991.78Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LS ST HSAFK YVEGK  TPPLL+FRQLLRYR+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENAK EANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLE S+N NSVTMVSILSANANP SIHCYATKTGL+ENVSVV SL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFD+IDAVF+LFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AE VFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

XP_038882792.1 pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida]2.8e-28694.39Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLATLSLST HSAFK YVEGKNFTPPLLLFRQLLRY +KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+ LFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN LASMYGKCADLE VELLFGE IEKNVVSWNTMIGAF QNGFF EAMLVFKQMLEE VNANSVTMVSILSANANPG IHCYATKTGLVEN+SVV SL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV CGCIQIAELIYMSKLQKNLVALTAIISSYAEK DMGSVVKLYSR+QHLDMKLDAVAMVGIIQGI YPDHFGIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLF EM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL+LE FVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGRIDLAEKVFKSMK+PCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

TrEMBL top hitse value%identityAlignment
A0A0A0KMV8 Uncharacterized protein6.0e-27490.09Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH ATLSL+T HSAFK+YVEGK FTPPLLLFR+LLR+R+KPNDSTFS+LIKAFVVSSS+SSFAPS CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+RLFD+FPEKDVVSWNALISGY+R G SHDAFKLFVEMRRR FDPCQRTLVSL+PSCGTQQLFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KNAL SMYGKCADL+GV+LLFGEI EK+VVSWNTMIGAFGQNG F EAMLVFKQMLEESVNANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I++AELIYMSKL+KNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISSC+Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        AL+DMYVKCGR+D AE VFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

A0A1S4DTH7 pentatricopeptide repeat-containing protein At2g048601.8e-27089.35Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGH ATLSL+T HSAFK+YVEGKNFTPPLLLFRQLLR++++PNDSTFS+LIKAFVVSSS      S CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFD+FPEKDVVSWNALISGY+R GYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQ+LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEESV+ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I+IAELIYMSKLQKNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISS +Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN+++LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        AL+DMYVKCGRID AE VFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

A0A5D3CM04 Pentatricopeptide repeat-containing protein1.8e-27089.35Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGH ATLSL+T HSAFK+YVEGKNFTPPLLLFRQLLR++++PNDSTFS+LIKAFVVSSS      S CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFD+FPEKDVVSWNALISGY+R GYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQ+LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEESV+ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I+IAELIYMSKLQKNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISS +Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN+++LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        AL+DMYVKCGRID AE VFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

A0A6J1GEF9 pentatricopeptide repeat-containing protein At2g04860 isoform X18.0e-27991.78Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LF QGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLIS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

A0A6J1IKP6 pentatricopeptide repeat-containing protein At2g04860 isoform X13.4e-27790.84Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENA+AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSG++HD F+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEES+N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCI IAE IYMSKLQKNLVALTAIIS YAEK DMG+VVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+IS
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLIS

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184856.5e-6032.25Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  +S +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC

Query:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN
        +H   VK  LD +  + NAL  MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY
           +HCY+ K   V N  V  + V SY +CG +  A+ ++     K + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY

Query:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
             +G   HG+ +++ L  D  V    +S+Y     +  V +LF  M  K+L SWN+VI+   Q+G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI
           +L  G   H Y L++ L  + F+  +LIDMY K G I  + KVF  +KE   ASWN++I
Subjt:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic6.5e-6030.22Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+++      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF

Query:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---
          GK IH L VK+G  LD      L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---

Query:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD
              IH YA ++G    V++ T+LV  Y +CG ++ A  ++   L++N+V+  ++I +Y +  +    + ++ ++    +K   V+++G +   A   
Subjt:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD

Query:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
            G   H   ++ GL  +  V N  ISMY K   +D   S+F ++  +TL SWN++I   +Q+GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI
           H  + +H  ++R+ L+   FV TAL+DMY KCG I +A  +F  M E  + +WN++I
Subjt:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic3.8e-6030.19Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD
        +  AFL ++ + G +  A  +F +  E+++ SWN L+ GY++ GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD

Query:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE
          + NAL +MY KC D++   LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y   TG   
Subjt:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y  +    +K D + +  ++   A       G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VAN  I+MYSK   ID    +F  +P+K + SW S+I+    + R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENYELLVLTVIKLL
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN L++      +   +V  +  +++       +T I LL
Subjt:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENYELLVLTVIKLL

Q9SJ73 Pentatricopeptide repeat-containing protein At2g048606.0e-13847.77Show/hide
Query:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        LS  HS  K  + G+  + P+ +FR LLR  L PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD
         +A+ LFDE PE+D V WNALI GYSR+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+ +H +  K+GL+LDSQ+KNAL S Y KCA+
Subjt:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD

Query:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE
        L   E+LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCGC+  AE
Subjt:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEK DM   V  +S+ + L MK+DAVA+VGI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV

Query:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID
          LF+++ +  L SWNSVIS C QSGR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN   E FV TALIDMY KCG   
Subjt:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID

Query:  LAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENY---ELLVLTVIKLLDH
         AE VFKS+K PC A+WNS+IS   ++  +   +  YLE+  +     E+  L V+   +H
Subjt:  LAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENY---ELLVLTVIKLLDH

Q9STE1 Pentatricopeptide repeat-containing protein At4g213001.1e-5926.7Show/hide
Query:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+   +S    +V        L  + ++L + + P+ STF  L+KA V   +                 + L       G D   +V+++ +  Y + G 
Subjt:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA
        +    +LFD   +KD V WN +++GY++ G      K F  MR     P   T   ++  C ++ L   G  +H L V +G+D +  +KN+L SMY KC 
Subjt:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA

Query:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +    LF  +   + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      ++V  TA+IS Y         ++++  +  + +  + + +V I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV
           ++  + +F+ + K+ + SWNS+I+ C+QS     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + LIDMY 
Subjt:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV

Query:  KCGRIDLAEKVFKSMKEPCLASWNSLIS
        KCG +  A  VFK+MKE  + SWNS+I+
Subjt:  KCGRIDLAEKVFKSMKEPCLASWNSLIS

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-6130.22Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+++      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF

Query:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---
          GK IH L VK+G  LD      L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---

Query:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD
              IH YA ++G    V++ T+LV  Y +CG ++ A  ++   L++N+V+  ++I +Y +  +    + ++ ++    +K   V+++G +   A   
Subjt:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD

Query:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
            G   H   ++ GL  +  V N  ISMY K   +D   S+F ++  +TL SWN++I   +Q+GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI
           H  + +H  ++R+ L+   FV TAL+DMY KCG I +A  +F  M E  + +WN++I
Subjt:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-6130.19Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD
        +  AFL ++ + G +  A  +F +  E+++ SWN L+ GY++ GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD

Query:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE
          + NAL +MY KC D++   LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y   TG   
Subjt:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y  +    +K D + +  ++   A       G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VAN  I+MYSK   ID    +F  +P+K + SW S+I+    + R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENYELLVLTVIKLL
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN L++      +   +V  +  +++       +T I LL
Subjt:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENYELLVLTVIKLL

AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-6132.25Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  +S +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC

Query:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN
        +H   VK  LD +  + NAL  MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY
           +HCY+ K   V N  V  + V SY +CG +  A+ ++     K + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY

Query:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
             +G   HG+ +++ L  D  V    +S+Y     +  V +LF  M  K+L SWN+VI+   Q+G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI
           +L  G   H Y L++ L  + F+  +LIDMY K G I  + KVF  +KE   ASWN++I
Subjt:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLI

AT2G04860.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-13947.77Show/hide
Query:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        LS  HS  K  + G+  + P+ +FR LLR  L PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD
         +A+ LFDE PE+D V WNALI GYSR+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+ +H +  K+GL+LDSQ+KNAL S Y KCA+
Subjt:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD

Query:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE
        L   E+LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCGC+  AE
Subjt:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEK DM   V  +S+ + L MK+DAVA+VGI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV

Query:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID
          LF+++ +  L SWNSVIS C QSGR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN   E FV TALIDMY KCG   
Subjt:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID

Query:  LAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENY---ELLVLTVIKLLDH
         AE VFKS+K PC A+WNS+IS   ++  +   +  YLE+  +     E+  L V+   +H
Subjt:  LAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSYLEVIRENY---ELLVLTVIKLLDH

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-6126.7Show/hide
Query:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+   +S    +V        L  + ++L + + P+ STF  L+KA V   +                 + L       G D   +V+++ +  Y + G 
Subjt:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA
        +    +LFD   +KD V WN +++GY++ G      K F  MR     P   T   ++  C ++ L   G  +H L V +G+D +  +KN+L SMY KC 
Subjt:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA

Query:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +    LF  +   + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      ++V  TA+IS Y         ++++  +  + +  + + +V I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV
           ++  + +F+ + K+ + SWNS+I+ C+QS     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + LIDMY 
Subjt:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV

Query:  KCGRIDLAEKVFKSMKEPCLASWNSLIS
        KCG +  A  VFK+MKE  + SWNS+I+
Subjt:  KCGRIDLAEKVFKSMKEPCLASWNSLIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTTACATCTTCGGTAGGTCACCTAGCAACGCTCTCGCTCTCTACCTGCCATTCTGCATTCAAATATTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGCT
TTTCCGTCAGCTGCTAAGGTATCGGCTTAAACCTAATGATTCTACCTTCTCCGTCCTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTTGCACCATCATCCT
GTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTCCTGAAAAAGATGTAGTCTCGTGGAATGCATTGATTTCTGGGTACTCACGAAGTGGATATAGCCATGA
TGCGTTCAAGCTATTTGTCGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATTCCTTCTTGTGGTACCCAACAATTATTTGTCCAAGGAA
AATGCATCCATGCGTTAGGTGTTAAAGCTGGACTTGATTTGGACTCCCAAATGAAAAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTGGAAGGGGTGGAACTC
CTATTTGGAGAGATTATTGAAAAAAATGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTTGAGGCAATGCTTGTTTTCAAGCAAATGCT
TGAGGAAAGTGTCAACGCTAACTCGGTGACGATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACGGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGATGCATACAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTAGCTATGCTGAGAAATGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTAGATGCAGTTGCAATGGTTGGAATAAT
CCAAGGTATTGCATATCCTGATCACTTTGGCATTGGACTTGCTTTCCACGGTTATGGGCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCCTAAAAAGACACTGAGCAGCTGGAATTCTGTGATATCTAGCTGTTCACAGTCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGAACTTGGAGGGTTTTGTCGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACT
TAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCACTGATCTCTGTAAAAATGATGAACACAAAGAAAATCTGTATAGTTGTATCTTAC
CTTGAAGTTATCAGAGAAAATTATGAGCTGTTAGTCTTGACAGTTATCAAGTTACTGGATCACCAAGACCCACAAATCCAGCAAGCTGCAGCTTTTTACGATTCCTTCCT
CGCCTTGAAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTTACATCTTCGGTAGGTCACCTAGCAACGCTCTCGCTCTCTACCTGCCATTCTGCATTCAAATATTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGCT
TTTCCGTCAGCTGCTAAGGTATCGGCTTAAACCTAATGATTCTACCTTCTCCGTCCTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTTGCACCATCATCCT
GTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTCCTGAAAAAGATGTAGTCTCGTGGAATGCATTGATTTCTGGGTACTCACGAAGTGGATATAGCCATGA
TGCGTTCAAGCTATTTGTCGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATTCCTTCTTGTGGTACCCAACAATTATTTGTCCAAGGAA
AATGCATCCATGCGTTAGGTGTTAAAGCTGGACTTGATTTGGACTCCCAAATGAAAAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTGGAAGGGGTGGAACTC
CTATTTGGAGAGATTATTGAAAAAAATGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTTGAGGCAATGCTTGTTTTCAAGCAAATGCT
TGAGGAAAGTGTCAACGCTAACTCGGTGACGATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACGGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGATGCATACAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTAGCTATGCTGAGAAATGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTAGATGCAGTTGCAATGGTTGGAATAAT
CCAAGGTATTGCATATCCTGATCACTTTGGCATTGGACTTGCTTTCCACGGTTATGGGCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCCTAAAAAGACACTGAGCAGCTGGAATTCTGTGATATCTAGCTGTTCACAGTCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGAACTTGGAGGGTTTTGTCGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACT
TAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCACTGATCTCTGTAAAAATGATGAACACAAAGAAAATCTGTATAGTTGTATCTTAC
CTTGAAGTTATCAGAGAAAATTATGAGCTGTTAGTCTTGACAGTTATCAAGTTACTGGATCACCAAGACCCACAAATCCAGCAAGCTGCAGCTTTTTACGATTCCTTCCT
CGCCTTGAAGGGTTGA
Protein sequenceShow/hide protein sequence
MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSK
LGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVEL
LFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTA
IISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSG
RSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISVKMMNTKKICIVVSY
LEVIRENYELLVLTVIKLLDHQDPQIQQAAAFYDSFLALKG