; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G005080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G005080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr01:4146871..4149024
RNA-Seq ExpressionLsi01G005080
SyntenyLsi01G005080
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603840.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0092.09Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LS ST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFD+FLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

XP_022950030.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita moschata]0.0e+0092.23Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LF QGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

XP_022977696.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima]0.0e+0091.81Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENA+AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSG++HD F+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEES+N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCI IAE IYMSKLQKNLVALTAIIS YAEK DMG+VVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+ISGYGLFGFDNH  LCYTKMMEKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVA+VRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

XP_023543683.1 pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo]0.0e+0092.8Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LS ST HSAFK YVEGK  TPPLL+FRQLLRYR+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENAK EANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLE S+N NSVTMVSILSANANP SIHCYATKTGL+ENVSVV SL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFD+IDAVF+LFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AE VFKSMKEPCLASWNSLISGYGLFGFDNHA LCYT MMEKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

XP_038882792.1 pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida]0.0e+0094.56Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLATLSLST HSAFK YVEGKNFTPPLLLFRQLLRY +KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+ LFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN LASMYGKCADLE VELLFGE IEKNVVSWNTMIGAF QNGFF EAMLVFKQMLEE VNANSVTMVSILSANANPG IHCYATKTGLVEN+SVV SL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV CGCIQIAELIYMSKLQKNLVALTAIISSYAEK DMGSVVKLYSR+QHLDMKLDAVAMVGIIQGI YPDHFGIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLF EM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL+LE FVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGRIDLAEKVFKSMK+PCLASWNSLISGYGLFGFDNHAL CYTKMMEKGIKPNKITFSG+LAACTHGG VEEGRTYFKIMKKEFGIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVA+VRKMMREMG+DG SG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWISLEDIDRDLY
        VSLMEWISLED +RDLY
Subjt:  VSLMEWISLEDIDRDLY

TrEMBL top hitse value%identityAlignment
A0A0A0KMV8 Uncharacterized protein0.0e+0090.77Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH ATLSL+T HSAFK+YVEGK FTPPLLLFR+LLR+R+KPNDSTFS+LIKAFVVSSS+SSFAPS CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+RLFD+FPEKDVVSWNALISGY+R G SHDAFKLFVEMRRR FDPCQRTLVSL+PSCGTQQLFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KNAL SMYGKCADL+GV+LLFGEI EK+VVSWNTMIGAFGQNG F EAMLVFKQMLEESVNANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I++AELIYMSKL+KNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISSC+Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        AL+DMYVKCGR+D AE VFKSMKEPCLASWNSLISGYGLFGF NHALLCYT+MMEKGIKPNKITFSGILAACTHGG VEEGR YFKIMKK+FGIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NME NPDSAVWGALLSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAAS RWNDVAR+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

A0A1S4DTH7 pentatricopeptide repeat-containing protein At2g048600.0e+0090.5Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGH ATLSL+T HSAFK+YVEGKNFTPPLLLFRQLLR++++PNDSTFS+LIKAFVVSSS      S CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFD+FPEKDVVSWNALISGY+R GYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQ+LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEESV+ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I+IAELIYMSKLQKNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISS +Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN+++LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        AL+DMYVKCGRID AE VFKSMKEPCLASWNSLISGYGLFGF N ALLCYTKMMEKGIKPNKITFSGILAACTHGG VEEGR YFK MKKEFGIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNME NPDSAVWGALLSACCIHQE+KLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

A0A5D3CM04 Pentatricopeptide repeat-containing protein0.0e+0090.5Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGH ATLSL+T HSAFK+YVEGKNFTPPLLLFRQLLR++++PNDSTFS+LIKAFVVSSS      S CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFD+FPEKDVVSWNALISGY+R GYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQ+LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEESV+ANSVTMVSILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        VCSYV+CG I+IAELIYMSKLQKNLVALTAIIS YAEK DMGSVV+LYS VQHLDMKLDAVAMVGIIQG  YPDH GIGLAFHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYSKFDNIDAVFSLFQEM KKTLSSWNSVISS +Q+GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN+++LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        AL+DMYVKCGRID AE VFKSMKEPCLASWNSLISGYGLFGF N ALLCYTKMMEKGIKPNKITFSGILAACTHGG VEEGR YFK MKKEFGIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNME NPDSAVWGALLSACCIHQE+KLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

A0A6J1GEF9 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0092.23Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAP SCSENAKAEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSGY+HDAF+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LF QGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEE ++ NSVTMVSILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYVRCGCIQIAELIYMSKLQKNLVALTAIIS YAEK DMGSVVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGLAFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNSLISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

A0A6J1IKP6 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0091.81Show/hide
Query:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHLA+LSLST HSAFK YVEGK  TPPLL+FRQLLR R+KPNDSTFS+LIKAFVVSSSSSSFAPSSCSENA+AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFD+ PEKDVVSWNALISGYSRSG++HD F+LFVEMRRRGF+PCQRTLVSLIPSCGTQ LFVQGKCIHALGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQ

Query:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL
        +KN+LASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFF EAMLVFKQMLEES+N +SVTMVSILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  MKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG
        +CSYV+CGCI IAE IYMSKLQKNLVALTAIIS YAEK DMG+VVKLYSRVQHL+MKLDAVAMVGIIQGI YPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT
        FISMYS+FD+IDAVFSLFQEM +KTLSSWNSVISSC+Q+GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNL+LEGFVGT
Subjt:  FISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGT

Query:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH
        ALIDMYVKCGR+D AEKVFKSMKEPCLASWNS+ISGYGLFGFDNH  LCYTKMMEKGIKPNKITFSGILAACTHGG VEEGRTYF+IMKKE GIVPESQH
Subjt:  ALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVA+VRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSG

Query:  VSLMEWIS
        VSLMEWIS
Subjt:  VSLMEWIS

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184851.4e-10134.33Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  +S +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC

Query:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN
        +H   VK  LD +  + NAL  MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY
           +HCY+ K   V N  V  + V SY +CG +  A+ ++     K + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY

Query:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
             +G   HG+ +++ L  D  V    +S+Y     +  V +LF  M  K+L SWN+VI+   Q+G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L  + F+  +LIDMY K G I  + KVF  +KE   ASWN++I GYG+ G    A+  + +M   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC

Query:  THGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS
         H G + EG  Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+S
Subjt:  THGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS

Query:  NLYAASGRWNDVARVRKMMREMG---EDGCSGVSL
        NLYA  G+W DV +VR+ M EM    + GCS + L
Subjt:  NLYAASGRWNDVARVRKMMREMG---EDGCSGVSL

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.7e-10032.96Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+++      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF

Query:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---
          GK IH L VK+G  LD      L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---

Query:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD
              IH YA ++G    V++ T+LV  Y +CG ++ A  ++   L++N+V+  ++I +Y +  +    + ++ ++    +K   V+++G +   A   
Subjt:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD

Query:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
            G   H   ++ GL  +  V N  ISMY K   +D   S+F ++  +TL SWN++I   +Q+GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTH
           H  + +H  ++R+ L+   FV TAL+DMY KCG I +A  +F  M E  + +WN++I GYG  GF   AL  + +M +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTH

Query:  GGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY
         G VE G   F +MK+ + I     H  +MV LLGRAG   EA  FI  M + P   V+GA+L AC IH+ V   E  A++L   N  +GG+ VL++N+Y
Subjt:  GGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY

Query:  AASGRWNDVARVRKMMREMGEDGCSGVSLME
         A+  W  V +VR  M   G     G S++E
Subjt:  AASGRWNDVARVRKMMREMGEDGCSGVSLME

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic5.8e-10333.71Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD
        +  AFL ++ + G +  A  +F +  E+++ SWN L+ GY++ GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD

Query:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE
          + NAL +MY KC D++   LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y   TG   
Subjt:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y  +    +K D + +  ++   A       G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VAN  I+MYSK   ID    +F  +P+K + SW S+I+    + R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEF
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C+    V +G  YF  M +++
Subjt:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEF

Query:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M + PD AVWGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +VA+VR+MM+E
Subjt:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

Q9SJ73 Pentatricopeptide repeat-containing protein At2g048601.0e-20052.25Show/hide
Query:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        LS  HS  K  + G+  + P+ +FR LLR  L PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD
         +A+ LFDE PE+D V WNALI GYSR+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+ +H +  K+GL+LDSQ+KNAL S Y KCA+
Subjt:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD

Query:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE
        L   E+LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCGC+  AE
Subjt:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEK DM   V  +S+ + L MK+DAVA+VGI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV

Query:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID
          LF+++ +  L SWNSVIS C QSGR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN   E FV TALIDMY KCG   
Subjt:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID

Query:  LAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGL
         AE VFKS+K PC A+WNS+ISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVGLLGRA L
Subjt:  LAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVS
        F EA+  I  M+I PDSAVWGALLSAC IH+E+++GE VA+K+   + +NGG +VLMSNLYA    W+DV RVR MM++ G DG  GVS
Subjt:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVS

Q9STE1 Pentatricopeptide repeat-containing protein At4g213007.8e-10029.76Show/hide
Query:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+   +S    +V        L  + ++L + + P+ STF  L+KA V   +                 + L       G D   +V+++ +  Y + G 
Subjt:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA
        +    +LFD   +KD V WN +++GY++ G      K F  MR     P   T   ++  C ++ L   G  +H L V +G+D +  +KN+L SMY KC 
Subjt:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA

Query:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +    LF  +   + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      ++V  TA+IS Y         ++++  +  + +  + + +V I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV
           ++  + +F+ + K+ + SWNS+I+ C+QS     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + LIDMY 
Subjt:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV

Query:  KCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVG
        KCG +  A  VFK+MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V+EG  +F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL+  +  N G++VL+SN +A +  W  V +VR +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVSLME

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-10132.96Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+++      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLF

Query:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---
          GK IH L VK+G  LD      L +MY KC  +     +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN---

Query:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD
              IH YA ++G    V++ T+LV  Y +CG ++ A  ++   L++N+V+  ++I +Y +  +    + ++ ++    +K   V+++G +   A   
Subjt:  ---PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPD

Query:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
            G   H   ++ GL  +  V N  ISMY K   +D   S+F ++  +TL SWN++I   +Q+GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTH
           H  + +H  ++R+ L+   FV TAL+DMY KCG I +A  +F  M E  + +WN++I GYG  GF   AL  + +M +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTH

Query:  GGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY
         G VE G   F +MK+ + I     H  +MV LLGRAG   EA  FI  M + P   V+GA+L AC IH+ V   E  A++L   N  +GG+ VL++N+Y
Subjt:  GGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLY

Query:  AASGRWNDVARVRKMMREMGEDGCSGVSLME
         A+  W  V +VR  M   G     G S++E
Subjt:  AASGRWNDVARVRKMMREMGEDGCSGVSLME

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-10433.71Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD
        +  AFL ++ + G +  A  +F +  E+++ SWN L+ GY++ GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLD

Query:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE
          + NAL +MY KC D++   LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y   TG   
Subjt:  SQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y  +    +K D + +  ++   A       G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VAN  I+MYSK   ID    +F  +P+K + SW S+I+    + R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEF
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C+    V +G  YF  M +++
Subjt:  NLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEF

Query:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M + PD AVWGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +VA+VR+MM+E
Subjt:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-10234.33Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  +S +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLIPSCGTQQLFVQGKC

Query:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN
        +H   VK  LD +  + NAL  MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHALGVKAGLDLDSQMKNALASMYGKCADLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY
           +HCY+ K   V N  V  + V SY +CG +  A+ ++     K + +  A+I  +A+  D        S   HL MK+     D+  +  ++   + 
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKL-----DAVAMVGIIQGIAY

Query:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
             +G   HG+ +++ L  D  V    +S+Y     +  V +LF  M  K+L SWN+VI+   Q+G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L  + F+  +LIDMY K G I  + KVF  +KE   ASWN++I GYG+ G    A+  + +M   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAAC

Query:  THGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS
         H G + EG  Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+S
Subjt:  THGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMS

Query:  NLYAASGRWNDVARVRKMMREMG---EDGCSGVSL
        NLYA  G+W DV +VR+ M EM    + GCS + L
Subjt:  NLYAASGRWNDVARVRKMMREMG---EDGCSGVSL

AT2G04860.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.3e-20252.25Show/hide
Query:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        LS  HS  K  + G+  + P+ +FR LLR  L PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD
         +A+ LFDE PE+D V WNALI GYSR+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+ +H +  K+GL+LDSQ+KNAL S Y KCA+
Subjt:  KAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCAD

Query:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE
        L   E+LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCGC+  AE
Subjt:  LEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEK DM   V  +S+ + L MK+DAVA+VGI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAV

Query:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID
          LF+++ +  L SWNSVIS C QSGR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN   E FV TALIDMY KCG   
Subjt:  FSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRID

Query:  LAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGL
         AE VFKS+K PC A+WNS+ISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVGLLGRA L
Subjt:  LAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVS
        F EA+  I  M+I PDSAVWGALLSAC IH+E+++GE VA+K+   + +NGG +VLMSNLYA    W+DV RVR MM++ G DG  GVS
Subjt:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVS

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-10129.76Show/hide
Query:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+   +S    +V        L  + ++L + + P+ STF  L+KA V   +                 + L       G D   +V+++ +  Y + G 
Subjt:  SLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA
        +    +LFD   +KD V WN +++GY++ G      K F  MR     P   T   ++  C ++ L   G  +H L V +G+D +  +KN+L SMY KC 
Subjt:  VKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCA

Query:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +    LF  +   + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DLEGVELLFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      ++V  TA+IS Y         ++++  +  + +  + + +V I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GCIQIAELIYMSKLQKNLVALTAIISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV
           ++  + +F+ + K+ + SWNS+I+ C+QS     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + LIDMY 
Subjt:  FDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYV

Query:  KCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVG
        KCG +  A  VFK+MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V+EG  +F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL+  +  N G++VL+SN +A +  W  V +VR +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVSLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTTACATCTTCGGTAGGTCACCTAGCAACGCTCTCGCTCTCTACCTGCCATTCTGCATTCAAATATTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGCT
TTTCCGTCAGCTGCTAAGGTATCGGCTTAAACCTAATGATTCTACCTTCTCCGTCCTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTTGCACCATCATCCT
GTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTCCTGAAAAAGATGTAGTCTCGTGGAATGCATTGATTTCTGGGTACTCACGAAGTGGATATAGCCATGA
TGCGTTCAAGCTATTTGTCGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATTCCTTCTTGTGGTACCCAACAATTATTTGTCCAAGGAA
AATGCATCCATGCGTTAGGTGTTAAAGCTGGACTTGATTTGGACTCCCAAATGAAAAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTGGAAGGGGTGGAACTC
CTATTTGGAGAGATTATTGAAAAAAATGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTTGAGGCAATGCTTGTTTTCAAGCAAATGCT
TGAGGAAAGTGTCAACGCTAACTCGGTGACGATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACGGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGATGCATACAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTAGCTATGCTGAGAAATGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTAGATGCAGTTGCAATGGTTGGAATAAT
CCAAGGTATTGCATATCCTGATCACTTTGGCATTGGACTTGCTTTCCACGGTTATGGGCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCCTAAAAAGACACTGAGCAGCTGGAATTCTGTGATATCTAGCTGTTCACAGTCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGAACTTGGAGGGTTTTGTCGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACT
TAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCACTGATCTCTGGTTATGGTTTATTTGGGTTTGACAATCATGCTCTCCTCTGTTAC
ACTAAAATGATGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGCATTTTAGCTGCTTGTACTCATGGAGGACGTGTCGAAGAAGGTAGAACATACTTCAAAAT
CATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAACATTGTGCATCCATGGTTGGCCTGCTTGGTCGAGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACA
TGGAAATCAATCCAGATTCTGCAGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCGAAAAAGTTGCTTTTCTCTAAC
TGTAGAAATGGAGGGTTTTTTGTTTTGATGTCAAATCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGCAAGAGTTAGAAAGATGATGCGAGAAATGGGAGAAGATGG
TTGTTCAGGTGTTAGCCTTATGGAATGGATTTCTTTGGAAGACATAGACAGAGATTTATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTTACATCTTCGGTAGGTCACCTAGCAACGCTCTCGCTCTCTACCTGCCATTCTGCATTCAAATATTACGTTGAAGGCAAAAATTTTACTCCCCCCTTGTTGCT
TTTCCGTCAGCTGCTAAGGTATCGGCTTAAACCTAATGATTCTACCTTCTCCGTCCTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTTGCACCATCATCCT
GTTCTGAGAATGCAAAAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGACTTGTACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGAATTTCCTGAAAAAGATGTAGTCTCGTGGAATGCATTGATTTCTGGGTACTCACGAAGTGGATATAGCCATGA
TGCGTTCAAGCTATTTGTCGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATTCCTTCTTGTGGTACCCAACAATTATTTGTCCAAGGAA
AATGCATCCATGCGTTAGGTGTTAAAGCTGGACTTGATTTGGACTCCCAAATGAAAAATGCTCTTGCATCGATGTATGGTAAATGTGCAGATTTGGAAGGGGTGGAACTC
CTATTTGGAGAGATTATTGAAAAAAATGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTTGAGGCAATGCTTGTTTTCAAGCAAATGCT
TGAGGAAAGTGTCAACGCTAACTCGGTGACGATGGTGAGTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACGGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAGATGTGGATGCATACAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTAGCTATGCTGAGAAATGTGACATGGGATCTGTGGTGAAGCTATATTCCCGAGTACAGCATTTAGATATGAAACTAGATGCAGTTGCAATGGTTGGAATAAT
CCAAGGTATTGCATATCCTGATCACTTTGGCATTGGACTTGCTTTCCACGGTTATGGGCTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATGCCTAAAAAGACACTGAGCAGCTGGAATTCTGTGATATCTAGCTGTTCACAGTCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACGTTGTCAGGTTATGGGCCAGATTCAATTACACTTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGAACTTGGAGGGTTTTGTCGGGACTGCTCTTATAGACATGTACGTCAAGTGTGGAAGAATAGACT
TAGCTGAAAAGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCACTGATCTCTGGTTATGGTTTATTTGGGTTTGACAATCATGCTCTCCTCTGTTAC
ACTAAAATGATGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGCATTTTAGCTGCTTGTACTCATGGAGGACGTGTCGAAGAAGGTAGAACATACTTCAAAAT
CATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAACATTGTGCATCCATGGTTGGCCTGCTTGGTCGAGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACA
TGGAAATCAATCCAGATTCTGCAGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCGAAAAAGTTGCTTTTCTCTAAC
TGTAGAAATGGAGGGTTTTTTGTTTTGATGTCAAATCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGCAAGAGTTAGAAAGATGATGCGAGAAATGGGAGAAGATGG
TTGTTCAGGTGTTAGCCTTATGGAATGGATTTCTTTGGAAGACATAGACAGAGATTTATACTAG
Protein sequenceShow/hide protein sequence
MQFTSSVGHLATLSLSTCHSAFKYYVEGKNFTPPLLLFRQLLRYRLKPNDSTFSVLIKAFVVSSSSSSFAPSSCSENAKAEANQLQTHFIKWGFDQFLYVSTAFLDLYSK
LGFVKAARRLFDEFPEKDVVSWNALISGYSRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLIPSCGTQQLFVQGKCIHALGVKAGLDLDSQMKNALASMYGKCADLEGVEL
LFGEIIEKNVVSWNTMIGAFGQNGFFFEAMLVFKQMLEESVNANSVTMVSILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGCIQIAELIYMSKLQKNLVALTA
IISSYAEKCDMGSVVKLYSRVQHLDMKLDAVAMVGIIQGIAYPDHFGIGLAFHGYGLKSGLIIDCLVANGFISMYSKFDNIDAVFSLFQEMPKKTLSSWNSVISSCSQSG
RSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLNLEGFVGTALIDMYVKCGRIDLAEKVFKSMKEPCLASWNSLISGYGLFGFDNHALLCY
TKMMEKGIKPNKITFSGILAACTHGGRVEEGRTYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN
CRNGGFFVLMSNLYAASGRWNDVARVRKMMREMGEDGCSGVSLMEWISLEDIDRDLY