; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G027450 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G027450
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchrH02:2934555..2936669
RNA-Seq ExpressionChy2G027450
SyntenyChy2G027450
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134990.1 pentatricopeptide repeat-containing protein At2g04860 [Cucumis sativus]0.096.73Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHPATLSLTTFHSAFKFYVEGK FTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSS+SSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCG SHDAFKLFVEMRRR FDPCQRTLVSLMPSCGTQQLFVQGKSIHGLG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKNALVSMYGKCADL+GVKLLFGEITEK+VVSWNTMIGAFGQNGLFSEAM+VFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYVKCGYI+LAELIYMSKLKKNLVALTAIISHYAEKGD GSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLFQE+HKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        ALVDMYVKCGR+DFAENVFKSMK PCLASWNSLISGYGLFGFHNHALLCYTEM+EKGIKPNKITFSGILAAC HGGLVEEGRKYFKIMKK+FGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

XP_016899294.1 PREDICTED: pentatricopeptide repeat-containing protein At2g04860 [Cucumis melo]0.093.47Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPATLSLTTFHSAFKFYVEGK+FTPPLLLFR+LLRH+V+PNDSTFSLLIKAFVVSSSS      FCSENEKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+S+LGFVKAA+RLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEESV+ANSVTMVSILSANAN GCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYVKCGYI++AELIYMSKL+KNLVALTAIIS YAEKGD GSVVRLYS+VQHLDMKLDAVAMVGIIQGFTYPDH GIGLAFHGYGVKSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLFQE+HKKTLSSWNSVISS AQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++DLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        ALVDMYVKCGRIDFAENVFKSMK PCLASWNSLISGYGLFGFHN ALLCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGRKYFK MKKEFGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQE+KLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

XP_022977696.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima]0.087.5Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+LSL+TFHSAFK YVEGK  TPPLL+FR+LLR RVKPNDSTFSLLIKAFVVSSSSSSFAPS CSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAA+RLFDD PEKDVVSWNALISGY+R G++HD F+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEES+N +SVTMVSILSANAN   IHCYATK GL+ENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        +CSYVKCG I +AE IYMSKL+KNLVALTAIIS YAEKGD G+VV+LYS VQHL+MKLDAVAMVGIIQG TYPDH GIGL+FHGYG+KSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YS+FD+IDAVFSLFQE+H+KTLSSWNSVISSCAQAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        AL+DMYVKCGR+DFAE VFKSMK PCLASWNS+ISGYGLFGF NH  LCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGR YF+IMKKE GIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGA LSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAAS RWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

XP_023543683.1 pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo]0.088.64Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+LS +TFHSAFK YVEGK  TPPLL+FR+LLR+RVKPNDSTFSLLIKAFVVSSSSSSFAPS CSEN K EANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAA+RLFDD PEKDVVSWNALISGY+R GY+HDAF+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLE S+N NSVTMVSILSANAN   IHCYATK GL+ENVSVV SL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        +CSYVKCG I++AELIYMSKL+KNLVALTAIIS YAEKGD GSVV+LYS VQHL+MKLDAVAMVGIIQG TYPDH GIGLAFHGYG+KSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFD+IDAVF+LFQE+H+KTLSSWNSVISSCAQAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        AL+DMYVKCGR+DFAENVFKSMK PCLASWNSLISGYGLFGF NHA LCYT M+EKGIKPNKITFSGILAAC HGGLVEEGR YF+IMKKE GIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGALLSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAAS RWNDVAR+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

XP_038882792.1 pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida]0.089.77Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH ATLSL+TFHSAFK YVEGK+FTPPLLLFR+LLR+ +KPNDSTFSLLIKAFVVSSSSSSFAPS CSEN KAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAAQ LFD+FPEKDVVSWNALISGY+R GYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQQLFVQGK IH LG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN L SMYGKCADLE V+LLFGE  EKNVVSWNTMIGAF QNG F EAM+VFKQMLEE VNANSVTMVSILSANAN GCIHCYATK GLVEN+SVV SL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYV CG I++AELIYMSKL+KNLVALTAIIS YAEKGD GSVV+LYS +QHLDMKLDAVAMVGIIQG TYPDH GIGLAFHGYG+KSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLF E+H+KTLSSWNSVISSCAQAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLE FVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        AL+DMYVKCGRID AE VFKSMK PCLASWNSLISGYGLFGF NHAL CYT+M+EKGIKPNKITFSG+LAAC HGGLVEEGR YFKIMKKEFGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNME NPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVA++RKMMREMG+DG SG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

TrEMBL top hitse value%identityAlignment
A0A0A0KMV8 Uncharacterized protein0.0e+0096.73Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHPATLSLTTFHSAFKFYVEGK FTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSS+SSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCG SHDAFKLFVEMRRR FDPCQRTLVSLMPSCGTQQLFVQGKSIHGLG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKNALVSMYGKCADL+GVKLLFGEITEK+VVSWNTMIGAFGQNGLFSEAM+VFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYVKCGYI+LAELIYMSKLKKNLVALTAIISHYAEKGD GSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLFQE+HKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        ALVDMYVKCGR+DFAENVFKSMK PCLASWNSLISGYGLFGFHNHALLCYTEM+EKGIKPNKITFSGILAAC HGGLVEEGRKYFKIMKK+FGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

A0A1S4DTH7 pentatricopeptide repeat-containing protein At2g048600.0e+0093.47Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPATLSLTTFHSAFKFYVEGK+FTPPLLLFR+LLRH+V+PNDSTFSLLIKAFVVSSS      SFCSENEKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+S+LGFVKAA+RLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEESV+ANSVTMVSILSANAN GCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYVKCGYI++AELIYMSKL+KNLVALTAIIS YAEKGD GSVVRLYS+VQHLDMKLDAVAMVGIIQGFTYPDH GIGLAFHGYGVKSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLFQE+HKKTLSSWNSVISS AQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++DLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        ALVDMYVKCGRIDFAENVFKSMK PCLASWNSLISGYGLFGFHN ALLCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGRKYFK MKKEFGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQE+KLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

A0A5D3CM04 Pentatricopeptide repeat-containing protein0.0e+0093.47Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPATLSLTTFHSAFKFYVEGK+FTPPLLLFR+LLRH+V+PNDSTFSLLIKAFVVSSS      SFCSENEKAEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+S+LGFVKAA+RLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEESV+ANSVTMVSILSANAN GCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        VCSYVKCGYI++AELIYMSKL+KNLVALTAIIS YAEKGD GSVVRLYS+VQHLDMKLDAVAMVGIIQGFTYPDH GIGLAFHGYGVKSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YSKFDNIDAVFSLFQE+HKKTLSSWNSVISS AQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++DLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        ALVDMYVKCGRIDFAENVFKSMK PCLASWNSLISGYGLFGFHN ALLCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGRKYFK MKKEFGIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQE+KLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

A0A6J1GEF9 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0087.64Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+LSL+TFHSAFK YVEGK  TPPLL+FR+LLR RVKPNDSTFSLLIKAFVVSSSSSSFAP  CSEN KAEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAA+RLFDD PEKDVVSWNALISGY+R GY+HDAF+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LF QGK IH LG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEE ++ NSVTMVSILSANAN   IHCYATK GLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        +CSYV+CG I++AELIYMSKL+KNLVALTAIIS YAEKGD GSVV+LYS VQHL+MKLDAVAMVGIIQG TYPDH GIGLAFHGYG+KSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YS+FD+IDAVFSLFQE+ +KTLSSWNSVISSCAQAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        AL+DMYVKCGR+DFAE VFKSMK PCLASWNSLISGYGLFGF+NHA LCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGR YF+IMKKE GIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGALL+ACCIHQEVKLGESVAK+L FSN RNGGFFVLMSNLYAAS RWNDVAR+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

A0A6J1IKP6 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0087.5Show/hide
Query:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+LSL+TFHSAFK YVEGK  TPPLL+FR+LLR RVKPNDSTFSLLIKAFVVSSSSSSFAPS CSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ
        STAFLDL+SKLGFVKAA+RLFDD PEKDVVSWNALISGY+R G++HD F+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LG+KAGLDLDSQ
Subjt:  STAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNG F EAM+VFKQMLEES+N +SVTMVSILSANAN   IHCYATK GL+ENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG
        +CSYVKCG I +AE IYMSKL+KNLVALTAIIS YAEKGD G+VV+LYS VQHL+MKLDAVAMVGIIQG TYPDH GIGL+FHGYG+KSGLIIDCLVA+G
Subjt:  VCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHG

Query:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT
        FIS+YS+FD+IDAVFSLFQE+H+KTLSSWNSVISSCAQAGRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLDLEGFVGT
Subjt:  FISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGT

Query:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH
        AL+DMYVKCGR+DFAE VFKSMK PCLASWNS+ISGYGLFGF NH  LCYT+M+EKGIKPNKITFSGILAAC HGGLVEEGR YF+IMKKE GIVP+SQH
Subjt:  ALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGA LSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAAS RWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSLM
Subjt:  VSLM

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184851.1e-10134.17Show/hide
Query:  IKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS
        +K G  + ++V  A +  +   GFV  A +LFD  PE+++VSWN++I  ++  G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS

Query:  IHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLE--ESVNANSVTMVSIL------SANAN
        +HG  +K  LD +  + NAL+ MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  TGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKL-----DAVAMVGIIQGFTY
           +HCY+ K   V N  V  + V SY KCG +  A+ ++     K + +  A+I  +A+  D     RL S+  HL MK+     D+  +  ++   + 
Subjt:  TGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKL-----DAVAMVGIIQGFTY

Query:  PDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
           + +G   HG+ +++ L  D  V    +SLY     +  V +LF  +  K+L SWN+VI+   Q G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L+ + F+  +L+DMY K G I  +  VF  +K    ASWN++I GYG+ G    A+  + EM   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAAC

Query:  AHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMS
         H GL+ EG +Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KLF         +VL+S
Subjt:  AHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMS

Query:  NLYAASRRWNDVARIRKMMREMG---EDGCSGVSL
        NLYA   +W DV ++R+ M EM    + GCS + L
Subjt:  NLYAASRRWNDVARIRKMMREMG---EDGCSGVSL

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.8e-10133.7Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF
        E  Q+     K G  Q  +  T  + L  + G V  A R+F+    K  V ++ ++ G+ +      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF

Query:  VQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGC
          GK IHGL +K+G  LD      L +MY KC  +   + +F  + E+++VSWNT++  + QNG+   A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGC

Query:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPD
              IH YA + G    V++ T+LV  Y KCG ++ A  ++   L++N+V+  ++I  Y +  +    + ++  +    +K   V+++G +       
Subjt:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPD

Query:  HIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
         +  G   H   V+ GL  +  V +  IS+Y K   +D   S+F ++  +TL SWN++I   AQ GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAH
           H  + +H  ++R+ LD   FV TALVDMY KCG I  A  +F  M    + +WN++I GYG  GF   AL  + EM +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAH

Query:  GGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLY
         GLVE G K F +MK+ + I     H  +MV LLGRAG   EA  FI  M   P   V+GA+L AC IH+ V   E  A++LF  N  +GG+ VL++N+Y
Subjt:  GGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLY

Query:  AASRRWNDVARIRKMMREMG---EDGCSGVSL
         A+  W  V ++R  M   G     GCS V +
Subjt:  AASRRWNDVARIRKMMREMG---EDGCSGVSL

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic7.7e-10033.06Show/hide
Query:  VSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLD
        +  AFL +  + G +  A  +F    E+++ SWN L+ GY + GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   ++ G +LD
Subjt:  VSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLD

Query:  SQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTG------CIHCYATKIGLVE
          V NAL++MY KC D++  +LLF  +  ++++SWN MI  + +NG+  E + +F  M   SV+ + +T+ S++SA    G       IH Y    G   
Subjt:  SQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTG------CIHCYATKIGLVE

Query:  NVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y ++    +K D + +  ++        +  G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLI

Query:  IDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VA+  I++YSK   ID    +F  I +K + SW S+I+      R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  DLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEF
         L+ F+  AL+DMYV+CGR++ A + F S K   + SWN L++GY   G  +  +  +  MV+  ++P++ITF  +L  C+   +V +G  YF  M +++
Subjt:  DLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEF

Query:  GIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M   PD AVWGALL+AC IH ++ LGE  A+ +F  + ++ G+++L+ NLYA   +W +VA++R+MM+E
Subjt:  GIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

Q9SJ73 Pentatricopeptide repeat-containing protein At2g048602.9e-20051.29Show/hide
Query:  PATL--SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDL
        P TL   L+ FHS  K  + G+  + P+ +FR+LLR  + PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+L
Subjt:  PATL--SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDL

Query:  HSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVS
        + K G V +AQ LFD+ PE+D V WNALI GY+R GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+S+HG+  K+GL+LDSQVKNAL+S
Subjt:  HSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVS

Query:  MYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSLVCSYVKC
         Y KCA+L   ++LF E+ +K+ VSWNTMIGA+ Q+GL  EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y +C
Subjt:  MYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK
        G +  AE +Y S  + ++V LT+I+S YAEKGD    V  +S  + L MK+DAVA+VGI+ G     HI IG++ HGY +KSGL    LV +G I++YSK
Subjt:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK

Query:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMY
        FD+++ V  LF+++ +  L SWNSVIS C Q+GR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN + E FV TAL+DMY
Subjt:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMY

Query:  VKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG
         KCG    AE+VFKS+K PC A+WNS+ISGY L G  + AL CY EM EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVG
Subjt:  VKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGVS
        LLGRA LF EA+  I  M+  PDSAVWGALLSAC IH+E+++GE VA+K+F  + +NGG +VLMSNLYA    W+DV R+R MM++ G DG  GVS
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGVS

Q9STE1 Pentatricopeptide repeat-containing protein At4g213005.2e-10430.47Show/hide
Query:  SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGF
        S+  ++S    +V        L  + ++L   V P+ STF  L+KA V   +       F S+   +            G D   +V+++ +  + + G 
Subjt:  SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGF

Query:  VKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCA
        +    +LFD   +KD V WN +++GY +CG      K F  MR     P   T   ++  C ++ L   G  +HGL + +G+D +  +KN+L+SMY KC 
Subjt:  VKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCA

Query:  DLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANAN------TGCIHCYATKIGLVENVSVVTSLVCSYVKC
          +    LF  ++  + V+WN MI  + Q+GL  E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y KC
Subjt:  DLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANAN------TGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK
          + +A+ I+      ++V  TA+IS Y   G     + ++  +  + +  + + +V I+        + +G   HG+ +K G    C +    I +Y+K
Subjt:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK

Query:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYV
           ++  + +F+ + K+ + SWNS+I+ CAQ+     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + L+DMY 
Subjt:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYV

Query:  KCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEK-GIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG
        KCG +  A NVFK+MK   + SWNS+I+  G  G    +L  + EMVEK GI+P++ITF  I+++C H G V+EG ++F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEK-GIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL   +  N G++VL+SN +A +R W  V ++R +M+E
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-10233.7Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF
        E  Q+     K G  Q  +  T  + L  + G V  A R+F+    K  V ++ ++ G+ +      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF

Query:  VQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGC
          GK IHGL +K+G  LD      L +MY KC  +   + +F  + E+++VSWNT++  + QNG+   A+ + K M EE++  + +T+VS+L A +    
Subjt:  VQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGC

Query:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPD
              IH YA + G    V++ T+LV  Y KCG ++ A  ++   L++N+V+  ++I  Y +  +    + ++  +    +K   V+++G +       
Subjt:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPD

Query:  HIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
         +  G   H   V+ GL  +  V +  IS+Y K   +D   S+F ++  +TL SWN++I   AQ GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAH
           H  + +H  ++R+ LD   FV TALVDMY KCG I  A  +F  M    + +WN++I GYG  GF   AL  + EM +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAH

Query:  GGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLY
         GLVE G K F +MK+ + I     H  +MV LLGRAG   EA  FI  M   P   V+GA+L AC IH+ V   E  A++LF  N  +GG+ VL++N+Y
Subjt:  GGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLY

Query:  AASRRWNDVARIRKMMREMG---EDGCSGVSL
         A+  W  V ++R  M   G     GCS V +
Subjt:  AASRRWNDVARIRKMMREMG---EDGCSGVSL

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.5e-10133.06Show/hide
Query:  VSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLD
        +  AFL +  + G +  A  +F    E+++ SWN L+ GY + GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   ++ G +LD
Subjt:  VSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLD

Query:  SQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTG------CIHCYATKIGLVE
          V NAL++MY KC D++  +LLF  +  ++++SWN MI  + +NG+  E + +F  M   SV+ + +T+ S++SA    G       IH Y    G   
Subjt:  SQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTG------CIHCYATKIGLVE

Query:  NVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLI
        ++SV  SL   Y+  G  + AE ++    +K++V+ T +IS Y         +  Y ++    +K D + +  ++        +  G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLI

Query:  IDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +VA+  I++YSK   ID    +F  I +K + SW S+I+      R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  DLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEF
         L+ F+  AL+DMYV+CGR++ A + F S K   + SWN L++GY   G  +  +  +  MV+  ++P++ITF  +L  C+   +V +G  YF  M +++
Subjt:  DLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEF

Query:  GIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M   PD AVWGALL+AC IH ++ LGE  A+ +F  + ++ G+++L+ NLYA   +W +VA++R+MM+E
Subjt:  GIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-10334.17Show/hide
Query:  IKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS
        +K G  + ++V  A +  +   GFV  A +LFD  PE+++VSWN++I  ++  G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLHSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS

Query:  IHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLE--ESVNANSVTMVSIL------SANAN
        +HG  +K  LD +  + NAL+ MY KC  +   +++F     KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   +
Subjt:  IHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLE--ESVNANSVTMVSIL------SANAN

Query:  TGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKL-----DAVAMVGIIQGFTY
           +HCY+ K   V N  V  + V SY KCG +  A+ ++     K + +  A+I  +A+  D     RL S+  HL MK+     D+  +  ++   + 
Subjt:  TGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKL-----DAVAMVGIIQGFTY

Query:  PDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
           + +G   HG+ +++ L  D  V    +SLY     +  V +LF  +  K+L SWN+VI+   Q G    A+ +F QM L G     I++  +  AC 
Subjt:  PDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L+ + F+  +L+DMY K G I  +  VF  +K    ASWN++I GYG+ G    A+  + EM   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAAC

Query:  AHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMS
         H GL+ EG +Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KLF         +VL+S
Subjt:  AHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMS

Query:  NLYAASRRWNDVARIRKMMREMG---EDGCSGVSL
        NLYA   +W DV ++R+ M EM    + GCS + L
Subjt:  NLYAASRRWNDVARIRKMMREMG---EDGCSGVSL

AT2G04860.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-20151.29Show/hide
Query:  PATL--SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDL
        P TL   L+ FHS  K  + G+  + P+ +FR+LLR  + PN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ L+L
Subjt:  PATL--SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDL

Query:  HSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVS
        + K G V +AQ LFD+ PE+D V WNALI GY+R GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+S+HG+  K+GL+LDSQVKNAL+S
Subjt:  HSKLGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVS

Query:  MYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSLVCSYVKC
         Y KCA+L   ++LF E+ +K+ VSWNTMIGA+ Q+GL  EA+ VFK M E++V  + VT++++LSA+ +   +HC   K G+V ++SVVTSLVC+Y +C
Subjt:  MYGKCADLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK
        G +  AE +Y S  + ++V LT+I+S YAEKGD    V  +S  + L MK+DAVA+VGI+ G     HI IG++ HGY +KSGL    LV +G I++YSK
Subjt:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK

Query:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMY
        FD+++ V  LF+++ +  L SWNSVIS C Q+GR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN + E FV TAL+DMY
Subjt:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMY

Query:  VKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG
         KCG    AE+VFKS+K PC A+WNS+ISGY L G  + AL CY EM EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVG
Subjt:  VKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGVS
        LLGRA LF EA+  I  M+  PDSAVWGALLSAC IH+E+++GE VA+K+F  + +NGG +VLMSNLYA    W+DV R+R MM++ G DG  GVS
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGVS

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-10530.47Show/hide
Query:  SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGF
        S+  ++S    +V        L  + ++L   V P+ STF  L+KA V   +       F S+   +            G D   +V+++ +  + + G 
Subjt:  SLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLHSKLGF

Query:  VKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCA
        +    +LFD   +KD V WN +++GY +CG      K F  MR     P   T   ++  C ++ L   G  +HGL + +G+D +  +KN+L+SMY KC 
Subjt:  VKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCA

Query:  DLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANAN------TGCIHCYATKIGLVENVSVVTSLVCSYVKC
          +    LF  ++  + V+WN MI  + Q+GL  E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y KC
Subjt:  DLEGVKLLFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANAN------TGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK
          + +A+ I+      ++V  TA+IS Y   G     + ++  +  + +  + + +V I+        + +G   HG+ +K G    C +    I +Y+K
Subjt:  GYIKLAELIYMSKLKKNLVALTAIISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSK

Query:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYV
           ++  + +F+ + K+ + SWNS+I+ CAQ+     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L  + +  + L+DMY 
Subjt:  FDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYV

Query:  KCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEK-GIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG
        KCG +  A NVFK+MK   + SWNS+I+  G  G    +L  + EMVEK GI+P++ITF  I+++C H G V+EG ++F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCYTEMVEK-GIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL   +  N G++VL+SN +A +R W  V ++R +M+E
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASRRWNDVARIRKMMRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTACATCTTCGGTCGGTCACCCAGCAACTCTCTCCCTAACTACCTTCCATTCTGCATTCAAATTTTACGTCGAAGGAAAAAGTTTTACTCCCCCCTTGTTGCT
TTTCCGTGAGCTCCTAAGACATCGGGTTAAACCTAATGATTCTACCTTCTCCTTACTCATCAAAGCCTTCGTTGTATCGTCTTCTTCTTCTTCTTTTGCACCATCGTTCT
GTTCTGAGAATGAAAAAGCGGAGGCTAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCATTTCTCGATTTGCACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCAACGTCTGTTTGATGATTTTCCTGAAAAAGATGTTGTATCGTGGAATGCGTTGATTTCTGGGTACACACGATGTGGATATAGCCATGA
CGCGTTTAAGCTATTTGTGGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATGCCTTCCTGTGGTACCCAACAATTATTTGTCCAAGGTA
AATCCATCCATGGATTAGGTCTTAAGGCTGGCCTTGATTTGGATTCCCAAGTGAAAAATGCTCTTGTATCGATGTATGGTAAATGTGCAGATTTAGAAGGGGTGAAACTC
TTATTTGGAGAGATTACTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGCTCTTTTCGGAGGCAATGATTGTTTTCAAGCAAATGCT
TGAAGAAAGTGTCAATGCTAACTCGGTGACTATGGTGAGTATCTTGTCTGCAAATGCAAATACAGGATGTATTCATTGTTATGCTACCAAAATTGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCCCTAGTTTGCTCCTACGTAAAATGTGGATATATAAAACTAGCGGAACTGATTTATATGTCAAAACTCAAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTCACTATGCTGAGAAAGGTGACACGGGATCTGTGGTAAGGCTATATTCCATTGTACAGCATTTAGATATGAAATTAGATGCTGTTGCAATGGTTGGCATAAT
CCAAGGTTTTACATATCCTGATCACATTGGCATTGGACTTGCTTTCCACGGTTATGGTGTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCTCATGGCTTCATAAGCC
TGTATTCAAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATTCACAAAAAGACACTGAGCAGCTGGAACTCTGTGATATCTAGCTGTGCACAGGCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACATTGTCAGGTTATGGGCCAGATTCAATTACACTAGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTGGAGGGTTTTGTTGGGACTGCTCTTGTAGACATGTACGTCAAGTGTGGAAGAATAGACT
TTGCTGAAAATGTGTTTAAGAGCATGAAAGGGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTCACAATCATGCTCTCCTCTGTTAC
ACCGAAATGGTGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTGCTCATGGAGGACTTGTTGAAGAAGGTAGAAAATACTTCAAAAT
CATGAAGAAAGAATTTGGTATCGTGCCCAAATCACAGCATTGTGCATCCATGGTTGGCCTTCTTGGTAGGGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACA
TGGAAACCAATCCAGATTCTGCCGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGTGAATCTGTGGCCAAAAAGTTATTTTTCTCTAAC
TGTAGAAATGGGGGGTTTTTTGTGTTGATGTCTAATCTTTATGCAGCATCAAGAAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGG
TTGTTCAGGCGTTAGCCTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATTTACATCTTCGGTCGGTCACCCAGCAACTCTCTCCCTAACTACCTTCCATTCTGCATTCAAATTTTACGTCGAAGGAAAAAGTTTTACTCCCCCCTTGTTGCT
TTTCCGTGAGCTCCTAAGACATCGGGTTAAACCTAATGATTCTACCTTCTCCTTACTCATCAAAGCCTTCGTTGTATCGTCTTCTTCTTCTTCTTTTGCACCATCGTTCT
GTTCTGAGAATGAAAAAGCGGAGGCTAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCATTTCTCGATTTGCACTCAAAA
TTGGGTTTTGTTAAAGCTGCTCAACGTCTGTTTGATGATTTTCCTGAAAAAGATGTTGTATCGTGGAATGCGTTGATTTCTGGGTACACACGATGTGGATATAGCCATGA
CGCGTTTAAGCTATTTGTGGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATGCCTTCCTGTGGTACCCAACAATTATTTGTCCAAGGTA
AATCCATCCATGGATTAGGTCTTAAGGCTGGCCTTGATTTGGATTCCCAAGTGAAAAATGCTCTTGTATCGATGTATGGTAAATGTGCAGATTTAGAAGGGGTGAAACTC
TTATTTGGAGAGATTACTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGCTCTTTTCGGAGGCAATGATTGTTTTCAAGCAAATGCT
TGAAGAAAGTGTCAATGCTAACTCGGTGACTATGGTGAGTATCTTGTCTGCAAATGCAAATACAGGATGTATTCATTGTTATGCTACCAAAATTGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCCCTAGTTTGCTCCTACGTAAAATGTGGATATATAAAACTAGCGGAACTGATTTATATGTCAAAACTCAAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTCACTATGCTGAGAAAGGTGACACGGGATCTGTGGTAAGGCTATATTCCATTGTACAGCATTTAGATATGAAATTAGATGCTGTTGCAATGGTTGGCATAAT
CCAAGGTTTTACATATCCTGATCACATTGGCATTGGACTTGCTTTCCACGGTTATGGTGTAAAGAGTGGGCTAATTATTGATTGTTTGGTTGCTCATGGCTTCATAAGCC
TGTATTCAAAGTTCGATAATATTGATGCAGTGTTTTCTTTATTTCAAGAGATTCACAAAAAGACACTGAGCAGCTGGAACTCTGTGATATCTAGCTGTGCACAGGCAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACATTGTCAGGTTATGGGCCAGATTCAATTACACTAGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTGGAGGGTTTTGTTGGGACTGCTCTTGTAGACATGTACGTCAAGTGTGGAAGAATAGACT
TTGCTGAAAATGTGTTTAAGAGCATGAAAGGGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTCACAATCATGCTCTCCTCTGTTAC
ACCGAAATGGTGGAGAAGGGGATAAAACCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTGCTCATGGAGGACTTGTTGAAGAAGGTAGAAAATACTTCAAAAT
CATGAAGAAAGAATTTGGTATCGTGCCCAAATCACAGCATTGTGCATCCATGGTTGGCCTTCTTGGTAGGGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACA
TGGAAACCAATCCAGATTCTGCCGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCAGGAAGTTAAGCTTGGTGAATCTGTGGCCAAAAAGTTATTTTTCTCTAAC
TGTAGAAATGGGGGGTTTTTTGTGTTGATGTCTAATCTTTATGCAGCATCAAGAAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGG
TTGTTCAGGCGTTAGCCTTATGTAA
Protein sequenceShow/hide protein sequence
MQFTSSVGHPATLSLTTFHSAFKFYVEGKSFTPPLLLFRELLRHRVKPNDSTFSLLIKAFVVSSSSSSFAPSFCSENEKAEANQLQTHFIKWGFDQFLYVSTAFLDLHSK
LGFVKAAQRLFDDFPEKDVVSWNALISGYTRCGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGLKAGLDLDSQVKNALVSMYGKCADLEGVKL
LFGEITEKNVVSWNTMIGAFGQNGLFSEAMIVFKQMLEESVNANSVTMVSILSANANTGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIKLAELIYMSKLKKNLVALTA
IISHYAEKGDTGSVVRLYSIVQHLDMKLDAVAMVGIIQGFTYPDHIGIGLAFHGYGVKSGLIIDCLVAHGFISLYSKFDNIDAVFSLFQEIHKKTLSSWNSVISSCAQAG
RSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDLEGFVGTALVDMYVKCGRIDFAENVFKSMKGPCLASWNSLISGYGLFGFHNHALLCY
TEMVEKGIKPNKITFSGILAACAHGGLVEEGRKYFKIMKKEFGIVPKSQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHQEVKLGESVAKKLFFSN
CRNGGFFVLMSNLYAASRRWNDVARIRKMMREMGEDGCSGVSLM