; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028676 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028676
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:3183296..3185504
RNA-Seq ExpressionPI0028676
SyntenyPI0028676
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134990.1 pentatricopeptide repeat-containing protein At2g04860 [Cucumis sativus]0.0e+0094.32Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHPAT SLTTFHSAFKFYVEGK FTPPLLLFR+LLR+RVKPND TFSLLIKAFVVSSS+SSFAPSFCSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+RLFDDFPEKDVVSWNALISGYTR G SHDAFKLFVEMRRR FDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKNALVSMYGKCADL+GVKLLFGEI+EK+VVSWNTMIGAFGQNG F EAMLVFKQMLEESVNANSVTMVSILSANA+ GCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYVKCGYIE+AELIYMSKL+KNLVALTAIIS YAEKGDM SVVRLYS+VQHLDMKLDAVAMVGIIQGFTYP HIGIGLAFHGYGVKSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLFQEMHKKTLSSWNSVISSCAQ GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        ALVDMYVKCGR+DFAENVFK MKEPCLASWNSLISGYGLFGFHNHALLCYT+MMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKK+FGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NMETNPDSAVWGALLSACCIH EVKLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVARIRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

XP_016899294.1 PREDICTED: pentatricopeptide repeat-containing protein At2g04860 [Cucumis melo]0.0e+0093.98Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPAT SLTTFHSAFKFYVEGKNFTPPLLLFRQLLR++V+PND TFSLLIKAFVVSSS      SFCSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFDDFPEKDVVSWNALISGYTR GYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESV+ANSVTMVSILSANA+AGCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYVKCGYIEIAELIYMSKLQKNLVALTAIIS YAEKGDM SVVRLYSLVQHLDMKLDAVAMVGIIQGFTYP H GIGLAFHGYGVKSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLFQEMHKKTLSSWNSVISS AQ GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++D +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        ALVDMYVKCGRIDFAENVFK MKEPCLASWNSLISGYGLFGFHN ALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFK MKKEFGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIH E+KLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGIHTCENFI
        VSLME  HT ENFI
Subjt:  VSLMEGIHTCENFI

XP_022977696.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima]0.0e+0088.4Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+ SL+TFHSAFK YVEGK  TPPLL+FRQLLR RVKPND TFSLLIKAFVVSSSSSSFAPS CSENA AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFDD PEKDVVSWNALISGY+RSG++HD F+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF+EAMLVFKQMLEES+N +SVTMVSILSANA+   IHCYATK GL+ENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        +CSYVKCG I IAE IYMSKLQKNLVALTAIIS YAEKGDM +VV+LYS VQHL+MKLDAVAMVGIIQG TYP H GIGL+FHGYG+KSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYS+FD+IDAV+SLFQEMH+KTLSSWNSVISSCAQ GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        AL+DMYVKCGR+DFAE VFK MKEPCLASWNS+ISGYGLFGF NH  LCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGR YF+IMKKE GIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGA LSACCIH EVKLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGI
        VSLME I
Subjt:  VSLMEGI

XP_023543683.1 pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo]0.0e+0089.39Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+ S +TFHSAFK YVEGK  TPPLL+FRQLLRYRVKPND TFSLLIKAFVVSSSSSSFAPS CSENA+ EANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFDD PEKDVVSWNALISGY+RSGY+HDAF+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF+EAMLVFKQMLE S+N NSVTMVSILSANA+   IHCYATK GL+ENVSVV SL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        +CSYVKCG I+IAELIYMSKLQKNLVALTAIIS YAEKGDM SVV+LYS VQHL+MKLDAVAMVGIIQG TYP H GIGLAFHGYG+KSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFD+IDAV++LFQEMH+KTLSSWNSVISSCAQ GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        AL+DMYVKCGR+DFAENVFK MKEPCLASWNSLISGYGLFGF NHA LCYT MMEKGIKPNKITFSGILAACTHGGLVEEGR YF+IMKKE GIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGALLSACCIH EVKLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGI
        VSLME I
Subjt:  VSLMEGI

XP_038882792.1 pentatricopeptide repeat-containing protein At2g04860 [Benincasa hispida]0.0e+0090.66Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH AT SL+TFHSAFK YVEGKNFTPPLLLFRQLLRY +KPND TFSLLIKAFVVSSSSSSFAPS CSENA+AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+ LFD+FPEKDVVSWNALISGY+RSGYSHDAFKLFVEMRRRGFDPCQRTLVSL+PSCGTQQLFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN L SMYGKCADLE V+LLFGE  EKNVVSWNTMIGAF QNGFFLEAMLVFKQMLEE VNANSVTMVSILSANA+ GCIHCYATK GLVEN+SVV SL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYV CG I+IAELIYMSKLQKNLVALTAIISSYAEKGDM SVV+LYS +QHLDMKLDAVAMVGIIQG TYP H GIGLAFHGYG+KSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLF EMH+KTLSSWNSVISSCAQ GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLD + FVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        AL+DMYVKCGRID AE VFK MK+PCLASWNSLISGYGLFGF NHAL CYTKMMEKGIKPNKITFSG+LAACTHGGLVEEGR YFKIMKKEFGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNME NPDSAVWGALLSACCIH EVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVA++RKMMREMG+DG SG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGI
        VSLME I
Subjt:  VSLMEGI

TrEMBL top hitse value%identityAlignment
A0A0A0KMV8 Uncharacterized protein0.0e+0094.32Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGHPAT SLTTFHSAFKFYVEGK FTPPLLLFR+LLR+RVKPND TFSLLIKAFVVSSS+SSFAPSFCSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAA+RLFDDFPEKDVVSWNALISGYTR G SHDAFKLFVEMRRR FDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKNALVSMYGKCADL+GVKLLFGEI+EK+VVSWNTMIGAFGQNG F EAMLVFKQMLEESVNANSVTMVSILSANA+ GCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYVKCGYIE+AELIYMSKL+KNLVALTAIIS YAEKGDM SVVRLYS+VQHLDMKLDAVAMVGIIQGFTYP HIGIGLAFHGYGVKSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLFQEMHKKTLSSWNSVISSCAQ GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        ALVDMYVKCGR+DFAENVFK MKEPCLASWNSLISGYGLFGFHNHALLCYT+MMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKK+FGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NMETNPDSAVWGALLSACCIH EVKLGESVAKKLFFSNCRNGGFFVLMSNLYAAS RWNDVARIRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

A0A1S4DTH7 pentatricopeptide repeat-containing protein At2g048600.0e+0093.98Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPAT SLTTFHSAFKFYVEGKNFTPPLLLFRQLLR++V+PND TFSLLIKAFVVSSS      SFCSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFDDFPEKDVVSWNALISGYTR GYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESV+ANSVTMVSILSANA+AGCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYVKCGYIEIAELIYMSKLQKNLVALTAIIS YAEKGDM SVVRLYSLVQHLDMKLDAVAMVGIIQGFTYP H GIGLAFHGYGVKSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLFQEMHKKTLSSWNSVISS AQ GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++D +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        ALVDMYVKCGRIDFAENVFK MKEPCLASWNSLISGYGLFGFHN ALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFK MKKEFGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIH E+KLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGIHTCENFI
        VSLME  HT ENFI
Subjt:  VSLMEGIHTCENFI

A0A5D3CM04 Pentatricopeptide repeat-containing protein0.0e+0093.98Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFT SVGHPAT SLTTFHSAFKFYVEGKNFTPPLLLFRQLLR++V+PND TFSLLIKAFVVSSS      SFCSEN +AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYS+LGFVKAARRLFDDFPEKDVVSWNALISGYTR GYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQ+LFVQGKSIHGLGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN LVSMYGKCADLEGVKLLFGEI EKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESV+ANSVTMVSILSANA+AGCIHCYATKIGLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        VCSYVKCGYIEIAELIYMSKLQKNLVALTAIIS YAEKGDM SVVRLYSLVQHLDMKLDAVAMVGIIQGFTYP H GIGLAFHGYGVKSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYSKFDNIDAV+SLFQEMHKKTLSSWNSVISS AQ GRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRN++D +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        ALVDMYVKCGRIDFAENVFK MKEPCLASWNSLISGYGLFGFHN ALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFK MKKEFGIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIH E+KLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVA+IRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGIHTCENFI
        VSLME  HT ENFI
Subjt:  VSLMEGIHTCENFI

A0A6J1GEF9 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0088.26Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+ SL+TFHSAFK YVEGK  TPPLL+FRQLLR RVKPND TFSLLIKAFVVSSSSSSFAP  CSENA+AEANQLQ HFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFDD PEKDVVSWNALISGY+RSGY+HDAF+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LF QGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF+EAMLVFKQMLEE ++ NSVTMVSILSANA+   IHCYATK GLVENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        +CSYV+CG I+IAELIYMSKLQKNLVALTAIIS YAEKGDM SVV+LYS VQHL+MKLDAVAMVGIIQG TYP H GIGLAFHGYG+KSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYS+FD+IDAV+SLFQEM +KTLSSWNSVISSCAQ GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEI+H YILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        AL+DMYVKCGR+DFAE VFK MKEPCLASWNSLISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLVEEGR YF+IMKKE GIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGALL+ACCIH EVKLGESVAK+L FSN RNGGFFVLMSNLYAASGRWNDVAR+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGI
        VSLME I
Subjt:  VSLMEGI

A0A6J1IKP6 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0088.4Show/hide
Query:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV
        MQFTSSVGH A+ SL+TFHSAFK YVEGK  TPPLL+FRQLLR RVKPND TFSLLIKAFVVSSSSSSFAPS CSENA AEANQLQTHFIKWGFDQFLYV
Subjt:  MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYV

Query:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ
        STAFLDLYSKLGFVKAARRLFDD PEKDVVSWNALISGY+RSG++HD F+LFVEMRRRGF+PCQRTLVSL+PSCGTQ LFVQGK IH LGVKAGLDLDSQ
Subjt:  STAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQ

Query:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL
        VKN+L SMYGKCADLEGV+LLFGEI EKNVVSWNTMIGAFGQNGFF+EAMLVFKQMLEES+N +SVTMVSILSANA+   IHCYATK GL+ENVSVVTSL
Subjt:  VKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSL

Query:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG
        +CSYVKCG I IAE IYMSKLQKNLVALTAIIS YAEKGDM +VV+LYS VQHL+MKLDAVAMVGIIQG TYP H GIGL+FHGYG+KSGLIIDCLV NG
Subjt:  VCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNG

Query:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT
         ISMYS+FD+IDAV+SLFQEMH+KTLSSWNSVISSCAQ GRSIDAMALFSQM LSGYGPDSITLASLLSACCQNGNLHFGEILH YILRNNLD +GFVGT
Subjt:  LISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGT

Query:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH
        AL+DMYVKCGR+DFAE VFK MKEPCLASWNS+ISGYGLFGF NH  LCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGR YF+IMKKE GIVPESQH
Subjt:  ALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME NPDSAVWGA LSACCIH EVKLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDVA++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSG

Query:  VSLMEGI
        VSLME I
Subjt:  VSLMEGI

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184851.6e-10534.65Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  ++ +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS

Query:  IHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLE--ESVNANSVTMVSIL------SANAS
        +HG  VK  LD +  + NAL+ MY KC  +   +++F   + KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   S
Subjt:  IHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLE--ESVNANSVTMVSIL------SANAS

Query:  AGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKL-----DAVAMVGIIQGFTY
           +HCY+ K   V N  V  + V SY KCG +  A+ ++     K + +  A+I  +A+  D        SL  HL MK+     D+  +  ++   + 
Subjt:  AGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKL-----DAVAMVGIIQGFTY

Query:  PYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
           + +G   HG+ +++ L  D  V   ++S+Y     +  V +LF  M  K+L SWN+VI+   Q G    A+ +F QM L G     I++  +  AC 
Subjt:  PYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L+   F+  +L+DMY K G I  +  VF G+KE   ASWN++I GYG+ G    A+  + +M   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAAC

Query:  THGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMS
         H GL+ EG +Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IH  +++GE VA KLF         +VL+S
Subjt:  THGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMS

Query:  NLYAASGRWNDVARIRKMMREMG---EDGCSGVSL
        NLYA  G+W DV ++R+ M EM    + GCS + L
Subjt:  NLYAASGRWNDVARIRKMMREMG---EDGCSGVSL

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.6e-10534.48Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+ +      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF

Query:  VQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGC
          GK IHGL VK+G  LD      L +MY KC  +   + +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A ++   
Subjt:  VQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGC

Query:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPY
              IH YA + G    V++ T+LV  Y KCG +E A  ++   L++N+V+  ++I +Y +  +    + ++  +    +K   V+++G +       
Subjt:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPY

Query:  HIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
         +  G   H   V+ GL  +  VVN LISMY K   +D   S+F ++  +TL SWN++I   AQ GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTH
           H  + +H  ++R+ LD   FV TALVDMY KCG I  A  +F  M E  + +WN++I GYG  GF   AL  + +M +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTH

Query:  GGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLY
         GLVE G K F +MK+ + I     H  +MV LLGRAG   EA  FI  M   P   V+GA+L AC IH  V   E  A++LF  N  +GG+ VL++N+Y
Subjt:  GGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLY

Query:  AASGRWNDVARIRKMMREMG---EDGCSGVSLMEGIHT
         A+  W  V ++R  M   G     GCS V +   +H+
Subjt:  AASGRWNDVARIRKMMREMG---EDGCSGVSLMEGIHT

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic1.4e-10133.12Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLD
        +  AFL ++ + G +  A  +F    E+++ SWN L+ GY + GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLD

Query:  SQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAG------CIHCYATKIGLVE
          V NAL++MY KC D++  +LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y    G   
Subjt:  SQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAG------CIHCYATKIGLVE

Query:  NVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLI
        ++SV  SL   Y+  G    AE ++    +K++V+ T +IS Y         +  Y ++    +K D + +  ++        +  G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLI

Query:  IDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +V N LI+MYSK   ID    +F  + +K + SW S+I+      R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  DFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEF
            F+  AL+DMYV+CGR++ A + F   K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C+   +V +G  YF  M +++
Subjt:  DFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEF

Query:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M   PD AVWGALL+AC IHH++ LGE  A+ +F  + ++ G+++L+ NLYA  G+W +VA++R+MM+E
Subjt:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE

Query:  MG---EDGCSGVSLMEGIH
         G   + GCS V +   +H
Subjt:  MG---EDGCSGVSLMEGIH

Q9SJ73 Pentatricopeptide repeat-containing protein At2g048607.8e-20151.67Show/hide
Query:  LTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        L+ FHS  K  + G+  + P+ +FR LLR  + PN  T S+ ++A   ++S +SF         + +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCAD
         +A+ LFD+ PE+D V WNALI GY+R+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+S+HG+  K+GL+LDSQVKNAL+S Y KCA+
Subjt:  KAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCAD

Query:  LEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAE
        L   ++LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ S   +HC   K G+V ++SVVTSLVC+Y +CG +  AE
Subjt:  LEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEKGDM   V  +S  + L MK+DAVA+VGI+ G     HI IG++ HGY +KSGL    LVVNGLI+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAV

Query:  YSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRID
          LF+++ +  L SWNSVIS C Q+GR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN + + FV TAL+DMY KCG   
Subjt:  YSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRID

Query:  FAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGL
         AE+VFK +K PC A+WNS+ISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVGLLGRA L
Subjt:  FAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVS
        F EA+  I  M+  PDSAVWGALLSAC IH E+++GE VA+K+F  + +NGG +VLMSNLYA    W+DV R+R MM++ G DG  GVS
Subjt:  FEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVS

Q9STE1 Pentatricopeptide repeat-containing protein At4g213003.4e-10330.33Show/hide
Query:  SLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+  ++S    +V        L  + ++L + V P+  TF  L+KA V   +       F S+   +            G D   +V+++ +  Y + G 
Subjt:  SLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCA
        +    +LFD   +KD V WN +++GY + G      K F  MR     P   T   ++  C ++ L   G  +HGL V +G+D +  +KN+L+SMY KC 
Subjt:  VKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCA

Query:  DLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANAS------AGCIHCYATKIGLVENVSVVTSLVCSYVKC
          +    LF  +S  + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y KC
Subjt:  DLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANAS------AGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSK
          + +A+ I+      ++V  TA+IS Y   G     + ++  +  + +  + + +V I+        + +G   HG+ +K G    C +   +I MY+K
Subjt:  GYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSK

Query:  FDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYV
           ++  Y +F+ + K+ + SWNS+I+ CAQ+     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L    +  + L+DMY 
Subjt:  FDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYV

Query:  KCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVG
        KCG +  A NVFK MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V+EG ++F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H  V+L E  + KL   +  N G++VL+SN +A +  W  V ++R +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVSLME

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-10634.48Show/hide
Query:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF
        E  Q+     K G  Q  +  T  + L+ + G V  A R+F+    K  V ++ ++ G+ +      A + FV MR    +P       L+  CG +   
Subjt:  EANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLF

Query:  VQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGC
          GK IHGL VK+G  LD      L +MY KC  +   + +F  + E+++VSWNT++  + QNG    A+ + K M EE++  + +T+VS+L A ++   
Subjt:  VQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGC

Query:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPY
              IH YA + G    V++ T+LV  Y KCG +E A  ++   L++N+V+  ++I +Y +  +    + ++  +    +K   V+++G +       
Subjt:  ------IHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPY

Query:  HIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN
         +  G   H   V+ GL  +  VVN LISMY K   +D   S+F ++  +TL SWN++I   AQ GR IDA+  FSQM      PD+ T  S+++A  + 
Subjt:  HIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQN

Query:  GNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTH
           H  + +H  ++R+ LD   FV TALVDMY KCG I  A  +F  M E  + +WN++I GYG  GF   AL  + +M +  IKPN +TF  +++AC+H
Subjt:  GNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTH

Query:  GGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLY
         GLVE G K F +MK+ + I     H  +MV LLGRAG   EA  FI  M   P   V+GA+L AC IH  V   E  A++LF  N  +GG+ VL++N+Y
Subjt:  GGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLY

Query:  AASGRWNDVARIRKMMREMG---EDGCSGVSLMEGIHT
         A+  W  V ++R  M   G     GCS V +   +H+
Subjt:  AASGRWNDVARIRKMMREMG---EDGCSGVSLMEGIHT

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-10233.12Show/hide
Query:  VSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLD
        +  AFL ++ + G +  A  +F    E+++ SWN L+ GY + GY  +A  L+  M    G  P   T   ++ +CG      +GK +H   V+ G +LD
Subjt:  VSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMR-RRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLD

Query:  SQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAG------CIHCYATKIGLVE
          V NAL++MY KC D++  +LLF  +  ++++SWN MI  + +NG   E + +F  M   SV+ + +T+ S++SA    G       IH Y    G   
Subjt:  SQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAG------CIHCYATKIGLVE

Query:  NVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLI
        ++SV  SL   Y+  G    AE ++    +K++V+ T +IS Y         +  Y ++    +K D + +  ++        +  G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLI

Query:  IDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL
           +V N LI+MYSK   ID    +F  + +K + SW S+I+      R  +A+    QM ++   P++ITL + L+AC + G L  G+ +H ++LR  +
Subjt:  IDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNL

Query:  DFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEF
            F+  AL+DMYV+CGR++ A + F   K+  + SWN L++GY   G  +  +  + +M++  ++P++ITF  +L  C+   +V +G  YF  M +++
Subjt:  DFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEF

Query:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE
        G+ P  +H A +V LLGRAG  +EA  FI+ M   PD AVWGALL+AC IHH++ LGE  A+ +F  + ++ G+++L+ NLYA  G+W +VA++R+MM+E
Subjt:  GIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMRE

Query:  MG---EDGCSGVSLMEGIH
         G   + GCS V +   +H
Subjt:  MG---EDGCSGVSLMEGIH

AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-10634.65Show/hide
Query:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS
        +K G  + ++V  A +  Y   GFV  A +LFD  PE+++VSWN++I  ++ +G+S ++F L  EM        F P   TLV+++P C  ++    GK 
Subjt:  IKWGFDQFLYVSTAFLDLYSKLGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRR----GFDPCQRTLVSLMPSCGTQQLFVQGKS

Query:  IHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLE--ESVNANSVTMVSIL------SANAS
        +HG  VK  LD +  + NAL+ MY KC  +   +++F   + KNVVSWNTM+G F   G       V +QML   E V A+ VT+++ +      S   S
Subjt:  IHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLE--ESVNANSVTMVSIL------SANAS

Query:  AGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKL-----DAVAMVGIIQGFTY
           +HCY+ K   V N  V  + V SY KCG +  A+ ++     K + +  A+I  +A+  D        SL  HL MK+     D+  +  ++   + 
Subjt:  AGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKL-----DAVAMVGIIQGFTY

Query:  PYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACC
           + +G   HG+ +++ L  D  V   ++S+Y     +  V +LF  M  K+L SWN+VI+   Q G    A+ +F QM L G     I++  +  AC 
Subjt:  PYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACC

Query:  QNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAAC
           +L  G   H Y L++ L+   F+  +L+DMY K G I  +  VF G+KE   ASWN++I GYG+ G    A+  + +M   G  P+ +TF G+L AC
Subjt:  QNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAAC

Query:  THGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMS
         H GL+ EG +Y   MK  FG+ P  +H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IH  +++GE VA KLF         +VL+S
Subjt:  THGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAI-VFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMS

Query:  NLYAASGRWNDVARIRKMMREMG---EDGCSGVSL
        NLYA  G+W DV ++R+ M EM    + GCS + L
Subjt:  NLYAASGRWNDVARIRKMMREMG---EDGCSGVSL

AT2G04860.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-20251.67Show/hide
Query:  LTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV
        L+ FHS  K  + G+  + P+ +FR LLR  + PN  T S+ ++A   ++S +SF         + +  Q+QTH  K G D+F+YV T+ L+LY K G V
Subjt:  LTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGFV

Query:  KAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCAD
         +A+ LFD+ PE+D V WNALI GY+R+GY  DA+KLF+ M ++GF P   TLV+L+P CG      QG+S+HG+  K+GL+LDSQVKNAL+S Y KCA+
Subjt:  KAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCAD

Query:  LEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAE
        L   ++LF E+ +K+ VSWNTMIGA+ Q+G   EA+ VFK M E++V  + VT++++LSA+ S   +HC   K G+V ++SVVTSLVC+Y +CG +  AE
Subjt:  LEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAE

Query:  LIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAV
         +Y S  Q ++V LT+I+S YAEKGDM   V  +S  + L MK+DAVA+VGI+ G     HI IG++ HGY +KSGL    LVVNGLI+MYSKFD+++ V
Subjt:  LIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAV

Query:  YSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRID
          LF+++ +  L SWNSVIS C Q+GR+  A  +F QM L+ G  PD+IT+ASLL+ C Q   L+ G+ LH Y LRNN + + FV TAL+DMY KCG   
Subjt:  YSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLS-GYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRID

Query:  FAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGL
         AE+VFK +K PC A+WNS+ISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V+EG+  F+ M KEFGI P  QH A MVGLLGRA L
Subjt:  FAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVS
        F EA+  I  M+  PDSAVWGALLSAC IH E+++GE VA+K+F  + +NGG +VLMSNLYA    W+DV R+R MM++ G DG  GVS
Subjt:  FEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVS

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-10430.33Show/hide
Query:  SLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF
        S+  ++S    +V        L  + ++L + V P+  TF  L+KA V   +       F S+   +            G D   +V+++ +  Y + G 
Subjt:  SLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSKLGF

Query:  VKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCA
        +    +LFD   +KD V WN +++GY + G      K F  MR     P   T   ++  C ++ L   G  +HGL V +G+D +  +KN+L+SMY KC 
Subjt:  VKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCA

Query:  DLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANAS------AGCIHCYATKIGLVENVSVVTSLVCSYVKC
          +    LF  +S  + V+WN MI  + Q+G   E++  F +M+   V  +++T  S+L + +          IHCY  +  +  ++ + ++L+ +Y KC
Subjt:  DLEGVKLLFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANAS------AGCIHCYATKIGLVENVSVVTSLVCSYVKC

Query:  GYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSK
          + +A+ I+      ++V  TA+IS Y   G     + ++  +  + +  + + +V I+        + +G   HG+ +K G    C +   +I MY+K
Subjt:  GYIEIAELIYMSKLQKNLVALTAIISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSK

Query:  FDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYV
           ++  Y +F+ + K+ + SWNS+I+ CAQ+     A+ +F QM +SG   D +++++ LSAC    +  FG+ +H ++++++L    +  + L+DMY 
Subjt:  FDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTGRSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYV

Query:  KCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVG
        KCG +  A NVFK MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V+EG ++F+ M +++GI P+ +H A +V 
Subjt:  KCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCYTKMMEK-GIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H  V+L E  + KL   +  N G++VL+SN +A +  W  V ++R +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSNCRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVSLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTACATCTTCGGTAGGTCACCCAGCAACTCCGTCCCTAACTACCTTCCATTCTGCATTCAAATTTTACGTCGAAGGAAAAAATTTTACTCCCCCCTTGTTGCT
TTTCCGTCAGCTGCTAAGATATCGGGTTAAACCTAATGATTGTACCTTCTCCTTACTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTCGCACCATCATTCT
GTTCTGAGAATGCAAGAGCGGAGGCGAATCAGCTCCAAACCCACTTCATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGATTTGTATTCAAAA
TTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGATTTTCCTGAAAAAGACGTTGTATCGTGGAATGCGTTGATTTCTGGGTACACACGAAGTGGATATAGCCATGA
CGCGTTTAAGCTATTTGTGGAAATGCGCAGAAGGGGGTTCGACCCTTGTCAGAGAACGTTGGTAAGTTTAATGCCTTCCTGTGGTACCCAACAATTATTCGTCCAAGGAA
AATCCATCCATGGATTAGGTGTTAAGGCTGGCCTTGATTTGGATTCCCAAGTGAAAAATGCTCTTGTATCGATGTATGGTAAATGTGCAGATTTAGAAGGGGTGAAACTC
TTATTTGGAGAGATTAGTGAAAAAAACGTAGTTTCTTGGAATACCATGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTGGAGGCAATGCTTGTTTTCAAGCAAATGCT
TGAAGAAAGTGTCAATGCTAACTCGGTGACTATGGTGAGTATCTTGTCTGCAAATGCAAGTGCAGGATGTATTCATTGTTATGCTACCAAAATTGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAAATGTGGATATATAGAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCG
ATTATTTCTAGCTATGCTGAGAAAGGTGACATGGCATCTGTGGTGAGGCTATATTCCCTTGTACAGCATTTAGATATGAAATTAGATGCTGTTGCAATGGTTGGCATAAT
CCAAGGTTTTACATATCCTTATCACATTGGCATTGGACTTGCTTTCCACGGTTATGGTGTCAAGAGTGGGCTAATTATTGATTGTTTGGTTGTTAATGGTCTCATAAGCA
TGTATTCAAAGTTCGATAATATTGATGCAGTGTATTCTTTATTTCAAGAGATGCATAAAAAGACACTGAGCAGCTGGAACTCTGTGATATCTAGCTGTGCACAGACAGGA
AGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACATTGTCAGGTTATGGGCCAGATTCAATTACACTAGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
ACATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTGGACTTCAAGGGTTTTGTTGGGACTGCTCTTGTAGACATGTACGTCAAGTGTGGAAGAATAGACT
TTGCTGAAAATGTGTTTAAGGGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTCACAATCATGCTCTCCTCTGTTAC
ACCAAAATGATGGAGAAGGGGATAAAGCCCAATAAAATCACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTGAAGAAGGTAGAAAATACTTCAAAAT
CATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAGCATTGTGCATCCATGGTTGGCCTGCTTGGTCGGGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACA
TGGAAACCAATCCAGATTCTGCTGTGTGGGGAGCATTGCTCAGTGCTTGTTGCATTCACCACGAAGTTAAGCTTGGTGAATCTGTGGCCAAAAAGTTGTTTTTCTCTAAC
TGTAGAAATGGGGGGTTTTTTGTGTTGATGTCTAATCTTTATGCAGCATCAGGAAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGG
TTGTTCAGGCGTTAGCCTTATGGAAGGGATTCATACTTGTGAGAATTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAACATGCACATTCCATAAGCTTCCATAGGTGCGGAACCGCCAAAACGGGCGCCCAGTAATATGCAATTTACATCTTCGGTAGGTCACCCAGCAACTCCGTCCCTAA
CTACCTTCCATTCTGCATTCAAATTTTACGTCGAAGGAAAAAATTTTACTCCCCCCTTGTTGCTTTTCCGTCAGCTGCTAAGATATCGGGTTAAACCTAATGATTGTACC
TTCTCCTTACTCATCAAAGCCTTCGTTGTATCGTCTTCATCTTCTTCTTTCGCACCATCATTCTGTTCTGAGAATGCAAGAGCGGAGGCGAATCAGCTCCAAACCCACTT
CATTAAATGGGGATTTGACCAATTTTTGTATGTTAGTACTGCCTTTCTCGATTTGTATTCAAAATTGGGTTTTGTTAAAGCTGCTCGACGTTTGTTTGATGATTTTCCTG
AAAAAGACGTTGTATCGTGGAATGCGTTGATTTCTGGGTACACACGAAGTGGATATAGCCATGACGCGTTTAAGCTATTTGTGGAAATGCGCAGAAGGGGGTTCGACCCT
TGTCAGAGAACGTTGGTAAGTTTAATGCCTTCCTGTGGTACCCAACAATTATTCGTCCAAGGAAAATCCATCCATGGATTAGGTGTTAAGGCTGGCCTTGATTTGGATTC
CCAAGTGAAAAATGCTCTTGTATCGATGTATGGTAAATGTGCAGATTTAGAAGGGGTGAAACTCTTATTTGGAGAGATTAGTGAAAAAAACGTAGTTTCTTGGAATACCA
TGATTGGGGCATTCGGCCAAAATGGGTTCTTTTTGGAGGCAATGCTTGTTTTCAAGCAAATGCTTGAAGAAAGTGTCAATGCTAACTCGGTGACTATGGTGAGTATCTTG
TCTGCAAATGCAAGTGCAGGATGTATTCATTGTTATGCTACCAAAATTGGTCTTGTGGAAAATGTTTCCGTGGTTACCTCTCTAGTTTGCTCCTACGTAAAATGTGGATA
TATAGAAATAGCAGAACTGATTTATATGTCAAAACTCCAGAAAAACTTGGTTGCATTAACTGCGATTATTTCTAGCTATGCTGAGAAAGGTGACATGGCATCTGTGGTGA
GGCTATATTCCCTTGTACAGCATTTAGATATGAAATTAGATGCTGTTGCAATGGTTGGCATAATCCAAGGTTTTACATATCCTTATCACATTGGCATTGGACTTGCTTTC
CACGGTTATGGTGTCAAGAGTGGGCTAATTATTGATTGTTTGGTTGTTAATGGTCTCATAAGCATGTATTCAAAGTTCGATAATATTGATGCAGTGTATTCTTTATTTCA
AGAGATGCATAAAAAGACACTGAGCAGCTGGAACTCTGTGATATCTAGCTGTGCACAGACAGGAAGGTCAATTGATGCCATGGCTTTGTTTTCCCAAATGACATTGTCAG
GTTATGGGCCAGATTCAATTACACTAGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTTACATTTTGGGGAGATACTTCATTGCTATATTCTAAGAAACAATCTG
GACTTCAAGGGTTTTGTTGGGACTGCTCTTGTAGACATGTACGTCAAGTGTGGAAGAATAGACTTTGCTGAAAATGTGTTTAAGGGCATGAAAGAGCCATGTTTAGCTTC
ATGGAACTCGCTGATCTCTGGTTATGGTTTATTTGGGTTTCACAATCATGCTCTCCTCTGTTACACCAAAATGATGGAGAAGGGGATAAAGCCCAATAAAATCACTTTCT
CAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTGAAGAAGGTAGAAAATACTTCAAAATCATGAAGAAAGAATTTGGTATCGTGCCCGAATCACAGCATTGTGCA
TCCATGGTTGGCCTGCTTGGTCGGGCAGGATTATTTGAAGAGGCAATTGTATTTATCAAGAACATGGAAACCAATCCAGATTCTGCTGTGTGGGGAGCATTGCTCAGTGC
TTGTTGCATTCACCACGAAGTTAAGCTTGGTGAATCTGTGGCCAAAAAGTTGTTTTTCTCTAACTGTAGAAATGGGGGGTTTTTTGTGTTGATGTCTAATCTTTATGCAG
CATCAGGAAGGTGGAATGATGTAGCAAGAATCAGAAAGATGATGCGAGAAATGGGAGAAGATGGTTGTTCAGGCGTTAGCCTTATGGAAGGGATTCATACTTGTGAGAAT
TTCATTTGA
Protein sequenceShow/hide protein sequence
MQFTSSVGHPATPSLTTFHSAFKFYVEGKNFTPPLLLFRQLLRYRVKPNDCTFSLLIKAFVVSSSSSSFAPSFCSENARAEANQLQTHFIKWGFDQFLYVSTAFLDLYSK
LGFVKAARRLFDDFPEKDVVSWNALISGYTRSGYSHDAFKLFVEMRRRGFDPCQRTLVSLMPSCGTQQLFVQGKSIHGLGVKAGLDLDSQVKNALVSMYGKCADLEGVKL
LFGEISEKNVVSWNTMIGAFGQNGFFLEAMLVFKQMLEESVNANSVTMVSILSANASAGCIHCYATKIGLVENVSVVTSLVCSYVKCGYIEIAELIYMSKLQKNLVALTA
IISSYAEKGDMASVVRLYSLVQHLDMKLDAVAMVGIIQGFTYPYHIGIGLAFHGYGVKSGLIIDCLVVNGLISMYSKFDNIDAVYSLFQEMHKKTLSSWNSVISSCAQTG
RSIDAMALFSQMTLSGYGPDSITLASLLSACCQNGNLHFGEILHCYILRNNLDFKGFVGTALVDMYVKCGRIDFAENVFKGMKEPCLASWNSLISGYGLFGFHNHALLCY
TKMMEKGIKPNKITFSGILAACTHGGLVEEGRKYFKIMKKEFGIVPESQHCASMVGLLGRAGLFEEAIVFIKNMETNPDSAVWGALLSACCIHHEVKLGESVAKKLFFSN
CRNGGFFVLMSNLYAASGRWNDVARIRKMMREMGEDGCSGVSLMEGIHTCENFI