; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G006490 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G006490
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr10:9058191..9079912
RNA-Seq ExpressionLsi10G006490
SyntenyLsi10G006490
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004134 - 4-alpha-glucanotransferase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR003385 - Glycoside hydrolase, family 77
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR017853 - Glycoside hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045882.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0091.99Show/hide
Query:  MPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVF
        MPNRNLVSWSSVVSMYTQL YNEKAL YFL+F+R CDDK NEYILASIIRACVQRDGGEPGSQVHSYV KAGFDEDVYVGTSLVDLYAKHGEIDKARLVF
Subjt:  MPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVF

Query:  DGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTL
        DGLV KT VTWTAIITGYTKSGRSEVSLQLFNLM+ESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVK+GK L
Subjt:  DGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTL

Query:  FDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKR
        FDRM+VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGS+DALQHGRQIHSY IKV LEHDNFV NALIDMYSKCNSLDDAKR
Subjt:  FDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKR

Query:  VFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDAR
        VFDVVTC SVV YNAMIEGYSRQEYLCGALEVF+EMRLK VSPSFLTFVSLLGLSA L+CLQLSK+IHGL IKYG SLDKFTSSALIDVYSKCSCIRDAR
Subjt:  VFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDAR

Query:  HVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEA
        +VFEGTT  DIVVWNALFSGYNLQLKSEEAFKLYSDLQ S+ERPNEFTFAALITAAS LASLQHGQQFHNQVMKMGL  DPFITNALVDMYAKCGSVEEA
Subjt:  HVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEA

Query:  EKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSE
        EK FSSSV KDTACWNSMISMYAQHGK E ALRMFE+M+SNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM+RYGIEPG+EHYASVVTLLGRAG+L+E
Subjt:  EKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSE

Query:  AREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSR
        A EFIEKMTI+PAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLR KMDVNGVVKEPGQSWIE+NGEVH FVSR
Subjt:  AREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSR

Query:  DKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        DKVHD+TDLIYLALDELTMQMKDAGSV DTTILE+ID
Subjt:  DKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

XP_016902172.1 PREDICTED: pentatricopeptide repeat-containing protein At4g39530 [Cucumis melo]0.0e+0092.07Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYTQL YNEKAL YFL+F+R CDDK NEYILASIIRACVQRDGGEPGSQVHSYV KAGFDEDVYVGTSLVDLYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        DKARLVFDGLV KT VTWTAIITGYTKSGRSEVSLQLFNLM+ESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VK+GK LFDRM+VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGS+DALQHGRQIHSY IKV LEHDNFV NALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDAKRVFDVVTC SVV YNAMIEGYSRQEYLCGALEVF+EMRLK VSPSFLTFVSLLGLSA L+CLQLSK+IHGL IKYG SLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFEGTT  DIVVWNALFSGYNLQLKSEEAFKLYSDLQ S+ERPNEFTFAALITAAS LASLQHGQQFHNQVMKMGL  DPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEK FSSSV KDTACWNSMISMYAQHGK E ALRMFE+M+SNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM+RYGIEPG+EHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAG+L+EA EFIEKMTI+PAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLR KMDVNGVVKEPGQSWIE+NGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VH FVSRDKVHD+TDLIYLALDELTMQMKDAGSV DTTILE+ID
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

XP_023512716.1 pentatricopeptide repeat-containing protein At4g39530 [Cucurbita pepo subsp. pepo]0.0e+0089.92Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYT+LGYNEKAL YFLEFRR  D+ +NEYILAS IRACVQRDGGEPGSQVHSY++KAGFDEDVYVGTSL+DLYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        +KARL+FDGLVMK+AVTWTAIITGYTKSGRSEVSLQLFNLM ESNV+PDKYVLSS+LNACS+LGFLEGGKQIHAYV+RRE KMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDRM++KN+ISWTTMI+GYMQNSYDWEAVEL  EMFR GWK DEY CSSILTSCGSVDALQHGRQIHSYIIKV LEHDNFV+NALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDA++VFD    HSVVSYNAMIEGYSRQEYL  ALE+FREMR+KHVSPSFLTFVSLLG+SA L CLQLSKQIHGL IKYGVSLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFE TTN DIVVWNALFSGYNLQ +SEEAF+LY+DLQFSRERPNEFTFAALITAAS LASLQHGQQFHNQV+KMGLGLD FITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEKTFSSSVWKDT CWNSMISMYAQHGKAE ALRMFE+MMSND+ PNYVTFVSVL+ACSHVGFVEDGLQHFNSM+RYGIEPGMEHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGN+ELAKHA+EMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDV+GVVKEPGQSWIEVNGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VH+FVSRD+VH+++DLIYLALDELT+QMKDAG VLDTTILE  D
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

XP_031737016.1 pentatricopeptide repeat-containing protein At4g39530 isoform X2 [Cucumis sativus]0.0e+0092.47Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKAL YFLEF+R C DKLNEYILASIIRACVQRDGGEPGSQVHSYVIK+GF EDVYVGTSLV LYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        DKARLVFDGLV+KT VTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLR ETKMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDR++VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGSVDALQHGRQIHSY+IKV LEHDNFV NALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        +LDDAKRVFDVVTCHSVV YNAMIEGYSRQ YLCGALEVF+EMRLKHVSPSFLTFVSLLGLSA L+CLQLSKQIHGL IKYG SLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFEGTTN DIVVWN+LFSGYNLQLKSEEAFKLYSDLQ SRERPNEFTFAAL TAAS LASL HGQQFHNQVMKMGL  DPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEK FSSSVWKDTACWNSMISMYAQHGK E ALRMFE M+SN+INPNYVTFVSVLSACSHVGFVEDGLQH+NSM+RYGIEPG+EHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRL+EAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VHIFVSRDKVHD+TDLIYLALDELT QMKD G V DTTILE+ID
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

XP_038900776.1 pentatricopeptide repeat-containing protein At4g39530 [Benincasa hispida]0.0e+0095.16Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKAL YFLEFRR CDDKLNEYILASIIRACVQRD GEPGSQVHSYVIKAGFDEDVYVGTSLV LYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        DKARLVFDGLVMKTA TWTAII+GYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETK+DVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDRM+VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDE+ACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSA L+ LQLSKQIHGL IKYG SLDKFTSSAL+DVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFEGTTN DIVVWNALFSGYNLQLKSEEAFKLYSDLQ SRERPNEFTFAALITAAS LASLQHGQQFHNQVMK+GLGLDPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAE ALRMFE+MM NDINPNYVTFVSVLSACSHVGFVEDGLQHF+SM+RYGIEPGMEHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRLSEA+EFIEKMTIRPAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIE+NGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        V+IFVSRDKVHD+TDLIYLALDELTM MKDAGS+LDTTILEVID
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

TrEMBL top hitse value%identityAlignment
A0A0A0LM26 Uncharacterized protein0.0e+0092.47Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKAL YFLEF+R C DKLNEYILASIIRACVQRDGGEPGSQVHSYVIK+GF EDVYVGTSLV LYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        DKARLVFDGLV+KT VTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLR ETKMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDR++VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGSVDALQHGRQIHSY+IKV LEHDNFV NALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        +LDDAKRVFDVVTCHSVV YNAMIEGYSRQ YLCGALEVF+EMRLKHVSPSFLTFVSLLGLSA L+CLQLSKQIHGL IKYG SLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFEGTTN DIVVWN+LFSGYNLQLKSEEAFKLYSDLQ SRERPNEFTFAAL TAAS LASL HGQQFHNQVMKMGL  DPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEK FSSSVWKDTACWNSMISMYAQHGK E ALRMFE M+SN+INPNYVTFVSVLSACSHVGFVEDGLQH+NSM+RYGIEPG+EHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRL+EAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VHIFVSRDKVHD+TDLIYLALDELT QMKD G V DTTILE+ID
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

A0A1S4E1S3 pentatricopeptide repeat-containing protein At4g395300.0e+0092.07Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYTQL YNEKAL YFL+F+R CDDK NEYILASIIRACVQRDGGEPGSQVHSYV KAGFDEDVYVGTSLVDLYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        DKARLVFDGLV KT VTWTAIITGYTKSGRSEVSLQLFNLM+ESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VK+GK LFDRM+VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGS+DALQHGRQIHSY IKV LEHDNFV NALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDAKRVFDVVTC SVV YNAMIEGYSRQEYLCGALEVF+EMRLK VSPSFLTFVSLLGLSA L+CLQLSK+IHGL IKYG SLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFEGTT  DIVVWNALFSGYNLQLKSEEAFKLYSDLQ S+ERPNEFTFAALITAAS LASLQHGQQFHNQVMKMGL  DPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEK FSSSV KDTACWNSMISMYAQHGK E ALRMFE+M+SNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM+RYGIEPG+EHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAG+L+EA EFIEKMTI+PAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLR KMDVNGVVKEPGQSWIE+NGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VH FVSRDKVHD+TDLIYLALDELTMQMKDAGSV DTTILE+ID
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

A0A5D3CQ85 Pentatricopeptide repeat-containing protein0.0e+0091.99Show/hide
Query:  MPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVF
        MPNRNLVSWSSVVSMYTQL YNEKAL YFL+F+R CDDK NEYILASIIRACVQRDGGEPGSQVHSYV KAGFDEDVYVGTSLVDLYAKHGEIDKARLVF
Subjt:  MPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVF

Query:  DGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTL
        DGLV KT VTWTAIITGYTKSGRSEVSLQLFNLM+ESNVIPDKYVLSSILNACSVLG+L+GGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVK+GK L
Subjt:  DGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTL

Query:  FDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKR
        FDRM+VKN+ISWTTMIAGYMQNSYDWEAVELVGEMFR GWKPDEYACSS+LTSCGS+DALQHGRQIHSY IKV LEHDNFV NALIDMYSKCNSLDDAKR
Subjt:  FDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKR

Query:  VFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDAR
        VFDVVTC SVV YNAMIEGYSRQEYLCGALEVF+EMRLK VSPSFLTFVSLLGLSA L+CLQLSK+IHGL IKYG SLDKFTSSALIDVYSKCSCIRDAR
Subjt:  VFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDAR

Query:  HVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEA
        +VFEGTT  DIVVWNALFSGYNLQLKSEEAFKLYSDLQ S+ERPNEFTFAALITAAS LASLQHGQQFHNQVMKMGL  DPFITNALVDMYAKCGSVEEA
Subjt:  HVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEA

Query:  EKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSE
        EK FSSSV KDTACWNSMISMYAQHGK E ALRMFE+M+SNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM+RYGIEPG+EHYASVVTLLGRAG+L+E
Subjt:  EKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSE

Query:  AREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSR
        A EFIEKMTI+PAALVWRSLLSACRVFGNVELAKHA+EMAISIDPMDSGSY+MLSNIFASKGMWGDVKRLR KMDVNGVVKEPGQSWIE+NGEVH FVSR
Subjt:  AREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSR

Query:  DKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        DKVHD+TDLIYLALDELTMQMKDAGSV DTTILE+ID
Subjt:  DKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

A0A6J1EQJ5 pentatricopeptide repeat-containing protein At4g395300.0e+0089.92Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYT+LGYNEKAL YFLEFRR  D  LNEYILAS IRACVQRDGGEPGSQVHSY++KAGFDEDVYVGTSL+DLYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        +KARL+FDGLVMK+AVTWTAIITGYTKSGRSEVSLQLFNLM ESNV+PDKYVLSS+LNACS+LGFLEGGKQIHA+VLRRETKMDVSTYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDRM+ KN+ISWTTMI+GYMQNSYDWEAVEL  EMFR GWKPDEY CSSILTSCGSVDALQHGRQIHSYIIKV LEHDNFVINALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDA++VFD  T HSVVSYNAMIEGYSRQEYL  ALE+FREMR+KHVSPSFLTFVSLLG+SA L CLQLSKQIHGL IKYGVSLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFE TTN DIVVWNALFSGYNLQ +SEEAF+LY+DLQFSRERPNEFTFAALITAAS LASLQHGQQFHNQV+KMGLGLD FITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEKTFSSSVWKDT CWNSMISMYAQHGKA+ ALRMFE MM+NDI PNYVTFVSVL+ACSHVGFVEDGLQHFNSM+RY IEPGMEHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGN+ELAKHA+ MAISIDPMDSGSYIMLSNIFASK MWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VH+FVSRD+VH+++DLIYLAL+ELT+QMK+AG VLDTTILE  D
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

A0A6J1I6J1 pentatricopeptide repeat-containing protein At4g395300.0e+0090.32Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        AGTLFDKMPNRNLVSWSSVVSMYT+LGYNEKAL YFLEFRR  D  LNEYILAS IRACVQRDGGEPGSQVHSY++KAGFDEDVYVGTSLVDLYAKHGEI
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        +KARL+FDGLVMK+AVTWTAIITGYTKSGRSEVSLQLFNLM ESNV+PDKYVLSS+LNACS+LGFLEGGKQIHAYVLRRETKMDV TYNVLIDFYTKCGR
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        VKAGK LFDRM++KN+ISWTTMI+GYMQNSYDWEAVEL  EMFR GWKPDEY CSSILTSCGSVDALQHGRQIHSYIIKV LEHDNFVINALIDMYSKCN
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        SLDDA++VFD  T HSVVSYNAMIEGYSRQEYL  ALE+FREMR+KHVSPSFLTFVSLLG+SA L CLQLSKQIHGL IKYGVSLDKFTSSALIDVYSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK
        SCIRDAR+VFE TTN DIVVWNALFSGYNLQ +SEE F+LY+DLQFSRERPNEFTFAALITAAS LASLQHGQQFHNQV+KMGLGLDPFITNALVDMYAK
Subjt:  SCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAK

Query:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG
        CGSVEEAEKTF SSVWKDT CWNSMISMYAQHGKAE AL MFE MM+NDI+PNYVTFVSVL+ACSHVGFVEDGLQHFNSM+RYGIEPGMEHYASVVTLLG
Subjt:  CGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLG

Query:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE
        RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNV+LAKHA+EMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDV+GVVKEPGQSWIEVNGE
Subjt:  RAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGE

Query:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID
        VH+FVSRD+ H+++DLIYLALDELT+QMKDAG VLDTTILE  D
Subjt:  VHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID

SwissProt top hitse value%identityAlignment
Q06801 4-alpha-glucanotransferase, chloroplastic/amyloplastic1.6e-16166.67Show/hide
Query:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL
        VLPLVPPG++ NE+GSPYSGQDANCGNTLLISL+ELV DGLL  EELP+P+  D V +S +++IKDPLI KAA+RL+ S+GELK QLE F RDP+ISSWL
Subjt:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL

Query:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG
        EDAAYFAAIDN +N+ SWY+WPEPLKNRHL+ALE+VYQ E+DFI+IFIAQQFLFQRQW++VR YA  KGI+IMGDMPIYVGY SADVWANKKQFLLN+KG
Subjt:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG

Query:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTV-----------------------------EGQTCNL----AGPGKSLFDAISRAVGKINIF
        FP++VSGVPPDAFSETGQLWG          K+ +SW V                             E +   L     GPGK LFDAI +AVGKINI 
Subjt:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTV-----------------------------EGQTCNL----AGPGKSLFDAISRAVGKINIF

Query:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK
        AEDLGVITEDVVQLRKSI APGMAVLQF FGSD+ NPHLPHNHE NQVVYTGTHDNDTIRGWWD L + EKSN                 VLKYLS  E+
Subjt:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK

Query:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQ
        ++I   LI  A+SSVA+ AIIP+QDVLGLGS +RMNIPATQ
Subjt:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQ

Q8LI30 4-alpha-glucanotransferase DPE1, chloroplastic/amyloplastic3.8e-14760.77Show/hide
Query:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL
        VLPLVPPGRK+ E+GSPYSGQDANCGNTLLISL+ELVKDGLL + ELP P+D ++V+F  VA++K+PLIAKAAERL+ S GEL+ Q + F ++P+IS WL
Subjt:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL

Query:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG
        EDAA FAAID  +++ SWYEWPEPLKNRHL ALED+YQ+++DFI IF+AQQFLFQRQWQR+R YA   GI+IMGDMPIYVGY SADVWAN+K FLL+K G
Subjt:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG

Query:  FPVLVSGVPPDAFSETGQLW----------------------GRGKNCYS-------------WTVEGQT------CNLAGPGKSLFDAISRAVGKINIF
        FP  VSGVPPDAFSETGQLW                       R  + Y              W V  ++         AGP  + FDA+ +AVG+INI 
Subjt:  FPVLVSGVPPDAFSETGQLW----------------------GRGKNCYS-------------WTVEGQT------CNLAGPGKSLFDAISRAVGKINIF

Query:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK
        AEDLGVITEDVV LRKSI APGMAVLQF FG  S NPHLPHNHE +QVVYTGTHDNDT+ GWW  L E EK                  TV KYL    +
Subjt:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK

Query:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQ
         +I WALI AALSSVA+T+++ +QD+LGL SSARMN PATQ
Subjt:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQ

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial7.1e-13034.22Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        A + F+ MP R++VSW+S++S Y Q G + K++  F++  R    + +    A I++ C   +    G Q+H  V++ G D DV   ++L+D+YAK    
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
         ++  VF G+  K +V+W+AII G  ++    ++L+ F  M + N    + + +S+L +C+ L  L  G Q+HA+ L+ +   D       +D Y KC  
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        ++  + LFD     N  S+  MI GY Q  + ++A+ L   +  +G   DE + S +  +C  V  L  G QI+   IK  L  D  V NA IDMY KC 
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        +L +A RVFD +     VS+NA+I  + +       L +F  M    + P   TF S+L  + T   L    +IH   +K G++ +     +LID+YSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHV---FEGTTNID-----------------IVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVM
          I +A  +   F    N+                   V WN++ SGY ++ +SE+A  L++ +      P++FT+A ++   + LAS   G+Q H QV+
Subjt:  SCIRDARHV---FEGTTNID-----------------IVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVM

Query:  KMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM
        K  L  D +I + LVDMY+KCG + ++   F  S+ +D   WN+MI  YA HGK E A+++FE M+  +I PN+VTF+S+L AC+H+G ++ GL++F  M
Subjt:  KMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM

Query:  SR-YGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVF-GNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLR
         R YG++P + HY+++V +LG++G++  A E I +M      ++WR+LL  C +   NVE+A+ A+   + +DP DS +Y +LSN++A  GMW  V  LR
Subjt:  SR-YGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVF-GNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLR

Query:  LKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK
          M    + KEPG SW+E+  E+H+F+  DK H   + IY  L  +  +MK
Subjt:  LKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK

Q9LV91 4-alpha-glucanotransferase DPE1, chloroplastic/amyloplastic1.3e-15262.11Show/hide
Query:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL
        VLPLVPP    +E GSPY+GQDANCGNTLLISLDELVKDGLL K+ELP+P+D D V +     +K PLI KAA+RLI  +GELK +L +F  DP IS WL
Subjt:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL

Query:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG
        EDAAYFAAIDN LN++SW+EWPEPLKNRHLSALE +Y+ +++FI++FIA+QFLFQRQWQ+VR YA  +G+ IMGDMPIYVGY SADVWANKK FLLNKKG
Subjt:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG

Query:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTVEG--------QTCNL-------------------------AGPGKSLFDAISRAVGKINIF
        FP+LVSGVPPD FSETGQLWG           + YSW V            C +                          GPGKSLFDAIS+ VGKI I 
Subjt:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTVEG--------QTCNL-------------------------AGPGKSLFDAISRAVGKINIF

Query:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK
        AEDLGVIT+DVV+LRKSIGAPGMAVLQF FG  + NPHLPHNHE NQVVY+GTHDNDTIRGWWD L++ EKS                   +KYLS+  +
Subjt:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK

Query:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQSSGIG
        DDI W++I+AA SS AQTAIIP+QD+LGLGSSARMN PAT+    G
Subjt:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQSSGIG

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395302.3e-24556.46Show/hide
Query:  DVQHASILVFLSSVSTAEDKIALFPVGAAAGEQAPVVEIGAGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACV
        +V H  I+V+   + T    I +     A G       + A  +F+KMP RNLVSWS++VS     G  E++L  FLEF R   D  NEYIL+S I+AC 
Subjt:  DVQHASILVFLSSVSTAEDKIALFPVGAAAGEQAPVVEIGAGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACV

Query:  QRDGGEPGS--QVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILN
          DG       Q+ S+++K+GFD DVYVGT L+D Y K G ID ARLVFD L  K+ VTWT +I+G  K GRS VSLQLF  +ME NV+PD Y+LS++L+
Subjt:  QRDGGEPGS--QVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILN

Query:  ACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSIL
        ACS+L FLEGGKQIHA++LR   +MD S  NVLID Y KCGRV A   LF+ M  KN+ISWTT+++GY QN+   EA+EL   M + G KPD YACSSIL
Subjt:  ACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSIL

Query:  TSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSR---QEYLCGALEVFREMRLKHVSPSFLTF
        TSC S+ AL  G Q+H+Y IK  L +D++V N+LIDMY+KC+ L DA++VFD+     VV +NAMIEGYSR   Q  L  AL +FR+MR + + PS LTF
Subjt:  TSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSR---QEYLCGALEVFREMRLKHVSPSFLTF

Query:  VSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFT
        VSLL  SA+L  L LSKQIHGL  KYG++LD F  SALIDVYS C C++D+R VF+     D+V+WN++F+GY  Q ++EEA  L+ +LQ SRERP+EFT
Subjt:  VSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFT

Query:  FAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYV
        FA ++TAA  LAS+Q GQ+FH Q++K GL  +P+ITNAL+DMYAKCGS E+A K F S+  +D  CWNS+IS YA HG+ + AL+M E MMS  I PNY+
Subjt:  FAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYV

Query:  TFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDS
        TFV VLSACSH G VEDGL+ F  M R+GIEP  EHY  +V+LLGRAGRL++ARE IEKM  +PAA+VWRSLLS C   GNVELA+HA+EMAI  DP DS
Subjt:  TFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDS

Query:  GSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK
        GS+ MLSNI+ASKGMW + K++R +M V GVVKEPG+SWI +N EVHIF+S+DK H   + IY  LD+L +Q++
Subjt:  GSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK

Arabidopsis top hitse value%identityAlignment
AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-13134.22Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        A + F+ MP R++VSW+S++S Y Q G + K++  F++  R    + +    A I++ C   +    G Q+H  V++ G D DV   ++L+D+YAK    
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
         ++  VF G+  K +V+W+AII G  ++    ++L+ F  M + N    + + +S+L +C+ L  L  G Q+HA+ L+ +   D       +D Y KC  
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        ++  + LFD     N  S+  MI GY Q  + ++A+ L   +  +G   DE + S +  +C  V  L  G QI+   IK  L  D  V NA IDMY KC 
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC
        +L +A RVFD +     VS+NA+I  + +       L +F  M    + P   TF S+L  + T   L    +IH   +K G++ +     +LID+YSKC
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKC

Query:  SCIRDARHV---FEGTTNID-----------------IVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVM
          I +A  +   F    N+                   V WN++ SGY ++ +SE+A  L++ +      P++FT+A ++   + LAS   G+Q H QV+
Subjt:  SCIRDARHV---FEGTTNID-----------------IVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVM

Query:  KMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM
        K  L  D +I + LVDMY+KCG + ++   F  S+ +D   WN+MI  YA HGK E A+++FE M+  +I PN+VTF+S+L AC+H+G ++ GL++F  M
Subjt:  KMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM

Query:  SR-YGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVF-GNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLR
         R YG++P + HY+++V +LG++G++  A E I +M      ++WR+LL  C +   NVE+A+ A+   + +DP DS +Y +LSN++A  GMW  V  LR
Subjt:  SR-YGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVF-GNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLR

Query:  LKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK
          M    + KEPG SW+E+  E+H+F+  DK H   + IY  L  +  +MK
Subjt:  LKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK

AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-12835.17Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI
        A  LF +M + ++V+W+ ++S + + G    A+ YF   R++   K     L S++ A       + G  VH+  IK G   ++YVG+SLV +Y+K  ++
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYAKHGEI

Query:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR
        + A  VF+ L  K  V W A+I GY  +G S   ++LF  M  S    D +  +S+L+ C+    LE G Q H+ +++++   ++   N L+D Y KCG 
Subjt:  DKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGR

Query:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN
        ++  + +F+RM  ++ ++W T+I  Y+Q+  + EA +L   M   G   D    +S L +C  V  L  G+Q+H   +K  L+ D    ++LIDMYSKC 
Subjt:  VKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCN

Query:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLD-KFTSSALIDVYSK
         + DA++VF  +   SVVS NA+I GYS Q  L  A+ +F+EM  + V+PS +TF +++        L L  Q HG   K G S + ++   +L+ +Y  
Subjt:  SLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLD-KFTSSALIDVYSK

Query:  CSCIRDARHVF-EGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMY
           + +A  +F E ++   IV+W  + SG++     EEA K Y +++     P++ TF  ++   S L+SL+ G+  H+ +  +   LD   +N L+DMY
Subjt:  CSCIRDARHVF-EGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMY

Query:  AKCGSVEEAEKTFSSSVWK-DTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM-SRYGIEPGMEHYASVV
        AKCG ++ + + F     + +   WNS+I+ YA++G AE AL++F+ M  + I P+ +TF+ VL+ACSH G V DG + F  M  +YGIE  ++H A +V
Subjt:  AKCGSVEEAEKTFSSSVWK-DTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM-SRYGIEPGMEHYASVV

Query:  TLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIE
         LLGR G L EA +FIE   ++P A +W SLL ACR+ G+    + ++E  I ++P +S +Y++LSNI+AS+G W     LR  M   GV K PG SWI+
Subjt:  TLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIE

Query:  VNGEVHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILE
        V    HIF + DK H +   I + L++L   MKD  +V++  I+E
Subjt:  VNGEVHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILE

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-12935.21Show/hide
Query:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKL-----NEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYA
        A  +FD +  ++  SW +++S  ++     +A+  F      CD  +       Y  +S++ AC + +  E G Q+H  V+K GF  D YV  +LV LY 
Subjt:  AGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKL-----NEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLYA

Query:  KHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFY
          G +  A  +F  +  + AVT+  +I G ++ G  E +++LF  M    + PD   L+S++ ACS  G L  G+Q+HAY  +     +      L++ Y
Subjt:  KHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFY

Query:  TKCGRVKAGKTLFDRMNVKNVISWTTMIAGY-----MQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVIN
         KC  ++     F    V+NV+ W  M+  Y     ++NS+      +  +M      P++Y   SIL +C  +  L+ G QIHS IIK   + + +V +
Subjt:  TKCGRVKAGKTLFDRMNVKNVISWTTMIAGY-----MQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVIN

Query:  ALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTS
         LIDMY+K   LD A  +        VVS+  MI GY++  +   AL  FR+M  + +    +   + +   A L  L+  +QIH  A   G S D    
Subjt:  ALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTS

Query:  SALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFI
        +AL+ +YS+C  I ++   FE T   D + WNAL SG+     +EEA +++  +       N FTF + + AAS  A+++ G+Q H  + K G   +  +
Subjt:  SALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFI

Query:  TNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM-SRYGIEPGM
         NAL+ MYAKCGS+ +AEK F     K+   WN++I+ Y++HG    AL  F+ M+ +++ PN+VT V VLSACSH+G V+ G+ +F SM S YG+ P  
Subjt:  TNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSM-SRYGIEPGM

Query:  EHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKE
        EHY  VV +L RAG LS A+EFI++M I+P ALVWR+LLSAC V  N+E+ + A+   + ++P DS +Y++LSN++A    W      R KM   GV KE
Subjt:  EHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKE

Query:  PGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLD
        PGQSWIEV   +H F   D+ H   D I+    +LT +  + G V D
Subjt:  PGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLD

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-24656.46Show/hide
Query:  DVQHASILVFLSSVSTAEDKIALFPVGAAAGEQAPVVEIGAGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACV
        +V H  I+V+   + T    I +     A G       + A  +F+KMP RNLVSWS++VS     G  E++L  FLEF R   D  NEYIL+S I+AC 
Subjt:  DVQHASILVFLSSVSTAEDKIALFPVGAAAGEQAPVVEIGAGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACV

Query:  QRDGGEPGS--QVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILN
          DG       Q+ S+++K+GFD DVYVGT L+D Y K G ID ARLVFD L  K+ VTWT +I+G  K GRS VSLQLF  +ME NV+PD Y+LS++L+
Subjt:  QRDGGEPGS--QVHSYVIKAGFDEDVYVGTSLVDLYAKHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILN

Query:  ACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSIL
        ACS+L FLEGGKQIHA++LR   +MD S  NVLID Y KCGRV A   LF+ M  KN+ISWTT+++GY QN+   EA+EL   M + G KPD YACSSIL
Subjt:  ACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAGKTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSIL

Query:  TSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSR---QEYLCGALEVFREMRLKHVSPSFLTF
        TSC S+ AL  G Q+H+Y IK  L +D++V N+LIDMY+KC+ L DA++VFD+     VV +NAMIEGYSR   Q  L  AL +FR+MR + + PS LTF
Subjt:  TSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKRVFDVVTCHSVVSYNAMIEGYSR---QEYLCGALEVFREMRLKHVSPSFLTF

Query:  VSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFT
        VSLL  SA+L  L LSKQIHGL  KYG++LD F  SALIDVYS C C++D+R VF+     D+V+WN++F+GY  Q ++EEA  L+ +LQ SRERP+EFT
Subjt:  VSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDARHVFEGTTNIDIVVWNALFSGYNLQLKSEEAFKLYSDLQFSRERPNEFT

Query:  FAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYV
        FA ++TAA  LAS+Q GQ+FH Q++K GL  +P+ITNAL+DMYAKCGS E+A K F S+  +D  CWNS+IS YA HG+ + AL+M E MMS  I PNY+
Subjt:  FAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGKAEAALRMFELMMSNDINPNYV

Query:  TFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDS
        TFV VLSACSH G VEDGL+ F  M R+GIEP  EHY  +V+LLGRAGRL++ARE IEKM  +PAA+VWRSLLS C   GNVELA+HA+EMAI  DP DS
Subjt:  TFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHASEMAISIDPMDS

Query:  GSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK
        GS+ MLSNI+ASKGMW + K++R +M V GVVKEPG+SWI +N EVHIF+S+DK H   + IY  LD+L +Q++
Subjt:  GSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMK

AT5G64860.1 disproportionating enzyme9.5e-15462.11Show/hide
Query:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL
        VLPLVPP    +E GSPY+GQDANCGNTLLISLDELVKDGLL K+ELP+P+D D V +     +K PLI KAA+RLI  +GELK +L +F  DP IS WL
Subjt:  VLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWL

Query:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG
        EDAAYFAAIDN LN++SW+EWPEPLKNRHLSALE +Y+ +++FI++FIA+QFLFQRQWQ+VR YA  +G+ IMGDMPIYVGY SADVWANKK FLLNKKG
Subjt:  EDAAYFAAIDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKG

Query:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTVEG--------QTCNL-------------------------AGPGKSLFDAISRAVGKINIF
        FP+LVSGVPPD FSETGQLWG           + YSW V            C +                          GPGKSLFDAIS+ VGKI I 
Subjt:  FPVLVSGVPPDAFSETGQLWG--------RGKNCYSWTVEG--------QTCNL-------------------------AGPGKSLFDAISRAVGKINIF

Query:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK
        AEDLGVIT+DVV+LRKSIGAPGMAVLQF FG  + NPHLPHNHE NQVVY+GTHDNDTIRGWWD L++ EKS                   +KYLS+  +
Subjt:  AEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEGEKSNVRNTFSEPNTFTRAKTTVLKYLSVTEK

Query:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQSSGIG
        DDI W++I+AA SS AQTAIIP+QD+LGLGSSARMN PAT+    G
Subjt:  DDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQSSGIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTTCTTCCTCTTGTCCCTCCAGGAAGGAAGGCCAACGAAGAGGGATCACCCTACTCAGGCCAGGATGCAAATTGTGGAAACACTCTCTTGATCTCTCTTGATGA
GCTTGTCAAGGATGGTTTGCTCACAAAAGAGGAGTTGCCAAAACCAGTCGACCATGACCATGTAAAATTCTCTGCTGTTGCTGATATAAAGGATCCCTTGATAGCAAAGG
CTGCAGAAAGGTTAATTCAAAGTGATGGAGAACTTAAAAGACAGCTTGAAGAATTTTGTAGAGACCCTGATATATCAAGCTGGCTCGAAGATGCAGCTTATTTTGCTGCT
ATTGATAACAGACTAAATTCTTTCAGTTGGTATGAATGGCCAGAACCCTTAAAAAATCGGCATCTTTCAGCTTTAGAAGATGTTTACCAAAGGGAAAGAGATTTTATAAA
TATATTCATTGCTCAACAGTTTTTATTCCAGAGACAATGGCAAAGAGTTCGCAGCTATGCAAATATGAAGGGTATTACTATAATGGGGGACATGCCTATATATGTTGGCT
ATCAGAGTGCAGATGTTTGGGCTAATAAGAAGCAATTTTTGCTGAACAAGAAAGGTTTTCCCGTGTTAGTGAGTGGTGTGCCTCCAGATGCATTTAGTGAAACTGGTCAA
CTATGGGGAAGAGGCAAAAATTGCTACAGTTGGACGGTGGAAGGTCAGACTTGTAATTTGGCAGGGCCTGGAAAATCTTTATTTGATGCCATATCCAGAGCGGTTGGGAA
GATCAACATATTTGCAGAAGATTTGGGGGTTATTACTGAAGATGTGGTTCAGCTCAGAAAATCTATAGGGGCACCTGGAATGGCTGTTCTCCAATTTGGTTTCGGAAGTG
ATTCTGCTAATCCGCATTTGCCCCACAATCATGAATCTAACCAAGTTGTCTACACGGGAACTCATGATAATGACACGATTCGTGGCTGGTGGGACAACTTGAATGAAGGA
GAAAAATCCAATGTTCGCAACACCTTCTCTGAACCCAACACATTCACACGTGCAAAAACTACAGTACTGAAGTATCTTTCAGTTACTGAGAAAGATGATATCCCCTGGGC
TCTCATCCGGGCAGCATTGTCTTCTGTGGCACAAACTGCCATAATACCTTTGCAAGATGTTCTTGGGTTAGGGAGTTCTGCAAGGATGAACATTCCAGCAACACAGAGCA
GCGGAATAGGTGGAGCCAGTGGTGGTGGTGGCGGCGGCGACGTCCAGCATGCTTCAATTTTGGTGTTTCTGAGTTCGGTTTCGACGGCGGAGGATAAGATTGCTCTTTTT
CCTGTCGGTGCGGCCGCCGGCGAACAGGCTCCAGTCGTCGAAATTGGGGCAGGAACATTGTTTGACAAAATGCCTAATAGGAACCTAGTATCTTGGTCTTCAGTGGTTTC
TATGTATACCCAATTAGGCTATAATGAAAAAGCATTGCATTATTTCTTGGAATTCCGGCGGAACTGTGACGATAAACTGAATGAATATATTTTAGCTAGCATTATCAGAG
CTTGTGTTCAACGTGATGGTGGTGAACCTGGTTCCCAAGTCCATAGCTACGTTATTAAAGCAGGTTTTGATGAGGATGTTTATGTAGGAACCTCTTTGGTGGACTTGTAT
GCAAAACATGGTGAGATAGATAAAGCAAGATTGGTATTTGATGGTTTAGTCATGAAGACTGCTGTCACTTGGACTGCTATTATTACGGGGTACACGAAAAGTGGAAGGAG
TGAGGTTTCATTGCAATTGTTTAACTTAATGATGGAAAGTAATGTTATACCTGATAAATATGTGCTCTCGAGCATTCTAAATGCGTGTTCAGTGCTTGGTTTTCTGGAAG
GCGGTAAGCAAATTCATGCTTATGTTCTGAGGAGGGAAACAAAGATGGATGTGTCAACATATAACGTTCTTATAGACTTCTATACGAAATGTGGTAGAGTGAAAGCTGGG
AAAACACTTTTTGATAGAATGAATGTTAAGAATGTTATTTCTTGGACTACCATGATTGCTGGGTACATGCAAAATTCATATGATTGGGAAGCTGTGGAACTAGTTGGTGA
AATGTTCAGAACGGGATGGAAGCCTGACGAATATGCTTGCTCAAGCATTCTTACTTCATGTGGTTCAGTTGATGCTTTACAGCATGGAAGACAAATACATTCTTATATTA
TCAAGGTTTATCTTGAACATGATAACTTTGTGATAAATGCTTTAATTGACATGTATTCAAAATGTAATTCATTGGATGATGCAAAACGAGTCTTTGATGTCGTGACTTGT
CACAGTGTGGTCTCTTACAATGCAATGATTGAAGGCTATTCGAGACAAGAGTACTTGTGTGGAGCACTGGAAGTTTTCCGTGAGATGAGGCTAAAACATGTTTCACCAAG
CTTTTTAACATTTGTAAGCCTTCTGGGTTTATCAGCTACATTAGTCTGCTTGCAACTGAGCAAGCAAATCCATGGCCTTGCCATCAAATATGGGGTCTCTTTAGATAAGT
TCACTAGCAGTGCTCTTATAGATGTTTATTCTAAATGTTCATGCATTAGAGATGCAAGGCATGTGTTTGAAGGGACAACTAATATAGATATAGTTGTTTGGAATGCACTA
TTTTCTGGATATAACCTACAATTAAAAAGTGAGGAGGCTTTCAAACTATACTCAGATTTACAGTTCTCAAGAGAGAGGCCAAATGAGTTCACTTTTGCAGCTTTGATTAC
AGCAGCGAGCACCCTGGCAAGTCTCCAGCATGGTCAACAGTTCCATAATCAAGTAATGAAGATGGGTCTAGGATTAGACCCTTTCATCACTAATGCCCTTGTGGATATGT
ATGCCAAATGTGGGAGTGTGGAAGAAGCTGAAAAAACATTTTCCTCCTCAGTATGGAAAGATACTGCATGCTGGAACTCCATGATTTCAATGTATGCACAACATGGAAAA
GCAGAAGCCGCTCTTAGGATGTTTGAATTAATGATGAGCAATGACATAAATCCCAATTATGTCACTTTTGTGAGTGTGCTATCAGCTTGTAGTCATGTGGGGTTTGTTGA
AGATGGACTTCAACATTTCAATTCAATGTCCAGATATGGAATTGAACCAGGAATGGAACATTATGCTTCAGTTGTTACTCTTTTGGGTAGGGCTGGGAGGTTATCTGAAG
CTCGAGAATTCATTGAGAAGATGACAATAAGACCAGCAGCATTAGTATGGAGGAGCTTGCTTAGTGCATGTAGAGTTTTTGGCAATGTTGAGTTAGCAAAACATGCTTCA
GAGATGGCAATTTCGATTGACCCCATGGACAGTGGATCATATATTATGCTTTCAAATATTTTTGCATCTAAAGGTATGTGGGGAGATGTTAAAAGGCTGAGGCTGAAAAT
GGATGTTAACGGTGTAGTTAAAGAACCTGGACAGAGCTGGATTGAGGTCAACGGTGAAGTTCATATATTTGTTTCAAGAGACAAAGTCCATGATGATACCGATTTAATTT
ATCTAGCTTTGGATGAACTAACTATGCAGATGAAAGATGCAGGTTCTGTACTTGATACCACAATTCTTGAAGTGATTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGTTCTTCCTCTTGTCCCTCCAGGAAGGAAGGCCAACGAAGAGGGATCACCCTACTCAGGCCAGGATGCAAATTGTGGAAACACTCTCTTGATCTCTCTTGATGA
GCTTGTCAAGGATGGTTTGCTCACAAAAGAGGAGTTGCCAAAACCAGTCGACCATGACCATGTAAAATTCTCTGCTGTTGCTGATATAAAGGATCCCTTGATAGCAAAGG
CTGCAGAAAGGTTAATTCAAAGTGATGGAGAACTTAAAAGACAGCTTGAAGAATTTTGTAGAGACCCTGATATATCAAGCTGGCTCGAAGATGCAGCTTATTTTGCTGCT
ATTGATAACAGACTAAATTCTTTCAGTTGGTATGAATGGCCAGAACCCTTAAAAAATCGGCATCTTTCAGCTTTAGAAGATGTTTACCAAAGGGAAAGAGATTTTATAAA
TATATTCATTGCTCAACAGTTTTTATTCCAGAGACAATGGCAAAGAGTTCGCAGCTATGCAAATATGAAGGGTATTACTATAATGGGGGACATGCCTATATATGTTGGCT
ATCAGAGTGCAGATGTTTGGGCTAATAAGAAGCAATTTTTGCTGAACAAGAAAGGTTTTCCCGTGTTAGTGAGTGGTGTGCCTCCAGATGCATTTAGTGAAACTGGTCAA
CTATGGGGAAGAGGCAAAAATTGCTACAGTTGGACGGTGGAAGGTCAGACTTGTAATTTGGCAGGGCCTGGAAAATCTTTATTTGATGCCATATCCAGAGCGGTTGGGAA
GATCAACATATTTGCAGAAGATTTGGGGGTTATTACTGAAGATGTGGTTCAGCTCAGAAAATCTATAGGGGCACCTGGAATGGCTGTTCTCCAATTTGGTTTCGGAAGTG
ATTCTGCTAATCCGCATTTGCCCCACAATCATGAATCTAACCAAGTTGTCTACACGGGAACTCATGATAATGACACGATTCGTGGCTGGTGGGACAACTTGAATGAAGGA
GAAAAATCCAATGTTCGCAACACCTTCTCTGAACCCAACACATTCACACGTGCAAAAACTACAGTACTGAAGTATCTTTCAGTTACTGAGAAAGATGATATCCCCTGGGC
TCTCATCCGGGCAGCATTGTCTTCTGTGGCACAAACTGCCATAATACCTTTGCAAGATGTTCTTGGGTTAGGGAGTTCTGCAAGGATGAACATTCCAGCAACACAGAGCA
GCGGAATAGGTGGAGCCAGTGGTGGTGGTGGCGGCGGCGACGTCCAGCATGCTTCAATTTTGGTGTTTCTGAGTTCGGTTTCGACGGCGGAGGATAAGATTGCTCTTTTT
CCTGTCGGTGCGGCCGCCGGCGAACAGGCTCCAGTCGTCGAAATTGGGGCAGGAACATTGTTTGACAAAATGCCTAATAGGAACCTAGTATCTTGGTCTTCAGTGGTTTC
TATGTATACCCAATTAGGCTATAATGAAAAAGCATTGCATTATTTCTTGGAATTCCGGCGGAACTGTGACGATAAACTGAATGAATATATTTTAGCTAGCATTATCAGAG
CTTGTGTTCAACGTGATGGTGGTGAACCTGGTTCCCAAGTCCATAGCTACGTTATTAAAGCAGGTTTTGATGAGGATGTTTATGTAGGAACCTCTTTGGTGGACTTGTAT
GCAAAACATGGTGAGATAGATAAAGCAAGATTGGTATTTGATGGTTTAGTCATGAAGACTGCTGTCACTTGGACTGCTATTATTACGGGGTACACGAAAAGTGGAAGGAG
TGAGGTTTCATTGCAATTGTTTAACTTAATGATGGAAAGTAATGTTATACCTGATAAATATGTGCTCTCGAGCATTCTAAATGCGTGTTCAGTGCTTGGTTTTCTGGAAG
GCGGTAAGCAAATTCATGCTTATGTTCTGAGGAGGGAAACAAAGATGGATGTGTCAACATATAACGTTCTTATAGACTTCTATACGAAATGTGGTAGAGTGAAAGCTGGG
AAAACACTTTTTGATAGAATGAATGTTAAGAATGTTATTTCTTGGACTACCATGATTGCTGGGTACATGCAAAATTCATATGATTGGGAAGCTGTGGAACTAGTTGGTGA
AATGTTCAGAACGGGATGGAAGCCTGACGAATATGCTTGCTCAAGCATTCTTACTTCATGTGGTTCAGTTGATGCTTTACAGCATGGAAGACAAATACATTCTTATATTA
TCAAGGTTTATCTTGAACATGATAACTTTGTGATAAATGCTTTAATTGACATGTATTCAAAATGTAATTCATTGGATGATGCAAAACGAGTCTTTGATGTCGTGACTTGT
CACAGTGTGGTCTCTTACAATGCAATGATTGAAGGCTATTCGAGACAAGAGTACTTGTGTGGAGCACTGGAAGTTTTCCGTGAGATGAGGCTAAAACATGTTTCACCAAG
CTTTTTAACATTTGTAAGCCTTCTGGGTTTATCAGCTACATTAGTCTGCTTGCAACTGAGCAAGCAAATCCATGGCCTTGCCATCAAATATGGGGTCTCTTTAGATAAGT
TCACTAGCAGTGCTCTTATAGATGTTTATTCTAAATGTTCATGCATTAGAGATGCAAGGCATGTGTTTGAAGGGACAACTAATATAGATATAGTTGTTTGGAATGCACTA
TTTTCTGGATATAACCTACAATTAAAAAGTGAGGAGGCTTTCAAACTATACTCAGATTTACAGTTCTCAAGAGAGAGGCCAAATGAGTTCACTTTTGCAGCTTTGATTAC
AGCAGCGAGCACCCTGGCAAGTCTCCAGCATGGTCAACAGTTCCATAATCAAGTAATGAAGATGGGTCTAGGATTAGACCCTTTCATCACTAATGCCCTTGTGGATATGT
ATGCCAAATGTGGGAGTGTGGAAGAAGCTGAAAAAACATTTTCCTCCTCAGTATGGAAAGATACTGCATGCTGGAACTCCATGATTTCAATGTATGCACAACATGGAAAA
GCAGAAGCCGCTCTTAGGATGTTTGAATTAATGATGAGCAATGACATAAATCCCAATTATGTCACTTTTGTGAGTGTGCTATCAGCTTGTAGTCATGTGGGGTTTGTTGA
AGATGGACTTCAACATTTCAATTCAATGTCCAGATATGGAATTGAACCAGGAATGGAACATTATGCTTCAGTTGTTACTCTTTTGGGTAGGGCTGGGAGGTTATCTGAAG
CTCGAGAATTCATTGAGAAGATGACAATAAGACCAGCAGCATTAGTATGGAGGAGCTTGCTTAGTGCATGTAGAGTTTTTGGCAATGTTGAGTTAGCAAAACATGCTTCA
GAGATGGCAATTTCGATTGACCCCATGGACAGTGGATCATATATTATGCTTTCAAATATTTTTGCATCTAAAGGTATGTGGGGAGATGTTAAAAGGCTGAGGCTGAAAAT
GGATGTTAACGGTGTAGTTAAAGAACCTGGACAGAGCTGGATTGAGGTCAACGGTGAAGTTCATATATTTGTTTCAAGAGACAAAGTCCATGATGATACCGATTTAATTT
ATCTAGCTTTGGATGAACTAACTATGCAGATGAAAGATGCAGGTTCTGTACTTGATACCACAATTCTTGAAGTGATTGATTGA
Protein sequenceShow/hide protein sequence
MNVLPLVPPGRKANEEGSPYSGQDANCGNTLLISLDELVKDGLLTKEELPKPVDHDHVKFSAVADIKDPLIAKAAERLIQSDGELKRQLEEFCRDPDISSWLEDAAYFAA
IDNRLNSFSWYEWPEPLKNRHLSALEDVYQRERDFINIFIAQQFLFQRQWQRVRSYANMKGITIMGDMPIYVGYQSADVWANKKQFLLNKKGFPVLVSGVPPDAFSETGQ
LWGRGKNCYSWTVEGQTCNLAGPGKSLFDAISRAVGKINIFAEDLGVITEDVVQLRKSIGAPGMAVLQFGFGSDSANPHLPHNHESNQVVYTGTHDNDTIRGWWDNLNEG
EKSNVRNTFSEPNTFTRAKTTVLKYLSVTEKDDIPWALIRAALSSVAQTAIIPLQDVLGLGSSARMNIPATQSSGIGGASGGGGGGDVQHASILVFLSSVSTAEDKIALF
PVGAAAGEQAPVVEIGAGTLFDKMPNRNLVSWSSVVSMYTQLGYNEKALHYFLEFRRNCDDKLNEYILASIIRACVQRDGGEPGSQVHSYVIKAGFDEDVYVGTSLVDLY
AKHGEIDKARLVFDGLVMKTAVTWTAIITGYTKSGRSEVSLQLFNLMMESNVIPDKYVLSSILNACSVLGFLEGGKQIHAYVLRRETKMDVSTYNVLIDFYTKCGRVKAG
KTLFDRMNVKNVISWTTMIAGYMQNSYDWEAVELVGEMFRTGWKPDEYACSSILTSCGSVDALQHGRQIHSYIIKVYLEHDNFVINALIDMYSKCNSLDDAKRVFDVVTC
HSVVSYNAMIEGYSRQEYLCGALEVFREMRLKHVSPSFLTFVSLLGLSATLVCLQLSKQIHGLAIKYGVSLDKFTSSALIDVYSKCSCIRDARHVFEGTTNIDIVVWNAL
FSGYNLQLKSEEAFKLYSDLQFSRERPNEFTFAALITAASTLASLQHGQQFHNQVMKMGLGLDPFITNALVDMYAKCGSVEEAEKTFSSSVWKDTACWNSMISMYAQHGK
AEAALRMFELMMSNDINPNYVTFVSVLSACSHVGFVEDGLQHFNSMSRYGIEPGMEHYASVVTLLGRAGRLSEAREFIEKMTIRPAALVWRSLLSACRVFGNVELAKHAS
EMAISIDPMDSGSYIMLSNIFASKGMWGDVKRLRLKMDVNGVVKEPGQSWIEVNGEVHIFVSRDKVHDDTDLIYLALDELTMQMKDAGSVLDTTILEVID