; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G006560 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G006560
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr08:4258204..4266268
RNA-Seq ExpressionCmoCh08G006560
SyntenyCmoCh08G006560
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013766 - Thioredoxin domain
IPR018108 - Mitochondrial substrate/solute carrier
IPR023395 - Mitochondrial carrier domain superfamily
IPR032867 - DYW domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059294.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.9e-30582.99Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        RISSC   S FK +F     P  SS F   SSRNQSTKTHSVR SPRPALGP   +SP SYYASLLQSC V+KAIEPGKQLHARIWQMG+ FNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        +NLYC+CNSL NA LLFDRISKR+ FLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPDKFTFPFVLKACSALSAMEEGKKIH+ VI  GLE+DVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA  G+ PTEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK GSV+VAR LFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL+ELEPD+GGNYVILSNMYAQAGKW+GVARLRD MMD+GLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWIEV NKVHAF SEDTSHP+SEAIYAELKR+GKLMKEAGYAPQ+GSVFHDVEDDEK DMV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCG
        FISKIT+REI+IRDVNRYHHFKDG+CSCG
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCG

XP_008462155.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo]5.4e-30782.75Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        RISSC   S FK +F     P  SS F   SSRNQSTKTHSVR SPRPALGP   +SP SYYASLLQSC V+KAIEPGKQLHARIWQMG+ FNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        +NLYC+CNSL NA LLFDRISKR+ FLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPDKFTFPFVLKACSALSAMEEGKKIH+ VI  GLE+DVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA  G+ PTEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK GSV+VAR LFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL+ELEPD+GGNYVILSNMYAQAGKW+GVARLRD+MMD+GLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWI+V NKVHAF SED SHP+SEAIYAELKR+GKLMKEAGYAPQ+GSVFHDVEDDEK DMV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW
        FISKIT+REI+IRDVNRYHHFKDG+CSCGDFW
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW

XP_022964400.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata]0.0e+00100Show/hide
Query:  MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN
        MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN
Subjt:  MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN

Query:  LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI
        LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI
Subjt:  LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI

Query:  DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL
        DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL
Subjt:  DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL

Query:  VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC
        VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC
Subjt:  VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC

Query:  MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW
        MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW
Subjt:  MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW

Query:  IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI
        IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI
Subjt:  IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI

Query:  SKITKREISIRDVNRYHHFKDGLCSCGDFW
        SKITKREISIRDVNRYHHFKDGLCSCGDFW
Subjt:  SKITKREISIRDVNRYHHFKDGLCSCGDFW

XP_031744701.1 pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus]1.1e-30282.19Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINL
        R  SC   S  K +F     P  SS FY  SSRNQSTKTHSVRP+    L  +SP SYYASLLQSC V+KAIEPGKQLHARI Q+G+ FNPLLATKL+NL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINL

Query:  YCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALID
        YC+CNSL NAHLLFDRISKR+LFLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPDKFTFPFVLKACSALSAMEEGKKIH+ VI +GLE+DVFVGAALID
Subjt:  YCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALID

Query:  MYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALV
        MYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA+ G+ PTEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKTAL+
Subjt:  MYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALV

Query:  DMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCM
        DMYAK GSV+VARSLFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHYTCM
Subjt:  DMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCM

Query:  IDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWI
        IDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KLVELEPD+GGNYVILSNMYAQAGKW+GVARLRD+MM++GLKKSIACSWI
Subjt:  IDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWI

Query:  EVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFIS
        EV NKVHAF SEDTSHP+SEAIYAELKR GKLMKEAGYAPQ+GSVFHDVEDDEK DMV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIKFIS
Subjt:  EVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFIS

Query:  KITKREISIRDVNRYHHFKDGLCSCGDFW
        KIT+REI+IRDVNRYHHFKDG+CSCGDFW
Subjt:  KITKREISIRDVNRYHHFKDGLCSCGDFW

XP_038898350.1 pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida]0.0e+0084.34Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        R+SSC   SY KS+F   SS S +S FY  SSRNQSTK HSVRPSPRPAL P   +S  SYYASLLQSC ++KAIEPGKQLHAR+WQMG+ FNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        +NLYC+CNSL NAHLLFDRISKR+LFLWNVMIRGYAWNGPYEVAISLYYQMQD+G VPDK+TFPFVLKACSALSAMEEGKKIH+ V+  GLE+DVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV  A+QVFDK+ ERD VCWNSMLA YSQNGQPD+ L+LCR MA+ GV PTEGTLVISISASAD+ LLPQGKELHGYSWRHGF  ND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK GSV+VARSLFE+L+EKRVVSWNAMI+GYAMHGHA +ALDLF++MK  AL DHITFVGVLAACS  G L +GKMYFRSM+SDYII PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL+ELEPD+GGNYVILSNMYAQAG WEGVARLRD+MMD+GLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWIEVRNKVHAF SEDTSHPESEAIYAELK IGKLMKEAGYAPQIGSVFHDVEDDEK DMVC HSERLAIA+GLIST +G++LLIIKNLRVCEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW
        FISKIT+REI+IRDVNRYHHFKDG CSCGDFW
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A0A0KAF0 DYW_deaminase domain-containing protein3.3e-29485.34Show/hide
Query:  SYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPD
        SYYASLLQSC V+KAIEPGKQLHARI Q+G+ FNPLLATKL+NLYC+CNSL NAHLLFDRISKR+LFLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPD
Subjt:  SYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPD

Query:  KFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNP
        KFTFPFVLKACSALSAMEEGKKIH+ VI +GLE+DVFVGAALIDMYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA+ G+ P
Subjt:  KFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNP

Query:  TEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL
        TEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKTAL+DMYAK GSV+VARSLFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L 
Subjt:  TEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL

Query:  DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELE
        DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHYTCMIDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KLVELE
Subjt:  DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELE

Query:  PDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKAD
        PD+GGNYVILSNMYAQAGKW+GVARLRD+MM++GLKKSIACSWIEV NKVHAF SEDTSHP+SEAIYAELKR GKLMKEAGYAPQ+GSVFHDVEDDEK D
Subjt:  PDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKAD

Query:  MVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW
        MV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIKFISKIT+REI+IRDVNRYHHFKDG+CSCGDFW
Subjt:  MVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW

A0A1S3CG77 pentatricopeptide repeat-containing protein At4g21065-like2.6e-30782.75Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        RISSC   S FK +F     P  SS F   SSRNQSTKTHSVR SPRPALGP   +SP SYYASLLQSC V+KAIEPGKQLHARIWQMG+ FNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        +NLYC+CNSL NA LLFDRISKR+ FLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPDKFTFPFVLKACSALSAMEEGKKIH+ VI  GLE+DVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA  G+ PTEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK GSV+VAR LFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL+ELEPD+GGNYVILSNMYAQAGKW+GVARLRD+MMD+GLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWI+V NKVHAF SED SHP+SEAIYAELKR+GKLMKEAGYAPQ+GSVFHDVEDDEK DMV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW
        FISKIT+REI+IRDVNRYHHFKDG+CSCGDFW
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW

A0A5A7UW41 Pentatricopeptide repeat-containing protein1.9e-30582.99Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        RISSC   S FK +F     P  SS F   SSRNQSTKTHSVR SPRPALGP   +SP SYYASLLQSC V+KAIEPGKQLHARIWQMG+ FNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        +NLYC+CNSL NA LLFDRISKR+ FLWNVMIRGYAWNGPYE+AISLYYQM+DYG VPDKFTFPFVLKACSALSAMEEGKKIH+ VI  GLE+DVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV SA+QVFDK+ ERDVVCWNSMLA YSQNGQPD+ L+LCR MA  G+ PTEGT VISI+ASAD+ LLPQGKELHGYSWRHGF  ND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK GSV+VAR LFELL+EKRVVSWNAMI+GYAMHGHAN+ALDLFKEMK   L DHITFVGVLAACS  G L +GKM+FRSM+SD+ I PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCG LEEAY LIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL+ELEPD+GGNYVILSNMYAQAGKW+GVARLRD MMD+GLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWIEV NKVHAF SEDTSHP+SEAIYAELKR+GKLMKEAGYAPQ+GSVFHDVEDDEK DMV  HSERLAIA+GLIST+ G++LLIIKNLR+CEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCG
        FISKIT+REI+IRDVNRYHHFKDG+CSCG
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCG

A0A6J1DBD3 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like2.4e-30081.96Show/hide
Query:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL
        RIS C   S  KS+F T SS S+ S FYS              PS RPAL P   ++  SYYASLLQSC VQKAIEPGKQLHARI+Q+GLGFNPLLATKL
Subjt:  RISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGP---KSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKL

Query:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA
        ++LY  CNSL NAHLLFDRISKR+LFLWNVMIRGYAWNGPYE AISLYYQMQD+GF PDKFTFPFVLKACSALS MEEGKKIH+HVI  GLETDVFVGAA
Subjt:  INLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAA

Query:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT
        LIDMYAKCGCV SA+QVFDK++ERD VCWNSMLAAYSQNGQP D LSLC EMA+A V PTEGTLVISISASADS  LPQGKELHGYSWRHGFGLND+VKT
Subjt:  LIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKT

Query:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY
        ALVDMYAK G V VARSLFE+L +KRVVSWN MI+GYAMHGHA +ALDLF+EMK+ A+ D+ITFVGVLAACS  G L +GKMYFRSM+ D +I PTVQHY
Subjt:  ALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHY

Query:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC
        TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVE+GELAL KL ELEPD+GGN+VILSNMYAQAGKWEGVAR+RD+MMDRGLKKSIAC
Subjt:  TCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIAC

Query:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK
        SWIEV+NKVHAF SEDTSHP+ EAIYAELKRIGKLMKEAGYAPQ  SVFHDVEDDEK DMVC HSERLAIA+GLISTA G+RLLI KNLRVCEDCHVAIK
Subjt:  SWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIK

Query:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW
        FISKI  REI+IRDVNRYHHFKDGLCSCGDFW
Subjt:  FISKITKREISIRDVNRYHHFKDGLCSCGDFW

A0A6J1HKP7 pentatricopeptide repeat-containing protein At4g21065-like0.0e+00100Show/hide
Query:  MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN
        MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN
Subjt:  MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLIN

Query:  LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI
        LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI
Subjt:  LYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALI

Query:  DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL
        DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL
Subjt:  DMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTAL

Query:  VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC
        VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC
Subjt:  VDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTC

Query:  MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW
        MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW
Subjt:  MIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSW

Query:  IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI
        IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI
Subjt:  IEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFI

Query:  SKITKREISIRDVNRYHHFKDGLCSCGDFW
        SKITKREISIRDVNRYHHFKDGLCSCGDFW
Subjt:  SKITKREISIRDVNRYHHFKDGLCSCGDFW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210652.7e-13141.94Show/hide
Query:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVC----NSLINAHLLFDRISKR-SLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFV-
        +LLQ+ GV  +I   +Q+HA   + G+  +     K +  Y V       +  AH +F +I K  ++F+WN +IRGYA  G    A SLY +M+  G V 
Subjt:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVC----NSLINAHLLFDRISKR-SLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFV-

Query:  PDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGV
        PD  T+PF++KA + ++ +  G+ IH  VI +G  + ++V  +L+ +YA CG V SA +VFDKM E+D+V WNS++  +++NG+P++ L+L  EM   G+
Subjt:  PDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGV

Query:  NPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMK--E
         P   T+V  +SA A    L  GK +H Y  + G   N      L+D+YA+CG V  A++LF+ + +K  VSW ++I G A++G   +A++LFK M+  E
Subjt:  NPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMK--E

Query:  IALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKL
          L   ITFVG+L ACS  G + +G  YFR M  +Y I P ++H+ CM+DLL   G +++AY  I  M ++P+  +W  LL +C +HG+ +L E A  ++
Subjt:  IALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKL

Query:  VELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDD
        ++LEP++ G+YV+LSNMYA   +W  V ++R  M+  G+KK    S +EV N+VH F   D SHP+S+AIYA+LK +   ++  GY PQI +V+ DVE++
Subjt:  VELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDD

Query:  EKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW
        EK + V  HSE++AIA+ LIST   S + ++KNLRVC DCH+AIK +SK+  REI +RD +R+HHFK+G CSC D+W
Subjt:  EKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.5e-13139.34Show/hide
Query:  PASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAH-------------------------------LLFDRISKRSL
        P SY +  +L+SC   KA + G+Q+H  + ++G   +  + T LI++Y     L +AH                                LFD I  + +
Subjt:  PASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAH-------------------------------LLFDRISKRSL

Query:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD
          WN MI GYA  G Y+ A+ L+  M      PD+ T   V+ AC+   ++E G+++H  +   G  +++ +  ALID+Y+KCG + +A  +F+++  +D
Subjt:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD

Query:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLND--RVKTALVDMYAKCGSVHVARSLFELLK
        V+ WN+++  Y+      + L L +EM  +G  P + T++  + A A    +  G+ +H Y  +   G+ +   ++T+L+DMYAKCG +  A  +F  + 
Subjt:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLND--RVKTALVDMYAKCGSVHVARSLFELLK

Query:  EKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIME
         K + SWNAMI G+AMHG A+ + DLF  M++I +  D ITFVG+L+ACS  G L  G+  FR+M  DY + P ++HY CMIDLLGH G  +EA  +I  
Subjt:  EKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIME

Query:  MRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPES
        M +EPD  +W +LL +CK+HGNVELGE     L+++EP+N G+YV+LSN+YA AG+W  VA+ R ++ D+G+KK   CS IE+ + VH F   D  HP +
Subjt:  MRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPES

Query:  EAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFK
          IY  L+ +  L+++AG+ P    V  ++E++ K   +  HSE+LAIA+GLIST  G++L I+KNLRVC +CH A K ISKI KREI  RD  R+HHF+
Subjt:  EAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFK

Query:  DGLCSCGDFW
        DG+CSC D+W
Subjt:  DGLCSCGDFW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.2e-13339.97Show/hide
Query:  YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCV---CNSLINAHLLFDRISKRS---------------------------------L
        + S+L+SC +   +  G+ +H  I ++G+  +      L+N+Y       S I+   +FD + +R+                                 +
Subjt:  YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCV---CNSLINAHLLFDRISKRS---------------------------------L

Query:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD
          +N +I GYA +G YE A+ +  +M      PD FT   VL   S    + +GK+IH +VI  G+++DV++G++L+DMYAK   +  +++VF ++  RD
Subjt:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD

Query:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEK
         + WNS++A Y QNG+ ++ L L R+M  A V P        I A A    L  GK+LHGY  R GFG N  + +ALVDMY+KCG++  AR +F+ +   
Subjt:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEK

Query:  RVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMR
          VSW A+I G+A+HGH ++A+ LF+EMK   +  + + FV VL ACS VG + +   YF SM   Y +   ++HY  + DLLG  G LEEAYN I +M 
Subjt:  RVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMR

Query:  VEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEA
        VEP   VW  LL SC +H N+EL E    K+  ++ +N G YV++ NMYA  G+W+ +A+LR  M  +GL+K  ACSWIE++NK H F S D SHP  + 
Subjt:  VEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEA

Query:  IYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDG
        I   LK + + M++ GY      V HDV+++ K +++ GHSERLA+A+G+I+T  G+ + + KN+R+C DCHVAIKFISKIT+REI +RD +R+HHF  G
Subjt:  IYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDG

Query:  LCSCGDFW
         CSCGD+W
Subjt:  LCSCGDFW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic3.5e-13140.81Show/hide
Query:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTF
        S+   C   + I  G+ +H+   +            L+++Y  C  L +A  +F  +S RS+  +  MI GYA  G    A+ L+ +M++ G  PD +T 
Subjt:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTF

Query:  PFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCR-EMAYAGVNPTEG
          VL  C+    ++EGK++HE +    L  D+FV  AL+DMYAKCG +  A+ VF +M  +D++ WN+++  YS+N   ++ LSL    +     +P E 
Subjt:  PFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCR-EMAYAGVNPTEG

Query:  TLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIAL-LDH
        T+   + A A      +G+E+HGY  R+G+  +  V  +LVDMYAKCG++ +A  LF+ +  K +VSW  MI+GY MHG   +A+ LF +M++  +  D 
Subjt:  TLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIAL-LDH

Query:  ITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPD
        I+FV +L ACS  G + +G  +F  M  +  I PTV+HY C++D+L   G L +AY  I  M + PDA +WGALL  C+IH +V+L E    K+ ELEP+
Subjt:  ITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPD

Query:  NGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMV
        N G YV+++N+YA+A KWE V RLR  +  RGL+K+  CSWIE++ +V+ F + D+S+PE+E I A L+++   M E GY+P       D E+ EK + +
Subjt:  NGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMV

Query:  CGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW
        CGHSE+LA+A G+IS+  G  + + KNLRVC DCH   KF+SK+T+REI +RD NR+H FKDG CSC  FW
Subjt:  CGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic1.7e-13841.51Show/hide
Query:  LSSPSRSSRFYSSSSRNQ----STKTHSVRPSPRPALGPKSPASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHL
        L++PS SS   +  S NQ      K   ++ + R      SP+   Y  L+  CG + ++    ++H  I   G   +P LATKLI +Y    S+  A  
Subjt:  LSSPSRSSRFYSSSSRNQ----STKTHSVRPSPRPALGPKSPASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHL

Query:  LFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSA----LSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCV
        +FD+  KR++++WN + R     G  E  + LY++M   G   D+FT+ +VLKAC A    ++ + +GK+IH H+   G  + V++   L+DMYA+ GCV
Subjt:  LFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSA----LSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCV

Query:  VSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREM--AYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKC
          A  VF  M  R+VV W++M+A Y++NG+  + L   REM       +P   T+V  + A A    L QGK +HGY  R G      V +ALV MY +C
Subjt:  VSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREM--AYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKC

Query:  GSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEM-KEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLG
        G + V + +F+ + ++ VVSWN++IS Y +HG+  +A+ +F+EM    A    +TFV VL ACS  G + +GK  F +M  D+ I+P ++HY CM+DLLG
Subjt:  GSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEM-KEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLG

Query:  HCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNK
            L+EA  ++ +MR EP   VWG+LL SC+IHGNVEL E A R+L  LEP N GNYV+L+++YA+A  W+ V R++ ++  RGL+K     W+EVR K
Subjt:  HCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNK

Query:  VHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKR
        +++F S D  +P  E I+A L ++ + MKE GY PQ   V +++E +EK  +V GHSE+LA+A+GLI+T+ G  + I KNLR+CEDCH+  KFISK  ++
Subjt:  VHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKR

Query:  EISIRDVNRYHHFKDGLCSCGDFW
        EI +RDVNR+H FK+G+CSCGD+W
Subjt:  EISIRDVNRYHHFKDGLCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-13239.34Show/hide
Query:  PASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAH-------------------------------LLFDRISKRSL
        P SY +  +L+SC   KA + G+Q+H  + ++G   +  + T LI++Y     L +AH                                LFD I  + +
Subjt:  PASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAH-------------------------------LLFDRISKRSL

Query:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD
          WN MI GYA  G Y+ A+ L+  M      PD+ T   V+ AC+   ++E G+++H  +   G  +++ +  ALID+Y+KCG + +A  +F+++  +D
Subjt:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD

Query:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLND--RVKTALVDMYAKCGSVHVARSLFELLK
        V+ WN+++  Y+      + L L +EM  +G  P + T++  + A A    +  G+ +H Y  +   G+ +   ++T+L+DMYAKCG +  A  +F  + 
Subjt:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLND--RVKTALVDMYAKCGSVHVARSLFELLK

Query:  EKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIME
         K + SWNAMI G+AMHG A+ + DLF  M++I +  D ITFVG+L+ACS  G L  G+  FR+M  DY + P ++HY CMIDLLGH G  +EA  +I  
Subjt:  EKRVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIME

Query:  MRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPES
        M +EPD  +W +LL +CK+HGNVELGE     L+++EP+N G+YV+LSN+YA AG+W  VA+ R ++ D+G+KK   CS IE+ + VH F   D  HP +
Subjt:  MRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPES

Query:  EAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFK
          IY  L+ +  L+++AG+ P    V  ++E++ K   +  HSE+LAIA+GLIST  G++L I+KNLRVC +CH A K ISKI KREI  RD  R+HHF+
Subjt:  EAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFK

Query:  DGLCSCGDFW
        DG+CSC D+W
Subjt:  DGLCSCGDFW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-13439.97Show/hide
Query:  YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCV---CNSLINAHLLFDRISKRS---------------------------------L
        + S+L+SC +   +  G+ +H  I ++G+  +      L+N+Y       S I+   +FD + +R+                                 +
Subjt:  YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCV---CNSLINAHLLFDRISKRS---------------------------------L

Query:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD
          +N +I GYA +G YE A+ +  +M      PD FT   VL   S    + +GK+IH +VI  G+++DV++G++L+DMYAK   +  +++VF ++  RD
Subjt:  FLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERD

Query:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEK
         + WNS++A Y QNG+ ++ L L R+M  A V P        I A A    L  GK+LHGY  R GFG N  + +ALVDMY+KCG++  AR +F+ +   
Subjt:  VVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEK

Query:  RVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMR
          VSW A+I G+A+HGH ++A+ LF+EMK   +  + + FV VL ACS VG + +   YF SM   Y +   ++HY  + DLLG  G LEEAYN I +M 
Subjt:  RVVSWNAMISGYAMHGHANQALDLFKEMKEIALL-DHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMR

Query:  VEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEA
        VEP   VW  LL SC +H N+EL E    K+  ++ +N G YV++ NMYA  G+W+ +A+LR  M  +GL+K  ACSWIE++NK H F S D SHP  + 
Subjt:  VEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEA

Query:  IYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDG
        I   LK + + M++ GY      V HDV+++ K +++ GHSERLA+A+G+I+T  G+ + + KN+R+C DCHVAIKFISKIT+REI +RD +R+HHF  G
Subjt:  IYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDG

Query:  LCSCGDFW
         CSCGD+W
Subjt:  LCSCGDFW

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-13941.51Show/hide
Query:  LSSPSRSSRFYSSSSRNQ----STKTHSVRPSPRPALGPKSPASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHL
        L++PS SS   +  S NQ      K   ++ + R      SP+   Y  L+  CG + ++    ++H  I   G   +P LATKLI +Y    S+  A  
Subjt:  LSSPSRSSRFYSSSSRNQ----STKTHSVRPSPRPALGPKSPASY-YASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHL

Query:  LFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSA----LSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCV
        +FD+  KR++++WN + R     G  E  + LY++M   G   D+FT+ +VLKAC A    ++ + +GK+IH H+   G  + V++   L+DMYA+ GCV
Subjt:  LFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSA----LSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCV

Query:  VSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREM--AYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKC
          A  VF  M  R+VV W++M+A Y++NG+  + L   REM       +P   T+V  + A A    L QGK +HGY  R G      V +ALV MY +C
Subjt:  VSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREM--AYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKC

Query:  GSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEM-KEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLG
        G + V + +F+ + ++ VVSWN++IS Y +HG+  +A+ +F+EM    A    +TFV VL ACS  G + +GK  F +M  D+ I+P ++HY CM+DLLG
Subjt:  GSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEM-KEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLG

Query:  HCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNK
            L+EA  ++ +MR EP   VWG+LL SC+IHGNVEL E A R+L  LEP N GNYV+L+++YA+A  W+ V R++ ++  RGL+K     W+EVR K
Subjt:  HCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNK

Query:  VHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKR
        +++F S D  +P  E I+A L ++ + MKE GY PQ   V +++E +EK  +V GHSE+LA+A+GLI+T+ G  + I KNLR+CEDCH+  KFISK  ++
Subjt:  VHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKR

Query:  EISIRDVNRYHHFKDGLCSCGDFW
        EI +RDVNR+H FK+G+CSCGD+W
Subjt:  EISIRDVNRYHHFKDGLCSCGDFW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-13240.81Show/hide
Query:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTF
        S+   C   + I  G+ +H+   +            L+++Y  C  L +A  +F  +S RS+  +  MI GYA  G    A+ L+ +M++ G  PD +T 
Subjt:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLINAHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTF

Query:  PFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCR-EMAYAGVNPTEG
          VL  C+    ++EGK++HE +    L  D+FV  AL+DMYAKCG +  A+ VF +M  +D++ WN+++  YS+N   ++ LSL    +     +P E 
Subjt:  PFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCR-EMAYAGVNPTEG

Query:  TLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIAL-LDH
        T+   + A A      +G+E+HGY  R+G+  +  V  +LVDMYAKCG++ +A  LF+ +  K +VSW  MI+GY MHG   +A+ LF +M++  +  D 
Subjt:  TLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMKEIAL-LDH

Query:  ITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPD
        I+FV +L ACS  G + +G  +F  M  +  I PTV+HY C++D+L   G L +AY  I  M + PDA +WGALL  C+IH +V+L E    K+ ELEP+
Subjt:  ITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKLVELEPD

Query:  NGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMV
        N G YV+++N+YA+A KWE V RLR  +  RGL+K+  CSWIE++ +V+ F + D+S+PE+E I A L+++   M E GY+P       D E+ EK + +
Subjt:  NGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDDEKADMV

Query:  CGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW
        CGHSE+LA+A G+IS+  G  + + KNLRVC DCH   KF+SK+T+REI +RD NR+H FKDG CSC  FW
Subjt:  CGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-13241.94Show/hide
Query:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVC----NSLINAHLLFDRISKR-SLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFV-
        +LLQ+ GV  +I   +Q+HA   + G+  +     K +  Y V       +  AH +F +I K  ++F+WN +IRGYA  G    A SLY +M+  G V 
Subjt:  SLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVC----NSLINAHLLFDRISKR-SLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFV-

Query:  PDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGV
        PD  T+PF++KA + ++ +  G+ IH  VI +G  + ++V  +L+ +YA CG V SA +VFDKM E+D+V WNS++  +++NG+P++ L+L  EM   G+
Subjt:  PDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMVERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGV

Query:  NPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMK--E
         P   T+V  +SA A    L  GK +H Y  + G   N      L+D+YA+CG V  A++LF+ + +K  VSW ++I G A++G   +A++LFK M+  E
Subjt:  NPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNAMISGYAMHGHANQALDLFKEMK--E

Query:  IALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKL
          L   ITFVG+L ACS  G + +G  YFR M  +Y I P ++H+ CM+DLL   G +++AY  I  M ++P+  +W  LL +C +HG+ +L E A  ++
Subjt:  IALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIHGNVELGELALRKL

Query:  VELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDD
        ++LEP++ G+YV+LSNMYA   +W  V ++R  M+  G+KK    S +EV N+VH F   D SHP+S+AIYA+LK +   ++  GY PQI +V+ DVE++
Subjt:  VELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDVEDD

Query:  EKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW
        EK + V  HSE++AIA+ LIST   S + ++KNLRVC DCH+AIK +SK+  REI +RD +R+HHFK+G CSC D+W
Subjt:  EKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAATTTCTTCGTGTTTGTCTTTCTCCTACTTCAAATCTTTATTCTCTACTTTATCTTCTCCGTCTCGATCGTCCCGTTTCTACTCTTCTTCATCTCGGAACCAGAG
CACTAAAACTCATTCAGTCCGGCCATCTCCAAGACCAGCTCTCGGGCCTAAGTCGCCGGCGTCTTATTATGCCTCTCTCCTCCAATCTTGCGGCGTTCAAAAGGCGATAG
AGCCTGGAAAGCAGCTTCACGCTCGGATTTGGCAAATGGGTCTTGGATTCAACCCTCTTTTGGCTACTAAACTAATCAATCTTTACTGCGTCTGCAACTCTTTGATAAAT
GCTCATCTCTTGTTCGATAGAATTTCCAAACGGAGTCTCTTCCTTTGGAATGTTATGATACGAGGTTATGCTTGGAATGGACCATATGAGGTTGCAATTTCACTTTACTA
TCAAATGCAGGATTATGGATTTGTGCCCGATAAATTTACCTTTCCATTTGTGCTCAAAGCTTGCTCGGCTCTTTCTGCAATGGAAGAGGGAAAGAAGATTCACGAACATG
TCATAGGCGCTGGATTGGAGACTGATGTGTTTGTGGGTGCAGCCCTGATTGATATGTATGCTAAGTGTGGTTGTGTAGTGAGTGCCCAACAGGTGTTTGATAAAATGGTT
GAGAGAGATGTTGTGTGTTGGAACTCCATGCTTGCAGCTTATTCCCAAAATGGGCAACCAGATGATTGTCTTTCTCTTTGTAGGGAGATGGCGTATGCTGGAGTGAACCC
CACCGAGGGGACGCTTGTCATCTCCATCTCAGCTTCAGCTGATAGTGTCCTTCTTCCTCAAGGGAAAGAGCTTCATGGTTATAGTTGGAGACATGGATTTGGGTTGAATG
ACAGGGTGAAAACTGCATTGGTGGATATGTATGCCAAGTGTGGCTCTGTACATGTTGCTAGGAGTCTGTTTGAGTTGTTGAAGGAGAAGAGGGTAGTCTCTTGGAACGCC
ATGATCTCTGGGTATGCAATGCATGGCCATGCTAATCAAGCTTTGGACTTGTTTAAGGAGATGAAAGAGATAGCTCTCCTTGATCATATAACTTTTGTTGGAGTTCTGGC
TGCTTGCAGCCAGGTGGGTTGCTTAATCAAGGGGAAGATGTACTTCAGATCAATGTTAAGTGATTACATTATACGTCCTACTGTTCAACATTACACTTGTATGATTGATC
TACTTGGTCATTGTGGTCACTTGGAAGAAGCTTACAACCTCATAATGGAAATGAGAGTAGAGCCAGATGCTGGTGTGTGGGGTGCCTTGCTTCACTCGTGCAAAATCCAT
GGTAATGTGGAGTTGGGCGAGCTAGCGTTAAGGAAGTTGGTTGAGCTCGAACCTGACAATGGCGGGAACTATGTGATTTTGTCGAACATGTACGCGCAAGCAGGTAAATG
GGAAGGAGTTGCAAGACTAAGGGATGTCATGATGGATAGGGGGTTGAAGAAAAGTATAGCTTGTAGTTGGATAGAAGTGAGGAACAAAGTCCATGCCTTCTCGTCAGAAG
ATACTTCGCATCCCGAGTCTGAAGCAATCTATGCCGAGCTAAAACGGATAGGGAAGTTGATGAAAGAGGCTGGCTATGCACCACAAATCGGGTCGGTTTTCCACGATGTC
GAAGACGATGAAAAGGCTGATATGGTGTGTGGTCACAGTGAAAGACTGGCCATTGCTTATGGACTCATCAGCACAGCTTTGGGAAGTAGGCTCTTGATTATCAAGAACCT
TAGAGTTTGTGAGGACTGTCATGTTGCGATTAAGTTCATATCGAAGATCACGAAGAGAGAAATAAGTATTAGAGATGTTAATCGTTATCATCACTTCAAAGATGGACTTT
GCTCCTGTGGTGATTTCTGGTTGATTTTGCTTCAAAACTTTCGAGACGGTTTTTCTGCAACATCTTTACAGTTTGAACATTTTGCGAGGGGGATCATATTGGAGAAGCAG
GTGTTGACGGTGGCGAAGGCCGTGGAGGACAAGATTGATGACGATATTGCAGCGCTAGATCGTCTGGACCTTGACGATTTGGAGGCTTTGAGGGAGCGAAGATTGCAGCA
GATGAAGAGGATGGCGGAGAAGCGTAGTCGATGGATCTCCCTTGGCCATGGCGAATACTCTGAGATTCCCGTCGAGAAGGACTTCTTCTCCGTAGTCAAGGCTAGTGATC
GCGTGGTCTGCCACTTTTACCGTGAAAATTGGCCCTGCAAGGTTATGGACAAGCACTTGAATATCTTGGCAAAGCAACATATTGAGACTCGTTTCGTAAAGATCAATGCC
GAGAAAAGTCCATTTCTGACTGAGAAGTTGAAGATCATTGTTCTTCCAACCCTTGCTCTTGTCAAAAATGCAAAAGTTGAAGACTATGTGGTGGGATTTGATGAGCTTGG
TGGAACTGACGAATTCAGCACAGAGGAATTGGAAGAGAGTTGGGGTAAAGAATTTGTCGCCGGTGGCTTTGGTGGCATTGCTGGTATAGTCTCCGGCTACCCACTTGATA
CTCTTCGAATCAGGCAACAGCGATCCATTTCTGGTTCTGCTTTCAAAATCTTTCGTGATATTATTGGAAACCATGGCCCTGCTGGTCTTTACAGAGGCATGGCTGCTCCC
TTGGCCTCTGTCACTTTCCAGAACGCTGCTGTGTTTCAAATCTACGCCGTTCTTTCTCGGGCATTCAACTTCTCCCAATCCAACGGTGACCCTCCGAGCTATAAGGCTGT
GGCTCTTGGAGGATTTGTCACCGGTGCTCTTCAGAGCTTCATTCTGTCACCGGTGGAGCTGGTGAAAATCCACCTGCAATTGCAAAGTTCTTCTTCTAACAAAGGCCCTG
TGAGCGTAGCCAAACGCATCTGCAAAACAGAAGGATTGAGAGGGGTTTACAGAGGACTTACCATAACCATGCTCAGAGATGCACCGGCACATGGCATCTATTTCTGGACA
TATGAGTTCATGAGAGAGCAGTTTCATCCTGGCTGCCGGAAGACCGGCCAAGAGAGTGTCAGCACAATGCTGTTCGCCGGAGGGCTCGCCGGAGTTGGCAGCTGGGTTGT
CTGTTATCCCTTGGATGTGTTGAAGACAAGAATTCAGGGTGAGACACAAAGCTCATCTGGAAAGTATAATGGAATTGTTGATTGTCTTGGCAAGAGTGTGAGAGAAGAGG
GTTACAGAGTGCTGTGGCGAGGGCTGGGGACTGCGGTGGCTAGAGCATTTGTGGTTAATGGGGCCGTCTTTGCTGCATATGAGATTACTTTGAGGTGTTTGTTTAGCAAT
CAAGCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAATTTCTTCGTGTTTGTCTTTCTCCTACTTCAAATCTTTATTCTCTACTTTATCTTCTCCGTCTCGATCGTCCCGTTTCTACTCTTCTTCATCTCGGAACCAGAG
CACTAAAACTCATTCAGTCCGGCCATCTCCAAGACCAGCTCTCGGGCCTAAGTCGCCGGCGTCTTATTATGCCTCTCTCCTCCAATCTTGCGGCGTTCAAAAGGCGATAG
AGCCTGGAAAGCAGCTTCACGCTCGGATTTGGCAAATGGGTCTTGGATTCAACCCTCTTTTGGCTACTAAACTAATCAATCTTTACTGCGTCTGCAACTCTTTGATAAAT
GCTCATCTCTTGTTCGATAGAATTTCCAAACGGAGTCTCTTCCTTTGGAATGTTATGATACGAGGTTATGCTTGGAATGGACCATATGAGGTTGCAATTTCACTTTACTA
TCAAATGCAGGATTATGGATTTGTGCCCGATAAATTTACCTTTCCATTTGTGCTCAAAGCTTGCTCGGCTCTTTCTGCAATGGAAGAGGGAAAGAAGATTCACGAACATG
TCATAGGCGCTGGATTGGAGACTGATGTGTTTGTGGGTGCAGCCCTGATTGATATGTATGCTAAGTGTGGTTGTGTAGTGAGTGCCCAACAGGTGTTTGATAAAATGGTT
GAGAGAGATGTTGTGTGTTGGAACTCCATGCTTGCAGCTTATTCCCAAAATGGGCAACCAGATGATTGTCTTTCTCTTTGTAGGGAGATGGCGTATGCTGGAGTGAACCC
CACCGAGGGGACGCTTGTCATCTCCATCTCAGCTTCAGCTGATAGTGTCCTTCTTCCTCAAGGGAAAGAGCTTCATGGTTATAGTTGGAGACATGGATTTGGGTTGAATG
ACAGGGTGAAAACTGCATTGGTGGATATGTATGCCAAGTGTGGCTCTGTACATGTTGCTAGGAGTCTGTTTGAGTTGTTGAAGGAGAAGAGGGTAGTCTCTTGGAACGCC
ATGATCTCTGGGTATGCAATGCATGGCCATGCTAATCAAGCTTTGGACTTGTTTAAGGAGATGAAAGAGATAGCTCTCCTTGATCATATAACTTTTGTTGGAGTTCTGGC
TGCTTGCAGCCAGGTGGGTTGCTTAATCAAGGGGAAGATGTACTTCAGATCAATGTTAAGTGATTACATTATACGTCCTACTGTTCAACATTACACTTGTATGATTGATC
TACTTGGTCATTGTGGTCACTTGGAAGAAGCTTACAACCTCATAATGGAAATGAGAGTAGAGCCAGATGCTGGTGTGTGGGGTGCCTTGCTTCACTCGTGCAAAATCCAT
GGTAATGTGGAGTTGGGCGAGCTAGCGTTAAGGAAGTTGGTTGAGCTCGAACCTGACAATGGCGGGAACTATGTGATTTTGTCGAACATGTACGCGCAAGCAGGTAAATG
GGAAGGAGTTGCAAGACTAAGGGATGTCATGATGGATAGGGGGTTGAAGAAAAGTATAGCTTGTAGTTGGATAGAAGTGAGGAACAAAGTCCATGCCTTCTCGTCAGAAG
ATACTTCGCATCCCGAGTCTGAAGCAATCTATGCCGAGCTAAAACGGATAGGGAAGTTGATGAAAGAGGCTGGCTATGCACCACAAATCGGGTCGGTTTTCCACGATGTC
GAAGACGATGAAAAGGCTGATATGGTGTGTGGTCACAGTGAAAGACTGGCCATTGCTTATGGACTCATCAGCACAGCTTTGGGAAGTAGGCTCTTGATTATCAAGAACCT
TAGAGTTTGTGAGGACTGTCATGTTGCGATTAAGTTCATATCGAAGATCACGAAGAGAGAAATAAGTATTAGAGATGTTAATCGTTATCATCACTTCAAAGATGGACTTT
GCTCCTGTGGTGATTTCTGGTTGATTTTGCTTCAAAACTTTCGAGACGGTTTTTCTGCAACATCTTTACAGTTTGAACATTTTGCGAGGGGGATCATATTGGAGAAGCAG
GTGTTGACGGTGGCGAAGGCCGTGGAGGACAAGATTGATGACGATATTGCAGCGCTAGATCGTCTGGACCTTGACGATTTGGAGGCTTTGAGGGAGCGAAGATTGCAGCA
GATGAAGAGGATGGCGGAGAAGCGTAGTCGATGGATCTCCCTTGGCCATGGCGAATACTCTGAGATTCCCGTCGAGAAGGACTTCTTCTCCGTAGTCAAGGCTAGTGATC
GCGTGGTCTGCCACTTTTACCGTGAAAATTGGCCCTGCAAGGTTATGGACAAGCACTTGAATATCTTGGCAAAGCAACATATTGAGACTCGTTTCGTAAAGATCAATGCC
GAGAAAAGTCCATTTCTGACTGAGAAGTTGAAGATCATTGTTCTTCCAACCCTTGCTCTTGTCAAAAATGCAAAAGTTGAAGACTATGTGGTGGGATTTGATGAGCTTGG
TGGAACTGACGAATTCAGCACAGAGGAATTGGAAGAGAGTTGGGGTAAAGAATTTGTCGCCGGTGGCTTTGGTGGCATTGCTGGTATAGTCTCCGGCTACCCACTTGATA
CTCTTCGAATCAGGCAACAGCGATCCATTTCTGGTTCTGCTTTCAAAATCTTTCGTGATATTATTGGAAACCATGGCCCTGCTGGTCTTTACAGAGGCATGGCTGCTCCC
TTGGCCTCTGTCACTTTCCAGAACGCTGCTGTGTTTCAAATCTACGCCGTTCTTTCTCGGGCATTCAACTTCTCCCAATCCAACGGTGACCCTCCGAGCTATAAGGCTGT
GGCTCTTGGAGGATTTGTCACCGGTGCTCTTCAGAGCTTCATTCTGTCACCGGTGGAGCTGGTGAAAATCCACCTGCAATTGCAAAGTTCTTCTTCTAACAAAGGCCCTG
TGAGCGTAGCCAAACGCATCTGCAAAACAGAAGGATTGAGAGGGGTTTACAGAGGACTTACCATAACCATGCTCAGAGATGCACCGGCACATGGCATCTATTTCTGGACA
TATGAGTTCATGAGAGAGCAGTTTCATCCTGGCTGCCGGAAGACCGGCCAAGAGAGTGTCAGCACAATGCTGTTCGCCGGAGGGCTCGCCGGAGTTGGCAGCTGGGTTGT
CTGTTATCCCTTGGATGTGTTGAAGACAAGAATTCAGGGTGAGACACAAAGCTCATCTGGAAAGTATAATGGAATTGTTGATTGTCTTGGCAAGAGTGTGAGAGAAGAGG
GTTACAGAGTGCTGTGGCGAGGGCTGGGGACTGCGGTGGCTAGAGCATTTGTGGTTAATGGGGCCGTCTTTGCTGCATATGAGATTACTTTGAGGTGTTTGTTTAGCAAT
CAAGCTCTCTGAATTGCCCACATTTCCCACGGTCACTAAAATGAGAATGAATTCGTCTCTTTTTTACTTTTGATATTGTTAGTTTTAAAGTCAAATATCTTGGTCAATAG
TTATAATAGTAATCATACATAGTAAATGATAATGTATAGAAAAGAGTGGTGTCTGTTGTTGTGCAATTCTCTT
Protein sequenceShow/hide protein sequence
MRISSCLSFSYFKSLFSTLSSPSRSSRFYSSSSRNQSTKTHSVRPSPRPALGPKSPASYYASLLQSCGVQKAIEPGKQLHARIWQMGLGFNPLLATKLINLYCVCNSLIN
AHLLFDRISKRSLFLWNVMIRGYAWNGPYEVAISLYYQMQDYGFVPDKFTFPFVLKACSALSAMEEGKKIHEHVIGAGLETDVFVGAALIDMYAKCGCVVSAQQVFDKMV
ERDVVCWNSMLAAYSQNGQPDDCLSLCREMAYAGVNPTEGTLVISISASADSVLLPQGKELHGYSWRHGFGLNDRVKTALVDMYAKCGSVHVARSLFELLKEKRVVSWNA
MISGYAMHGHANQALDLFKEMKEIALLDHITFVGVLAACSQVGCLIKGKMYFRSMLSDYIIRPTVQHYTCMIDLLGHCGHLEEAYNLIMEMRVEPDAGVWGALLHSCKIH
GNVELGELALRKLVELEPDNGGNYVILSNMYAQAGKWEGVARLRDVMMDRGLKKSIACSWIEVRNKVHAFSSEDTSHPESEAIYAELKRIGKLMKEAGYAPQIGSVFHDV
EDDEKADMVCGHSERLAIAYGLISTALGSRLLIIKNLRVCEDCHVAIKFISKITKREISIRDVNRYHHFKDGLCSCGDFWLILLQNFRDGFSATSLQFEHFARGIILEKQ
VLTVAKAVEDKIDDDIAALDRLDLDDLEALRERRLQQMKRMAEKRSRWISLGHGEYSEIPVEKDFFSVVKASDRVVCHFYRENWPCKVMDKHLNILAKQHIETRFVKINA
EKSPFLTEKLKIIVLPTLALVKNAKVEDYVVGFDELGGTDEFSTEELEESWGKEFVAGGFGGIAGIVSGYPLDTLRIRQQRSISGSAFKIFRDIIGNHGPAGLYRGMAAP
LASVTFQNAAVFQIYAVLSRAFNFSQSNGDPPSYKAVALGGFVTGALQSFILSPVELVKIHLQLQSSSSNKGPVSVAKRICKTEGLRGVYRGLTITMLRDAPAHGIYFWT
YEFMREQFHPGCRKTGQESVSTMLFAGGLAGVGSWVVCYPLDVLKTRIQGETQSSSGKYNGIVDCLGKSVREEGYRVLWRGLGTAVARAFVVNGAVFAAYEITLRCLFSN
QAL