; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033668 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033668
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:984218..986314
RNA-Seq ExpressionLag0033668
SyntenyLag0033668
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589416.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.84Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS VACLP  SVTSIT + QFPENPK+LILQRCKTPKDL Q+HAHLLKTRR  DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR  EQVHALILKSG K NEFVENTLIHMYANCG++GVARQVFDGMS+R  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYI+SKG+  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL LFHEMQKAKVD NEVTMVS LYSCA+LGAYETGKWVH Y+K+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVFR MPF NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDE+MK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGYVPNMEDARLEA EEESKETS+SHHSEKLAIAYGL+RTPL+TTIRISKNLRMCRDCHNATKVIS+V++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

KAG7023094.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0091.13Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS VACLP  SVTSIT + QFPENPK+LILQRCKTPKDL Q+HAHLLKTRR+ DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR+ EQVHALILKSG K NEFVENTLIHMYANCG++GVARQVFDGMS+R  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYI+SKG+  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL LFHEMQKAKVD NEVTMVSVLYSCA+LGAYETGKWVH Y+K+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVFR MPF NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGYVPNMEDARLEA EEESKETS+SHHSEKLAIAYGL+RTPL+TTIRISKNLRMCRDCHNATKVIS+V++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_022135298.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Momordica charantia]0.0e+0091.69Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS +AC PA SVT+IT I QFPENPKTLILQ+CKTPKDLHQ+HAHL+KTRR+LDPTITEAVLESAALLLPNTIDYALSIFNHID+PESSAYNVMIRGL+
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNAFLLFKKMHENSVEHD FTFSCVLKACSRMRALR+ EQVHA ILKSG KPNEFVENTLIHMYANCGE+G+AR+VFDGMSERG++AWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNG+W EVVKLF+ MLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT N +LTTSLIDMYAKCGRVDTA KLFD+M KRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFY+KKKKMKLTVTLGTQLIDFYAKCGY D SVEVFR+MP +NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGK+ALDFFSLMREN+VKPNDVTFI +LSACSHACLVDQGRHLFNSM RDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYA AGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIH+ALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RI+ LGYVPN+EDARLEAEE+SKETS+SHHSEKLAIAYGL+RTP RT IRISKNLRMCRDCHNATKVISRVF+R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_022921781.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0091.27Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS VACLP  SVTSIT + QFPENPK+LILQRCKTPKDL Q+HAHLLKTRR+ DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR+ EQVHALILKSG KPNEFVENTLIHMYANCG++GVARQVFDGMS+R  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYI+SKG+  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL LFHEMQKAKVD NEVTMVSVLYSCA+LGAYETGKWVH Y+K+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVFR MPF NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGYVPNMEDARLEA EEESKETS+SHHSEKLAIAYGL+RTPL+TTIRISKNLRMCRDCHNATKVIS+V++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

XP_022987229.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima]0.0e+0089.99Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS V CLP TSVTSI  + QFPENPK+LILQ+CKTPKDL Q+HAHLLKTRR+ DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR+ EQVHALILKSG K NEFVENTLIHMYANCG++GVARQVFDGMSER  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGL  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQAD+CKEAL LFHEMQKAKVD NEVTMVSVLYSCA+LGAYETGKWVH Y+K+KKM+LTV+LGTQLIDFYAKCGYIDRSVEVFR M F NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEG+MALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLL+EAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGY+PNMEDARLEA EEESKETS+ HHSEKLAIAYGL+RTPL+TTIRISKNLR+CRDCHNATK+ISRV++R IIVRDRNRFHHF+DGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

TrEMBL top hitse value%identityAlignment
A0A1S4E4P2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0088.68Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS V CLP TS+TSITQI QFPENPK+LILQ+CKTPKDL Q+HAHLLKTRR+LDP ITEAVLESAALLLP+TIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FK+SP NA LLFKKMHENSV+HD+FTFS VLKACSRMR L++ EQVHALILKSG K NEFVENTLI MYANCG++GVAR VFDGM ERGIVAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+ +LEL+I FDDVTMISVLMACGRLA+LE+GELIGEYIVSKGL  N+TL TSLIDMYAKCGR+DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQK  VDPNEVTMVSVLYSCAMLGAY+TGKWVHFY+KKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVF++M FKNVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMAL+FFSLM ENDVKPNDVTFI +LSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAG LEEAYQFID+MP PPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASCRAHKN+EMAEKSLEHIT+LEP HSGDYILLSNTYAL GRVEDA+RVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEH HSKEIHDALD+MMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        +IK+LGYVPN+E ARLEAEEE+KETS+SHHSEKLAIAYGL+RT  RTTIRISKNLRMC DCHNATK IS+ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A5D3BFH8 Pentatricopeptide repeat-containing protein0.0e+0088.68Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS V CLP TS+TSITQI QFPENPK+LILQ+CKTPKDL Q+HAHLLKTRR+LDP ITEAVLESAALLLP+TIDYALSIFNHIDKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FK+SP NA LLFKKMHENSV+HD+FTFS VLKACSRMR L++ EQVHALILKSG K NEFVENTLI MYANCG++GVAR VFDGM ERGIVAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+ +LEL+I FDDVTMISVLMACGRLA+LE+GELIGEYIVSKGL  N+TL TSLIDMYAKCGR+DTARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQK  VDPNEVTMVSVLYSCAMLGAY+TGKWVHFY+KKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVF++M FKNVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMAL+FFSLM ENDVKPNDVTFI +LSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAG LEEAYQFID+MP PPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASCRAHKN+EMAEKSLEHIT+LEP HSGDYILLSNTYAL GRVEDA+RVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEH HSKEIHDALD+MMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        +IK+LGYVPN+E ARLEAEEE+KETS+SHHSEKLAIAYGL+RT  RTTIRISKNLRMC DCHNATK IS+ FERMIIVRDRNRFHHFKDGLCSC DYW
Subjt:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A6J1C4F8 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0091.69Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS +AC PA SVT+IT I QFPENPKTLILQ+CKTPKDLHQ+HAHL+KTRR+LDPTITEAVLESAALLLPNTIDYALSIFNHID+PESSAYNVMIRGL+
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNAFLLFKKMHENSVEHD FTFSCVLKACSRMRALR+ EQVHA ILKSG KPNEFVENTLIHMYANCGE+G+AR+VFDGMSERG++AWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNG+W EVVKLF+ MLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT N +LTTSLIDMYAKCGRVDTA KLFD+M KRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL+LFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFY+KKKKMKLTVTLGTQLIDFYAKCGY D SVEVFR+MP +NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGK+ALDFFSLMREN+VKPNDVTFI +LSACSHACLVDQGRHLFNSM RDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYA AGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIH+ALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RI+ LGYVPN+EDARLEAEE+SKETS+SHHSEKLAIAYGL+RTP RT IRISKNLRMCRDCHNATKVISRVF+R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A6J1E2C2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0091.27Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS VACLP  SVTSIT + QFPENPK+LILQRCKTPKDL Q+HAHLLKTRR+ DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR+ EQVHALILKSG KPNEFVENTLIHMYANCG++GVARQVFDGMS+R  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYI+SKG+  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQADRCKEAL LFHEMQKAKVD NEVTMVSVLYSCA+LGAYETGKWVH Y+K+KKMKLTV+LGTQLIDFYAKCGYIDRSVEVFR MPF NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGYVPNMEDARLEA EEESKETS+SHHSEKLAIAYGL+RTPL+TTIRISKNLRMCRDCHNATKVIS+V++R IIVRDRNRFHHFKDGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

A0A6J1JIA5 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0089.99Show/hide
Query:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA
        MAS V CLP TSVTSI  + QFPENPK+LILQ+CKTPKDL Q+HAHLLKTRR+ DPTI EAVLESAALLLPN+IDYALSIFNH+DKPESSAYNVMIRGLA
Subjt:  MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLA

Query:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY
        FKQSPHNA LLFKKMHENSV+HD+FTFS VLKACSRMRALR+ EQVHALILKSG K NEFVENTLIHMYANCG++GVARQVFDGMSER  VAWNSMLSGY
Subjt:  FKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGY

Query:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFR MLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGL  NSTLTTSLIDMYAKCG+VDTARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISG

Query:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ
        YAQAD+CKEAL LFHEMQKAKVD NEVTMVSVLYSCA+LGAYETGKWVH Y+K+KKM+LTV+LGTQLIDFYAKCGYIDRSVEVFR M F NVFTWTALIQ
Subjt:  YAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEG+MALDFF+LMREN+VKPNDVTFIA+LSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLL+EAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK
        LLASC+AHKNVEMAEKS +HIT LEPAHSGDYILLSNTYAL GRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEMMK
Subjt:  LLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMK

Query:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        RIKSLGY+PNMEDARLEA EEESKETS+ HHSEKLAIAYGL+RTPL+TTIRISKNLR+CRDCHNATK+ISRV++R IIVRDRNRFHHF+DGLCSCNDYW
Subjt:  RIKSLGYVPNMEDARLEA-EEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148203.4e-14837.11Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHI-DKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC
        L  CK+   + Q+HAH+L+T  +++  +   +   +       + YAL++F+ I   PES  +N  +R L+    P    L ++++       D+F+F  
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHI-DKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI
        +LKA S++ AL +  ++H +  K     + FVE   + MYA+CG +  AR VFD MS R +V WN+M+  Y + GL DE  KLF  M + ++  D++ + 
Subjt:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYA-------------------------------KCGRVDTARKLFDEMDKRDVVAWSAMI
        +++ ACGR  ++     I E+++   + +++ L T+L+ MYA                               KCGR+D A+ +FD+ +K+D+V W+ MI
Subjt:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYA-------------------------------KCGRVDTARKLFDEMDKRDVVAWSAMI

Query:  SGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTAL
        S Y ++D  +EAL +F EM  + + P+ V+M SV+ +CA LG  +  KWVH  +    ++  +++   LI+ YAKCG +D + +VF  MP +NV +W+++
Subjt:  SGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTAL

Query:  IQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW
        I  L+ +GE   AL  F+ M++ +V+PN+VTF+ +L  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA LL EA + I++MP+  N V+W
Subjt:  IQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW

Query:  RTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM
         +L+++CR H  +E+ + + + I +LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  H  S EI+  LDE+
Subjt:  RTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM

Query:  MKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRT------TIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLC
        + ++K  GYVP+     ++ EEE K+  +  HSEKLA+ +GL+             IRI KNLR+C DCH   K++S+V+ER IIVRDR RFH +K+GLC
Subjt:  MKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRT------TIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLC

Query:  SCNDYW
        SC DYW
Subjt:  SCNDYW

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic6.6e-16039.69Show/hide
Query:  ILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKM-HENSVEHDEFTFS
        +++RC + + L Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S A+N +IR  A    P  +   F  M  E+    +++TF 
Subjt:  ILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKM-HENSVEHDEFTFS

Query:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTM
         ++KA + + +L   + +H + +KS    + FV N+LIH Y +CG+L  A +VF  + E+ +V+WNSM++G+ + G  D+ ++LF+ M    ++   VTM
Subjt:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTM

Query:  ISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMD-------------------------------KRDVVAWSAM
        + VL AC ++ +LE G  +  YI    + +N TL  +++DMY KCG ++ A++LFD M+                               ++D+VAW+A+
Subjt:  ISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMD-------------------------------KRDVVAWSAM

Query:  ISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWT
        IS Y Q  +  EAL +FHE+Q  K +  N++T+VS L +CA +GA E G+W+H Y+KK  +++   + + LI  Y+KCG +++S EVF  +  ++VF W+
Subjt:  ISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWT

Query:  ALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAV
        A+I GLA +G G  A+D F  M+E +VKPN VTF  +  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+G LE+A +FI+ MPIPP+  
Subjt:  ALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAV

Query:  VWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALD
        VW  LL +C+ H N+ +AE +   + +LEP + G ++LLSN YA  G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Subjt:  VWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALD

Query:  EMMKRIKSLGYVPNMEDA-RLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCN
        E+M+++KS GY P +    ++  EEE KE S++ HSEKLAI YGL+ T     IR+ KNLR+C DCH+  K+IS++++R IIVRDR RFHHF++G CSCN
Subjt:  EMMKRIKSLGYVPNMEDA-RLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCN

Query:  DYW
        D+W
Subjt:  DYW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665206.8e-14939.25Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILDP-TITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC
        LQRC   ++L QIHA +LKT  + D   IT+ +    +    + + YA  +F+  D+P++  +N+MIRG +    P  + LL+++M  +S  H+ +TF  
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILDP-TITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI
        +LKACS + A  +  Q+HA I K G + + +  N+LI+ YA  G   +A  +FD + E   V+WNS++ G                              
Subjt:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTM
                                                Y K G++D A  LF +M +++ ++W+ MISGY QAD  KEAL LFHEMQ + V+P+ V++
Subjt:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTM

Query:  VSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVF+++  K+V  WTALI G A +G G+ A+  F  M++  +KPN +T
Subjt:  VSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVT

Query:  FIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHS
        F A+L+ACS+  LV++G+ +F SM RD++++P IEHYGC+VD+LGRAGLL+EA +FI  MP+ PNAV+W  LL +CR HKN+E+ E+  E +  ++P H 
Subjt:  FIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHS

Query:  GDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLE-AEEESKETSMS
        G Y+  +N +A+  + + A   R L+KE+ + K PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP +E+  L+  +++ +E  + 
Subjt:  GDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLE-AEEESKETSMS

Query:  HHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
         HSEKLAI YGL++T   T IRI KNLR+C+DCH  TK+IS++++R I++RDR RFHHF+DG CSC DYW
Subjt:  HHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.0e-16439.57Show/hide
Query:  VACLPATSVTSITQIHQFP-----------ENPKTLILQRCKTPKDLHQIHAHLLK-----TRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPE
        ++C P T  +S    H  P            +P   +L  CKT + L  IHA ++K     T   L   I   +L      LP    YA+S+F  I +P 
Subjt:  VACLPATSVTSITQIHQFP-----------ENPKTLILQRCKTPKDLHQIHAHLLK-----TRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPE

Query:  SSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFD-----
           +N M RG A    P +A  L+  M    +  + +TF  VLK+C++ +A ++ +Q+H  +LK GC  + +V  +LI MY   G L  A +VFD     
Subjt:  SSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFD-----

Query:  ----------GMSERG----------------IVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT
                  G + RG                +V+WN+M+SGY + G + E ++LF+ M++ ++  D+ TM++V+ AC +   +ELG  +  +I   G  
Subjt:  ----------GMSERG----------------IVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT

Query:  INSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKM
         N  +  +LID+Y+KCG ++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM ++   PN+VTM+S+L +CA LGA + G+W+H Y+ K+  
Subjt:  INSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKM

Query:  KLT--VTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRR
         +T   +L T LID YAKCG I+ + +VF  +  K++ +W A+I G A +G    + D FS MR+  ++P+D+TF+ +LSACSH+ ++D GRH+F +M +
Subjt:  KLT--VTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRR

Query:  DFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLI
        D+ + P++EHYGCM+D+LG +GL +EA + I+ M + P+ V+W +LL +C+ H NVE+ E   E++ ++EP + G Y+LLSN YA AGR  +  + R+L+
Subjt:  DFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLI

Query:  KEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNL
         +K +KK PGCS IE+D VVHEF   D  H  ++EI+  L+EM   ++  G+VP+  +   E EEE KE ++ HHSEKLAIA+GL+ T   T + I KNL
Subjt:  KEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNL

Query:  RMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        R+CR+CH ATK+IS++++R II RDR RFHHF+DG+CSCNDYW
Subjt:  RMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.1e-14235.2Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILD-PTITEAVLESAALLLPNTIDYALSIFNHIDKPESS-AYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFS
        L+ CKT  +L   H  L K     D  TIT+ V  S  L    ++ +A  +F + +   +   YN +IRG A     + A LLF +M  + +  D++TF 
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILD-PTITEAVLESAALLLPNTIDYALSIFNHIDKPESS-AYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFS

Query:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKL-FRAMLELHIEFDDVT
          L AC++ RA  +  Q+H LI+K G   + FV+N+L+H YA CGEL  AR+VFD MSER +V+W SM+ GY +     + V L FR + +  +  + VT
Subjt:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKL-FRAMLELHIEFDDVT

Query:  MISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDE------------------------------------------
        M+ V+ AC +L DLE GE +  +I + G+ +N  + ++L+DMY KC  +D A++LFDE                                          
Subjt:  MISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDE------------------------------------------

Query:  ------------------------------------------------------------------------------------------MDKRDVVAWS
                                                                                                  M ++++V+W+
Subjt:  ------------------------------------------------------------------------------------------MDKRDVVAWS

Query:  AMISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFT
         +ISG  Q    +EA+ +F  MQ  + V+ + VTM+S+  +C  LGA +  KW+++Y++K  ++L V LGT L+D +++CG  + ++ +F  +  ++V  
Subjt:  AMISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFT

Query:  WTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPN
        WTA I  +A  G  + A++ F  M E  +KP+ V F+  L+ACSH  LV QG+ +F SM +   + P   HYGCMVD+LGRAGLLEEA Q I++MP+ PN
Subjt:  WTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPN

Query:  AVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDA
         V+W +LLA+CR   NVEMA  + E I  L P  +G Y+LLSN YA AGR  D  +VR  +KEK ++K PG S I++ G  HEF S D  H     I   
Subjt:  AVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDA

Query:  LDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSC
        LDE+ +R   LG+VP++ +  ++ +E+ K   +S HSEKLA+AYGL+ +   TTIRI KNLR+C DCH+  K  S+V+ R II+RD NRFH+ + G CSC
Subjt:  LDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSC

Query:  NDYW
         D+W
Subjt:  NDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-16539.57Show/hide
Query:  VACLPATSVTSITQIHQFP-----------ENPKTLILQRCKTPKDLHQIHAHLLK-----TRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPE
        ++C P T  +S    H  P            +P   +L  CKT + L  IHA ++K     T   L   I   +L      LP    YA+S+F  I +P 
Subjt:  VACLPATSVTSITQIHQFP-----------ENPKTLILQRCKTPKDLHQIHAHLLK-----TRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPE

Query:  SSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFD-----
           +N M RG A    P +A  L+  M    +  + +TF  VLK+C++ +A ++ +Q+H  +LK GC  + +V  +LI MY   G L  A +VFD     
Subjt:  SSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFD-----

Query:  ----------GMSERG----------------IVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT
                  G + RG                +V+WN+M+SGY + G + E ++LF+ M++ ++  D+ TM++V+ AC +   +ELG  +  +I   G  
Subjt:  ----------GMSERG----------------IVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLT

Query:  INSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKM
         N  +  +LID+Y+KCG ++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM ++   PN+VTM+S+L +CA LGA + G+W+H Y+ K+  
Subjt:  INSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKM

Query:  KLT--VTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRR
         +T   +L T LID YAKCG I+ + +VF  +  K++ +W A+I G A +G    + D FS MR+  ++P+D+TF+ +LSACSH+ ++D GRH+F +M +
Subjt:  KLT--VTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRR

Query:  DFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLI
        D+ + P++EHYGCM+D+LG +GL +EA + I+ M + P+ V+W +LL +C+ H NVE+ E   E++ ++EP + G Y+LLSN YA AGR  +  + R+L+
Subjt:  DFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLI

Query:  KEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNL
         +K +KK PGCS IE+D VVHEF   D  H  ++EI+  L+EM   ++  G+VP+  +   E EEE KE ++ HHSEKLAIA+GL+ T   T + I KNL
Subjt:  KEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNL

Query:  RMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
        R+CR+CH ATK+IS++++R II RDR RFHHF+DG+CSCNDYW
Subjt:  RMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-16139.69Show/hide
Query:  ILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKM-HENSVEHDEFTFS
        +++RC + + L Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S A+N +IR  A    P  +   F  M  E+    +++TF 
Subjt:  ILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKM-HENSVEHDEFTFS

Query:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTM
         ++KA + + +L   + +H + +KS    + FV N+LIH Y +CG+L  A +VF  + E+ +V+WNSM++G+ + G  D+ ++LF+ M    ++   VTM
Subjt:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTM

Query:  ISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMD-------------------------------KRDVVAWSAM
        + VL AC ++ +LE G  +  YI    + +N TL  +++DMY KCG ++ A++LFD M+                               ++D+VAW+A+
Subjt:  ISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMD-------------------------------KRDVVAWSAM

Query:  ISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWT
        IS Y Q  +  EAL +FHE+Q  K +  N++T+VS L +CA +GA E G+W+H Y+KK  +++   + + LI  Y+KCG +++S EVF  +  ++VF W+
Subjt:  ISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWT

Query:  ALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAV
        A+I GLA +G G  A+D F  M+E +VKPN VTF  +  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+G LE+A +FI+ MPIPP+  
Subjt:  ALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAV

Query:  VWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALD
        VW  LL +C+ H N+ +AE +   + +LEP + G ++LLSN YA  G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Subjt:  VWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALD

Query:  EMMKRIKSLGYVPNMEDA-RLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCN
        E+M+++KS GY P +    ++  EEE KE S++ HSEKLAI YGL+ T     IR+ KNLR+C DCH+  K+IS++++R IIVRDR RFHHF++G CSCN
Subjt:  EMMKRIKSLGYVPNMEDA-RLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCN

Query:  DYW
        D+W
Subjt:  DYW

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.5e-14335.2Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILD-PTITEAVLESAALLLPNTIDYALSIFNHIDKPESS-AYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFS
        L+ CKT  +L   H  L K     D  TIT+ V  S  L    ++ +A  +F + +   +   YN +IRG A     + A LLF +M  + +  D++TF 
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILD-PTITEAVLESAALLLPNTIDYALSIFNHIDKPESS-AYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFS

Query:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKL-FRAMLELHIEFDDVT
          L AC++ RA  +  Q+H LI+K G   + FV+N+L+H YA CGEL  AR+VFD MSER +V+W SM+ GY +     + V L FR + +  +  + VT
Subjt:  CVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKL-FRAMLELHIEFDDVT

Query:  MISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDE------------------------------------------
        M+ V+ AC +L DLE GE +  +I + G+ +N  + ++L+DMY KC  +D A++LFDE                                          
Subjt:  MISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDE------------------------------------------

Query:  ------------------------------------------------------------------------------------------MDKRDVVAWS
                                                                                                  M ++++V+W+
Subjt:  ------------------------------------------------------------------------------------------MDKRDVVAWS

Query:  AMISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFT
         +ISG  Q    +EA+ +F  MQ  + V+ + VTM+S+  +C  LGA +  KW+++Y++K  ++L V LGT L+D +++CG  + ++ +F  +  ++V  
Subjt:  AMISGYAQADRCKEALSLFHEMQKAK-VDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFT

Query:  WTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPN
        WTA I  +A  G  + A++ F  M E  +KP+ V F+  L+ACSH  LV QG+ +F SM +   + P   HYGCMVD+LGRAGLLEEA Q I++MP+ PN
Subjt:  WTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPN

Query:  AVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDA
         V+W +LLA+CR   NVEMA  + E I  L P  +G Y+LLSN YA AGR  D  +VR  +KEK ++K PG S I++ G  HEF S D  H     I   
Subjt:  AVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDA

Query:  LDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSC
        LDE+ +R   LG+VP++ +  ++ +E+ K   +S HSEKLA+AYGL+ +   TTIRI KNLR+C DCH+  K  S+V+ R II+RD NRFH+ + G CSC
Subjt:  LDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSC

Query:  NDYW
         D+W
Subjt:  NDYW

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-14937.11Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHI-DKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC
        L  CK+   + Q+HAH+L+T  +++  +   +   +       + YAL++F+ I   PES  +N  +R L+    P    L ++++       D+F+F  
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHI-DKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI
        +LKA S++ AL +  ++H +  K     + FVE   + MYA+CG +  AR VFD MS R +V WN+M+  Y + GL DE  KLF  M + ++  D++ + 
Subjt:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYA-------------------------------KCGRVDTARKLFDEMDKRDVVAWSAMI
        +++ ACGR  ++     I E+++   + +++ L T+L+ MYA                               KCGR+D A+ +FD+ +K+D+V W+ MI
Subjt:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYA-------------------------------KCGRVDTARKLFDEMDKRDVVAWSAMI

Query:  SGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTAL
        S Y ++D  +EAL +F EM  + + P+ V+M SV+ +CA LG  +  KWVH  +    ++  +++   LI+ YAKCG +D + +VF  MP +NV +W+++
Subjt:  SGYAQADRCKEALSLFHEMQKAKVDPNEVTMVSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTAL

Query:  IQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW
        I  L+ +GE   AL  F+ M++ +V+PN+VTF+ +L  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA LL EA + I++MP+  N V+W
Subjt:  IQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW

Query:  RTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM
         +L+++CR H  +E+ + + + I +LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  H  S EI+  LDE+
Subjt:  RTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM

Query:  MKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRT------TIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLC
        + ++K  GYVP+     ++ EEE K+  +  HSEKLA+ +GL+             IRI KNLR+C DCH   K++S+V+ER IIVRDR RFH +K+GLC
Subjt:  MKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRT------TIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLC

Query:  SCNDYW
        SC DYW
Subjt:  SCNDYW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-15039.25Show/hide
Query:  LQRCKTPKDLHQIHAHLLKTRRILDP-TITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC
        LQRC   ++L QIHA +LKT  + D   IT+ +    +    + + YA  +F+  D+P++  +N+MIRG +    P  + LL+++M  +S  H+ +TF  
Subjt:  LQRCKTPKDLHQIHAHLLKTRRILDP-TITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI
        +LKACS + A  +  Q+HA I K G + + +  N+LI+ YA  G   +A  +FD + E   V+WNS++ G                              
Subjt:  VLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTM
                                                Y K G++D A  LF +M +++ ++W+ MISGY QAD  KEAL LFHEMQ + V+P+ V++
Subjt:  SVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTM

Query:  VSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVF+++  K+V  WTALI G A +G G+ A+  F  M++  +KPN +T
Subjt:  VSVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVT

Query:  FIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHS
        F A+L+ACS+  LV++G+ +F SM RD++++P IEHYGC+VD+LGRAGLL+EA +FI  MP+ PNAV+W  LL +CR HKN+E+ E+  E +  ++P H 
Subjt:  FIAILSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHS

Query:  GDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLE-AEEESKETSMS
        G Y+  +N +A+  + + A   R L+KE+ + K PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP +E+  L+  +++ +E  + 
Subjt:  GDYILLSNTYALAGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLE-AEEESKETSMS

Query:  HHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW
         HSEKLAI YGL++T   T IRI KNLR+C+DCH  TK+IS++++R I++RDR RFHHF+DG CSC DYW
Subjt:  HHSEKLAIAYGLLRTPLRTTIRISKNLRMCRDCHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACAGTTGCTTGCCTTCCCGCTACGTCTGTAACTTCCATAACCCAGATTCACCAATTCCCTGAAAATCCCAAAACTTTGATTCTTCAGAGATGCAAGACTCC
CAAAGATCTCCACCAAATTCACGCTCACCTTCTCAAAACTCGTCGTATTCTCGATCCCACCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCAACACCA
TAGACTATGCCCTTTCCATTTTCAATCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCCTTCAAGCAATCGCCTCATAATGCCTTTCTC
CTGTTCAAGAAAATGCATGAAAACTCGGTTGAACACGACGAATTCACTTTCTCCTGTGTCTTAAAGGCCTGCTCGAGAATGAGAGCGTTGAGGGATAGGGAACAGGTCCA
CGCCCTGATTTTGAAATCTGGGTGCAAGCCAAATGAGTTTGTCGAGAACACTTTGATTCATATGTACGCGAATTGTGGAGAACTTGGGGTTGCACGTCAGGTGTTTGATG
GAATGTCGGAACGAGGCATTGTTGCGTGGAATTCAATGTTGTCTGGTTACACGAAGAATGGGCTTTGGGATGAGGTCGTGAAACTTTTTCGGGCAATGCTGGAACTGCAT
ATTGAATTTGATGATGTTACAATGATTAGCGTGTTGATGGCTTGTGGAAGATTAGCTGATCTTGAATTGGGTGAGTTGATTGGTGAGTATATTGTGTCCAAGGGGCTAAC
AATAAATAGTACTTTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCGAGTCGATACTGCTCGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTCGTTGCGT
GGAGTGCAATGATCTCGGGTTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAGTCTGTTCCATGAGATGCAAAAGGCAAAAGTGGATCCAAACGAGGTTACAATGGTC
AGTGTTCTTTATTCGTGCGCCATGCTCGGAGCATACGAAACCGGTAAATGGGTTCATTTCTATATGAAAAAGAAGAAGATGAAGCTCACTGTTACTCTTGGAACTCAGCT
GATAGATTTCTACGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTTAGGGACATGCCTTTCAAAAATGTCTTCACATGGACAGCACTGATTCAGGGTCTTGCCA
ATAATGGAGAGGGGAAAATGGCTCTTGACTTCTTTTCTTTGATGCGAGAGAACGATGTAAAGCCAAATGATGTAACTTTCATTGCTATTCTTTCTGCTTGTAGCCATGCT
TGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATTCTTGGTCGAGCTGG
GCTACTTGAAGAAGCATATCAGTTCATAGATAACATGCCCATTCCACCCAATGCTGTTGTTTGGAGGACACTTTTGGCTTCATGCAGAGCTCATAAAAATGTTGAAATGG
CAGAGAAATCATTGGAACACATAACTCAATTGGAGCCTGCTCATAGTGGAGATTACATTCTTCTGTCAAATACTTATGCTTTGGCTGGTAGGGTTGAGGATGCACTGAGG
GTGAGATCTCTGATAAAAGAGAAGGAAATTAAGAAGACACCAGGTTGTAGTTTGATTGAGCTTGATGGTGTAGTACATGAGTTTTTTTCTGAAGATGGAGAGCACACTCA
CTCTAAGGAAATACACGATGCGTTAGATGAAATGATGAAGCGGATCAAGTCACTCGGATACGTGCCCAACATGGAGGATGCGAGACTAGAGGCGGAGGAAGAGAGCAAGG
AAACTTCAATGTCGCATCATAGTGAGAAGCTAGCTATTGCATATGGTCTTCTTCGGACACCTCTTCGAACCACCATTAGAATTTCAAAAAATCTAAGGATGTGCAGGGAC
TGTCACAATGCGACAAAGGTTATATCGCGAGTCTTTGAAAGAATGATCATTGTTAGGGATCGGAATCGCTTTCATCATTTCAAAGATGGTCTTTGCTCCTGTAATGACTA
TTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGACAGTTGCTTGCCTTCCCGCTACGTCTGTAACTTCCATAACCCAGATTCACCAATTCCCTGAAAATCCCAAAACTTTGATTCTTCAGAGATGCAAGACTCC
CAAAGATCTCCACCAAATTCACGCTCACCTTCTCAAAACTCGTCGTATTCTCGATCCCACCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCAACACCA
TAGACTATGCCCTTTCCATTTTCAATCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCCTTCAAGCAATCGCCTCATAATGCCTTTCTC
CTGTTCAAGAAAATGCATGAAAACTCGGTTGAACACGACGAATTCACTTTCTCCTGTGTCTTAAAGGCCTGCTCGAGAATGAGAGCGTTGAGGGATAGGGAACAGGTCCA
CGCCCTGATTTTGAAATCTGGGTGCAAGCCAAATGAGTTTGTCGAGAACACTTTGATTCATATGTACGCGAATTGTGGAGAACTTGGGGTTGCACGTCAGGTGTTTGATG
GAATGTCGGAACGAGGCATTGTTGCGTGGAATTCAATGTTGTCTGGTTACACGAAGAATGGGCTTTGGGATGAGGTCGTGAAACTTTTTCGGGCAATGCTGGAACTGCAT
ATTGAATTTGATGATGTTACAATGATTAGCGTGTTGATGGCTTGTGGAAGATTAGCTGATCTTGAATTGGGTGAGTTGATTGGTGAGTATATTGTGTCCAAGGGGCTAAC
AATAAATAGTACTTTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCGAGTCGATACTGCTCGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTCGTTGCGT
GGAGTGCAATGATCTCGGGTTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAGTCTGTTCCATGAGATGCAAAAGGCAAAAGTGGATCCAAACGAGGTTACAATGGTC
AGTGTTCTTTATTCGTGCGCCATGCTCGGAGCATACGAAACCGGTAAATGGGTTCATTTCTATATGAAAAAGAAGAAGATGAAGCTCACTGTTACTCTTGGAACTCAGCT
GATAGATTTCTACGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTTAGGGACATGCCTTTCAAAAATGTCTTCACATGGACAGCACTGATTCAGGGTCTTGCCA
ATAATGGAGAGGGGAAAATGGCTCTTGACTTCTTTTCTTTGATGCGAGAGAACGATGTAAAGCCAAATGATGTAACTTTCATTGCTATTCTTTCTGCTTGTAGCCATGCT
TGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATTCTTGGTCGAGCTGG
GCTACTTGAAGAAGCATATCAGTTCATAGATAACATGCCCATTCCACCCAATGCTGTTGTTTGGAGGACACTTTTGGCTTCATGCAGAGCTCATAAAAATGTTGAAATGG
CAGAGAAATCATTGGAACACATAACTCAATTGGAGCCTGCTCATAGTGGAGATTACATTCTTCTGTCAAATACTTATGCTTTGGCTGGTAGGGTTGAGGATGCACTGAGG
GTGAGATCTCTGATAAAAGAGAAGGAAATTAAGAAGACACCAGGTTGTAGTTTGATTGAGCTTGATGGTGTAGTACATGAGTTTTTTTCTGAAGATGGAGAGCACACTCA
CTCTAAGGAAATACACGATGCGTTAGATGAAATGATGAAGCGGATCAAGTCACTCGGATACGTGCCCAACATGGAGGATGCGAGACTAGAGGCGGAGGAAGAGAGCAAGG
AAACTTCAATGTCGCATCATAGTGAGAAGCTAGCTATTGCATATGGTCTTCTTCGGACACCTCTTCGAACCACCATTAGAATTTCAAAAAATCTAAGGATGTGCAGGGAC
TGTCACAATGCGACAAAGGTTATATCGCGAGTCTTTGAAAGAATGATCATTGTTAGGGATCGGAATCGCTTTCATCATTTCAAAGATGGTCTTTGCTCCTGTAATGACTA
TTGGTGA
Protein sequenceShow/hide protein sequence
MASTVACLPATSVTSITQIHQFPENPKTLILQRCKTPKDLHQIHAHLLKTRRILDPTITEAVLESAALLLPNTIDYALSIFNHIDKPESSAYNVMIRGLAFKQSPHNAFL
LFKKMHENSVEHDEFTFSCVLKACSRMRALRDREQVHALILKSGCKPNEFVENTLIHMYANCGELGVARQVFDGMSERGIVAWNSMLSGYTKNGLWDEVVKLFRAMLELH
IEFDDVTMISVLMACGRLADLELGELIGEYIVSKGLTINSTLTTSLIDMYAKCGRVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALSLFHEMQKAKVDPNEVTMV
SVLYSCAMLGAYETGKWVHFYMKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFRDMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENDVKPNDVTFIAILSACSHA
CLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVEMAEKSLEHITQLEPAHSGDYILLSNTYALAGRVEDALR
VRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMMKRIKSLGYVPNMEDARLEAEEESKETSMSHHSEKLAIAYGLLRTPLRTTIRISKNLRMCRD
CHNATKVISRVFERMIIVRDRNRFHHFKDGLCSCNDYW