; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017426 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017426
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153047:782532..788438
RNA-Seq ExpressionSgr017426
SyntenySgr017426
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589416.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.4Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS +AC P  SVT+ITH+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRR  DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALR GEQVHA ILKSG K NEFVENTLIHMYANCG+VGVARQVFDGMS+R  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+I+SKG+ RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEAL+LFHEMQKAKVD NEVTMVS LYSCA+LGAYETGKWVH YIK++KMKLTV+LGTQLIDFYAKCGYI+ SVEVFR MPF NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDE++K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGYVPN+EDARLEA +E+SKETSVSHHSEKLAIAYGLIRT  +TT+RISKNLRMCRDCHNA KVIS+V+ RTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

KAG7023094.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.11Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS +AC P  SVT+ITH+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRRL DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALREGEQVHA ILKSG K NEFVENTLIHMYANCG+VGVARQVFDGMS+R  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+I+SKG+ RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEAL+LFHEMQKAKVD NEVTMVSVLYSCAVLGAYETGKWVH YIK++KMKLTV+LGTQLIDFYAKCGYI+ SVEVFR MPF NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGYVPN+EDARLEA +E+SKETSVSHHSEKLAIAYGLIRT  +TT+RISKNLRMCRDCHNA KVIS+V+ RTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

XP_022135298.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Momordica charantia]0.0e+0092.11Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS LACFPA SVTTITHISQFPENPKTLILQQCKT KDLHQVHAHL+KTRRLLDP ITEA+LESAALLLPNTIDYALSIFNHID+PE SAYNVMIRGL+
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NA LLFKKMHENSVEHD FTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVG+AR+VFDGMSERG+IAWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNG+W EVVKLF++MLELHIEFDDVTMISVLMACGRLADLELGELIGE+IVSKGLT N +LTTSL+DMYAKCG+V++A KLFD+M KRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEALNLFHEMQKAKVDPNEVTMVSVLYSCA+LGAYETGKWVHFYIKK+KMKLTVTLGTQLIDFYAKCGY +SSVEVFREMP +NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGK+ALDFFSLMRENNVKPNDVTFIG+LSACSHACLV QGRHLFNSM RDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASCRAHKNV MAEKSLEHIT+LEPAHSGDYILLSNTYA  GRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIH+ALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RI+ LGYVPNIEDARLEA+EDSKETSVSHHSEKLAIAYGLIRT PRT +RISKNLRMCRDCHNA KVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

XP_022921781.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]0.0e+0089.26Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS +AC P  SVT+ITH+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRRL DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALREGEQVHA ILKSG KPNEFVENTLIHMYANCG+VGVARQVFDGMS+R  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+I+SKG+ RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEAL+LFHEMQKAKVD NEVTMVSVLYSCAVLGAYETGKWVH YIK++KMKLTV+LGTQLIDFYAKCGYI+ SVEVFR MPF NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGYVPN+EDARLEA +E+SKETSVSHHSEKLAIAYGLIRT  +TT+RISKNLRMCRDCHNA KVIS+V+ RTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

XP_022987229.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima]0.0e+0088.4Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS + C P TSVT+I H+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRRL DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALREGEQVHA ILKSG K NEFVENTLIHMYANCG+VGVARQVFDGMSER  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+IVSKGL RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQADQCKEAL+LFHEMQKAKVD NEVTMVSVLYSCAVLGAYETGKWVH YIK++KM+LTV+LGTQLIDFYAKCGYI+ SVEVFR M F NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEG+MALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLL+EAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGY+PN+EDARLEA +E+SKETSV HHSEKLAIAYGLIRT  +TT+RISKNLR+CRDCHNA K+ISRV+ RTIIVRDRNRFHHF+DGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

TrEMBL top hitse value%identityAlignment
A0A0A0LRD6 DYW_deaminase domain-containing protein0.0e+0087.95Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS + C P  S+T+IT   QFPENPK+LILQQCKT KDL QVHAHLLKTRRLLDP ITEA+LESAALLLP+TIDYALSIFNHIDKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FK+SP NA+LLFKKMHE SV+HD+FTFS VLKACSRM+ALREGEQVHA ILKSG K NEFVENTLI MYANCG++GVAR VFDGM ER I+AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+LE+GELIGE+IVSKGL RN+ LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEALNLFHEMQK  V PNEVTMVSVLYSCA+LGAYETGKWVHFYIKK+KMKLTVTLGTQLIDFYAKCGYI+ SVEVF+EM FKNVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMAL+FFS M EN+VKPNDVTFIG+LSACSHACLV QGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAG LEEAYQFIDNMP PPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASCRAHKN+ MAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDA+RVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEH HSKEIHDALD+M+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        +IK LGYVPN +DARLEA+E+SKETSVSHHSEKLAIAYGLIRTSPRTT+RISKNLRMCRDCHNA K IS+VF+R IIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

A0A5D3BFH8 Pentatricopeptide repeat-containing protein0.0e+0087.09Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS + C P TS+T+IT ISQFPENPK+LILQQCKT KDL QVHAHLLKTRRLLDP ITEA+LESAALLLP+TIDYALSIFNHIDKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FK+SP NA+LLFKKMHENSV+HD+FTFS VLKACSRMR L+EGEQVHA ILKSG K NEFVENTLI MYANCG++GVAR VFDGM ERGI+AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLF+K+LEL+I FDDVTMISVLMACGRLA+LE+GELIGE+IVSKGL RN+ L TSL+DMYAKCG++++ARKLF+EMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEALNLFHEMQK  VDPNEVTMVSVLYSCA+LGAY+TGKWVHFYIKK+KMKLTVTLGTQLIDFYAKCGYI+ SVEVF+EM FKNVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMAL+FFSLM EN+VKPNDVTFIG+LSACSHACLV QGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAG LEEAYQFID+MP PPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASCRAHKN+ MAEKSLEHITRLEP HSGDYILLSNTYALVGRVEDA+RVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEH HSKEIHDALD+M+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        +IK+LGYVPNIE ARLEA+E++KETSVSHHSEKLAIAYGLIRTSPRTT+RISKNLRMC DCHNA K IS+ F+R IIVRDRNRFHHFKDGLCSC DY
Subjt:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

A0A6J1C4F8 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0092.11Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS LACFPA SVTTITHISQFPENPKTLILQQCKT KDLHQVHAHL+KTRRLLDP ITEA+LESAALLLPNTIDYALSIFNHID+PE SAYNVMIRGL+
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NA LLFKKMHENSVEHD FTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVG+AR+VFDGMSERG+IAWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNG+W EVVKLF++MLELHIEFDDVTMISVLMACGRLADLELGELIGE+IVSKGLT N +LTTSL+DMYAKCG+V++A KLFD+M KRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEALNLFHEMQKAKVDPNEVTMVSVLYSCA+LGAYETGKWVHFYIKK+KMKLTVTLGTQLIDFYAKCGY +SSVEVFREMP +NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGK+ALDFFSLMRENNVKPNDVTFIG+LSACSHACLV QGRHLFNSM RDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASCRAHKNV MAEKSLEHIT+LEPAHSGDYILLSNTYA  GRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIH+ALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RI+ LGYVPNIEDARLEA+EDSKETSVSHHSEKLAIAYGLIRT PRT +RISKNLRMCRDCHNA KVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

A0A6J1E2C2 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0089.26Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS +AC P  SVT+ITH+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRRL DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALREGEQVHA ILKSG KPNEFVENTLIHMYANCG+VGVARQVFDGMS+R  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+I+SKG+ RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQAD+CKEAL+LFHEMQKAKVD NEVTMVSVLYSCAVLGAYETGKWVH YIK++KMKLTV+LGTQLIDFYAKCGYI+ SVEVFR MPF NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEGKMALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLLEEAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGYVPN+EDARLEA +E+SKETSVSHHSEKLAIAYGLIRT  +TT+RISKNLRMCRDCHNA KVIS+V+ RTIIVRDRNRFHHFKDGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

A0A6J1JIA5 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like0.0e+0088.4Show/hide
Query:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA
        MAS + C P TSVT+I H+SQFPENPK+LILQ+CKT KDL QVHAHLLKTRRL DP I EA+LESAALLLPN+IDYALSIFNH+DKPE SAYNVMIRGLA
Subjt:  MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLA

Query:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY
        FKQSP NAVLLFKKMHENSV+HD+FTFS VLKACSRMRALREGEQVHA ILKSG K NEFVENTLIHMYANCG+VGVARQVFDGMSER  +AWNSMLSGY
Subjt:  FKQSPLNAVLLFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGY

Query:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG
        TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGE+IVSKGL RN  LTTSL+DMYAKCGQV++ARKLFDEMDKRDVVAWSAMISG
Subjt:  TKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISG

Query:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ
        YAQADQCKEAL+LFHEMQKAKVD NEVTMVSVLYSCAVLGAYETGKWVH YIK++KM+LTV+LGTQLIDFYAKCGYI+ SVEVFR M F NVFTWTALIQ
Subjt:  YAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQ

Query:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT
        GLANNGEG+MALDFF+LMRENNVKPNDVTFI +LSACSHACLV QGRHLFNSMRR FDIEPRIEHYGCMVDILGRAGLL+EAYQFI NMPIPPNAVVWRT
Subjt:  GLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRT

Query:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK
        LLASC+AHKNV MAEKS +HIT LEPAHSGDYILLSNTYALVGRVEDALRVRSLIK+KEIKKTPGCSLIELDGVVHEFFSEDG+HTHSKEIHDALDEM+K
Subjt:  LLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLK

Query:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        RIKSLGY+PN+EDARLEA +E+SKETSV HHSEKLAIAYGLIRT  +TT+RISKNLR+CRDCHNA K+ISRV+ RTIIVRDRNRFHHF+DGLCSCNDY
Subjt:  RIKSLGYVPNIEDARLEA-DEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148201.2e-14636.74Show/hide
Query:  LQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHI-DKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC
        L  CK+   + Q+HAH+L+T  +++  +   L   +       + YAL++F+ I   PE   +N  +R L+    P   +L ++++       D+F+F  
Subjt:  LQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHI-DKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI
        +LKA S++ AL EG ++H    K     + FVE   + MYA+CG +  AR VFD MS R ++ WN+M+  Y + GL DE  KLF +M + ++  D++ + 
Subjt:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYA-------------------------------KCGQVNSARKLFDEMDKRDVVAWSAMI
        +++ ACGR  ++     I E ++   +  +  L T+LV MYA                               KCG+++ A+ +FD+ +K+D+V W+ MI
Subjt:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYA-------------------------------KCGQVNSARKLFDEMDKRDVVAWSAMI

Query:  SGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTAL
        S Y ++D  +EAL +F EM  + + P+ V+M SV+ +CA LG  +  KWVH  I    ++  +++   LI+ YAKCG ++++ +VF +MP +NV +W+++
Subjt:  SGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTAL

Query:  IQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW
        I  L+ +GE   AL  F+ M++ NV+PN+VTF+G+L  CSH+ LV +G+ +F SM  +++I P++EHYGCMVD+ GRA LL EA + I++MP+  N V+W
Subjt:  IQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW

Query:  RTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM
         +L+++CR H  + + + + + I  LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  H  S EI+  LDE+
Subjt:  RTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM

Query:  LKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRT------TLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC
        + ++K  GYVP+     ++ +E+ K+  V  HSEKLA+ +GL+             +RI KNLR+C DCH   K++S+V++R IIVRDR RFH +K+GLC
Subjt:  LKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRT------TLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC

Query:  SCNDY
        SC DY
Subjt:  SCNDY

O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.2e-15738.87Show/hide
Query:  PATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNA
        P T+     HIS         ++++C + + L Q H H+++T    DP     L   AAL    +++YA  +F+ I KP   A+N +IR  A    P+ +
Subjt:  PATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNA

Query:  VLLFKKM-HENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWD
        +  F  M  E+    +++TF  ++KA + + +L  G+ +H   +KS    + FV N+LIH Y +CG++  A +VF  + E+ +++WNSM++G+ + G  D
Subjt:  VLLFKKM-HENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWD

Query:  EVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMD--------------------
        + ++LF+KM    ++   VTM+ VL AC ++ +LE G  +  +I    +  N  L  +++DMY KCG +  A++LFD M+                    
Subjt:  EVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMD--------------------

Query:  -----------KRDVVAWSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCG
                   ++D+VAW+A+IS Y Q  +  EAL +FHE+Q  K +  N++T+VS L +CA +GA E G+W+H YIKK  +++   + + LI  Y+KCG
Subjt:  -----------KRDVVAWSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCG

Query:  YIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR
         +E S EVF  +  ++VF W+A+I GLA +G G  A+D F  M+E NVKPN VTF  +  ACSH  LV +   LF+ M  ++ I P  +HY C+VD+LGR
Subjt:  YIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR

Query:  AGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVV
        +G LE+A +FI+ MPIPP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LLSN YA +G+ E+   +R  ++   +KK PGCS IE+DG++
Subjt:  AGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVV

Query:  HEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDA-RLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDR
        HEF S D  H  S++++  L E+++++KS GY P I    ++  +E+ KE S++ HSEKLAI YGLI T     +R+ KNLR+C DCH+  K+IS+++DR
Subjt:  HEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDA-RLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDR

Query:  TIIVRDRNRFHHFKDGLCSCNDY
         IIVRDR RFHHF++G CSCND+
Subjt:  TIIVRDRNRFHHFKDGLCSCNDY

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665209.8e-14939.76Show/hide
Query:  LQQCKTTKDLHQVHAHLLKTRRLLDP-NITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC
        LQ+C   ++L Q+HA +LKT  + D   IT+ L    +    + + YA  +F+  D+P+   +N+MIRG +    P  ++LL+++M  +S  H+ +TF  
Subjt:  LQQCKTTKDLHQVHAHLLKTRRLLDP-NITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI
        +LKACS + A  E  Q+HAQI K G + + +  N+LI+ YA  G   +A  +FD + E   ++WNS++ GY K G  D  + LFRKM E           
Subjt:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTM
                                                                   ++ ++W+ MISGY QAD  KEAL LFHEMQ + V+P+ V++
Subjt:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTM

Query:  VSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG +E ++EVF+ +  K+V  WTALI G A +G G+ A+  F  M++  +KPN +T
Subjt:  VSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVT

Query:  FIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHS
        F  +L+ACS+  LV +G+ +F SM RD++++P IEHYGC+VD+LGRAGLL+EA +FI  MP+ PNAV+W  LL +CR HKN+ + E+  E +  ++P H 
Subjt:  FIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHS

Query:  GDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDARLE-ADEDSKETSVS
        G Y+  +N +A+  + + A   R L+KE+ + K PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP +E+  L+  D+D +E  V 
Subjt:  GDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDARLE-ADEDSKETSVS

Query:  HHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
         HSEKLAI YGLI+T P T +RI KNLR+C+DCH   K+IS+++ R I++RDR RFHHF+DG CSC DY
Subjt:  HHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic9.2e-16339.83Show/hide
Query:  NPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPN--TIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEH
        +P   +L  CKT + L  +HA ++K   L + N   + L    +L P+   + YA+S+F  I +P    +N M RG A    P++A+ L+  M    +  
Subjt:  NPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPN--TIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEH

Query:  DEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFD---------------GMSERG----------------II
        + +TF  VLK+C++ +A +EG+Q+H  +LK G   + +V  +LI MY   G +  A +VFD               G + RG                ++
Subjt:  DEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFD---------------GMSERG----------------II

Query:  AWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDV
        +WN+M+SGY + G + E ++LF+ M++ ++  D+ TM++V+ AC +   +ELG  +   I   G   N  +  +L+D+Y+KCG++ +A  LF+ +  +DV
Subjt:  AWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDV

Query:  VAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLT--VTLGTQLIDFYAKCGYIESSVEVFREMPF
        ++W+ +I GY   +  KEAL LF EM ++   PN+VTM+S+L +CA LGA + G+W+H YI K    +T   +L T LID YAKCG IE++ +VF  +  
Subjt:  VAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLT--VTLGTQLIDFYAKCGYIESSVEVFREMPF

Query:  KNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNM
        K++ +W A+I G A +G    + D FS MR+  ++P+D+TF+G+LSACSH+ ++  GRH+F +M +D+ + P++EHYGCM+D+LG +GL +EA + I+ M
Subjt:  KNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNM

Query:  PIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSK
         + P+ V+W +LL +C+ H NV + E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK PGCS IE+D VVHEF   D  H  ++
Subjt:  PIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSK

Query:  EIHDALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKD
        EI+  L+EM   ++  G+VP+  +   E +E+ KE ++ HHSEKLAIA+GLI T P T L I KNLR+CR+CH A K+IS+++ R II RDR RFHHF+D
Subjt:  EIHDALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKD

Query:  GLCSCNDY
        G+CSCNDY
Subjt:  GLCSCNDY

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088204.4e-14137.17Show/hide
Query:  SVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLL
        S+ T+   +   +  KTLI   C T   L Q+H  L+      D  +   LL+    L      Y+  +F+H   P    YN +I G          + L
Subjt:  SVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLL

Query:  FKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVK
        F  + ++ +    FTF  VLKAC+R  + + G  +H+ ++K G   +     +L+ +Y+  G +  A ++FD + +R ++ W ++ SGYT +G   E + 
Subjt:  FKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVK

Query:  LFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEAL
        LF+KM+E+ ++ D   ++ VL AC  + DL+ GE I +++    + +N  + T+LV++YAKCG++  AR +FD M ++D+V WS MI GYA     KE +
Subjt:  LFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEAL

Query:  NLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMA
         LF +M +  + P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAKCG +    EVF+EM  K++    A I GLA NG  K++
Subjt:  NLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMA

Query:  LDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNV
           F    +  + P+  TF+G+L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG+L++AY+ I +MP+ PNA+VW  LL+ CR  K+ 
Subjt:  LDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNV

Query:  AMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNI
         +AE  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR ++ +K +KK PG S IEL+G VHEF ++D  H  S +I+  L+++   ++ +G+VP  
Subjt:  AMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNI

Query:  EDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
        E    + +E+ KE  + +HSEKLA+A GLI T     +R+ KNLR+C DCH  MK+IS++  R I+VRD NRFH F +G CSCNDY
Subjt:  EDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-16439.83Show/hide
Query:  NPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPN--TIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEH
        +P   +L  CKT + L  +HA ++K   L + N   + L    +L P+   + YA+S+F  I +P    +N M RG A    P++A+ L+  M    +  
Subjt:  NPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPN--TIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEH

Query:  DEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFD---------------GMSERG----------------II
        + +TF  VLK+C++ +A +EG+Q+H  +LK G   + +V  +LI MY   G +  A +VFD               G + RG                ++
Subjt:  DEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFD---------------GMSERG----------------II

Query:  AWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDV
        +WN+M+SGY + G + E ++LF+ M++ ++  D+ TM++V+ AC +   +ELG  +   I   G   N  +  +L+D+Y+KCG++ +A  LF+ +  +DV
Subjt:  AWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDV

Query:  VAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLT--VTLGTQLIDFYAKCGYIESSVEVFREMPF
        ++W+ +I GY   +  KEAL LF EM ++   PN+VTM+S+L +CA LGA + G+W+H YI K    +T   +L T LID YAKCG IE++ +VF  +  
Subjt:  VAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLT--VTLGTQLIDFYAKCGYIESSVEVFREMPF

Query:  KNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNM
        K++ +W A+I G A +G    + D FS MR+  ++P+D+TF+G+LSACSH+ ++  GRH+F +M +D+ + P++EHYGCM+D+LG +GL +EA + I+ M
Subjt:  KNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNM

Query:  PIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSK
         + P+ V+W +LL +C+ H NV + E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK PGCS IE+D VVHEF   D  H  ++
Subjt:  PIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSK

Query:  EIHDALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKD
        EI+  L+EM   ++  G+VP+  +   E +E+ KE ++ HHSEKLAIA+GLI T P T L I KNLR+CR+CH A K+IS+++ R II RDR RFHHF+D
Subjt:  EIHDALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKD

Query:  GLCSCNDY
        G+CSCNDY
Subjt:  GLCSCNDY

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-15938.87Show/hide
Query:  PATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNA
        P T+     HIS         ++++C + + L Q H H+++T    DP     L   AAL    +++YA  +F+ I KP   A+N +IR  A    P+ +
Subjt:  PATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNA

Query:  VLLFKKM-HENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWD
        +  F  M  E+    +++TF  ++KA + + +L  G+ +H   +KS    + FV N+LIH Y +CG++  A +VF  + E+ +++WNSM++G+ + G  D
Subjt:  VLLFKKM-HENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWD

Query:  EVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMD--------------------
        + ++LF+KM    ++   VTM+ VL AC ++ +LE G  +  +I    +  N  L  +++DMY KCG +  A++LFD M+                    
Subjt:  EVVKLFRKMLELHIEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMD--------------------

Query:  -----------KRDVVAWSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCG
                   ++D+VAW+A+IS Y Q  +  EAL +FHE+Q  K +  N++T+VS L +CA +GA E G+W+H YIKK  +++   + + LI  Y+KCG
Subjt:  -----------KRDVVAWSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCG

Query:  YIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR
         +E S EVF  +  ++VF W+A+I GLA +G G  A+D F  M+E NVKPN VTF  +  ACSH  LV +   LF+ M  ++ I P  +HY C+VD+LGR
Subjt:  YIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR

Query:  AGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVV
        +G LE+A +FI+ MPIPP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LLSN YA +G+ E+   +R  ++   +KK PGCS IE+DG++
Subjt:  AGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVV

Query:  HEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDA-RLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDR
        HEF S D  H  S++++  L E+++++KS GY P I    ++  +E+ KE S++ HSEKLAI YGLI T     +R+ KNLR+C DCH+  K+IS+++DR
Subjt:  HEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDA-RLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDR

Query:  TIIVRDRNRFHHFKDGLCSCNDY
         IIVRDR RFHHF++G CSCND+
Subjt:  TIIVRDRNRFHHFKDGLCSCNDY

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.8e-14234.6Show/hide
Query:  LQQCKTTKDLHQVHAHLLKTRRLLD---PNITEALLESAALLLPNTIDYALSIFNHIDKPEPS-AYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFT
        L+ CKT  +L   H  L  T++ LD     IT+ +  S  L    ++ +A  +F + +       YN +IRG A       A+LLF +M  + +  D++T
Subjt:  LQQCKTTKDLHQVHAHLLKTRRLLD---PNITEALLESAALLLPNTIDYALSIFNHIDKPEPS-AYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFT

Query:  FSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKML-ELHIEFDD
        F   L AC++ RA   G Q+H  I+K G   + FV+N+L+H YA CGE+  AR+VFD MSER +++W SM+ GY +     + V LF +M+ +  +  + 
Subjt:  FSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKML-ELHIEFDD

Query:  VTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDE----------------------------------------
        VTM+ V+ AC +L DLE GE +   I + G+  N  + ++LVDMY KC  ++ A++LFDE                                        
Subjt:  VTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDE----------------------------------------

Query:  --------------------------------------------------------------------------------------------MDKRDVVA
                                                                                                    M ++++V+
Subjt:  --------------------------------------------------------------------------------------------MDKRDVVA

Query:  WSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNV
        W+ +ISG  Q    +EA+ +F  MQ  + V+ + VTM+S+  +C  LGA +  KW+++YI+K  ++L V LGT L+D +++CG  ES++ +F  +  ++V
Subjt:  WSAMISGYAQADQCKEALNLFHEMQKAK-VDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNV

Query:  FTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIP
          WTA I  +A  G  + A++ F  M E  +KP+ V F+G L+ACSH  LV QG+ +F SM +   + P   HYGCMVD+LGRAGLLEEA Q I++MP+ 
Subjt:  FTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIP

Query:  PNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIH
        PN V+W +LLA+CR   NV MA  + E I  L P  +G Y+LLSN YA  GR  D  +VR  +KEK ++K PG S I++ G  HEF S D  H     I 
Subjt:  PNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIH

Query:  DALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC
          LDE+ +R   LG+VP++ +  ++ DE  K   +S HSEKLA+AYGLI ++  TT+RI KNLR+C DCH+  K  S+V++R II+RD NRFH+ + G C
Subjt:  DALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC

Query:  SCNDYC------NDLKMDLQHLTSTHQDDLPVYYHDDVRLRLCY
        SC D+C        LK  L      + D++P        L LCY
Subjt:  SCNDYC------NDLKMDLQHLTSTHQDDLPVYYHDDVRLRLCY

AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein8.5e-14836.74Show/hide
Query:  LQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHI-DKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC
        L  CK+   + Q+HAH+L+T  +++  +   L   +       + YAL++F+ I   PE   +N  +R L+    P   +L ++++       D+F+F  
Subjt:  LQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHI-DKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI
        +LKA S++ AL EG ++H    K     + FVE   + MYA+CG +  AR VFD MS R ++ WN+M+  Y + GL DE  KLF +M + ++  D++ + 
Subjt:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYA-------------------------------KCGQVNSARKLFDEMDKRDVVAWSAMI
        +++ ACGR  ++     I E ++   +  +  L T+LV MYA                               KCG+++ A+ +FD+ +K+D+V W+ MI
Subjt:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYA-------------------------------KCGQVNSARKLFDEMDKRDVVAWSAMI

Query:  SGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTAL
        S Y ++D  +EAL +F EM  + + P+ V+M SV+ +CA LG  +  KWVH  I    ++  +++   LI+ YAKCG ++++ +VF +MP +NV +W+++
Subjt:  SGYAQADQCKEALNLFHEMQKAKVDPNEVTMVSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTAL

Query:  IQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW
        I  L+ +GE   AL  F+ M++ NV+PN+VTF+G+L  CSH+ LV +G+ +F SM  +++I P++EHYGCMVD+ GRA LL EA + I++MP+  N V+W
Subjt:  IQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVW

Query:  RTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM
         +L+++CR H  + + + + + I  LEP H G  +L+SN YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  H  S EI+  LDE+
Subjt:  RTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEM

Query:  LKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRT------TLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC
        + ++K  GYVP+     ++ +E+ K+  V  HSEKLA+ +GL+             +RI KNLR+C DCH   K++S+V++R IIVRDR RFH +K+GLC
Subjt:  LKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRT------TLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLC

Query:  SCNDY
        SC DY
Subjt:  SCNDY

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-15039.76Show/hide
Query:  LQQCKTTKDLHQVHAHLLKTRRLLDP-NITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC
        LQ+C   ++L Q+HA +LKT  + D   IT+ L    +    + + YA  +F+  D+P+   +N+MIRG +    P  ++LL+++M  +S  H+ +TF  
Subjt:  LQQCKTTKDLHQVHAHLLKTRRLLDP-NITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVLLFKKMHENSVEHDEFTFSC

Query:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI
        +LKACS + A  E  Q+HAQI K G + + +  N+LI+ YA  G   +A  +FD + E   ++WNS++ GY K G  D  + LFRKM E           
Subjt:  VLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMI

Query:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTM
                                                                   ++ ++W+ MISGY QAD  KEAL LFHEMQ + V+P+ V++
Subjt:  SVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTM

Query:  VSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVT
         + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG +E ++EVF+ +  K+V  WTALI G A +G G+ A+  F  M++  +KPN +T
Subjt:  VSVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVT

Query:  FIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHS
        F  +L+ACS+  LV +G+ +F SM RD++++P IEHYGC+VD+LGRAGLL+EA +FI  MP+ PNAV+W  LL +CR HKN+ + E+  E +  ++P H 
Subjt:  FIGILSACSHACLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHS

Query:  GDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDARLE-ADEDSKETSVS
        G Y+  +N +A+  + + A   R L+KE+ + K PGCS I L+G  HEF + D  H   ++I      M ++++  GYVP +E+  L+  D+D +E  V 
Subjt:  GDYILLSNTYALVGRVEDALRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDARLE-ADEDSKETSVS

Query:  HHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY
         HSEKLAI YGLI+T P T +RI KNLR+C+DCH   K+IS+++ R I++RDR RFHHF+DG CSC DY
Subjt:  HHSEKLAIAYGLIRTSPRTTLRISKNLRMCRDCHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACGCTGGCTTGCTTTCCCGCTACATCTGTAACTACCATAACCCACATTTCCCAATTCCCTGAAAATCCCAAAACTTTGATACTACAACAATGCAAAACTAC
CAAAGACCTCCACCAAGTTCACGCTCACCTCCTCAAAACTCGCCGTCTTCTCGACCCCAACATTACGGAAGCTCTTCTCGAGTCCGCGGCTCTACTCCTTCCCAACACCA
TAGACTATGCTCTTTCAATTTTCAATCATATCGATAAGCCCGAACCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCCTTCAAGCAATCGCCTCTTAATGCCGTCCTT
CTGTTTAAGAAAATGCATGAAAACTCGGTTGAACACGACGAATTCACTTTCTCTTGTGTCTTGAAGGCTTGCTCCAGAATGAGAGCGCTGAGGGAAGGCGAACAGGTCCA
CGCACAGATTTTGAAATCTGGGCGCAAGCCAAATGAGTTTGTCGAGAACACTTTGATTCACATGTACGCTAATTGCGGAGAAGTTGGGGTCGCACGTCAGGTGTTTGATG
GAATGTCGGAACGCGGCATAATTGCCTGGAATTCGATGTTGTCTGGTTACACGAAGAATGGGCTTTGGGATGAGGTCGTGAAACTTTTTCGAAAAATGCTGGAACTGCAT
ATTGAATTTGATGATGTTACAATGATCAGCGTGTTGATGGCTTGTGGAAGATTAGCTGATCTAGAATTGGGTGAGTTGATTGGTGAGCATATTGTGTCAAAAGGGCTAAC
AAGAAACCATGCTTTAACAACTTCGCTAGTTGATATGTATGCCAAATGTGGTCAAGTCAATAGTGCCAGAAAACTGTTCGACGAAATGGATAAAAGAGATGTTGTTGCAT
GGAGTGCAATGATCTCGGGTTACGCTCAAGCCGATCAATGTAAAGAAGCGCTTAATCTGTTCCATGAGATGCAGAAGGCAAAAGTGGATCCAAACGAGGTAACAATGGTC
AGTGTACTATATTCATGCGCCGTGCTCGGAGCATACGAAACCGGTAAGTGGGTTCATTTCTACATCAAAAAGGAGAAGATGAAGCTCACTGTTACTCTTGGAACTCAGCT
GATAGATTTCTATGCTAAATGCGGGTATATAGAAAGCTCAGTTGAAGTTTTCAGGGAAATGCCTTTCAAAAATGTCTTTACATGGACAGCACTGATTCAGGGCCTTGCCA
ATAATGGAGAGGGGAAAATGGCACTCGACTTCTTCTCTTTGATGCGAGAGAACAATGTAAAGCCAAATGATGTAACTTTCATTGGTATTCTTTCTGCTTGTAGTCATGCT
TGTCTGGTTCATCAGGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCTAGGATTGAGCATTATGGTTGCATGGTTGATATTCTTGGGCGAGCTGG
GTTACTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCATCCCACCCAATGCTGTTGTTTGGAGGACACTTTTGGCTTCATGCAGAGCTCACAAAAACGTAGCAATGG
CAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCATAGTGGAGATTACATTCTACTATCAAATACTTATGCTTTGGTTGGAAGGGTTGAGGATGCACTGAGG
GTGAGATCTCTGATAAAAGAAAAGGAGATTAAGAAGACCCCAGGTTGTAGTTTGATTGAGCTTGATGGTGTGGTACATGAGTTTTTTTCTGAAGATGGTGAGCATACTCA
CTCTAAGGAAATACACGATGCATTAGACGAAATGCTGAAACGGATTAAATCACTGGGGTATGTGCCCAACATAGAGGATGCTAGACTAGAGGCCGACGAAGACAGTAAGG
AAACTTCAGTGTCACATCATAGTGAGAAGCTAGCTATTGCTTATGGTCTTATCCGAACGTCTCCTCGAACTACTCTTAGAATTTCAAAAAATTTAAGGATGTGCAGGGAC
TGTCACAATGCAATGAAGGTGATATCGAGGGTCTTTGATAGAACAATCATTGTTAGGGATCGGAATCGTTTTCATCATTTCAAAGACGGTCTTTGCTCTTGCAATGACTA
CTGCAATGATTTGAAGATGGATCTTCAACATCTTACATCCACGCATCAGGATGATCTTCCAGTCTACTACCATGATGATGTAAGGCTTAGGCTGTGTTACAAAACTAAAA
CCACCTACACATTGATACATGGTGGGGAAGAAAGGGAAGCATTATCATTGGAGCAGAAGAAACAACCTCTGGGCTTATTTTCCATATTTGCTTTTGGGGTGTCCCCTGAA
TCGTCACGGACTGCCATCAGTGAGCTTGTTCCGGGAGACTCGGGAGGAAACACTGTGTTCTCAGAAGCAAAAACTCGAGTAGGAATCATGGGCTCAGGGGGTGAAATGGA
CCATAGGCAACTGCGAACATGGTGTTATATGGACCAAGAACCAGCAACGAAGGAGAAGAATGGAGTACAGGAGCATTCAGAATGGCATAAAGAATCATGGATTATGAGGT
GGATGAAAAGTACTACCAGGTTTCAGATCACGGTGAATGATCCCATGAGAGTGCAGACATTCCATAGCACGAGCAATATCGAGGGCAAAACCAACTGCCACATGCGTGTC
CAAGCACCGTGGGCGCATATTCAGCAAGTACTTTCGTCCGACGAAGAGATGTTTGGGATCAATCAGCCATTTCGCTTCCAATCTGAACTCATCCGTAGCCGAGTAGAATC
TACTCCCAGCTTCCATGTTTCTAGAAAAGCCCAGAAATCAATCGAAAAGTTGGATGACCGATATGTCGCGGAGGAGGCAAAAGGGAGAAGTCCCTCATATTCTACTACAA
CACCCACACAGACGCAGACGGATTGCTTAAAAATTCGTGGGGAAAGGAGAAAGATCCCCAAAAAGGAAGCAGGGGGGGAGAACCAATTGCAAAGCAAAGCCCCCAAAGTT
AAGAATCAGAAATCATTAAATTTATGCCCCCATAATGGAGCTGAACTGAAATGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGACGCTGGCTTGCTTTCCCGCTACATCTGTAACTACCATAACCCACATTTCCCAATTCCCTGAAAATCCCAAAACTTTGATACTACAACAATGCAAAACTAC
CAAAGACCTCCACCAAGTTCACGCTCACCTCCTCAAAACTCGCCGTCTTCTCGACCCCAACATTACGGAAGCTCTTCTCGAGTCCGCGGCTCTACTCCTTCCCAACACCA
TAGACTATGCTCTTTCAATTTTCAATCATATCGATAAGCCCGAACCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCCTTCAAGCAATCGCCTCTTAATGCCGTCCTT
CTGTTTAAGAAAATGCATGAAAACTCGGTTGAACACGACGAATTCACTTTCTCTTGTGTCTTGAAGGCTTGCTCCAGAATGAGAGCGCTGAGGGAAGGCGAACAGGTCCA
CGCACAGATTTTGAAATCTGGGCGCAAGCCAAATGAGTTTGTCGAGAACACTTTGATTCACATGTACGCTAATTGCGGAGAAGTTGGGGTCGCACGTCAGGTGTTTGATG
GAATGTCGGAACGCGGCATAATTGCCTGGAATTCGATGTTGTCTGGTTACACGAAGAATGGGCTTTGGGATGAGGTCGTGAAACTTTTTCGAAAAATGCTGGAACTGCAT
ATTGAATTTGATGATGTTACAATGATCAGCGTGTTGATGGCTTGTGGAAGATTAGCTGATCTAGAATTGGGTGAGTTGATTGGTGAGCATATTGTGTCAAAAGGGCTAAC
AAGAAACCATGCTTTAACAACTTCGCTAGTTGATATGTATGCCAAATGTGGTCAAGTCAATAGTGCCAGAAAACTGTTCGACGAAATGGATAAAAGAGATGTTGTTGCAT
GGAGTGCAATGATCTCGGGTTACGCTCAAGCCGATCAATGTAAAGAAGCGCTTAATCTGTTCCATGAGATGCAGAAGGCAAAAGTGGATCCAAACGAGGTAACAATGGTC
AGTGTACTATATTCATGCGCCGTGCTCGGAGCATACGAAACCGGTAAGTGGGTTCATTTCTACATCAAAAAGGAGAAGATGAAGCTCACTGTTACTCTTGGAACTCAGCT
GATAGATTTCTATGCTAAATGCGGGTATATAGAAAGCTCAGTTGAAGTTTTCAGGGAAATGCCTTTCAAAAATGTCTTTACATGGACAGCACTGATTCAGGGCCTTGCCA
ATAATGGAGAGGGGAAAATGGCACTCGACTTCTTCTCTTTGATGCGAGAGAACAATGTAAAGCCAAATGATGTAACTTTCATTGGTATTCTTTCTGCTTGTAGTCATGCT
TGTCTGGTTCATCAGGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCTAGGATTGAGCATTATGGTTGCATGGTTGATATTCTTGGGCGAGCTGG
GTTACTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCATCCCACCCAATGCTGTTGTTTGGAGGACACTTTTGGCTTCATGCAGAGCTCACAAAAACGTAGCAATGG
CAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCATAGTGGAGATTACATTCTACTATCAAATACTTATGCTTTGGTTGGAAGGGTTGAGGATGCACTGAGG
GTGAGATCTCTGATAAAAGAAAAGGAGATTAAGAAGACCCCAGGTTGTAGTTTGATTGAGCTTGATGGTGTGGTACATGAGTTTTTTTCTGAAGATGGTGAGCATACTCA
CTCTAAGGAAATACACGATGCATTAGACGAAATGCTGAAACGGATTAAATCACTGGGGTATGTGCCCAACATAGAGGATGCTAGACTAGAGGCCGACGAAGACAGTAAGG
AAACTTCAGTGTCACATCATAGTGAGAAGCTAGCTATTGCTTATGGTCTTATCCGAACGTCTCCTCGAACTACTCTTAGAATTTCAAAAAATTTAAGGATGTGCAGGGAC
TGTCACAATGCAATGAAGGTGATATCGAGGGTCTTTGATAGAACAATCATTGTTAGGGATCGGAATCGTTTTCATCATTTCAAAGACGGTCTTTGCTCTTGCAATGACTA
CTGCAATGATTTGAAGATGGATCTTCAACATCTTACATCCACGCATCAGGATGATCTTCCAGTCTACTACCATGATGATGTAAGGCTTAGGCTGTGTTACAAAACTAAAA
CCACCTACACATTGATACATGGTGGGGAAGAAAGGGAAGCATTATCATTGGAGCAGAAGAAACAACCTCTGGGCTTATTTTCCATATTTGCTTTTGGGGTGTCCCCTGAA
TCGTCACGGACTGCCATCAGTGAGCTTGTTCCGGGAGACTCGGGAGGAAACACTGTGTTCTCAGAAGCAAAAACTCGAGTAGGAATCATGGGCTCAGGGGGTGAAATGGA
CCATAGGCAACTGCGAACATGGTGTTATATGGACCAAGAACCAGCAACGAAGGAGAAGAATGGAGTACAGGAGCATTCAGAATGGCATAAAGAATCATGGATTATGAGGT
GGATGAAAAGTACTACCAGGTTTCAGATCACGGTGAATGATCCCATGAGAGTGCAGACATTCCATAGCACGAGCAATATCGAGGGCAAAACCAACTGCCACATGCGTGTC
CAAGCACCGTGGGCGCATATTCAGCAAGTACTTTCGTCCGACGAAGAGATGTTTGGGATCAATCAGCCATTTCGCTTCCAATCTGAACTCATCCGTAGCCGAGTAGAATC
TACTCCCAGCTTCCATGTTTCTAGAAAAGCCCAGAAATCAATCGAAAAGTTGGATGACCGATATGTCGCGGAGGAGGCAAAAGGGAGAAGTCCCTCATATTCTACTACAA
CACCCACACAGACGCAGACGGATTGCTTAAAAATTCGTGGGGAAAGGAGAAAGATCCCCAAAAAGGAAGCAGGGGGGGAGAACCAATTGCAAAGCAAAGCCCCCAAAGTT
AAGAATCAGAAATCATTAAATTTATGCCCCCATAATGGAGCTGAACTGAAATGGGGCTGA
Protein sequenceShow/hide protein sequence
MASTLACFPATSVTTITHISQFPENPKTLILQQCKTTKDLHQVHAHLLKTRRLLDPNITEALLESAALLLPNTIDYALSIFNHIDKPEPSAYNVMIRGLAFKQSPLNAVL
LFKKMHENSVEHDEFTFSCVLKACSRMRALREGEQVHAQILKSGRKPNEFVENTLIHMYANCGEVGVARQVFDGMSERGIIAWNSMLSGYTKNGLWDEVVKLFRKMLELH
IEFDDVTMISVLMACGRLADLELGELIGEHIVSKGLTRNHALTTSLVDMYAKCGQVNSARKLFDEMDKRDVVAWSAMISGYAQADQCKEALNLFHEMQKAKVDPNEVTMV
SVLYSCAVLGAYETGKWVHFYIKKEKMKLTVTLGTQLIDFYAKCGYIESSVEVFREMPFKNVFTWTALIQGLANNGEGKMALDFFSLMRENNVKPNDVTFIGILSACSHA
CLVHQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGLLEEAYQFIDNMPIPPNAVVWRTLLASCRAHKNVAMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDALR
VRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHTHSKEIHDALDEMLKRIKSLGYVPNIEDARLEADEDSKETSVSHHSEKLAIAYGLIRTSPRTTLRISKNLRMCRD
CHNAMKVISRVFDRTIIVRDRNRFHHFKDGLCSCNDYCNDLKMDLQHLTSTHQDDLPVYYHDDVRLRLCYKTKTTYTLIHGGEEREALSLEQKKQPLGLFSIFAFGVSPE
SSRTAISELVPGDSGGNTVFSEAKTRVGIMGSGGEMDHRQLRTWCYMDQEPATKEKNGVQEHSEWHKESWIMRWMKSTTRFQITVNDPMRVQTFHSTSNIEGKTNCHMRV
QAPWAHIQQVLSSDEEMFGINQPFRFQSELIRSRVESTPSFHVSRKAQKSIEKLDDRYVAEEAKGRSPSYSTTTPTQTQTDCLKIRGERRKIPKKEAGGENQLQSKAPKV
KNQKSLNLCPHNGAELKWG