; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020279 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020279
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr04:30503824..30505824
RNA-Seq ExpressionHG10020279
SyntenyHG10020279
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580575.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia]5.6e-26573.84Show/hide
Query:  ALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF-----------------------------
        ALQ +RR+DGMNYGAYGRLIQHCTD  F RLGKQLHARL+LSSVAPDNFLGSKLIA YS+S SLRDAYNVF                             
Subjt:  ALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF-----------------------------

Query:  -----------------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFY
                                                                                RIMFDR PERDIVSWNAMVAGYSQGGFY
Subjt:  -----------------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFY

Query:  EECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHG
        E+CKELFKAML S E KPNALTAVSVLQACAQSNDLIFGMEVH+FVNES +EMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHG
Subjt:  EECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHG

Query:  FVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-
        FVNQAMDLFRELERPALSTWNAVISGLVQNNQ D VVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYA+RN Y+GNIYVATAI+DSYAK 
Subjt:  FVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-

Query:  -------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLS
                                 SAYAAHGDAN  L LFYEMLTNGI+PDPVTFTSVLVACAHSGEL+EAWKIFNVLLPE+GIQPLVEHYACMVGVLS
Subjt:  -------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLS

Query:  RAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGG
        RAGKLSDAVEFISKMP+EPTAKVWGALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD+VRDLMKEVGLKKIPGNSWIETRGG
Subjt:  RAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGG

Query:  LRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        L+SFVARDTSNDRTPEIYG LEGL+ LMKEEG+I QHEIDDDCGSG
Subjt:  LRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

KAG7017327.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-26573.84Show/hide
Query:  ALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF-----------------------------
        ALQ +RR+DGMNYGAYGRLIQHCTD  F RLGKQLHARL+LSSVAPDNFLGSKLIA YS+S SLRDAYNVF                             
Subjt:  ALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF-----------------------------

Query:  -----------------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFY
                                                                                RIMFDR PERDIVSWNAMVAGYSQGGFY
Subjt:  -----------------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFY

Query:  EECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHG
        E+CKELFKAML S E KPNALTAVSVLQACAQSNDLIFGMEVH+FVNES +EMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHG
Subjt:  EECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHG

Query:  FVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-
        FVNQAMDLFRELERPALSTWNAVISGLVQNNQ D VVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYA+RN Y+GNIYVATAI+DSYAK 
Subjt:  FVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-

Query:  -------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLS
                                 SAYAAHGDAN  L LFYEMLTNGI+PDPVTFTSVLVACAHSGEL+EAWKIFNVLLPE+GIQPLVEHYACMVGVLS
Subjt:  -------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLS

Query:  RAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGG
        RAGKLSDAVEFISKMP+EPTAKVWGALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD+VRDLMKEVGLKKIPGNSWIETRGG
Subjt:  RAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGG

Query:  LRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        L+SFVARDTSNDRTPEIYG LEGL+ LMKEEG+I QHEIDDDCGSG
Subjt:  LRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

XP_022145703.1 pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia]2.1e-26772.49Show/hide
Query:  QTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG----------------
        Q S+PA   +PWALQA+RR DGMNY AYGRLIQHC D  F+RLGKQLHARL+L SV PDNFLGSKLIAFYS+S SLRDAYNVFG                
Subjt:  QTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG----------------

Query:  ------------------------------------------------------------------------------------RIMFDRMPERDIVSWN
                                                                                            RI+F RMPERDIVSWN
Subjt:  ------------------------------------------------------------------------------------RIMFDRMPERDIVSWN

Query:  AMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
        AMVAG+SQGGFYEECKELFK MLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ+EMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
Subjt:  AMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT

Query:  YGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIY
        YGSMISGYMVHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQ D V+DIFRAMQ HGCRPN VTLASVLP+FSHFSTLKGGKEIHAYA+RNGYNGNIY
Subjt:  YGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIY

Query:  VATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPL
        VATAI+DSYAK                          SAYAAHGDANVAL LFYEML NGIQPDPVTFTSVLVACAHSGEL+EAWKIFN++LPEYGIQPL
Subjt:  VATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPL

Query:  VEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKK
        VEHYACMVGVLSRAGKLSDAV+FISKMP+EP+AKVWGALLNGASVAGDVELGKYVFDRL EIEPENTG YIIMANLYSQ GRWKEADKVRDLMKEVGL+K
Subjt:  VEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKK

Query:  IPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        IPG+SWIET GGL SFVARDTSND TPEIY MLEGLLGLMKEEG ILQ+EID+DCGSG
Subjt:  IPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

XP_022934145.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata]4.6e-25973.7Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------
        MNYGAYGRLIQHCTD  F RLGKQLHARL+LSSVAPDNFLGSKLIA YS+S SLRDAYNVF                                       
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------

Query:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
                                                                      RIMFDR PERDIVSWNAMVAGYSQGGFYE+CKELFKAM
Subjt:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR
        L S E KPNALTAVSVLQACA SNDLIFGMEVH+FVNES +EMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHGFVNQAMDLFR
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR

Query:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------
        ELERPALSTWNAVISGLVQNNQ D VVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYA+RN Y+GNIYVATAI+DSYAK           
Subjt:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------

Query:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
                       SAYAAHGDAN  L LFYEMLTNGI+PDPVTFTSVLVACAHSGEL+EAWKIFNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS
        FISKMP+EPTAKVWGALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD VRDLMKEVGLKKIPGNSWIETR GL+SFVARDTS
Subjt:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS
        NDRTPEIYG LEGL+GLMKEEG+I QHEIDDDCGS
Subjt:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS

XP_038905794.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida]1.1e-27674.62Show/hide
Query:  PKNLQTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG------------
        PK  +  VPA+V L WALQALRRTD MNYGAYGRLIQHCTD LFVRLGKQLHARL+LSSVAPDNFLGSKLIAFYS+S SLRDAYNVFG            
Subjt:  PKNLQTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG------------

Query:  ----------------------------------------------------------------------------------------RIMFDRMPERDI
                                                                                                RI+FDRMPE+DI
Subjt:  ----------------------------------------------------------------------------------------RIMFDRMPERDI

Query:  VSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEK
        VSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALT VSVLQACAQSNDLIFGMEVHRFV+ESQ+EMDVSLCNAVIGLYAKCGSLDYARELFEEMP+K
Subjt:  VSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEK

Query:  DEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYN
        DEVTYGSMISGYMV+GFVNQAMDLFRELERP LSTWNAVISGLVQNNQ D V+DIFRAMQ HGCRPNTVTLASVLPIFSHFST+KGGKEIHAYAIR  Y+
Subjt:  DEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYN

Query:  GNIYVATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYG
        GNIYVAT I++SYAK                          SAYAAHGDANVAL LFYEML NGIQPDPVTFTSVLVACAHSGEL+EAWKIFNVLLP+YG
Subjt:  GNIYVATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYG

Query:  IQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEV
        IQP VEHYACMVGVLSRAGKLSDAVEFISKMP EPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNY+IMANLYSQFGRWKEADKVRDLMKEV
Subjt:  IQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEV

Query:  GLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        GLKKIPGNSWIETRGGL+SF+ARDTSN+RTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
Subjt:  GLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

TrEMBL top hitse value%identityAlignment
A0A1S4DUQ6 pentatricopeptide repeat-containing protein At2g373101.7e-23070.37Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG--------------------------------------
        MNYGAYGRLIQHCTDHLF R+GKQLHARL+LSSVAPDNFLGSKLI+FYS+S SLRDAYNVFG                                      
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG--------------------------------------

Query:  --------------------------------------------------------------RIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
                                                                      RIMFDRMPERDIVSWNAM+AGYSQGG YE+CKELF+ M
Subjt:  --------------------------------------------------------------RIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR
         SS+E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ++MDVSL NAVIGLYAKCGSLDYARELFEEMPEKD +TY SMISGYMVHGFVNQAMDLFR
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR

Query:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------
        ELERP L TWNAVISGLVQNN+ D  +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKGGKEIH YAIRN Y+GNI+VATAI+DSYAK           
Subjt:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------

Query:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
                       SAYA HGDANVAL LFYEMLT GIQPD VTFTSVL ACAHSGEL+EAWKIFN+LLP+YGIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSF
        FISKMPLEP AKVWGALLNGASVAGDVELGKYVFDRLFEIEP NTGNY+IMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETRGGL+SF
Subjt:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSF

A0A5A7TRM4 Pentatricopeptide repeat-containing protein1.7e-23070.37Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG--------------------------------------
        MNYGAYGRLIQHCTDHLF R+GKQLHARL+LSSVAPDNFLGSKLI+FYS+S SLRDAYNVFG                                      
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG--------------------------------------

Query:  --------------------------------------------------------------RIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
                                                                      RIMFDRMPERDIVSWNAM+AGYSQGG YE+CKELF+ M
Subjt:  --------------------------------------------------------------RIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR
         SS+E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ++MDVSL NAVIGLYAKCGSLDYARELFEEMPEKD +TY SMISGYMVHGFVNQAMDLFR
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR

Query:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------
        ELERP L TWNAVISGLVQNN+ D  +DIFRAMQ HGCRPNTVTLAS+LPIFSHFSTLKGGKEIH YAIRN Y+GNI+VATAI+DSYAK           
Subjt:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------

Query:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
                       SAYA HGDANVAL LFYEMLT GIQPD VTFTSVL ACAHSGEL+EAWKIFN+LLP+YGIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSF
        FISKMPLEP AKVWGALLNGASVAGDVELGKYVFDRLFEIEP NTGNY+IMANLYSQ GRWKEAD +RDLMKEV LKKIPGNSWIETRGGL+SF
Subjt:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSF

A0A6J1CWN9 pentatricopeptide repeat-containing protein At2g373101.0e-26772.49Show/hide
Query:  QTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG----------------
        Q S+PA   +PWALQA+RR DGMNY AYGRLIQHC D  F+RLGKQLHARL+L SV PDNFLGSKLIAFYS+S SLRDAYNVFG                
Subjt:  QTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFG----------------

Query:  ------------------------------------------------------------------------------------RIMFDRMPERDIVSWN
                                                                                            RI+F RMPERDIVSWN
Subjt:  ------------------------------------------------------------------------------------RIMFDRMPERDIVSWN

Query:  AMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
        AMVAG+SQGGFYEECKELFK MLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ+EMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
Subjt:  AMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT

Query:  YGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIY
        YGSMISGYMVHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQ D V+DIFRAMQ HGCRPN VTLASVLP+FSHFSTLKGGKEIHAYA+RNGYNGNIY
Subjt:  YGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIY

Query:  VATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPL
        VATAI+DSYAK                          SAYAAHGDANVAL LFYEML NGIQPDPVTFTSVLVACAHSGEL+EAWKIFN++LPEYGIQPL
Subjt:  VATAIVDSYAK--------------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPL

Query:  VEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKK
        VEHYACMVGVLSRAGKLSDAV+FISKMP+EP+AKVWGALLNGASVAGDVELGKYVFDRL EIEPENTG YIIMANLYSQ GRWKEADKVRDLMKEVGL+K
Subjt:  VEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKK

Query:  IPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        IPG+SWIET GGL SFVARDTSND TPEIY MLEGLLGLMKEEG ILQ+EID+DCGSG
Subjt:  IPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

A0A6J1F110 pentatricopeptide repeat-containing protein At2g373102.2e-25973.7Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------
        MNYGAYGRLIQHCTD  F RLGKQLHARL+LSSVAPDNFLGSKLIA YS+S SLRDAYNVF                                       
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------

Query:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
                                                                      RIMFDR PERDIVSWNAMVAGYSQGGFYE+CKELFKAM
Subjt:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR
        L S E KPNALTAVSVLQACA SNDLIFGMEVH+FVNES +EMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHGFVNQAMDLFR
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR

Query:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------
        ELERPALSTWNAVISGLVQNNQ D VVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYA+RN Y+GNIYVATAI+DSYAK           
Subjt:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------

Query:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
                       SAYAAHGDAN  L LFYEMLTNGI+PDPVTFTSVLVACAHSGEL+EAWKIFNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS
        FISKMP+EPTAKVWGALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD VRDLMKEVGLKKIPGNSWIETR GL+SFVARDTS
Subjt:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS
        NDRTPEIYG LEGL+GLMKEEG+I QHEIDDDCGS
Subjt:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS

A0A6J1J0S5 pentatricopeptide repeat-containing protein At2g373107.2e-25873.58Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------
        MNYGAYGRLIQHCTD  F RLGKQLHARL+LSSVAPDNFLGSKLIA YS+S SLRDAYNVF                                       
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVF---------------------------------------

Query:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
                                                                      RIMF R PERDIVSWNAMVAGYSQGGFYE+CKELFKAM
Subjt:  -------------------------------------------------------------GRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR
        L S E KPNALTAVSVLQACAQSNDLIFGMEVH+FVNES +EMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHGFVNQAMDLFR
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFR

Query:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------
        ELERPALSTWNAVISGLVQNNQ D VVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYA+RN Y+GNIYVATAI+DSYAK           
Subjt:  ELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------

Query:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE
                       SAYAAHGDAN  L LFYEMLTNGI+PDPVTFTSVLVACAHSGELNEAWKIFNVLLPE+GIQPLVEHYACMVGVLSRAGKLSDAVE
Subjt:  ---------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVE

Query:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS
        FISKMP+EPTAKVWGALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFG WKEAD VRDLMKEVGLKKIPGNSWIETRGGL+SFVARDTS
Subjt:  FISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTS

Query:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG
        NDRTPEIYG LEGL+GLMK EG+I QHEIDD+CGSG
Subjt:  NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic8.6e-9133.79Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKP
        LI+   +   + LG+ LH   + S+V  D F+ + LI  Y     L  A  VF  I      E+D+VSWN+M+ G+ Q G  ++  ELFK M  S ++K 
Subjt:  LIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKP

Query:  NALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALS
        + +T V VL ACA+  +L FG +V  ++ E++V ++++L NA++ +Y KCGS++ A+ LF+ M EKD VT+ +M+ GY +      A ++   + +  + 
Subjt:  NALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALS

Query:  TWNAVISGLVQNNQHDRVVDIFRAMQLH-GCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK------------------
         WNA+IS   QN + +  + +F  +QL    + N +TL S L   +    L+ G+ IH+Y  ++G   N +V +A++  Y+K                  
Subjt:  TWNAVISGLVQNNQHDRVVDIFRAMQLH-GCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK------------------

Query:  --------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPL
                   A HG  N A+ +FY+M    ++P+ VTFT+V  AC+H+G ++EA  +F+ +   YGI P  +HYAC+V VL R+G L  AV+FI  MP+
Subjt:  --------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPL

Query:  EPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEI
         P+  VWGALL    +  ++ L +    RL E+EP N G +++++N+Y++ G+W+   ++R  M+  GLKK PG S IE  G +  F++ D ++  + ++
Subjt:  EPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEI

Query:  YGMLEGLLGLMKEEG
        YG L  ++  +K  G
Subjt:  YGMLEGLLGLMKEEG

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220702.4e-8534.74Show/hide
Query:  FDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYAR
        F++M ERDIV+WN+M++G++Q G+     ++F  ML    L P+  T  SVL ACA    L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR
Subjt:  FDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYAR

Query:  ELFEEMPEKDEVTYG--SMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKE
         L E+   KD    G  +++ GY+  G +NQA ++F  L+   +  W A+I G  Q+  +   +++FR+M   G RPN+ TLA++L + S  ++L  GK+
Subjt:  ELFEEMPEKDEVTYG--SMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKE

Query:  IHAYAIRNGYNGNIYVATAIVDSYAKS---------------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEA
        IH  A+++G   ++ V+ A++  YAK+                           A A HG A  AL LF  ML  G++PD +T+  V  AC H+G +N+ 
Subjt:  IHAYAIRNGYNGNIYVATAIVDSYAKS---------------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEA

Query:  WKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWK
         + F+++     I P + HYACMV +  RAG L +A EFI KMP+EP    WG+LL+   V  +++LGK   +RL  +EPEN+G Y  +ANLYS  G+W+
Subjt:  WKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWK

Query:  EADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII-----LQHEIDDD
        EA K+R  MK+  +KK  G SWIE +  +  F   D ++    EIY  ++ +   +K+ G +     + H+++++
Subjt:  EADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII-----LQHEIDDD

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136005.2e-8831.25Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
        +N  ++  ++  C+    +  G Q+H+ +  S    D ++GS L+  YS+  ++ DA  V     FD M +R++VSWN+++  + Q G   E  ++F+ M
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVH-RFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLF
        L S  ++P+ +T  SV+ ACA  + +  G EVH R V   ++  D+ L NA + +YAKC  +  AR +F+ MP ++ +   SMISGY +      A  +F
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVH-RFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLF

Query:  RELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGY------NGNIYVATAIVDSYAKS---
         ++    + +WNA+I+G  QN +++  + +F  ++     P   + A++L   +  + L  G + H + +++G+        +I+V  +++D Y K    
Subjt:  RELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGY------NGNIYVATAIVDSYAKS---

Query:  -----------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAG
                                +A +G  N AL LF EML +G +PD +T   VL AC H+G + E    F+ +  ++G+ PL +HY CMV +L RAG
Subjt:  -----------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAG

Query:  KLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRS
         L +A   I +MP++P + +WG+LL    V  ++ LGKYV ++L E+EP N+G Y++++N+Y++ G+W++   VR  M++ G+ K PG SWI+ +G    
Subjt:  KLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRS

Query:  FVARDTSNDRTPEIYGMLEGLLGLMKEE
        F+ +D S+ R  +I+ +L+ L+  M+ E
Subjt:  FVARDTSNDRTPEIYGMLEGLLGLMKEE

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic3.7e-8633.73Show/hide
Query:  CTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALT
        C D   + LG+ +H+  + +  + ++   + L+  YS+   L  A     + +F  M +R +VS+ +M+AGY++ G   E  +LF+ M     + P+  T
Subjt:  CTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALT

Query:  AVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNA
          +VL  CA+   L  G  VH ++ E+ +  D+ + NA++ +YAKCGS+  A  +F EM  KD +++ ++I GY  + + N+A+ LF  L          
Subjt:  AVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNA

Query:  VISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------------------
                            ++     P+  T+A VLP  +  S    G+EIH Y +RNGY  + +VA ++VD YAK                       
Subjt:  VISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------------------

Query:  ---SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAK
           + Y  HG    A+ LF +M   GI+ D ++F S+L AC+HSG ++E W+ FN++  E  I+P VEHYAC+V +L+R G L  A  FI  MP+ P A 
Subjt:  ---SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAK

Query:  VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLE
        +WGALL G  +  DV+L + V +++FE+EPENTG Y++MAN+Y++  +W++  ++R  + + GL+K PG SWIE +G +  FVA D+SN  T  I   L 
Subjt:  VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLE

Query:  GLLGLMKEEG
         +   M EEG
Subjt:  GLLGLMKEEG

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373102.1e-16954.21Show/hide
Query:  RTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKEL
        R D ++     + +  C D     L +Q+H  +I      D F+G+ +I +Y++  ++  A     R +FD M ERD+VSWN+M++GYSQ G +E+CK++
Subjt:  RTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKEL

Query:  FKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAM
        +KAML+  + KPN +T +SV QAC QS+DLIFG+EVH+ + E+ ++MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTYG++ISGYM HG V +AM
Subjt:  FKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAM

Query:  DLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-------
         LF E+E   LSTWNA+ISGL+QNN H+ V++ FR M   G RPNTVTL+S+LP  ++ S LKGGKEIHA+AIRNG + NIYV T+I+D+YAK       
Subjt:  DLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-------

Query:  -------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLS
                           +AYA HGD++ A  LF +M   G +PD VT T+VL A AHSG+ + A  IF+ +L +Y I+P VEHYACMV VLSRAGKLS
Subjt:  -------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLS

Query:  DAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVA
        DA+EFISKMP++P AKVWGALLNGASV GD+E+ ++  DRLFE+EPENTGNY IMANLY+Q GRW+EA+ VR+ MK +GLKKIPG SWIET  GLRSF+A
Subjt:  DAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVA

Query:  RDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD
        +D+S +R+ E+Y ++EGL+  M ++  I + E+D+
Subjt:  RDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-8931.25Show/hide
Query:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM
        +N  ++  ++  C+    +  G Q+H+ +  S    D ++GS L+  YS+  ++ DA  V     FD M +R++VSWN+++  + Q G   E  ++F+ M
Subjt:  MNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAM

Query:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVH-RFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLF
        L S  ++P+ +T  SV+ ACA  + +  G EVH R V   ++  D+ L NA + +YAKC  +  AR +F+ MP ++ +   SMISGY +      A  +F
Subjt:  LSSVELKPNALTAVSVLQACAQSNDLIFGMEVH-RFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLF

Query:  RELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGY------NGNIYVATAIVDSYAKS---
         ++    + +WNA+I+G  QN +++  + +F  ++     P   + A++L   +  + L  G + H + +++G+        +I+V  +++D Y K    
Subjt:  RELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGY------NGNIYVATAIVDSYAKS---

Query:  -----------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAG
                                +A +G  N AL LF EML +G +PD +T   VL AC H+G + E    F+ +  ++G+ PL +HY CMV +L RAG
Subjt:  -----------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAG

Query:  KLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRS
         L +A   I +MP++P + +WG+LL    V  ++ LGKYV ++L E+EP N+G Y++++N+Y++ G+W++   VR  M++ G+ K PG SWI+ +G    
Subjt:  KLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRS

Query:  FVARDTSNDRTPEIYGMLEGLLGLMKEE
        F+ +D S+ R  +I+ +L+ L+  M+ E
Subjt:  FVARDTSNDRTPEIYGMLEGLLGLMKEE

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.7e-8634.74Show/hide
Query:  FDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYAR
        F++M ERDIV+WN+M++G++Q G+     ++F  ML    L P+  T  SVL ACA    L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR
Subjt:  FDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYAR

Query:  ELFEEMPEKDEVTYG--SMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKE
         L E+   KD    G  +++ GY+  G +NQA ++F  L+   +  W A+I G  Q+  +   +++FR+M   G RPN+ TLA++L + S  ++L  GK+
Subjt:  ELFEEMPEKDEVTYG--SMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKE

Query:  IHAYAIRNGYNGNIYVATAIVDSYAKS---------------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEA
        IH  A+++G   ++ V+ A++  YAK+                           A A HG A  AL LF  ML  G++PD +T+  V  AC H+G +N+ 
Subjt:  IHAYAIRNGYNGNIYVATAIVDSYAKS---------------------------AYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEA

Query:  WKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWK
         + F+++     I P + HYACMV +  RAG L +A EFI KMP+EP    WG+LL+   V  +++LGK   +RL  +EPEN+G Y  +ANLYS  G+W+
Subjt:  WKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWK

Query:  EADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII-----LQHEIDDD
        EA K+R  MK+  +KK  G SWIE +  +  F   D ++    EIY  ++ +   +K+ G +     + H+++++
Subjt:  EADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII-----LQHEIDDD

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-9233.79Show/hide
Query:  LIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKP
        LI+   +   + LG+ LH   + S+V  D F+ + LI  Y     L  A  VF  I      E+D+VSWN+M+ G+ Q G  ++  ELFK M  S ++K 
Subjt:  LIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKP

Query:  NALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALS
        + +T V VL ACA+  +L FG +V  ++ E++V ++++L NA++ +Y KCGS++ A+ LF+ M EKD VT+ +M+ GY +      A ++   + +  + 
Subjt:  NALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALS

Query:  TWNAVISGLVQNNQHDRVVDIFRAMQLH-GCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK------------------
         WNA+IS   QN + +  + +F  +QL    + N +TL S L   +    L+ G+ IH+Y  ++G   N +V +A++  Y+K                  
Subjt:  TWNAVISGLVQNNQHDRVVDIFRAMQLH-GCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK------------------

Query:  --------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPL
                   A HG  N A+ +FY+M    ++P+ VTFT+V  AC+H+G ++EA  +F+ +   YGI P  +HYAC+V VL R+G L  AV+FI  MP+
Subjt:  --------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPL

Query:  EPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEI
         P+  VWGALL    +  ++ L +    RL E+EP N G +++++N+Y++ G+W+   ++R  M+  GLKK PG S IE  G +  F++ D ++  + ++
Subjt:  EPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEI

Query:  YGMLEGLLGLMKEEG
        YG L  ++  +K  G
Subjt:  YGMLEGLLGLMKEEG

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-17054.21Show/hide
Query:  RTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKEL
        R D ++     + +  C D     L +Q+H  +I      D F+G+ +I +Y++  ++  A     R +FD M ERD+VSWN+M++GYSQ G +E+CK++
Subjt:  RTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKEL

Query:  FKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAM
        +KAML+  + KPN +T +SV QAC QS+DLIFG+EVH+ + E+ ++MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTYG++ISGYM HG V +AM
Subjt:  FKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAM

Query:  DLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-------
         LF E+E   LSTWNA+ISGL+QNN H+ V++ FR M   G RPNTVTL+S+LP  ++ S LKGGKEIHA+AIRNG + NIYV T+I+D+YAK       
Subjt:  DLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-------

Query:  -------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLS
                           +AYA HGD++ A  LF +M   G +PD VT T+VL A AHSG+ + A  IF+ +L +Y I+P VEHYACMV VLSRAGKLS
Subjt:  -------------------SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLS

Query:  DAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVA
        DA+EFISKMP++P AKVWGALLNGASV GD+E+ ++  DRLFE+EPENTGNY IMANLY+Q GRW+EA+ VR+ MK +GLKKIPG SWIET  GLRSF+A
Subjt:  DAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVA

Query:  RDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD
        +D+S +R+ E+Y ++EGL+  M ++  I + E+D+
Subjt:  RDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-8733.73Show/hide
Query:  CTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALT
        C D   + LG+ +H+  + +  + ++   + L+  YS+   L  A     + +F  M +R +VS+ +M+AGY++ G   E  +LF+ M     + P+  T
Subjt:  CTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALT

Query:  AVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNA
          +VL  CA+   L  G  VH ++ E+ +  D+ + NA++ +YAKCGS+  A  +F EM  KD +++ ++I GY  + + N+A+ LF  L          
Subjt:  AVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNA

Query:  VISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------------------
                            ++     P+  T+A VLP  +  S    G+EIH Y +RNGY  + +VA ++VD YAK                       
Subjt:  VISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAK-----------------------

Query:  ---SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAK
           + Y  HG    A+ LF +M   GI+ D ++F S+L AC+HSG ++E W+ FN++  E  I+P VEHYAC+V +L+R G L  A  FI  MP+ P A 
Subjt:  ---SAYAAHGDANVALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAK

Query:  VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLE
        +WGALL G  +  DV+L + V +++FE+EPENTG Y++MAN+Y++  +W++  ++R  + + GL+K PG SWIE +G +  FVA D+SN  T  I   L 
Subjt:  VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLE

Query:  GLLGLMKEEG
         +   M EEG
Subjt:  GLLGLMKEEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAATGCGAAGCCCAAAAACCTTCAAACCTCAGTTCCCGCCAGCGTCTCTCTTCCATGGGCTCTACAGGCGCTCCGCCGCACCGACGGGATGAACTACGGCGCTTA
TGGTCGCCTTATCCAGCACTGCACCGACCACCTCTTCGTCCGCCTCGGTAAGCAGCTTCACGCTCGTCTTATTCTATCATCCGTCGCTCCCGATAACTTCCTCGGATCGA
AACTCATCGCCTTCTACTCTAGATCCAGCAGCCTTAGAGATGCCTACAATGTGTTCGGGAGAATTATGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCG
ATGGTGGCTGGGTACTCTCAGGGTGGGTTCTATGAGGAGTGCAAGGAACTATTTAAAGCGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCGTTAACCGCAGTCAGTGT
TTTGCAAGCTTGTGCTCAGTCAAACGATCTCATTTTTGGAATGGAAGTTCATAGATTCGTTAATGAAAGCCAGGTTGAAATGGATGTTTCACTGTGCAATGCTGTTATTG
GATTATATGCAAAGTGTGGTAGCTTGGATTATGCTCGGGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCTCGATGATATCAGGCTACATGGTCCAT
GGTTTTGTAAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTTTGGTCCAGAACAACCAGCATGATAGAGT
TGTAGATATATTCCGAGCAATGCAGTTGCATGGCTGCAGACCAAATACTGTGACACTTGCGAGCGTTCTTCCCATTTTCTCACATTTTTCAACCCTAAAAGGTGGGAAAG
AAATTCATGCTTATGCCATTAGAAACGGTTACAATGGGAATATATATGTTGCTACTGCCATCGTTGATTCTTATGCTAAATCTGCATATGCTGCACATGGAGATGCCAAT
GTGGCTCTTCGTCTTTTCTATGAGATGCTGACAAACGGGATTCAGCCTGACCCGGTAACCTTTACATCAGTATTGGTTGCCTGTGCCCATTCAGGAGAGTTAAATGAAGC
CTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGGATTCAACCACTAGTCGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTGATGCTG
TTGAATTTATTTCTAAAATGCCACTTGAACCCACCGCAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCGGTTGCTGGAGATGTTGAACTTGGAAAGTATGTTTTTGAT
CGTCTCTTTGAGATTGAGCCTGAAAATACTGGTAACTATATCATCATGGCTAACTTATATTCACAATTTGGAAGGTGGAAAGAAGCTGACAAGGTTAGGGATTTGATGAA
GGAAGTTGGATTGAAGAAGATCCCGGGAAATAGCTGGATAGAAACAAGAGGAGGGTTGCGGAGTTTCGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATTTATG
GAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGAATCATTCTGCAACATGAGATAGATGACGACTGTGGCAGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAATGCGAAGCCCAAAAACCTTCAAACCTCAGTTCCCGCCAGCGTCTCTCTTCCATGGGCTCTACAGGCGCTCCGCCGCACCGACGGGATGAACTACGGCGCTTA
TGGTCGCCTTATCCAGCACTGCACCGACCACCTCTTCGTCCGCCTCGGTAAGCAGCTTCACGCTCGTCTTATTCTATCATCCGTCGCTCCCGATAACTTCCTCGGATCGA
AACTCATCGCCTTCTACTCTAGATCCAGCAGCCTTAGAGATGCCTACAATGTGTTCGGGAGAATTATGTTTGATAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCG
ATGGTGGCTGGGTACTCTCAGGGTGGGTTCTATGAGGAGTGCAAGGAACTATTTAAAGCGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCGTTAACCGCAGTCAGTGT
TTTGCAAGCTTGTGCTCAGTCAAACGATCTCATTTTTGGAATGGAAGTTCATAGATTCGTTAATGAAAGCCAGGTTGAAATGGATGTTTCACTGTGCAATGCTGTTATTG
GATTATATGCAAAGTGTGGTAGCTTGGATTATGCTCGGGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCTCGATGATATCAGGCTACATGGTCCAT
GGTTTTGTAAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTTTGGTCCAGAACAACCAGCATGATAGAGT
TGTAGATATATTCCGAGCAATGCAGTTGCATGGCTGCAGACCAAATACTGTGACACTTGCGAGCGTTCTTCCCATTTTCTCACATTTTTCAACCCTAAAAGGTGGGAAAG
AAATTCATGCTTATGCCATTAGAAACGGTTACAATGGGAATATATATGTTGCTACTGCCATCGTTGATTCTTATGCTAAATCTGCATATGCTGCACATGGAGATGCCAAT
GTGGCTCTTCGTCTTTTCTATGAGATGCTGACAAACGGGATTCAGCCTGACCCGGTAACCTTTACATCAGTATTGGTTGCCTGTGCCCATTCAGGAGAGTTAAATGAAGC
CTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGGATTCAACCACTAGTCGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTGATGCTG
TTGAATTTATTTCTAAAATGCCACTTGAACCCACCGCAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCGGTTGCTGGAGATGTTGAACTTGGAAAGTATGTTTTTGAT
CGTCTCTTTGAGATTGAGCCTGAAAATACTGGTAACTATATCATCATGGCTAACTTATATTCACAATTTGGAAGGTGGAAAGAAGCTGACAAGGTTAGGGATTTGATGAA
GGAAGTTGGATTGAAGAAGATCCCGGGAAATAGCTGGATAGAAACAAGAGGAGGGTTGCGGAGTTTCGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATTTATG
GAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGAATCATTCTGCAACATGAGATAGATGACGACTGTGGCAGTGGTTAG
Protein sequenceShow/hide protein sequence
MRNAKPKNLQTSVPASVSLPWALQALRRTDGMNYGAYGRLIQHCTDHLFVRLGKQLHARLILSSVAPDNFLGSKLIAFYSRSSSLRDAYNVFGRIMFDRMPERDIVSWNA
MVAGYSQGGFYEECKELFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVH
GFVNQAMDLFRELERPALSTWNAVISGLVQNNQHDRVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAIRNGYNGNIYVATAIVDSYAKSAYAAHGDAN
VALRLFYEMLTNGIQPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEYGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPLEPTAKVWGALLNGASVAGDVELGKYVFD
RLFEIEPENTGNYIIMANLYSQFGRWKEADKVRDLMKEVGLKKIPGNSWIETRGGLRSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG