; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015366 (gene) of Snake gourd v1 genome

Gene IDTan0015366
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:65891974..65895306
RNA-Seq ExpressionTan0015366
SyntenyTan0015366
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017816.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-28089.33Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW  FRCNKF PTSLH  I+ +P RFIFAV+S +QSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFH+AGKYHPG+SHNYDTYHAIIHRLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL EL   GI CGEDLFI+VIR+YGLAGRPKMAVK F+RIQTFGV+RSVRSLNTLLNALVQNKR+SLVHLLFK+S+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
        ALC+KNDVEGARKV DEMPAMGMVPNVVTYTTILGGYVARGDMV AKRVFGELFD GWLPDATTYTILM+GYI+ GRFT+AVKVMD+MEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKY+PSSALCCKVIDVLCGEGRVKE CKLW KLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFLKVGKAEEVI+V EEMLDKGCLPNE TYSILAE LLKLGK GEFLN+LSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVD KAWHLFVP FV NMDEQ NML+KILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

XP_022154201.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Momordica charantia]8.2e-28591.1Show/hide
Query:  NKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVES
        NK K  SLHS IAIVP RFIFAV+S LQSYTVTPPIKPWPQRLYP+RLV+MII QQNLDLALQIFHHAGK+HPGFSHNYDTYHAI+HRLSRARAFEPVES
Subjt:  NKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVES

Query:  LLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKND
        LL++L   GIKCGEDLFITVIRSYGLAGRPK+AVKTFLRIQTFGV+RSVRSLNTLLNALVQN R+SLVHLLFK+S SKFGVVPNVFTCNILIKALCKKND
Subjt:  LLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKND

Query:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAY
        +EGARKVFDEMPAMGMVPNVVTYTTILGGYV+RGDMVGAKRVFGELFD GWLPDATTYTILMDGYI+QGRFTDAVKVMD+MEENG++PND+TYGVIIEAY
Subjt:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAY

Query:  CREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTL
        CREKKSGEALNLLNDMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWE+LLKKNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI SLLTYNTL
Subjt:  CREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTL

Query:  IAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFK
        IAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFL+VGKAEEVIKVVEEMLDKGCLPNE TYSILAEGLLKLGKGGEF NILSMFISS GVVDFK
Subjt:  IAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFK

Query:  AWHLFVPIFVSNMDEQTNMLDKILIETS
         WHLFVP FV NMDEQ+N+L+KILIETS
Subjt:  AWHLFVPIFVSNMDEQTNMLDKILIETS

XP_022982687.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Cucurbita maxima]4.2e-28189.51Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW  FR NKF PTSLHSPI+ +  RFIF+V+S +QSYTVTPPIKPWPQRLYPKRLVAM+IRQQNLDLALQIFH+AGK+HPGFSHNYDTYHAIIHRLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL EL   GI CGEDLFI+VIR+YGLA RPKMAVKTFLRIQTFGV+RSVRSLNTLLNALVQNKR+SLVHLLFK+S+SKFGVV NVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
        ALC+KNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMV AKRVFGELFD GWLPDATTYTILM+GYI+ GRFT+AVKVMD+MEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKY+PSSALCCKVIDVLC EGRVKE CKLWEKLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+P+EFTYNMLIKGFLKVGKAEEVIKV EEMLDKGCLPNE TYSILAEGLLKLGKGGEFLNILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVD K WHLFVP FV NMDEQ NML+KILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

XP_023528110.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Cucurbita pepo subsp. pepo]4.2e-28190.07Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW   R NKF PTSLHSPI+ +P RFIF V+S +QSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFH+AGKYHPGFSHNYDTYHAIIHRLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL EL   GI CGEDLFI+VIR+YGLAGRPKMAVK FLRIQTFGV+RSVRSLNTLLNALVQNKR+SLVHLLFK+S+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
        ALC+KNDVEGARKV DEMPAMGMVPNVVTYTTILGGYVARGDMV AKRVFGELFD GWLPDATTYTILM+GYI+ GRFT+AVKVMD+MEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKY+PSSALCCKVIDVLCGEGRVKE CKLW KLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFLKVGKAEEVIKV EEMLDKGCLPNE TYSILAEGLLKLGK GEFLNIL MFISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVD KAWHLFVP FV NMDEQ NML+KILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

XP_038893383.1 pentatricopeptide repeat-containing protein At5g16420, mitochondrial [Benincasa hispida]1.5e-29192.88Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW  FR NKFK  SL +PI IVP RFIFAV+SPLQSYTVTPPIKPWPQRLYPKRLV+MIIRQQNLDLALQIFH+AGKYHPGFSHNYDTYHAII+RLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL+EL   GI CGEDLFITVIRSYGLAGRPK+AVKTFLRIQTFGV+RSVRSLNTLLNALVQN R+SLVHLLFKYS+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
         LCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYV+RGDMVGAKRVFGELFD GWLPDATTYTILMDGYI+QGRFTDAVKVMD+MEENGVEPNDVTY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLC EGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PN+FTYNMLIKGFLKVGKAEEVIK+ EEMLDKGCLPNE TYSILAEGLLKLGKGGEFLNILSMFISS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVDFKAWHLFVP FVSN+DEQ NMLDKILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

TrEMBL top hitse value%identityAlignment
A0A0A0LDM6 Uncharacterized protein4.7e-27888.32Show/hide
Query:  WFRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFE
        +FR N+FK  SLH+PI+IVP RFIFAVE+PLQSYTVTPPIKPWPQRL+P RLVAMI RQQNLDLALQIFH+AGKYHP F+HNYDTYHAII+RLSRARAFE
Subjt:  WFRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFE

Query:  PVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALC
        PVESLL+EL + GI C EDLFITVIRSYGLA RPKMA+KTFLRIQTFGV+RSVRSLNTLLNALVQN R+S VHLLFKYSKSKFGVVPNVFTCNILIKALC
Subjt:  PVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALC

Query:  KKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVI
        KKNDVEGARKVFDEMP+MG+VPNVVTYTTILGGYV+RGDM+GAKRVFGELFD GWLPDATTYTILMDGY++QGRFTDAVKVMD+MEENGVEPND+TYGVI
Subjt:  KKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVI

Query:  IEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLT
        I  YC+E+KSGEALNLLNDMLEKKYIP+SALCCKVIDVLCGEGRVKEACK+WEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERG+ISSLLT
Subjt:  IEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLT

Query:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGV
        YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKA+EVIKVVEEMLDKGCL NE TY IL EGLLKLGK  E LNILSM ISS G 
Subjt:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGV

Query:  VDFKAWHLFVPIFVSNMDEQTNMLDKILIET
        VDFKAW+LFVP FVSN++EQ N+L+KILIET
Subjt:  VDFKAWHLFVPIFVSNMDEQTNMLDKILIET

A0A1S3BPE8 pentatricopeptide repeat-containing protein At5g16420, mitochondrial5.0e-28088.89Show/hide
Query:  WFRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFE
        +FR N+FK  SLH+PI+IVP RFIFA+E+PLQSYTVTPPIKPWPQRL+PKRLVAMI RQQNLDLALQIFH+AGK+HP FSHNYDTYHAIIHRLSRARAFE
Subjt:  WFRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFE

Query:  PVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALC
        PVESLL+EL + GI C EDLFITVIRSYGLAGRPKMA+KTFLRIQTFGV+RSVRSLNTLLNALVQN R+SLVHLLFKYSKSKFGVVPNVFTCNILIKALC
Subjt:  PVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALC

Query:  KKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVI
        KKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYV+RGDMVGAKRVFGELFD GWLPDATTYTILMDGYI++GRFTDAVKVMD+MEENGVEPNDVTYGVI
Subjt:  KKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVI

Query:  IEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLT
        I AYC+E+KSGEALNLLNDMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFE G+ISSLLT
Subjt:  IEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLT

Query:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGV
        YNTLI GMCE+GELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLK+GKAEEVIKVVEEMLDKGCL NE TYS+L EGLLKLGKG E  NILSM IS+ G 
Subjt:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGV

Query:  VDFKAWHLFVPIFVSNMDEQTNMLDKILIET
        VDFKAWH  +P FVSN++EQ NML+KILIET
Subjt:  VDFKAWHLFVPIFVSNMDEQTNMLDKILIET

A0A6J1DN12 pentatricopeptide repeat-containing protein At5g16420, mitochondrial4.0e-28591.1Show/hide
Query:  NKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVES
        NK K  SLHS IAIVP RFIFAV+S LQSYTVTPPIKPWPQRLYP+RLV+MII QQNLDLALQIFHHAGK+HPGFSHNYDTYHAI+HRLSRARAFEPVES
Subjt:  NKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVES

Query:  LLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKND
        LL++L   GIKCGEDLFITVIRSYGLAGRPK+AVKTFLRIQTFGV+RSVRSLNTLLNALVQN R+SLVHLLFK+S SKFGVVPNVFTCNILIKALCKKND
Subjt:  LLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKND

Query:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAY
        +EGARKVFDEMPAMGMVPNVVTYTTILGGYV+RGDMVGAKRVFGELFD GWLPDATTYTILMDGYI+QGRFTDAVKVMD+MEENG++PND+TYGVIIEAY
Subjt:  VEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAY

Query:  CREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTL
        CREKKSGEALNLLNDMLEKKYIP+SALCCKVIDVLC EGRVKEACKLWE+LLKKNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI SLLTYNTL
Subjt:  CREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTL

Query:  IAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFK
        IAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFL+VGKAEEVIKVVEEMLDKGCLPNE TYSILAEGLLKLGKGGEF NILSMFISS GVVDFK
Subjt:  IAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFK

Query:  AWHLFVPIFVSNMDEQTNMLDKILIETS
         WHLFVP FV NMDEQ+N+L+KILIETS
Subjt:  AWHLFVPIFVSNMDEQTNMLDKILIETS

A0A6J1F4B0 pentatricopeptide repeat-containing protein At5g16420, mitochondrial6.6e-28089.51Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW   R NKF PTS H PI  +P RFIFAV+S +QSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFH+AGKYHPGFSHNYDTYHAIIHRLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL EL   GI CGEDLFI+VIR+YGLAGRPKMAVK FLRIQTFGV+RSVRSLNTLLNALVQNKR+SLVHLLFK+S+SKFGVVPNVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
        ALC+KNDVEGARKV DEMPAMGMVPNVVTYTTILGGYVARGDMV AKRVFGELFD GWLPDATTYTILM+GYI+ GRFT+AVKVMD+MEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKY+PSSALCCKVIDVLCGEGRVKE CKLW KLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+PNEFTYNMLIKGFLKVGKAEEVI+V EEMLDKGCLPNE TYSILAE LLKLGK GEFLNILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVD KAWHLFVP FV NMDEQ NML+KILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

A0A6J1J5I0 pentatricopeptide repeat-containing protein At5g16420, mitochondrial2.0e-28189.51Show/hide
Query:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR
        MW  FR NKF PTSLHSPI+ +  RFIF+V+S +QSYTVTPPIKPWPQRLYPKRLVAM+IRQQNLDLALQIFH+AGK+HPGFSHNYDTYHAIIHRLSRAR
Subjt:  MW--FRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRAR

Query:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK
        AFEPVESLL EL   GI CGEDLFI+VIR+YGLA RPKMAVKTFLRIQTFGV+RSVRSLNTLLNALVQNKR+SLVHLLFK+S+SKFGVV NVFTCNILIK
Subjt:  AFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIK

Query:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY
        ALC+KNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMV AKRVFGELFD GWLPDATTYTILM+GYI+ GRFT+AVKVMD+MEENGVEPND+TY
Subjt:  ALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTY

Query:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS
        GVIIEAYCREKKSGEALNLLNDMLEKKY+PSSALCCKVIDVLC EGRVKE CKLWEKLL KNCTPDNAITSTLIHWLCKEGNIWEAR LFNEFERGSI S
Subjt:  GVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISS

Query:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS
        LLTYNTLIAGMCEMGELCEAARLWDDMLEKGC+P+EFTYNMLIKGFLKVGKAEEVIKV EEMLDKGCLPNE TYSILAEGLLKLGKGGEFLNILSMF+SS
Subjt:  LLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISS

Query:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET
         GVVD K WHLFVP FV NMDEQ NML+KILIET
Subjt:  GGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIET

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200908.0e-5730.91Show/hide
Query:  TYHAIIHRLSRARAFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLR-IQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKY---SK
        T  ++I   + +  F+ VE LL  +        E  FI V R+YG A  P  AV  F R +  F  +RSV+S N++LN ++    Y      + Y   S 
Subjt:  TYHAIIHRLSRARAFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLR-IQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKY---SK

Query:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVK
            + PN  + N++IKALCK   V+ A +VF  MP    +P+  TY T++ G      +  A  +  E+   G  P    Y +L+DG  ++G  T   K
Subjt:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVK

Query:  VMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI
        ++D+M   G  PN+VTY  +I   C + K  +A++LL  M+  K IP+      +I+ L  + R  +A +L   + ++    +  I S LI  L KEG  
Subjt:  VMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI

Query:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLL
         EA  L+ +  E+G   +++ Y+ L+ G+C  G+  EA  + + M+  GC+PN +TY+ L+KGF K G  EE ++V +EM   GC  N+  YS+L +GL 
Subjt:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLL

Query:  KLGKGGEFLNILSMFISSGGVVDFKAWHLFVP--IFVSNMDEQTNMLDKILIE
         +G+  E + + S  ++ G   D  A+   +     + +MD    +  ++L +
Subjt:  KLGKGGEFLNILSMFISSGGVVDFKAWHLFVP--IFVSNMDEQTNMLDKILIE

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745802.1e-5726.67Show/hide
Query:  LYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHE-CGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQ
        L PK + A+I  Q++   AL++F+   K   GF H   TY ++I +L     FE +E +LV++ E  G    E +++  +++YG  G+ + AV  F R+ 
Subjt:  LYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHE-CGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQ

Query:  TFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVF-------------------------------------------------------
         +  + +V S N +++ LV +  +   H ++   + + G+ P+V+                                                       
Subjt:  TFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVF-------------------------------------------------------

Query:  ---------------TCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRF
                       T N L++ LCKK DV+   K+ D++   G++PN+ TY   + G   RG++ GA R+ G L ++G  PD  TY  L+ G  +  +F
Subjt:  ---------------TCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRF

Query:  TDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLC
         +A   +  M   G+EP+  TY  +I  YC+      A  ++ D +   ++P       +ID LC EG    A  L+ + L K   P+  + +TLI  L 
Subjt:  TDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLC

Query:  KEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSIL
         +G I EA +L NE  E+G I  + T+N L+ G+C+MG + +A  L   M+ KG  P+ FT+N+LI G+    K E  +++++ MLD G  P+  TY+ L
Subjt:  KEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSIL

Query:  AEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIF--VSNMDEQTNMLDKI
          GL K  K  + +      +  G   +   +++ +        +DE   +L+++
Subjt:  AEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIF--VSNMDEQTNMLDKI

Q9FFE3 Pentatricopeptide repeat-containing protein At5g16420, mitochondrial1.1e-19463.73Show/hide
Query:  AVESPLQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHEC--GIKCGEDLFI
        A  + LQ Y T  PPIKPWPQRL+PKRLV+MI +QQN+DLALQIF +AGK HPGF+HNYDTYH+I+ +LSRARAF+PVESL+ +L      IKCGE+LFI
Subjt:  AVESPLQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHEC--GIKCGEDLFI

Query:  TVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVP
         ++R+YGLAGR + +++ FLRI  FGV+RSVRSLNTLLN L+QN+R+ LVH +FK SK  FG+ PN+FTCN+L+KALCKKND+E A KV DE+P+MG+VP
Subjt:  TVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVP

Query:  NVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLE
        N+VTYTTILGGYVARGDM  AKRV  E+ DRGW PDATTYT+LMDGY + GRF++A  VMDDME+N +EPN+VTYGV+I A C+EKKSGEA N+ ++MLE
Subjt:  NVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLE

Query:  KKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWD
        + ++P S+LCCKVID LC + +V EAC LW K+LK NC PDNA+ STLIHWLCKEG + EARKLF+EFE+GSI SLLTYNTLIAGMCE GEL EA RLWD
Subjt:  KKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWD

Query:  DMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFVSNMDEQTN
        DM E+ C PN FTYN+LI+G  K G  +E ++V+EEML+ GC PN+ T+ IL EGL KLGK  + + I+SM + + G VD ++W LF+  F   +D+   
Subjt:  DMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFVSNMDEQTN

Query:  MLDKILIETS
         L ++L E S
Subjt:  MLDKILIETS

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic8.2e-6229.01Show/hide
Query:  CNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVE
        C KF P S+   + +    F   +  P  + +   P          K L+  +  Q +   AL++F+ A K  P FS     Y  I+ RL R+ +F+ ++
Subjt:  CNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVE

Query:  SLLVELHECGIKCGEDLFITVIRSYG-LAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKK
         +L ++     + G   F+ +I SY     + ++       I  FG++      N +LN LV      LV +      S +G+ P+V T N+LIKALC+ 
Subjt:  SLLVELHECGIKCGEDLFITVIRSYG-LAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKK

Query:  NDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDM-EENGVEPNDVTYGVII
        + +  A  + ++MP+ G+VP+  T+TT++ GY+  GD+ GA R+  ++ + G      +  +++ G+ ++GR  DA+  + +M  ++G  P+  T+  ++
Subjt:  NDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDM-EENGVEPNDVTYGVII

Query:  EAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLT
           C+      A+ +++ ML++ Y P       VI  LC  G VKEA ++ ++++ ++C+P+    +TLI  LCKE  + EA +L      +G +  + T
Subjt:  EAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLT

Query:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGE
        +N+LI G+C       A  L+++M  KGC P+EFTYNMLI      GK +E + ++++M   GC  + +TY+ L +G  K  K  E
Subjt:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGE

Q9SS81 Pentatricopeptide repeat-containing protein At3g090602.1e-6532.32Show/hide
Query:  WPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESL--LVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKT
        +P+ L PK ++ ++  ++N   A  +F  A + HPG++H+   YH I+ RLS  R    V  +  L+   EC  KC ED+ ++VI++YG    P  A+  
Subjt:  WPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESL--LVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKT

Query:  FLRI-QTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGD
        F R+ + FG + ++RS NTLLNA V+ K++  V  LF Y ++  GV PN+ T N+LIK  CKK + E AR   D M   G  P+V +Y+T++      G 
Subjt:  FLRI-QTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGD

Query:  MVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDD-MEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDV
        +  A  +F E+ +RG  PD T Y IL+DG++++     A+++ D  +E++ V PN  T+ ++I    +                                
Subjt:  MVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDD-MEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDV

Query:  LCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYN
         C  GRV +  K+WE++ +     D    S+LIH LC  GN+ +A  +FNE  ER +   ++TYNT++ G C  G++ E+  LW  M  K  V N  +YN
Subjt:  LCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYN

Query:  MLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFV--SNMDEQTNMLDKI
        +LIKG L+ GK +E   +   M  KG   ++ TY I   GL   G   + L ++    SSGG +D  A+   +        ++E +N++ ++
Subjt:  MLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFV--SNMDEQTNMLDKI

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-5826.67Show/hide
Query:  LYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHE-CGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQ
        L PK + A+I  Q++   AL++F+   K   GF H   TY ++I +L     FE +E +LV++ E  G    E +++  +++YG  G+ + AV  F R+ 
Subjt:  LYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHE-CGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQ

Query:  TFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVF-------------------------------------------------------
         +  + +V S N +++ LV +  +   H ++   + + G+ P+V+                                                       
Subjt:  TFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVF-------------------------------------------------------

Query:  ---------------TCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRF
                       T N L++ LCKK DV+   K+ D++   G++PN+ TY   + G   RG++ GA R+ G L ++G  PD  TY  L+ G  +  +F
Subjt:  ---------------TCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRF

Query:  TDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLC
         +A   +  M   G+EP+  TY  +I  YC+      A  ++ D +   ++P       +ID LC EG    A  L+ + L K   P+  + +TLI  L 
Subjt:  TDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLC

Query:  KEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSIL
         +G I EA +L NE  E+G I  + T+N L+ G+C+MG + +A  L   M+ KG  P+ FT+N+LI G+    K E  +++++ MLD G  P+  TY+ L
Subjt:  KEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSIL

Query:  AEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIF--VSNMDEQTNMLDKI
          GL K  K  + +      +  G   +   +++ +        +DE   +L+++
Subjt:  AEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIF--VSNMDEQTNMLDKI

AT3G09060.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-6632.32Show/hide
Query:  WPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESL--LVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKT
        +P+ L PK ++ ++  ++N   A  +F  A + HPG++H+   YH I+ RLS  R    V  +  L+   EC  KC ED+ ++VI++YG    P  A+  
Subjt:  WPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESL--LVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKT

Query:  FLRI-QTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGD
        F R+ + FG + ++RS NTLLNA V+ K++  V  LF Y ++  GV PN+ T N+LIK  CKK + E AR   D M   G  P+V +Y+T++      G 
Subjt:  FLRI-QTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGD

Query:  MVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDD-MEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDV
        +  A  +F E+ +RG  PD T Y IL+DG++++     A+++ D  +E++ V PN  T+ ++I    +                                
Subjt:  MVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDD-MEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDV

Query:  LCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYN
         C  GRV +  K+WE++ +     D    S+LIH LC  GN+ +A  +FNE  ER +   ++TYNT++ G C  G++ E+  LW  M  K  V N  +YN
Subjt:  LCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYN

Query:  MLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFV--SNMDEQTNMLDKI
        +LIKG L+ GK +E   +   M  KG   ++ TY I   GL   G   + L ++    SSGG +D  A+   +        ++E +N++ ++
Subjt:  MLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFV--SNMDEQTNMLDKI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-6329.01Show/hide
Query:  CNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVE
        C KF P S+   + +    F   +  P  + +   P          K L+  +  Q +   AL++F+ A K  P FS     Y  I+ RL R+ +F+ ++
Subjt:  CNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVE

Query:  SLLVELHECGIKCGEDLFITVIRSYG-LAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKK
         +L ++     + G   F+ +I SY     + ++       I  FG++      N +LN LV      LV +      S +G+ P+V T N+LIKALC+ 
Subjt:  SLLVELHECGIKCGEDLFITVIRSYG-LAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKK

Query:  NDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDM-EENGVEPNDVTYGVII
        + +  A  + ++MP+ G+VP+  T+TT++ GY+  GD+ GA R+  ++ + G      +  +++ G+ ++GR  DA+  + +M  ++G  P+  T+  ++
Subjt:  NDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDM-EENGVEPNDVTYGVII

Query:  EAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLT
           C+      A+ +++ ML++ Y P       VI  LC  G VKEA ++ ++++ ++C+P+    +TLI  LCKE  + EA +L      +G +  + T
Subjt:  EAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEF-ERGSISSLLT

Query:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGE
        +N+LI G+C       A  L+++M  KGC P+EFTYNMLI      GK +E + ++++M   GC  + +TY+ L +G  K  K  E
Subjt:  YNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGE

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-5830.91Show/hide
Query:  TYHAIIHRLSRARAFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLR-IQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKY---SK
        T  ++I   + +  F+ VE LL  +        E  FI V R+YG A  P  AV  F R +  F  +RSV+S N++LN ++    Y      + Y   S 
Subjt:  TYHAIIHRLSRARAFEPVESLLVELHECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLR-IQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKY---SK

Query:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVK
            + PN  + N++IKALCK   V+ A +VF  MP    +P+  TY T++ G      +  A  +  E+   G  P    Y +L+DG  ++G  T   K
Subjt:  SKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVK

Query:  VMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI
        ++D+M   G  PN+VTY  +I   C + K  +A++LL  M+  K IP+      +I+ L  + R  +A +L   + ++    +  I S LI  L KEG  
Subjt:  VMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNI

Query:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLL
         EA  L+ +  E+G   +++ Y+ L+ G+C  G+  EA  + + M+  GC+PN +TY+ L+KGF K G  EE ++V +EM   GC  N+  YS+L +GL 
Subjt:  WEARKLFNEF-ERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLL

Query:  KLGKGGEFLNILSMFISSGGVVDFKAWHLFVP--IFVSNMDEQTNMLDKILIE
         +G+  E + + S  ++ G   D  A+   +     + +MD    +  ++L +
Subjt:  KLGKGGEFLNILSMFISSGGVVDFKAWHLFVP--IFVSNMDEQTNMLDKILIE

AT5G16420.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.6e-19663.73Show/hide
Query:  AVESPLQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHEC--GIKCGEDLFI
        A  + LQ Y T  PPIKPWPQRL+PKRLV+MI +QQN+DLALQIF +AGK HPGF+HNYDTYH+I+ +LSRARAF+PVESL+ +L      IKCGE+LFI
Subjt:  AVESPLQSY-TVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVELHEC--GIKCGEDLFI

Query:  TVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVP
         ++R+YGLAGR + +++ FLRI  FGV+RSVRSLNTLLN L+QN+R+ LVH +FK SK  FG+ PN+FTCN+L+KALCKKND+E A KV DE+P+MG+VP
Subjt:  TVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMGMVP

Query:  NVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLE
        N+VTYTTILGGYVARGDM  AKRV  E+ DRGW PDATTYT+LMDGY + GRF++A  VMDDME+N +EPN+VTYGV+I A C+EKKSGEA N+ ++MLE
Subjt:  NVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLE

Query:  KKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWD
        + ++P S+LCCKVID LC + +V EAC LW K+LK NC PDNA+ STLIHWLCKEG + EARKLF+EFE+GSI SLLTYNTLIAGMCE GEL EA RLWD
Subjt:  KKYIPSSALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWD

Query:  DMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFVSNMDEQTN
        DM E+ C PN FTYN+LI+G  K G  +E ++V+EEML+ GC PN+ T+ IL EGL KLGK  + + I+SM + + G VD ++W LF+  F   +D+   
Subjt:  DMLEKGCVPNEFTYNMLIKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFVSNMDEQTN

Query:  MLDKILIETS
         L ++L E S
Subjt:  MLDKILIETS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTCCGCTGCAACAAATTCAAACCCACTTCATTGCATAGCCCAATTGCCATTGTTCCTTTCCGTTTCATCTTCGCCGTTGAATCTCCCCTTCAATCCTACACCGT
CACACCGCCGATCAAACCCTGGCCGCAGCGTCTCTATCCCAAGCGCCTCGTCGCAATGATCATTCGCCAACAGAATCTCGACCTCGCCCTTCAAATCTTCCACCACGCCG
GAAAATATCATCCTGGATTTTCCCACAACTACGATACCTATCATGCGATCATTCATCGTCTCTCTCGCGCTCGAGCTTTTGAACCCGTTGAGTCTTTGCTTGTGGAATTG
CATGAATGTGGCATCAAGTGCGGTGAGGATTTATTTATTACTGTGATTAGAAGCTATGGGCTTGCGGGCCGTCCGAAAATGGCTGTGAAAACGTTTCTGCGTATTCAAAC
CTTTGGTGTTCAACGCTCGGTGAGGTCGTTGAACACGTTGCTCAATGCTTTGGTGCAGAACAAACGGTATTCTTTGGTTCATTTATTGTTTAAGTATTCCAAATCCAAAT
TTGGGGTTGTGCCTAATGTGTTTACTTGCAACATTTTGATCAAAGCGCTTTGTAAAAAGAATGATGTTGAGGGTGCACGGAAGGTGTTTGACGAAATGCCTGCCATGGGT
ATGGTTCCAAATGTTGTTACTTACACAACAATCTTAGGTGGTTATGTTGCAAGAGGCGATATGGTTGGTGCCAAGAGAGTTTTTGGTGAGCTTTTTGATCGCGGGTGGCT
TCCTGATGCAACTACTTACACAATTTTAATGGATGGGTACATTAGGCAGGGTAGATTTACTGATGCTGTAAAGGTGATGGATGATATGGAGGAAAATGGGGTCGAGCCAA
ATGATGTGACTTATGGAGTCATTATTGAAGCTTACTGTAGGGAGAAAAAGTCTGGTGAAGCACTTAACCTGCTTAATGATATGCTTGAAAAGAAGTATATACCAAGCTCA
GCACTCTGCTGTAAGGTGATCGATGTTCTGTGCGGTGAGGGAAGGGTGAAGGAAGCTTGTAAGCTGTGGGAGAAGCTTTTGAAGAAGAACTGTACTCCGGATAATGCCAT
TACAAGCACCCTTATTCATTGGCTCTGTAAGGAGGGGAATATATGGGAAGCAAGAAAGTTATTTAATGAGTTTGAGAGGGGCTCAATTTCAAGTTTATTAACTTATAACA
CACTTATTGCAGGAATGTGTGAGATGGGGGAGTTGTGTGAAGCCGCTAGGTTGTGGGATGACATGTTGGAAAAGGGCTGTGTTCCTAATGAATTTACTTATAACATGTTG
ATAAAAGGATTTCTTAAAGTAGGTAAAGCTGAGGAAGTGATTAAAGTAGTGGAGGAGATGTTGGATAAGGGATGCTTGCCAAATGAATTGACATACTCGATATTGGCTGA
AGGGCTCCTCAAGTTGGGAAAAGGAGGAGAATTCCTGAATATTCTTTCGATGTTTATCTCAAGTGGTGGGGTTGTCGACTTTAAAGCCTGGCATCTATTTGTACCCATAT
TTGTTTCTAATATGGATGAACAGACAAATATGCTTGACAAAATATTGATCGAGACTTCTGGGTAG
mRNA sequenceShow/hide mRNA sequence
CTTTTTTATACCCTTTATTTGTCTCATTTTTGGCACTAACTTTGTCATTGTTCCTCTCTTACCATTGAGACCGCACTAGAACCCTTCACATTGGGCTTTAGCGGCAATGG
CAATTCTCTATCTGTAAGGAACAAACCCATATCTTTCTTCAACCTCACGCTTTCAATTTTCAACCATGTGGTTCCGCTGCAACAAATTCAAACCCACTTCATTGCATAGC
CCAATTGCCATTGTTCCTTTCCGTTTCATCTTCGCCGTTGAATCTCCCCTTCAATCCTACACCGTCACACCGCCGATCAAACCCTGGCCGCAGCGTCTCTATCCCAAGCG
CCTCGTCGCAATGATCATTCGCCAACAGAATCTCGACCTCGCCCTTCAAATCTTCCACCACGCCGGAAAATATCATCCTGGATTTTCCCACAACTACGATACCTATCATG
CGATCATTCATCGTCTCTCTCGCGCTCGAGCTTTTGAACCCGTTGAGTCTTTGCTTGTGGAATTGCATGAATGTGGCATCAAGTGCGGTGAGGATTTATTTATTACTGTG
ATTAGAAGCTATGGGCTTGCGGGCCGTCCGAAAATGGCTGTGAAAACGTTTCTGCGTATTCAAACCTTTGGTGTTCAACGCTCGGTGAGGTCGTTGAACACGTTGCTCAA
TGCTTTGGTGCAGAACAAACGGTATTCTTTGGTTCATTTATTGTTTAAGTATTCCAAATCCAAATTTGGGGTTGTGCCTAATGTGTTTACTTGCAACATTTTGATCAAAG
CGCTTTGTAAAAAGAATGATGTTGAGGGTGCACGGAAGGTGTTTGACGAAATGCCTGCCATGGGTATGGTTCCAAATGTTGTTACTTACACAACAATCTTAGGTGGTTAT
GTTGCAAGAGGCGATATGGTTGGTGCCAAGAGAGTTTTTGGTGAGCTTTTTGATCGCGGGTGGCTTCCTGATGCAACTACTTACACAATTTTAATGGATGGGTACATTAG
GCAGGGTAGATTTACTGATGCTGTAAAGGTGATGGATGATATGGAGGAAAATGGGGTCGAGCCAAATGATGTGACTTATGGAGTCATTATTGAAGCTTACTGTAGGGAGA
AAAAGTCTGGTGAAGCACTTAACCTGCTTAATGATATGCTTGAAAAGAAGTATATACCAAGCTCAGCACTCTGCTGTAAGGTGATCGATGTTCTGTGCGGTGAGGGAAGG
GTGAAGGAAGCTTGTAAGCTGTGGGAGAAGCTTTTGAAGAAGAACTGTACTCCGGATAATGCCATTACAAGCACCCTTATTCATTGGCTCTGTAAGGAGGGGAATATATG
GGAAGCAAGAAAGTTATTTAATGAGTTTGAGAGGGGCTCAATTTCAAGTTTATTAACTTATAACACACTTATTGCAGGAATGTGTGAGATGGGGGAGTTGTGTGAAGCCG
CTAGGTTGTGGGATGACATGTTGGAAAAGGGCTGTGTTCCTAATGAATTTACTTATAACATGTTGATAAAAGGATTTCTTAAAGTAGGTAAAGCTGAGGAAGTGATTAAA
GTAGTGGAGGAGATGTTGGATAAGGGATGCTTGCCAAATGAATTGACATACTCGATATTGGCTGAAGGGCTCCTCAAGTTGGGAAAAGGAGGAGAATTCCTGAATATTCT
TTCGATGTTTATCTCAAGTGGTGGGGTTGTCGACTTTAAAGCCTGGCATCTATTTGTACCCATATTTGTTTCTAATATGGATGAACAGACAAATATGCTTGACAAAATAT
TGATCGAGACTTCTGGGTAGAAATTGTTTACAGACTCTCAGATCTACCTGTTGAATCGGAGTGGAGATATTGATCAGTAGACCACCACGGACGAACAGGGATCAGGCTGT
GTTCTGTTGTGATATCTTTTAGAACATGTTTGACGAATATTTTCCAAATCTGAGGCTGTTCTGCTGTTGTGCCATTGGGGTGTAGAACAAATTGTCTCTTCAAGCTATGC
CTCATCGCAAGTTTGCAACTACCTCCAATGCGGGCATGAGGTCAGGGAAAGTACCTCTAACCCATCTTCAATTTCGAGCAACTGGACAAATCCTCCGGGCCAACCTCTAG
CACAATGTTGAAGGGTGAGGCTGAGGTCCCTCCCCTCCCGTCTTCCATACTGAAAGCATTTATTAATACTATCCTATTAATTAAATGTTTGTTATTTAGTTATGAGGAAA
AAAAAAAGTGAAGTAACTGCTTCCTCAACGAATTGGAGAGATAATGTCGCTATTTTGTTTTTCGAATTCAAAGAAAAGAACAGGAGTAAGATAGTCGTCTCAAAACAAAA
ACCATTCTATGCATCCTTGTTTAAGATCTAAGGTTGTAAACGGAGGAAATTTTTCGTTTGTGAATATTCTCCACTTATCATTATCTTACATTTTCGGTGGATTTCACTGC
AATAAGAGGTCAGTACTCTACTACATCATAAAAATTGTTTTCACGTGTTGGTTTTCTTCAAAGGCTCTGATTTCAATATAGGAATTTTTGAGAAAAGAGTACTAACATCT
TGTTGTAGTGGAATCTACAAAGATTCGAGATAACGATCAATAGGGACTTGTTTGGGGCACAGGTTATAATAACCTATGGAATACTCTGCACTCGAAACACAAACTATTAT
AACCACATACTATACTACTCTGCATTTTAAATAATATTAAATTATTTCACAGATTATAATAACCCACTGACTATTATAATCTCCTATTATTATAACTTACTCAGCCACCC
CAAACAACCTTTAGGGGATTCAGGATAATCACGAACCAAAAATTCCATCTACTTGCATCCTATGATCTTCAACATGAATGCATAAGCCAAAGGAAGGATAGCTTAACTCA
ACAATAATTAACATGTATCCTCAATCAAGAAGTCGGAGGTTCAAATCTCCTACCCCAAATTGTTTTTGATTCCCTTTGTAGTGAAAGAGAACCAGGTTGTTTCTAAATAA
ATCTGTCTCTTTCTGAATTAGAAAATAAGAGAAAGCCAGT
Protein sequenceShow/hide protein sequence
MWFRCNKFKPTSLHSPIAIVPFRFIFAVESPLQSYTVTPPIKPWPQRLYPKRLVAMIIRQQNLDLALQIFHHAGKYHPGFSHNYDTYHAIIHRLSRARAFEPVESLLVEL
HECGIKCGEDLFITVIRSYGLAGRPKMAVKTFLRIQTFGVQRSVRSLNTLLNALVQNKRYSLVHLLFKYSKSKFGVVPNVFTCNILIKALCKKNDVEGARKVFDEMPAMG
MVPNVVTYTTILGGYVARGDMVGAKRVFGELFDRGWLPDATTYTILMDGYIRQGRFTDAVKVMDDMEENGVEPNDVTYGVIIEAYCREKKSGEALNLLNDMLEKKYIPSS
ALCCKVIDVLCGEGRVKEACKLWEKLLKKNCTPDNAITSTLIHWLCKEGNIWEARKLFNEFERGSISSLLTYNTLIAGMCEMGELCEAARLWDDMLEKGCVPNEFTYNML
IKGFLKVGKAEEVIKVVEEMLDKGCLPNELTYSILAEGLLKLGKGGEFLNILSMFISSGGVVDFKAWHLFVPIFVSNMDEQTNMLDKILIETSG