; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023760 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023760
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:16325191..16326251
RNA-Seq ExpressionPI0023760
SyntenyPI0023760
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059103.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.3e-16293.83Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALLEACSSTKNLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKS DRHTFP VLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMY KCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M++EGQ+PDELTFTSLL+ACSHGGLIEKGKEYF  MRMEF LRPGLQHYTCMVDLLGRLGQVEEAEKLIMEME+EPDEALWGAMLSACRIHG+ DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

KAG6583697.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]1.7e-15790.55Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKPQ+IIPFYAALL+ACSSTKN  TLKQIHALTI L ISHH FIRTKLASTYAAC  LPQATTIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGK IDRHT PPV+KSCTGLSSLRLGRQVHGA++INGFS+DLP+LNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMF EVFGLFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M+EEGQKPDELTFT+LL+ACSHGGLIE+GKEYFGMM M FDLRPGL+HYTCMVDLLGR+GQVEEAEKLIMEMEIEPDEALWGA+LSACRIHGKA+VADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQ
        Q RFIKQ
Subjt:  QKRFIKQ

XP_004144516.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis sativus]9.9e-16695.45Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALL+ACSST NLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFN LIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKSIDRHT PPVLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M+EEGQKPDELTFTSLL+ACSHGGLIEKGKEYFGMMRMEF LRPGLQHYTCMVDLLGR GQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGK DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

XP_008455480.1 PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucumis melo]1.5e-16193.51Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALLEACSSTKNLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKS DRHTFP VLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMY KCGDLGVARKVFDGMPERN VSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M++EGQ+PDELTFTSLL+ACSHGGLIEKGKEYF  MRMEF LRPGLQHYTCMVDLLGRLGQVEEAEKLIMEME+EPDEALWGAMLSACRIHG+ DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

XP_023520705.1 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo]1.7e-15790.23Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKPQ+IIPFYAALL+ACSSTKN  TLKQIHALTI L ISHH FIRTKLASTYAAC  LPQATTIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        L+SGK IDRHT PPVLKSCTGLSSLRLGRQVHGA++INGFS+DLP+LNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M+EEGQKPDELTFT+LL+ACSHGGLIE+GKEYFGMM+M FDLRPGL+HYTCMVDLLGR+GQVEEAEKLIMEMEIEPDEALWGA+L ACRIHGKA+VADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQ
        Q RF+KQ
Subjt:  QKRFIKQ

TrEMBL top hitse value%identityAlignment
A0A0A0K1F7 Uncharacterized protein4.8e-16695.45Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALL+ACSST NLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFN LIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKSIDRHT PPVLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M+EEGQKPDELTFTSLL+ACSHGGLIEKGKEYFGMMRMEF LRPGLQHYTCMVDLLGR GQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGK DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

A0A1S3C0K0 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like7.2e-16293.51Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALLEACSSTKNLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKS DRHTFP VLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMY KCGDLGVARKVFDGMPERN VSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M++EGQ+PDELTFTSLL+ACSHGGLIEKGKEYF  MRMEF LRPGLQHYTCMVDLLGRLGQVEEAEKLIMEME+EPDEALWGAMLSACRIHG+ DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

A0A5A7UT42 Pentatricopeptide repeat-containing protein1.1e-16293.83Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKP EIIPFYAALLEACSSTKNLHTLKQIHALTI L ISHHHFIRTKLASTYAACAQLPQA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGKS DRHTFP VLKSCTGLSSLRLGRQVHGALLINGFS+DLPSLNALITMY KCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVF LFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M++EGQ+PDELTFTSLL+ACSHGGLIEKGKEYF  MRMEF LRPGLQHYTCMVDLLGRLGQVEEAEKLIMEME+EPDEALWGAMLSACRIHG+ DVADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQQ
        QKRFIKQQ
Subjt:  QKRFIKQQ

A0A6J1EHW3 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like4.5e-15689.58Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKPQ+IIPFYAALL+ACSSTKN  TLKQIHALTI L ISHH FIRTKLASTYAAC  LPQATTIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFR M
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGK IDRHT PPV+KSCTGLSSLRLGRQVHGA++INGFS+DLP+LNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMF EVFGLFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV
        M+EEGQKPDELTFT+LL+ACSHGGLIE+GKEYFGMM+M FDLRPGL+HYTCMVDLLGR+GQVEEAEKLIMEMEIEPDEALWGA+LSACRIHGK +VADRV
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRV

Query:  QKRFIKQ
        Q RF+KQ
Subjt:  QKRFIKQ

A0A6J1I7X0 pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like2.6e-15689.32Show/hide
Query:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM
        MPKPQ+IIPFYAALL+ACSSTKN HTLKQIHALTI L ISHH FIRTKLASTYAAC  LPQATTIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHM
Subjt:  MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHM

Query:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
        LLSGK IDRHT PPVLKSCTGLSSLRLGRQVHG ++INGFS+DLP+LNALITMYGKCGDLG+ARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER
Subjt:  LLSGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFER

Query:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLI--MEMEIEPDEALWGAMLSACRIHGKADVAD
        M+EEGQKPDELTFT+LL+ACSHGGLIE+GKEYFGMM+M F+L+PGL+HYTCMVDLLGR+GQVEEAEKLI  MEMEIEPDEALWGA+LSACRIHGKA+VAD
Subjt:  MMEEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLI--MEMEIEPDEALWGAMLSACRIHGKADVAD

Query:  RVQKRFIKQ
        RVQ RF+KQ
Subjt:  RVQKRFIKQ

SwissProt top hitse value%identityAlignment
P93005 Pentatricopeptide repeat-containing protein At2g336807.6e-5238.1Show/hide
Query:  KPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLL
        KP E       +L ACS    L   KQ+H+  + L    H F  T L   YA    L  A   F     R   L+ +LI  +       ++L ++R M  
Subjt:  KPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLL

Query:  SGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMM
        +G   +  T   VLK+C+ L++L LG+QVHG  + +GF  ++P  +AL TMY KCG L     VF   P ++ VSW+A+++G   +G   E   LFE M+
Subjt:  SGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMM

Query:  EEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADV
         EG +PD++TF +++SACSH G +E+G  YF MM  +  L P + HY CMVDLL R GQ++EA++ I    I+    LW  +LSAC+ HGK ++
Subjt:  EEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADV

Q9LND4 Pentatricopeptide repeat-containing protein At1g06140, mitochondrial1.5e-5234.46Show/hide
Query:  LLEACSSTKNLHTLKQIHALTIILQ-ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTF
        L++AC +       K +H ++I    I    +++  +   Y  C  L  A  +F  +  R   ++  LI   +      ++  +FR ML      ++ T 
Subjt:  LLEACSSTKNLHTLKQIHALTIILQ-ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTF

Query:  PPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDELT
          +L SC+ L SLR G+ VHG ++ NG   D  +  + I MY +CG++ +AR VFD MPERN +SWS+++  +G++G+F E    F +M  +   P+ +T
Subjt:  PPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDELT

Query:  FTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ
        F SLLSACSH G +++G + F  M  ++ + P  +HY CMVDLLGR G++ EA+  I  M ++P  + WGA+LSACRIH + D+A  + ++ +  +
Subjt:  FTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.3e-5135.23Show/hide
Query:  YAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRH
        +A +L   ++  +L   +Q+H   ++        +   L   YA C    +A  IF    +R T  + ALI  +    L    L +F  M  S    D+ 
Subjt:  YAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRH

Query:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE
        TF  VLK+    +SL LG+Q+H  ++ +G   ++ S + L+ MY KCG +  A +VF+ MP+RN VSW+AL++ +  +G      G F +M+E G +PD 
Subjt:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE

Query:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ
        ++   +L+ACSH G +E+G EYF  M   + + P  +HY CM+DLLGR G+  EAEKL+ EM  EPDE +W ++L+ACRIH    +A+R  ++    +
Subjt:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic5.8e-5236.18Show/hide
Query:  ALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRH
        A+L  C+  + L   K++H       +    F+   L   YA C  + +A  +FS    +    +N +I  +S     +++LS+F ++LL  K  S D  
Subjt:  ALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRH

Query:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE
        T   VL +C  LS+   GR++HG ++ NG+ SD    N+L+ MY KCG L +A  +FD +  ++ VSW+ ++AGYG+HG   E   LF +M + G + DE
Subjt:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE

Query:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR
        ++F SLL ACSH GL+++G  +F +MR E  + P ++HY C+VD+L R G + +A + I  M I PD  +WGA+L  CRIH    +A++V ++
Subjt:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic8.7e-5637.79Show/hide
Query:  YAALLEACSSTK----NLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK-
        Y  +L+AC +++    +L   K+IHA       S H +I T L   YA    +  A+ +F     R    ++A+I  ++      ++L  FR M+   K 
Subjt:  YAALLEACSSTK----NLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK-

Query:  -SIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEE
         S +  T   VL++C  L++L  G+ +HG +L  G  S LP ++AL+TMYG+CG L V ++VFD M +R+ VSW++L++ YGVHG   +   +FE M+  
Subjt:  -SIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEE

Query:  GQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR
        G  P  +TF S+L ACSH GL+E+GK  F  M  +  ++P ++HY CMVDLLGR  +++EA K++ +M  EP   +WG++L +CRIHG  ++A+R  +R
Subjt:  GQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR

Arabidopsis top hitse value%identityAlignment
AT1G06140.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-5334.46Show/hide
Query:  LLEACSSTKNLHTLKQIHALTIILQ-ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTF
        L++AC +       K +H ++I    I    +++  +   Y  C  L  A  +F  +  R   ++  LI   +      ++  +FR ML      ++ T 
Subjt:  LLEACSSTKNLHTLKQIHALTIILQ-ISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTF

Query:  PPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDELT
          +L SC+ L SLR G+ VHG ++ NG   D  +  + I MY +CG++ +AR VFD MPERN +SWS+++  +G++G+F E    F +M  +   P+ +T
Subjt:  PPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDELT

Query:  FTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ
        F SLLSACSH G +++G + F  M  ++ + P  +HY CMVDLLGR G++ EA+  I  M ++P  + WGA+LSACRIH + D+A  + ++ +  +
Subjt:  FTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ

AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-5338.1Show/hide
Query:  KPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLL
        KP E       +L ACS    L   KQ+H+  + L    H F  T L   YA    L  A   F     R   L+ +LI  +       ++L ++R M  
Subjt:  KPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLL

Query:  SGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMM
        +G   +  T   VLK+C+ L++L LG+QVHG  + +GF  ++P  +AL TMY KCG L     VF   P ++ VSW+A+++G   +G   E   LFE M+
Subjt:  SGKSIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMM

Query:  EEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADV
         EG +PD++TF +++SACSH G +E+G  YF MM  +  L P + HY CMVDLL R GQ++EA++ I    I+    LW  +LSAC+ HGK ++
Subjt:  EEGQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADV

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-5335.23Show/hide
Query:  YAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRH
        +A +L   ++  +L   +Q+H   ++        +   L   YA C    +A  IF    +R T  + ALI  +    L    L +F  M  S    D+ 
Subjt:  YAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRH

Query:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE
        TF  VLK+    +SL LG+Q+H  ++ +G   ++ S + L+ MY KCG +  A +VF+ MP+RN VSW+AL++ +  +G      G F +M+E G +PD 
Subjt:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE

Query:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ
        ++   +L+ACSH G +E+G EYF  M   + + P  +HY CM+DLLGR G+  EAEKL+ EM  EPDE +W ++L+ACRIH    +A+R  ++    +
Subjt:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-5737.79Show/hide
Query:  YAALLEACSSTK----NLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK-
        Y  +L+AC +++    +L   K+IHA       S H +I T L   YA    +  A+ +F     R    ++A+I  ++      ++L  FR M+   K 
Subjt:  YAALLEACSSTK----NLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK-

Query:  -SIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEE
         S +  T   VL++C  L++L  G+ +HG +L  G  S LP ++AL+TMYG+CG L V ++VFD M +R+ VSW++L++ YGVHG   +   +FE M+  
Subjt:  -SIDRHTFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEE

Query:  GQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR
        G  P  +TF S+L ACSH GL+E+GK  F  M  +  ++P ++HY CMVDLLGR  +++EA K++ +M  EP   +WG++L +CRIHG  ++A+R  +R
Subjt:  GQKPDELTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-5336.18Show/hide
Query:  ALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRH
        A+L  C+  + L   K++H       +    F+   L   YA C  + +A  +FS    +    +N +I  +S     +++LS+F ++LL  K  S D  
Subjt:  ALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRH

Query:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE
        T   VL +C  LS+   GR++HG ++ NG+ SD    N+L+ MY KCG L +A  +FD +  ++ VSW+ ++AGYG+HG   E   LF +M + G + DE
Subjt:  TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDE

Query:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR
        ++F SLL ACSH GL+++G  +F +MR E  + P ++HY C+VD+L R G + +A + I  M I PD  +WGA+L  CRIH    +A++V ++
Subjt:  LTFTSLLSACSHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAGCCACAAGAAATAATCCCCTTCTACGCCGCTCTCTTGGAAGCGTGCTCCTCCACCAAGAACCTCCACACTCTCAAGCAAATTCACGCTCTTACCATAATACT
TCAAATCTCTCACCACCATTTCATTCGAACCAAGCTCGCTTCAACCTACGCCGCCTGCGCCCAACTCCCACAAGCCACCACCATCTTCTCCTTCGCCACTCGCCGCCCCA
CCTACCTCTTCAATGCTCTCATCAGAGCCCACTCTTCTCTCCGTCTCTTCTCCCAATCCCTCTCCATTTTCCGCCACATGCTTCTTTCTGGAAAATCCATCGATCGCCAT
ACTTTCCCGCCGGTGCTCAAGTCATGTACCGGCCTCTCCTCCTTGCGCCTTGGCCGCCAGGTTCATGGGGCTCTTCTGATTAATGGCTTCTCTTCAGATTTGCCTAGTTT
GAATGCTTTGATTACCATGTATGGCAAATGCGGTGATTTGGGTGTCGCGCGGAAGGTGTTTGATGGAATGCCTGAGAGGAATGAGGTGTCGTGGTCGGCTTTGATGGCGG
GTTATGGTGTTCATGGGATGTTTGGGGAGGTGTTTGGGTTGTTTGAGAGGATGATGGAAGAAGGGCAAAAGCCGGATGAGCTCACTTTTACATCTCTTCTCTCGGCGTGT
AGCCATGGAGGGTTGATTGAGAAAGGGAAGGAGTATTTTGGTATGATGAGAATGGAGTTTGATTTGAGGCCTGGGTTGCAACATTATACTTGCATGGTGGATTTGCTTGG
GAGATTGGGGCAAGTGGAAGAAGCAGAGAAGTTGATAATGGAAATGGAGATCGAGCCTGATGAGGCTTTGTGGGGTGCCATGTTGAGTGCTTGTAGGATTCATGGGAAGG
CCGATGTGGCTGATAGGGTGCAAAAACGGTTTATCAAACAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAGCCACAAGAAATAATCCCCTTCTACGCCGCTCTCTTGGAAGCGTGCTCCTCCACCAAGAACCTCCACACTCTCAAGCAAATTCACGCTCTTACCATAATACT
TCAAATCTCTCACCACCATTTCATTCGAACCAAGCTCGCTTCAACCTACGCCGCCTGCGCCCAACTCCCACAAGCCACCACCATCTTCTCCTTCGCCACTCGCCGCCCCA
CCTACCTCTTCAATGCTCTCATCAGAGCCCACTCTTCTCTCCGTCTCTTCTCCCAATCCCTCTCCATTTTCCGCCACATGCTTCTTTCTGGAAAATCCATCGATCGCCAT
ACTTTCCCGCCGGTGCTCAAGTCATGTACCGGCCTCTCCTCCTTGCGCCTTGGCCGCCAGGTTCATGGGGCTCTTCTGATTAATGGCTTCTCTTCAGATTTGCCTAGTTT
GAATGCTTTGATTACCATGTATGGCAAATGCGGTGATTTGGGTGTCGCGCGGAAGGTGTTTGATGGAATGCCTGAGAGGAATGAGGTGTCGTGGTCGGCTTTGATGGCGG
GTTATGGTGTTCATGGGATGTTTGGGGAGGTGTTTGGGTTGTTTGAGAGGATGATGGAAGAAGGGCAAAAGCCGGATGAGCTCACTTTTACATCTCTTCTCTCGGCGTGT
AGCCATGGAGGGTTGATTGAGAAAGGGAAGGAGTATTTTGGTATGATGAGAATGGAGTTTGATTTGAGGCCTGGGTTGCAACATTATACTTGCATGGTGGATTTGCTTGG
GAGATTGGGGCAAGTGGAAGAAGCAGAGAAGTTGATAATGGAAATGGAGATCGAGCCTGATGAGGCTTTGTGGGGTGCCATGTTGAGTGCTTGTAGGATTCATGGGAAGG
CCGATGTGGCTGATAGGGTGCAAAAACGGTTTATCAAACAACAATGAGTCTATAAACTATGGTGTAAGTTATGTTTGATGGTTGGTTTCATAAGTCTCCATTGATTCAAT
TTTAAAACAAATTGGTCTCCAACGTCGATTTGTCCAAAGAAAACACACGAATTTGACTAGAATGTAGAATT
Protein sequenceShow/hide protein sequence
MPKPQEIIPFYAALLEACSSTKNLHTLKQIHALTIILQISHHHFIRTKLASTYAACAQLPQATTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRH
TFPPVLKSCTGLSSLRLGRQVHGALLINGFSSDLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSALMAGYGVHGMFGEVFGLFERMMEEGQKPDELTFTSLLSAC
SHGGLIEKGKEYFGMMRMEFDLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKADVADRVQKRFIKQQ