; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021798 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021798
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold1:415446..419872
RNA-Seq ExpressionMS021798
SyntenyMS021798
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132690.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Momordica charantia]0.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

XP_022132697.1 pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Momordica charantia]0.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

XP_022132705.1 pentatricopeptide repeat-containing protein At1g76280 isoform X3 [Momordica charantia]0.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

XP_022937086.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata]1.6e-26281.5Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVAL +GAAG KLP LELDIP+P  TEFY +NF+FE+N  S+DE+Y KK+V C+ DI QFSVNGMKCG+     T  +N RS+FVMKVLRWSFNDVI AC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        A TR+CGLAE L  Q             M +LGLQPS HTFDGFVRSVVSERGFSDG+KILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        AC YPHPFNAFL ACD MDQPERAMRML KMKQ++VLP+V TYE LYSLFGNVNAPYEEGNRLSQ DA KRIRMIEMDM KHGIQHS+ SM NLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYL+VAENLFYYNNT+LGTP+YNT LHFLVESKEIHMAIELFNNMKHSG FPDAATFEMM+DCCSV+ CLKSAFALLS+M+R+GFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVL  ERFDDALNLLDQASSEGI+LDVVIMNTI+ KACEKGR+DVIEFV+E+M R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRML K
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD
        E+D SP +TEYVE+FVLAED  A+ RILEFFKCSEESLSFAL NLRWSAMLGYSLCSSPNQSPWAMRLA+SYD
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD

XP_022937087.1 pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cucurbita moschata]1.6e-26281.5Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVAL +GAAG KLP LELDIP+P  TEFY +NF+FE+N  S+DE+Y KK+V C+ DI QFSVNGMKCG+     T  +N RS+FVMKVLRWSFNDVI AC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        A TR+CGLAE L  Q             M +LGLQPS HTFDGFVRSVVSERGFSDG+KILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        AC YPHPFNAFL ACD MDQPERAMRML KMKQ++VLP+V TYE LYSLFGNVNAPYEEGNRLSQ DA KRIRMIEMDM KHGIQHS+ SM NLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYL+VAENLFYYNNT+LGTP+YNT LHFLVESKEIHMAIELFNNMKHSG FPDAATFEMM+DCCSV+ CLKSAFALLS+M+R+GFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVL  ERFDDALNLLDQASSEGI+LDVVIMNTI+ KACEKGR+DVIEFV+E+M R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRML K
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD
        E+D SP +TEYVE+FVLAED  A+ RILEFFKCSEESLSFAL NLRWSAMLGYSLCSSPNQSPWAMRLA+SYD
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD

TrEMBL top hitse value%identityAlignment
A0A6J1BT64 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

A0A6J1BT79 pentatricopeptide repeat-containing protein At1g76280 isoform X30.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

A0A6J1BUK2 pentatricopeptide repeat-containing protein At1g76280 isoform X20.0e+0096.37Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVALAVGAAGGKLP LELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        AFTRDCGLAE L  Q             MLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVLRSE FDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
        EEDASPDLTEYVENFVLAEDPGAD RILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYDANRSS

A0A6J1F9C6 pentatricopeptide repeat-containing protein At1g76280 isoform X17.9e-26381.5Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVAL +GAAG KLP LELDIP+P  TEFY +NF+FE+N  S+DE+Y KK+V C+ DI QFSVNGMKCG+     T  +N RS+FVMKVLRWSFNDVI AC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        A TR+CGLAE L  Q             M +LGLQPS HTFDGFVRSVVSERGFSDG+KILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        AC YPHPFNAFL ACD MDQPERAMRML KMKQ++VLP+V TYE LYSLFGNVNAPYEEGNRLSQ DA KRIRMIEMDM KHGIQHS+ SM NLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYL+VAENLFYYNNT+LGTP+YNT LHFLVESKEIHMAIELFNNMKHSG FPDAATFEMM+DCCSV+ CLKSAFALLS+M+R+GFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVL  ERFDDALNLLDQASSEGI+LDVVIMNTI+ KACEKGR+DVIEFV+E+M R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRML K
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD
        E+D SP +TEYVE+FVLAED  A+ RILEFFKCSEESLSFAL NLRWSAMLGYSLCSSPNQSPWAMRLA+SYD
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD

A0A6J1FA55 pentatricopeptide repeat-containing protein At1g76280 isoform X27.9e-26381.5Show/hide
Query:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC
        MVAL +GAAG KLP LELDIP+P  TEFY +NF+FE+N  S+DE+Y KK+V C+ DI QFSVNGMKCG+     T  +N RS+FVMKVLRWSFNDVI AC
Subjt:  MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHAC

Query:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
        A TR+CGLAE L  Q             M +LGLQPS HTFDGFVRSVVSERGFSDG+KILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS
Subjt:  AFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQIS

Query:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA
        AC YPHPFNAFL ACD MDQPERAMRML KMKQ++VLP+V TYE LYSLFGNVNAPYEEGNRLSQ DA KRIRMIEMDM KHGIQHS+ SM NLLKALGA
Subjt:  ACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGA

Query:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY
        EGMTKELLQYL+VAENLFYYNNT+LGTP+YNT LHFLVESKEIHMAIELFNNMKHSG FPDAATFEMM+DCCSV+ CLKSAFALLS+M+R+GFCPQILTY
Subjt:  EGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTY

Query:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK
        TSL  IVL  ERFDDALNLLDQASSEGI+LDVVIMNTI+ KACEKGR+DVIEFV+E+M R+KIQPDPSTCHSVFSAYV+LGYHSTAMEALQVLSMRML K
Subjt:  TSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSK

Query:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD
        E+D SP +TEYVE+FVLAED  A+ RILEFFKCSEESLSFAL NLRWSAMLGYSLCSSPNQSPWAMRLA+SYD
Subjt:  EEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSYD

SwissProt top hitse value%identityAlignment
Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028603.5e-1026.6Show/hide
Query:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ
        YNT++           A ++F  MK +GF  D  T+  ++D        K A  +L+ MV  GF P I+TY SLI    R    D+A+ L +Q + +G +
Subjt:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ

Query:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE
         DV    T+L      G+V+    + E M     +P+  T ++    Y N G  +  M+    +++  L      SPD+  +     +    G D  +  
Subjt:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE

Query:  FFK
         FK
Subjt:  FFK

Q9SGQ6 Pentatricopeptide repeat-containing protein At1g762801.0e-13452.56Show/hide
Query:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK
        ++VLRWSFNDVIHAC  +++  LAE L  Q                                             LK+MQQ+ LKPYDSTLA V+  CSK
Subjt:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK

Query:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ
        AL++DLAE LL+QIS C Y +PFN  L A D++DQPERA+R+L +MK+LK+ P++ TYE L+SLFGNVNAPYEEGN LSQ D  KRI  IEMDM ++G Q
Subjt:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ

Query:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL
        HS +S  N+L+ALGAEGM  E++++L  AENL  ++N YLGTP YN VLH L+E+ E  M I +F  MK  G   D AT+ +M+DCCS++   KSA AL+
Subjt:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL

Query:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST
        SMM+R GF P+ +T+T+L  I+L    F++ALNLLDQA+ E I LDV+  NTIL KA EKG +DVIE+++E+M+REK+ PDP+TCH VFS YV  GYH+T
Subjt:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST

Query:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY
        A+EAL VLS+RML++E  E       E  ENFV++EDP A+ +I+E F+ SEE L+ AL NLRW AMLG  +  S +QSPWA  L+N Y
Subjt:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY

Q9SHK2 Pentatricopeptide repeat-containing protein At1g065806.5e-1228.23Show/hide
Query:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ
        ++ +L  + +  +    I LF +++  G   D  +F  ++DC      L  A + L  M++ GF P I+T+ SL+       RF +A++L+DQ    G +
Subjt:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ

Query:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE
         +VVI NTI+   CEKG+V+    V++ M +  I+PD  T +S+ +   + G    +   L  + MRM       SPD+  +     L +  G + ++LE
Subjt:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE

Query:  FFKCSEESL
          K   E +
Subjt:  FFKCSEESL

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic4.2e-1121.39Show/hide
Query:  IMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACPY---PHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEE
        + ++R+        +A+  +  +  ++ +A+ + E   A  Y    + F+A + A       E A+ +   MK+  + PN+ TY  +    G     +  
Subjt:  IMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACPY---PHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEE

Query:  GNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGAEGMTKELLQYLSVAENLF-YYNNTYLGTPV--YNTVLHFLVESKEIHMAIELFNNMKHS
                  K++     +M ++G+Q   ++  +LL      G+ +        A NLF    N  +   V  YNT+L  + +  ++ +A E+   M   
Subjt:  GNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGAEGMTKELLQYLSVAENLF-YYNNTYLGTPV--YNTVLHFLVESKEIHMAIELFNNMKHS

Query:  GFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLIVLRSE--RFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIE
           P+  ++  ++D  +       A  L   M   G     ++Y +L+ + ++  R ++AL++L + +S GI+ DVV  N +L    ++G+ D ++ V  
Subjt:  GFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLIVLRSE--RFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIE

Query:  RMNREKIQPDPSTCHSVFSAYVNLGYHSTAME
         M RE + P+  T  ++   Y   G +  AME
Subjt:  RMNREKIQPDPSTCHSVFSAYVNLGYHSTAME

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic1.9e-1121.47Show/hide
Query:  ELYRKKLVTCDDD--IGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTF
        +L +K+++  D +  +  F    +K G +  P   +      FV+    +S+N +IH    +R C  A               +Y +M+  G +PS  T+
Subjt:  ELYRKKLVTCDDD--IGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTF

Query:  DGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQI--SAC-PYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLP
           +  +   R     M +LK M+   LKP   T         +A +++ A  +L+++    C P    +   + A  T  + + A  +  KMK  +  P
Subjt:  DGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQI--SAC-PYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLP

Query:  NVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLV
        +  TY  L   F +        NR    D+   ++    +M K G     ++ T L+ AL   G   E    L V  +     N +     YNT++  L+
Subjt:  NVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLV

Query:  ESKEIHMAIELFNNMKHSGFFPDAATFEMMVD-----------------------CCSVMECLKSAFALLSM------------MVRTGFCPQILTYTSL
            +  A+ELF NM+  G  P A T+ + +D                         +++ C  S ++L               +   G  P  +TY  +
Subjt:  ESKEIHMAIELFNNMKHSGFFPDAATFEMMVD-----------------------CCSVMECLKSAFALLSM------------MVRTGFCPQILTYTSL

Query:  IVLRSE--RFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQV
        +   S+    D+A+ LL +    G + DV+++N+++    +  RVD    +  RM   K++P   T +++ +    LG +    EA+++
Subjt:  IVLRSE--RFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQV

Arabidopsis top hitse value%identityAlignment
AT1G06580.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-1328.23Show/hide
Query:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ
        ++ +L  + +  +    I LF +++  G   D  +F  ++DC      L  A + L  M++ GF P I+T+ SL+       RF +A++L+DQ    G +
Subjt:  YNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQ

Query:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE
         +VVI NTI+   CEKG+V+    V++ M +  I+PD  T +S+ +   + G    +   L  + MRM       SPD+  +     L +  G + ++LE
Subjt:  LDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILE

Query:  FFKCSEESL
          K   E +
Subjt:  FFKCSEESL

AT1G76280.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.3e-13652.56Show/hide
Query:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK
        ++VLRWSFNDVIHAC  +++  LAE L  Q                                             LK+MQQ+ LKPYDSTLA V+  CSK
Subjt:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK

Query:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ
        AL++DLAE LL+QIS C Y +PFN  L A D++DQPERA+R+L +MK+LK+ P++ TYE L+SLFGNVNAPYEEGN LSQ D  KRI  IEMDM ++G Q
Subjt:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ

Query:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL
        HS +S  N+L+ALGAEGM  E++++L  AENL  ++N YLGTP YN VLH L+E+ E  M I +F  MK  G   D AT+ +M+DCCS++   KSA AL+
Subjt:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL

Query:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST
        SMM+R GF P+ +T+T+L  I+L    F++ALNLLDQA+ E I LDV+  NTIL KA EKG +DVIE+++E+M+REK+ PDP+TCH VFS YV  GYH+T
Subjt:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST

Query:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY
        A+EAL VLS+RML++E  E       E  ENFV++EDP A+ +I+E F+ SEE L+ AL NLRW AMLG  +  S +QSPWA  L+N Y
Subjt:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY

AT1G76280.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-10855.04Show/hide
Query:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK
        ++VLRWSFNDVIHAC  +++  LAE L              L+M +LGL PS HT+DGF+R+V    G+  GM +LK+MQQ+ LKPYDSTLA V+  CSK
Subjt:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK

Query:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ
        AL++DLAE LL+QIS C Y +PFN  L A D++DQPERA+R+L +MK+LK+ P++ TYE L+SLFGNVNAPYEEGN LSQ D  KRI  IEMDM ++G Q
Subjt:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ

Query:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL
        HS +S  N+L+ALGAEGM  E++++L  AENL  ++N YLGTP YN VLH L+E+ E  M I +F  MK  G   D AT+ +M+DCCS++   KSA AL+
Subjt:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL

Query:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIE
        SMM+R GF P+ +T+T+L  I+L    F++ALNLLDQA+ E I LDV+  NTIL KA EK ++ V++
Subjt:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIE

AT1G76280.3 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-15055.83Show/hide
Query:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK
        ++VLRWSFNDVIHAC  +++  LAE L              L+M +LGL PS HT+DGF+R+V    G+  GM +LK+MQQ+ LKPYDSTLA V+  CSK
Subjt:  MKVLRWSFNDVIHACAFTRDCGLAEHLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSK

Query:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ
        AL++DLAE LL+QIS C Y +PFN  L A D++DQPERA+R+L +MK+LK+ P++ TYE L+SLFGNVNAPYEEGN LSQ D  KRI  IEMDM ++G Q
Subjt:  ALELDLAEALLEQISACPYPHPFNAFLKACDTMDQPERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQ

Query:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL
        HS +S  N+L+ALGAEGM  E++++L  AENL  ++N YLGTP YN VLH L+E+ E  M I +F  MK  G   D AT+ +M+DCCS++   KSA AL+
Subjt:  HSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVYNTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALL

Query:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST
        SMM+R GF P+ +T+T+L  I+L    F++ALNLLDQA+ E I LDV+  NTIL KA EKG +DVIE+++E+M+REK+ PDP+TCH VFS YV  GYH+T
Subjt:  SMMVRTGFCPQILTYTSL--IVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHST

Query:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY
        A+EAL VLS+RML++E  E       E  ENFV++EDP A+ +I+E F+ SEE L+ AL NLRW AMLG  +  S +QSPWA  L+N Y
Subjt:  AMEALQVLSMRMLSKE--EDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLGYSLCSSPNQSPWAMRLANSY

AT5G41170.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.9e-1226.58Show/hide
Query:  NTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQL
        N +++   +S + ++A      M   GF PD  TF  +++   +   ++ A ++++ MV  G  P ++ YT++I  + ++   + AL+L DQ  + GI+ 
Subjt:  NTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLI--VLRSERFDDALNLLDQASSEGIQL

Query:  DVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME
        DVV+  +++   C  GR    + ++  M + KI+PD  T +++  A+V  G    A E
Subjt:  DVVIMNTILLKACEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCTTTGGCTGTTGGAGCCGCAGGAGGAAAGTTACCCTGTTTAGAATTGGACATTCCTATACCTTCAAGCACTGAATTCTATCGTAACAATTTTAGTTTCGAGGA
TAATGAACATTCTTCTGATGAGTTATATCGTAAGAAATTGGTCACCTGCGATGATGACATAGGGCAGTTTTCTGTTAATGGAATGAAATGTGGAGATGAAAGTGGTCCAT
TAACTTTCCAGAACAATTGCAGAAGCAGTTTTGTTATGAAGGTTTTGAGATGGTCTTTCAATGATGTGATACACGCATGTGCGTTTACTAGGGACTGCGGTCTTGCAGAG
CATTTAGCATTTCAATTTTCACGTGTACTGACAATAGGGGCCATGTATTTGAAGATGCTTGATCTCGGATTGCAACCTTCATGCCACACATTTGATGGTTTTGTTAGATC
AGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATGAAAATTTTAAAAATAATGCAACAGAGGAAATTGAAGCCGTATGATTCAACTCTCGCTGCTGTTTCAATAAGTTGTA
GCAAGGCGCTAGAACTTGATTTGGCTGAGGCTCTACTTGAACAAATTTCTGCTTGCCCTTACCCACACCCCTTCAATGCATTTCTTAAAGCATGTGATACGATGGATCAG
CCTGAACGTGCCATGCGTATGTTGGTTAAAATGAAACAGTTGAAGGTGCTTCCAAATGTCAACACATATGAGCATTTGTATTCTTTATTTGGTAACGTGAATGCTCCATA
TGAGGAGGGCAACAGATTGTCACAGGCGGATGCTGGCAAAAGGATACGCATGATAGAGATGGATATGGCAAAACATGGGATTCAACACAGTAATTTATCTATGACGAACT
TGTTGAAAGCTCTAGGCGCAGAAGGGATGACGAAGGAGCTGCTTCAGTATTTAAGTGTTGCAGAGAACCTCTTCTATTATAATAACACTTATCTGGGAACGCCTGTTTAC
AACACAGTGTTGCATTTTTTAGTTGAATCCAAGGAAATTCACATGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGATGCTGCGACATTTGAGAT
GATGGTTGACTGTTGTAGTGTCATGGAATGCTTGAAATCAGCTTTTGCCCTTCTTTCCATGATGGTCCGCACAGGGTTTTGTCCACAGATATTAACTTATACGAGTCTAA
TTGTATTGAGATCTGAGAGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCAGAAGGGATTCAACTTGATGTAGTTATAATGAATACCATCTTGCTGAAAGCT
TGTGAAAAGGGAAGGGTTGATGTGATTGAGTTCGTCATTGAGAGGATGAATCGCGAAAAGATCCAACCTGACCCGTCAACGTGCCATAGCGTCTTCTCTGCATATGTGAA
CCTCGGCTATCACAGCACCGCTATGGAAGCACTTCAAGTACTGAGTATGCGTATGCTATCCAAAGAAGAAGATGCCTCTCCAGACTTGACAGAATATGTCGAAAACTTTG
TCCTTGCCGAAGACCCCGGAGCCGATTGGCGAATTTTGGAATTCTTCAAATGCTCTGAAGAGAGCCTGAGCTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGA
TATTCGCTATGTTCTTCCCCTAATCAGAGCCCGTGGGCAATGAGACTTGCAAATTCCTATGATGCCAACAGAAGTTCA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCTTTGGCTGTTGGAGCCGCAGGAGGAAAGTTACCCTGTTTAGAATTGGACATTCCTATACCTTCAAGCACTGAATTCTATCGTAACAATTTTAGTTTCGAGGA
TAATGAACATTCTTCTGATGAGTTATATCGTAAGAAATTGGTCACCTGCGATGATGACATAGGGCAGTTTTCTGTTAATGGAATGAAATGTGGAGATGAAAGTGGTCCAT
TAACTTTCCAGAACAATTGCAGAAGCAGTTTTGTTATGAAGGTTTTGAGATGGTCTTTCAATGATGTGATACACGCATGTGCGTTTACTAGGGACTGCGGTCTTGCAGAG
CATTTAGCATTTCAATTTTCACGTGTACTGACAATAGGGGCCATGTATTTGAAGATGCTTGATCTCGGATTGCAACCTTCATGCCACACATTTGATGGTTTTGTTAGATC
AGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATGAAAATTTTAAAAATAATGCAACAGAGGAAATTGAAGCCGTATGATTCAACTCTCGCTGCTGTTTCAATAAGTTGTA
GCAAGGCGCTAGAACTTGATTTGGCTGAGGCTCTACTTGAACAAATTTCTGCTTGCCCTTACCCACACCCCTTCAATGCATTTCTTAAAGCATGTGATACGATGGATCAG
CCTGAACGTGCCATGCGTATGTTGGTTAAAATGAAACAGTTGAAGGTGCTTCCAAATGTCAACACATATGAGCATTTGTATTCTTTATTTGGTAACGTGAATGCTCCATA
TGAGGAGGGCAACAGATTGTCACAGGCGGATGCTGGCAAAAGGATACGCATGATAGAGATGGATATGGCAAAACATGGGATTCAACACAGTAATTTATCTATGACGAACT
TGTTGAAAGCTCTAGGCGCAGAAGGGATGACGAAGGAGCTGCTTCAGTATTTAAGTGTTGCAGAGAACCTCTTCTATTATAATAACACTTATCTGGGAACGCCTGTTTAC
AACACAGTGTTGCATTTTTTAGTTGAATCCAAGGAAATTCACATGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGATGCTGCGACATTTGAGAT
GATGGTTGACTGTTGTAGTGTCATGGAATGCTTGAAATCAGCTTTTGCCCTTCTTTCCATGATGGTCCGCACAGGGTTTTGTCCACAGATATTAACTTATACGAGTCTAA
TTGTATTGAGATCTGAGAGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCAGAAGGGATTCAACTTGATGTAGTTATAATGAATACCATCTTGCTGAAAGCT
TGTGAAAAGGGAAGGGTTGATGTGATTGAGTTCGTCATTGAGAGGATGAATCGCGAAAAGATCCAACCTGACCCGTCAACGTGCCATAGCGTCTTCTCTGCATATGTGAA
CCTCGGCTATCACAGCACCGCTATGGAAGCACTTCAAGTACTGAGTATGCGTATGCTATCCAAAGAAGAAGATGCCTCTCCAGACTTGACAGAATATGTCGAAAACTTTG
TCCTTGCCGAAGACCCCGGAGCCGATTGGCGAATTTTGGAATTCTTCAAATGCTCTGAAGAGAGCCTGAGCTTTGCCCTCTTCAACTTGAGATGGTCTGCCATGCTGGGA
TATTCGCTATGTTCTTCCCCTAATCAGAGCCCGTGGGCAATGAGACTTGCAAATTCCTATGATGCCAACAGAAGTTCA
Protein sequenceShow/hide protein sequence
MVALAVGAAGGKLPCLELDIPIPSSTEFYRNNFSFEDNEHSSDELYRKKLVTCDDDIGQFSVNGMKCGDESGPLTFQNNCRSSFVMKVLRWSFNDVIHACAFTRDCGLAE
HLAFQFSRVLTIGAMYLKMLDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKIMQQRKLKPYDSTLAAVSISCSKALELDLAEALLEQISACPYPHPFNAFLKACDTMDQ
PERAMRMLVKMKQLKVLPNVNTYEHLYSLFGNVNAPYEEGNRLSQADAGKRIRMIEMDMAKHGIQHSNLSMTNLLKALGAEGMTKELLQYLSVAENLFYYNNTYLGTPVY
NTVLHFLVESKEIHMAIELFNNMKHSGFFPDAATFEMMVDCCSVMECLKSAFALLSMMVRTGFCPQILTYTSLIVLRSERFDDALNLLDQASSEGIQLDVVIMNTILLKA
CEKGRVDVIEFVIERMNREKIQPDPSTCHSVFSAYVNLGYHSTAMEALQVLSMRMLSKEEDASPDLTEYVENFVLAEDPGADWRILEFFKCSEESLSFALFNLRWSAMLG
YSLCSSPNQSPWAMRLANSYDANRSS