; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014204 (gene) of Snake gourd v1 genome

Gene IDTan0014204
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:14744188..14746610
RNA-Seq ExpressionTan0014204
SyntenyTan0014204
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011122.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-25386.33Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+A+ QA+QAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMT RNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKG+E HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS+ NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VEPTVEHYACVVDLLSRFGFLQEAF LI NMK TAAAS+WGALLSGC+MHKN+EIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        N+LFKLEP NPSNFIALI IYES G+  GVSLTR KMR+ GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNCETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

XP_022932000.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X1 [Cucurbita moschata]5.8e-25386.53Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+ +AQAKQAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMT RNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKG+E HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS+ NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VEPTVEHYACVVDLLSRFGFLQEAF LI NMK TAAAS+WGALLSGC+MHKN+EIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        N+LFKLEP NPSNFIALI IYES G+  GVSLTR KMR+ GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNCETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

XP_022932001.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X2 [Cucurbita moschata]2.4e-25186.45Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+ +AQAKQAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMT RNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKG+E HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS+ NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VEPTVEHYACVVDLLSRFGFLQEAF LI NMK TAAAS+WGALLSGC+MHKN+EIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC
        N+LFKLEP NPSNFIALI IYES G+  GVSLTR KMR+ GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNC
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC

XP_022971622.1 pentatricopeptide repeat-containing protein At5g04780, mitochondrial-like isoform X1 [Cucurbita maxima]9.0e-25487.14Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFSFILRNCA L+A+AQAKQAHAQI+IHGLIPH+TL TD++LVY KCGL+HDARKVFDKMTHRNMHSWNILIASYVHSSLH DAI+V NEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKGKE HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E VF NMS++NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYI+EGWRIF SIVSDY VE TVEHYACVVDLLSRFGFLQEAF LIR+MK TAAASIWGALLSGC+MHKNLEIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        NQLFKLEP NPSNFIALI IYES G+L  VSLTR KMRD GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNCETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

XP_023512346.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo]2.6e-25387.68Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+A+ QAKQAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMTHRNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTIRLGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV MD MTIPSVLNACGGEGDLRKGKE HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS++NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        K CGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VE TVEHYACVVDLLSRFGFLQEAF LIRNMK TAAASIWGALLSGC+MHKNLEIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC
        NQLFKLEPMNPSNFIALI IYES G+  G SLTR KMRD GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNC
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC

TrEMBL top hitse value%identityAlignment
A0A1S3CDI3 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like2.0e-25186.12Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        +SHSTSRFSF+LR+CAH  AIAQAKQ HAQILIHG +PH+TL TDL+LVY KCG LHDAR VFDKMTHRNMHSWNILIASYVH+SL+ DA++VFNEFRD 
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFKASVG GDVYLGKRLHCWTI+LGF GYVVVDSTVLD YAK G VGDA+KVFD MI KDT+SWNSMISGYGRAG+Y DALDCFK MLL
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EG  MDFMTIPSVLNACGGEGDLRKGKE HCLVLKS V AADVA+GNSLIDMYSKCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIF+SIVSD  VEPTVEHYACVVDLLSRFGFL+EAF LIRNMK  AAASIWGALLSGC++H+NLE GEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        NQLFKLEP NPSNFIALISIYES G+L GVS+TR KMR  GLTK+PGCS ITIDGVVHKFYGG NSHPLAL+IFETLN++RQA V+CETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

A0A5D3BTE8 Pentatricopeptide repeat-containing protein2.0e-25186.12Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        +SHSTSRFSF+LR+CAH  AIAQAKQ HAQILIHG +PH+TL TDL+LVY KCG LHDAR VFDKMTHRNMHSWNILIASYVH+SL+ DA++VFNEFRD 
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFKASVG GDVYLGKRLHCWTI+LGF GYVVVDSTVLD YAK G VGDA+KVFD MI KDT+SWNSMISGYGRAG+Y DALDCFK MLL
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EG  MDFMTIPSVLNACGGEGDLRKGKE HCLVLKS V AADVA+GNSLIDMYSKCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIF+SIVSD  VEPTVEHYACVVDLLSRFGFL+EAF LIRNMK  AAASIWGALLSGC++H+NLE GEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        NQLFKLEP NPSNFIALISIYES G+L GVS+TR KMR  GLTK+PGCS ITIDGVVHKFYGG NSHPLAL+IFETLN++RQA V+CETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

A0A6J1EV55 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X21.2e-25186.45Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+ +AQAKQAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMT RNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKG+E HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS+ NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VEPTVEHYACVVDLLSRFGFLQEAF LI NMK TAAAS+WGALLSGC+MHKN+EIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC
        N+LFKLEP NPSNFIALI IYES G+  GVSLTR KMR+ GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNC
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNC

A0A6J1F0Z8 pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X12.8e-25386.53Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFS ILRNCA L+ +AQAKQAHAQILIHGLIPH+TL TDL+LVY KCG+LHDARKVFDKMT RNMHSWNILIASYVHSSLH DAI+VFNEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKG+E HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E+VF NMS+ NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKS VSDY VEPTVEHYACVVDLLSRFGFLQEAF LI NMK TAAAS+WGALLSGC+MHKN+EIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        N+LFKLEP NPSNFIALI IYES G+  GVSLTR KMR+ GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNCETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

A0A6J1I2G9 pentatricopeptide repeat-containing protein At5g04780, mitochondrial-like isoform X14.4e-25487.14Show/hide
Query:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL
        MS STSRFSFILRNCA L+A+AQAKQAHAQI+IHGLIPH+TL TD++LVY KCGL+HDARKVFDKMTHRNMHSWNILIASYVHSSLH DAI+V NEFR L
Subjt:  MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDL

Query:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL
        GFLPDHYTLPQMFK SVGIGDVY+GKRLHCWTI+LGFEGYVVV STVLD YAK G VGDAKKVFD+MILKDTISWNSMISGYGRAGLYGDALDCFK ML 
Subjt:  GFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLL

Query:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM
        EGV+MD MTIPSVLNACGGEGDLRKGKE HCLVLKS +F ADVAIGNSLIDMY+KCGSL D+E VF NMS++NIVTWTTMISCYGAHGKGEKSL LFNKM
Subjt:  EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKM

Query:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA
        KDCGIQPNSVTLTAILASCSHAGYI+EGWRIF SIVSDY VE TVEHYACVVDLLSRFGFLQEAF LIR+MK TAAASIWGALLSGC+MHKNLEIGEIAA
Subjt:  KDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAA

Query:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT
        NQLFKLEP NPSNFIALI IYES G+L  VSLTR KMRD GLTKLPGCSWITI+GVVHKFYGGD SHPL L+IFETL+A++QA+VNCETT
Subjt:  NQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT

SwissProt top hitse value%identityAlignment
Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial9.2e-9235.2Show/hide
Query:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHY
        ++ +L+ C     + Q +  HA IL       I +   L+ +Y+KCG L +ARKVF+KM  R+  +W  LI+ Y       DA+  FN+    G+ P+ +
Subjt:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHY

Query:  TLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDF
        TL  + KA+        G +LH + ++ GF+  V V S +LDLY + G + DA+ VFD++  ++ +SWN++I+G+ R      AL+ F+ ML +G     
Subjt:  TLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDF

Query:  MTIPSVLNACGGEGDLRKGKETHCLVLKS----LVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDC
         +  S+  AC   G L +GK  H  ++KS    + FA     GN+L+DMY+K GS+ D+ K+F  ++  ++V+W ++++ Y  HG G++++  F +M+  
Subjt:  MTIPSVLNACGGEGDLRKGKETHCLVLKS----LVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDC

Query:  GIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQL
        GI+PN ++  ++L +CSH+G ++EGW  ++ +  D G+ P   HY  VVDLL R G L  A   I  M     A+IW ALL+ C MHKN E+G  AA  +
Subjt:  GIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQL

Query:  FKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHP----LALKIFETLNAIRQ
        F+L+P +P   + L +IY S G     +  R KM++ G+ K P CSW+ I+  +H F   D  HP    +A K  E L  I++
Subjt:  FKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHP----LALKIFETLNAIRQ

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.0e-9034.26Show/hide
Query:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHR-------------------------------NMHSWNI
        F F+L++CA   A  + +Q H  +L  G    + + T L+ +Y + G L DA KVFDK  HR                               ++ SWN 
Subjt:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHR-------------------------------NMHSWNI

Query:  LIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWN
        +I+ Y  +  +++A+ +F +       PD  T+  +  A    G + LG+++H W    GF   + + + ++DLY+KCG +  A  +F+ +  KD ISWN
Subjt:  LIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWN

Query:  SMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADV-AIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIV
        ++I GY    LY +AL  F+ ML  G   + +T+ S+L AC   G +  G+  H  + K L    +  ++  SLIDMY+KCG +  + +VF ++ + ++ 
Subjt:  SMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADV-AIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIV

Query:  TWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTA
        +W  MI  +  HG+ + S  LF++M+  GIQP+ +T   +L++CSH+G ++ G  IF+++  DY + P +EHY C++DLL   G  +EA  +I  M+   
Subjt:  TWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTA

Query:  AASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFE
           IW +LL  C MH N+E+GE  A  L K+EP NP +++ L +IY S G    V+ TRA + D+G+ K+PGCS I ID VVH+F  GD  HP   +I+ 
Subjt:  AASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFE

Query:  TL
         L
Subjt:  TL

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial9.8e-9439.42Show/hide
Query:  SRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVF-------NEFR
        S F   ++ C+ L  I   KQ H Q  + G    I + + L+++YS CG L DARKVFD++  RN+ SW  +I  Y  +    DA+S+F       N+  
Subjt:  SRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVF-------NEFR

Query:  DLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKC--GAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFK
        D  FL D   L  +  A   +    L + +H + I+ GF+  V V +T+LD YAK   G V  A+K+FD ++ KD +S+NS++S Y ++G+  +A + F+
Subjt:  DLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKC--GAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFK

Query:  HMLL-EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLV
         ++  + V  + +T+ +VL A    G LR GK  H  V++ +    DV +G S+IDMY KCG +  + K F  M N N+ +WT MI+ YG HG   K+L 
Subjt:  HMLL-EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLV

Query:  LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEI
        LF  M D G++PN +T  ++LA+CSHAG   EGWR F ++   +GVEP +EHY C+VDLL R GFLQ+A+ LI+ MK    + IW +LL+ C +HKN+E+
Subjt:  LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEI

Query:  GEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETL
         EI+  +LF+L+  N   ++ L  IY   G    V   R  M+++GL K PG S + ++G VH F  GD  HP   KI+E L
Subjt:  GEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETL

Q9LZ19 Pentatricopeptide repeat-containing protein At5g04780, mitochondrial2.8e-9637.61Show/hide
Query:  ILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLP
        IL+ CA   A+ +AK  H +I+   L   +TL   L+  YSKCG +  AR+VFD M  R++ SWN +I  Y  + +  +A+ +F E R+ GF    +T+ 
Subjt:  ILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLP

Query:  QMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTI
         +  A     D    K+LHC +++   +  + V + +LDLYAKCG + DA +VF+SM  K +++W+SM++GY +   Y +AL  ++      +E +  T+
Subjt:  QMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTI

Query:  PSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSV
         SV+ AC     L +GK+ H ++ KS  F ++V + +S +DMY+KCGSL +S  +F  +   N+  W T+IS +  H + ++ ++LF KM+  G+ PN V
Subjt:  PSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSV

Query:  TLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMN
        T +++L+ C H G + EG R FK + + YG+ P V HY+C+VD+L R G L EA+ LI+++     ASIWG+LL+ C ++KNLE+ E+AA +LF+LEP N
Subjt:  TLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMN

Query:  PSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLN
          N + L +IY +      ++ +R  +RD  + K+ G SWI I   VH F  G++ HP   +I  TL+
Subjt:  PSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLN

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic5.0e-9038.27Show/hide
Query:  LVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIR--LGFEGYVVV
        L+ +YSKCG L  A+ VF +M+ R++ S+  +IA Y    L  +A+ +F E  + G  PD YT+  +         +  GKR+H W     LGF+  + V
Subjt:  LVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIR--LGFEGYVVV

Query:  DSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLE-GVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAAD
         + ++D+YAKCG++ +A+ VF  M +KD ISWN++I GY +     +AL  F  +L E     D  T+  VL AC       KG+E H  ++++  F +D
Subjt:  DSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLE-GVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAAD

Query:  VAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVE
          + NSL+DMY+KCG+LL +  +F ++++ ++V+WT MI+ YG HG G++++ LFN+M+  GI+ + ++  ++L +CSH+G ++EGWR F  +  +  +E
Subjt:  VAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVE

Query:  PTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGL
        PTVEHYAC+VD+L+R G L +A+  I NM     A+IWGALL GC +H ++++ E  A ++F+LEP N   ++ + +IY        V   R ++  +GL
Subjt:  PTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGL

Query:  TKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIR
         K PGCSWI I G V+ F  GD+S+P    I   L  +R
Subjt:  TKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIR

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.2e-9234.26Show/hide
Query:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHR-------------------------------NMHSWNI
        F F+L++CA   A  + +Q H  +L  G    + + T L+ +Y + G L DA KVFDK  HR                               ++ SWN 
Subjt:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHR-------------------------------NMHSWNI

Query:  LIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWN
        +I+ Y  +  +++A+ +F +       PD  T+  +  A    G + LG+++H W    GF   + + + ++DLY+KCG +  A  +F+ +  KD ISWN
Subjt:  LIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWN

Query:  SMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADV-AIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIV
        ++I GY    LY +AL  F+ ML  G   + +T+ S+L AC   G +  G+  H  + K L    +  ++  SLIDMY+KCG +  + +VF ++ + ++ 
Subjt:  SMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADV-AIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIV

Query:  TWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTA
        +W  MI  +  HG+ + S  LF++M+  GIQP+ +T   +L++CSH+G ++ G  IF+++  DY + P +EHY C++DLL   G  +EA  +I  M+   
Subjt:  TWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTA

Query:  AASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFE
           IW +LL  C MH N+E+GE  A  L K+EP NP +++ L +IY S G    V+ TRA + D+G+ K+PGCS I ID VVH+F  GD  HP   +I+ 
Subjt:  AASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFE

Query:  TL
         L
Subjt:  TL

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-9335.2Show/hide
Query:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHY
        ++ +L+ C     + Q +  HA IL       I +   L+ +Y+KCG L +ARKVF+KM  R+  +W  LI+ Y       DA+  FN+    G+ P+ +
Subjt:  FSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHY

Query:  TLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDF
        TL  + KA+        G +LH + ++ GF+  V V S +LDLY + G + DA+ VFD++  ++ +SWN++I+G+ R      AL+ F+ ML +G     
Subjt:  TLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDF

Query:  MTIPSVLNACGGEGDLRKGKETHCLVLKS----LVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDC
         +  S+  AC   G L +GK  H  ++KS    + FA     GN+L+DMY+K GS+ D+ K+F  ++  ++V+W ++++ Y  HG G++++  F +M+  
Subjt:  MTIPSVLNACGGEGDLRKGKETHCLVLKS----LVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDC

Query:  GIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQL
        GI+PN ++  ++L +CSH+G ++EGW  ++ +  D G+ P   HY  VVDLL R G L  A   I  M     A+IW ALL+ C MHKN E+G  AA  +
Subjt:  GIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQL

Query:  FKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHP----LALKIFETLNAIRQ
        F+L+P +P   + L +IY S G     +  R KM++ G+ K P CSW+ I+  +H F   D  HP    +A K  E L  I++
Subjt:  FKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHP----LALKIFETLNAIRQ

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-9539.42Show/hide
Query:  SRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVF-------NEFR
        S F   ++ C+ L  I   KQ H Q  + G    I + + L+++YS CG L DARKVFD++  RN+ SW  +I  Y  +    DA+S+F       N+  
Subjt:  SRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVF-------NEFR

Query:  DLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKC--GAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFK
        D  FL D   L  +  A   +    L + +H + I+ GF+  V V +T+LD YAK   G V  A+K+FD ++ KD +S+NS++S Y ++G+  +A + F+
Subjt:  DLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKC--GAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFK

Query:  HMLL-EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLV
         ++  + V  + +T+ +VL A    G LR GK  H  V++ +    DV +G S+IDMY KCG +  + K F  M N N+ +WT MI+ YG HG   K+L 
Subjt:  HMLL-EGVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLV

Query:  LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEI
        LF  M D G++PN +T  ++LA+CSHAG   EGWR F ++   +GVEP +EHY C+VDLL R GFLQ+A+ LI+ MK    + IW +LL+ C +HKN+E+
Subjt:  LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEI

Query:  GEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETL
         EI+  +LF+L+  N   ++ L  IY   G    V   R  M+++GL K PG S + ++G VH F  GD  HP   KI+E L
Subjt:  GEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-9138.27Show/hide
Query:  LVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIR--LGFEGYVVV
        L+ +YSKCG L  A+ VF +M+ R++ S+  +IA Y    L  +A+ +F E  + G  PD YT+  +         +  GKR+H W     LGF+  + V
Subjt:  LVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLPQMFKASVGIGDVYLGKRLHCWTIR--LGFEGYVVV

Query:  DSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLE-GVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAAD
         + ++D+YAKCG++ +A+ VF  M +KD ISWN++I GY +     +AL  F  +L E     D  T+  VL AC       KG+E H  ++++  F +D
Subjt:  DSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLE-GVEMDFMTIPSVLNACGGEGDLRKGKETHCLVLKSLVFAAD

Query:  VAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVE
          + NSL+DMY+KCG+LL +  +F ++++ ++V+WT MI+ YG HG G++++ LFN+M+  GI+ + ++  ++L +CSH+G ++EGWR F  +  +  +E
Subjt:  VAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFKSIVSDYGVE

Query:  PTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGL
        PTVEHYAC+VD+L+R G L +A+  I NM     A+IWGALL GC +H ++++ E  A ++F+LEP N   ++ + +IY        V   R ++  +GL
Subjt:  PTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQGL

Query:  TKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIR
         K PGCSWI I G V+ F  GD+S+P    I   L  +R
Subjt:  TKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIR

AT5G04780.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-9737.61Show/hide
Query:  ILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLP
        IL+ CA   A+ +AK  H +I+   L   +TL   L+  YSKCG +  AR+VFD M  R++ SWN +I  Y  + +  +A+ +F E R+ GF    +T+ 
Subjt:  ILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLP

Query:  QMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTI
         +  A     D    K+LHC +++   +  + V + +LDLYAKCG + DA +VF+SM  K +++W+SM++GY +   Y +AL  ++      +E +  T+
Subjt:  QMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTI

Query:  PSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSV
         SV+ AC     L +GK+ H ++ KS  F ++V + +S +DMY+KCGSL +S  +F  +   N+  W T+IS +  H + ++ ++LF KM+  G+ PN V
Subjt:  PSVLNACGGEGDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSV

Query:  TLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMN
        T +++L+ C H G + EG R FK + + YG+ P V HY+C+VD+L R G L EA+ LI+++     ASIWG+LL+ C ++KNLE+ E+AA +LF+LEP N
Subjt:  TLTAILASCSHAGYINEGWRIFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMN

Query:  PSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLN
          N + L +IY +      ++ +R  +RD  + K+ G SWI I   VH F  G++ HP   +I  TL+
Subjt:  PSNFIALISIYESRGILCGVSLTRAKMRDQGLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCATAGCACTTCTCGTTTCTCTTTCATTCTCAGAAACTGTGCTCATCTTTCTGCAATTGCTCAAGCAAAGCAGGCCCACGCTCAAATCCTCATCCATGGCTTAAT
TCCACATATCACTCTTCAAACAGATCTCGTGTTGGTGTATAGCAAATGTGGGCTTCTGCATGACGCCCGCAAAGTGTTTGACAAAATGACTCACAGAAACATGCACTCCT
GGAACATCTTGATTGCTTCTTATGTTCATAGTTCTTTGCACCGTGATGCAATAAGTGTGTTTAATGAGTTTAGGGACCTTGGTTTTCTACCTGACCACTATACTCTGCCC
CAGATGTTTAAGGCTAGTGTTGGTATAGGAGATGTCTATCTGGGGAAGAGACTCCATTGTTGGACAATTAGGCTTGGGTTTGAAGGATATGTTGTTGTAGACAGTACAGT
TTTGGACTTATATGCAAAATGTGGGGCTGTGGGTGATGCCAAGAAGGTGTTTGATAGTATGATCTTGAAAGATACAATTTCTTGGAATTCAATGATTTCTGGGTATGGGA
GGGCTGGGCTTTATGGAGATGCATTGGATTGTTTCAAGCACATGCTCTTGGAAGGAGTGGAGATGGATTTTATGACAATTCCTAGTGTTTTGAATGCTTGTGGAGGGGAA
GGAGATTTAAGAAAAGGCAAAGAGACTCATTGCTTAGTTTTGAAGAGTCTGGTATTTGCTGCAGACGTTGCAATTGGGAACTCGTTAATTGATATGTATTCAAAGTGTGG
AAGCCTGCTTGATTCTGAAAAGGTCTTTTGGAATATGAGCAACTTGAATATTGTTACATGGACCACAATGATATCTTGTTATGGGGCCCATGGTAAAGGAGAGAAATCCT
TGGTCCTGTTTAACAAAATGAAAGATTGTGGAATCCAACCCAATTCTGTCACACTGACAGCCATTTTGGCTAGCTGCAGCCATGCAGGTTACATCAATGAAGGTTGGAGA
ATTTTCAAGTCCATTGTTTCAGATTATGGGGTTGAACCGACCGTAGAACATTATGCTTGTGTTGTTGATCTTCTGAGTCGTTTCGGCTTTTTGCAGGAAGCATTTTCATT
AATAAGAAACATGAAATCCACAGCTGCTGCAAGCATCTGGGGTGCTCTGCTTTCTGGTTGTATAATGCACAAGAACCTTGAGATTGGAGAAATTGCAGCCAACCAGCTTT
TCAAGTTGGAACCTATGAATCCAAGCAATTTTATAGCCTTAATTAGTATATATGAATCTCGGGGTATATTGTGCGGTGTTTCACTGACTAGAGCGAAAATGAGAGACCAG
GGTTTGACCAAACTCCCTGGCTGCAGCTGGATAACCATTGATGGAGTTGTACATAAATTCTATGGAGGAGACAATTCTCATCCTCTGGCTTTGAAAATATTTGAAACGTT
AAATGCAATAAGACAAGCGACTGTTAACTGTGAAACAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCATAGCACTTCTCGTTTCTCTTTCATTCTCAGAAACTGTGCTCATCTTTCTGCAATTGCTCAAGCAAAGCAGGCCCACGCTCAAATCCTCATCCATGGCTTAAT
TCCACATATCACTCTTCAAACAGATCTCGTGTTGGTGTATAGCAAATGTGGGCTTCTGCATGACGCCCGCAAAGTGTTTGACAAAATGACTCACAGAAACATGCACTCCT
GGAACATCTTGATTGCTTCTTATGTTCATAGTTCTTTGCACCGTGATGCAATAAGTGTGTTTAATGAGTTTAGGGACCTTGGTTTTCTACCTGACCACTATACTCTGCCC
CAGATGTTTAAGGCTAGTGTTGGTATAGGAGATGTCTATCTGGGGAAGAGACTCCATTGTTGGACAATTAGGCTTGGGTTTGAAGGATATGTTGTTGTAGACAGTACAGT
TTTGGACTTATATGCAAAATGTGGGGCTGTGGGTGATGCCAAGAAGGTGTTTGATAGTATGATCTTGAAAGATACAATTTCTTGGAATTCAATGATTTCTGGGTATGGGA
GGGCTGGGCTTTATGGAGATGCATTGGATTGTTTCAAGCACATGCTCTTGGAAGGAGTGGAGATGGATTTTATGACAATTCCTAGTGTTTTGAATGCTTGTGGAGGGGAA
GGAGATTTAAGAAAAGGCAAAGAGACTCATTGCTTAGTTTTGAAGAGTCTGGTATTTGCTGCAGACGTTGCAATTGGGAACTCGTTAATTGATATGTATTCAAAGTGTGG
AAGCCTGCTTGATTCTGAAAAGGTCTTTTGGAATATGAGCAACTTGAATATTGTTACATGGACCACAATGATATCTTGTTATGGGGCCCATGGTAAAGGAGAGAAATCCT
TGGTCCTGTTTAACAAAATGAAAGATTGTGGAATCCAACCCAATTCTGTCACACTGACAGCCATTTTGGCTAGCTGCAGCCATGCAGGTTACATCAATGAAGGTTGGAGA
ATTTTCAAGTCCATTGTTTCAGATTATGGGGTTGAACCGACCGTAGAACATTATGCTTGTGTTGTTGATCTTCTGAGTCGTTTCGGCTTTTTGCAGGAAGCATTTTCATT
AATAAGAAACATGAAATCCACAGCTGCTGCAAGCATCTGGGGTGCTCTGCTTTCTGGTTGTATAATGCACAAGAACCTTGAGATTGGAGAAATTGCAGCCAACCAGCTTT
TCAAGTTGGAACCTATGAATCCAAGCAATTTTATAGCCTTAATTAGTATATATGAATCTCGGGGTATATTGTGCGGTGTTTCACTGACTAGAGCGAAAATGAGAGACCAG
GGTTTGACCAAACTCCCTGGCTGCAGCTGGATAACCATTGATGGAGTTGTACATAAATTCTATGGAGGAGACAATTCTCATCCTCTGGCTTTGAAAATATTTGAAACGTT
AAATGCAATAAGACAAGCGACTGTTAACTGTGAAACAACTTGAACTAAAGGAATAGAAGCTCAAGTTGGTAGTAGAGCCTATGAGACTTGCGTCCATAGCATCCATTAAC
GTAAATTCTTTAGCAACCAGTTGGAACTTGAGCTGTTCAATCGGCCTACAATTGGTCATTGTATCAATTAAACTTGACTTCAACGACTGCATCTTCTGGAGACCACAAGT
GCTCTCTGTTGTACGTGCACATGATCTTGAAGATTTCTTGTTTGATTCAACATAGACGCCATTAAGTTATATCTCACACAACCTGTTGAGATTAGCAGTACAAGCTACAT
TCAATTGAACTCAAAGTACACATTTCAATCTTGTTGTTGGAGGCAGCTGAGTTATTGAAAATTCTCAATCACAGTCGGTTACGATTAAGAACGGCTCGATGAGCATTCAT
GACTATATACTGAAGATGGAAAACTGTAGCTGATAGTTTTTCCATTACAGAAATTAACGACAATGAAGAGTTATTGATGCACACTGATAGGTTATGGTAAAATTAATTAT
ATCAACACTAACAGTTTCTAACTAGTATAGTTGTTTGGACCATAGTTTTAGACCTCATTCTCAGGTACTTACCATTTCCTTCCTAGAAATGACAACTCTAATCAGTATTT
GTTGCTTCTACTAACGGGAGTATACCAACAAAGGAGCAACGAATCATATTGACTGGTGACATGGAAAAACCCACCTTCAACAATGGGTGCCAAGGGAATAGTAAGTTGGC
AGAAATACTGAGTTCGCTGCTTTAAAAAAAAAAATCTCCAGTAGTTCAGCACTCTCAAAGGAGAGATTGTGGTCCTTAAGCGCTAGGAGAATAGTAATTTAGATGTCAGG
CTGTTACATGTTAGTTAGATTTGAAGTTTGAGCCTTTGAGGGTTTTTGTTATCTCTGTTAGCGAACTGGGAATGATTGCTTTGTGGACTTTAATTGGTTTTTCAACATTC
AGG
Protein sequenceShow/hide protein sequence
MSHSTSRFSFILRNCAHLSAIAQAKQAHAQILIHGLIPHITLQTDLVLVYSKCGLLHDARKVFDKMTHRNMHSWNILIASYVHSSLHRDAISVFNEFRDLGFLPDHYTLP
QMFKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDLYAKCGAVGDAKKVFDSMILKDTISWNSMISGYGRAGLYGDALDCFKHMLLEGVEMDFMTIPSVLNACGGE
GDLRKGKETHCLVLKSLVFAADVAIGNSLIDMYSKCGSLLDSEKVFWNMSNLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWR
IFKSIVSDYGVEPTVEHYACVVDLLSRFGFLQEAFSLIRNMKSTAAASIWGALLSGCIMHKNLEIGEIAANQLFKLEPMNPSNFIALISIYESRGILCGVSLTRAKMRDQ
GLTKLPGCSWITIDGVVHKFYGGDNSHPLALKIFETLNAIRQATVNCETT