; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014070 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014070
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationtig00154217:107212..110917
RNA-Seq ExpressionSgr014070
SyntenySgr014070
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022386.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-22671.38Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLIVRNSLQW+N+STT++S PPF P  PSIWATNLIKSYFDKGL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           + + PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCNCL DAKR FGE+TE+NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SM+EDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

XP_022157012.1 putative pentatricopeptide repeat-containing protein At1g56570 [Momordica charantia]2.6e-23170.95Show/hide
Query:  NHPATAKKGECSASSGELKNSVR-ELEVRTKTTGMSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNL
        NH     + E    +  LKN ++    +R  TTGMSANKLASSTHF  IPL+VRNSLQ VNSSTTI+ HPPF+P GPSIWATNLIKSYFDKGLT EARNL
Subjt:  NHPATAKKGECSASSGELKNSVR-ELEVRTKTTGMSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNL

Query:  FDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL-----TGRS----------------
        FDEMPERDVVAWT +I                     +E+EPNAFTMSS+LKA KGM+ALSCG L HGLA  + +      G +                
Subjt:  FDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL-----TGRS----------------

Query:  ------LKTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAK
              LKTAVSWTTLIA +THRGDGYSGLQVFRQMLLEDVE +SFSFSIAVRACASIGS SYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCL DAK
Subjt:  ------LKTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAK

Query:  RYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSH
        R FGE+T KNLITWNTL+AGYERSDS+ESLRLFS MGCEGYEPNCFTFTS+TAACANLAVLSCGQQVHGGIVRRGFDKSVAL+NALIDMYAKCGNVNDSH
Subjt:  RYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSH

Query:  KLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEE
        KLF DM QRDLVSWTTMMIGYGAHGYGKEAIKLFDE                                  SMLEDY +NPDQEIYGCVVDLLGRAGRVEE
Subjt:  KLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEE

Query:  AFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
        AFQL ESMPFEPDESVWGALLGACK YELSNLGNLAAQRVL+ RPNMAGTYLLLSNIYAAEGKW EFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  AFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

XP_022931578.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita moschata]8.8e-22771.38Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLIVRNSLQW+N+STT++S PPF P  PSIWATNLIKSYFDKGL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +++ PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCNCL DAKR FGE+TE+NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SM+EDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

XP_022989366.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita maxima]4.8e-22571.02Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLI+RNSLQW+N+STT++S PPF P  PSIWATNLIKSYFD+GL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +++ PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCN L DAKR FGE+T +NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMPQRDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SMLEDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

XP_023520790.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita pepo subsp. pepo]6.7e-22771.55Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLIVRNSLQW+N+STT++S PPF P  PSIWATNLIKSYFDKGL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           + + PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCNCL DAKR FGE+TE+NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SMLEDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

TrEMBL top hitse value%identityAlignment
A0A0A0LW37 Uncharacterized protein1.7e-21569.07Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MS +KLASS HF  IPLIVRNSLQW+ S++T++S+PPF P GPS+WATNLIKSYFDKGLT EA NLF+E+PERDVV WTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL----------------------------TGRSLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +EV+PNAFTMSS+LKACKGMKALSCG L H LA    +                                LKTAVSWTTLIAG+THRGDGYSGL  F
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL----------------------------TGRSLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLLEDV  +SFSFSIA RACASI S+S GKQIHAAVTKYGLH D PVMNSILDMYCRCN L DAKR FGE+TEKNLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
         QMG EGY+PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDK+VAL+N+LIDMYAKCG+++DSHKLFCDMP RDLVSWTTMMIGYGAHGYGKEA+KL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SMLEDY+INPDQEIY CVVDLLGRAGRVEEAFQLVE+MPFEPDESVWGALLGACKAY+LSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLVYAE
        NLAAQRVL+ RPNMAGTYLLLS IYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V A+
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLVYAE

A0A1S3BQ30 putative pentatricopeptide repeat-containing protein At1g565703.2e-21468.89Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MS +KLASS HF  IPLIVRNSLQW+ S++T++S+PPF P GPS WATNLIKSYFDKGLT EA NLF+E+PERDVV WTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL----------------------------TGRSLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +EV+PNAFTMSS+LKACKGMKALSCG L H LA  + +                                LKTAVSWTTLIAG THRGDGYSGL  F
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL----------------------------TGRSLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        R+MLLEDV  +SFSFSIA RACASI S+S GKQIHAAVTKYGLH D PVMNSILDMYCRCN L DAKR F E+TEKNLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
         QMG EGY+PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDK+VAL+N+LIDMYAKCG++NDSHKLFCDMP RDLVSWTTMMIGYG HGYGKEA+KL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SMLEDY+INPDQEIY CVVDLLGRAGRVEEAFQLVE+MPFEPDESVWGALLGACKA +LSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLVYAE
        NLAAQRVL TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V A+
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLVYAE

A0A6J1DTG1 putative pentatricopeptide repeat-containing protein At1g565701.3e-23170.95Show/hide
Query:  NHPATAKKGECSASSGELKNSVR-ELEVRTKTTGMSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNL
        NH     + E    +  LKN ++    +R  TTGMSANKLASSTHF  IPL+VRNSLQ VNSSTTI+ HPPF+P GPSIWATNLIKSYFDKGLT EARNL
Subjt:  NHPATAKKGECSASSGELKNSVR-ELEVRTKTTGMSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNL

Query:  FDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL-----TGRS----------------
        FDEMPERDVVAWT +I                     +E+EPNAFTMSS+LKA KGM+ALSCG L HGLA  + +      G +                
Subjt:  FDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMAL-----TGRS----------------

Query:  ------LKTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAK
              LKTAVSWTTLIA +THRGDGYSGLQVFRQMLLEDVE +SFSFSIAVRACASIGS SYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCL DAK
Subjt:  ------LKTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAK

Query:  RYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSH
        R FGE+T KNLITWNTL+AGYERSDS+ESLRLFS MGCEGYEPNCFTFTS+TAACANLAVLSCGQQVHGGIVRRGFDKSVAL+NALIDMYAKCGNVNDSH
Subjt:  RYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSH

Query:  KLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEE
        KLF DM QRDLVSWTTMMIGYGAHGYGKEAIKLFDE                                  SMLEDY +NPDQEIYGCVVDLLGRAGRVEE
Subjt:  KLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEE

Query:  AFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
        AFQL ESMPFEPDESVWGALLGACK YELSNLGNLAAQRVL+ RPNMAGTYLLLSNIYAAEGKW EFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  AFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

A0A6J1EZ33 putative pentatricopeptide repeat-containing protein At1g565704.2e-22771.38Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLIVRNSLQW+N+STT++S PPF P  PSIWATNLIKSYFDKGL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +++ PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCNCL DAKR FGE+TE+NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SM+EDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

A0A6J1JJV4 putative pentatricopeptide repeat-containing protein At1g565702.3e-22571.02Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------
        MSANKLASST F  IPLI+RNSLQW+N+STT++S PPF P  PSIWATNLIKSYFD+GL+  ARNLFDEMPERDVVAWTAMI                  
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI------------------

Query:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF
           +++ PNAFT+SS+LKACKGMKALSCGTL H LA    + G                              LKTAVSWTTLIAG+THRGDGYSGLQVF
Subjt:  ---NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQVF

Query:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF
        RQMLL++VE +SFSFSIAVRACASIGS++YGKQIHAAVTKYGLHSD+PV+NSILDMYCRCN L DAKR FGE+T +NLITWNTL+AGYERSDS+ESL LF
Subjt:  RQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLF

Query:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL
        SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKCGN+NDSHKLFCDMPQRDLVSWTTMMIGYG+HGYGKE IKL
Subjt:  SQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKL

Query:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
        FDE                                  SMLEDY++NPD EIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG
Subjt:  FDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLG

Query:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV
         LAAQRVL+TRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGM  +K+  +  + +  E  + +V
Subjt:  NLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLV

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic9.2e-7031.21Show/hide
Query:  EARNLFDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLA----------LSMAL----------
        EAR +FD MPERD+V+W  ++                       ++P+  T+ S+L A   ++ +S G  +HG A          +S AL          
Subjt:  EARNLFDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLA----------LSMAL----------

Query:  -TGRSL------KTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNC
         T R L      +  VSW ++I  Y    +    + +F++ML E V+ +  S   A+ ACA +G    G+ IH    + GL  +V V+NS++ MYC+C  
Subjt:  -TGRSL------KTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNC

Query:  LSDAKRYFGEVTEKNLITWNTLVAGYERSDSN-ESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCG
        +  A   FG++  + L++WN ++ G+ ++    ++L  FSQM     +P+ FT+ S+  A A L++    + +HG ++R   DK+V +  AL+DMYAKCG
Subjt:  LSDAKRYFGEVTEKNLITWNTLVAGYERSDSN-ESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCG

Query:  NVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE---------------------------------WSMLEDYSINPDQEIYGCVVDLLGR
         +  +  +F  M +R + +W  M+ GYG HG+GK A++LF+E                                 + M E+YSI    + YG +VDLLGR
Subjt:  NVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE---------------------------------WSMLEDYSINPDQEIYGCVVDLLGR

Query:  AGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRK
        AGR+ EA+  +  MP +P  +V+GA+LGAC+ ++  N    AA+R+    P+  G ++LL+NIY A   W +  ++R  M     RK
Subjt:  AGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRK

Q9FXA9 Putative pentatricopeptide repeat-containing protein At1g565701.4e-14749.28Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVN-SSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI-----------------
        MS  KLA S  F+ IP  VR+SL+     S+    +PP++P    I ATNLI SYF+KGL  EAR+LFDEMP+RDVVAWTAMI                 
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVN-SSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI-----------------

Query:  ----NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQV
                PN FT+SS+LK+C+ MK L+ G LVHG+ + + + G                              +K  V+WTTLI G+TH GDG  GL++
Subjt:  ----NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQV

Query:  FRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRL
        ++QMLLE+ E++ +  +IAVRA ASI S + GKQIHA+V K G  S++PVMNSILD+YCRC  LS+AK YF E+ +K+LITWNTL++  ERSDS+E+L +
Subjt:  FRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRL

Query:  FSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAI
        F +   +G+ PNC+TFTS+ AACAN+A L+CGQQ+HG I RRGF+K+V L NALIDMYAKCGN+ DS ++F + + +R+LVSWT+MMIGYG+HGYG EA+
Subjt:  FSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAI

Query:  KLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSN
        +LFD+                                   M  +Y INPD++IY CVVDLLGRAG++ EA++LVE MPF+PDES WGA+LGACKA++ + 
Subjt:  KLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSN

Query:  L-GNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK
        L   LAA++V+  +P M GTY++LS IYAAEGKW +FA++RK+M+ M  +K+
Subjt:  L-GNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168606.0e-6932.69Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMINEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLK-TAVSWTTLIAGYTHRGDGYSGL
        L+  Y   G+  EA  +F  M  +DVV+W AM+                         G     + L   +    +K   V+W+  I+GY  RG GY  L
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMINEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLK-TAVSWTTLIAGYTHRGDGYSGL

Query:  QVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGL------HSDV-PVMNSILDMYCRCNCLSDAKRYFGEVT--EKNLITWNTLVAGY
         V RQML   ++ +  +    +  CAS+G+  +GK+IH    KY +      H D   V+N ++DMY +C  +  A+  F  ++  E++++TW  ++ GY
Subjt:  QVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGL------HSDV-PVMNSILDMYCRCNCLSDAKRYFGEVT--EKNLITWNTLVAGY

Query:  -ERSDSNESLRLFSQMGCEGYE--PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV-NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTM
         +  D+N++L L S+M  E  +  PN FT +    ACA+LA L  G+Q+H   +R   +     V N LIDMYAKCG+++D+  +F +M  ++ V+WT++
Subjt:  -ERSDSNESLRLFSQMGCEGYE--PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV-NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTM

Query:  MIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVW
        M GYG HGYG+EA+ +FDE                                   M   + ++P  E Y C+VDLLGRAGR+  A +L+E MP EP   VW
Subjt:  MIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVW

Query:  GALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK
         A L  C+ +    LG  AA+++     N  G+Y LLSN+YA  G+W +  ++R LM+    +K+
Subjt:  GALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic8.3e-7130.75Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALS-----------
        LI  Y   G    AR LFD MP RD+++W AMI+                      V+P+  T++S++ AC+ +     G  +H   ++           
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALS-----------

Query:  ----MALTGRSLKTA------------VSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVM
            M L   S + A            VSWTT+I+GY +       +  +R M  + V+    + +  + ACA++G    G ++H    K  L S V V 
Subjt:  ----MALTGRSLKTA------------VSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVM

Query:  NSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV
        N++++MY +C C+  A   F  +  KN+I+W +++AG   ++      +F +      +PN  T T+  AACA +  L CG+++H  ++R G      L 
Subjt:  NSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV

Query:  NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE--------------------------------WSMLEDYSINPDQEI
        NAL+DMY +CG +N +   F +  ++D+ SW  ++ GY   G G   ++LFD                                 +S +EDY + P+ + 
Subjt:  NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE--------------------------------WSMLEDYSINPDQEI

Query:  YGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK
        Y CVVDLLGRAG ++EA + ++ MP  PD +VWGALL AC+ +   +LG L+AQ +        G Y+LL N+YA  GKW E AK+R++MK
Subjt:  YGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395303.2e-7030.99Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLKT-
        LI SY   G  I A  LF+ MP +++++WT +++                      ++P+ +  SSIL +C  + AL  GT VH   +   L   S  T 
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLKT-

Query:  --------------------------AVSWTTLIAGYTHRG---DGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDV
                                   V +  +I GY+  G   + +  L +FR M    +  S  +F   +RA AS+ S    KQIH  + KYGL+ D+
Subjt:  --------------------------AVSWTTLIAGYTHRG---DGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDV

Query:  PVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGY-ERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS
           ++++D+Y  C CL D++  F E+  K+L+ WN++ AGY ++S++ E+L LF ++      P+ FTF ++  A  NLA +  GQ+ H  +++RG + +
Subjt:  PVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGY-ERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS

Query:  VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW----------------------SMLED----------YSINP
          + NAL+DMYAKCG+  D+HK F     RD+V W +++  Y  HG GK+A+++ ++                        ++ED          + I P
Subjt:  VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW----------------------SMLED----------YSINP

Query:  DQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK--GMT
        + E Y C+V LLGRAGR+ +A +L+E MP +P   VW +LL  C       L   AA+  + + P  +G++ +LSNIYA++G W E  K+R+ MK  G+ 
Subjt:  DQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK--GMT

Query:  TRKKRVRVGLNME
            R  +G+N E
Subjt:  TRKKRVRVGLNME

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-7131.21Show/hide
Query:  EARNLFDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLA----------LSMAL----------
        EAR +FD MPERD+V+W  ++                       ++P+  T+ S+L A   ++ +S G  +HG A          +S AL          
Subjt:  EARNLFDEMPERDVVAWTAMI---------------------NEVEPNAFTMSSILKACKGMKALSCGTLVHGLA----------LSMAL----------

Query:  -TGRSL------KTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNC
         T R L      +  VSW ++I  Y    +    + +F++ML E V+ +  S   A+ ACA +G    G+ IH    + GL  +V V+NS++ MYC+C  
Subjt:  -TGRSL------KTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNC

Query:  LSDAKRYFGEVTEKNLITWNTLVAGYERSDSN-ESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCG
        +  A   FG++  + L++WN ++ G+ ++    ++L  FSQM     +P+ FT+ S+  A A L++    + +HG ++R   DK+V +  AL+DMYAKCG
Subjt:  LSDAKRYFGEVTEKNLITWNTLVAGYERSDSN-ESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCG

Query:  NVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE---------------------------------WSMLEDYSINPDQEIYGCVVDLLGR
         +  +  +F  M +R + +W  M+ GYG HG+GK A++LF+E                                 + M E+YSI    + YG +VDLLGR
Subjt:  NVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE---------------------------------WSMLEDYSINPDQEIYGCVVDLLGR

Query:  AGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRK
        AGR+ EA+  +  MP +P  +V+GA+LGAC+ ++  N    AA+R+    P+  G ++LL+NIY A   W +  ++R  M     RK
Subjt:  AGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRK

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.9e-7230.75Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALS-----------
        LI  Y   G    AR LFD MP RD+++W AMI+                      V+P+  T++S++ AC+ +     G  +H   ++           
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALS-----------

Query:  ----MALTGRSLKTA------------VSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVM
            M L   S + A            VSWTT+I+GY +       +  +R M  + V+    + +  + ACA++G    G ++H    K  L S V V 
Subjt:  ----MALTGRSLKTA------------VSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVM

Query:  NSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV
        N++++MY +C C+  A   F  +  KN+I+W +++AG   ++      +F +      +PN  T T+  AACA +  L CG+++H  ++R G      L 
Subjt:  NSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV

Query:  NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE--------------------------------WSMLEDYSINPDQEI
        NAL+DMY +CG +N +   F +  ++D+ SW  ++ GY   G G   ++LFD                                 +S +EDY + P+ + 
Subjt:  NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDE--------------------------------WSMLEDYSINPDQEI

Query:  YGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK
        Y CVVDLLGRAG ++EA + ++ MP  PD +VWGALL AC+ +   +LG L+AQ +        G Y+LL N+YA  GKW E AK+R++MK
Subjt:  YGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK

AT1G56570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-14849.28Show/hide
Query:  MSANKLASSTHFQLIPLIVRNSLQWVN-SSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI-----------------
        MS  KLA S  F+ IP  VR+SL+     S+    +PP++P    I ATNLI SYF+KGL  EAR+LFDEMP+RDVVAWTAMI                 
Subjt:  MSANKLASSTHFQLIPLIVRNSLQWVN-SSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVVAWTAMI-----------------

Query:  ----NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQV
                PN FT+SS+LK+C+ MK L+ G LVHG+ + + + G                              +K  V+WTTLI G+TH GDG  GL++
Subjt:  ----NEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGR----------------------------SLKTAVSWTTLIAGYTHRGDGYSGLQV

Query:  FRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRL
        ++QMLLE+ E++ +  +IAVRA ASI S + GKQIHA+V K G  S++PVMNSILD+YCRC  LS+AK YF E+ +K+LITWNTL++  ERSDS+E+L +
Subjt:  FRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRL

Query:  FSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAI
        F +   +G+ PNC+TFTS+ AACAN+A L+CGQQ+HG I RRGF+K+V L NALIDMYAKCGN+ DS ++F + + +R+LVSWT+MMIGYG+HGYG EA+
Subjt:  FSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAI

Query:  KLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSN
        +LFD+                                   M  +Y INPD++IY CVVDLLGRAG++ EA++LVE MPF+PDES WGA+LGACKA++ + 
Subjt:  KLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSN

Query:  L-GNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK
        L   LAA++V+  +P M GTY++LS IYAAEGKW +FA++RK+M+ M  +K+
Subjt:  L-GNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-7130.99Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLKT-
        LI SY   G  I A  LF+ MP +++++WT +++                      ++P+ +  SSIL +C  + AL  GT VH   +   L   S  T 
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMIN---------------------EVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLKT-

Query:  --------------------------AVSWTTLIAGYTHRG---DGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDV
                                   V +  +I GY+  G   + +  L +FR M    +  S  +F   +RA AS+ S    KQIH  + KYGL+ D+
Subjt:  --------------------------AVSWTTLIAGYTHRG---DGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGLHSDV

Query:  PVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGY-ERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS
           ++++D+Y  C CL D++  F E+  K+L+ WN++ AGY ++S++ E+L LF ++      P+ FTF ++  A  NLA +  GQ+ H  +++RG + +
Subjt:  PVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGY-ERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS

Query:  VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW----------------------SMLED----------YSINP
          + NAL+DMYAKCG+  D+HK F     RD+V W +++  Y  HG GK+A+++ ++                        ++ED          + I P
Subjt:  VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEW----------------------SMLED----------YSINP

Query:  DQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK--GMT
        + E Y C+V LLGRAGR+ +A +L+E MP +P   VW +LL  C       L   AA+  + + P  +G++ +LSNIYA++G W E  K+R+ MK  G+ 
Subjt:  DQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMK--GMT

Query:  TRKKRVRVGLNME
            R  +G+N E
Subjt:  TRKKRVRVGLNME

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-7032.69Show/hide
Query:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMINEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLK-TAVSWTTLIAGYTHRGDGYSGL
        L+  Y   G+  EA  +F  M  +DVV+W AM+                         G     + L   +    +K   V+W+  I+GY  RG GY  L
Subjt:  LIKSYFDKGLTIEARNLFDEMPERDVVAWTAMINEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLK-TAVSWTTLIAGYTHRGDGYSGL

Query:  QVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGL------HSDV-PVMNSILDMYCRCNCLSDAKRYFGEVT--EKNLITWNTLVAGY
         V RQML   ++ +  +    +  CAS+G+  +GK+IH    KY +      H D   V+N ++DMY +C  +  A+  F  ++  E++++TW  ++ GY
Subjt:  QVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIHAAVTKYGL------HSDV-PVMNSILDMYCRCNCLSDAKRYFGEVT--EKNLITWNTLVAGY

Query:  -ERSDSNESLRLFSQMGCEGYE--PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV-NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTM
         +  D+N++L L S+M  E  +  PN FT +    ACA+LA L  G+Q+H   +R   +     V N LIDMYAKCG+++D+  +F +M  ++ V+WT++
Subjt:  -ERSDSNESLRLFSQMGCEGYE--PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALV-NALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTM

Query:  MIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVW
        M GYG HGYG+EA+ +FDE                                   M   + ++P  E Y C+VDLLGRAGR+  A +L+E MP EP   VW
Subjt:  MIGYGAHGYGKEAIKLFDEW---------------------------------SMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVW

Query:  GALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK
         A L  C+ +    LG  AA+++     N  G+Y LLSN+YA  G+W +  ++R LM+    +K+
Subjt:  GALLGACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCACCCGGCCACCGCGAAGAAGGGAGAGTGCTCGGCATCCTCCGGCGAATTGAAGAACTCAGTTCGTGAATTGGAGGTAAGAACGAAAACAACAGGGATGAGTGC
TAACAAATTGGCATCTTCCACTCATTTCCAGCTAATCCCATTGATAGTCAGAAACTCTCTTCAATGGGTAAACAGCTCCACCACTATACGATCACACCCACCTTTCAGGC
CGATGGGACCATCTATTTGGGCCACAAATCTGATCAAATCATACTTTGACAAGGGCCTGACTATAGAAGCTCGTAACCTGTTTGATGAAATGCCTGAAAGAGATGTGGTT
GCCTGGACTGCTATGATTAATGAGGTTGAGCCAAATGCCTTCACTATGTCTAGTATTCTTAAGGCTTGCAAGGGCATGAAGGCTCTTTCATGTGGGACTTTGGTTCATGG
TTTGGCACTAAGCATGGCATTGACGGGTCGATCTCTGAAGACTGCTGTGTCATGGACTACTTTGATTGCAGGGTACACTCACAGAGGTGATGGCTACAGCGGGCTTCAAG
TCTTCAGGCAAATGTTGTTGGAAGATGTTGAACTAAGCTCGTTTAGCTTTTCCATTGCGGTTAGAGCTTGTGCTTCGATTGGCTCGCATTCATATGGAAAGCAAATACAT
GCAGCAGTCACCAAATATGGCCTCCACTCTGATGTTCCAGTAATGAATTCCATACTTGACATGTATTGCAGGTGTAATTGTTTAAGTGATGCAAAAAGATACTTTGGTGA
AGTGACTGAAAAGAATTTGATTACATGGAACACCTTGGTAGCAGGATATGAAAGGTCAGATTCGAATGAATCTCTACGTTTGTTTTCGCAAATGGGATGCGAAGGCTATG
AACCGAATTGTTTTACATTCACAAGTATTACAGCTGCATGTGCAAATTTAGCAGTCTTGAGTTGTGGACAACAGGTTCATGGTGGAATTGTTCGTAGGGGATTTGACAAG
AGTGTGGCATTGGTCAATGCACTTATTGACATGTATGCAAAGTGCGGAAACGTAAATGATTCACACAAACTCTTCTGTGATATGCCTCAAAGAGACTTGGTTTCCTGGAC
TACCATGATGATTGGGTATGGAGCACATGGATATGGAAAAGAGGCCATTAAGTTGTTTGATGAATGGTCAATGCTGGAGGATTACAGTATTAACCCTGATCAAGAGATCT
ATGGGTGTGTGGTGGACTTGCTTGGCCGCGCTGGGAGAGTTGAGGAGGCTTTTCAACTCGTCGAGAGCATGCCATTTGAACCGGACGAGTCTGTTTGGGGTGCCCTCCTG
GGAGCTTGTAAAGCATATGAACTTTCAAATCTGGGAAATTTAGCAGCTCAGAGAGTATTGAATACTAGGCCGAACATGGCGGGGACTTACCTGCTGCTATCCAATATATA
TGCTGCTGAAGGTAAATGGGGCGAGTTCGCCAAAATGAGGAAGCTGATGAAAGGGATGACAACAAGAAAGAAGCGGGTAAGAGTTGGATTGAATATGGAGGAGGAGACGA
AAGCTGACCTTGTTTATGCTGAGAACACCTCAACTTGCAGAAATGTAGACGATGCAAACCAACATAATGCTGATGGCACCTTGCTTTCCTCATTGAGATGCTCAAAGCCC
TGCTGGAGTCTGGAAGGTGATTTTTCCAGTTGGTGGGTCATCAACATAGATATCACTCAGTGTGACCCAAAGCAGAAGCTCCTTGGTCTTAACCCCAGTGAGTTTCTTGA
TCTTGTTCTGCTCTACAATGCTGTGACTTCAGTGGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCACCCGGCCACCGCGAAGAAGGGAGAGTGCTCGGCATCCTCCGGCGAATTGAAGAACTCAGTTCGTGAATTGGAGGTAAGAACGAAAACAACAGGGATGAGTGC
TAACAAATTGGCATCTTCCACTCATTTCCAGCTAATCCCATTGATAGTCAGAAACTCTCTTCAATGGGTAAACAGCTCCACCACTATACGATCACACCCACCTTTCAGGC
CGATGGGACCATCTATTTGGGCCACAAATCTGATCAAATCATACTTTGACAAGGGCCTGACTATAGAAGCTCGTAACCTGTTTGATGAAATGCCTGAAAGAGATGTGGTT
GCCTGGACTGCTATGATTAATGAGGTTGAGCCAAATGCCTTCACTATGTCTAGTATTCTTAAGGCTTGCAAGGGCATGAAGGCTCTTTCATGTGGGACTTTGGTTCATGG
TTTGGCACTAAGCATGGCATTGACGGGTCGATCTCTGAAGACTGCTGTGTCATGGACTACTTTGATTGCAGGGTACACTCACAGAGGTGATGGCTACAGCGGGCTTCAAG
TCTTCAGGCAAATGTTGTTGGAAGATGTTGAACTAAGCTCGTTTAGCTTTTCCATTGCGGTTAGAGCTTGTGCTTCGATTGGCTCGCATTCATATGGAAAGCAAATACAT
GCAGCAGTCACCAAATATGGCCTCCACTCTGATGTTCCAGTAATGAATTCCATACTTGACATGTATTGCAGGTGTAATTGTTTAAGTGATGCAAAAAGATACTTTGGTGA
AGTGACTGAAAAGAATTTGATTACATGGAACACCTTGGTAGCAGGATATGAAAGGTCAGATTCGAATGAATCTCTACGTTTGTTTTCGCAAATGGGATGCGAAGGCTATG
AACCGAATTGTTTTACATTCACAAGTATTACAGCTGCATGTGCAAATTTAGCAGTCTTGAGTTGTGGACAACAGGTTCATGGTGGAATTGTTCGTAGGGGATTTGACAAG
AGTGTGGCATTGGTCAATGCACTTATTGACATGTATGCAAAGTGCGGAAACGTAAATGATTCACACAAACTCTTCTGTGATATGCCTCAAAGAGACTTGGTTTCCTGGAC
TACCATGATGATTGGGTATGGAGCACATGGATATGGAAAAGAGGCCATTAAGTTGTTTGATGAATGGTCAATGCTGGAGGATTACAGTATTAACCCTGATCAAGAGATCT
ATGGGTGTGTGGTGGACTTGCTTGGCCGCGCTGGGAGAGTTGAGGAGGCTTTTCAACTCGTCGAGAGCATGCCATTTGAACCGGACGAGTCTGTTTGGGGTGCCCTCCTG
GGAGCTTGTAAAGCATATGAACTTTCAAATCTGGGAAATTTAGCAGCTCAGAGAGTATTGAATACTAGGCCGAACATGGCGGGGACTTACCTGCTGCTATCCAATATATA
TGCTGCTGAAGGTAAATGGGGCGAGTTCGCCAAAATGAGGAAGCTGATGAAAGGGATGACAACAAGAAAGAAGCGGGTAAGAGTTGGATTGAATATGGAGGAGGAGACGA
AAGCTGACCTTGTTTATGCTGAGAACACCTCAACTTGCAGAAATGTAGACGATGCAAACCAACATAATGCTGATGGCACCTTGCTTTCCTCATTGAGATGCTCAAAGCCC
TGCTGGAGTCTGGAAGGTGATTTTTCCAGTTGGTGGGTCATCAACATAGATATCACTCAGTGTGACCCAAAGCAGAAGCTCCTTGGTCTTAACCCCAGTGAGTTTCTTGA
TCTTGTTCTGCTCTACAATGCTGTGACTTCAGTGGCATAA
Protein sequenceShow/hide protein sequence
MNHPATAKKGECSASSGELKNSVRELEVRTKTTGMSANKLASSTHFQLIPLIVRNSLQWVNSSTTIRSHPPFRPMGPSIWATNLIKSYFDKGLTIEARNLFDEMPERDVV
AWTAMINEVEPNAFTMSSILKACKGMKALSCGTLVHGLALSMALTGRSLKTAVSWTTLIAGYTHRGDGYSGLQVFRQMLLEDVELSSFSFSIAVRACASIGSHSYGKQIH
AAVTKYGLHSDVPVMNSILDMYCRCNCLSDAKRYFGEVTEKNLITWNTLVAGYERSDSNESLRLFSQMGCEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDK
SVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEWSMLEDYSINPDQEIYGCVVDLLGRAGRVEEAFQLVESMPFEPDESVWGALL
GACKAYELSNLGNLAAQRVLNTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKLMKGMTTRKKRVRVGLNMEEETKADLVYAENTSTCRNVDDANQHNADGTLLSSLRCSKP
CWSLEGDFSSWWVINIDITQCDPKQKLLGLNPSEFLDLVLLYNAVTSVA