; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025437 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025437
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00006406:1322551..1330364
RNA-Seq ExpressionSgr025437
SyntenySgr025437
Gene Ontology termsGO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0046933 - proton-transporting ATP synthase activity, rotational mechanism (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR000711 - ATPase, OSCP/delta subunit
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR026015 - F1F0 ATP synthase OSCP/delta subunit, N-terminal domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582687.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.14Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNGVD+TKFLI+KLLQIPNL YACTLFDLIPKP+VFLYNKFIQ FSS GH HRCWLLYYQMCL+GCSPN HSFTFLF ACAS L+ YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HF KSGFASDVFALTALLDMY KLG+L+SARQLFDEM VRD PTWNSMIAGYARSG MGAALELF +MP RNV+SWTALISGYAQNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEVTIASVLPACA LGALDIG+RIEAYARKNGFFKNLYVS AILEVHARCGNIEEAR+VF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QML +R RPDDVTFVGLLLACTHGGMVAKGR+LFESME KFQIAPKLEHY CLVDLLGR+GE++EAY+LIQ+MPM+PDSVIWGALLGACSFH NVEL E
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNEV
        VAAESLFKLEPWNPGNYVILSNIYASAG+W GVAR RKMMKGGH+ KRAG SYIE                              ++  +   P  +  V
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNEV

Query:  ATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFEDERDAEDAIRGLDNMPFGYDRRRLSVEWARGERGRHRDGSKSLANQRPTKTLFVI
        ATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFE+ERDAED+IR LDNMPFGYDRRRLSVEWARGERGR RDGSKS+ANQRPTKTLFVI
Subjt:  ATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFEDERDAEDAIRGLDNMPFGYDRRRLSVEWARGERGRHRDGSKSLANQRPTKTLFVI

Query:  NFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQEDATKALECTHMSKILDRVVSVEYALRDDGERGDPYDDSPRRAAYGRPGDSPYRRSPSPVF
        NFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQE+ATKALECTHMSKILDRVVSVEYALRDDGERGD YDDSPRRAAYGR GDSPYRRSPSPV+
Subjt:  NFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQEDATKALECTHMSKILDRVVSVEYALRDDGERGDPYDDSPRRAAYGRPGDSPYRRSPSPVF

Query:  RRRPSPDYGRARSPAYDRYNGP-YERRRSPEYGRNRSPEYGRFR
         RRPSPDYGRARSPAYDR   P   R RSPE GRNRSP+YGR R
Subjt:  RRRPSPDYGRARSPAYDRYNGP-YERRRSPEYGRNRSPEYGRFR

KAG7019084.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.83Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNGVD+TKFLI+KLLQIPNL YACTLFDLIPKP+VFLYNKFIQ FSS GH HRCWLLYYQMCL+GCSPN HSFTFLF ACAS L+ YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HF KSGFASDVFALTALLDMY KLG+L+SARQLFDEM VRD PTWNSMIAGYARSG MGAALELF +MP RNV+SWTALISGYAQNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEVTIASVLPACA LGALDIG+RIEAYARKNGFFKNLYVS AILEVHARCGNIEEAR+VF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTR---------RMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACS
        +QML R         R RPDDVTFVGLLLACTHGGMVAKGR+LFESME KFQIAPKLEHY CLVDLLGR+GE++EAY+LIQ+MPM+PDSVIWGALLGACS
Subjt:  EQMLTR---------RMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACS

Query:  FHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK
        FH NVEL EVAAESLFKLEPWNPGNYVILSNIYASAG+W GVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KSDEIYALLH +YA IK   
Subjt:  FHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK

Query:  PAPHCQNE----VATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFEDERDAEDAIRGLDNMPFGYDRRRLSVEWARGERGRHRDGSKS
           H QNE    VATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFE+ERDAED+IR LDNMPFGYDRRRLSVEWARGERGR RDGSKS
Subjt:  PAPHCQNE----VATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFEDERDAEDAIRGLDNMPFGYDRRRLSVEWARGERGRHRDGSKS

Query:  LANQRPTKTLFVINFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQEDATKALECTHMSKILDRVVSVEYALRDDGERGDPYDDSPRRAAYGRP
        +ANQRPTKTLFVINFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQE+ATKALECTHMSKILDRVVSVEYALRDDGERGD YDDSPRRAAYGR 
Subjt:  LANQRPTKTLFVINFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQEDATKALECTHMSKILDRVVSVEYALRDDGERGDPYDDSPRRAAYGRP

Query:  GDSPYRRSPSPVFRRRPSPDYGRARSPAYDRYNGPYERRRSPEYGRNRSPEYGRFR
        GDSPYRRSPSPV+ RRPSPDYGRARSPAYDRYNGPYERRRSP+YGRNRSP+YGR R
Subjt:  GDSPYRRSPSPVFRRRPSPDYGRARSPAYDRYNGPYERRRSPEYGRNRSPEYGRFR

XP_022147487.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica charantia]1.3e-26789.78Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAYGLR+GVD+TKFLIEKLLQIPNL YAC LFDLIPKP+VFLYNKFIQ++SS+G  HRCW LYYQMC +GCSPNQHSFTFLFAACAS+ + +PGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEM VRD PTWNSM+AGY+RSGDMGAALELF RMPVRNVVSWTALISGY+QNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENERG KPNEVT+ASVLPACAQLGALDIGRRIEAYAR NGFFKNLYVS AILEVHARCGNIEEARQVF+EIGSKRNLCSWNTMIMGLAVHGRCS A++LY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QMLT+R+RPDDVTF+GLLLACTHGGMVAKGR+LFESMESKFQIAPKLEHY CLVDLLGR+GEL+EAYNLIQ MPMVPDSVIWGALLGACSFH +VELAE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+WRGVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIY+ IK QKPA H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

XP_022147489.1 pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Momordica charantia]1.3e-26789.78Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAYGLR+GVD+TKFLIEKLLQIPNL YAC LFDLIPKP+VFLYNKFIQ++SS+G  HRCW LYYQMC +GCSPNQHSFTFLFAACAS+ + +PGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEM VRD PTWNSM+AGY+RSGDMGAALELF RMPVRNVVSWTALISGY+QNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENERG KPNEVT+ASVLPACAQLGALDIGRRIEAYAR NGFFKNLYVS AILEVHARCGNIEEARQVF+EIGSKRNLCSWNTMIMGLAVHGRCS A++LY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QMLT+R+RPDDVTF+GLLLACTHGGMVAKGR+LFESMESKFQIAPKLEHY CLVDLLGR+GEL+EAYNLIQ MPMVPDSVIWGALLGACSFH +VELAE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+WRGVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIY+ IK QKPA H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]1.4e-25887.37Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNGVD+TKFLIEKLLQIPNL YACTLFDLIPKP+VFLYNKFIQ FSS GH HRCWLLYYQMCL+GCSPN HSFTFLF ACAS L+ YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEM VRD PTWNSMIAGYARSG MGAALELF +MP+RNV+SWTALISGYAQNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEVTIASVLPACAQLGALDIG+RIE YARKNGFFKNLYVS AILEVHARCGNIEEAR+VF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QML +R RPDDVTFVGLLLACTHGGMVAKGR++FESME KFQIAPKLEHY CLVDLLGR+GE++EAY+LIQ+MPM PDSVIWGALLGACSFH NVEL E
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+W GVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KSDEIYALLH IYA IK      H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein9.4e-25685.57Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNG+DHTKFLIEKLLQ+P+L YACTLFD IPKP+V+LYNKFIQ FSS GH HRCWLLY QMC +GCSPNQ+SFTFLF ACAS+ + YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HFCKSGFASD+FA+TALLDMYAKLGMLRSARQLFDEM VRD PTWNS+IAGYARSG M AALELF++MPVRNV+SWTALISGYAQNGKYAKALEMF+ L
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEV+IASVLPAC+QLGALDIG+RIEAYAR NGFFKN YVS A+LE+HARCGNIEEA+QVF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QML R+MRPDDVTFVGLLLACTHGGMVA+GR+LFESMESKFQ+APKLEHY CLVDLLGR+GELQEAYNLIQNMPM PDSVIWG LLGACSFH NVEL E
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYA AG+W GVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY  IK  K   H  NE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X26.3e-26889.78Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAYGLR+GVD+TKFLIEKLLQIPNL YAC LFDLIPKP+VFLYNKFIQ++SS+G  HRCW LYYQMC +GCSPNQHSFTFLFAACAS+ + +PGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEM VRD PTWNSM+AGY+RSGDMGAALELF RMPVRNVVSWTALISGY+QNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENERG KPNEVT+ASVLPACAQLGALDIGRRIEAYAR NGFFKNLYVS AILEVHARCGNIEEARQVF+EIGSKRNLCSWNTMIMGLAVHGRCS A++LY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QMLT+R+RPDDVTF+GLLLACTHGGMVAKGR+LFESMESKFQIAPKLEHY CLVDLLGR+GEL+EAYNLIQ MPMVPDSVIWGALLGACSFH +VELAE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+WRGVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIY+ IK QKPA H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X16.3e-26889.78Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAYGLR+GVD+TKFLIEKLLQIPNL YAC LFDLIPKP+VFLYNKFIQ++SS+G  HRCW LYYQMC +GCSPNQHSFTFLFAACAS+ + +PGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEM VRD PTWNSM+AGY+RSGDMGAALELF RMPVRNVVSWTALISGY+QNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENERG KPNEVT+ASVLPACAQLGALDIGRRIEAYAR NGFFKNLYVS AILEVHARCGNIEEARQVF+EIGSKRNLCSWNTMIMGLAVHGRCS A++LY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QMLT+R+RPDDVTF+GLLLACTHGGMVAKGR+LFESMESKFQIAPKLEHY CLVDLLGR+GEL+EAYNLIQ MPMVPDSVIWGALLGACSFH +VELAE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+WRGVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIY+ IK QKPA H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X12.5e-25686.57Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNGVD+TKFLI+KLLQIPNL YACTLFDLIPKP+VFLYNKFIQ FSS GH HRCWLLYYQMCL+GCSPN HSFTFLF ACAS L+ YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HF KSGFASDVFALTALLDMY KLG+L+SARQLFDEM VRD PTWNSMIAGYARSG MGAALELF +MP RNV+SWTALISGYAQNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEVTIASVLPACA LGALDIG+RIEAYARKNGFFKNLYVS AILEVHARCGNIEEAR+VF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QML +R RPDDVTFVGLLLACTHGGMVAKGR+LFESME KFQIAPKLEHY CLVDLLGR+GE++EAY+LIQ+MPM+PDSVIWGALLGACSFH NVEL E
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+W GVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KSDEIYALLH +YA IK      H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X16.9e-25987.37Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        +IHAY LRNGVD+TKFLIEKLLQIPNL YACTLFDLIPKP+VFLYNKFIQ FSS GH HRCWLLYYQMCL+GCSPN HSFTFLF ACAS L+ YPGQMLH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        +HFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEM VRD PTWNSMIAGYARSG MGAALELF +MP+RNV+SWTALISGYAQNGKYAKALEMFLRL
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        ENE+GTKPNEVTIASVLPACAQLGALDIG+RIE YARKNGFFKNLYVS AILEVHARCGNIEEAR+VF+EIGSKRNLCSWNTMIMGLAVHGRC DALQLY
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
        +QML +R RPDDVTFVGLLLACTHGGMVAKGR++FESME KFQIAPKLEHY CLVDLLGR+GE++EAY+LIQ+MPM PDSVIWGALLGACSFH NVEL E
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE
        VAAESLFKLEPWNPGNYVILSNIYASAG+W GVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KSDEIYALLH IYA IK      H QNE
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNE

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.6e-10137.47Show/hide
Query:  IHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQ
        +H   +++ V    F+   L+       +L  AC +F  I +  V  +N  I  F   G   +   L+ +M       +  +   + +ACA I +   G+
Subjt:  IHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQ

Query:  MLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF
         + ++  ++    ++    A+LDMY K G +  A++LFD M  +D  TW +M+ GYA S D  AA E+ + MP +++V+W ALIS Y QNGK  +AL +F
Subjt:  MLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF

Query:  LRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDAL
          L+ ++  K N++T+ S L ACAQ+GAL++GR I +Y +K+G   N +V+ A++ ++++CG++E++R+VFN +  KR++  W+ MI GLA+HG  ++A+
Subjt:  LRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDAL

Query:  QLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVE
         ++ +M    ++P+ VTF  +  AC+H G+V +   LF  MES + I P+ +HY+C+VD+LGRSG L++A   I+ MP+ P + +WGALLGAC  H+N+ 
Subjt:  QLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVE

Query:  LAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIK
        LAE+A   L +LEP N G +V+LSNIYA  G W  V+ LRK M+   + K  G S IE+   IHEF+  D +H  S+++Y  LH +   +K
Subjt:  LAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIK

Q9C501 Pentatricopeptide repeat-containing protein At1g333502.7e-10639.6Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKL-----LQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSST--GHSHRCWLLYYQMCLRGC-SPNQHSFTFLFAACASILS
        ++ ++ + +G+ H+ FL  KL     L++ NLSYA  +FD    P   LY   + A+SS+   H+   +  +  M  R    PN   +  +  +   + S
Subjt:  KIHAYGLRNGVDHTKFLIEKL-----LQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSST--GHSHRCWLLYYQMCLRGC-SPNQHSFTFLFAACASILS

Query:  GYPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYA
         +   ++H H  KSGF   V   TALL  YA  +  +  ARQLFDEMS R+  +W +M++GYARSGD+  A+ LF  MP R+V SW A+++   QNG + 
Subjt:  GYPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYA

Query:  KALEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHG
        +A+ +F R+ NE   +PNEVT+  VL ACAQ G L + + I A+A +     +++VS ++++++ +CGN+EEA  VF ++ SK++L +WN+MI   A+HG
Subjt:  KALEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHG

Query:  RCSDALQLYEQML---TRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLG
        R  +A+ ++E+M+      ++PD +TF+GLL ACTHGG+V+KGR  F+ M ++F I P++EHY CL+DLLGR+G   EA  ++  M M  D  IWG+LL 
Subjt:  RCSDALQLYEQML---TRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLG

Query:  ACSFHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI
        AC  H +++LAEVA ++L  L P N G   +++N+Y   GNW    R RKM+K  +  K  G+S IE+ + +H+F   D+SH +++EIY +L  +
Subjt:  ACSFHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085101.7e-16657.32Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        ++HA+ LR GVD TK L+++LL IPNL YA  LFD       FLYNK IQA+      H   +LY  +   G  P+ H+F F+FAA AS  S  P ++LH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        + F +SGF SD F  T L+  YAKLG L  AR++FDEMS RD P WN+MI GY R GDM AA+ELF  MP +NV SWT +ISG++QNG Y++AL+MFL +
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        E ++  KPN +T+ SVLPACA LG L+IGRR+E YAR+NGFF N+YV  A +E++++CG I+ A+++F E+G++RNLCSWN+MI  LA HG+  +AL L+
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
         QML    +PD VTFVGLLLAC HGGMV KG+ELF+SME   +I+PKLEHY C++DLLGR G+LQEAY+LI+ MPM PD+V+WG LLGACSFH NVE+AE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK
        +A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +K +K
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK

Q9LS72 Pentatricopeptide repeat-containing protein At3g292304.2e-10436.7Show/hide
Query:  SSRKMEKEQLFGIAKKVQNLSGAAMVKIHAYGLRNGVDHTKFLIEKLLQIPNL----SYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCL
        SSR++ +E+L  +  K  NL+   + ++HA  +R  +     +  KL+   +L    + A  +F+ + +P V L N  I+A +     ++ + ++ +M  
Subjt:  SSRKMEKEQLFGIAKKVQNLSGAAMVKIHAYGLRNGVDHTKFLIEKLLQIPNL----SYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCL

Query:  RGCSPNQHSFTFLFAACASILSGYPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDE
         G   +  ++ FL  AC+        +M+H H  K G +SD++   AL+D Y+                                 K G LR AR+LFDE
Subjt:  RGCSPNQHSFTFLFAACASILSGYPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDE

Query:  MSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF------------------------LRLENER--------G
        M  RD  +WN+M+ GYAR  +M  A ELF +MP RN VSW+ ++ GY++ G    A  MF                        L  E +R        G
Subjt:  MSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF------------------------LRLENER--------G

Query:  TKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLYEQMLT
         K +   + S+L AC + G L +G RI +  +++    N YV  A+L+++A+CGN+++A  VFN+I  K++L SWNTM+ GL VHG   +A++L+ +M  
Subjt:  TKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLYEQMLT

Query:  RRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAEVAAES
          +RPD VTF+ +L +C H G++ +G + F SME  + + P++EHY CLVDLLGR G L+EA  ++Q MPM P+ VIWGALLGAC  H+ V++A+   ++
Subjt:  RRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAEVAAES

Query:  LFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL
        L KL+P +PGNY +LSNIYA+A +W GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KSD+IY +L
Subjt:  LFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205401.2e-10641.6Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLL-YYQMCLRGC--SPNQHSFTFLFAACASILSG
        KI+A  + +G+  + F++ K++    +I ++ YA  LF+ +  P VFLYN  I+A+  T +S  C ++  Y+  LR     P++ +F F+F +CAS+ S 
Subjt:  KIHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLL-YYQMCLRGC--SPNQHSFTFLFAACASILSG

Query:  YPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKA
        Y G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD  +WNS+++GYAR G M  A  LF  M  + +VSWTA+ISGY   G Y +A
Subjt:  YPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKA

Query:  LEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRC
        ++ F  ++   G +P+E+++ SVLP+CAQLG+L++G+ I  YA + GF K   V  A++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG  
Subjt:  LEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRC

Query:  SDALQLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFH
          A++ + +M   +++P+ +TF+GLL AC+H GM  +G   F+ M   +QI PK+EHY CL+D+L R+G+L+ A  + + MPM PDS IWG+LL +C   
Subjt:  SDALQLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFH

Query:  SNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
         N+++A VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  SNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-10739.6Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKL-----LQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSST--GHSHRCWLLYYQMCLRGC-SPNQHSFTFLFAACASILS
        ++ ++ + +G+ H+ FL  KL     L++ NLSYA  +FD    P   LY   + A+SS+   H+   +  +  M  R    PN   +  +  +   + S
Subjt:  KIHAYGLRNGVDHTKFLIEKL-----LQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSST--GHSHRCWLLYYQMCLRGC-SPNQHSFTFLFAACASILS

Query:  GYPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYA
         +   ++H H  KSGF   V   TALL  YA  +  +  ARQLFDEMS R+  +W +M++GYARSGD+  A+ LF  MP R+V SW A+++   QNG + 
Subjt:  GYPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYA

Query:  KALEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHG
        +A+ +F R+ NE   +PNEVT+  VL ACAQ G L + + I A+A +     +++VS ++++++ +CGN+EEA  VF ++ SK++L +WN+MI   A+HG
Subjt:  KALEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHG

Query:  RCSDALQLYEQML---TRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLG
        R  +A+ ++E+M+      ++PD +TF+GLL ACTHGG+V+KGR  F+ M ++F I P++EHY CL+DLLGR+G   EA  ++  M M  D  IWG+LL 
Subjt:  RCSDALQLYEQML---TRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLG

Query:  ACSFHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI
        AC  H +++LAEVA ++L  L P N G   +++N+Y   GNW    R RKM+K  +  K  G+S IE+ + +H+F   D+SH +++EIY +L  +
Subjt:  ACSFHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI

AT2G20540.1 mitochondrial editing factor 218.5e-10841.6Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLL-YYQMCLRGC--SPNQHSFTFLFAACASILSG
        KI+A  + +G+  + F++ K++    +I ++ YA  LF+ +  P VFLYN  I+A+  T +S  C ++  Y+  LR     P++ +F F+F +CAS+ S 
Subjt:  KIHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLL-YYQMCLRGC--SPNQHSFTFLFAACASILSG

Query:  YPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKA
        Y G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD  +WNS+++GYAR G M  A  LF  M  + +VSWTA+ISGY   G Y +A
Subjt:  YPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKA

Query:  LEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRC
        ++ F  ++   G +P+E+++ SVLP+CAQLG+L++G+ I  YA + GF K   V  A++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG  
Subjt:  LEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRC

Query:  SDALQLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFH
          A++ + +M   +++P+ +TF+GLL AC+H GM  +G   F+ M   +QI PK+EHY CL+D+L R+G+L+ A  + + MPM PDS IWG+LL +C   
Subjt:  SDALQLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFH

Query:  SNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
         N+++A VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  SNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10237.47Show/hide
Query:  IHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQ
        +H   +++ V    F+   L+       +L  AC +F  I +  V  +N  I  F   G   +   L+ +M       +  +   + +ACA I +   G+
Subjt:  IHAYGLRNGVDHTKFLIEKLL----QIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQ

Query:  MLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF
         + ++  ++    ++    A+LDMY K G +  A++LFD M  +D  TW +M+ GYA S D  AA E+ + MP +++V+W ALIS Y QNGK  +AL +F
Subjt:  MLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF

Query:  LRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDAL
          L+ ++  K N++T+ S L ACAQ+GAL++GR I +Y +K+G   N +V+ A++ ++++CG++E++R+VFN +  KR++  W+ MI GLA+HG  ++A+
Subjt:  LRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDAL

Query:  QLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVE
         ++ +M    ++P+ VTF  +  AC+H G+V +   LF  MES + I P+ +HY+C+VD+LGRSG L++A   I+ MP+ P + +WGALLGAC  H+N+ 
Subjt:  QLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVE

Query:  LAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIK
        LAE+A   L +LEP N G +V+LSNIYA  G W  V+ LRK M+   + K  G S IE+   IHEF+  D +H  S+++Y  LH +   +K
Subjt:  LAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIK

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-10536.7Show/hide
Query:  SSRKMEKEQLFGIAKKVQNLSGAAMVKIHAYGLRNGVDHTKFLIEKLLQIPNL----SYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCL
        SSR++ +E+L  +  K  NL+   + ++HA  +R  +     +  KL+   +L    + A  +F+ + +P V L N  I+A +     ++ + ++ +M  
Subjt:  SSRKMEKEQLFGIAKKVQNLSGAAMVKIHAYGLRNGVDHTKFLIEKLLQIPNL----SYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCL

Query:  RGCSPNQHSFTFLFAACASILSGYPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDE
         G   +  ++ FL  AC+        +M+H H  K G +SD++   AL+D Y+                                 K G LR AR+LFDE
Subjt:  RGCSPNQHSFTFLFAACASILSGYPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDE

Query:  MSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF------------------------LRLENER--------G
        M  RD  +WN+M+ GYAR  +M  A ELF +MP RN VSW+ ++ GY++ G    A  MF                        L  E +R        G
Subjt:  MSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMF------------------------LRLENER--------G

Query:  TKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLYEQMLT
         K +   + S+L AC + G L +G RI +  +++    N YV  A+L+++A+CGN+++A  VFN+I  K++L SWNTM+ GL VHG   +A++L+ +M  
Subjt:  TKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLYEQMLT

Query:  RRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAEVAAES
          +RPD VTF+ +L +C H G++ +G + F SME  + + P++EHY CLVDLLGR G L+EA  ++Q MPM P+ VIWGALLGAC  H+ V++A+   ++
Subjt:  RRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAEVAAES

Query:  LFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL
        L KL+P +PGNY +LSNIYA+A +W GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KSD+IY +L
Subjt:  LFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-16757.32Show/hide
Query:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH
        ++HA+ LR GVD TK L+++LL IPNL YA  LFD       FLYNK IQA+      H   +LY  +   G  P+ H+F F+FAA AS  S  P ++LH
Subjt:  KIHAYGLRNGVDHTKFLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLH

Query:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL
        + F +SGF SD F  T L+  YAKLG L  AR++FDEMS RD P WN+MI GY R GDM AA+ELF  MP +NV SWT +ISG++QNG Y++AL+MFL +
Subjt:  AHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRL

Query:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY
        E ++  KPN +T+ SVLPACA LG L+IGRR+E YAR+NGFF N+YV  A +E++++CG I+ A+++F E+G++RNLCSWN+MI  LA HG+  +AL L+
Subjt:  ENERGTKPNEVTIASVLPACAQLGALDIGRRIEAYARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLY

Query:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE
         QML    +PD VTFVGLLLAC HGGMV KG+ELF+SME   +I+PKLEHY C++DLLGR G+LQEAY+LI+ MPM PD+V+WG LLGACSFH NVE+AE
Subjt:  EQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIAPKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAE

Query:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK
        +A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +K +K
Subjt:  VAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACTTTAACGAGCTCTGTATCAACCCTCAGAGTTCCGGCGACTCTCCACTCATCATCCCGCGACTTCTTTCACCTGAAAAACACTGCCCATCTTCCTCACCGCTC
CTCCTCTCGCAGATCCAGCTCAATCTCCAGACCCACTAATTCCCTCACTACCCACAAGCCCCTCCAGCCGCTCCCAGTCTCCCCCTGCCGCTCCTCTTCCCCGGCCGCCA
CTGGCTACGCCGCCGCCCTGGTCGGGGTAGCTCAATCCAGCGGCTCGCTTCACTCGGTTGCGCACGACGTTGGGAGATTCTCGAAGCTTCTGAGGACGAAGCAAATCGGA
AGGGTGTTGAACGACCCGTTCGTTGGGGAGGAAGAGAAGGGGCGGACGGTGAGAGAAGTTGCAGAAAAGGGAGGGTTTCAGCGGCAGGTGGTGAGGTTGACGAAGATGTT
GGTTGAGAAGAACAAGGTGGGGATTTTGAAACAAGTGTTAAGCGAATTCGAGAGGATTTACGATGAGTTGTGTGGAACTGAGGTGGTTTTTGTTTCTTCGAGCAGGAAGA
TGGAGAAAGAGCAGTTGTTTGGGATTGCTAAGAAAGTGCAGAACCTGAGTGGGGCTGCCATGGTGAAGATTCATGCTTACGGCCTCAGAAATGGCGTAGATCATACCAAA
TTCCTCATCGAAAAACTTCTGCAGATCCCAAATCTTTCATATGCTTGCACCCTGTTCGACCTTATTCCTAAGCCGACTGTTTTTCTCTACAACAAGTTCATTCAAGCATT
TTCTTCTACTGGTCACTCCCACCGATGTTGGTTGCTTTACTACCAAATGTGCCTCCGAGGCTGCTCCCCGAACCAGCATTCCTTTACCTTTCTCTTTGCCGCATGCGCTT
CGATTTTATCTGGTTACCCAGGTCAGATGCTTCATGCCCATTTTTGTAAGTCGGGATTTGCCTCAGATGTATTTGCTTTGACGGCATTGTTGGACATGTATGCCAAACTG
GGAATGTTGAGGTCCGCTCGCCAACTGTTTGATGAAATGTCTGTTCGAGACACACCCACTTGGAATTCGATGATTGCTGGTTATGCGAGGTCCGGGGACATGGGGGCAGC
GTTAGAATTGTTCAGCCGCATGCCTGTGAGAAATGTGGTGTCCTGGACAGCATTGATATCTGGGTATGCTCAGAATGGCAAGTATGCGAAGGCCTTGGAGATGTTTCTGA
GATTGGAAAACGAGAGAGGCACTAAACCAAATGAGGTGACCATAGCAAGTGTTCTTCCTGCCTGTGCACAGCTTGGGGCGTTGGATATTGGGAGGAGGATTGAAGCATAC
GCACGAAAGAATGGGTTTTTCAAAAACTTGTATGTAAGCATTGCGATACTGGAAGTGCATGCCAGGTGCGGTAATATTGAGGAAGCGAGGCAAGTTTTTAATGAGATTGG
AAGCAAAAGAAACTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCAGCGACGCTCTGCAGCTTTATGAACAAATGTTGACACGAAGAATGA
GACCCGATGATGTAACATTTGTGGGGCTTCTCTTAGCTTGCACTCATGGAGGCATGGTTGCGAAAGGCCGAGAACTCTTTGAATCAATGGAGAGTAAGTTTCAGATTGCT
CCCAAATTAGAGCACTATAGTTGCTTGGTTGATCTATTAGGCAGGTCTGGGGAGCTACAAGAAGCTTATAATCTTATTCAAAACATGCCAATGGTTCCCGATTCTGTAAT
ATGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATAGCAATGTTGAATTGGCTGAAGTAGCGGCTGAGTCTCTCTTCAAACTTGAGCCATGGAACCCTGGAAATTATGTCA
TTCTCTCTAACATTTATGCATCGGCTGGCAATTGGCGTGGAGTTGCAAGGCTGAGGAAGATGATGAAGGGAGGGCATATAACGAAGAGAGCAGGATATAGTTATATTGAA
GTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATCTGAAGAGTGATGAAATATATGCTTTACTTCATGGAATTTATGCAAATATTAAACATCAGAAGCC
TGCACCTCATTGTCAAAATGAAGTAGCAACGATGAGGCCGATATTTGTTGGGAACTTTGGGTATGACACCCGGCAATCTGAGCTGGAGCGGTTATTCGCCAAATACGGAA
GAGTTGAAAGGATCGACATGAAATCTGGTTTTGCTTTTGTTTACTTTGAAGATGAGAGGGATGCGGAAGATGCCATTCGTGGCCTTGATAATATGCCATTTGGTTATGAT
AGACGCAGATTGTCTGTGGAATGGGCTAGGGGTGAAAGAGGTCGTCATCGTGATGGATCCAAGTCATTGGCAAATCAGAGGCCAACCAAAACCTTGTTTGTAATCAACTT
TGATCCAATTCGTACCAGAGTTCGTGATATTGAAAGACACTTTGAACCCTATGGAAAGGTTCTCAATGTTCGCATAAGAAGGAACTTTGCATTTGTACAGTTTGAGACAC
AAGAGGATGCAACCAAAGCCCTCGAGTGTACCCACATGAGCAAAATATTAGACAGGGTTGTGTCAGTTGAGTATGCTTTGAGGGATGATGGTGAGAGGGGTGACCCTTAT
GATGATAGCCCTCGAAGAGCAGCTTATGGGCGGCCTGGGGATAGTCCTTATCGGAGGTCACCTAGTCCTGTGTTTCGCCGTCGACCAAGTCCTGACTATGGCCGAGCTCG
CAGCCCTGCTTATGATAGGTATAATGGCCCATATGAACGACGCAGGAGTCCTGAATATGGTCGAAATCGGAGCCCTGAATATGGTAGATTTCGCAGGCTCCTGCGCCATA
TGCCGGCGTACCACGGCTTTATGGCCCTCCGGCAGGCCCAGTGGGCTCCATTAAATAGAATTGAATGCTCACCGTATTCGTTACAAGGCCATCTGGATGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATACTTTAACGAGCTCTGTATCAACCCTCAGAGTTCCGGCGACTCTCCACTCATCATCCCGCGACTTCTTTCACCTGAAAAACACTGCCCATCTTCCTCACCGCTC
CTCCTCTCGCAGATCCAGCTCAATCTCCAGACCCACTAATTCCCTCACTACCCACAAGCCCCTCCAGCCGCTCCCAGTCTCCCCCTGCCGCTCCTCTTCCCCGGCCGCCA
CTGGCTACGCCGCCGCCCTGGTCGGGGTAGCTCAATCCAGCGGCTCGCTTCACTCGGTTGCGCACGACGTTGGGAGATTCTCGAAGCTTCTGAGGACGAAGCAAATCGGA
AGGGTGTTGAACGACCCGTTCGTTGGGGAGGAAGAGAAGGGGCGGACGGTGAGAGAAGTTGCAGAAAAGGGAGGGTTTCAGCGGCAGGTGGTGAGGTTGACGAAGATGTT
GGTTGAGAAGAACAAGGTGGGGATTTTGAAACAAGTGTTAAGCGAATTCGAGAGGATTTACGATGAGTTGTGTGGAACTGAGGTGGTTTTTGTTTCTTCGAGCAGGAAGA
TGGAGAAAGAGCAGTTGTTTGGGATTGCTAAGAAAGTGCAGAACCTGAGTGGGGCTGCCATGGTGAAGATTCATGCTTACGGCCTCAGAAATGGCGTAGATCATACCAAA
TTCCTCATCGAAAAACTTCTGCAGATCCCAAATCTTTCATATGCTTGCACCCTGTTCGACCTTATTCCTAAGCCGACTGTTTTTCTCTACAACAAGTTCATTCAAGCATT
TTCTTCTACTGGTCACTCCCACCGATGTTGGTTGCTTTACTACCAAATGTGCCTCCGAGGCTGCTCCCCGAACCAGCATTCCTTTACCTTTCTCTTTGCCGCATGCGCTT
CGATTTTATCTGGTTACCCAGGTCAGATGCTTCATGCCCATTTTTGTAAGTCGGGATTTGCCTCAGATGTATTTGCTTTGACGGCATTGTTGGACATGTATGCCAAACTG
GGAATGTTGAGGTCCGCTCGCCAACTGTTTGATGAAATGTCTGTTCGAGACACACCCACTTGGAATTCGATGATTGCTGGTTATGCGAGGTCCGGGGACATGGGGGCAGC
GTTAGAATTGTTCAGCCGCATGCCTGTGAGAAATGTGGTGTCCTGGACAGCATTGATATCTGGGTATGCTCAGAATGGCAAGTATGCGAAGGCCTTGGAGATGTTTCTGA
GATTGGAAAACGAGAGAGGCACTAAACCAAATGAGGTGACCATAGCAAGTGTTCTTCCTGCCTGTGCACAGCTTGGGGCGTTGGATATTGGGAGGAGGATTGAAGCATAC
GCACGAAAGAATGGGTTTTTCAAAAACTTGTATGTAAGCATTGCGATACTGGAAGTGCATGCCAGGTGCGGTAATATTGAGGAAGCGAGGCAAGTTTTTAATGAGATTGG
AAGCAAAAGAAACTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCAGCGACGCTCTGCAGCTTTATGAACAAATGTTGACACGAAGAATGA
GACCCGATGATGTAACATTTGTGGGGCTTCTCTTAGCTTGCACTCATGGAGGCATGGTTGCGAAAGGCCGAGAACTCTTTGAATCAATGGAGAGTAAGTTTCAGATTGCT
CCCAAATTAGAGCACTATAGTTGCTTGGTTGATCTATTAGGCAGGTCTGGGGAGCTACAAGAAGCTTATAATCTTATTCAAAACATGCCAATGGTTCCCGATTCTGTAAT
ATGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATAGCAATGTTGAATTGGCTGAAGTAGCGGCTGAGTCTCTCTTCAAACTTGAGCCATGGAACCCTGGAAATTATGTCA
TTCTCTCTAACATTTATGCATCGGCTGGCAATTGGCGTGGAGTTGCAAGGCTGAGGAAGATGATGAAGGGAGGGCATATAACGAAGAGAGCAGGATATAGTTATATTGAA
GTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATCTGAAGAGTGATGAAATATATGCTTTACTTCATGGAATTTATGCAAATATTAAACATCAGAAGCC
TGCACCTCATTGTCAAAATGAAGTAGCAACGATGAGGCCGATATTTGTTGGGAACTTTGGGTATGACACCCGGCAATCTGAGCTGGAGCGGTTATTCGCCAAATACGGAA
GAGTTGAAAGGATCGACATGAAATCTGGTTTTGCTTTTGTTTACTTTGAAGATGAGAGGGATGCGGAAGATGCCATTCGTGGCCTTGATAATATGCCATTTGGTTATGAT
AGACGCAGATTGTCTGTGGAATGGGCTAGGGGTGAAAGAGGTCGTCATCGTGATGGATCCAAGTCATTGGCAAATCAGAGGCCAACCAAAACCTTGTTTGTAATCAACTT
TGATCCAATTCGTACCAGAGTTCGTGATATTGAAAGACACTTTGAACCCTATGGAAAGGTTCTCAATGTTCGCATAAGAAGGAACTTTGCATTTGTACAGTTTGAGACAC
AAGAGGATGCAACCAAAGCCCTCGAGTGTACCCACATGAGCAAAATATTAGACAGGGTTGTGTCAGTTGAGTATGCTTTGAGGGATGATGGTGAGAGGGGTGACCCTTAT
GATGATAGCCCTCGAAGAGCAGCTTATGGGCGGCCTGGGGATAGTCCTTATCGGAGGTCACCTAGTCCTGTGTTTCGCCGTCGACCAAGTCCTGACTATGGCCGAGCTCG
CAGCCCTGCTTATGATAGGTATAATGGCCCATATGAACGACGCAGGAGTCCTGAATATGGTCGAAATCGGAGCCCTGAATATGGTAGATTTCGCAGGCTCCTGCGCCATA
TGCCGGCGTACCACGGCTTTATGGCCCTCCGGCAGGCCCAGTGGGCTCCATTAAATAGAATTGAATGCTCACCGTATTCGTTACAAGGCCATCTGGATGGCTAA
Protein sequenceShow/hide protein sequence
MDTLTSSVSTLRVPATLHSSSRDFFHLKNTAHLPHRSSSRRSSSISRPTNSLTTHKPLQPLPVSPCRSSSPAATGYAAALVGVAQSSGSLHSVAHDVGRFSKLLRTKQIG
RVLNDPFVGEEEKGRTVREVAEKGGFQRQVVRLTKMLVEKNKVGILKQVLSEFERIYDELCGTEVVFVSSSRKMEKEQLFGIAKKVQNLSGAAMVKIHAYGLRNGVDHTK
FLIEKLLQIPNLSYACTLFDLIPKPTVFLYNKFIQAFSSTGHSHRCWLLYYQMCLRGCSPNQHSFTFLFAACASILSGYPGQMLHAHFCKSGFASDVFALTALLDMYAKL
GMLRSARQLFDEMSVRDTPTWNSMIAGYARSGDMGAALELFSRMPVRNVVSWTALISGYAQNGKYAKALEMFLRLENERGTKPNEVTIASVLPACAQLGALDIGRRIEAY
ARKNGFFKNLYVSIAILEVHARCGNIEEARQVFNEIGSKRNLCSWNTMIMGLAVHGRCSDALQLYEQMLTRRMRPDDVTFVGLLLACTHGGMVAKGRELFESMESKFQIA
PKLEHYSCLVDLLGRSGELQEAYNLIQNMPMVPDSVIWGALLGACSFHSNVELAEVAAESLFKLEPWNPGNYVILSNIYASAGNWRGVARLRKMMKGGHITKRAGYSYIE
VGDGIHEFIVEDRSHLKSDEIYALLHGIYANIKHQKPAPHCQNEVATMRPIFVGNFGYDTRQSELERLFAKYGRVERIDMKSGFAFVYFEDERDAEDAIRGLDNMPFGYD
RRRLSVEWARGERGRHRDGSKSLANQRPTKTLFVINFDPIRTRVRDIERHFEPYGKVLNVRIRRNFAFVQFETQEDATKALECTHMSKILDRVVSVEYALRDDGERGDPY
DDSPRRAAYGRPGDSPYRRSPSPVFRRRPSPDYGRARSPAYDRYNGPYERRRSPEYGRNRSPEYGRFRRLLRHMPAYHGFMALRQAQWAPLNRIECSPYSLQGHLDG