; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1244 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1244
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC04:20529762..20531309
RNA-Seq ExpressionMC04g1244
SyntenyMC04g1244
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465802.1 PREDICTED: pentatricopeptide repeat-containing protein At4g08210 isoform X2 [Cucumis melo]0.088.76Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NII+SGYSKAGLM+EAEKLFHCMP PNVVSWNSMIAGFADNGSQRALEFVSMM+ + +KLD FTFPCALKISALHGLL++GKQ+HTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCN LIEAVKLFDQ S+FN+SISDNLALWNSMLSGYVINNCD+AALNL+SEIHCSGALLDSYTFGGALKVCIN+LSRRV LQ+HGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKL +IDDAL +F RLPRKDIIAWSGLIMGCAQ+GLNWLAFSMF+DM+   +EID FVIST LKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF C+QEKDIVSWTGIIVGCGQNG+AAEA+RFFHEM+Q G+ PNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLA  GLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLIN VADGLLEATPNDPSTYVTLSNAYASLGMW TLSKAREA+K
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        T G K+AGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

XP_016903440.1 PREDICTED: pentatricopeptide repeat-containing protein At4g08210 isoform X1 [Cucumis melo]0.088.76Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NII+SGYSKAGLM+EAEKLFHCMP PNVVSWNSMIAGFADNGSQRALEFVSMM+ + +KLD FTFPCALKISALHGLL++GKQ+HTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCN LIEAVKLFDQ S+FN+SISDNLALWNSMLSGYVINNCD+AALNL+SEIHCSGALLDSYTFGGALKVCIN+LSRRV LQ+HGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKL +IDDAL +F RLPRKDIIAWSGLIMGCAQ+GLNWLAFSMF+DM+   +EID FVIST LKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF C+QEKDIVSWTGIIVGCGQNG+AAEA+RFFHEM+Q G+ PNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLA  GLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLIN VADGLLEATPNDPSTYVTLSNAYASLGMW TLSKAREA+K
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        T G K+AGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

XP_022141502.1 pentatricopeptide repeat-containing protein At4g08210 [Momordica charantia]0.0100Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        TVGAKRAGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

XP_022990073.1 pentatricopeptide repeat-containing protein At4g08210 [Cucurbita maxima]0.089.53Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NIIISGYSKAGLM+EAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVS+M+ +G+KLD FTFPCALKISALHGLL++GKQIH+YVTKLGY SSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCNGL EAVKLFDQHS+FN+SIS+NLALWNSMLSGYVINNCD+AALNLIS IHCSG ++DSYTFGGALKVCIN+LS RV  QVHGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKLG IDDAL LF RLPRKDIIAWSGLI+GCAQMGLNWLAFSMF+DM+  AHEID FVISTTLKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF CIQEKDIV+WTGIIVGCGQNGRAAEAVRFFHEMIQ GLNPNEIT LGVLSACRYAGL+EEAR+IFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLALAGLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLINSVA+GLLEATP+DPSTYV+LSNAYASLGMW  LSKAREAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
         VG KRAGLSWIEV+S
Subjt:  TVGAKRAGLSWIEVSS

XP_023527633.1 pentatricopeptide repeat-containing protein At4g08210 [Cucurbita pepo subsp. pepo]0.089.73Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NIIISGYSKAGLM+EAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVS+M+ +G+KLD FTFPCALKISALHGLL++GKQIH+YVTKLGY SSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCNGL EAVKLFDQHS+FN SIS+NLALWNSMLSGYVINNCD+AALNLIS IHCSG ++DSYTFGGALKVCIN+LS RV  QVHGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKLG IDDAL LF RLPRKDIIAWSGLI+GCAQMGLNWLAFSMF+DM+  AHEID FVISTTLKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF CIQEKDIV+WTGIIVGCGQNGRAAEAVRFFHEMIQ GLNPNEITFLGVLSACRYAGL+EEARSIFNSMKS+
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLALAGLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLINSVA GLLEATP+DPSTYV+LSNAYASLGMW  LSKAREAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
         VG KRAGLSWIEV+S
Subjt:  TVGAKRAGLSWIEVSS

TrEMBL top hitse value%identityAlignment
A0A1S3CPP9 pentatricopeptide repeat-containing protein At4g08210 isoform X20.088.76Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NII+SGYSKAGLM+EAEKLFHCMP PNVVSWNSMIAGFADNGSQRALEFVSMM+ + +KLD FTFPCALKISALHGLL++GKQ+HTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCN LIEAVKLFDQ S+FN+SISDNLALWNSMLSGYVINNCD+AALNL+SEIHCSGALLDSYTFGGALKVCIN+LSRRV LQ+HGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKL +IDDAL +F RLPRKDIIAWSGLIMGCAQ+GLNWLAFSMF+DM+   +EID FVIST LKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF C+QEKDIVSWTGIIVGCGQNG+AAEA+RFFHEM+Q G+ PNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLA  GLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLIN VADGLLEATPNDPSTYVTLSNAYASLGMW TLSKAREA+K
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        T G K+AGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

A0A1S3CPQ2 pentatricopeptide repeat-containing protein At4g08210 isoform X30.088.76Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NII+SGYSKAGLM+EAEKLFHCMP PNVVSWNSMIAGFADNGSQRALEFVSMM+ + +KLD FTFPCALKISALHGLL++GKQ+HTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCN LIEAVKLFDQ S+FN+SISDNLALWNSMLSGYVINNCD+AALNL+SEIHCSGALLDSYTFGGALKVCIN+LSRRV LQ+HGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKL +IDDAL +F RLPRKDIIAWSGLIMGCAQ+GLNWLAFSMF+DM+   +EID FVIST LKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF C+QEKDIVSWTGIIVGCGQNG+AAEA+RFFHEM+Q G+ PNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLA  GLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLIN VADGLLEATPNDPSTYVTLSNAYASLGMW TLSKAREA+K
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        T G K+AGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

A0A5A7TZR3 Pentatricopeptide repeat-containing protein0.088.76Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NII+SGYSKAGLM+EAEKLFHCMP PNVVSWNSMIAGFADNGSQRALEFVSMM+ + +KLD FTFPCALKISALHGLL++GKQ+HTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCN LIEAVKLFDQ S+FN+SISDNLALWNSMLSGYVINNCD+AALNL+SEIHCSGALLDSYTFGGALKVCIN+LSRRV LQ+HGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKL +IDDAL +F RLPRKDIIAWSGLIMGCAQ+GLNWLAFSMF+DM+   +EID FVIST LKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF C+QEKDIVSWTGIIVGCGQNG+AAEA+RFFHEM+Q G+ PNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLA  GLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLIN VADGLLEATPNDPSTYVTLSNAYASLGMW TLSKAREA+K
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        T G K+AGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

A0A6J1CK13 pentatricopeptide repeat-containing protein At4g082100.0100Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
        TVGAKRAGLSWIEVSS
Subjt:  TVGAKRAGLSWIEVSS

A0A6J1JS73 pentatricopeptide repeat-containing protein At4g082100.089.53Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        NIIISGYSKAGLM+EAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVS+M+ +G+KLD FTFPCALKISALHGLL++GKQIH+YVTKLGY SSCFTL
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNCNGL EAVKLFDQHS+FN+SIS+NLALWNSMLSGYVINNCD+AALNLIS IHCSG ++DSYTFGGALKVCIN+LS RV  QVHGLIVTCGY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDYV+GSILVDLYAKLG IDDAL LF RLPRKDIIAWSGLI+GCAQMGLNWLAFSMF+DM+  AHEID FVISTTLKVCSNLASLRSGKQVHAFCVK G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YEMEGFTITSLLDMYSKCGEIEDALTLF CIQEKDIV+WTGIIVGCGQNGRAAEAVRFFHEMIQ GLNPNEIT LGVLSACRYAGL+EEAR+IFNSMKSV
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEPHLEHYCCMVDLLALAGLPEEAEK++ANMPF+PDQTTWRTLLGA GTRND KLINSVA+GLLEATP+DPSTYV+LSNAYASLGMW  LSKAREAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWIEVSS
         VG KRAGLSWIEV+S
Subjt:  TVGAKRAGLSWIEVSS

SwissProt top hitse value%identityAlignment
P0C898 Putative pentatricopeptide repeat-containing protein At3g151303.8e-8834.81Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQR-ALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT
        N +I  Y K    L A K+F  MP+ NVVSW+++++G   NG  + +L   S M  +G+  + FTF   LK   L   L  G QIH +  K+G+E     
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQR-ALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT

Query:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSG--ALLDSYTFGGALKVCINVLSRRVALQVHGLIVT
         ++L+DMYS C  + EA K+      F   +  +L  WN+M++G+V       AL+    +  +      D +T    LK C +        Q+HG +V 
Subjt:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSG--ALLDSYTFGGALKVCINVLSRRVALQVHGLIVT

Query:  CGYEL--DYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAF
         G+       I   LVDLY K G +  A + F ++  K +I+WS LI+G AQ G    A  +F+ +     +ID F +S+ + V ++ A LR GKQ+ A 
Subjt:  CGYEL--DYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAF

Query:  CVKGGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFN
         VK    +E   + S++DMY KCG +++A   F+ +Q KD++SWT +I G G++G   ++VR F+EM++  + P+E+ +L VLSAC ++G+++E   +F+
Subjt:  CVKGGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFN

Query:  SMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKA
         +   +G++P +EHY C+VDLL  AG  +EA+ ++  MP KP+   W+TLL       D +L   V   LL     +P+ YV +SN Y   G W     A
Subjt:  SMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKA

Query:  REAAKTVGAKR-AGLSWIEV
        RE     G K+ AG+SW+E+
Subjt:  REAAKTVGAKR-AGLSWIEV

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.1e-8634.29Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGK--QIHTYVTKLGYESSC
        N +++G +K G + EA+ LF  MP+ +  +WNSM++GFA +   + AL + +MM+ EG  L+ ++F  A  +SA  GL  M K  Q+H+ + K  + S  
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGK--QIHTYVTKLGYESSC

Query:  FTLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIV-
        +  SAL+DMYS C  + +A ++FD+          N+  WNS+++ +  N     AL++   +  S    D  T    +  C ++ + +V  +VHG +V 
Subjt:  FTLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIV-

Query:  TCGYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIA-------------------------------WSGLIMGCAQMGLNWLAFSMF----RDM
              D ++ +  VD+YAK   I +A  +F  +P +++IA                               W+ LI G  Q G N  A S+F    R+ 
Subjt:  TCGYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIA-------------------------------WSGLIMGCAQMGLNWLAFSMF----RDM

Query:  VSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGGYEMEG------FTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEA
        V   H    +  +  LK C++LA L  G Q H   +K G++ +       F   SL+DMY KCG +E+   +F  + E+D VSW  +I+G  QNG   EA
Subjt:  VSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGGYEMEG------FTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEA

Query:  VRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDP
        +  F EM++ G  P+ IT +GVLSAC +AG VEE R  F+SM   +G+ P  +HY CMVDLL  AG  EEA+ M+  MP +PD   W +LL A     + 
Subjt:  VRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDP

Query:  KLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAKTVG-AKRAGLSWIEV
         L   VA+ LLE  P++   YV LSN YA LG W+ +   R++ +  G  K+ G SWI++
Subjt:  KLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAKTVG-AKRAGLSWIEV

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331703.8e-8835.84Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSMMNTEGVKLDGFTFPCALK-ISALHGLLLMGKQIHTYVTKLGYESSCF
        N +I+ Y K      A  +F  M + +++SWNS+IAG A NG +  A+     +   G+K D +T    LK  S+L   L + KQ+H +  K+   S  F
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSMMNTEGVKLDGFTFPCALK-ISALHGLLLMGKQIHTYVTKLGYESSCF

Query:  TLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTC
          +ALID YS    + EA  LF++H+        +L  WN+M++GY  ++     L L + +H  G   D +T     K C  + +     QVH   +  
Subjt:  TLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTC

Query:  GYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVK
        GY+LD  + S ++D+Y K G +  A   F  +P  D +AW+ +I GC + G    AF +F  M       D+F I+T  K  S L +L  G+Q+HA  +K
Subjt:  GYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVK

Query:  GGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK
             + F  TSL+DMY+KCG I+DA  LF  I+  +I +W  ++VG  Q+G   E ++ F +M  +G+ P+++TF+GVLSAC ++GLV EA     SM 
Subjt:  GGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK

Query:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA
          YG++P +EHY C+ D L  AGL ++AE ++ +M  +   + +RTLL A   + D +    VA  LLE  P D S YV LSN YA+   W  +  AR  
Subjt:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA

Query:  AKTVGAKR-AGLSWIEVSS
         K    K+  G SWIEV +
Subjt:  AKTVGAKR-AGLSWIEVSS

Q9STE1 Pentatricopeptide repeat-containing protein At4g213004.2e-8734.56Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT
        N ++S YSK G   +A KLF  M + + V+WN MI+G+  +G  + +L F   M + GV  D  TF   L   +    L   KQIH Y+ +       F 
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT

Query:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCG
         SALID Y  C G+  A  +F Q +      S ++ ++ +M+SGY+ N     +L +   +       +  T    L V   +L+ ++  ++HG I+  G
Subjt:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCG

Query:  YELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKG
        ++    IG  ++D+YAK G ++ A E+F+RL ++DI++W+ +I  CAQ      A  +FR M  S    D   IS  L  C+NL S   GK +H F +K 
Subjt:  YELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKG

Query:  GYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQ-VGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK
            + ++ ++L+DMY+KCG ++ A+ +F  ++EK+IVSW  II  CG +G+  +++  FHEM++  G+ P++ITFL ++S+C + G V+E    F SM 
Subjt:  GYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQ-VGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK

Query:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA
          YG++P  EHY C+VDL   AG   EA + V +MPF PD   W TLLGA     + +L    +  L++  P++   YV +SNA+A+   W++++K R  
Subjt:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA

Query:  AKTVGAKR-AGLSWIEVS
         K    ++  G SWIE++
Subjt:  AKTVGAKR-AGLSWIEVS

Q9SUF9 Pentatricopeptide repeat-containing protein At4g082107.7e-17457.81Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        N +ISGY KAGLM EA  LFH MPQPNVVSWN +I+GF D GS RALEF+  M  EG+ LDGF  PC LK  +  GLL MGKQ+H  V K G ESS F +
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNC  LI A  +F Q      +++ ++A+WNSMLSG++IN  ++AAL L+ +I+ S    DSYT  GALK+CIN ++ R+ LQVH L+V  GY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDY++GSILVDL+A +G+I DA +LF RLP KDIIA+SGLI GC + G N LAF +FR+++    + D F++S  LKVCS+LASL  GKQ+H  C+K G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YE E  T T+L+DMY KCGEI++ + LF  + E+D+VSWTGIIVG GQNGR  EA R+FH+MI +G+ PN++TFLG+LSACR++GL+EEARS   +MKS 
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEP+LEHY C+VDLL  AGL +EA +++  MP +PD+T W +LL A GT  +  L+  +A+ LL+  P+DPS Y +LSNAYA+LGMW  LSK REAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWI
         +GAK +G+SWI
Subjt:  TVGAKRAGLSWI

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-8734.29Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGK--QIHTYVTKLGYESSC
        N +++G +K G + EA+ LF  MP+ +  +WNSM++GFA +   + AL + +MM+ EG  L+ ++F  A  +SA  GL  M K  Q+H+ + K  + S  
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGK--QIHTYVTKLGYESSC

Query:  FTLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIV-
        +  SAL+DMYS C  + +A ++FD+          N+  WNS+++ +  N     AL++   +  S    D  T    +  C ++ + +V  +VHG +V 
Subjt:  FTLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIV-

Query:  TCGYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIA-------------------------------WSGLIMGCAQMGLNWLAFSMF----RDM
              D ++ +  VD+YAK   I +A  +F  +P +++IA                               W+ LI G  Q G N  A S+F    R+ 
Subjt:  TCGYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIA-------------------------------WSGLIMGCAQMGLNWLAFSMF----RDM

Query:  VSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGGYEMEG------FTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEA
        V   H    +  +  LK C++LA L  G Q H   +K G++ +       F   SL+DMY KCG +E+   +F  + E+D VSW  +I+G  QNG   EA
Subjt:  VSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGGYEMEG------FTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEA

Query:  VRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDP
        +  F EM++ G  P+ IT +GVLSAC +AG VEE R  F+SM   +G+ P  +HY CMVDLL  AG  EEA+ M+  MP +PD   W +LL A     + 
Subjt:  VRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDP

Query:  KLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAKTVG-AKRAGLSWIEV
         L   VA+ LLE  P++   YV LSN YA LG W+ +   R++ +  G  K+ G SWI++
Subjt:  KLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAKTVG-AKRAGLSWIEV

AT3G15130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-8934.81Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQR-ALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT
        N +I  Y K    L A K+F  MP+ NVVSW+++++G   NG  + +L   S M  +G+  + FTF   LK   L   L  G QIH +  K+G+E     
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQR-ALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT

Query:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSG--ALLDSYTFGGALKVCINVLSRRVALQVHGLIVT
         ++L+DMYS C  + EA K+      F   +  +L  WN+M++G+V       AL+    +  +      D +T    LK C +        Q+HG +V 
Subjt:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSG--ALLDSYTFGGALKVCINVLSRRVALQVHGLIVT

Query:  CGYEL--DYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAF
         G+       I   LVDLY K G +  A + F ++  K +I+WS LI+G AQ G    A  +F+ +     +ID F +S+ + V ++ A LR GKQ+ A 
Subjt:  CGYEL--DYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAF

Query:  CVKGGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFN
         VK    +E   + S++DMY KCG +++A   F+ +Q KD++SWT +I G G++G   ++VR F+EM++  + P+E+ +L VLSAC ++G+++E   +F+
Subjt:  CVKGGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFN

Query:  SMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKA
         +   +G++P +EHY C+VDLL  AG  +EA+ ++  MP KP+   W+TLL       D +L   V   LL     +P+ YV +SN Y   G W     A
Subjt:  SMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKA

Query:  REAAKTVGAKR-AGLSWIEV
        RE     G K+ AG+SW+E+
Subjt:  REAAKTVGAKR-AGLSWIEV

AT4G08210.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.5e-17557.81Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL
        N +ISGY KAGLM EA  LFH MPQPNVVSWN +I+GF D GS RALEF+  M  EG+ LDGF  PC LK  +  GLL MGKQ+H  V K G ESS F +
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTL

Query:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY
        SALIDMYSNC  LI A  +F Q      +++ ++A+WNSMLSG++IN  ++AAL L+ +I+ S    DSYT  GALK+CIN ++ R+ LQVH L+V  GY
Subjt:  SALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGY

Query:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG
        ELDY++GSILVDL+A +G+I DA +LF RLP KDIIA+SGLI GC + G N LAF +FR+++    + D F++S  LKVCS+LASL  GKQ+H  C+K G
Subjt:  ELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGG

Query:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV
        YE E  T T+L+DMY KCGEI++ + LF  + E+D+VSWTGIIVG GQNGR  EA R+FH+MI +G+ PN++TFLG+LSACR++GL+EEARS   +MKS 
Subjt:  YEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSV

Query:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK
        YGLEP+LEHY C+VDLL  AGL +EA +++  MP +PD+T W +LL A GT  +  L+  +A+ LL+  P+DPS Y +LSNAYA+LGMW  LSK REAAK
Subjt:  YGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAK

Query:  TVGAKRAGLSWI
         +GAK +G+SWI
Subjt:  TVGAKRAGLSWI

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-8834.56Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT
        N ++S YSK G   +A KLF  M + + V+WN MI+G+  +G  + +L F   M + GV  D  TF   L   +    L   KQIH Y+ +       F 
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFT

Query:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCG
         SALID Y  C G+  A  +F Q +      S ++ ++ +M+SGY+ N     +L +   +       +  T    L V   +L+ ++  ++HG I+  G
Subjt:  LSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCG

Query:  YELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKG
        ++    IG  ++D+YAK G ++ A E+F+RL ++DI++W+ +I  CAQ      A  +FR M  S    D   IS  L  C+NL S   GK +H F +K 
Subjt:  YELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKG

Query:  GYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQ-VGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK
            + ++ ++L+DMY+KCG ++ A+ +F  ++EK+IVSW  II  CG +G+  +++  FHEM++  G+ P++ITFL ++S+C + G V+E    F SM 
Subjt:  GYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQ-VGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK

Query:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA
          YG++P  EHY C+VDL   AG   EA + V +MPF PD   W TLLGA     + +L    +  L++  P++   YV +SNA+A+   W++++K R  
Subjt:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA

Query:  AKTVGAKR-AGLSWIEVS
         K    ++  G SWIE++
Subjt:  AKTVGAKR-AGLSWIEVS

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-8935.84Show/hide
Query:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSMMNTEGVKLDGFTFPCALK-ISALHGLLLMGKQIHTYVTKLGYESSCF
        N +I+ Y K      A  +F  M + +++SWNS+IAG A NG +  A+     +   G+K D +T    LK  S+L   L + KQ+H +  K+   S  F
Subjt:  NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSMMNTEGVKLDGFTFPCALK-ISALHGLLLMGKQIHTYVTKLGYESSCF

Query:  TLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTC
          +ALID YS    + EA  LF++H+        +L  WN+M++GY  ++     L L + +H  G   D +T     K C  + +     QVH   +  
Subjt:  TLSALIDMYSNCNGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTC

Query:  GYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVK
        GY+LD  + S ++D+Y K G +  A   F  +P  D +AW+ +I GC + G    AF +F  M       D+F I+T  K  S L +L  G+Q+HA  +K
Subjt:  GYELDYVIGSILVDLYAKLGSIDDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVK

Query:  GGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK
             + F  TSL+DMY+KCG I+DA  LF  I+  +I +W  ++VG  Q+G   E ++ F +M  +G+ P+++TF+GVLSAC ++GLV EA     SM 
Subjt:  GGYEMEGFTITSLLDMYSKCGEIEDALTLFSCIQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMK

Query:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA
          YG++P +EHY C+ D L  AGL ++AE ++ +M  +   + +RTLL A   + D +    VA  LLE  P D S YV LSN YA+   W  +  AR  
Subjt:  SVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQTTWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREA

Query:  AKTVGAKR-AGLSWIEVSS
         K    K+  G SWIEV +
Subjt:  AKTVGAKR-AGLSWIEVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACATTATCATTTCTGGCTATAGTAAGGCTGGTTTGATGTTGGAGGCCGAAAAACTTTTTCATTGTATGCCACAGCCAAATGTTGTATCTTGGAACAGCATGATTGCTGG
TTTTGCAGACAATGGGAGTCAGCGCGCATTGGAGTTTGTGTCCATGATGAATACAGAGGGCGTTAAGCTTGATGGTTTTACGTTTCCATGTGCTCTTAAGATCAGTGCGC
TTCATGGGTTATTACTCATGGGGAAACAAATTCATACCTATGTCACCAAGTTGGGTTATGAATCTAGCTGTTTTACTCTGTCTGCCCTGATTGATATGTATTCGAATTGC
AATGGCCTGATCGAAGCAGTCAAATTATTTGACCAACACTCTACTTTCAACTCTTCCATTTCTGATAACCTGGCATTGTGGAATTCGATGCTCTCAGGATATGTTATTAA
CAATTGTGACAAAGCAGCTTTGAATCTGATTTCAGAAATCCATTGCTCGGGTGCATTATTGGACTCTTACACCTTTGGTGGTGCTTTAAAGGTTTGTATCAACGTATTAA
GTCGGAGGGTTGCTCTTCAAGTACATGGTTTGATCGTCACCTGTGGTTATGAGTTGGATTATGTTATTGGAAGCATTCTTGTGGATCTTTATGCAAAACTGGGGAGCATT
GATGACGCATTGGAACTGTTCCAGAGGCTTCCGAGGAAAGACATCATAGCTTGGTCCGGTTTGATCATGGGATGTGCTCAAATGGGATTAAACTGGCTAGCTTTCTCAAT
GTTCAGAGATATGGTTTCGTCGGCTCATGAAATAGATGATTTTGTCATTTCCACTACTCTGAAAGTCTGCTCTAATTTAGCATCTCTTAGAAGTGGAAAGCAGGTTCATG
CATTCTGTGTCAAGGGTGGGTATGAAATGGAGGGGTTCACAATCACATCCCTTCTTGATATGTATTCAAAATGCGGTGAAATTGAGGATGCGTTAACGTTGTTTAGCTGT
ATACAAGAAAAAGACATAGTAAGTTGGACTGGGATCATTGTAGGATGTGGACAAAATGGAAGGGCAGCTGAGGCAGTCAGGTTTTTTCACGAGATGATTCAAGTTGGGCT
AAATCCAAATGAGATCACCTTTCTAGGGGTTCTTTCTGCATGTAGATATGCTGGTTTGGTTGAAGAGGCACGAAGTATATTTAATTCCATGAAATCTGTATATGGATTAG
AACCTCATTTAGAGCATTATTGCTGCATGGTCGATCTTCTTGCTCTAGCGGGACTTCCTGAAGAAGCTGAGAAAATGGTTGCAAATATGCCGTTTAAGCCAGATCAGACC
ACATGGCGCACTTTGCTAGGGGCAATTGGAACTCGCAATGATCCGAAGCTTATTAACAGTGTTGCTGATGGCCTTCTTGAAGCCACACCAAATGACCCTTCTACGTACGT
GACGCTTTCAAATGCTTATGCATCACTGGGAATGTGGCAAACCCTGAGCAAAGCGAGAGAGGCTGCCAAAACAGTGGGAGCAAAAAGAGCTGGGTTGAGCTGGATTGAGG
TTTCGAGT
mRNA sequenceShow/hide mRNA sequence
AACATTATCATTTCTGGCTATAGTAAGGCTGGTTTGATGTTGGAGGCCGAAAAACTTTTTCATTGTATGCCACAGCCAAATGTTGTATCTTGGAACAGCATGATTGCTGG
TTTTGCAGACAATGGGAGTCAGCGCGCATTGGAGTTTGTGTCCATGATGAATACAGAGGGCGTTAAGCTTGATGGTTTTACGTTTCCATGTGCTCTTAAGATCAGTGCGC
TTCATGGGTTATTACTCATGGGGAAACAAATTCATACCTATGTCACCAAGTTGGGTTATGAATCTAGCTGTTTTACTCTGTCTGCCCTGATTGATATGTATTCGAATTGC
AATGGCCTGATCGAAGCAGTCAAATTATTTGACCAACACTCTACTTTCAACTCTTCCATTTCTGATAACCTGGCATTGTGGAATTCGATGCTCTCAGGATATGTTATTAA
CAATTGTGACAAAGCAGCTTTGAATCTGATTTCAGAAATCCATTGCTCGGGTGCATTATTGGACTCTTACACCTTTGGTGGTGCTTTAAAGGTTTGTATCAACGTATTAA
GTCGGAGGGTTGCTCTTCAAGTACATGGTTTGATCGTCACCTGTGGTTATGAGTTGGATTATGTTATTGGAAGCATTCTTGTGGATCTTTATGCAAAACTGGGGAGCATT
GATGACGCATTGGAACTGTTCCAGAGGCTTCCGAGGAAAGACATCATAGCTTGGTCCGGTTTGATCATGGGATGTGCTCAAATGGGATTAAACTGGCTAGCTTTCTCAAT
GTTCAGAGATATGGTTTCGTCGGCTCATGAAATAGATGATTTTGTCATTTCCACTACTCTGAAAGTCTGCTCTAATTTAGCATCTCTTAGAAGTGGAAAGCAGGTTCATG
CATTCTGTGTCAAGGGTGGGTATGAAATGGAGGGGTTCACAATCACATCCCTTCTTGATATGTATTCAAAATGCGGTGAAATTGAGGATGCGTTAACGTTGTTTAGCTGT
ATACAAGAAAAAGACATAGTAAGTTGGACTGGGATCATTGTAGGATGTGGACAAAATGGAAGGGCAGCTGAGGCAGTCAGGTTTTTTCACGAGATGATTCAAGTTGGGCT
AAATCCAAATGAGATCACCTTTCTAGGGGTTCTTTCTGCATGTAGATATGCTGGTTTGGTTGAAGAGGCACGAAGTATATTTAATTCCATGAAATCTGTATATGGATTAG
AACCTCATTTAGAGCATTATTGCTGCATGGTCGATCTTCTTGCTCTAGCGGGACTTCCTGAAGAAGCTGAGAAAATGGTTGCAAATATGCCGTTTAAGCCAGATCAGACC
ACATGGCGCACTTTGCTAGGGGCAATTGGAACTCGCAATGATCCGAAGCTTATTAACAGTGTTGCTGATGGCCTTCTTGAAGCCACACCAAATGACCCTTCTACGTACGT
GACGCTTTCAAATGCTTATGCATCACTGGGAATGTGGCAAACCCTGAGCAAAGCGAGAGAGGCTGCCAAAACAGTGGGAGCAAAAAGAGCTGGGTTGAGCTGGATTGAGG
TTTCGAGT
Protein sequenceShow/hide protein sequence
NIIISGYSKAGLMLEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSMMNTEGVKLDGFTFPCALKISALHGLLLMGKQIHTYVTKLGYESSCFTLSALIDMYSNC
NGLIEAVKLFDQHSTFNSSISDNLALWNSMLSGYVINNCDKAALNLISEIHCSGALLDSYTFGGALKVCINVLSRRVALQVHGLIVTCGYELDYVIGSILVDLYAKLGSI
DDALELFQRLPRKDIIAWSGLIMGCAQMGLNWLAFSMFRDMVSSAHEIDDFVISTTLKVCSNLASLRSGKQVHAFCVKGGYEMEGFTITSLLDMYSKCGEIEDALTLFSC
IQEKDIVSWTGIIVGCGQNGRAAEAVRFFHEMIQVGLNPNEITFLGVLSACRYAGLVEEARSIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKMVANMPFKPDQT
TWRTLLGAIGTRNDPKLINSVADGLLEATPNDPSTYVTLSNAYASLGMWQTLSKAREAAKTVGAKRAGLSWIEVSS