; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0026011 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0026011
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTetratricopeptide repeat-like superfamily protein
Genome locationchr02:24570652..24572454
RNA-Seq ExpressionPI0026011
SyntenyPI0026011
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589426.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.0e-24176.73Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSF LH LT+ELSK++LTLLRTKELHAFITK++LACDPFYATRIVRLYSIN +LNYARHVFDKTPNR+VYLWNSIIRAYAKAHKFG+ALSLF TM GTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------
        L D+FTYSCIIRAC+EN HRE LKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIE+ASK + +      L  +N + SG+   G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHG CLKC+ DSNEHVASALVSMYSRCNCMDSAYLVF SL+QPDLVTWSALITGYSQ+GDF KA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------
        QGKK D+ILIASILAA AQS NIR GIEIHGY LRQGI+S+EM+SSSLID+YSKCGYLSLGIRVFH+M +K I  YNS++WG+GLHGLAS+  +      
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------

Query:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
            +PNESTFSALLCACCH GLNSVGK IFKRM+DEF IKYRTEHYVYIVKLLGM+GELE AY+LV+SLPEPVDSG+WGALLSCCDACGN++LAE+VAQ
Subjt:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        +L+EN+PDK AY+VMLSNIYAG+GRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

XP_004138268.1 putative pentatricopeptide repeat-containing protein At1g64310 [Cucumis sativus]6.3e-25883.64Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSFNLH LT ELSKSFLTLLRTKELHAFITKS+LA DPFYATRIV+LYSIN KL YARH+FDKTPNRSVYLWNSIIRAYAKA+KF DALSLFLTMSGTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGFNHLTSGWIHG----------
          DNFTYSCIIRACSEN HREWLK VHGRVLV+GFGLDPICCSALVTAYSNLDLIEEASK    +    +  W   +CGF      W  G          
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGFNHLTSGWIHG----------

Query:  ---------SWCASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDF KAMLFFQRLNM
Subjt:  ---------SWCASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----
        QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVM QK+ISTYNSV+WGLGLHGLAS+  ++     
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----

Query:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
             PNESTFSALL ACCH GLNSVGK+IFKRMKDEFCIKY+TEHYVYIVKLLGMTGELEVAYNLVMSLPEP DSGIWGALLSCCDACGNV+LAEVVAQ
Subjt:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        RLIENDP+KT YKVMLSNIYAGDGRWDDVKKLRDTMTEKERGK PGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

XP_008464525.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g64310 [Cucumis melo]2.6e-25984.15Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSFNLH LT ELSKSFLTLLRTKELHAFITKS+LA DPFYATRIVRLYSIN KL+YARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMS TE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGF---NHLTSGWI---------
        L DNFTYSCIIRACSEN HREWLK VHGRVLV+GFGLDPICCSALVTA SNLDLIEEA+K    +    +  W   +CGF    +   G +         
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGF---NHLTSGWI---------

Query:  -HGSWC-----ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQ
         H   C     ASGIAEPSLLSTGKGIHGLCLK NFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYS+AGDF KAMLFFQ+LNMQ
Subjt:  -HGSWC-----ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQ

Query:  GKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL------
        GKKMDSILI SILAATAQSTN+RHGIEIHGYVLR GIESNEMISSSLIDMYSKCGYL+LGIRVFHVMPQKSISTYNSV+WGLGLHGLAS+  ++      
Subjt:  GKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL------

Query:  ----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQR
            PNESTFSALLCACCHVGLNSVGK+IFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNL+MSLPE VDSGIWGALLSCCDACGNV+LAEVVAQR
Subjt:  ----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQR

Query:  LIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        LIE DPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  LIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

XP_022921364.1 putative pentatricopeptide repeat-containing protein At1g64310 [Cucurbita moschata]1.6e-24076.73Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSF LH LT+ELSK++LTLLRTKELHAFITK++LACDPFYATRIVRLYSING+LNYARHVFDKTPNR+VYLWNSIIRAYAKAHKFG+ALSLF TM GTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------
        L DNFTYSCIIRAC+ENLHRE LKLVHGRVL SGFGLDPICCSALVTAYSNLDLIE+ASK + +      L  +N + SG+   G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHG CLKC+ DSNEHVASALVSMYSRCNCMDSAYLVF SL+QPDLVTWSALITGYSQ+GDF KA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------
        QGKK D+ILIASILAA AQS NIR GIEIHGY LRQGI+S+EM+SSSLIDMYSKCGYLSLGIRVF++M +K I  YNS++WG+GLHGLAS+  +      
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------

Query:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
            +PNESTFSALLCACCH GLNSVGK IFKRM +EF IKYRTEHYVYIVKLLGM+GE E AY+LV+SLPEPVDSG+WGALLSCCDACGN++LAE+VAQ
Subjt:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        +L+EN+PDK AY+VMLSNIYAG+GRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

XP_038878555.1 putative pentatricopeptide repeat-containing protein At1g64310 [Benincasa hispida]1.2e-24378.36Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MS  LH LT+ELSKSF TLLRTKELHAFITK++LACDPFYATRIVRLYS+N +L+YARHVFDK+PNR+VYLWNSIIRAYAKA+KF DALSLFLTM G E 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------
        L DNFTYSCIIRACS+N H EWLK VHGRVL+SGFGLDPICCSALVTAYSNLDLIE+ASK + +      L  +N + SG+ + G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    A  IAEPSLLSTGK IHG CLKC+FDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDF KA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----
        QGKK DSILI+SILAATAQST+  HGIEIHGYVLR GIES+EM+SSSLIDMYSKCGYLSLGIRVFH+MPQK I TYNSV+WGLGLHGLAS   ++     
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----

Query:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
             PNESTFSALLCACCHVGLNSVGK+IF+RMKDEFCIKYRTEHYVYIVKLLGMTGELE AYNL++SLPEPVDSGIWGALLSCCDACGN++LAEVVAQ
Subjt:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        RLI+N+PDKTAYKVMLSNIYAG+GRWDDVK LRDTMTEKER KLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

TrEMBL top hitse value%identityAlignment
A0A0A0LP28 Uncharacterized protein3.1e-25883.64Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSFNLH LT ELSKSFLTLLRTKELHAFITKS+LA DPFYATRIV+LYSIN KL YARH+FDKTPNRSVYLWNSIIRAYAKA+KF DALSLFLTMSGTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGFNHLTSGWIHG----------
          DNFTYSCIIRACSEN HREWLK VHGRVLV+GFGLDPICCSALVTAYSNLDLIEEASK    +    +  W   +CGF      W  G          
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGFNHLTSGWIHG----------

Query:  ---------SWCASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDF KAMLFFQRLNM
Subjt:  ---------SWCASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----
        QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVM QK+ISTYNSV+WGLGLHGLAS+  ++     
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----

Query:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
             PNESTFSALL ACCH GLNSVGK+IFKRMKDEFCIKY+TEHYVYIVKLLGMTGELEVAYNLVMSLPEP DSGIWGALLSCCDACGNV+LAEVVAQ
Subjt:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        RLIENDP+KT YKVMLSNIYAGDGRWDDVKKLRDTMTEKERGK PGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

A0A1S3CM62 putative pentatricopeptide repeat-containing protein At1g643101.2e-25984.15Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSFNLH LT ELSKSFLTLLRTKELHAFITKS+LA DPFYATRIVRLYSIN KL+YARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMS TE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGF---NHLTSGWI---------
        L DNFTYSCIIRACSEN HREWLK VHGRVLV+GFGLDPICCSALVTA SNLDLIEEA+K    +    +  W   +CGF    +   G +         
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTW---LCGF---NHLTSGWI---------

Query:  -HGSWC-----ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQ
         H   C     ASGIAEPSLLSTGKGIHGLCLK NFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYS+AGDF KAMLFFQ+LNMQ
Subjt:  -HGSWC-----ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQ

Query:  GKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL------
        GKKMDSILI SILAATAQSTN+RHGIEIHGYVLR GIESNEMISSSLIDMYSKCGYL+LGIRVFHVMPQKSISTYNSV+WGLGLHGLAS+  ++      
Subjt:  GKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL------

Query:  ----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQR
            PNESTFSALLCACCHVGLNSVGK+IFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNL+MSLPE VDSGIWGALLSCCDACGNV+LAEVVAQR
Subjt:  ----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQR

Query:  LIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        LIE DPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  LIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

A0A6J1C2V4 putative pentatricopeptide repeat-containing protein At1g643102.3e-22974.36Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSF LH LT+ELSKS+LTLLRTKELHA ITK+ LACD FYATRIV LYS+NG+L+Y RHVFDKTP+RSVYLWNSIIRAYAKAHKFGDALSLF  M  +E 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------
          DNFTYSCII+ACSE+ HRE LKLVHGR L SGFGLDPICCSALV AYSNLDLIEEA K + +      L  +N + SG+ + G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEP LL+ GKGIHG CLKCNFDS+EHVASA+VSMYSRC  MDSAYLVFSSLLQPDLVTWSALITGYSQ+ DFGKA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----
        QGKK DSILIASILAA AQSTNIR GIEIHGYVLR GIE NEM+SSSLIDMYSKCG+L+LGI VFH++PQKSI +YNSV+WG+GLHGLAS+  ++     
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL-----

Query:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
             PNESTFSALLCACCH GLNSVGK+IF+RMKDEF IKYR +HYVYIVKLLGMTGELE AYNL++SLPE VDSGIWGALLSCCDACGN +LAE+VAQ
Subjt:  -----PNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        +L++N+ +KTAYKVMLSNIYAG+GRWDDVKKLRDTMT  ERGKLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

A0A6J1E092 putative pentatricopeptide repeat-containing protein At1g643107.6e-24176.73Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSF LH LT+ELSK++LTLLRTKELHAFITK++LACDPFYATRIVRLYSING+LNYARHVFDKTPNR+VYLWNSIIRAYAKAHKFG+ALSLF TM GTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------
        L DNFTYSCIIRAC+ENLHRE LKLVHGRVL SGFGLDPICCSALVTAYSNLDLIE+ASK + +      L  +N + SG+   G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGW-IHGSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHG CLKC+ DSNEHVASALVSMYSRCNCMDSAYLVF SL+QPDLVTWSALITGYSQ+GDF KA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------
        QGKK D+ILIASILAA AQS NIR GIEIHGY LRQGI+S+EM+SSSLIDMYSKCGYLSLGIRVF++M +K I  YNS++WG+GLHGLAS+  +      
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------

Query:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
            +PNESTFSALLCACCH GLNSVGK IFKRM +EF IKYRTEHYVYIVKLLGM+GE E AY+LV+SLPEPVDSG+WGALLSCCDACGN++LAE+VAQ
Subjt:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        +L+EN+PDK AY+VMLSNIYAG+GRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

A0A6J1JDK9 putative pentatricopeptide repeat-containing protein At1g643102.7e-23876Show/hide
Query:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA
        MSF LH L +ELSK++LTLLRTKELHAFITK++LACDPFYATRIVRLYSIN +LNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFG+ALSLF TM GTE 
Subjt:  MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEA

Query:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------
        L DNFTYSCIIR C+EN HRE LKLVHGRVL SGFGLDPICCSALVTAYSNLDLIE+ASK + +      L  +N + SG+ + G W             
Subjt:  LSDNFTYSCIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIH-GSW-------------

Query:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM
                    ASGIAEPSLLSTGKGIHG CLKC+ DSNEHVASALVSMYSRCNCM SAYLVF SL+QPDLVTWSALITGYSQ+GDF KA+ FFQ+LNM
Subjt:  -----------CASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNM

Query:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------
        QG K D+ILIASILAA AQS NIR GIEIHGY LRQGI+SNEM+SSSLIDMYSKCGYLSLGIRVFH+M +K I  YNS++WG+GLHGLAS+  +      
Subjt:  QGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK------

Query:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ
            +PNESTFSALLCACCH GL+SVGK IF+RM++EF IKYRTEHYVYIVKLLGM+GELE AYNLV+SLPEPVDSG+WGALLSCCDACGN++LAE+VAQ
Subjt:  ----LPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQ

Query:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        +L+EN+P K AY+VMLSNIYAG+GRWDDVKKLRDTMTEKERGKLPGLSWI
Subjt:  RLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.5e-6528.44Show/hide
Query:  KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREW
        KE+H  + KS  + D F  T +  +Y+   ++N AR VFD+ P R +  WN+I+  Y++      AL +  +M          T   ++ A S       
Subjt:  KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREW

Query:  LKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTWLCGFNHLTSGWIHGS-------------------------WCASGIAEP
         K +HG  + SGF       +ALV  Y+    +E A +    ++E ++ +W    N +   ++                                  A+ 
Subjt:  LKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTWLCGFNHLTSGWIHGS-------------------------WCASGIAEP

Query:  SLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQ
          L  G+ IH L ++   D N  V ++L+SMY +C  +D+A  +F  L    LV+W+A+I G++Q G    A+ +F ++  +  K D+    S++ A A+
Subjt:  SLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQ

Query:  STNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNESTFSALLCACC
         +   H   IHG V+R  ++ N  ++++L+DMY+KCG + +   +F +M ++ ++T+N+++ G G HG      +L          PN  TF +++ AC 
Subjt:  STNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNESTFSALLCACC

Query:  HVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNI
        H GL   G   F  MK+ + I+   +HY  +V LLG  G L  A++ +M +P      ++GA+L  C    NV+ AE  A+RL E +PD   Y V+L+NI
Subjt:  HVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNI

Query:  YAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        Y     W+ V ++R +M  +   K PG S +
Subjt:  YAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

Q9C7V5 Putative pentatricopeptide repeat-containing protein At1g643106.4e-13647.5Show/hide
Query:  ELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCI
        E ++   T L T++LH+F+TKS LA DP++AT++ R Y++N  L  AR +FD  P RSV+LWNSIIRAYAKAH+F   LSLF  +  ++   DNFTY+C+
Subjt:  ELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCI

Query:  IRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASK---CLVECHIQTW--------LCGF-------NHLTSGWIHGSWC-----
         R  SE+   + L+ +HG  +VSG G D IC SA+V AYS   LI EASK    + +  +  W         CGF        +L     H   C     
Subjt:  IRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASK---CLVECHIQTW--------LCGF-------NHLTSGWIHGSWC-----

Query:  -ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIA
          SG+ +PSLL     +H  CLK N DS+ +V  ALV+MYSRC C+ SA  VF+S+ +PDLV  S+LITGYS+ G+  +A+  F  L M GKK D +L+A
Subjt:  -ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIA

Query:  SILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK----------LPNESTF
         +L + A+ ++   G E+H YV+R G+E +  + S+LIDMYSKCG L   + +F  +P+K+I ++NS++ GLGLHG AS  ++          +P+E TF
Subjt:  SILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK----------LPNESTF

Query:  SALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD-KT
        SALLC CCH GL + G++IF+RMK EF I+ +TEHYVY+VKL+GM G+LE A+  VMSL +P+DSGI GALLSCC+   N  LAEVVA+ + +N  + ++
Subjt:  SALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD-KT

Query:  AYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSW
         YKVMLSN+YA  GRWD+V++LRD ++E   GKLPG+SW
Subjt:  AYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.5e-6529.7Show/hide
Query:  AELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSC
        A L  S     + K++HA +    L    F  T+++   S  G + +AR VFD  P   ++ WN+IIR Y++ + F DAL ++  M       D+FT+  
Subjt:  AELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSC

Query:  IIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC-----LVECHIQTW----------------LCGFNHLTSGWIHGSWCA
        +++ACS   H +  + VH +V   GF  D    + L+  Y+    +  A        L E  I +W                L  F+ +    +   W A
Subjt:  IIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC-----LVECHIQTW----------------LCGFNHLTSGWIHGSWCA

Query:  -----SGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSI
             +       L  G+ IH   +K   +    +  +L +MY++C  + +A ++F  +  P+L+ W+A+I+GY++ G   +A+  F  +  +  + D+I
Subjt:  -----SGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSI

Query:  LIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNE
         I S ++A AQ  ++     ++ YV R     +  ISS+LIDM++KCG +     VF     + +  +++++ G GLHG A     L          PN+
Subjt:  LIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNE

Query:  STFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD
         TF  LL AC H G+   G   F RM D   I  + +HY  ++ LLG  G L+ AY ++  +P      +WGALLS C    +V+L E  AQ+L   DP 
Subjt:  STFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD

Query:  KTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
         T + V LSN+YA    WD V ++R  M EK   K  G SW+
Subjt:  KTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.7e-6430.33Show/hide
Query:  LSKSFLTLLRT---KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYS
        +SKSF +L      ++LH FI KS           +V  Y  N +++ AR VFD+   R V  WNSII  Y         LS+F+ M  +    D  T  
Subjt:  LSKSFLTLLRT---KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYS

Query:  CIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGSWCASGI-----------------
         +   C+++      + VH   + + F  +   C+ L+  YS    ++ A     E   ++ +  +  + +G+         +                 
Subjt:  CIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGSWCASGI-----------------

Query:  --------AEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKM--D
                A   LL  GK +H    + +   +  V++AL+ MY++C  M  A LVFS +   D+++W+ +I GYS+     +A+  F  L ++ K+   D
Subjt:  --------AEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKM--D

Query:  SILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKLPN---------
           +A +L A A  +    G EIHGY++R G  S+  +++SL+DMY+KCG L L   +F  +  K + ++  ++ G G+HG       L N         
Subjt:  SILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKLPN---------

Query:  -ESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIEND
         E +F +LL AC H GL   G   F  M+ E  I+   EHY  IV +L  TG+L  AY  + ++P P D+ IWGALL  C    +V LAE VA+++ E +
Subjt:  -ESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIEND

Query:  PDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        P+ T Y V+++NIYA   +W+ VK+LR  + ++   K PG SWI
Subjt:  PDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

Q9STE1 Pentatricopeptide repeat-containing protein At4g213004.5e-7328.98Show/hide
Query:  ELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREWL
        +LH  +  S +  +      ++ +YS  G+ + A  +F          WN +I  Y ++    ++L+ F  M  +  L D  T+S ++ + S+  + E+ 
Subjt:  ELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREWL

Query:  KLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGS----------WCASGIAEPS---------------LLS
        K +H  ++     LD    SAL+ AY     +  A     +C+    +  F  + SG++H            W       P+                L 
Subjt:  KLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGS----------WCASGIAEPS---------------LLS

Query:  TGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQSTNI
         G+ +HG  +K  FD+  ++  A++ MY++C  M+ AY +F  L + D+V+W+++IT  +Q+ +   A+  F+++ + G   D + I++ L+A A   + 
Subjt:  TGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQSTNI

Query:  RHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHG-----------LASRHWKLPNESTFSALLCACCHVG
          G  IHG++++  + S+    S+LIDMY+KCG L   + VF  M +K+I ++NS++   G HG           +  +    P++ TF  ++ +CCHVG
Subjt:  RHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHG-----------LASRHWKLPNESTFSALLCACCHVG

Query:  LNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNIYAG
            G   F+ M +++ I+ + EHY  +V L G  G L  AY  V S+P P D+G+WG LL  C    NV+LAEV + +L++ DP  + Y V++SN +A 
Subjt:  LNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNIYAG

Query:  DGRWDDVKKLRDTMTEKERGKLPGLSWI
           W+ V K+R  M E+E  K+PG SWI
Subjt:  DGRWDDVKKLRDTMTEKERGKLPGLSWI

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-6628.44Show/hide
Query:  KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREW
        KE+H  + KS  + D F  T +  +Y+   ++N AR VFD+ P R +  WN+I+  Y++      AL +  +M          T   ++ A S       
Subjt:  KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREW

Query:  LKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTWLCGFNHLTSGWIHGS-------------------------WCASGIAEP
         K +HG  + SGF       +ALV  Y+    +E A +    ++E ++ +W    N +   ++                                  A+ 
Subjt:  LKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC---LVECHIQTWLCGFNHLTSGWIHGS-------------------------WCASGIAEP

Query:  SLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQ
          L  G+ IH L ++   D N  V ++L+SMY +C  +D+A  +F  L    LV+W+A+I G++Q G    A+ +F ++  +  K D+    S++ A A+
Subjt:  SLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQ

Query:  STNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNESTFSALLCACC
         +   H   IHG V+R  ++ N  ++++L+DMY+KCG + +   +F +M ++ ++T+N+++ G G HG      +L          PN  TF +++ AC 
Subjt:  STNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNESTFSALLCACC

Query:  HVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNI
        H GL   G   F  MK+ + I+   +HY  +V LLG  G L  A++ +M +P      ++GA+L  C    NV+ AE  A+RL E +PD   Y V+L+NI
Subjt:  HVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNI

Query:  YAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        Y     W+ V ++R +M  +   K PG S +
Subjt:  YAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

AT1G64310.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-13747.5Show/hide
Query:  ELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCI
        E ++   T L T++LH+F+TKS LA DP++AT++ R Y++N  L  AR +FD  P RSV+LWNSIIRAYAKAH+F   LSLF  +  ++   DNFTY+C+
Subjt:  ELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCI

Query:  IRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASK---CLVECHIQTW--------LCGF-------NHLTSGWIHGSWC-----
         R  SE+   + L+ +HG  +VSG G D IC SA+V AYS   LI EASK    + +  +  W         CGF        +L     H   C     
Subjt:  IRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASK---CLVECHIQTW--------LCGF-------NHLTSGWIHGSWC-----

Query:  -ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIA
          SG+ +PSLL     +H  CLK N DS+ +V  ALV+MYSRC C+ SA  VF+S+ +PDLV  S+LITGYS+ G+  +A+  F  L M GKK D +L+A
Subjt:  -ASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIA

Query:  SILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK----------LPNESTF
         +L + A+ ++   G E+H YV+R G+E +  + S+LIDMYSKCG L   + +F  +P+K+I ++NS++ GLGLHG AS  ++          +P+E TF
Subjt:  SILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWK----------LPNESTF

Query:  SALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD-KT
        SALLC CCH GL + G++IF+RMK EF I+ +TEHYVY+VKL+GM G+LE A+  VMSL +P+DSGI GALLSCC+   N  LAEVVA+ + +N  + ++
Subjt:  SALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD-KT

Query:  AYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSW
         YKVMLSN+YA  GRWD+V++LRD ++E   GKLPG+SW
Subjt:  AYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSW

AT3G12770.1 mitochondrial editing factor 221.1e-6629.7Show/hide
Query:  AELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSC
        A L  S     + K++HA +    L    F  T+++   S  G + +AR VFD  P   ++ WN+IIR Y++ + F DAL ++  M       D+FT+  
Subjt:  AELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSC

Query:  IIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC-----LVECHIQTW----------------LCGFNHLTSGWIHGSWCA
        +++ACS   H +  + VH +V   GF  D    + L+  Y+    +  A        L E  I +W                L  F+ +    +   W A
Subjt:  IIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKC-----LVECHIQTW----------------LCGFNHLTSGWIHGSWCA

Query:  -----SGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSI
             +       L  G+ IH   +K   +    +  +L +MY++C  + +A ++F  +  P+L+ W+A+I+GY++ G   +A+  F  +  +  + D+I
Subjt:  -----SGIAEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSI

Query:  LIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNE
         I S ++A AQ  ++     ++ YV R     +  ISS+LIDM++KCG +     VF     + +  +++++ G GLHG A     L          PN+
Subjt:  LIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKL----------PNE

Query:  STFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD
         TF  LL AC H G+   G   F RM D   I  + +HY  ++ LLG  G L+ AY ++  +P      +WGALLS C    +V+L E  AQ+L   DP 
Subjt:  STFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPD

Query:  KTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
         T + V LSN+YA    WD V ++R  M EK   K  G SW+
Subjt:  KTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-6530.33Show/hide
Query:  LSKSFLTLLRT---KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYS
        +SKSF +L      ++LH FI KS           +V  Y  N +++ AR VFD+   R V  WNSII  Y         LS+F+ M  +    D  T  
Subjt:  LSKSFLTLLRT---KELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYS

Query:  CIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGSWCASGI-----------------
         +   C+++      + VH   + + F  +   C+ L+  YS    ++ A     E   ++ +  +  + +G+         +                 
Subjt:  CIIRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGSWCASGI-----------------

Query:  --------AEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKM--D
                A   LL  GK +H    + +   +  V++AL+ MY++C  M  A LVFS +   D+++W+ +I GYS+     +A+  F  L ++ K+   D
Subjt:  --------AEPSLLSTGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKM--D

Query:  SILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKLPN---------
           +A +L A A  +    G EIHGY++R G  S+  +++SL+DMY+KCG L L   +F  +  K + ++  ++ G G+HG       L N         
Subjt:  SILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKLPN---------

Query:  -ESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIEND
         E +F +LL AC H GL   G   F  M+ E  I+   EHY  IV +L  TG+L  AY  + ++P P D+ IWGALL  C    +V LAE VA+++ E +
Subjt:  -ESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIEND

Query:  PDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI
        P+ T Y V+++NIYA   +W+ VK+LR  + ++   K PG SWI
Subjt:  PDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-7428.98Show/hide
Query:  ELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREWL
        +LH  +  S +  +      ++ +YS  G+ + A  +F          WN +I  Y ++    ++L+ F  M  +  L D  T+S ++ + S+  + E+ 
Subjt:  ELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCIIRACSENLHREWL

Query:  KLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGS----------WCASGIAEPS---------------LLS
        K +H  ++     LD    SAL+ AY     +  A     +C+    +  F  + SG++H            W       P+                L 
Subjt:  KLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGS----------WCASGIAEPS---------------LLS

Query:  TGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQSTNI
         G+ +HG  +K  FD+  ++  A++ MY++C  M+ AY +F  L + D+V+W+++IT  +Q+ +   A+  F+++ + G   D + I++ L+A A   + 
Subjt:  TGKGIHGLCLKCNFDSNEHVASALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQSTNI

Query:  RHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHG-----------LASRHWKLPNESTFSALLCACCHVG
          G  IHG++++  + S+    S+LIDMY+KCG L   + VF  M +K+I ++NS++   G HG           +  +    P++ TF  ++ +CCHVG
Subjt:  RHGIEIHGYVLRQGIESNEMISSSLIDMYSKCGYLSLGIRVFHVMPQKSISTYNSVVWGLGLHG-----------LASRHWKLPNESTFSALLCACCHVG

Query:  LNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNIYAG
            G   F+ M +++ I+ + EHY  +V L G  G L  AY  V S+P P D+G+WG LL  C    NV+LAEV + +L++ DP  + Y V++SN +A 
Subjt:  LNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVDSGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNIYAG

Query:  DGRWDDVKKLRDTMTEKERGKLPGLSWI
           W+ V K+R  M E+E  K+PG SWI
Subjt:  DGRWDDVKKLRDTMTEKERGKLPGLSWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTTAATCTTCATAACCTCACGGCGGAGCTTTCCAAGAGTTTTCTAACTCTTTTACGAACAAAGGAACTGCATGCTTTTATTACCAAATCCTATCTTGCTTGCGA
CCCATTTTACGCAACTAGAATTGTTAGGCTCTACTCCATCAATGGTAAACTCAATTATGCGCGCCATGTGTTTGACAAAACTCCCAACCGAAGTGTCTACCTCTGGAACT
CAATCATTCGAGCTTACGCGAAAGCCCATAAATTCGGGGATGCATTATCTCTGTTCCTCACAATGTCTGGAACTGAGGCTTTGTCCGATAACTTCACCTATTCATGCATC
ATAAGGGCATGCTCTGAGAACCTCCATAGAGAATGGCTTAAACTTGTTCATGGACGAGTTTTAGTATCTGGGTTTGGATTAGATCCTATTTGTTGCAGCGCTCTAGTGAC
TGCATACTCAAATCTGGACCTTATTGAAGAGGCTAGCAAGTGTTTGGTGGAATGCCACATCCAGACTTGGTTATGTGGATTCAATCATTTAACATCCGGATGGATACACG
GTAGTTGGTGTGCATCGGGTATAGCAGAACCCAGCCTACTGAGCACTGGCAAAGGTATTCATGGACTTTGCCTGAAGTGTAATTTTGACTCTAATGAGCATGTAGCTAGT
GCACTTGTGAGTATGTATTCGAGGTGTAATTGCATGGATTCCGCGTATTTAGTATTTAGTAGTTTGTTACAGCCTGACTTAGTTACATGGTCTGCTTTAATAACTGGGTA
TTCTCAAGCTGGTGATTTTGGGAAAGCAATGTTGTTCTTTCAGAGACTGAATATGCAGGGTAAGAAGATGGATTCTATTTTGATTGCCAGCATTTTGGCTGCTACTGCTC
AATCGACCAACATAAGGCATGGAATTGAGATACATGGTTATGTTCTTCGACAAGGGATAGAATCAAACGAGATGATATCTTCTTCTCTCATAGACATGTATTCCAAGTGT
GGTTATTTGAGTTTAGGAATTCGTGTTTTTCACGTTATGCCGCAAAAAAGTATCTCGACATACAATTCTGTAGTATGGGGACTTGGTTTGCATGGACTTGCATCAAGGCA
TTGGAAATTGCCTAATGAGTCCACTTTCTCTGCTCTCCTCTGTGCGTGCTGCCATGTTGGTCTTAACTCTGTTGGCAAGGATATTTTCAAACGGATGAAAGATGAGTTTT
GCATCAAATACAGAACAGAGCATTACGTTTACATTGTAAAACTTCTTGGAATGACTGGGGAATTAGAAGTGGCTTACAATCTTGTCATGTCCTTACCAGAGCCTGTAGAC
TCTGGTATTTGGGGAGCTCTACTCTCTTGCTGTGATGCTTGTGGGAACGTTGATCTGGCTGAAGTTGTTGCTCAACGGCTCATAGAAAATGATCCTGACAAAACCGCTTA
TAAAGTAATGCTCTCTAATATTTATGCTGGAGATGGGAGATGGGATGATGTGAAGAAGTTAAGGGATACAATGACAGAAAAAGAACGAGGAAAATTGCCTGGCCTTAGCT
GGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTTAATCTTCATAACCTCACGGCGGAGCTTTCCAAGAGTTTTCTAACTCTTTTACGAACAAAGGAACTGCATGCTTTTATTACCAAATCCTATCTTGCTTGCGA
CCCATTTTACGCAACTAGAATTGTTAGGCTCTACTCCATCAATGGTAAACTCAATTATGCGCGCCATGTGTTTGACAAAACTCCCAACCGAAGTGTCTACCTCTGGAACT
CAATCATTCGAGCTTACGCGAAAGCCCATAAATTCGGGGATGCATTATCTCTGTTCCTCACAATGTCTGGAACTGAGGCTTTGTCCGATAACTTCACCTATTCATGCATC
ATAAGGGCATGCTCTGAGAACCTCCATAGAGAATGGCTTAAACTTGTTCATGGACGAGTTTTAGTATCTGGGTTTGGATTAGATCCTATTTGTTGCAGCGCTCTAGTGAC
TGCATACTCAAATCTGGACCTTATTGAAGAGGCTAGCAAGTGTTTGGTGGAATGCCACATCCAGACTTGGTTATGTGGATTCAATCATTTAACATCCGGATGGATACACG
GTAGTTGGTGTGCATCGGGTATAGCAGAACCCAGCCTACTGAGCACTGGCAAAGGTATTCATGGACTTTGCCTGAAGTGTAATTTTGACTCTAATGAGCATGTAGCTAGT
GCACTTGTGAGTATGTATTCGAGGTGTAATTGCATGGATTCCGCGTATTTAGTATTTAGTAGTTTGTTACAGCCTGACTTAGTTACATGGTCTGCTTTAATAACTGGGTA
TTCTCAAGCTGGTGATTTTGGGAAAGCAATGTTGTTCTTTCAGAGACTGAATATGCAGGGTAAGAAGATGGATTCTATTTTGATTGCCAGCATTTTGGCTGCTACTGCTC
AATCGACCAACATAAGGCATGGAATTGAGATACATGGTTATGTTCTTCGACAAGGGATAGAATCAAACGAGATGATATCTTCTTCTCTCATAGACATGTATTCCAAGTGT
GGTTATTTGAGTTTAGGAATTCGTGTTTTTCACGTTATGCCGCAAAAAAGTATCTCGACATACAATTCTGTAGTATGGGGACTTGGTTTGCATGGACTTGCATCAAGGCA
TTGGAAATTGCCTAATGAGTCCACTTTCTCTGCTCTCCTCTGTGCGTGCTGCCATGTTGGTCTTAACTCTGTTGGCAAGGATATTTTCAAACGGATGAAAGATGAGTTTT
GCATCAAATACAGAACAGAGCATTACGTTTACATTGTAAAACTTCTTGGAATGACTGGGGAATTAGAAGTGGCTTACAATCTTGTCATGTCCTTACCAGAGCCTGTAGAC
TCTGGTATTTGGGGAGCTCTACTCTCTTGCTGTGATGCTTGTGGGAACGTTGATCTGGCTGAAGTTGTTGCTCAACGGCTCATAGAAAATGATCCTGACAAAACCGCTTA
TAAAGTAATGCTCTCTAATATTTATGCTGGAGATGGGAGATGGGATGATGTGAAGAAGTTAAGGGATACAATGACAGAAAAAGAACGAGGAAAATTGCCTGGCCTTAGCT
GGATTTGAGAGTTTAATGTTTGTTGGCAGGTACATTCGCTTGGTCATCATTGATCATATTTCAATAATTTACAGAAGCGAATGTCACCATGTGAAGTGATGATTCCTTTA
AATTATGGATATTTTTCCTGGCTTTGAAGATGTTACCAGTGATCTTTGGATATTTA
Protein sequenceShow/hide protein sequence
MSFNLHNLTAELSKSFLTLLRTKELHAFITKSYLACDPFYATRIVRLYSINGKLNYARHVFDKTPNRSVYLWNSIIRAYAKAHKFGDALSLFLTMSGTEALSDNFTYSCI
IRACSENLHREWLKLVHGRVLVSGFGLDPICCSALVTAYSNLDLIEEASKCLVECHIQTWLCGFNHLTSGWIHGSWCASGIAEPSLLSTGKGIHGLCLKCNFDSNEHVAS
ALVSMYSRCNCMDSAYLVFSSLLQPDLVTWSALITGYSQAGDFGKAMLFFQRLNMQGKKMDSILIASILAATAQSTNIRHGIEIHGYVLRQGIESNEMISSSLIDMYSKC
GYLSLGIRVFHVMPQKSISTYNSVVWGLGLHGLASRHWKLPNESTFSALLCACCHVGLNSVGKDIFKRMKDEFCIKYRTEHYVYIVKLLGMTGELEVAYNLVMSLPEPVD
SGIWGALLSCCDACGNVDLAEVVAQRLIENDPDKTAYKVMLSNIYAGDGRWDDVKKLRDTMTEKERGKLPGLSWI