; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006323 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006323
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold4:5273355..5277653
RNA-Seq ExpressionSpg006323
SyntenySpg006323
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.3e-11185.54Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQVQIIHSY KAETDLAPEI+GFN LL+ALV+Y LG+LAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLRTVK+DAQ++YGESLEFLEEE+EAAT +  H
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]1.7e-11286.75Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQVQIIHSY KAETDLAPEI+GFN LL+ALVSY LGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLRTVK+DAQ++YGESLEFLEEE+EAAT + TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]1.1e-11186.35Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQVQIIHSYLKAETDLAPEI+GFN LL+ALVSY LGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLR VK+DAQ++YGESLEFLEEE+EAAT + TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]1.1e-11185.54Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PSPTILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQ+SLYADI+++LASNGLFE VQIIHSYLKAETDLAPEI+GFN LL+ALVSY LGELAMESYYLMK+VGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLRTVK+DAQ++YGESLEFLEEE++ AT + TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]4.2e-11487.55Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFL  PPSPTILSP   LLSSGG+ASCL LGRA  YR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLFE+VQIIHSYLKAETDLAPEIDGFNALL+ALVS+ LGELAMESYYLMKE+GCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++ GEA DLRTVKQDAQK+YGESLEFLEEE EAA A+ TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein2.4e-10785.31Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLATP SPTI SP     SS G A CL+LGRA  Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDM+AVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIT+LASNGLFE+VQII SY+KAE DLAPEIDGFNALL+ALVS+ LGELAMESYYLMK+VGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATA
        SFRIVIKGL++ GEA DLRTVKQDAQ++YGESLEFLEEE E ATA
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATA

A0A5A7UZI6 Pentatricopeptide repeat-containing protein3.1e-10783.94Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFL T PSPTILSPP  L SS  K  CL+LGRA  Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDM+AVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIIT+LASNGLFE+VQII SY+KAETDLAPEIDGFNALL+ALV + LG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL+  GEA DLRTVKQDAQK+YGESLEFLEE  E ATA+  H
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like6.1e-11188.21Show/hide
Query:  MSFLATPPSP-TILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVL
        M+FLA PPSP TI S P  LLSSG K+SCLRLGRA EYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K D QQLDRVYDSKI RLLKFDMMAVL
Subjt:  MSFLATPPSP-TILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADIIT+LASNGLFEQVQIIHSYLK ETDLAPEIDGFNALLRALVSY LGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDK

Query:  TSFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATA
        TSFRIVIKGL++T EA DLR VKQDAQKIYG+ LEFLEEE+EAA A
Subjt:  TSFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATA

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic8.5e-11386.75Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQVQIIHSY KAETDLAPEI+GFN LL+ALVSY LGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLRTVK+DAQ++YGESLEFLEEE+EAAT + TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic5.5e-11286.35Show/hide
Query:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+++LASNGLFEQVQIIHSYLKAETDLAPEI+GFN LL+ALVSY LGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH
        SFRIVIKGL++T E+ DLR VK+DAQ++YGESLEFLEEE+EAAT + TH
Subjt:  SFRIVIKGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAATAVPTH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.6e-0728.83Show/hide
Query:  KEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRAL--VSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGLDATGEAAD
        + H  KP +  Y  ++   A  GL E+ + I   L+ E  L P++  +NAL+ +     Y  G  A E + LM+ +GCEPD+ S+ I++      G  +D
Subjt:  KEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRAL--VSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGLDATGEAAD

Query:  LRTVKQDAQKI
           V ++ +++
Subjt:  LRTVKQDAQKI

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.8e-1836.18Show/hide
Query:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAE
        +S E + A + LKR +  S +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAE

Query:  TDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL
          L  +   F  L+R  +  +L   AM  Y  M+E    P    FR+++KGL
Subjt:  TDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic2.2e-1737.85Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       ++RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITLLASNG

Query:  LFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGLDATGE
         F+++  +   +    D   +      L+RA+V  +  E  +  Y LM+E G      E D+    ++ KGL   GE
Subjt:  LFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGLDATGE

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531707.1e-0828.32Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  +  +L +    +Q  ++   + +E  L P ID + +L+      +L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic8.1e-2035.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQV

Query:  QIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P++  FR+++KGL
Subjt:  QIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-1936.18Show/hide
Query:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAE
        +S E + A + LKR +  S +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAE

Query:  TDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL
          L  +   F  L+R  +  +L   AM  Y  M+E    P    FR+++KGL
Subjt:  TDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.6e-1837.85Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       ++RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIITLLASNG

Query:  LFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGLDATGE
         F+++  +   +    D   +      L+RA+V  +  E  +  Y LM+E G      E D+    ++ KGL   GE
Subjt:  LFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGLDATGE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-2135.19Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQV

Query:  QIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P++  FR+++KGL
Subjt:  QIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGL

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-0928.32Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  +  +L +    +Q  ++   + +E  L P ID + +L+      +L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.8e-5951.26Show/hide
Query:  SGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQN
        +GG+   L+    ++     MR  S+NRKPLQ+GR LSIEAIQAVQ+LKRA                  S  LDRV  SK +RLLKFDM+AVLRELLRQN
Subjt:  SGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQN

Query:  ECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVI
        ECSLALKVFE++RKE+WYKPQV +Y D+IT++A N L E+V  ++S +K+E  L  EI+ FN LL  L+++KL +L M+ Y  M+ +G EPD+ SFR+++
Subjt:  ECSLALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVI

Query:  KGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAAT
         GL++ GE      V+QDA + YGESLEF+EE+ E ++
Subjt:  KGLDATGEAADLRTVKQDAQKIYGESLEFLEEENEAAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAACTCCTCCGTCGCCGACGATTCTCAGTCCGCCGGACATGTTACTGAGCTCTGGCGGGAAAGCATCTTGCCTGCGACTAGGTAGGGCGGTGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAGTGAGAACCGGAAGCCATTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGCCAAGA
AAGATTCACAACAATTGGATCGAGTGTATGATTCCAAAATTAAGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTTTCATTGTATGCTGATATTATCACATTATTGGCTAGCAATGGATTGTTTGAACAAGT
GCAAATTATTCATTCCTACTTGAAAGCAGAAACTGATTTAGCGCCTGAAATTGACGGGTTTAACGCTCTTTTGAGGGCTTTGGTGAGTTACAAGTTAGGTGAACTTGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTAGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATAAAAGGATTGGACGCAACGGGAGAAGCAGCTGATTTAAGAACT
GTGAAGCAGGATGCACAAAAGATTTATGGTGAATCACTCGAGTTTCTCGAGGAAGAAAACGAGGCAGCTACAGCCGTACCGACGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTTAGCAACTCCTCCGTCGCCGACGATTCTCAGTCCGCCGGACATGTTACTGAGCTCTGGCGGGAAAGCATCTTGCCTGCGACTAGGTAGGGCGGTGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAGTGAGAACCGGAAGCCATTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGCCAAGA
AAGATTCACAACAATTGGATCGAGTGTATGATTCCAAAATTAAGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTTTCATTGTATGCTGATATTATCACATTATTGGCTAGCAATGGATTGTTTGAACAAGT
GCAAATTATTCATTCCTACTTGAAAGCAGAAACTGATTTAGCGCCTGAAATTGACGGGTTTAACGCTCTTTTGAGGGCTTTGGTGAGTTACAAGTTAGGTGAACTTGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTAGGTTGCGAGCCAGATAAGACTTCTTTCAGGATAGTCATAAAAGGATTGGACGCAACGGGAGAAGCAGCTGATTTAAGAACT
GTGAAGCAGGATGCACAAAAGATTTATGGTGAATCACTCGAGTTTCTCGAGGAAGAAAACGAGGCAGCTACAGCCGTACCGACGCACTGA
Protein sequenceShow/hide protein sequence
MSFLATPPSPTILSPPDMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRKEHWYKPQVSLYADIITLLASNGLFEQVQIIHSYLKAETDLAPEIDGFNALLRALVSYKLGELAMESYYLMKEVGCEPDKTSFRIVIKGLDATGEAADLRT
VKQDAQKIYGESLEFLEEENEAATAVPTH