; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04114 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04114
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr19:7705513..7706509
RNA-Seq ExpressionCarg04114
SyntenyCarg04114
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.1e-13099.2Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLG+LAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

KAG7011830.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]6.4e-131100Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]4.6e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]4.3e-12797.59Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSY KAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLR VKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]6.8e-12595.58Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPS TILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQ+ LYADIVSVLASNGLFE VQIIHSY KAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMK+VGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLEST ESVDLRTVKKDAQELYGESLEFLEEED+ ATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

TrEMBL top hitse value%identityAlignment
A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like7.2e-10482.73Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFL T PS TILSPP  L  S  K   L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR E WYKPQV LYADI++VLASNGLFE+VQII SY KAETDLAPEI+GFN LLKALV +NLG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLE   E+VDLRTVK+DAQ+LYGESLEFLEE +E AT IS+H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

A0A5A7UZI6 Pentatricopeptide repeat-containing protein7.2e-10482.73Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFL T PS TILSPP  L  S  K   L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR E WYKPQV LYADI++VLASNGLFE+VQII SY KAETDLAPEI+GFN LLKALV +NLG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLE   E+VDLRTVK+DAQ+LYGESLEFLEE +E AT IS+H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like1.4e-10483.13Show/hide
Query:  MSFLATTPS-STILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MSFLATTPS-STILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQV LYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALV+YNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDK

Query:  TSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISM
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   SM
Subjt:  TSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISM

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.2e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-12797.59Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSY KAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH
        SFRIVIKGLESTRESVDLR VKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISMH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351302.5e-0530.93Show/hide
Query:  KPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVI-----KGLESTRESV
        KP +  Y  +V+  A  GL E+ + I    + E  L P++  +N L+++         A E + LM+ +GCEPD+ S+ I++      GL S  E+V
Subjt:  KPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVI-----KGLESTRESV

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623502.4e-1632.79Show/hide
Query:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +    K E
Subjt:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAE

Query:  TDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED
          L  +   F +L++  +   L   AM  Y  M+E    P    FR+++KGL    E  +   VK D  EL+   + +   ED
Subjt:  TDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.1e-1638.37Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVLLYADIVSVLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  ++LYADIV+ L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVLLYADIVSVLASNG

Query:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++  +        D   + +    L++A+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531706.7e-0627.43Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+ WY+P+   Y  +  VL +    +Q  ++     +E  L P I+ + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-2033.7Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K++WY+P V +Y D++  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQV

Query:  QIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE
          +    K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+++KGL      +    VKKD +EL+ E
Subjt:  QIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-1732.79Show/hide
Query:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +    K E
Subjt:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAE

Query:  TDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED
          L  +   F +L++  +   L   AM  Y  M+E    P    FR+++KGL    E  +   VK D  EL+   + +   ED
Subjt:  TDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED

AT3G27750.1 FUNCTIONS IN: molecular_function unknown7.8e-1838.37Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVLLYADIVSVLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  ++LYADIV+ L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVLLYADIVSVLASNG

Query:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++  +        D   + +    L++A+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-2133.7Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K++WY+P V +Y D++  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQV

Query:  QIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE
          +    K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+++KGL      +    VKKD +EL+ E
Subjt:  QIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-0727.43Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+ WY+P+   Y  +  VL +    +Q  ++     +E  L P I+ + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain8.2e-6052.26Show/hide
Query:  GSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQ---------------LDRVYDSKIRRLLKFDMMAVLRELLRQ
        G+GG++  L+   V     + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKRA   L                 LDRV  SK RRLLKFDM+AVLRELLRQ
Subjt:  GSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQ---------------LDRVYDSKIRRLLKFDMMAVLRELLRQ

Query:  NECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIV
        NECSLALKVFE++RKE WYKPQV +Y D+++V+A N L E+V  ++S  K+E  L  EIE FN LL  L+ + L +L M+ Y  M+ +G EPD+ SFR++
Subjt:  NECSLALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIV

Query:  IKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISM
        + GLES  E      V++DA E YGESLEF+EE++E ++G S+
Subjt:  IKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAACTACTCCGTCGTCGACGATTCTCAGTCCGCCGTACACGTTACTGGGCTCCGGCGGGAAAGTATCCTATCTGCGACTAGGCAGGGTGGAGGGATA
TCGGCGAGTGACAATGAGAGGCGGAAGTGAAAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCTAAGA
AAGATTTACAACAATTGGACCGAGTGTATGATTCTAAAATTAGACGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTTGAAGATGTTAGAAAGGAGGACTGGTACAAGCCTCAGGTTTTGCTGTATGCTGATATTGTTTCAGTATTGGCTAGCAATGGATTGTTCGAACAAGT
ACAAATAATTCATTCGTACTTCAAAGCAGAAACTGACCTAGCACCTGAAATTGAGGGGTTCAACAATCTTCTGAAGGCTTTGGTTACTTATAATCTAGGTGAACTTGCAA
TGGAGTCGTATTACTTAATGAAAGAAGTAGGTTGTGAGCCAGATAAGACTTCTTTCAGGATTGTCATAAAAGGATTGGAATCAACGAGAGAATCAGTTGATTTAAGAACT
GTGAAGAAGGATGCACAAGAGCTTTATGGTGAATCACTTGAGTTTCTAGAGGAAGAAGATGAAGCAGCTACCGGCATATCGATGCACTGA
mRNA sequenceShow/hide mRNA sequence
TGGATGAACAAGTAGGCGGGAGGGACTACGCGGGAGAGAGATATTGAAGGATGAGTTTCTTAGCAACTACTCCGTCGTCGACGATTCTCAGTCCGCCGTACACGTTACTG
GGCTCCGGCGGGAAAGTATCCTATCTGCGACTAGGCAGGGTGGAGGGATATCGGCGAGTGACAATGAGAGGCGGAAGTGAAAACCGGAAGCCGTTGCAGAAGGGGAGGAA
CCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCTAAGAAAGATTTACAACAATTGGACCGAGTGTATGATTCTAAAATTAGACGCTTATTGAAGTTCG
ATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTGGCTCTTAAGGTTTTTGAAGATGTTAGAAAGGAGGACTGGTACAAGCCTCAGGTTTTGCTG
TATGCTGATATTGTTTCAGTATTGGCTAGCAATGGATTGTTCGAACAAGTACAAATAATTCATTCGTACTTCAAAGCAGAAACTGACCTAGCACCTGAAATTGAGGGGTT
CAACAATCTTCTGAAGGCTTTGGTTACTTATAATCTAGGTGAACTTGCAATGGAGTCGTATTACTTAATGAAAGAAGTAGGTTGTGAGCCAGATAAGACTTCTTTCAGGA
TTGTCATAAAAGGATTGGAATCAACGAGAGAATCAGTTGATTTAAGAACTGTGAAGAAGGATGCACAAGAGCTTTATGGTGAATCACTTGAGTTTCTAGAGGAAGAAGAT
GAAGCAGCTACCGGCATATCGATGCACTGA
Protein sequenceShow/hide protein sequence
MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRKEDWYKPQVLLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVTYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRT
VKKDAQELYGESLEFLEEEDEAATGISMH