; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G008330 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G008330
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr19:8165668..8169164
RNA-Seq ExpressionCmoCh19G008330
SyntenyCmoCh19G008330
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.1e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALV+YNLG+LAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

KAG7011830.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]4.6e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQV LYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALV+YNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]8.4e-131100Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]7.8e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSY KAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLR VKKDAQELYGESLEFLEEEDEAATGISTH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]1.2e-12696.79Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPS TILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQ+SLYADIVSVLASNGLFE VQIIHSY KAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMK+VGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLEST ESVDLRTVKKDAQELYGESLEFLEEED+ ATGISTH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

TrEMBL top hitse value%identityAlignment
A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like1.4e-10483.13Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFL T PS TILSPP  L  S  K   L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLASNGLFE+VQII SY KAETDLAPEI+GFN LLKALV +NLG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLE   E+VDLRTVK+DAQ+LYGESLEFLEE +E AT IS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

A0A5A7UZI6 Pentatricopeptide repeat-containing protein1.4e-10483.13Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFL T PS TILSPP  L  S  K   L+LGR EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKDLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLASNGLFE+VQII SY KAETDLAPEI+GFN LLKALV +NLG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLE   E+VDLRTVK+DAQ+LYGESLEFLEE +E AT IS H
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like5.0e-10583.87Show/hide
Query:  MSFLATTPS-STILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MSFLATTPS-STILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDK
        RELLRQNEC LALKVFEDVR E WYKPQVSLYADI++VLASNGLFEQVQIIHSY K ETDLAPEI+GFN LL+ALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDK

Query:  TSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS
        TSFRIVIKGLEST E+VDLR VK+DAQ++YG+ LEFLEEEDEAA   S
Subjt:  TSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic4.0e-131100Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.8e-12998.8Show/hide
Query:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSY KAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH
        SFRIVIKGLESTRESVDLR VKKDAQELYGESLEFLEEEDEAATGISTH
Subjt:  SFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGISTH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.9e-0530.93Show/hide
Query:  KPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVI-----KGLESTRESV
        KP +  Y  +V+  A  GL E+ + I    + E  L P++  +N L+++         A E + LM+ +GCEPD+ S+ I++      GL S  E+V
Subjt:  KPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVI-----KGLESTRESV

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623502.4e-1632.79Show/hide
Query:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +    K E
Subjt:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAE

Query:  TDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED
          L  +   F +L++  +   L   AM  Y  M+E    P    FR+++KGL    E  +   VK D  EL+   + +   ED
Subjt:  TDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.4e-1638.37Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVSLYADIVSVLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADIV+ L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVSLYADIVSVLASNG

Query:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++  +        D   + +    L++A+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531703.0e-0627.43Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+ WY+P+   Y  +  VL +    +Q  ++     +E  L P I+ + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.6e-2033.7Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K++WY+P V +Y D++  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQV

Query:  QIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE
          +    K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+++KGL      +    VKKD +EL+ E
Subjt:  QIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-1732.79Show/hide
Query:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +    K E
Subjt:  LSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAE

Query:  TDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED
          L  +   F +L++  +   L   AM  Y  M+E    P    FR+++KGL    E  +   VK D  EL+   + +   ED
Subjt:  TDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGESLEFLEEED

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.0e-1738.37Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVSLYADIVSVLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADIV+ L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKP-QVSLYADIVSVLASNG

Query:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++  +        D   + +    L++A+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-2133.7Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K++WY+P V +Y D++  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQV

Query:  QIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE
          +    K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+++KGL      +    VKKD +EL+ E
Subjt:  QIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRTVKKDAQELYGE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-0727.43Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+ WY+P+   Y  +  VL +    +Q  ++     +E  L P I+ + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain6.3e-6052.48Show/hide
Query:  GSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQ---------------LDRVYDSKIRRLLKFDMMAVLRELLRQ
        G+GG++  L+   V     + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKRA   L                 LDRV  SK RRLLKFDM+AVLRELLRQ
Subjt:  GSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQ---------------LDRVYDSKIRRLLKFDMMAVLRELLRQ

Query:  NECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIV
        NECSLALKVFE++RKE WYKPQV +Y D+++V+A N L E+V  ++S  K+E  L  EIE FN LL  L+++ L +L M+ Y  M+ +G EPD+ SFR++
Subjt:  NECSLALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIV

Query:  IKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS
        + GLES  E      V++DA E YGESLEF+EE++E ++G S
Subjt:  IKGLESTRESVDLRTVKKDAQELYGESLEFLEEEDEAATGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAACTACTCCGTCGTCGACGATTCTCAGTCCGCCGTACACGTTACTGGGCTCCGGCGGGAAAGTATCCTATCTGCGACTAGGCAGGGTGGAGGGATA
TCGGCGAGTGACAATGAGAGGCGGAAGTGAAAATCGGAAGCCGTTGCAGAAGGGGAGGAACCTTAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCTAAGA
AAGATTTACAACAATTGGACCGAGTGTATGATTCTAAAATTAGGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTAAAGGTTTTTGAAGATGTTAGAAAGGAGGACTGGTACAAGCCTCAGGTTTCGCTGTATGCTGATATTGTTTCAGTATTGGCTAGCAATGGATTGTTCGAACAAGT
ACAAATAATTCATTCGTACTTCAAAGCAGAAACTGACCTAGCACCTGAAATTGAGGGGTTCAACAATCTTCTGAAGGCTTTGGTTAGTTATAATCTAGGTGAACTTGCAA
TGGAGTCGTATTACTTAATGAAAGAAGTAGGTTGTGAACCAGATAAGACTTCTTTCAGGATTGTCATAAAAGGATTGGAATCAACGAGAGAATCAGTTGATTTAAGAACT
GTGAAGAAGGATGCACAAGAGCTTTATGGTGAATCACTTGAGTTTCTAGAGGAAGAAGATGAAGCAGCTACCGGCATATCGACGCACTGA
mRNA sequenceShow/hide mRNA sequence
AGTAGGCGGGAGGGACTACGCGGGAGAGAGATATTGAAGGATGAGTTTCTTAGCAACTACTCCGTCGTCGACGATTCTCAGTCCGCCGTACACGTTACTGGGCTCCGGCG
GGAAAGTATCCTATCTGCGACTAGGCAGGGTGGAGGGATATCGGCGAGTGACAATGAGAGGCGGAAGTGAAAATCGGAAGCCGTTGCAGAAGGGGAGGAACCTTAGCATC
GAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCTAAGAAAGATTTACAACAATTGGACCGAGTGTATGATTCTAAAATTAGGCGCTTATTGAAGTTCGATATGATGGC
TGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTGGCTCTAAAGGTTTTTGAAGATGTTAGAAAGGAGGACTGGTACAAGCCTCAGGTTTCGCTGTATGCTGATA
TTGTTTCAGTATTGGCTAGCAATGGATTGTTCGAACAAGTACAAATAATTCATTCGTACTTCAAAGCAGAAACTGACCTAGCACCTGAAATTGAGGGGTTCAACAATCTT
CTGAAGGCTTTGGTTAGTTATAATCTAGGTGAACTTGCAATGGAGTCGTATTACTTAATGAAAGAAGTAGGTTGTGAACCAGATAAGACTTCTTTCAGGATTGTCATAAA
AGGATTGGAATCAACGAGAGAATCAGTTGATTTAAGAACTGTGAAGAAGGATGCACAAGAGCTTTATGGTGAATCACTTGAGTTTCTAGAGGAAGAAGATGAAGCAGCTA
CCGGCATATCGACGCACTGAGCTGCGGATGAACTTGTGGCGAGAAAGATGGCTTGTCTATGCTTCCTTGGAAGAAACCTGAAAATTCTAAGAAACTTTCATTGGATATGA
ACTTTAACTGTATACCGGCTTAAGTCAACCCGGGCTTAGCAGAGGAAATCTGTGATAGCTTTTACGGTTATGTTGGATCTCATGAGTATTGGATGAATTGAAATCGGTTT
ACAGTTTACTCTCTTCTCCTTGTTAATGCTTCCGTTTCTCAAGCTTCTGGAATCCAAAATCTCCCCTGATTGGCAGTGTCACTAGTCTATGCAGATAAGAGCTGATCATC
TTGCAACATACTTCAGCCACTCTGTTGCAAGCATGAGAAGTTCATCTAAGTCTACTCAAGACAGAAATAGAAGGGAAAAATCTGAAGTATAATGCAAGGTGGTTGTTACA
TGATTTCTTTTTCGTTCCAAAAGTACCGAATTATACGTAGTAGAATACATAAGGTACAAACCCGGTTCGGGATTGCTGCGTATTTCTGTAGAAACAAATATAGTAAGATG
AAAAATTATGTGGGCTTTACGCCCAGACTGCCAACATTGTGCAGTTTAACTCTGATATGATAATCTCGAGGTTTAACATGCCGTTTGCAGGTGCAACGCTGTAGGAAGTA
AGTGTTCTTCCACTGCTTGAGGTAACTTCGAAAGAGAGAGGTTGCCCATGAAGATTGACATTGCTTTGCCAGTTCTGTCCCCAGTTCCTTGCCATTGCAAATGCTGCCTC
TGACATCTCAAAATGTTCTTTCGGAAAATTACACCAACCACCATAATCGGAAGATAGGCCGTAATTTGGAGGACAGAAGTCGGTGGCTGTTAGAACGACGGTCGGGCTTC
CTTGCAAGCACCACAAGATGTGGTCAACACATCTAAGCTCATAGCAAGCTCCACAGACGAAGGAGAAGAGTACCTGTAACAATGGACCCATCTGTTTCTTTGGAGTATGT
TGCAGTAGCAGATTTCCACTCTTCATCCTTAGCCGCAAAGAGATTGGGTGTTTGCAGAAGAACCAAGTAGACTACTACAGCTACAAAGGCAGCCATTAAAGAGAATATAA
GGTCTTGAAGAAGCCTGTGCAGTATGTATACCTAAAATGTGCCCTTCGTTCACATTGGTGGCGCGTTTAACGAGGCTAAGGAAAAGACACGCACCCTGTTCTTGTATCTT
CTAGGCCGCCGTTAATGGGGGCTCAACGAAGCAAACTTGGAGCCTCCATTTCTGCCATTATCGAGGTAGAACGAAGGGTTTGAGGCTTTATGAAAATGGCGGCGGCAGAT
ACAGAGGCTACAGAGCAGAAAGCCATGGAAGCCAGGTATGTGTTCCTTTGCATTTTAAGACCCATGGATATAAAGAAACTTAAGTTAATAAGAACTCAAAAAGTGGCCTG
TCTTGGTTTTGATTGTTGGACCCGAATGTGTTTTTAAAAGAGTGGCTTTTGGTGGTTTAAT
Protein sequenceShow/hide protein sequence
MSFLATTPSSTILSPPYTLLGSGGKVSYLRLGRVEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRKEDWYKPQVSLYADIVSVLASNGLFEQVQIIHSYFKAETDLAPEIEGFNNLLKALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESTRESVDLRT
VKKDAQELYGESLEFLEEEDEAATGISTH