; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014242 (gene) of Snake gourd v1 genome

Gene IDTan0014242
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG02:4875484..4876426
RNA-Seq ExpressionTan0014242
SyntenyTan0014242
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.9e-11187.55Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRL R E YRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKRAKK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLY DI+++LASNGLFEQVQIIHSY KAETDL PEI+GFN LL+ALV+YNLG+LAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES  E+VDLRTVK+DAQ+LYGESLEFLEEEDEAAT ISMH
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

KAG7011830.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]4.3e-11187.55Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRL R E YRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKRAKK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQV LY DI+++LASNGLFEQVQIIHSY KAETDL PEI+GFN LL+ALV+YNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES  E+VDLRTVK+DAQ+LYGESLEFLEEEDEAAT ISMH
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

XP_022135810.1 pentatricopeptide repeat-containing protein At1g62350-like [Momordica charantia]8.6e-11288.35Show/hide
Query:  MSFLATPPSP-TILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA PPSP TI S P+KLLSSG K+SCLRL RAEEYRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKR K +LQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MSFLATPPSP-TILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLY DIIT+LASNGLFEQVQIIHSYLK ETDL PEIDGFNALLRALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDK

Query:  TSFRIVIKGLESG-EAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM
        TSFRIVIKGLES  EAVDLR VKQDAQK+YG+ LEFLEEEDEAA   SM
Subjt:  TSFRIVIKGLESG-EAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]1.9e-11187.95Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRL R E YRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKRAKK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLY DI+++LASNGLFEQVQIIHSY KAETDL PEI+GFN LL+ALVSYNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES  E+VDLRTVK+DAQ+LYGESLEFLEEEDEAAT IS H
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]7.1e-11488.76Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFL  PPSPTILSP  KLLSSGG+ASCL L RAE YR+VTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKR KK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLY DII +LASNGLFE+VQIIHSYLKAETDL PEIDGFNALL+ALVS+NLGELAMESYYLMKE+GCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES GEAVDLRTVKQDAQKLYGESLEFLEEE+EAA  IS H
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein8.2e-10885.48Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLATP SPTI SP  K  SS G A CL+L RAE Y RVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKR KK+LQQLDRVYDSKI RLLKFDM+AVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKEHWYKPQVSLY DIIT+LASNGLFE+VQII SY+KAE DL PEIDGFNALL+ALVS+NLGELAMESYYLMK+VGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM
        SFRIVIKGLES GEAVDLRTVKQDAQ+LYGESLEFLEEE+E AT  S+
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM

A0A5A7UZI6 Pentatricopeptide repeat-containing protein1.1e-10785.14Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFL T PSPTILSPP KL SS  K  CL+L RAE Y RVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKR KK+LQQLDRVYDSKI RLLKFDM+AVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLY DIIT+LASNGLFE+VQII SY+KAETDL PEIDGFNALL+ALV +NLG+LAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLE-SGEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLE  GEAVDLRTVKQDAQKLYGESLEFLEE +E AT IS+H
Subjt:  SFRIVIKGLE-SGEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like4.2e-11288.35Show/hide
Query:  MSFLATPPSP-TILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA PPSP TI S P+KLLSSG K+SCLRL RAEEYRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKR K +LQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MSFLATPPSP-TILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLY DIIT+LASNGLFEQVQIIHSYLK ETDL PEIDGFNALLRALVSYNLGELAMESYYLMK+VGCEPDK
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDK

Query:  TSFRIVIKGLESG-EAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM
        TSFRIVIKGLES  EAVDLR VKQDAQK+YG+ LEFLEEEDEAA   SM
Subjt:  TSFRIVIKGLESG-EAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISM

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic9.3e-11287.95Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRL R E YRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKRAKK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLY DI+++LASNGLFEQVQIIHSY KAETDL PEI+GFN LL+ALVSYNLGELAMESYYLMKEVGCEPDKT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES  E+VDLRTVK+DAQ+LYGESLEFLEEEDEAAT IS H
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic6.0e-11187.55Show/hide
Query:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR
        MSFLAT PS TILSPP  LL SGGK S LRL R E YRRVTMRGG+ENRKPLQKGRNLSIEAIQAVQSLKRAKK+LQQLDRVYDSKI RLLKFDMMAVLR
Subjt:  MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLY DI+++LASNGLFEQVQIIHSYLKAETDL PEI+GFN LL+ALVSYNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKT

Query:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH
        SFRIVIKGLES  E+VDLR VK+DAQ+LYGESLEFLEEEDEAAT IS H
Subjt:  SFRIVIKGLES-GEAVDLRTVKQDAQKLYGESLEFLEEEDEAATTISMH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.7e-0625.42Show/hide
Query:  LQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIH
        L K +  + EAI   Q +KR        DR   +  +  L  ++        + ++  ++ K++ ++R  H  KP +  YT ++   A  GL E+ + I 
Subjt:  LQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIH

Query:  SYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGL-ESGEAVDLRTVKQDAQKL
          L+ E  L P++  +NAL+ +         A E + LM+ +GCEPD+ S+ I++     +G   D   V ++ ++L
Subjt:  SYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGL-ESGEAVDLRTVKQDAQKL

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.8e-1833.85Show/hide
Query:  LSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAE
        +S E + A + LKR +    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAE

Query:  TDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKL------YGESLEFLEEEDEAATTIS
          L  +   F  L+R  +   L   AM  Y  M+E    P    FR+++KGL     +    VK D  +L      Y    +  E+ DE A T S
Subjt:  TDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKL------YGESLEFLEEEDEAATTIS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic3.2e-1637.57Show/hide
Query:  GGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYTDIITLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       + RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LY DI+  L  N 
Subjt:  GGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYTDIITLLASNG

Query:  LFEQVQIIHSYLKAETDLVPEIDGFN---------ALLRALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++            L+ EIDG +          L+RA+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKAETDLVPEIDGFN---------ALLRALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531704.2e-0829.2Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   YT +  +L +    +Q  ++   + +E  L P ID + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic8.1e-2033.33Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K++ ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQV

Query:  QIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKLYGE
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P++  FR+++KGL     +    VK+D ++L+ E
Subjt:  QIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKLYGE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-1933.85Show/hide
Query:  LSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAE
        +S E + A + LKR +    +LDR   S +SRLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAE

Query:  TDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKL------YGESLEFLEEEDEAATTIS
          L  +   F  L+R  +   L   AM  Y  M+E    P    FR+++KGL     +    VK D  +L      Y    +  E+ DE A T S
Subjt:  TDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKL------YGESLEFLEEEDEAATTIS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown2.3e-1737.57Show/hide
Query:  GGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYTDIITLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       + RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LY DI+  L  N 
Subjt:  GGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYTDIITLLASNG

Query:  LFEQVQIIHSYLKAETDLVPEIDGFN---------ALLRALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL
         F+++            L+ EIDG +          L+RA+V     E  +  Y LM+E G      E D+    ++ KGL
Subjt:  LFEQVQIIHSYLKAETDLVPEIDGFN---------ALLRALVSYNLGELAMESYYLMKEVG-----CEPDKTSFRIVIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2133.33Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQV
        R PL +G+ L   EA+  +  LKR K++ ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQV

Query:  QIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKLYGE
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P++  FR+++KGL     +    VK+D ++L+ E
Subjt:  QIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTVKQDAQKLYGE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-0929.2Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   YT +  +L +    +Q  ++   + +E  L P ID + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVG

Query:  -CEPDKTSFRIVI
         C+PD  +F ++I
Subjt:  -CEPDKTSFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain1.5e-5851.24Show/hide
Query:  SGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQ---------------LDRVYDSKISRLLKFDMMAVLRELLRQN
        +GG+   L+ A       + MR  ++NRKPLQ+GR LSIEAIQAVQ+LKRA   L                 LDRV  SK  RLLKFDM+AVLRELLRQN
Subjt:  SGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQ---------------LDRVYDSKISRLLKFDMMAVLRELLRQN

Query:  ECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVI
        ECSLALKVFE++RKE+WYKPQV +YTD+IT++A N L E+V  ++S +K+E  L+ EI+ FN LL  L+++ L +L M+ Y  M+ +G EPD+ SFR+++
Subjt:  ECSLALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVI

Query:  KGLESGEAVDLRT-VKQDAQKLYGESLEFLEEEDEAATTISM
         GLES   + L   V+QDA + YGESLEF+EE++E ++  S+
Subjt:  KGLESGEAVDLRT-VKQDAQKLYGESLEFLEEEDEAATTISM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAACTCCTCCGTCACCGACGATTCTCAGTCCGCCGAACAAGCTACTGAGCTCCGGCGGGAAAGCATCTTGCCTGCGACTAGCCAGGGCAGAGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAATGAGAACCGGAAGCCGCTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCCAAGA
AAAATTTACAGCAATTGGACCGAGTGTATGATTCCAAAATTAGTCGTTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTCAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTCTCGCTGTATACTGATATTATTACATTATTGGCTAGCAATGGATTGTTCGAACAAGT
ACAGATTATTCATTCCTATTTGAAAGCAGAAACCGACTTAGTGCCTGAAATTGACGGGTTTAACGCTCTTTTGAGGGCCTTGGTTAGTTACAATTTAGGTGAACTCGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTTGGTTGTGAGCCAGATAAGACTTCTTTCAGGATAGTCATCAAAGGACTGGAATCGGGAGAGGCAGTTGATTTAAGAACTGTG
AAGCAGGATGCACAGAAGCTTTATGGTGAATCACTCGAGTTTCTAGAGGAAGAAGACGAGGCAGCTACAACCATATCTATGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTTAGCAACTCCTCCGTCACCGACGATTCTCAGTCCGCCGAACAAGCTACTGAGCTCCGGCGGGAAAGCATCTTGCCTGCGACTAGCCAGGGCAGAGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAATGAGAACCGGAAGCCGCTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAGCGAGCCAAGA
AAAATTTACAGCAATTGGACCGAGTGTATGATTCCAAAATTAGTCGTTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTCAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTCTCGCTGTATACTGATATTATTACATTATTGGCTAGCAATGGATTGTTCGAACAAGT
ACAGATTATTCATTCCTATTTGAAAGCAGAAACCGACTTAGTGCCTGAAATTGACGGGTTTAACGCTCTTTTGAGGGCCTTGGTTAGTTACAATTTAGGTGAACTCGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTTGGTTGTGAGCCAGATAAGACTTCTTTCAGGATAGTCATCAAAGGACTGGAATCGGGAGAGGCAGTTGATTTAAGAACTGTG
AAGCAGGATGCACAGAAGCTTTATGGTGAATCACTCGAGTTTCTAGAGGAAGAAGACGAGGCAGCTACAACCATATCTATGCACTGA
Protein sequenceShow/hide protein sequence
MSFLATPPSPTILSPPNKLLSSGGKASCLRLARAEEYRRVTMRGGNENRKPLQKGRNLSIEAIQAVQSLKRAKKNLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRKEHWYKPQVSLYTDIITLLASNGLFEQVQIIHSYLKAETDLVPEIDGFNALLRALVSYNLGELAMESYYLMKEVGCEPDKTSFRIVIKGLESGEAVDLRTV
KQDAQKLYGESLEFLEEEDEAATTISMH