; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014823 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014823
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:5013773..5014710
RNA-Seq ExpressionLag0014823
SyntenyLag0014823
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.3e-11185.14Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PS TILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+ +LASNGLF+QVQIIHSY KAETDL PEI+GFN+LL+ALV+YNLG+LAMESYYLMKEVGCEP+KT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLRTVK+DAQ++YGESL FLEEEDEAAT +  H
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]1.3e-11286.35Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PS TILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+ +LASNGLF+QVQIIHSY KAETDL PEI+GFN+LL+ALVSYNLGELAMESYYLMKEVGCEP+KT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLRTVK+DAQ++YGESL FLEEEDEAAT + TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]8.7e-11285.94Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PS TILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+ +LASNGLF+QVQIIHSYLKAETDL PEI+GFN+LL+ALVSYNLGELAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLR VK+DAQ++YGESL FLEEEDEAAT + TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]8.7e-11285.14Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PSPTILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQ+SLYADI+ +LASNGLF+ VQIIHSYLKAETDL PEI+GFN+LL+ALVSYNLGELAMESYYLMK+VGCEP+KT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLRTVK+DAQ++YGESL FLEEED+ AT + TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]1.4e-11487.15Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFL  PPSPTILSP Y LLSSGG+ASCL LGRA  YR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADIIR+LASNGLF++VQIIHSYLKAETDL PEIDGFN+LL+ALVS+NLGELAMESYYLMKE+GCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLES GEA DLRTVKQDAQK+YGESL FLEEE+EAA A+ TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

TrEMBL top hitse value%identityAlignment
A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like1.0e-10582.33Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFL   PSPTILSPP  L SS  K  CL+LGRA  Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDM+AVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLF++VQII SY+KAETDL PEIDGFN+LL+ALV +NLG+LAMESYYLMKEVGCEPNK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLE  GEA DLRTVKQDAQK+YGESL FLEE +E ATA+  H
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

A0A5A7UZI6 Pentatricopeptide repeat-containing protein1.0e-10582.33Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFL   PSPTILSPP  L SS  K  CL+LGRA  Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR KKD QQLDRVYDSKI+RLLKFDM+AVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVR EHWYKPQVSLYADII +LASNGLF++VQII SY+KAETDL PEIDGFN+LL+ALV +NLG+LAMESYYLMKEVGCEPNK 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLE  GEA DLRTVKQDAQK+YGESL FLEE +E ATA+  H
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like1.8e-11087.4Show/hide
Query:  MSFLAIPPSP-TILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVL
        M+FLAIPPSP TI S P+ LLSSG K+SCLRLGRA EYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K D QQLDRVYDSKI RLLKFDMMAVL
Subjt:  MSFLAIPPSP-TILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVL

Query:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNK
        RELLRQNEC LALKVFEDVR EHWYKPQVSLYADII +LASNGLF+QVQIIHSYLK ETDL PEIDGFN+LLRALVSYNLGELAMESYYLMK+VGCEP+K
Subjt:  RELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNK

Query:  TSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATA
        TSFRIVI+GLEST EA DLR VKQDAQKIYG+ L FLEEEDEAA A
Subjt:  TSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATA

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic6.5e-11386.35Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PS TILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+ +LASNGLF+QVQIIHSY KAETDL PEI+GFN+LL+ALVSYNLGELAMESYYLMKEVGCEP+KT
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLRTVK+DAQ++YGESL FLEEEDEAAT + TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic4.2e-11285.94Show/hide
Query:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR
        MSFLA  PS TILSPPY LL SGGK S LRLGR   YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKD QQLDRVYDSKI+RLLKFDMMAVLR
Subjt:  MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT
        ELLRQNECSLALKVFEDVRKE WYKPQVSLYADI+ +LASNGLF+QVQIIHSYLKAETDL PEI+GFN+LL+ALVSYNLGELAMESYYLMKEVGCEP+K 
Subjt:  ELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKT

Query:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH
        SFRIVI+GLEST E+ DLR VK+DAQ++YGESL FLEEEDEAAT + TH
Subjt:  SFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAATAVPTH

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.8e-1933.88Show/hide
Query:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAE
        +S E + A + LKR +  S +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   D+ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAE

Query:  TDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED
          L  +   F  L+R  +   L   AM  Y  M+E    P    FR++++GL    E  +   VK D  +++   +V+   ED
Subjt:  TDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.7e-1737.63Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIRLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       ++RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIRLLASNG

Query:  LFDQVQIIHSYLKAETDLVPEIDGFN---------SLLRALVSYNLGELAMESYYLMKEVG-----CEPNKTSFRIVIRGLESTGE
         FD++            L+ EIDG +          L+RA+V     E  +  Y LM+E G      E ++    ++ +GL   GE
Subjt:  LFDQVQIIHSYLKAETDLVPEIDGFN---------SLLRALVSYNLGELAMESYYLMKEVG-----CEPNKTSFRIVIRGLESTGE

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531703.2e-0829.2Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  + ++L +    DQ  ++   + +E  L P ID + SL+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVG

Query:  -CEPNKTSFRIVI
         C+P+  +F ++I
Subjt:  -CEPNKTSFRIVI

Q9SJN2 Pentatricopeptide repeat-containing protein At2g362401.8e-0628.83Show/hide
Query:  RQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFR
        R  +   AL  F+ +++    KP V +Y  ++     +G  D+    +  +  E    P++  FN L+      +  +LA++ +  MKE GCEPN  SF 
Subjt:  RQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFR

Query:  IVIRGLESTGE
         +IRG  S+G+
Subjt:  IVIRGLESTGE

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic8.1e-2032.12Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   D+ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQV

Query:  QIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P +  FR++++GL           VK+D ++++ E   +   E+
Subjt:  QIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2033.88Show/hide
Query:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAE
        +S E + A + LKR +  S +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR+E WY+P +  Y D++ +LA N   D+ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAE

Query:  TDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED
          L  +   F  L+R  +   L   AM  Y  M+E    P    FR++++GL    E  +   VK D  +++   +V+   ED
Subjt:  TDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.2e-1837.63Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIRLLASNG
        G  +NR PL KGR LS EAIQ++QSLKRA +    L       ++RL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKP-QVSLYADIIRLLASNG

Query:  LFDQVQIIHSYLKAETDLVPEIDGFN---------SLLRALVSYNLGELAMESYYLMKEVG-----CEPNKTSFRIVIRGLESTGE
         FD++            L+ EIDG +          L+RA+V     E  +  Y LM+E G      E ++    ++ +GL   GE
Subjt:  LFDQVQIIHSYLKAETDLVPEIDGFN---------SLLRALVSYNLGELAMESYYLMKEVG-----CEPNKTSFRIVIRGLESTGE

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-2132.12Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQV
        R PL +G+ L   EA+  +  LKR K+D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++K+ WY+P V +Y D+I  LA +   D+ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQV

Query:  QIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED
          +   +K E +L P+   +  ++R  +       AM  Y  M +    P +  FR++++GL           VK+D ++++ E   +   E+
Subjt:  QIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRTVKQDAQKIYGESLVFLEEED

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-0929.2Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +RK+HWY+P+   Y  + ++L +    DQ  ++   + +E  L P ID + SL+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVG

Query:  -CEPNKTSFRIVI
         C+P+  +F ++I
Subjt:  -CEPNKTSFRIVI

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain4.5e-5850Show/hide
Query:  SGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQN
        +GG+   L+    ++     MR  S+NRKPLQ+GR LSIEAIQAVQ+LKRA                  S  LDRV  SK +RLLKFDM+AVLRELLRQN
Subjt:  SGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQN

Query:  ECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVI
        ECSLALKVFE++RKE+WYKPQV +Y D+I ++A N L ++V  ++S +K+E  L+ EI+ FN+LL  L+++ L +L M+ Y  M+ +G EP++ SFR+++
Subjt:  ECSLALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVI

Query:  RGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAAT
         GLES GE      V+QDA + YGESL F+EE++E ++
Subjt:  RGLESTGEAADLRTVKQDAQKIYGESLVFLEEEDEAAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTTAGCAATTCCTCCGTCGCCGACGATTCTCAGTCCGCCGTACATGTTACTGAGCTCTGGCGGGAAAGCATCTTGCCTGCGACTAGGTAGGGCGGTGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAGTGAGAACCGGAAGCCATTGCAGAAGGGGAGGAACCTCAGCATTGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGCCAAGA
AAGATTCACAACAATTGGATCGAGTGTATGATTCCAAAATTAAGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTTTCGTTGTATGCTGATATTATCAGATTATTGGCTAGCAATGGATTGTTTGACCAAGT
GCAAATCATTCATTCCTACTTGAAAGCAGAAACTGATTTAGTGCCTGAAATTGACGGGTTTAACTCTCTTTTGAGGGCTTTGGTGAGTTACAATTTAGGTGAACTTGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTAGGTTGCGAGCCAAATAAGACTTCTTTCAGGATAGTCATAAGAGGATTGGAATCAACGGGAGAAGCAGCTGATTTAAGAACT
GTGAAGCAGGATGCACAAAAGATTTATGGTGAATCACTCGTGTTTCTGGAGGAAGAAGACGAGGCAGCTACAGCCGTACCGACACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTTAGCAATTCCTCCGTCGCCGACGATTCTCAGTCCGCCGTACATGTTACTGAGCTCTGGCGGGAAAGCATCTTGCCTGCGACTAGGTAGGGCGGTGGAATA
TCGGAGAGTGACAATGAGAGGCGGAAGTGAGAACCGGAAGCCATTGCAGAAGGGGAGGAACCTCAGCATTGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGCCAAGA
AAGATTCACAACAATTGGATCGAGTGTATGATTCCAAAATTAAGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAAGGAACACTGGTACAAGCCTCAGGTTTCGTTGTATGCTGATATTATCAGATTATTGGCTAGCAATGGATTGTTTGACCAAGT
GCAAATCATTCATTCCTACTTGAAAGCAGAAACTGATTTAGTGCCTGAAATTGACGGGTTTAACTCTCTTTTGAGGGCTTTGGTGAGTTACAATTTAGGTGAACTTGCGA
TGGAGTCCTATTACTTGATGAAAGAAGTAGGTTGCGAGCCAAATAAGACTTCTTTCAGGATAGTCATAAGAGGATTGGAATCAACGGGAGAAGCAGCTGATTTAAGAACT
GTGAAGCAGGATGCACAAAAGATTTATGGTGAATCACTCGTGTTTCTGGAGGAAGAAGACGAGGCAGCTACAGCCGTACCGACACACTGA
Protein sequenceShow/hide protein sequence
MSFLAIPPSPTILSPPYMLLSSGGKASCLRLGRAVEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKKDSQQLDRVYDSKIKRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRKEHWYKPQVSLYADIIRLLASNGLFDQVQIIHSYLKAETDLVPEIDGFNSLLRALVSYNLGELAMESYYLMKEVGCEPNKTSFRIVIRGLESTGEAADLRT
VKQDAQKIYGESLVFLEEEDEAATAVPTH