; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005539 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005539
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr07:3462002..3462952
RNA-Seq ExpressionHG10005539
SyntenyHG10005539
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.3e-11185.94Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLA NGLFEQVQ IHSY KAETDLAPEIEGFN LLKALV+YNLG+LAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT ISMH
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

KAG7011830.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]9.6e-11185.94Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQV LYADI++VLA NGLFEQVQ IHSY KAETDLAPEIEGFN LLKALV+YNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT ISMH
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]4.3e-11186.35Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLA NGLFEQVQ IHSY KAETDLAPEIEGFN LLKALVSYNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT IS H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

XP_022969229.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita maxima]6.6e-11287.15Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLA NGLFEQVQ IHSYLKAETDLAPEIEGFN LLKALVSYNLGELAMESYYLMKEVGCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT IS H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]6.6e-11286.75Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+    SPTI SP + L  +GG+A CL LG  EGYR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADII VLA NGLFE+VQ IHSYLKAETDLAPEI+GFNALLKALVS+NLGELAMESYYLMKE+GCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLES  EAVDLR VKQDAQKLYGESLEFLEEEEEAA AIS H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

TrEMBL top hitse value%identityAlignment
A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like6.1e-11185.54Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+ TL SPTI SPP  LP +  K CCL+LG  EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLA NGLFE+VQ I SY+KAETDLAPEI+GFNALLKALV +NLG+LAMESYYLMKEVGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLE   EAVDLR VKQDAQKLYGESLEFLEE EE ATAIS+H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

A0A5A7UZI6 Pentatricopeptide repeat-containing protein6.1e-11185.54Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+ TL SPTI SPP  LP +  K CCL+LG  EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLA NGLFE+VQ I SY+KAETDLAPEI+GFNALLKALV +NLG+LAMESYYLMKEVGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLE   EAVDLR VKQDAQKLYGESLEFLEE EE ATAIS+H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

A0A5D3CCT4 Pentatricopeptide repeat-containing protein1.8e-11085.14Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+ TL SPTI SPP  LP +  K CCL+LG  EGY RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKIRRLLKFDM+AVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLA NGLFE+VQ I SY+KAETDLAPEI+GFNALLK LV +NLG+LAMESYYLMKEVGCEP+KA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLE   EAVDLR VKQDAQKLYGESLEFLEE EE ATAIS+H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-11186.35Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLA NGLFEQVQ IHSY KAETDLAPEIEGFN LLKALVSYNLGELAMESYYLMKEVGCEPDK 
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT IS H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.2e-11287.15Show/hide
Query:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR
        MSF+AT  S TI SPP+ L  +GGK   LRLG  EGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAK DLQQLDRVYDSKIRRLLKFDMMAVLR
Subjt:  MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLR

Query:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA
        ELLRQNECSLALKVFEDVR E WYKPQVSLYADI++VLA NGLFEQVQ IHSYLKAETDLAPEIEGFN LLKALVSYNLGELAMESYYLMKEVGCEPDKA
Subjt:  ELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKA

Query:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH
        SFRI IKGLEST E+VDLR VK+DAQ+LYGESLEFLEEE+EAAT IS H
Subjt:  SFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEEEAATAISMH

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.4e-0833.04Show/hide
Query:  LALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFI---
        ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I   L+ E  L P++  +NAL+++         A E + LM+ +GCEPD+AS+ I +   
Subjt:  LALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFI---

Query:  --KGLESTAEAV
           GL S AEAV
Subjt:  --KGLESTAEAV

Q1PFH7 Pentatricopeptide repeat-containing protein At1g623501.9e-1633.72Show/hide
Query:  LSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAE

Query:  TDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLY
          L  +   F  L++  +   L   AM  Y  M+E    P    FR+ +KGL    E  +   VK D  +L+
Subjt:  TDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLY

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic1.4e-1637.79Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKP-QVSLYADIITVLACNG
        G  +NR PL KGR LS EAIQ++QSLKRA      L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKP-QVSLYADIITVLACNG

Query:  LFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG-----CEPDKASFRIFIKGL
         F+++  +   +    D   + +    L++A+V     E  +  Y LM+E G      E D+    +  KGL
Subjt:  LFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG-----CEPDKASFRIFIKGL

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531703.9e-0627.43Show/hide
Query:  MMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG
        ++  L E +++N    ALK+F  +R +HWY+P+   Y  +  VL      +Q   +   + +E  L P I+ + +L+       L + A  +   MK V 
Subjt:  MMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG

Query:  -CEPDKASFRIFI
         C+PD  +F + I
Subjt:  -CEPDKASFRIFI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic6.9e-1932.12Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQV
        R PL +G+ L   EA+  +  LKR K D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQV

Query:  QTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEE
          +   +K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+ +KGL      +    VK+D ++L+ E   +   EE
Subjt:  QTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEE

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-1733.72Show/hide
Query:  LSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAE
        +S E + A + LKR +    +LDR   S + RLLK D+++VL E  RQN+  L +K++E VR E WY+P +  Y D++ +LA N   ++ + +   LK E
Subjt:  LSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAE

Query:  TDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLY
          L  +   F  L++  +   L   AM  Y  M+E    P    FR+ +KGL    E  +   VK D  +L+
Subjt:  TDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLY

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-0933.04Show/hide
Query:  LALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFI---
        ++ K++ ++R+ H  KP +  Y  ++   A  GL E+ + I   L+ E  L P++  +NAL+++         A E + LM+ +GCEPD+AS+ I +   
Subjt:  LALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFI---

Query:  --KGLESTAEAV
           GL S AEAV
Subjt:  --KGLESTAEAV

AT3G27750.1 FUNCTIONS IN: molecular_function unknown1.0e-1737.79Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKP-QVSLYADIITVLACNG
        G  +NR PL KGR LS EAIQ++QSLKRA      L       +RRL+K D+++VLRELLRQ+ C+LA+ V   +R E  Y P  + LYADI+  L  N 
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKP-QVSLYADIITVLACNG

Query:  LFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG-----CEPDKASFRIFIKGL
         F+++  +   +    D   + +    L++A+V     E  +  Y LM+E G      E D+    +  KGL
Subjt:  LFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVG-----CEPDKASFRIFIKGL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-2032.12Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQV
        R PL +G+ L   EA+  +  LKR K D ++LD+   + + RLLK DM+AV+ EL RQ E +LA+K+FE ++ + WY+P V +Y D+I  LA +   ++ 
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQV

Query:  QTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEE
          +   +K E +L P+ + +  +++  +       AM  Y  M +    P++  FR+ +KGL      +    VK+D ++L+ E   +   EE
Subjt:  QTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQDAQKLYGESLEFLEEEE

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain3.1e-5954.91Show/hide
Query:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWY
        + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKRA                +    LDRV  SK RRLLKFDM+AVLRELLRQNECSLALKVFE++R E+WY
Subjt:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRA---------------KNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSLALKVFEDVRNEHWY

Query:  KPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQD
        KPQV +Y D+ITV+A N L E+V  ++S +K+E  L  EIE FN LL  L+++ L +L M+ Y  M+ +G EPD+ASFR+ + GLES  E      V+QD
Subjt:  KPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRAVKQD

Query:  AQKLYGESLEFLEEEEEAATAISM
        A + YGESLEF+EE+EE ++  S+
Subjt:  AQKLYGESLEFLEEEEEAATAISM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCATAGCAACTCTTCTGTCGCCGACAATTTTCAGTCCACCGTTCAATTTACCGAGAACTGGCGGGAAAGCATGCTGCCTGCGACTAGGCTGCACGGAGGGATA
TCGGAGAGTGACGATGAGAGGCGGAAGTGAAAACCGGAAGCCTCTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTGCAGTCGTTGAAGCGAGCGAAGA
ATGATTTACAACAACTGGACCGAGTGTATGATTCCAAAATTAGGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAATGAACACTGGTACAAGCCTCAGGTCTCGCTGTATGCTGATATTATTACAGTATTGGCTTGCAATGGATTGTTCGAACAAGT
ACAAACTATTCATTCGTACTTGAAAGCAGAAACTGACTTAGCACCTGAAATTGAAGGGTTTAACGCTCTTTTGAAGGCCTTGGTTAGTTATAACTTAGGTGAACTTGCGA
TGGAGTCGTATTACTTGATGAAAGAAGTAGGTTGTGAGCCAGATAAGGCTTCTTTCAGGATTTTCATAAAAGGATTGGAATCAACGGCAGAGGCAGTTGATTTAAGAGCT
GTGAAGCAGGATGCACAAAAGCTTTATGGTGAATCGCTTGAGTTTCTAGAGGAAGAAGAAGAGGCAGCTACAGCCATATCTATGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCATAGCAACTCTTCTGTCGCCGACAATTTTCAGTCCACCGTTCAATTTACCGAGAACTGGCGGGAAAGCATGCTGCCTGCGACTAGGCTGCACGGAGGGATA
TCGGAGAGTGACGATGAGAGGCGGAAGTGAAAACCGGAAGCCTCTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTGCAGTCGTTGAAGCGAGCGAAGA
ATGATTTACAACAACTGGACCGAGTGTATGATTCCAAAATTAGGCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTCCTTCGCCAGAACGAGTGTTCTTTG
GCTCTTAAGGTTTTCGAAGATGTTAGAAATGAACACTGGTACAAGCCTCAGGTCTCGCTGTATGCTGATATTATTACAGTATTGGCTTGCAATGGATTGTTCGAACAAGT
ACAAACTATTCATTCGTACTTGAAAGCAGAAACTGACTTAGCACCTGAAATTGAAGGGTTTAACGCTCTTTTGAAGGCCTTGGTTAGTTATAACTTAGGTGAACTTGCGA
TGGAGTCGTATTACTTGATGAAAGAAGTAGGTTGTGAGCCAGATAAGGCTTCTTTCAGGATTTTCATAAAAGGATTGGAATCAACGGCAGAGGCAGTTGATTTAAGAGCT
GTGAAGCAGGATGCACAAAAGCTTTATGGTGAATCGCTTGAGTTTCTAGAGGAAGAAGAAGAGGCAGCTACAGCCATATCTATGCACTGA
Protein sequenceShow/hide protein sequence
MSFIATLLSPTIFSPPFNLPRTGGKACCLRLGCTEGYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRAKNDLQQLDRVYDSKIRRLLKFDMMAVLRELLRQNECSL
ALKVFEDVRNEHWYKPQVSLYADIITVLACNGLFEQVQTIHSYLKAETDLAPEIEGFNALLKALVSYNLGELAMESYYLMKEVGCEPDKASFRIFIKGLESTAEAVDLRA
VKQDAQKLYGESLEFLEEEEEAATAISMH