; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G02030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G02030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationClcChr09:1661236..1662264
RNA-Seq ExpressionClc09G02030
SyntenyClc09G02030
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462172.1 PREDICTED: uncharacterized protein LOC103500597 [Cucumis melo]3.9e-10784.65Show/hide
Query:  FDFP----QGYYIHCE--IPNLHSAMALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQV
        FDFP    +GYY HCE   PNLH+AMA SLH+TFLKSQIS+PIP S A AAV VS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQV
Subjt:  FDFP----QGYYIHCE--IPNLHSAMALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQV

Query:  LSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRE
        LSTTLSRLLKADLVA LKELLRQERC LALEVFAVIRSEY A+LGLYAEVA ALSRNGAAEEIDRLVCDLDG DGVI+W  D+KGLIKL+KAVISG+RRE
Subjt:  LSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRE

Query:  STVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS
        STVRIYRMMRRNGWGS IK DDYM++V+SKGLRR+GE+ELADEINREFQDLVGS
Subjt:  STVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]3.2e-10186.34Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA SLH+TFLKSQI +PIP+SAAA V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAV+RSEYGADLG+YAEVA ALSRNGA EEIDRLVCDL+  D VIQ D+KGLIKL+KAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSF
        SKGLRRLGEME+ADE+N +FQDLVGSF
Subjt:  SKGLRRLGEMELADEINREFQDLVGSF

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]4.1e-10186.34Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA SLH+TFLKSQI +PIP+SA A V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAV+RSEYG DLG+YAEVA ALSRNGA EEIDRLVCDL+  D VIQ D+KGLIKL+KAVI GDRRESTVRIYRMM+R+GWGSTIKADDYMVRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSF
        SKGLRRLGEME+ADEIN +FQDLVGSF
Subjt:  SKGLRRLGEMELADEINREFQDLVGSF

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]7.1e-10185.9Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA SLH+TFLKSQI +PIP+SA A V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAV+RSEYGADLG+YAEVA ALSRNGAAEEIDRLVCDL+  D  IQ D+KGLIKL+KAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSF
        SKGLRRLGEME+ADE+N +FQDLVGSF
Subjt:  SKGLRRLGEMELADEINREFQDLVGSF

XP_038898717.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida]1.3e-10792.04Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA S+H+TFLKSQIS+PIPVSAAAAV VSLPVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA+LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAVIRSEYGADLG+YAEVA ALSRNGAAEEIDRLVCDLDGGD +IQWD+KGLIKL+KAVISGDRRESTVRIYRMMRRNGWGSTIKADDY+VRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRR GEMELADEINREFQDLVG+
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

TrEMBL top hitse value%identityAlignment
A0A0A0K6N6 Uncharacterized protein3.2e-9986.52Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAA-VTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQER
        MA SLH+TFLKSQIS+PIP S A + V VS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQER
Subjt:  MALSLHATFLKSQISVPIPVSAAAA-VTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQER

Query:  CVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        C LALEVFAVI+SEY A+LGLYAEVA ALSRNGAAEEIDRLV DLDGGDGVI+W  D+KGLIKL+KAVISG+RRESTVRIYRMMRR GWGS IKADDYM+
Subjt:  CVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGSF
        +VLSKGLRRLGE+ELADEINREF+DLVGSF
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGSF

A0A1S3CGD3 uncharacterized protein LOC1035005971.9e-10784.65Show/hide
Query:  FDFP----QGYYIHCE--IPNLHSAMALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQV
        FDFP    +GYY HCE   PNLH+AMA SLH+TFLKSQIS+PIP S A AAV VS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQV
Subjt:  FDFP----QGYYIHCE--IPNLHSAMALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQV

Query:  LSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRE
        LSTTLSRLLKADLVA LKELLRQERC LALEVFAVIRSEY A+LGLYAEVA ALSRNGAAEEIDRLVCDLDG DGVI+W  D+KGLIKL+KAVISG+RRE
Subjt:  LSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRE

Query:  STVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS
        STVRIYRMMRRNGWGS IK DDYM++V+SKGLRR+GE+ELADEINREFQDLVGS
Subjt:  STVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDLVGS

A0A5A7V0E2 Uncharacterized protein4.2e-9986.9Show/hide
Query:  MALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQER
        MA SL +TFLKSQIS+PIP S A AAV VS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQER
Subjt:  MALSLHATFLKSQISVPIPVS-AAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQER

Query:  CVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        C LALEVFAVIRSEY A+LGLYAEVA ALSRNGAAEEIDRLVCDLDG DGVI+W  D+KGLIKL+KAVISG+RRESTVRIYRMMRRNGWGS IK DDYM+
Subjt:  CVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQW--DEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGS
        +V+SKGLRR+GE+ELADEINREFQDLVGS
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGS

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic1.5e-10186.34Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA SLH+TFLKSQI +PIP+SAAA V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAV+RSEYGADLG+YAEVA ALSRNGA EEIDRLVCDL+  D VIQ D+KGLIKL+KAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSF
        SKGLRRLGEME+ADE+N +FQDLVGSF
Subjt:  SKGLRRLGEMELADEINREFQDLVGSF

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic2.0e-10186.34Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC
        MA SLH+TFLKSQI +PIP+SA A V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQ+RC
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERC

Query:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
         LALEVFAV+RSEYG DLG+YAEVA ALSRNGA EEIDRLVCDL+  D VIQ D+KGLIKL+KAVI GDRRESTVRIYRMM+R+GWGSTIKADDYMVRVL
Subjt:  VLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSF
        SKGLRRLGEME+ADEIN +FQDLVGSF
Subjt:  SKGLRRLGEMELADEINREFQDLVGSF

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623502.8e-0727.12Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEEIDRLVCDLDG
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ +  L ++++ V+R E  Y  D+  Y ++ + L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEEIDRLVCDLDG

Query:  GDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
         +  + +D+     L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic5.7e-4550.45Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQ
        MALSL  T   S +S    +S        + +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQ

Query:  ERCVLALEVFAVIRSEY-GADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGS-TIKADDY
        + C LA+ V + +R+EY   DL LYA++  AL+RN   +EIDRL+ ++DG D   + D+K L KL++AV+  +RRES VR+Y +MR +GWGS + +AD+Y
Subjt:  ERCVLALEVFAVIRSEY-GADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGS-TIKADDY

Query:  MVRVLSKGLRRLGEMELADEIN
        +  VLSKGL RLGE +LA +++
Subjt:  MVRVLSKGLRRLGEMELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-0728.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQE   LA+++F VI+ +  Y  D+ +Y ++ V+L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEE

Query:  IDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
           L   +   +  +  D +   ++++  +        + +Y  M +    S    ++   RVL KGL  L    L +++ ++F++L
Subjt:  IDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-0827.12Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEEIDRLVCDLDG
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ +  L ++++ V+R E  Y  D+  Y ++ + L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEEIDRLVCDLDG

Query:  GDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
         +  + +D+     L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

AT3G27750.1 FUNCTIONS IN: molecular_function unknown4.1e-4650.45Show/hide
Query:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQ
        MALSL  T   S +S    +S        + +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L+ELLRQ
Subjt:  MALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAALKELLRQ

Query:  ERCVLALEVFAVIRSEY-GADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGS-TIKADDY
        + C LA+ V + +R+EY   DL LYA++  AL+RN   +EIDRL+ ++DG D   + D+K L KL++AV+  +RRES VR+Y +MR +GWGS + +AD+Y
Subjt:  ERCVLALEVFAVIRSEY-GADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGS-TIKADDY

Query:  MVRVLSKGLRRLGEMELADEIN
        +  VLSKGL RLGE +LA +++
Subjt:  MVRVLSKGLRRLGEMELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-0828.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQE   LA+++F VI+ +  Y  D+ +Y ++ V+L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYAEVAVALSRNGAAEE

Query:  IDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
           L   +   +  +  D +   ++++  +        + +Y  M +    S    ++   RVL KGL  L    L +++ ++F++L
Subjt:  IDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain3.3e-1632.52Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L +V+ +   RLLK D+VA L+ELLRQ  C LAL+VF  IR E  Y   + +Y 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSTTLSRLLKADLVAALKELLRQERCVLALEVFAVIRSE--YGADLGLYA

Query:  EVAVALSRNGAAEEIDRLVCDLDGGDGV---IQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINRE
        ++   ++ N   EE++ L   +    G+   I+W       L+  +++    +  +  Y  M+  G+    + D    RVL  GL   GEM L+  + ++
Subjt:  EVAVALSRNGAAEEIDRLVCDLDGGDGV---IQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINRE

Query:  FQDLVG
          +  G
Subjt:  FQDLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCAAGCCCAATTTCTGATAACATTCGATTTCCCCCAAGGGTACTACATTCATTGCGAAATCCCCAATCTGCACTCCGCCATGGCTCTGTCTCTTCACGCCACATT
TCTCAAATCCCAAATCTCGGTTCCGATCCCCGTCTCCGCCGCGGCTGCCGTCACCGTTTCCCTTCCGGTACGATGCGGCCCTCGGGACAACCGAGGACCGCTAGTGAAAG
GCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTGAAACGGGCCGAGAGATCCGATCCAACGAAGCTCCAACAAGTCCTCTCCACTACGCTCTCCCGATTG
CTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGAGCGGTGCGTCCTCGCCTTGGAGGTTTTCGCAGTAATCAGATCGGAGTACGGCGCCGATTTGGG
GCTGTACGCGGAGGTGGCAGTGGCGCTGTCGAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTGGACGGCGGAGACGGGGTGATTCAGTGGGATGAGA
AGGGTTTGATTAAGTTGATGAAGGCGGTGATTAGTGGGGATAGAAGGGAATCAACGGTCAGGATTTATCGGATGATGAGAAGGAACGGTTGGGGGTCCACCATTAAAGCT
GATGATTATATGGTTAGGGTTTTGAGTAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTTCAAGATTTAGTGGGCAGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
GTGAAATAGTAGGTGAATGAGCCTATTTTTTTCCCGACTAACGGCCGAACAGTACAAGTCAAGTAAACAAGTTACAAGGCCCAACATTACCCAACCGATGGCCCAATAGA
AAAACACATCAATGGGCCAAGCCCAATTTCTGATAACATTCGATTTCCCCCAAGGGTACTACATTCATTGCGAAATCCCCAATCTGCACTCCGCCATGGCTCTGTCTCTT
CACGCCACATTTCTCAAATCCCAAATCTCGGTTCCGATCCCCGTCTCCGCCGCGGCTGCCGTCACCGTTTCCCTTCCGGTACGATGCGGCCCTCGGGACAACCGAGGACC
GCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTGAAACGGGCCGAGAGATCCGATCCAACGAAGCTCCAACAAGTCCTCTCCACTACGC
TCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGAGCGGTGCGTCCTCGCCTTGGAGGTTTTCGCAGTAATCAGATCGGAGTACGGC
GCCGATTTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCGAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTGGACGGCGGAGACGGGGTGATTCA
GTGGGATGAGAAGGGTTTGATTAAGTTGATGAAGGCGGTGATTAGTGGGGATAGAAGGGAATCAACGGTCAGGATTTATCGGATGATGAGAAGGAACGGTTGGGGGTCCA
CCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGTAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTTCAAGATTTAGTGGGC
AGTTTTTGAAAATTTCTGAAGTTGGTAAATTGACACAAAATGTATTTGTTTTATATATTTATATTCTATATTTTTGTTGTATTGTAAGTTGTAACATTATATTACCATTT
ACTTTGGATAATTAACTAATTTGCCAAAGGTTGATGTAA
Protein sequenceShow/hide protein sequence
MGQAQFLITFDFPQGYYIHCEIPNLHSAMALSLHATFLKSQISVPIPVSAAAAVTVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRL
LKADLVAALKELLRQERCVLALEVFAVIRSEYGADLGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGVIQWDEKGLIKLMKAVISGDRRESTVRIYRMMRRNGWGSTIKA
DDYMVRVLSKGLRRLGEMELADEINREFQDLVGSF