; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G028110 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G028110
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationchr02:34300712..34301395
RNA-Seq ExpressionLsi02G028110
SyntenyLsi02G028110
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462172.1 PREDICTED: uncharacterized protein LOC103500597 [Cucumis melo]1.7e-10189.13Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR
        MA SLHSTFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        CALALEVFAVIRSEY AELGLYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRIYRMMRRNGWGS IK DDYM+
Subjt:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGSL
        +V+SKGLRR+GE+ELADEINREFQDLVGSL
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGSL

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]9.7e-10287.61Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFLKSQI IPIP+S AA+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAV+RSEYGA+LG+YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRRLGEME+ADE+N +FQDLVGS
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]1.3e-10187.61Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAV+RSEYG +LG+YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRMM+R+GWGSTIKADDYMVRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRRLGEME+ADEIN +FQDLVGS
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]1.3e-10187.61Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAV+RSEYGA+LG+YAEVA ALSRNGAAEEIDRLVCDL+  D  IQ DDKGLIKLIKAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRRLGEME+ADE+N +FQDLVGS
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

XP_038898717.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida]1.3e-10993.83Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA S+HSTFLKSQISIPIPVS AA+VAVSLPVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA+LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAVIRSEYGA+LG+YAEVA ALSRNGAAEEIDRLVCDLDGGD LIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDY+VRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGSL
        SKGLRR GEMELADEINREFQDLVG++
Subjt:  SKGLRRLGEMELADEINREFQDLVGSL

TrEMBL top hitse value%identityAlignment
A0A0A0K6N6 Uncharacterized protein1.2e-10089.08Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAAS-VAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR
        MA SLHSTFLKSQISIPIP S A S VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLKSQISIPIPVSGAAS-VAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        CALALEVFAVI+SEY AELGLYAEVA ALSRNGAAEEIDRLV DLDGGDG+I+W  DDKGLIKLIKAVISG+RRESTVRIYRMMRR GWGS IKADDYM+
Subjt:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGS
        +VLSKGLRRLGE+ELADEINREF+DLVGS
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGS

A0A1S3CGD3 uncharacterized protein LOC1035005978.0e-10289.13Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR
        MA SLHSTFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        CALALEVFAVIRSEY AELGLYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRIYRMMRRNGWGS IK DDYM+
Subjt:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGSL
        +V+SKGLRR+GE+ELADEINREFQDLVGSL
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGSL

A0A5A7V0E2 Uncharacterized protein5.2e-10188.7Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR
        MA SL STFLKSQISIPIP S A A+VAVS  VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLKSQISIPIPVSGA-ASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV
        CALALEVFAVIRSEY AELGLYAEVA ALSRNGAAEEIDRLVCDLDG DG+I+W  DDKGLIKLIKAVISG+RRESTVRIYRMMRRNGWGS IK DDYM+
Subjt:  CALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQW--DDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMV

Query:  RVLSKGLRRLGEMELADEINREFQDLVGSL
        +V+SKGLRR+GE+ELADEINREFQDLVGSL
Subjt:  RVLSKGLRRLGEMELADEINREFQDLVGSL

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic4.7e-10287.61Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFLKSQI IPIP+S AA+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAV+RSEYGA+LG+YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRMM+R+GWGSTIKADDY VRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRRLGEME+ADE+N +FQDLVGS
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic6.1e-10287.61Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFLKSQI IPIP+S  A+V VSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLSTTLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL
        ALALEVFAV+RSEYG +LG+YAEVA ALSRNGA EEIDRLVCDL+  D +IQ DDKGLIKLIKAVI GDRRESTVRIYRMM+R+GWGSTIKADDYMVRVL
Subjt:  ALALEVFAVIRSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVL

Query:  SKGLRRLGEMELADEINREFQDLVGS
        SKGLRRLGEME+ADEIN +FQDLVGS
Subjt:  SKGLRRLGEMELADEINREFQDLVGS

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623507.2e-0726.55Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDG
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ V+R E  Y  ++  Y ++ + L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDG

Query:  GDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
         +  + +D      L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic3.5e-4650.44Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAAL
        MALSL  T        P  +S + +++V +P      +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAAL

Query:  KELLRQDRCALALEVFAVIRSEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGS-T
        +ELLRQD C LA+ V + +R+EY   +L LYA++  AL+RN   +EIDRL+ ++DG D   + DDK L KLI+AV+  +RRES VR+Y +MR +GWGS +
Subjt:  KELLRQDRCALALEVFAVIRSEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGS-T

Query:  IKADDYMVRVLSKGLRRLGEMELADEIN
         +AD+Y+  VLSKGL RLGE +LA +++
Subjt:  IKADDYMVRVLSKGLRRLGEMELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.9e-0728.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQ+  ALA+++F VI+ +  Y  ++ +Y ++ V+L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEE

Query:  IDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    ++   RVL KGL  L    L +++ ++F++L
Subjt:  IDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-0826.55Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDG
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ V+R E  Y  ++  Y ++ + L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEEIDRLVCDLDG

Query:  GDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
         +  + +D      L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

AT3G27750.1 FUNCTIONS IN: molecular_function unknown2.5e-4750.44Show/hide
Query:  MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAAL
        MALSL  T        P  +S + +++V +P      +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +     LS T   L RL+K+DL++ L
Subjt:  MALSLHSTFLKSQISIPIPVSGAASVAVSLP------VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTT---LSRLLKADLVAAL

Query:  KELLRQDRCALALEVFAVIRSEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGS-T
        +ELLRQD C LA+ V + +R+EY   +L LYA++  AL+RN   +EIDRL+ ++DG D   + DDK L KLI+AV+  +RRES VR+Y +MR +GWGS +
Subjt:  KELLRQDRCALALEVFAVIRSEY-GAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGS-T

Query:  IKADDYMVRVLSKGLRRLGEMELADEIN
         +AD+Y+  VLSKGL RLGE +LA +++
Subjt:  IKADDYMVRVLSKGLRRLGEMELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0828.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + + T + RLLK D++A + EL RQ+  ALA+++F VI+ +  Y  ++ +Y ++ V+L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYAEVAVALSRNGAAEE

Query:  IDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    ++   RVL KGL  L    L +++ ++F++L
Subjt:  IDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREFQDL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain2.7e-1733.01Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L +V+ +   RLLK D+VA L+ELLRQ+ C+LAL+VF  IR E  Y  ++ +Y 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVIRSE--YGAELGLYA

Query:  EVAVALSRNGAAEEIDRLVCDLDGGDGL---IQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINRE
        ++   ++ N   EE++ L   +    GL   I+W +     L+  +++    +  +  Y  M+  G+    + D    RVL  GL   GEM L+  + ++
Subjt:  EVAVALSRNGAAEEIDRLVCDLDGGDGL---IQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINRE

Query:  FQDLVG
          +  G
Subjt:  FQDLVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGTCTCTTCACTCCACATTTCTCAAATCCCAAATCTCGATTCCGATCCCCGTCTCCGGCGCGGCTTCCGTCGCCGTTTCCCTTCCGGTACGCTGCGGCCCTCG
GGACAACCGAGGACCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTAAAACGAGCCGAGAGATCCGACCCGACGAAGCTCCAACAAG
TGCTCTCCACTACGCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGACCGGTGCGCCCTTGCTTTGGAGGTTTTCGCCGTAATC
CGATCCGAGTACGGCGCCGAATTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCAAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTAGACGGCGG
AGATGGGCTGATTCAGTGGGATGATAAGGGTTTGATTAAGTTGATTAAGGCGGTTATTAGTGGGGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGAAGGA
ACGGTTGGGGATCCACCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTT
CAAGATTTAGTGGGCAGTTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGTCTCTTCACTCCACATTTCTCAAATCCCAAATCTCGATTCCGATCCCCGTCTCCGGCGCGGCTTCCGTCGCCGTTTCCCTTCCGGTACGCTGCGGCCCTCG
GGACAACCGAGGACCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCTCTAAAACGAGCCGAGAGATCCGACCCGACGAAGCTCCAACAAG
TGCTCTCCACTACGCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTCCTCCGGCAGGACCGGTGCGCCCTTGCTTTGGAGGTTTTCGCCGTAATC
CGATCCGAGTACGGCGCCGAATTGGGGCTGTACGCGGAGGTGGCAGTGGCGCTGTCAAGGAACGGAGCGGCGGAGGAAATCGACCGGCTGGTTTGCGATTTAGACGGCGG
AGATGGGCTGATTCAGTGGGATGATAAGGGTTTGATTAAGTTGATTAAGGCGGTTATTAGTGGGGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGAAGGA
ACGGTTGGGGATCCACCATTAAAGCTGATGATTATATGGTTAGGGTTTTGAGCAAGGGTTTAAGGAGGCTTGGGGAAATGGAGTTGGCCGATGAGATCAATAGGGAATTT
CAAGATTTAGTGGGCAGTTTATGA
Protein sequenceShow/hide protein sequence
MALSLHSTFLKSQISIPIPVSGAASVAVSLPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSTTLSRLLKADLVAALKELLRQDRCALALEVFAVI
RSEYGAELGLYAEVAVALSRNGAAEEIDRLVCDLDGGDGLIQWDDKGLIKLIKAVISGDRRESTVRIYRMMRRNGWGSTIKADDYMVRVLSKGLRRLGEMELADEINREF
QDLVGSL