; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003018 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003018
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationchr4:47420837..47421523
RNA-Seq ExpressionLag0003018
SyntenyLag0003018
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014066.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-10187.72Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPIS ATA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYGADLGMYAEVAAALSRNGA EE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADE+N +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]2.8e-10187.72Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPISAA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYGADLGMYAEVAAALSRNGA EE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADE+N +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]7.5e-10288.16Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPIS ATA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYG DLGMYAEVAAALSRNGA EE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+YMVRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADEIN +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]3.3e-10288.16Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPIS ATA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADE+N +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

XP_038898717.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida]1.1e-10088.94Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA S+HSTFLKSQ +IPIP+SAA AA AVS PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLS TLSRLLKADLVA+LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAV+RSEYGADLGMYAEVAAALSRNGAAEE+DRLVCDLD G+  IQ DDKGLIKLIKAVI GDRRESTVRIYRMMRR+GWGS IKAD+Y+VRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVG
        LSKGLRR GE+ELADEINREFQDLVG
Subjt:  LSKGLRRLGEVELADEINREFQDLVG

TrEMBL top hitse value%identityAlignment
A0A0A0K6N6 Uncharacterized protein8.6e-9683.48Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SLHSTFLKSQ +IPIP S AT+  AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV++SEY A+LG+YAEVAAALSRNGAAEE+DRLV DLD G+  I+   DDKGLIKLIKAVI G+RRESTVRIYRMMRR GWGS IKAD+YM+
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSKGLRRLGEVELADEINREFQDLVGTF
        +VLSKGLRRLGE+ELADEINREF+DLVG+F
Subjt:  RVLSKGLRRLGEVELADEINREFQDLVGTF

A0A1S3CGD3 uncharacterized protein LOC1035005975.1e-9684.28Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SLHSTFLKSQ +IPIP S ATAA AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLD--DGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV+RSEY A+LG+YAEVAAALSRNGAAEE+DRLVCDLD  DG  +   DDKGLIKLIKAVI G+RRESTVRIYRMMRR+GWGS IK D+YM+
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLD--DGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSKGLRRLGEVELADEINREFQDLVGT
        +V+SKGLRR+GE+ELADEINREFQDLVG+
Subjt:  RVLSKGLRRLGEVELADEINREFQDLVGT

A0A5A7V0E2 Uncharacterized protein7.3e-9583.84Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SL STFLKSQ +IPIP S ATAA AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAERSDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLD--DGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV+RSEY A+LG+YAEVAAALSRNGAAEE+DRLVCDLD  DG  +   DDKGLIKLIKAVI G+RRESTVRIYRMMRR+GWGS IK D+YM+
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLD--DGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSKGLRRLGEVELADEINREFQDLVGT
        +V+SKGLRR+GE+ELADEINREFQDLVG+
Subjt:  RVLSKGLRRLGEVELADEINREFQDLVGT

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic1.4e-10187.72Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPISAA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYGADLGMYAEVAAALSRNGA EE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADE+N +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic3.6e-10288.16Show/hide
Query:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MAFSLHSTFLKSQ  IPIPIS ATA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAE+SDPTKL+QVLS TLSRLLKADLVA LKELLRQDR
Subjt:  MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV
        CALALEVFAVVRSEYG DLGMYAEVAAALSRNGA EE+DRLVCDL+D +  IQCDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+YMVRV
Subjt:  CALALEVFAVVRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRV

Query:  LSKGLRRLGEVELADEINREFQDLVGTF
        LSKGLRRLGE+E+ADEIN +FQDLVG+F
Subjt:  LSKGLRRLGEVELADEINREFQDLVGTF

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.5e-0727.68Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEEMDRLVCDLDD
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEEMDRLVCDLDD

Query:  GESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL
         + ++  D      L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic3.5e-4654.26Show/hide
Query:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-GADLGMYAEVAAALSR
        +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+EY   DL +YA++  AL+R
Subjt:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-GADLGMYAEVAAALSR

Query:  NGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSKGLRRLGEVELADEIN
        N   +E+DRL+ ++D  + +   DDK L KLI+AV+G +RRES VR+Y +MR SGWGS + +ADEY+  VLSKGL RLGE +LA +++
Subjt:  NGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSKGLRRLGEVELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic1.1e-0728.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + +   + RLLK D++A + EL RQ+  ALA+++F V++ +  Y  D+ MY ++  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEE

Query:  MDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    +E   RVL KGL  L    L +++ ++F++L
Subjt:  MDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-0827.68Show/hide
Query:  LSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEEMDRLVCDLDD
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  DL  
Subjt:  LSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEEMDRLVCDLDD

Query:  GESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL
         + ++  D      L++  +  +     +R+Y  MR     S  +      RV+ KGL  +   EL +++  +F +L
Subjt:  GESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL

AT3G27750.1 FUNCTIONS IN: molecular_function unknown2.5e-4754.26Show/hide
Query:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-GADLGMYAEVAAALSR
        +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA R+  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+EY   DL +YA++  AL+R
Subjt:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-GADLGMYAEVAAALSR

Query:  NGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSKGLRRLGEVELADEIN
        N   +E+DRL+ ++D  + +   DDK L KLI+AV+G +RRES VR+Y +MR SGWGS + +ADEY+  VLSKGL RLGE +LA +++
Subjt:  NGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSKGLRRLGEVELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-0928.34Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEE
        RGPL +G+ L   EA+  I  LKR  + D  KL + +   + RLLK D++A + EL RQ+  ALA+++F V++ +  Y  D+ MY ++  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYAEVAAALSRNGAAEE

Query:  MDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    +E   RVL KGL  L    L +++ ++F++L
Subjt:  MDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQDL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain3.9e-1631.53Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L +V+ +   RLLK D+VA L+ELLRQ+ C+LAL+VF  +R E  Y   + MY 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------RSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YGADLGMYA

Query:  EVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQD
        ++   ++ N   EE++ L   +   E  +  + +    L+  ++     +  +  Y  M+  G+    + D    RVL  GL   GE+ L+  + ++  +
Subjt:  EVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINREFQD

Query:  LVG
          G
Subjt:  LVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTTTCTCTTCACTCCACATTTCTCAAATCCCAATTTGCAATTCCGATTCCCATCTCCGCCGCAACCGCCGCCGCCGCCGTCTCTTTTCCGGTACGCTGCGGCCC
ACGCGACAACCGAGGGCCGCTAGTTAAAGGCAGAACCCTAAGCACCGAGGCAATCCAAGCCATTCAATCCCTAAAACGGGCGGAAAGATCCGACCCGACGAAGCTCCAAC
AAGTCCTCTCAAATACTCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTTCTCCGGCAGGACCGGTGCGCCCTCGCTTTGGAGGTTTTCGCCGTC
GTCCGATCGGAGTACGGCGCGGATTTAGGGATGTACGCGGAGGTCGCGGCGGCGCTGTCGAGGAACGGAGCGGCGGAGGAAATGGACCGGCTGGTGTGCGATTTGGACGA
CGGAGAGAGTAAAATCCAGTGCGATGACAAGGGTTTGATTAAGTTGATCAAGGCGGTGATTGGTGGAGATCGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGA
GGAGCGGTTGGGGGTCCGCCATTAAAGCTGATGAGTATATGGTTAGGGTTTTGAGTAAGGGTTTAAGGAGACTTGGGGAAGTGGAATTGGCTGATGAGATCAATAGGGAA
TTTCAAGATTTAGTGGGGACTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTTTCTCTTCACTCCACATTTCTCAAATCCCAATTTGCAATTCCGATTCCCATCTCCGCCGCAACCGCCGCCGCCGCCGTCTCTTTTCCGGTACGCTGCGGCCC
ACGCGACAACCGAGGGCCGCTAGTTAAAGGCAGAACCCTAAGCACCGAGGCAATCCAAGCCATTCAATCCCTAAAACGGGCGGAAAGATCCGACCCGACGAAGCTCCAAC
AAGTCCTCTCAAATACTCTCTCCCGATTGCTCAAAGCCGACCTCGTCGCCGCCCTGAAGGAGCTTCTCCGGCAGGACCGGTGCGCCCTCGCTTTGGAGGTTTTCGCCGTC
GTCCGATCGGAGTACGGCGCGGATTTAGGGATGTACGCGGAGGTCGCGGCGGCGCTGTCGAGGAACGGAGCGGCGGAGGAAATGGACCGGCTGGTGTGCGATTTGGACGA
CGGAGAGAGTAAAATCCAGTGCGATGACAAGGGTTTGATTAAGTTGATCAAGGCGGTGATTGGTGGAGATCGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGA
GGAGCGGTTGGGGGTCCGCCATTAAAGCTGATGAGTATATGGTTAGGGTTTTGAGTAAGGGTTTAAGGAGACTTGGGGAAGTGGAATTGGCTGATGAGATCAATAGGGAA
TTTCAAGATTTAGTGGGGACTTTTTGA
Protein sequenceShow/hide protein sequence
MAFSLHSTFLKSQFAIPIPISAATAAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAERSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAV
VRSEYGADLGMYAEVAAALSRNGAAEEMDRLVCDLDDGESKIQCDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSKGLRRLGEVELADEINRE
FQDLVGTF