; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002512 (gene) of Snake gourd v1 genome

Gene IDTan0002512
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein THYLAKOID ASSEMBLY 8, chloroplastic
Genome locationLG06:2201960..2203038
RNA-Seq ExpressionTan0002512
SyntenyTan0002512
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014066.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-9986.34Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY ADLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADE+N +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

XP_022954347.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita moschata]9.0e-10086.78Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISAAA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY ADLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADE+N +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

XP_022992386.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Cucurbita maxima]1.2e-9986.78Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY  DLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+YMVRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADEIN +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

XP_023522652.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Cucurbita pepo subsp. pepo]9.0e-10086.34Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY ADLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADE+N +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

XP_038898717.1 protein THYLAKOID ASSEMBLY 8, chloroplastic [Benincasa hispida]1.1e-9785.84Show/hide
Query:  MALSLHSTFLRYQISIPIPISAA---AVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA S+HSTFL+ QISIPIP+SAA   AVS PVRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAE+SDPTKLQQVLS TLSRLLKADLVA+LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAA---AVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAV+RSEY ADLG+YAEVAAALSRNGA EEIDRLVCDLD G+  I+ DDKGLIKLIKAVI GDRRESTVRIYRMMRR+GWGS IKAD+Y+VRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGS
        S+GLRR GE+ELADEINREFQDLVG+
Subjt:  SRGLRRLGELELADEINREFQDLVGS

TrEMBL top hitse value%identityAlignment
A0A0A0K6N6 Uncharacterized protein1.4e-9383.48Show/hide
Query:  MALSLHSTFLRYQISIPIPISAA----AVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SLHSTFL+ QISIPIP S A    AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAE+SDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLRYQISIPIPISAA----AVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV++SEY A+LGLYAEVAAALSRNGA EEIDRLV DLD G+  IE   DDKGLIKLIKAVI G+RRESTVRIYRMMRR GWGS IKAD+YM+
Subjt:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSRGLRRLGELELADEINREFQDLVGSF
        +VLS+GLRRLGE+ELADEINREF+DLVGSF
Subjt:  RVLSRGLRRLGELELADEINREFQDLVGSF

A0A1S3CGD3 uncharacterized protein LOC1035005976.7e-9382.97Show/hide
Query:  MALSLHSTFLRYQISIPIPIS----AAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SLHSTFL+ QISIPIP S    A AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAE+SDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLRYQISIPIPIS----AAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV+RSEY A+LGLYAEVAAALSRNGA EEIDRLVCDLD  +  IE   DDKGLIKLIKAVI G+RRESTVRIYRMMRR+GWGS IK D+YM+
Subjt:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSRGLRRLGELELADEINREFQDLVGS
        +V+S+GLRR+GE+ELADEINREFQDLVGS
Subjt:  RVLSRGLRRLGELELADEINREFQDLVGS

A0A5A7V0E2 Uncharacterized protein3.3e-9282.97Show/hide
Query:  MALSLHSTFLRYQISIPIPIS----AAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR
        MA SL STFL+ QISIPIP S    A AVSF VRCGPRDNRGPLVKGRTLS EAIQAIQSLKRAE+SDPTKLQQVLS TLSRLLKADLVA LKELLRQ+R
Subjt:  MALSLHSTFLRYQISIPIPIS----AAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDR

Query:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV
        CALALEVFAV+RSEY A+LGLYAEVAAALSRNGA EEIDRLVCDLD  +  IE   DDKGLIKLIKAVI G+RRESTVRIYRMMRR+GWGS IK D+YM+
Subjt:  CALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIEC--DDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMV

Query:  RVLSRGLRRLGELELADEINREFQDLVGS
        +V+S+GLRR+GELELADEINREFQDLVGS
Subjt:  RVLSRGLRRLGELELADEINREFQDLVGS

A0A6J1GQS0 protein THYLAKOID ASSEMBLY 8, chloroplastic4.3e-10086.78Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISAAA   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY ADLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+Y VRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADE+N +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

A0A6J1JTE8 protein THYLAKOID ASSEMBLY 8, chloroplastic5.7e-10086.78Show/hide
Query:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC
        MA SLHSTFL+ QI IPIPISA A   VS PVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKL+QVLS TLSRLLKADLVA LKELLRQDRC
Subjt:  MALSLHSTFLRYQISIPIPISAAA---VSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRC

Query:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL
        ALALEVFAVVRSEY  DLG+YAEVAAALSRNGA EEIDRLVCDL++ +R I+CDDKGLIKLIKAVIGGDRRESTVRIYRMM+RSGWGS IKAD+YMVRVL
Subjt:  ALALEVFAVVRSEYSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVL

Query:  SRGLRRLGELELADEINREFQDLVGSF
        S+GLRRLGE+E+ADEIN +FQDLVGSF
Subjt:  SRGLRRLGELELADEINREFQDLVGSF

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623503.5e-0629.71Show/hide
Query:  LSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEEIDRLVCDLDE
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  DL +
Subjt:  LSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEEIDRLVCDLDE

Query:  GERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRS
         E  +  D      L++  +  +     +R+Y  MR S
Subjt:  GERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRS

Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic4.5e-4654.26Show/hide
Query:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-SADLGLYAEVAAALSR
        +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA ++  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+EY   DL LYA++  AL+R
Subjt:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-SADLGLYAEVAAALSR

Query:  NGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSRGLRRLGELELADEIN
        N   +EIDRL+ ++D  +++   DDK L KLI+AV+G +RRES VR+Y +MR SGWGS + +ADEY+  VLS+GL RLGE +LA +++
Subjt:  NGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSRGLRRLGELELADEIN

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.2e-0727.81Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEE
        RGPL +G+ L   EA+  I  LKR  K D  KL + +   + RLLK D++A + EL RQ+  ALA+++F V++ +  Y  D+ +Y ++  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEE

Query:  IDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    +E   RVL +GL  L    L +++ ++F++L
Subjt:  IDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQDL

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-0729.71Show/hide
Query:  LSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEEIDRLVCDLDE
        +S E + A + LKR + +   +L + + + +SRLLK+DLV+ L E  RQ++  L ++++ VVR E  Y  D+  Y ++   L+RN   +E  ++  DL +
Subjt:  LSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEEIDRLVCDLDE

Query:  GERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRS
         E  +  D      L++  +  +     +R+Y  MR S
Subjt:  GERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRS

AT3G27750.1 FUNCTIONS IN: molecular_function unknown3.2e-4754.26Show/hide
Query:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-SADLGLYAEVAAALSR
        +RCGPRDNRGPL+KGR LSTEAIQ+IQSLKRA ++  +    +    L RL+K+DL++ L+ELLRQD C LA+ V + +R+EY   DL LYA++  AL+R
Subjt:  VRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSEY-SADLGLYAEVAAALSR

Query:  NGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSRGLRRLGELELADEIN
        N   +EIDRL+ ++D  +++   DDK L KLI+AV+G +RRES VR+Y +MR SGWGS + +ADEY+  VLS+GL RLGE +LA +++
Subjt:  NGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGS-AIKADEYMVRVLSRGLRRLGELELADEIN

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-0827.81Show/hide
Query:  RGPLVKGRTL-STEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEE
        RGPL +G+ L   EA+  I  LKR  K D  KL + +   + RLLK D++A + EL RQ+  ALA+++F V++ +  Y  D+ +Y ++  +L+++   +E
Subjt:  RGPLVKGRTL-STEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYAEVAAALSRNGAGEE

Query:  IDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQDL
           L   +   +  +  D +   ++I+  +        + +Y  M +    S    +E   RVL +GL  L    L +++ ++F++L
Subjt:  IDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQDL

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain2.3e-1631.03Show/hide
Query:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------KSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYA
        NR PL +GR LS EAIQA+Q+LKRA                S    L +V+ +   RLLK D+VA L+ELLRQ+ C+LAL+VF  +R E  Y   + +Y 
Subjt:  NRGPLVKGRTLSTEAIQAIQSLKRAE--------------KSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE--YSADLGLYA

Query:  EVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQD
        ++   ++ N   EE++ L   + + E+ +  + +    L+  ++     +  +  Y  M+  G+    + D    RVL  GL   GE+ L+  + ++  +
Subjt:  EVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQD

Query:  LVG
          G
Subjt:  LVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTTCTCTTCACTCCACATTTCTCAGATACCAAATATCAATTCCGATCCCCATCTCCGCCGCCGCCGTCTCATTTCCGGTACGCTGTGGCCCACGCGACAACCG
AGGGCCGCTAGTGAAAGGCAGAACCCTAAGCACCGAAGCAATCCAAGCCATTCAATCCCTAAAACGGGCCGAAAAATCCGACCCGACGAAGCTCCAACAAGTCCTCTCAA
ATACTCTCTCCCGATTGCTGAAAGCCGACCTCGTGGCCGCCCTGAAGGAGCTTCTCCGGCAGGATCGATGCGCCCTCGCCTTGGAGGTCTTCGCCGTCGTCCGATCGGAG
TATAGCGCCGATTTAGGTCTGTACGCGGAGGTCGCGGCGGCGCTGTCGAGGAACGGAGCGGGGGAGGAAATCGACCGGCTGGTGTGCGATTTGGACGAGGGAGAGAGGAA
AATCGAGTGCGATGATAAAGGTTTGATTAAGTTGATCAAGGCGGTGATTGGTGGAGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGAGGAGCGGTTGGG
GATCCGCCATTAAAGCTGATGAGTATATGGTTAGGGTTTTGAGTAGGGGTTTAAGGAGGCTTGGGGAATTGGAGTTGGCTGATGAGATCAATAGGGAATTTCAGGATTTA
GTGGGCAGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
CTTATTTTTTTTAAGAAGCCGGAGATTCAAATTTCTCTAATATTCTAAAAAAGAAAGTCCATTGATTTAGACAGGCTATTGGGCCCAACCTTTTCCAACTACTGGCCCAA
CAGAATACGTCAAGCAGACAAGTTACTGGGCCCAACATTACACAACTAATGGCCCAATAAGAAAACATTAATGGGCCAAGCCCAAGTCCAAAGTTACGTTCAATTCCCCA
TCGCCCAACAAGGGTATAATAGTCATTACAAAAACACCCCCACGAAAAAACCCAATCCATTACCCCTAATCTGCACTCCGCCATGGCTCTTTCTCTTCACTCCACATTTC
TCAGATACCAAATATCAATTCCGATCCCCATCTCCGCCGCCGCCGTCTCATTTCCGGTACGCTGTGGCCCACGCGACAACCGAGGGCCGCTAGTGAAAGGCAGAACCCTA
AGCACCGAAGCAATCCAAGCCATTCAATCCCTAAAACGGGCCGAAAAATCCGACCCGACGAAGCTCCAACAAGTCCTCTCAAATACTCTCTCCCGATTGCTGAAAGCCGA
CCTCGTGGCCGCCCTGAAGGAGCTTCTCCGGCAGGATCGATGCGCCCTCGCCTTGGAGGTCTTCGCCGTCGTCCGATCGGAGTATAGCGCCGATTTAGGTCTGTACGCGG
AGGTCGCGGCGGCGCTGTCGAGGAACGGAGCGGGGGAGGAAATCGACCGGCTGGTGTGCGATTTGGACGAGGGAGAGAGGAAAATCGAGTGCGATGATAAAGGTTTGATT
AAGTTGATCAAGGCGGTGATTGGTGGAGATAGAAGGGAATCGACGGTCAGGATTTATCGGATGATGAGGAGGAGCGGTTGGGGATCCGCCATTAAAGCTGATGAGTATAT
GGTTAGGGTTTTGAGTAGGGGTTTAAGGAGGCTTGGGGAATTGGAGTTGGCTGATGAGATCAATAGGGAATTTCAGGATTTAGTGGGCAGTTTTTGAAATTTTTTCTCTA
TGTTTTTGTTGTATAATAAGTTAATAACATCATGTATTTTTCTATTTACTTTGGATTATTGACTTAATCAATAATGTTGATATTACTTA
Protein sequenceShow/hide protein sequence
MALSLHSTFLRYQISIPIPISAAAVSFPVRCGPRDNRGPLVKGRTLSTEAIQAIQSLKRAEKSDPTKLQQVLSNTLSRLLKADLVAALKELLRQDRCALALEVFAVVRSE
YSADLGLYAEVAAALSRNGAGEEIDRLVCDLDEGERKIECDDKGLIKLIKAVIGGDRRESTVRIYRMMRRSGWGSAIKADEYMVRVLSRGLRRLGELELADEINREFQDL
VGSF