; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21030 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21030
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:17111058..17112624
RNA-Seq ExpressionCSPI03G21030
SyntenyCSPI03G21030
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061366.1 retrotransposon protein [Cucumis melo var. makuwa]1.8e-15857.77Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +SLKEMMLEMKK ++R+ DE++E+H  KK+EE+GTT +G ++KLKGKL++TE   E+ G+   R              NPESW YRAEHFF+INNL EAE
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q GSY+DYVKKFVTYS+PLP MAESVL+DA  TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALK-----LAMEEMDKVEPKRSESSSKLQGKGEKVT------IPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE
        SRHPQTLEE M++AQLVNDRN+ALK     L M E D+    +   + +++    + T      IP+KG ++K +PP+KRL D EFRARLD+G CFRCN+
Subjt:  SRHPQTLEECMKKAQLVNDRNIALK-----LAMEEMDKVEPKRSESSSKLQGKGEKVT------IPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE

Query:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL
        KYSP HRCK +EKRELM  I+NEED+ E  ++ ++ A   +EL  LEL EDT IEL+ +T ++SKGTMKLKG V  KE+V+L NSGAT+NFI Q L  EL
Subjt:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL

Query:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT
        Q+ ++  T+FG TI NGTRC+G+G+C+RV++KLKE+TI+ADFLA ELG VD VLG+QWLD+TGTM++HWPSLTM FW  GR+I+LKGDPSL K+ECSLKT
Subjt:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT

Query:  LEKTWQSGDQGFLLEFQNYEV
        L KTWQ  DQGFLLE+ N E+
Subjt:  LEKTWQSGDQGFLLEFQNYEV

XP_031737605.1 uncharacterized protein LOC116402475 [Cucumis sativus]3.2e-21679.27Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAEK
        MSLK+MM EMKK    +TDELRE+HG KK+EEAGTTE   LKLKGKLE+  T  ESGGSNA R             VNPESWAYRAEHFFDINNL EAEK
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAEK

Query:  VKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVIS
        VKVAVVSFGQEE                                QSLGAR+I I+Q+GSYSDYVKKFV YSAPLPDMA+S+LLDAF TGLEPALQAEVIS
Subjt:  VKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVIS

Query:  RHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE
        RHPQTLEECMK+AQLVNDRNIALKLAMEEMDKVEPKRSE+SSKLQGK EK            VTIPLK DYQKKDPPIKRL D EFRARLDKG CFRCNE
Subjt:  RHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE

Query:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL
        KYSP HRCKGREKRELMLLILNEE+D+ERESNTEDEA EIIELNQLELNEDTPIELRLIT VTSKGTMKLKGHVNGKEVVIL +SGATNNFISQVLVDEL
Subjt:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL

Query:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT
        QLSID GTRFGV I NGTRCEGRGICKRVKVKLKELTIVADFLA ELG+VDLVLG+QWLDSTGTMKVHWPSLTMTFWTKGRRIILKGD SLTK ECSL+T
Subjt:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT

Query:  LEKTWQSGDQGFLLEFQNYEV
        LEKTWQSGDQGFLLEFQNYEV
Subjt:  LEKTWQSGDQGFLLEFQNYEV

XP_031742100.1 uncharacterized protein LOC116404055 [Cucumis sativus]3.5e-19171.98Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAEK
        MSLKEMMLEMKK MDRMT ELRENHG KK+EEAGTT+   LKLKGKLE+  TT ESGGS+A R             VNPESWAYRAEHFFDIN+L EAEK
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAEK

Query:  VKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVIS
        VKVAVVSFGQEE                                QSLGAR+I IKQ+ SYS+YVKKFVTYSAPLPDMAESVLLDAF TGLEP+LQAEVIS
Subjt:  VKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVIS

Query:  RHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE
        RHPQTLEECMK+AQL NDRNIALKLAMEE+D VEPKR+E+SSK QGK EK            VTIPLK DYQKKD PIKRL D EFRARLDKG CFRCNE
Subjt:  RHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE

Query:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL
        KY+P HRCKGREKRELMLLILNEE+D++RE +TEDEA E+IELN LELN D PIELRLIT VTSKGTMKLKGH                           
Subjt:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL

Query:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT
         LSID GTRFGVTI N  +CEG GICKRVKVKLKELTIVADFLA ELG VDLVLG+QWLDSTGTMKVHWPSLTMTFW KGRRIILKGDPSLTK ECSL+T
Subjt:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT

Query:  LEKTWQSGDQGFLLEFQNYEV
        LEKTWQSGDQGFLLEFQNYEV
Subjt:  LEKTWQSGDQGFLLEFQNYEV

XP_031745472.1 uncharacterized protein K02A2.6-like [Cucumis sativus]3.0e-19076.26Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRRVNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE
        MSLKEMMLEMKK MDRMTDELRENHG KK+EEAGTT     K+K  +             + RR   ESW    E  FD  N                 +
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRRVNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE

Query:  QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKL
        QSLGAR+I IKQ+GSYS+YVKKFVTYSAPLPDMAESVLLDAF TGLEP+LQAEVISRHPQTLEECMK+AQLVNDRNIALKLAMEE+D VEPKR+E+SSK 
Subjt:  QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKL

Query:  QGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELN
        QGK EK            VTIPLK DYQKKDPPIKRL D EFRARLDKG CFRCNEKY+P HRCKGREKRELMLLILNEE+D++RE +TEDEA E+IELN
Subjt:  QGKGEK------------VTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELN

Query:  QLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLA
         LELN D PIELRLIT VTSKGTMKLKGHVNG+EVVIL +SGATNNFISQVLVDELQLSID GTRFGVTI NG +CEG GICKRVKVKLKELTIVADFLA
Subjt:  QLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLA

Query:  GELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQNYE
         ELG VDLVLG+QWLDSTGTMKVHWPSLTMTFW KGRRIILKGDPSLTK ECSL+TLEKTWQSGDQGFLLEF+NYE
Subjt:  GELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQNYE

XP_031745726.1 uncharacterized protein K02A2.6-like [Cucumis sativus]1.1e-18979.5Show/hide
Query:  VNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPD
        VNPESWAYRAEHFFDIN+L EAEKVKVAVVSFGQEE                                QSLGAR+I IKQ+GSYS+YVKKFVTYSAPLPD
Subjt:  VNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPD

Query:  MAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDP
        MAESVLLDAF TGLEP+LQAEVISRHPQTLEECMK+AQLVNDRNIALKLAMEE+D VEPKR+E+SSK QGK EK            VTIPLK DYQKKDP
Subjt:  MAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEK------------VTIPLKGDYQKKDP

Query:  PIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNG
        PIKRL D EFRARLDKG CFRCNEKY+P HRCKGREKRELMLLILNEE+D++RE +TEDEA E+IELN LELN D PIELRLIT VTSKGTMKLKGHVNG
Subjt:  PIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNG

Query:  KEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTF
        +EVVIL +SGATNNFISQVLVDELQLSID GTRFGVTI NG +CEG GICKRVKVKLKELTIVADFLA ELG VDLVLG+QWLDSTGTMKVHWPSLTMTF
Subjt:  KEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTF

Query:  WTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQNYE
        W KGRRIILKGDPSLTK ECSL+TLEKTWQSGDQGFLLEF+NYE
Subjt:  WTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQNYE

TrEMBL top hitse value%identityAlignment
A0A5A7T8C0 Ty3/gypsy retrotransposon protein1.2e-15756.71Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +S+KEM+LEMKK +DR+TDE++ENH  KK+EE+GTT +G ++KLKGK+++T+   E  G+   R              NP+SW YRAEHFF+INNL E+E
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q GSY++YVKKFVTYSAPLP MAESVL+DAF TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEM-------------------DKVEPKRSESSSKLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDK
        SRHPQTLE+CM++AQLVNDRN+ALKL+  E+                   DK  P++++   K      ++TIP+KG+++K +PP+KRL D EFRARLD+
Subjt:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEM-------------------DKVEPKRSESSSKLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDK

Query:  GFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFI
        G CFRCN+KYSP HRCK +EKRELM  I+NEE+++E   + E+     +EL  LEL ED  IEL+ +TR++SK TMKLKG +  KE+VIL +SGAT+NFI
Subjt:  GFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFI

Query:  SQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLT
         Q L  +L+L ++  T+FG TI NGTRC+G+GIC+RV+VKL+E+TI+ADFLA ELG VD VLG+QWLD+TGTMK+HWPSLTM+FW  GR+IILKGDPSL 
Subjt:  SQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLT

Query:  KLECSLKTLEKTWQSGDQGFLLEFQNYEV
        + ECSL+TLEKTWQ  DQGFLLE+ N EV
Subjt:  KLECSLKTLEKTWQSGDQGFLLEFQNYEV

A0A5A7UM77 Ty3/gypsy retrotransposon protein2.1e-15756.98Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +S+KEM+LEMKK ++R+ DE++ENH  KK+EE+GTT +G ++KLKGK+++TE   E  G+   R              NPESW YRAEHFF+INNLSE+E
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q+GSY++YVKKFVTYSAPLP MAESVL+DAF TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC
        SRHPQTLE+CM++AQLVNDRN ALKL+  E+   E +   SS              K   + +++TIP+KG+++K +PP+KRL D EFRARLD+G CFRC
Subjt:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC

Query:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD
        N+KYSP HRCK +EKRELM  I+NEE+++E   + E+     +EL  LEL ++  IEL+ +TR++SKGTMKLKG +  KEVV+L +SGAT+NFI   L  
Subjt:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD

Query:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL
        +L+L +D  T FG TI NGTRC G+GIC+RV+VKL E+TI+ADFLA ELG VD VLG+QWLD+TGTMK+HWPSLTM+FW  GR+I+LKGDPSL + ECSL
Subjt:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL

Query:  KTLEKTWQSGDQGFLLEFQNYEV
        +TLEKTWQ  DQGFLLE+ N +V
Subjt:  KTLEKTWQSGDQGFLLEFQNYEV

A0A5A7UYM1 Ty3/gypsy retrotransposon protein1.1e-15857.55Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +S+KEM+LEMKK ++R+ DE++ENH  KK+EE+GTT +G ++KLKGK+++TE   E  G+   R              NPESW YRAEHFF+INNLSE+E
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q+GSY++YVKKFVTYSAPLP MAESVL+DAF TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC
        SRHPQTLE+CM++AQLVNDRN ALKL+  E+   E +   SS              K   + +++TIP+KG+++K +PP+KRL D EFRARLD+G CFRC
Subjt:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC

Query:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD
        N+KYSP HRCK +EKRELM  I+NEE+++E   + E+   E +EL  LEL ++T IEL+ +TR++SKGTMKLKG +  KEVV+L +SGAT+NFI   L  
Subjt:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD

Query:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL
        EL+L +D  T FG TI NGTRC G+GIC+RV+VKL E+TI+ADFLA ELG VD VLG+QWLD+TGTMK++WPSLTM+FW  GR+I+LKGDPSL + ECSL
Subjt:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL

Query:  KTLEKTWQSGDQGFLLEFQNYEV
        +TLEKTWQ  DQGFLLE+ N EV
Subjt:  KTLEKTWQSGDQGFLLEFQNYEV

A0A5A7V4F5 Retrotransposon protein8.6e-15957.77Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +SLKEMMLEMKK ++R+ DE++E+H  KK+EE+GTT +G ++KLKGKL++TE   E+ G+   R              NPESW YRAEHFF+INNL EAE
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q GSY+DYVKKFVTYS+PLP MAESVL+DA  TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALK-----LAMEEMDKVEPKRSESSSKLQGKGEKVT------IPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE
        SRHPQTLEE M++AQLVNDRN+ALK     L M E D+    +   + +++    + T      IP+KG ++K +PP+KRL D EFRARLD+G CFRCN+
Subjt:  SRHPQTLEECMKKAQLVNDRNIALK-----LAMEEMDKVEPKRSESSSKLQGKGEKVT------IPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNE

Query:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL
        KYSP HRCK +EKRELM  I+NEED+ E  ++ ++ A   +EL  LEL EDT IEL+ +T ++SKGTMKLKG V  KE+V+L NSGAT+NFI Q L  EL
Subjt:  KYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDEL

Query:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT
        Q+ ++  T+FG TI NGTRC+G+G+C+RV++KLKE+TI+ADFLA ELG VD VLG+QWLD+TGTM++HWPSLTM FW  GR+I+LKGDPSL K+ECSLKT
Subjt:  QLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKT

Query:  LEKTWQSGDQGFLLEFQNYEV
        L KTWQ  DQGFLLE+ N E+
Subjt:  LEKTWQSGDQGFLLEFQNYEV

A0A5D3BJ50 Ty3/gypsy retrotransposon protein3.3e-15857.17Show/hide
Query:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE
        +S+KEM+LEMKK ++R+ DE++ENH  KK+EE+GTT +G ++KLKGK+++TE   E  G+   R              NPESW YRAEHFF+INNLSE+E
Subjt:  MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTT-EGPMLKLKGKLEDTETTAESGGSNAGRR------------VNPESWAYRAEHFFDINNLSEAE

Query:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI
        KVKVAVVSFGQ+E                                +SLGAR+I I+Q+GSY++YVKKFVTYSAPLP MAESVL+DAF TGLEP+LQAEVI
Subjt:  KVKVAVVSFGQEE--------------------------------QSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVI

Query:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC
        SRHPQTLE+CM++AQLVNDRN ALKL+  E+   E +   SS              K   + +++TIP+KG+++K +PP+KRL D EFRARLD+G CFRC
Subjt:  SRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC

Query:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD
        N+KYSP HRCK +EKRELM  I+NEE+++E   + E+   E +EL  LEL ++  IEL+ +TR++SKGTMKLKG +  KEVV+L +SGAT+NFI   L  
Subjt:  NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVD

Query:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL
        +L+L +D  T FG TI NGTRC G+GIC+RV+VKL E+TI+ADFLA ELG VD VLG+QWLD+TGTMK+HWPSLTM+FW  GR+I+LKGDPSL + ECSL
Subjt:  ELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSL

Query:  KTLEKTWQSGDQGFLLEFQNYEV
        +TLEKTWQ  DQGFLLE+ N +V
Subjt:  KTLEKTWQSGDQGFLLEFQNYEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein7.2e-0929.86Show/hide
Query:  GEIIELNQLELNEDT---PIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLK
        G I EL +LE +  T    +E  +I    +KG M+  G +   +VV+  +SGAT+NFI   L   L+L      +  V +      +  G C  +++ ++
Subjt:  GEIIELNQLELNEDT---PIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLK

Query:  ELTIVADFLAGELGR--VDLVLGIQWLDSTGTMKVHWPSLTMTF
        E+ I  +FL  +L +  VD++LG +WL   G   V+W +   +F
Subjt:  ELTIVADFLAGELGR--VDLVLGIQWLDSTGTMKVHWPSLTMTF

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-0430.61Show/hide
Query:  TRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGR--VDLVLG
        T  T    M+  G ++  +VV++ +SGATNNFIS  L   L+L   +  +  V +      +  G C  + + ++E+ I  +FL  +L +  VD++LG
Subjt:  TRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGR--VDLVLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTGAAGGAGATGATGCTGGAAATGAAGAAATTGATGGATCGAATGACAGACGAGTTGAGAGAAAACCATGGCACTAAAAAAAGAGAAGAAGCTGGGACTACGGA
AGGCCCCATGTTGAAACTGAAGGGAAAGCTGGAAGATACCGAAACCACAGCCGAGTCGGGAGGAAGCAACGCAGGCCGAAGGGTGAACCCTGAATCATGGGCTTACCGAG
CAGAACATTTCTTTGATATAAACAATTTATCGGAAGCTGAGAAGGTCAAAGTAGCAGTCGTGAGTTTTGGACAGGAAGAGCAGAGCTTGGGAGCAAGAGTAATCTGTATC
AAGCAAAAAGGCTCATACAGTGATTATGTTAAGAAGTTTGTTACATATTCAGCCCCCCTCCCGGACATGGCCGAAAGTGTTCTGCTGGATGCGTTTTTTACGGGTCTTGA
GCCAGCACTGCAAGCAGAAGTAATCAGCAGGCACCCTCAAACTTTAGAGGAGTGTATGAAGAAAGCTCAACTGGTGAATGATAGGAATATAGCATTAAAACTGGCGATGG
AAGAGATGGATAAAGTTGAGCCCAAAAGAAGTGAGAGCAGTAGCAAACTTCAAGGAAAAGGTGAGAAGGTAACTATCCCACTAAAAGGTGATTACCAGAAAAAAGATCCC
CCAATTAAAAGGCTGTTGGATATGGAATTTAGAGCAAGGCTGGATAAAGGGTTTTGTTTCAGGTGTAATGAAAAGTATTCCCCCCGTCATAGATGTAAAGGCAGAGAAAA
AAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGATGACAACGAAAGGGAATCAAATACAGAGGATGAGGCAGGCGAAATAATAGAACTGAATCAATTGGAATTGAATG
AGGATACCCCTATTGAGTTAAGGTTGATCACGAGGGTTACGTCCAAGGGAACGATGAAACTAAAAGGACATGTGAATGGAAAGGAAGTAGTCATCCTGACTAACAGCGGT
GCAACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAACTGAGCATCGATTCGGGGACTCGTTTCGGAGTTACCATTAGGAATGGCACCCGATGTGAAGGGAG
AGGGATCTGTAAGAGGGTAAAAGTGAAGTTGAAAGAGTTAACGATCGTAGCAGACTTCCTGGCGGGAGAATTAGGAAGGGTAGACTTGGTGTTGGGGATACAATGGTTAG
ATTCGACAGGGACTATGAAAGTACACTGGCCATCTCTAACCATGACGTTTTGGACAAAGGGTAGAAGAATAATCCTAAAAGGGGACCCTTCTCTAACAAAGTTAGAATGT
TCGTTGAAAACCTTAGAGAAAACTTGGCAATCTGGGGACCAAGGATTTCTCTTGGAATTTCAAAATTATGAAGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTGAAGGAGATGATGCTGGAAATGAAGAAATTGATGGATCGAATGACAGACGAGTTGAGAGAAAACCATGGCACTAAAAAAAGAGAAGAAGCTGGGACTACGGA
AGGCCCCATGTTGAAACTGAAGGGAAAGCTGGAAGATACCGAAACCACAGCCGAGTCGGGAGGAAGCAACGCAGGCCGAAGGGTGAACCCTGAATCATGGGCTTACCGAG
CAGAACATTTCTTTGATATAAACAATTTATCGGAAGCTGAGAAGGTCAAAGTAGCAGTCGTGAGTTTTGGACAGGAAGAGCAGAGCTTGGGAGCAAGAGTAATCTGTATC
AAGCAAAAAGGCTCATACAGTGATTATGTTAAGAAGTTTGTTACATATTCAGCCCCCCTCCCGGACATGGCCGAAAGTGTTCTGCTGGATGCGTTTTTTACGGGTCTTGA
GCCAGCACTGCAAGCAGAAGTAATCAGCAGGCACCCTCAAACTTTAGAGGAGTGTATGAAGAAAGCTCAACTGGTGAATGATAGGAATATAGCATTAAAACTGGCGATGG
AAGAGATGGATAAAGTTGAGCCCAAAAGAAGTGAGAGCAGTAGCAAACTTCAAGGAAAAGGTGAGAAGGTAACTATCCCACTAAAAGGTGATTACCAGAAAAAAGATCCC
CCAATTAAAAGGCTGTTGGATATGGAATTTAGAGCAAGGCTGGATAAAGGGTTTTGTTTCAGGTGTAATGAAAAGTATTCCCCCCGTCATAGATGTAAAGGCAGAGAAAA
AAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGATGACAACGAAAGGGAATCAAATACAGAGGATGAGGCAGGCGAAATAATAGAACTGAATCAATTGGAATTGAATG
AGGATACCCCTATTGAGTTAAGGTTGATCACGAGGGTTACGTCCAAGGGAACGATGAAACTAAAAGGACATGTGAATGGAAAGGAAGTAGTCATCCTGACTAACAGCGGT
GCAACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAACTGAGCATCGATTCGGGGACTCGTTTCGGAGTTACCATTAGGAATGGCACCCGATGTGAAGGGAG
AGGGATCTGTAAGAGGGTAAAAGTGAAGTTGAAAGAGTTAACGATCGTAGCAGACTTCCTGGCGGGAGAATTAGGAAGGGTAGACTTGGTGTTGGGGATACAATGGTTAG
ATTCGACAGGGACTATGAAAGTACACTGGCCATCTCTAACCATGACGTTTTGGACAAAGGGTAGAAGAATAATCCTAAAAGGGGACCCTTCTCTAACAAAGTTAGAATGT
TCGTTGAAAACCTTAGAGAAAACTTGGCAATCTGGGGACCAAGGATTTCTCTTGGAATTTCAAAATTATGAAGTGTGA
Protein sequenceShow/hide protein sequence
MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRRVNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEEQSLGARVICI
KQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEKVTIPLKGDYQKKDP
PIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSG
ATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLEC
SLKTLEKTWQSGDQGFLLEFQNYEV