; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024148 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024148
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr10:788340..792501
RNA-Seq ExpressionLag0024148
SyntenyLag0024148
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6796719.1 hypothetical protein E3N88_07615 [Mikania micrantha]2.9e-7140.4Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR
        +K  V+ FV +C VCQQ KY  L+P GLLQPLPIPDQIWEDIS+DF++ LP + +FD+I VV                        +F KEI+RLH  P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        SIVSDRD IF S FW+ELF      ++ ST+YHPQTDGQTEV+N  +ETYL CFA   P KW ++L W EYSYNTS+HS L  +PF+VVYGR PP +  Y
Subjt:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE-----------------------------------------------------
         P E++  E+  QL  RD ML L++  L  AQ  M   A++KRR++                                                      
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE-----------------------------------------------------

Query:  -----------LVLRKALGNSFPVVPFPQNVSSDMLIKVVPV-ELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGS
                   L L K L  +FP+ P P   + ++L++   V +   +H S      LE+LI+W +  + EATWE+   +ATQFP F LEDK + +GPGS
Subjt:  -----------LVLRKALGNSFPVVPFPQNVSSDMLIKVVPV-ELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGS

Query:  V
        +
Subjt:  V

KFK22699.1 hypothetical protein AALP_AAs67984U000100 [Arabis alpina]1.7e-7143.45Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR
        M S +K FV+EC VCQ+ KY  LAP+GLLQPLPIP Q+WEDIS+DFV  LP++E FD+I VV                        +FI+EIVRLH  PR
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        ++VSDRD +FT +FW ELF      +  STAYHP TDGQTEV N G+ET L CF    P KW  +L W E+ YN+SYHS ++ +PF+ +YGR PP +L +
Subjt:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVELVLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWA
            ++   ++ QLKERD M+ ++++ +L AQQ M   A+  RR+VE  L  A+G+SF     P +++++ +++  P   MG+  + +   + EV I+W 
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVELVLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWA

Query:  ESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSV
             ++TWE    +  QFP+F LEDK      G V
Subjt:  ESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSV

PWA91318.1 Ty3/gypsy retrotransposon protein [Artemisia annua]1.8e-7338.54Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        M+  +   VSEC VCQ+ KY  LAP+GLLQPL +P+++W++++MDF+  LP +E F  I VV+                        F++E++RLH +P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        ++V+DRD +F S FW+E+F  Q   ++RSTAYHPQTDGQTEVVN  +ETYL CF+   P +W RWLSW EY YNTSYH+  KA+PF+++YGR PP ++PY
Subjt:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------
          + +  FEVD  L+ERDR+L  ++  LL AQQ M   ++  RRDV+                                                     
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------

Query:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSK-LEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPG
                   L+K +G+      FP+ +S+DM + V P E++G+ +   +  +  EVLIRW      E+TWE    +  QFPDFHLEDKV LW  G
Subjt:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSK-LEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPG

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-7038.77Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        MK+ VK + +EC +CQQ K L L+P GLL PL IP  IW DISMDFV  LP+A  F+ IFVV+                        F+KE+VRLH  P 
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        SIVSDRD +F S FW+E+F     ++ RS+AYHPQ+DGQTEVVN G+E YL CF    P +WV+W++W EY YNT++   L  +PF+VVYGR+PPP+L Y
Subjt:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------
            +    +D QLKERD M+  +RE L  AQ+ M + A+++RRD+E                                                     
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------

Query:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR
                   L+K +G    + P  Q +  + + K  PVE +   ++ +   + EV+IRW   ++ E TWE  V +A ++PDFHLEDKV+L G  +VR 
Subjt:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR

Query:  PIKFE
        PI F+
Subjt:  PIKFE

XP_028780228.1 uncharacterized protein LOC114736538 [Prosopis alba]4.9e-7139.85Show/hide
Query:  CTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPRSIVSDRDPIFT
        C+ CQQ KY  LAP GLLQPLP+P+QIWEDIS+DF+  LP+++ FD IFVV+                        F KE+VRLH +PRSIVSDRD +F 
Subjt:  CTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPRSIVSDRDPIFT

Query:  SLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPYHPNESSVFEVD
        S+FW+ELF      +  STAYHPQ+DGQTEVVN  +ETYL CF    P KW  +L W EYS+NTSYH     +PF++VYGR+PPP++PY   E++V +++
Subjt:  SLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPYHPNESSVFEVD

Query:  HQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL--------------------------------------------------------------V
         QL+ RD ML ++RE LL AQ  M   A+  RRD++                                                               +
Subjt:  HQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL--------------------------------------------------------------V

Query:  LRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRRPIKFEEQVILE
        LR ALG   P VP P  +SS+M + + P +++    +T     LE+L++W +  + EATWED   LA QFP F LEDK       + R P KF+      
Subjt:  LRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRRPIKFEEQVILE

Query:  KTRG
        + RG
Subjt:  KTRG

TrEMBL top hitse value%identityAlignment
A0A087FYJ7 Uncharacterized protein8.2e-7243.45Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR
        M S +K FV+EC VCQ+ KY  LAP+GLLQPLPIP Q+WEDIS+DFV  LP++E FD+I VV                        +FI+EIVRLH  PR
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        ++VSDRD +FT +FW ELF      +  STAYHP TDGQTEV N G+ET L CF    P KW  +L W E+ YN+SYHS ++ +PF+ +YGR PP +L +
Subjt:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVELVLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWA
            ++   ++ QLKERD M+ ++++ +L AQQ M   A+  RR+VE  L  A+G+SF     P +++++ +++  P   MG+  + +   + EV I+W 
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVELVLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWA

Query:  ESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSV
             ++TWE    +  QFP+F LEDK      G V
Subjt:  ESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSV

A0A2U1PZV9 Ty3/gypsy retrotransposon protein8.7e-7438.54Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        M+  +   VSEC VCQ+ KY  LAP+GLLQPL +P+++W++++MDF+  LP +E F  I VV+                        F++E++RLH +P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        ++V+DRD +F S FW+E+F  Q   ++RSTAYHPQTDGQTEVVN  +ETYL CF+   P +W RWLSW EY YNTSYH+  KA+PF+++YGR PP ++PY
Subjt:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------
          + +  FEVD  L+ERDR+L  ++  LL AQQ M   ++  RRDV+                                                     
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------

Query:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSK-LEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPG
                   L+K +G+      FP+ +S+DM + V P E++G+ +   +  +  EVLIRW      E+TWE    +  QFPDFHLEDKV LW  G
Subjt:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSK-LEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPG

A0A5A7VJA0 Ty3/gypsy retrotransposon protein9.0e-7138.77Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        MK+ VK + +EC +CQQ K L L+P GLL PL IP  IW DISMDFV  LP+A  F+ IFVV+                        F+KE+VRLH  P 
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        SIVSDRD +F S FW+E+F     ++ RS+AYHPQ+DGQTEVVN G+E YL CF    P +WV+W++W EY YNT++   L  +PF+VVYGR+PPP+L Y
Subjt:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------
            +    +D QLKERD M+  +RE L  AQ+ M + A+++RRD+E                                                     
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------

Query:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR
                   L+K +G    + P  Q +  + + K  PVE +   ++ +   + EV+IRW   ++ E TWE  V +A ++PDFHLEDKV+L G  +VR 
Subjt:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR

Query:  PIKFE
        PI F+
Subjt:  PIKFE

A0A5D3BEL2 Ty3/gypsy retrotransposon protein9.0e-7138.77Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        MK+ VK + +EC +CQQ K L L+P GLL PL IP  IW DISMDFV  LP+A  F+ IFVV+                        F+KE+VRLH  P 
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        SIVSDRD +F S FW+E+F     ++ RS+AYHPQ+DGQTEVVN G+E YL CF    P +WV+W++W EY YNT++   L  +PF+VVYGR+PPP+L Y
Subjt:  SIVSDRDPIFTSLFWEELF---CCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------
            +    +D QLKERD M+  +RE L  AQ+ M + A+++RRD+E                                                     
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVEL----------------------------------------------------

Query:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR
                   L+K +G    + P  Q +  + + K  PVE +   ++ +   + EV+IRW   ++ E TWE  V +A ++PDFHLEDKV+L G  +VR 
Subjt:  ----------VLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRR

Query:  PIKFE
        PI F+
Subjt:  PIKFE

A0A5N6PUX8 Uncharacterized protein1.4e-7140.4Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR
        +K  V+ FV +C VCQQ KY  L+P GLLQPLPIPDQIWEDIS+DF++ LP + +FD+I VV                        +F KEI+RLH  P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVV------------------------IFIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
        SIVSDRD IF S FW+ELF      ++ ST+YHPQTDGQTEV+N  +ETYL CFA   P KW ++L W EYSYNTS+HS L  +PF+VVYGR PP +  Y
Subjt:  SIVSDRDPIFTSLFWEELFCCQ---VERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE-----------------------------------------------------
         P E++  E+  QL  RD ML L++  L  AQ  M   A++KRR++                                                      
Subjt:  HPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE-----------------------------------------------------

Query:  -----------LVLRKALGNSFPVVPFPQNVSSDMLIKVVPV-ELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGS
                   L L K L  +FP+ P P   + ++L++   V +   +H S      LE+LI+W +  + EATWE+   +ATQFP F LEDK + +GPGS
Subjt:  -----------LVLRKALGNSFPVVPFPQNVSSDMLIKVVPV-ELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGS

Query:  V
        +
Subjt:  V

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.4e-2328.63Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        ++ +++ +V  C  CQ  K     P G LQP+P  ++ WE +SMDF+  LP +  ++++FVV+                        F + ++     P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
         I++D D IFTS  W++    +   ++ S  Y PQTDGQTE  N  +E  L C     P  WV  +S  + SYN + HS  + +PFE+V+ R  P + P 
Subjt:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
          P+ S   + D   +E  ++   ++E L      M +  + K +++E
Subjt:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE

P0CT35 Transposon Tf2-2 polyprotein1.4e-2328.63Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        ++ +++ +V  C  CQ  K     P G LQP+P  ++ WE +SMDF+  LP +  ++++FVV+                        F + ++     P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
         I++D D IFTS  W++    +   ++ S  Y PQTDGQTE  N  +E  L C     P  WV  +S  + SYN + HS  + +PFE+V+ R  P + P 
Subjt:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
          P+ S   + D   +E  ++   ++E L      M +  + K +++E
Subjt:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE

P0CT36 Transposon Tf2-3 polyprotein1.4e-2328.63Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        ++ +++ +V  C  CQ  K     P G LQP+P  ++ WE +SMDF+  LP +  ++++FVV+                        F + ++     P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
         I++D D IFTS  W++    +   ++ S  Y PQTDGQTE  N  +E  L C     P  WV  +S  + SYN + HS  + +PFE+V+ R  P + P 
Subjt:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
          P+ S   + D   +E  ++   ++E L      M +  + K +++E
Subjt:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE

P0CT41 Transposon Tf2-12 polyprotein1.4e-2328.63Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        ++ +++ +V  C  CQ  K     P G LQP+P  ++ WE +SMDF+  LP +  ++++FVV+                        F + ++     P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
         I++D D IFTS  W++    +   ++ S  Y PQTDGQTE  N  +E  L C     P  WV  +S  + SYN + HS  + +PFE+V+ R  P + P 
Subjt:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
          P+ S   + D   +E  ++   ++E L      M +  + K +++E
Subjt:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE

Q9UR07 Transposon Tf2-11 polyprotein1.4e-2328.63Show/hide
Query:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR
        ++ +++ +V  C  CQ  K     P G LQP+P  ++ WE +SMDF+  LP +  ++++FVV+                        F + ++     P+
Subjt:  MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVI------------------------FIKEIVRLHDIPR

Query:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY
         I++D D IFTS  W++    +   ++ S  Y PQTDGQTE  N  +E  L C     P  WV  +S  + SYN + HS  + +PFE+V+ R  P + P 
Subjt:  SIVSDRDPIFTSLFWEEL---FCCQVERSTAYHPQTDGQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPY

Query:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
          P+ S   + D   +E  ++   ++E L      M +  + K +++E
Subjt:  H-PNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE

Arabidopsis top hitse value%identityAlignment
AT3G30770.1 Eukaryotic aspartyl protease family protein2.5e-0428.7Show/hide
Query:  FSSPGTFKVKGKIEDQEVVILIGCGATHNFISQKLVEEFKLSVLETLNYKKIIGTGTAAKGKGICKGVIISLNELMIAKDYLPLKLGSIDA---------
        F+     +  G I   +VV++I  GAT+NFIS +L    KL    T     ++G     +  G C G+ + + E+ I +++L L L   D          
Subjt:  FSSPGTFKVKGKIEDQEVVILIGCGATHNFISQKLVEEFKLSVLETLNYKKIIGTGTAAKGKGICKGVIISLNELMIAKDYLPLKLGSIDA---------

Query:  -ILDMQWL
          L+ QWL
Subjt:  -ILDMQWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCCCGTGTGAAGGCCTTTGTCAGTGAGTGCACGGTTTGCCAACAAGCTAAATATTTAGCTTTAGCCCCAACTGGTTTGCTTCAACCACTTCCGATTCCAGATCA
AATTTGGGAAGATATCTCTATGGATTTCGTAATCGATTTACCACGGGCGGAGAAATTTGACTCCATATTCGTTGTGATCTTTATCAAGGAAATTGTACGCTTGCACGACA
TTCCTCGCAGTATTGTATCCGATCGGGACCCGATTTTCACCAGTTTGTTTTGGGAGGAGCTTTTTTGTTGCCAAGTTGAAAGAAGCACAGCTTATCATCCTCAAACGGAT
GGTCAAACCGAGGTTGTAAATGGAGGTATCGAAACGTACTTGCATTGTTTCGCTATGCATTGTCCTTTAAAATGGGTACGTTGGTTATCATGGACTGAATATAGCTACAA
CACGTCGTACCACTCTGGTTTGAAGGCCTCACCTTTTGAGGTTGTATACGGGAGACAACCCCCACCTATTCTACCATACCATCCAAATGAGTCCTCTGTTTTTGAGGTTG
ACCATCAGCTTAAGGAACGAGATCGAATGTTGGCGTTAATCCGAGAACGCCTGCTCGACGCACAACAATGTATGGCTAGGATGGCCAATGAAAAACGCCGAGATGTGGAG
TTGGTGCTTCGGAAGGCCCTGGGAAACTCATTCCCTGTGGTTCCTTTCCCTCAGAATGTCTCCTCGGATATGTTGATTAAAGTTGTTCCCGTGGAATTGATGGGATTGCA
CCAATCTACAATTGATCCTTCGAAACTGGAGGTGTTAATCCGATGGGCAGAATCTAATGTCTCGGAGGCTACGTGGGAAGATGCAGTTCAACTAGCAACACAATTTCCAG
ATTTTCACCTTGAGGATAAGGTGGCGCTATGGGGGCCGGGTAGTGTTAGACGTCCAATTAAGTTTGAAGAGCAAGTCATTCTGGAAAAGACACGAGGAATGGCGCAAAAG
CAACTAGAGGAGCGTATAGTTGAGGCTGAAAAAAGTACGCGAGCTGACGGGCTAGGAGAAAGTAATTGTGACTACAGTGAGTTTCACGAGGGTGGCACTCTGATGGTGAA
AACGGTGGTGATATACCGTGAACAATTCGAAGCATTGGCAGCTTCGTTGCCACACTTAATGAAAAGTCATGGAAAATCTGATGAAAAATATTTAGTGGGACATAGATGCA
AAGGGAAGGAACTAAAGGTCTTGATTGTGTCAGACAAAAACGAAGACAAGGAACAACGCTGTAAAGAAGAGAGAGCAAAGGAGAGTAGGGTCGAGGAAGGGGAAGATTCC
TCAGTGGAAATGGATATGGTCGAGCTCTCTCTCAACACTATGTTGGGATTTTCCTCACCCGGTACTTTTAAGGTTAAAGGCAAGATTGAAGACCAAGAAGTGGTGATTCT
GATTGGTTGCGGTGCGACCCATAATTTTATTTCCCAGAAACTCGTCGAGGAATTCAAGTTGTCGGTATTGGAAACGCTGAATTACAAGAAAATCATTGGAACAGGGACTG
CGGCGAAAGGCAAAGGTATTTGCAAGGGAGTAATTATCTCTCTGAACGAGCTCATGATTGCCAAAGACTATTTGCCTCTCAAACTGGGCAGTATAGACGCAATCCTCGAC
ATGCAATGGCTGCGAACGCTAGGCGTAGCAACTGTGGCTTGGAAAACACTTACTCTAACCATCGAGAGGAGGAAAACCTGCGGTTTTAGAAGTTGTGCCCTTTCTCCCAA
CCCACCATCGATTGATCGATTACTTGAACAATATGATGATGTGCTTTCGATTCCCACACAACTTCCACCATCAAGGGAGGTCGACCATCGCATCTACCTCAAAGATGAAC
AATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCCCGTGTGAAGGCCTTTGTCAGTGAGTGCACGGTTTGCCAACAAGCTAAATATTTAGCTTTAGCCCCAACTGGTTTGCTTCAACCACTTCCGATTCCAGATCA
AATTTGGGAAGATATCTCTATGGATTTCGTAATCGATTTACCACGGGCGGAGAAATTTGACTCCATATTCGTTGTGATCTTTATCAAGGAAATTGTACGCTTGCACGACA
TTCCTCGCAGTATTGTATCCGATCGGGACCCGATTTTCACCAGTTTGTTTTGGGAGGAGCTTTTTTGTTGCCAAGTTGAAAGAAGCACAGCTTATCATCCTCAAACGGAT
GGTCAAACCGAGGTTGTAAATGGAGGTATCGAAACGTACTTGCATTGTTTCGCTATGCATTGTCCTTTAAAATGGGTACGTTGGTTATCATGGACTGAATATAGCTACAA
CACGTCGTACCACTCTGGTTTGAAGGCCTCACCTTTTGAGGTTGTATACGGGAGACAACCCCCACCTATTCTACCATACCATCCAAATGAGTCCTCTGTTTTTGAGGTTG
ACCATCAGCTTAAGGAACGAGATCGAATGTTGGCGTTAATCCGAGAACGCCTGCTCGACGCACAACAATGTATGGCTAGGATGGCCAATGAAAAACGCCGAGATGTGGAG
TTGGTGCTTCGGAAGGCCCTGGGAAACTCATTCCCTGTGGTTCCTTTCCCTCAGAATGTCTCCTCGGATATGTTGATTAAAGTTGTTCCCGTGGAATTGATGGGATTGCA
CCAATCTACAATTGATCCTTCGAAACTGGAGGTGTTAATCCGATGGGCAGAATCTAATGTCTCGGAGGCTACGTGGGAAGATGCAGTTCAACTAGCAACACAATTTCCAG
ATTTTCACCTTGAGGATAAGGTGGCGCTATGGGGGCCGGGTAGTGTTAGACGTCCAATTAAGTTTGAAGAGCAAGTCATTCTGGAAAAGACACGAGGAATGGCGCAAAAG
CAACTAGAGGAGCGTATAGTTGAGGCTGAAAAAAGTACGCGAGCTGACGGGCTAGGAGAAAGTAATTGTGACTACAGTGAGTTTCACGAGGGTGGCACTCTGATGGTGAA
AACGGTGGTGATATACCGTGAACAATTCGAAGCATTGGCAGCTTCGTTGCCACACTTAATGAAAAGTCATGGAAAATCTGATGAAAAATATTTAGTGGGACATAGATGCA
AAGGGAAGGAACTAAAGGTCTTGATTGTGTCAGACAAAAACGAAGACAAGGAACAACGCTGTAAAGAAGAGAGAGCAAAGGAGAGTAGGGTCGAGGAAGGGGAAGATTCC
TCAGTGGAAATGGATATGGTCGAGCTCTCTCTCAACACTATGTTGGGATTTTCCTCACCCGGTACTTTTAAGGTTAAAGGCAAGATTGAAGACCAAGAAGTGGTGATTCT
GATTGGTTGCGGTGCGACCCATAATTTTATTTCCCAGAAACTCGTCGAGGAATTCAAGTTGTCGGTATTGGAAACGCTGAATTACAAGAAAATCATTGGAACAGGGACTG
CGGCGAAAGGCAAAGGTATTTGCAAGGGAGTAATTATCTCTCTGAACGAGCTCATGATTGCCAAAGACTATTTGCCTCTCAAACTGGGCAGTATAGACGCAATCCTCGAC
ATGCAATGGCTGCGAACGCTAGGCGTAGCAACTGTGGCTTGGAAAACACTTACTCTAACCATCGAGAGGAGGAAAACCTGCGGTTTTAGAAGTTGTGCCCTTTCTCCCAA
CCCACCATCGATTGATCGATTACTTGAACAATATGATGATGTGCTTTCGATTCCCACACAACTTCCACCATCAAGGGAGGTCGACCATCGCATCTACCTCAAAGATGAAC
AATAG
Protein sequenceShow/hide protein sequence
MKSRVKAFVSECTVCQQAKYLALAPTGLLQPLPIPDQIWEDISMDFVIDLPRAEKFDSIFVVIFIKEIVRLHDIPRSIVSDRDPIFTSLFWEELFCCQVERSTAYHPQTD
GQTEVVNGGIETYLHCFAMHCPLKWVRWLSWTEYSYNTSYHSGLKASPFEVVYGRQPPPILPYHPNESSVFEVDHQLKERDRMLALIRERLLDAQQCMARMANEKRRDVE
LVLRKALGNSFPVVPFPQNVSSDMLIKVVPVELMGLHQSTIDPSKLEVLIRWAESNVSEATWEDAVQLATQFPDFHLEDKVALWGPGSVRRPIKFEEQVILEKTRGMAQK
QLEERIVEAEKSTRADGLGESNCDYSEFHEGGTLMVKTVVIYREQFEALAASLPHLMKSHGKSDEKYLVGHRCKGKELKVLIVSDKNEDKEQRCKEERAKESRVEEGEDS
SVEMDMVELSLNTMLGFSSPGTFKVKGKIEDQEVVILIGCGATHNFISQKLVEEFKLSVLETLNYKKIIGTGTAAKGKGICKGVIISLNELMIAKDYLPLKLGSIDAILD
MQWLRTLGVATVAWKTLTLTIERRKTCGFRSCALSPNPPSIDRLLEQYDDVLSIPTQLPPSREVDHRIYLKDEQ