; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr11:11506735..11507931
RNA-Seq ExpressionLag0031657
SyntenyLag0031657
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.6e-9645.87Show/hide
Query:  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTI
        S    N W +D G   H+T+DLANL     Y G++NIT+ NGQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F I
Subjt:  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTI

Query:  QDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL-------------------------TAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVS
        QDK+T ++LF GPS +GLYPL    + K  +P+ Q  L                         TA +G + ST++WHD LGHP  + L S+L+S+SI   
Subjt:  QDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL-------------------------TAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVS

Query:  RSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSD
        R    +C+HCL GK++K PF LS++ S +PL+L+HSD WGPAP  S +   YYVSFVDDFS                                    RSD
Subjt:  RSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSD

Query:  GGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHV
        GGGEY   +L  L    GI H++SCP+TP+QNGI ERKHRHIV   L+LLS++S+P+++W  AF+TA YLINR+P+  L+H SP+E LF  PPDYT L  
Subjt:  GGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHV

Query:  FGCACYHLLRPY
        FG ACY LL+PY
Subjt:  FGCACYHLLRPY

KAB2610253.1 hypothetical protein D8674_018285 [Pyrus ussuriensis x Pyrus communis]4.3e-9744.1Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF
        ++S   + +WL+D G   H+T+DL+NL +++ Y   + +   NG+ L +SH G   +  P     L+++  VP +S NLLSVH++C+DNNC  IFD+  F
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF

Query:  TIQDKSTGKVLFHGPSVNGLYPL--VAKSP---SPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIG-VCKHCLDGKLSKQPFLL
         IQDK TG++L+ G   NGLYP+  +AK P   SP  +  +A +G   S+ +WH  LGHP  +I++++L+ ++I  S+ D+  VC  CL+GK +K PF  
Subjt:  TIQDKSTGKVLFHGPSVNGLYPL--VAKSP---SPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIG-VCKHCLDGKLSKQPFLL

Query:  SSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQ
        S+  S  P E++HSD WGPAP  SI+G ++YV+ +D+ +R+ W+FP+  KSD F  F  F  F +   S+ + + +SDGGGEY+++ L+     +G++H 
Subjt:  SSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQ

Query:  KSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
         SCPYTPEQNG+ ERKHRH++   ++LL  + +P +FW FA   A YLINR+P+P L HKSPFELLF   P  T L VFGC+C+ LL+PY
Subjt:  KSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

PKU75882.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]8.2e-9645.52Show/hide
Query:  PGTLLNTSPS--DSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCF
        P T   +SPS   S+ W  D G + HLTSD      S  Y G   + +GNG  LPI + G G L  P  S  L NL  VP++S NLLSV+QL  DNNC  
Subjt:  PGTLLNTSPS--DSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCF

Query:  IFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGVCKHCLDGKLSKQPFL
         F S  F I+D  T +VL  GP +NGLY + A SP+ ++  L A + ++A   +WH  LGHP  S L+S+    S     S   +C  C   K  ++PF 
Subjt:  IFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGVCKHCLDGKLSKQPFL

Query:  LSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILH
        +S S S SP +L+HSD WGP+P  S  G+RYYVSF+D+FS++TW++P+ +KS+VF  F +F    K    + + + R+DGGGE+++N    L  N GI+H
Subjt:  LSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILH

Query:  QKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        Q +CPY+P QNG+ ERKHRH+     SLL ++S+P  FW     TA YLINRLPSPN +HKSP+E+L+++ P+Y  L VFGC CY  L+PY
Subjt:  QKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]4.2e-10046.44Show/hide
Query:  WLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGK
        W +D G   H+TS++ NL + + Y+  + ++VGNG  L ISH G   +S P+++F L+N+  VP ISTNL+SVH+   DNNC FIFDSS F I+DK++GK
Subjt:  WLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGK

Query:  VLFHGPSVNGLYPL-VAKSPSPAQVT-LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLSSSLSCSPLEL
         LF G S NGLYP  + + P+ +      A VG + +  +WH  LGHP  ++   + ++  +PV  S     +C  C  GK  K PF +SSS+S +PL+L
Subjt:  VLFHGPSVNGLYPL-VAKSPSPAQVT-LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLSSSLSCSPLEL

Query:  LHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNG
        +H D WG +P+ SI+G+ YYVSF+DD ++Y W +P+  KS  F  F +F  + +N+LS+ +  F+SDGGGE++SN  +N   + GI H+ SCP+TPEQNG
Subjt:  LHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNG

Query:  IDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        + ERKH HIV M L+LL+ S +P+++W  AF TA +LINRLP+  L +KSP+E LF + P+Y  LH FGC C+  LRPY
Subjt:  IDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

TQE09310.1 hypothetical protein C1H46_005046 [Malus baccata]1.3e-9846.85Show/hide
Query:  ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCID
        A T+ S+P    ++S   S  WL D G   H+TSDL+NL ++  Y+  + IT  NG  L I+H G   LSLP  +  L+++  VP +S +LLS+HQLC D
Subjt:  ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCID

Query:  NNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIG-VCKHCLDGKL
        NNC  I D  S  IQDK T KVL+ G S N +YPL     SP  V+  A +G + S+ +WH  LGHP   +L + L+ + I  S +D    CK CL GK 
Subjt:  NNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIG-VCKHCLDGKL

Query:  SKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFA
        +  PF   +S S  P E++H+D WGP+P  SI  +RYYVSF+D+ +RYTWIFP+  K+ VF +F QF  F  N     + + +SDGGGEY+    +N   
Subjt:  SKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFA

Query:  NQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
         +GILH KSCPYTP+QNG+ ERK+RHI   A++LL ++ +P +FW+ A ATA YLINR+P+P LA +SPFE L+  PP    L +FGCACY  LRPY
Subjt:  NQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

TrEMBL top hitse value%identityAlignment
A0A2N9FMC6 Integrase catalytic domain-containing protein7.4e-10347.59Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF
        N +PS +  W+SD G   H T DL NL     Y G + +++GNG  LPI+H G  QL   +  F L  + RVP + TNLLSV++ C DN C F FD++ F
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF

Query:  TIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SIPVSRSDIGVCKHCLDGKLSK
        +IQD  +G+ L+ G S +GLYP++  S S    T         +A +G K +  VWH  LGHP   +L+SVLN     S+  ++     C HC+ GKL +
Subjt:  TIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SIPVSRSDIGVCKHCLDGKLSK

Query:  QPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ
         PF  SS  + +PLEL+HSD WGPAP  SING R+YVSFVD F+R+TW+FP+ +KS V + F  F    +N+L++R+ V R+D GGEY ++  ++  + +
Subjt:  QPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ

Query:  GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        GILHQ SCP+TP+QNG+ ERKHRHIV  AL+L+S+SS+P+++W +AF+TA YLINR+P+PNL   SP++LLF   PDY+ L  FGC C+ LLRPY
Subjt:  GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

A0A2N9G7E3 Integrase catalytic domain-containing protein1.3e-10249Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF
        +++ S SN W+SD G   H T DLANL  +  YNG + +TVGNGQ LPI+H G  QL        L    RVP++ TNLLSV + C DNNCCF FD+S F
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF

Query:  TIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQVGIKASTIVWHDWLGHPCLSILNSV---LNSSSIPVSRSDIGVCKHC
        +IQD  +GKVL+ G +  GLYP+          ++P P            +A    K S+  WH  LGHP   IL SV   L +S I  S S+   CKHC
Subjt:  TIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQVGIKASTIVWHDWLGHPCLSILNSV---LNSSSIPVSRSDIGVCKHC

Query:  LDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDL
          GK+S+ PF  S + +  PL+L+HSD WGPAP  SING RYYVSF+DDFS++TW FP+ +KS V S F  F    +NLL+ +L V R+D GGEY  +  
Subjt:  LDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDL

Query:  KNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLR
        ++  ++QGI HQ SCP+TP+QNG+ ERKHRHI+  AL+L+S+SS+P+ +W +AFA++ +LINRLP+ +L  KSP+E+LF  PPDY+   VFGC+CY LL 
Subjt:  KNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLR

Query:  PY
        PY
Subjt:  PY

A0A2N9GRJ0 Uncharacterized protein7.4e-10347.59Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF
        N +PS +  W+SD G   H T DL NL     Y G + +++GNG  LPI+H G  QL   +  F L  + RVP + TNLLSV++ C DN C F FD++ F
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF

Query:  TIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SIPVSRSDIGVCKHCLDGKLSK
        +IQD  +G+ L+ G S +GLYP++  S S    T         +A +G K +  VWH  LGHP   +L+SVLN     S+  ++     C HC+ GKL +
Subjt:  TIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SIPVSRSDIGVCKHCLDGKLSK

Query:  QPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ
         PF  SS  + +PLEL+HSD WGPAP  SING R+YVSFVD F+R+TW+FP+ +KS V + F  F    +N+L++R+ V R+D GGEY ++  ++  + +
Subjt:  QPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ

Query:  GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        GILHQ SCP+TP+QNG+ ERKHRHIV  AL+L+S+SS+P+++W +AF+TA YLINR+P+PNL   SP++LLF   PDY+ L  FGC C+ LLRPY
Subjt:  GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

A0A2N9I765 Integrase catalytic domain-containing protein5.3e-10149.38Show/hide
Query:  ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCID
        ASTS+ + G          + WL+D G   HLT+++ NL + T Y G + + VGNGQS+PI++ G GQL+     F L+NL     IS+NLLSVH+LC D
Subjt:  ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCID

Query:  NNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSP----AQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGV---CKH
        N C   FDS+ F IQD  +GKVL+ G S NGLYP +   PSP    A  T++A +  K    +WH  LGHP   +L S L S S  +S  +  V   CKH
Subjt:  NNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSP----AQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGV---CKH

Query:  CLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND
        CL GK+ K PF  S   S  PLEL+HSD WGPAP  S NG++YY+ FVDDFS+Y+W+F +  KSDV + F  F    +  LS+++   R+D GGEY SN 
Subjt:  CLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND

Query:  LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLL
          +  ++ GI HQ SCP+TP+QNGI ERKHRHI+  AL+LLS +S+P   W +A  TA +LINRLPSP+L+HKSP+E LF K PD T L  FGC CY  L
Subjt:  LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLL

Query:  RPY
        RPY
Subjt:  RPY

A0A2N9IEP2 Uncharacterized protein1.8e-10146.79Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF
        N+  SD + W+SD G   H T DL+ +     Y G +  TVGNGQ++PI+H G  QL   +  F L  + RVP +++NLLSV++ C DNNCCF+FD++ F
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSF

Query:  TIQDKSTGKVLFHGPSVNGLYPLVAKS-PSPAQVT--LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLS
         I+D  TGK+L+ GPS NGLYP+   S P P   +   + Q     S+ VWHD LGHP   +   + ++S +  S S+     C HC+ GK++  PF  S
Subjt:  TIQDKSTGKVLFHGPSVNGLYPLVAKS-PSPAQVT--LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLS

Query:  SSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQK
         S +C PLE++HSD WGP+P  S  G R+YV FVD+F+R+TW +P+  KS V S F  F    +NLL+ ++ + R+D GGEY SN+  +   + GI HQ 
Subjt:  SSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQK

Query:  SCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        +CP+T +QNG+ ERKHRHIV++AL+L+S+SS+P+ FW +AF+TA YLINR+P  N    SP+ELLF + P+Y SL  FGC CY L+RPY
Subjt:  SCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-3329.55Show/hide
Query:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVG-NGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSS
        NTS  D+  ++ D G + HL +D +    S        I V   G+ +  +  G  +L   +   TL ++    + + NL+SV +L  +      FD S 
Subjt:  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVG-NGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSS

Query:  FTIQD------KSTGKVLFHGPSVN-GLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGH----PCLSILNSVLNSSSIPVSRSDIG--VCKHCLDGK
         TI        K++G +L + P +N   Y + AK               K +  +WH+  GH      L I    + S    ++  ++   +C+ CL+GK
Subjt:  FTIQD------KSTGKVLFHGPSVN-GLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGH----PCLSILNSVLNSSSIPVSRSDIG--VCKHCLDGK

Query:  LSKQPF--LLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKN
         ++ PF  L   +    PL ++HSD  GP    +++   Y+V FVD F+ Y   + + YKSDVFS+F  F+  ++   + ++     D G EYLSN+++ 
Subjt:  LSKQPF--LLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKN

Query:  LFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNL--AHKSPFELLFKKPPDYTSLHVFGCACY
            +GI +  + P+TP+ NG+ ER  R I   A +++S + +   FW  A  TA YLINR+PS  L  + K+P+E+   K P    L VFG   Y
Subjt:  LFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNL--AHKSPFELLFKKPPDYTSLHVFGCACY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-3832.81Show/hide
Query:  SFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNS
        +  L ++  VPD+  NL+S   L  D    + F +  + +   S   V+  G +   LY   A+     Q  L A    + S  +WH  +GH     L  
Subjt:  SFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNS

Query:  VLNSSSIPVSR-SDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNL
        +   S I  ++ + +  C +CL GK  +  F  SS    + L+L++SD  GP   +S+ G++Y+V+F+DD SR  W++ +  K  VF +F +F    +  
Subjt:  VLNSSSIPVSR-SDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNL

Query:  LSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLF
           +L   RSD GGEY S + +   ++ GI H+K+ P TP+ NG+ ER +R IV    S+L  + +P  FW  A  TA YLINR PS  LA + P  +  
Subjt:  LSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLF

Query:  KKPPDYTSLHVFGCACY
         K   Y+ L VFGC  +
Subjt:  KKPPDYTSLHVFGCACY

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein2.1e-2527.25Show/hide
Query:  SSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDN-NCC
        S P   ++++    +  L D G +  L      L  +T  N E NI     Q +PI+  G    +  N + T       P+I+ +LLS+ +L   N   C
Subjt:  SSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDN-NCC

Query:  FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT-LTAQVGIKASTI------VWHDWLGHPCL-SILNSV-------LNSSSIPVSRSDIG
        F  ++      ++S G VL         Y L  K   P+ ++ LT     K+ ++      + H  LGH    SI  S+       L  S I  S +   
Subjt:  FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVT-LTAQVGIKASTI------VWHDWLGHPCL-SILNSV-------LNSSSIPVSRSDIG

Query:  VCKHCLDGKLSKQPFLLSSSL----SCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSD--VFSIFSQFLPFAKNLLSSRLNVFRS
         C  CL GK +K   +  S L    S  P + LH+D +GP      +   Y++SF D+ +R+ W++P+  + +  + ++F+  L F KN  ++R+ V + 
Subjt:  VCKHCLDGKLSKQPFLLSSSL----SCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSD--VFSIFSQFLPFAKNLLSSRLNVFRS

Query:  DGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSP
        D G EY +  L   F N+GI    +       +G+ ER +R ++N   +LL  S +P   WF A   +  + N L SP
Subjt:  DGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-8344.53Show/hide
Query:  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTI
        SP  SN WL D G   H+TSD  NL +   Y G +++ V +G ++PISH G   LS  +    L N+  VP+I  NL+SV++LC  N     F  +SF +
Subjt:  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTI

Query:  QDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLSSSLSC
        +D +TG  L  G + + LY     S  P  V+L A    KA+   WH  LGHP  SILNSV+++ S+ V         C  CL  K +K PF  S+  S 
Subjt:  QDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLSSSLSC

Query:  SPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYT
         PLE ++SD W  +P  S + +RYYV FVD F+RYTW++P+  KS V   F  F    +N   +R+  F SD GGE+++  L   F+  GI H  S P+T
Subjt:  SPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYT

Query:  PEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
        PE NG+ ERKHRHIV   L+LLS +SIP  +W +AFA A YLINRLP+P L  +SPF+ LF   P+Y  L VFGCACY  LRPY
Subjt:  PEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-8242.18Show/hide
Query:  STSHSSP----GTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQL
        STS  +P      L   SP ++N WL D G   H+TSD  NL     Y G +++ + +G ++PI+H G   L   + S  L+ +  VP+I  NL+SV++L
Subjt:  STSHSSP----GTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQL

Query:  CIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKH
        C  N     F  +SF ++D +TG  L  G + + LY  P+     S   V++ A    KA+   WH  LGHP L+ILNSV+++ S+PV      +  C  
Subjt:  CIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKH

Query:  CLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND
        C   K  K PF  S+  S  PLE ++SD W  +P  SI+ +RYYV FVD F+RYTW++P+  KS V   F  F    +N   +R+    SD GGE++   
Subjt:  CLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND

Query:  LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLL
        L++  +  GI H  S P+TPE NG+ ERKHRHIV M L+LLS +S+P  +W +AF+ A YLINRLP+P L  +SPF+ LF +PP+Y  L VFGCACY  L
Subjt:  LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLL

Query:  RPY
        RPY
Subjt:  RPY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTTCTACTTCGCACTCTTCTCCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATATAGGGTGTAATGCACATCTTACTAGTGA
CCTTGCAAACTTGGGCATTTCTACTGCTTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTC
CCAATGCCTCTTTTACTTTATCTAATCTTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGAT
TCATCCTCTTTTACCATTCAGGACAAATCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGT
AACCCTTACGGCCCAAGTTGGTATCAAGGCTTCCACTATTGTGTGGCATGATTGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTCTATTC
CAGTTAGTCGGTCTGATATTGGTGTTTGTAAACATTGTCTTGATGGCAAGTTGTCTAAACAACCTTTTCTCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTG
CATAGTGATGCATGGGGCCCTGCTCCTGATAAATCAATAAATGGTCATCGCTATTATGTTTCTTTTGTTGATGATTTTTCACGCTATACTTGGATCTTTCCCATGTGTTA
CAAATCTGATGTGTTTTCTATTTTTAGTCAGTTTCTGCCATTTGCTAAAAACCTACTTTCTTCCCGCCTTAACGTTTTTCGTAGTGATGGGGGTGGTGAATATCTTAGCA
ATGACCTTAAAAATTTATTTGCTAATCAGGGTATACTTCACCAAAAATCTTGTCCTTACACCCCTGAGCAAAATGGTATCGATGAACGTAAACATCGACATATTGTCAAT
ATGGCATTGTCATTACTATCTAAATCATCTATTCCTATGCGGTTCTGGTTCTTCGCCTTTGCAACTGCCGAATATCTCATAAATCGCCTACCGTCTCCAAACTTAGCTCA
CAAATCTCCTTTTGAACTTCTCTTTAAAAAACCTCCAGATTATACTTCTCTTCATGTTTTTGGGTGTGCCTGTTATCATTTATTACGTCCTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTTCTACTTCGCACTCTTCTCCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATATAGGGTGTAATGCACATCTTACTAGTGA
CCTTGCAAACTTGGGCATTTCTACTGCTTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTC
CCAATGCCTCTTTTACTTTATCTAATCTTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGAT
TCATCCTCTTTTACCATTCAGGACAAATCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGT
AACCCTTACGGCCCAAGTTGGTATCAAGGCTTCCACTATTGTGTGGCATGATTGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTCTATTC
CAGTTAGTCGGTCTGATATTGGTGTTTGTAAACATTGTCTTGATGGCAAGTTGTCTAAACAACCTTTTCTCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTG
CATAGTGATGCATGGGGCCCTGCTCCTGATAAATCAATAAATGGTCATCGCTATTATGTTTCTTTTGTTGATGATTTTTCACGCTATACTTGGATCTTTCCCATGTGTTA
CAAATCTGATGTGTTTTCTATTTTTAGTCAGTTTCTGCCATTTGCTAAAAACCTACTTTCTTCCCGCCTTAACGTTTTTCGTAGTGATGGGGGTGGTGAATATCTTAGCA
ATGACCTTAAAAATTTATTTGCTAATCAGGGTATACTTCACCAAAAATCTTGTCCTTACACCCCTGAGCAAAATGGTATCGATGAACGTAAACATCGACATATTGTCAAT
ATGGCATTGTCATTACTATCTAAATCATCTATTCCTATGCGGTTCTGGTTCTTCGCCTTTGCAACTGCCGAATATCTCATAAATCGCCTACCGTCTCCAAACTTAGCTCA
CAAATCTCCTTTTGAACTTCTCTTTAAAAAACCTCCAGATTATACTTCTCTTCATGTTTTTGGGTGTGCCTGTTATCATTTATTACGTCCTTATTGA
Protein sequenceShow/hide protein sequence
MAASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFD
SSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELL
HSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVN
MALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY