; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G16360 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G16360
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr2:15759836..15761626
RNA-Seq ExpressionCSPI02G16360
SyntenyCSPI02G16360
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW16151.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.8e-19459.33Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISS LPQN+WGEA+LT+NYLLN++P KK++   YE WKGRK SY +L++WGCLAKV +P PK VKIGPKTIDCIFIGYA NS AYRF+V++S+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA
         DIH NTIM SRNA+FFE++FP    C+++ +  S   +         VE+    E R+SKR+R  KSFGPD+LT++LE EPQTFKEA++S E   WKEA
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA

Query:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN
        + SEI+SI+ NHTWE V+LP   KP   KWIFK K+K DGSIDKYKARLV KGY+Q EGLDYF TYSPVTRI SIRM++A +AL   EIHQMDVKT FLN
Subjt:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN

Query:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------
        G+LDEEIYM+QPE                                 + M+++GFKINECDKCVYVK+ EH +VIV                         
Subjt:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------

Query:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY
                         K  RT   L+LSQSHY+DKIL K+ K    +A+TP+D +LHLSKN G+S++Q+EYSRIIGSLM +MSCTRPDIAYAV KLSRY
Subjt:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY

Query:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        TSNPG  HW+ I+RVL YL+ T++YG++YTRYPAVLEGYSD NWIS+ KDSKS SGY+FTLGG AVSWKSSKQT IARSTME EFIALDK G+EA
Subjt:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

RVW67960.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.2e-20762.71Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAML+SS LPQNLWGEALL++NY+LN++PHKK+    YE WKG K  YK+LKVWGCLAKV +PKPK VKIGPKTIDCIFIGYA+NS AYRF+V+KS+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYH---NRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA FFE IFP+    E   QKR+ D          N+ + E   +++LR+ KR R S SFGPD+LTYLLEN+PQTFKEAMSSPEASYW
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYH---NRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEA+NSEIESI+ NHTWE V+LP  +K  GCKWIFK K+K +GSIDKYKARLVAKGYKQ+EGLDYF TYSPV++ITSIRMLIA +A++  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------
        FLNGELDEEIYM QPE                                 +AM++NGFKINECDKCVYVKN +  +VIV                      
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------

Query:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL
                            K S+T  GL+LSQSHYI+KIL+K+ K++I   KT +D +LHL KN G   +QLEYSRIIGSLM +M+CT PDIAY+VSKL
Subjt:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL

Query:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        SR+TSNPG +HWK I+RVLGYLK+T+NYG+++TRYPAVLEGYSD NWIS +KDSKSTSGY+FTLGG AVSWKSSKQTCIARSTME EFIA+DKAGEEA
Subjt:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

RVW71630.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.9e-19559.5Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISS LPQN+WGEA+LT+NYLLN++P KK++   YE WKGRK SY +L++WGCLAKV +P PK VKIGPKTIDCIFIGYA NS AYRF+V++S+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA
         DIH NTIM SRNA+FFE++FP    C+++ +  S   +         VE+    E R+SKR+R  KSFGPD+LT++LE EPQTFKEA++S E   WKEA
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA

Query:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN
        + SEI+SI+ NHTWE V+LP   KP   KWIFK K+K DGSIDKYKARLV KGY+Q EGLDYF TYSPVTRI SIRM++A +AL   EIHQMDVKT FLN
Subjt:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN

Query:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------
        G+LDEEIYM+QPE                                 + M+++GFKINECDKCVYVK+ EH +VIV                         
Subjt:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------

Query:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY
                         K  RT   L+LSQSHY+DKIL K+ K    +A+TP+D +LHLSKN G+S++Q+EYSR+IGSLM +MSCTRPDIAYAVSKLSRY
Subjt:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY

Query:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        TSNPG  HW+ I+RVL YL+ T++YG++YTRYPAVLEGYSD NWIS+ KDSKS SGY+FTLGG AVSWKSSKQT IARSTME EFIALDK GEEA
Subjt:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

RVW97088.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.5e-19559.66Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISS LPQN+WGEA+LT+NYLLN++P KK++   YE WKGRK SY +L++WGCLAKV +P PK VKIGPKTIDCIFIGYA NS AYRF+V++S+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA
         DIH NTIM SRNA+FFE++FP    C+++ +  S   +         VE+    E R+SKR+R  KSFGPD+LT++LE EPQTFKEA++S E   WKEA
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA

Query:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN
        + SEI+SI+ NHTWE V+LP   KP   KWIFK K+K DGSIDKYKARLV KGY+Q EGLDYF TYSPVTRI SIRM++A +AL   EIHQMDVKT FLN
Subjt:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN

Query:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------
        G+LDEEIYM+QPE                                 + M+++GFKINECDKCVYVK+ EH +VIV                         
Subjt:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------

Query:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY
                         K  RT   L+LSQSHY+DKIL K+ K    +A+TP+D +LHLSKN G+S++Q+EYSRIIGSLM +MSCTRPDIAYAVSKLSRY
Subjt:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY

Query:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        TSNPG  HW+ I+RVL YL+ T++YG++YTRYPAVLEGYSD NWIS+ KDSKS SGY+FTLGG AVSWKSSKQT IARSTME EFIALDK GEEA
Subjt:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

RVX15136.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.7e-19559.5Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISS LPQN+WGEA+LT+NYLLN++P KK++   YE WKGRK SY +L++WGCLAKV +P PK VKIGPKTIDCIFIGYA NS AYRF+V++S+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA
         DIH NTIM SRNA+FFE++FP    C+++ +  S   +         VE+    E R+SKR+R  KSFGPD+LT++LE+EPQTFKEA++S E+  WKEA
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEA

Query:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN
        + SEI+SI+ NHTWE V+LP   KP   KWIFK K+K DGSIDKYKARLV KGY+Q EGLDYF TYSPVTRI SIRM++A +AL   EIHQMDVKT FLN
Subjt:  VNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN

Query:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------
        G+LDEEIYM+QPE                                 + M+++GFKINECDKCVYVK+ EH +VIV                         
Subjt:  GELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS------------------------

Query:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY
                         K  RT   L+LSQSHY+DKIL K+ K    +A+TP+D +LHLSKN G+S++Q+EYSRIIGSLM +MSCTRPDIAYAV KLSRY
Subjt:  -----------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRY

Query:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        TSNPG  HW+ I+RVL YL+ T++YG++YTRYPAVLEGYSD NWIS+ KDSKS SGY+FTLGG AVSWKSSKQT IARSTME EFIALDK GEEA
Subjt:  TSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

TrEMBL top hitse value%identityAlignment
A0A2N9F2Q4 Integrase catalytic domain-containing protein1.2e-20662.54Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISSGLPQNLWGEA+L++NY+LN++P KK     YE WKGR  SY+FLKVWGCLAKV +P PK VKIGPKT+DC+FIGYA NS AYRF++HKSDI
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA+FFE IFP K   E       KR+ ++ +S  H++  +E  N  E R+SKR + SK+FGPD+LT++LE+EPQ+FKEAMS PEA  W
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEAVNSEIESI+ NHTWE V+LP   KP G KWIFK K+K DGSIDKYKARLV KGYKQ+EG+DYF TYSPVTRITSIRMLIA +AL+  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------
        FLNGELDEEIYM+QPE                                 +AMM+NGF+INECDKCVYVKN    +VIV                      
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------

Query:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL
                            K +RT  GLVLSQ+HYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLM IM+CTRPDIAY+VSKL
Subjt:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL

Query:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        SRYTSNPG DHWK I+RVL YLK+T NYG++YTRYPAVLEGYSD NWIS T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTME EFIALDKAGEEA
Subjt:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

A0A2N9GPK3 Integrase catalytic domain-containing protein1.6e-20965.38Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISSGLPQNLWGEA+L++NY+LN++P KK     YE WKGR  SY+FLKVWGCLAKV +P PK VKIGPKT+DC+FIGYA NS AYRF++HKSDI
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA+FFE IFP K   E       KR+ ++ +S  H++  +E  N  E R+SKR + SK+FGPD+LT++LE+EPQ+FKEAMS PEA  W
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEAVNSEIESI+ NHTWE V+LP   KP G KWIFK K+K DGSIDKYKARLV KGYKQ+EG+DYF TYSPVTRITSIRMLIA +AL+  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------KNSRTP
        FLNGELDEEIYM+QPE                                 +AMM+NGF+INECDKCVY+KN    +VIV                K +RT 
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------KNSRTP

Query:  QGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTK
         GLVLSQSHYI K+L+K+ +++    KTPID +LHL+KN G+ I+QLEYS+IIGSLM IM+CTRPDIAY+VSKLSRYTSNPG DHWK I+RVL YLK+T 
Subjt:  QGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTK

Query:  NYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        NYG++YTRYPAVLEGYSD NWIS T D+KSTSGY+FTLGG AVSWKSSKQTCIARST+E EFIALDKAGEEA
Subjt:  NYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

A0A2N9GRE4 Integrase catalytic domain-containing protein5.6e-20762.71Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISSGLPQNLWGEA+L++NY+LN++P KK     YE WKGR  SY+FLKVWGCLAKV +P PK VKIGPKT+DC+FIGYA NS AYRF++HKSDI
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA+FFE IFP K   E       KR+ ++ +S  H++  +E  N  E R+SKR + SK+FGPD+LT++LE+EPQ+FKEAMS PEA  W
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEAVNSEIESI+ NHTWE V+LP   KP G KWIFK K+K DGSIDKYKARLV KGYKQ+EG+DYF TYSPVTRITSIRMLIA +AL+  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------
        FLNGELDEEIYM+QPE                                 +AMM+NGF+INECDKCVYVKN    +VIV                      
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------

Query:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL
                            K +RT  GLVLSQSHYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLM IM+CTRPDIAY+VSKL
Subjt:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL

Query:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        SRYTSNPG DHWK I+RVL YLK+T NYG++YTRYPAVLEGYSD NWIS T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTME EFIALDKAGEEA
Subjt:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

A0A2N9HDE2 Integrase catalytic domain-containing protein5.6e-20762.71Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAMLISSGLPQNLWGEA+L++NY+LN++P KK     YE WKGR  SY+FLKVWGCLAKV +P PK VKIGPKT+DC+FIGYA NS AYRF++HKSDI
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA+FFE IFP K   E       KR+ ++ +S  H++  +E  N  E R+SKR + SK+FGPD+LT++LE+EPQ+FKEAMS PEA  W
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEA---RLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEAVNSEIESI+ NHTWE V+LP   KP G KWIFK K+K DGSIDKYKARLV KGYKQ+EG+DYF TYSPVTRITSIRMLIA +AL+  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------
        FLNGELDEEIYM+QPE                                 +AMM+NGF+INECDKCVYVKN    +VIV                      
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------

Query:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL
                            K +RT  GLVLSQSHYI K+L+K+ +++    KTP+D +LHL+KN G+ I+QLEYS+IIGSLM IM+CTRPDIAY+VSKL
Subjt:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL

Query:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        SRYTSNPG DHWK I+RVL YLK+T NYG++YTRYPAVLEGYSD NWIS T D+KSTSGY+FTLGG AVSWKSSKQTCIARSTME EFIALDKAGEEA
Subjt:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

A0A438G6X1 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-20762.71Show/hide
Query:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        MMNAML+SS LPQNLWGEALL++NY+LN++PHKK+    YE WKG K  YK+LKVWGCLAKV +PKPK VKIGPKTIDCIFIGYA+NS AYRF+V+KS+I
Subjt:  MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYH---NRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW
         D+HVNTI+ SRNA FFE IFP+    E   QKR+ D          N+ + E   +++LR+ KR R S SFGPD+LTYLLEN+PQTFKEAMSSPEASYW
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQKRSFDAITSEYH---NRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW

Query:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT
        KEA+NSEIESI+ NHTWE V+LP  +K  GCKWIFK K+K +GSIDKYKARLVAKGYKQ+EGLDYF TYSPV++ITSIRMLIA +A++  EIHQMDVKT 
Subjt:  KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTT

Query:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------
        FLNGELDEEIYM QPE                                 +AM++NGFKINECDKCVYVKN +  +VIV                      
Subjt:  FLNGELDEEIYMQQPE---------------------------------SAMMANGFKINECDKCVYVKNNEHDHVIVS---------------------

Query:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL
                            K S+T  GL+LSQSHYI+KIL+K+ K++I   KT +D +LHL KN G   +QLEYSRIIGSLM +M+CT PDIAY+VSKL
Subjt:  --------------------KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKL

Query:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA
        SR+TSNPG +HWK I+RVLGYLK+T+NYG+++TRYPAVLEGYSD NWIS +KDSKSTSGY+FTLGG AVSWKSSKQTCIARSTME EFIA+DKAGEEA
Subjt:  SRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-5326.47Show/hide
Query:  MLISSGLPQNLWGEALLTSNYLLNRIPHK---KSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASN------SCAYRFIV
        M+  + L ++ WGEA+LT+ YL+NRIP +    S    YE W  +K   K L+V+G    V + K K  K   K+   IF+GY  N      +   +FIV
Subjt:  MLISSGLPQNLWGEALLTSNYLLNRIPHK---KSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASN------SCAYRFIV

Query:  HKSDISDIHVNTIMVSRNATFFENIF-------PHKMF------------------CE-------------ARLQKRSFDAITSEYHNRS----NVELT-
         +  + D    T MV+  A  FE +F        +K F                  C+                   S   I +E+ N S    N++   
Subjt:  HKSDISDIHVNTIMVSRNATFFENIF-------PHKMF------------------CE-------------ARLQKRSFDAITSEYHNRS----NVELT-

Query:  -----------------------------NNEELRQSKRMRISKSFGPDYLT--------------------------------------YLLENEPQTF
                                     N  E R+S+     K  G D  T                                       +  + P +F
Subjt:  -----------------------------NNEELRQSKRMRISKSFGPDYLT--------------------------------------YLLENEPQTF

Query:  KEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALH
         E     + S W+EA+N+E+ +   N+TW     P        +W+F  K    G+  +YKARLVA+G+ Q+  +DY  T++PV RI+S R +++    +
Subjt:  KEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALH

Query:  GFEIHQMDVKTTFLNGELDEEIYMQQPE--SAMMANGFKIN----------------------EC-------DKCVYVKN--------------------
          ++HQMDVKT FLNG L EEIYM+ P+  S    N  K+N                      EC       D+C+Y+ +                    
Subjt:  GFEIHQMDVKTTFLNGELDEEIYMQQPE--SAMMANGFKIN----------------------EC-------DKCVYVKN--------------------

Query:  -----------------------NEHDHVIVSKNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSC
                               NE  H I  +       + LSQS Y+ KIL K+         TP+   ++    N D         +IG LM IM C
Subjt:  -----------------------NEHDHVIVSKNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSC

Query:  TRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTR---YPAVLEGYSDVNWISSTKDSKSTSGYIFTL-GGGAVSWKSSKQTCIARSTM
        TRPD+  AV+ LSRY+S    + W+ + RVL YLK T +  + + +   +   + GY D +W  S  D KST+GY+F +     + W + +Q  +A S+ 
Subjt:  TRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTR---YPAVLEGYSDVNWISSTKDSKSTSGYIFTL-GGGAVSWKSSKQTCIARSTM

Query:  EFEFIALDKAGEEA
        E E++AL +A  EA
Subjt:  EFEFIALDKAGEEA

P0CV72 Secreted RxLR effector protein 1617.1e-2646.4Show/hide
Query:  YSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRY-PAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKS
        Y   +G++M +M  TRPD+A AV  LS++ S+P   HW+ + RVL YL+ T+ YG+ +TR   A L GYSD +W    +  +STSGY+F L GG VSW+S
Subjt:  YSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRY-PAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKS

Query:  SKQTCIARSTMEFEFIALDKAGEEA
         KQ  +A S+ E E++AL +A +EA
Subjt:  SKQTCIARSTMEFEFIALDKAGEEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-8031.16Show/hide
Query:  MNAMLISSGLPQNLWGEALLTSNYLLNRIPH-KKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI
        + +ML  + LP++ WGEA+ T+ YL+NR P    +  I    W  +++SY  LKV+GC A   +PK +  K+  K+I CIFIGY      YR       +
Subjt:  MNAMLISSGLPQNLWGEALLTSNYLLNRIPH-KKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDI

Query:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQK---RSFDAITSEYHNRSNVELTNNE------------------------------------ELRQSK
         D     ++ SR+  F E+          +++     +F  I S  +N ++ E T +E                                     LR+S+
Subjt:  SDIHVNTIMVSRNATFFENIFPHKMFCEARLQK---RSFDAITSEYHNRSNVELTNNE------------------------------------ELRQSK

Query:  RMRISKSFGP--DYLTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEG
        R R+     P  +Y+    + EP++ KE +S PE +   +A+  E+ES+  N T++ V LP   +P  CKW+FK K   D  + +YKARLV KG++Q++G
Subjt:  RMRISKSFGP--DYLTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEG

Query:  LDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPE---------------------------------SAMMANGFKINEC
        +D+   +SPV ++TSIR +++ +A    E+ Q+DVKT FL+G+L+EEIYM+QPE                                 S M +  +     
Subjt:  LDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPE---------------------------------SAMMANGFKINEC

Query:  DKCVYVK----NN-------EHDHVIVSKN---------------------------------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGS
        D CVY K    NN         D +IV K+                                  RT + L LSQ  YI+++L+++         TP+ G 
Subjt:  DKCVYVK----NN-------EHDHVIVSKN---------------------------------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGS

Query:  LHLSK-------NNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTK
        L LSK           ++A++ YS  +GSLM  M CTRPDIA+AV  +SR+  NPG++HW+ +  +L YL+ T    + +     +L+GY+D +      
Subjt:  LHLSK-------NNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTK

Query:  DSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEE
        + KS++GY+FT  GGA+SW+S  Q C+A ST E E+IA  + G+E
Subjt:  DSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-4831.6Show/hide
Query:  ENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPP------GCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRI
        E+EP+T  +A+       W+ A+ SEI + + NHTW+     LV  PP      GC+WIF  K  +DGS+++YKARLVAKGY Q+ GLDY  T+SPV + 
Subjt:  ENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPP------GCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRI

Query:  TSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQP---------------------------------ESAMMANGFKINECDKCVY--------
        TSIR+++  +    + I Q+DV   FL G L +++YM QP                                  + ++  GF  +  D  ++        
Subjt:  TSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQP---------------------------------ESAMMANGFKINECDKCVY--------

Query:  --------------------------------VKNNEHDHVIVS-KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LE
                                        VK++E  H  +  +  R P GL LSQ  YI  +L +          TP+  S  LS  +G  +    E
Subjt:  --------------------------------VKNNEHDHVIVS-KNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQ-LE

Query:  YSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKS
        Y  I+GSL   ++ TRPDI+YAV++LS++   P  +H + + R+L YL  T N+GI   +   + L  YSD +W     D  ST+GYI  LG   +SW S
Subjt:  YSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKS

Query:  SKQTCIARSTMEFEFIALDKAGEE
         KQ  + RS+ E E+ ++     E
Subjt:  SKQTCIARSTMEFEFIALDKAGEE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.9e-5132.48Show/hide
Query:  YLTYLLEN-EPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPP------GCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVT
        Y T L  N EP+T  +AM       W++A+ SEI + + NHTW+     LV  PP      GC+WIF  K  +DGS+++YKARLVAKGY Q+ GLDY  T
Subjt:  YLTYLLEN-EPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPP------GCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVT

Query:  YSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQP---------------------------------ESAMMANGFKINECDKCVYV
        +SPV + TSIR+++  +    + I Q+DV   FL G L +E+YM QP                                  + ++  GF  +  D  ++V
Subjt:  YSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQP---------------------------------ESAMMANGFKINECDKCVYV

Query:  KNNEH---------DHVIVSKN--------------------------------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGD
                      D ++++ N                                 R PQGL LSQ  Y   +L +          TP+  S  L+ ++G 
Subjt:  KNNEH---------DHVIVSKN--------------------------------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGD

Query:  SIAQ-LEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGG
         +    EY  I+GSL   ++ TRPD++YAV++LS+Y   P  DHW  + RVL YL  T ++GI   +   + L  YSD +W   T D  ST+GYI  LG 
Subjt:  SIAQ-LEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGG

Query:  GAVSWKSSKQTCIARSTMEFEFIALDKAGEE
          +SW S KQ  + RS+ E E+ ++     E
Subjt:  GAVSWKSSKQTCIARSTMEFEFIALDKAGEE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-5329.81Show/hide
Query:  AITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLK
        A   +Y+  S   LT ++  +     ++S  +    +      EP T+ EA    E   W  A++ EI ++   HTWE   LP   KP GCKW++K K  
Subjt:  AITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLK

Query:  TDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPES-----------------------
        +DG+I++YKARLVAKGY QQEG+D+  T+SPV ++TS+++++A SA++ F +HQ+D+   FLNG+LDEEIYM+ P                         
Subjt:  TDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPES-----------------------

Query:  --------------AMMANGFKINECDKCVYVKNNE----------HDHVIVSKN-------------------------------SRTPQGLVLSQSHY
                       ++  GF  +  D   ++K              D +I S N                               +R+  G+ + Q  Y
Subjt:  --------------AMMANGFKINECDKCVYVKNNE----------HDHVIVSKN-------------------------------SRTPQGLVLSQSHY

Query:  IDKILKKYTKHEIVIAKTPIDGSLHLSKNN-GDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRY
           +L +        +  P+D S+  S ++ GD +    Y R+IG LM  +  TR DI++AV+KLS+++  P   H + ++++L Y+K T   G+ Y+  
Subjt:  IDKILKKYTKHEIVIAKTPIDGSLHLSKNN-GDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRY

Query:  PAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEE
          + L+ +SD ++ S     +ST+GY   LG   +SWKS KQ  +++S+ E E+ AL  A +E
Subjt:  PAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEE

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.4e-0528.95Show/hide
Query:  MSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGY
        ++ TRPD+ +AV++LS+++S       + + +VL Y+K T   G+ Y+    + L+ ++D +W S     +S +G+
Subjt:  MSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-1529.34Show/hide
Query:  PQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHT
        P GL LSQ+ Y ++IL      +     TP+   L+ S +        ++  I+G+L   ++ TRPDI+YAV+ + +    P    + ++ RVL Y+K T
Subjt:  PQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHT

Query:  KNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIAL
          +G+   +   + ++ + D +W   T   +ST+G+   LG   +SW + +Q  ++RS+ E E+ AL
Subjt:  KNYGINYTRYPAV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIAL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1840.57Show/hide
Query:  LTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRIT
        +T  ++ EP++   A+  P    W +A+  E++++  N TW  V  P+     GCKW+FK KL +DG++D+ KARLVAKG+ Q+EG+ +  TYSPV R  
Subjt:  LTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRIT

Query:  SIRMLI
        +IR ++
Subjt:  SIRMLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAACGCAATGCTTATAAGTTCAGGTTTACCCCAAAATTTGTGGGGAGAAGCTTTGTTAACATCAAATTATTTATTAAACAGAATTCCTCATAAGAAGTCACAAAA
TATTTCTTATGAAAAATGGAAAGGAAGAAAACTTTCATATAAATTCTTAAAAGTATGGGGGTGCCTAGCAAAGGTTGTCATGCCTAAACCTAAAATGGTTAAAATTGGAC
CAAAAACTATTGATTGCATATTCATTGGTTATGCTAGTAACAGTTGTGCATATCGATTTATAGTTCATAAATCAGATATTTCAGATATACATGTTAATACAATCATGGTA
TCGAGGAATGCAACATTCTTTGAGAATATTTTTCCTCATAAAATGTTTTGTGAAGCAAGGTTACAAAAACGTTCGTTTGATGCTATAACGAGTGAATATCACAATAGATC
AAATGTTGAGTTAACGAACAATGAGGAACTCCGACAAAGTAAAAGGATGAGGATCTCAAAATCATTTGGTCCTGATTATTTAACTTATTTGTTAGAAAACGAACCTCAAA
CATTTAAAGAGGCCATGTCCTCTCCTGAAGCTTCATATTGGAAAGAGGCTGTGAATAGTGAAATTGAGTCCATTATGCATAACCATACTTGGGAACCAGTTAATCTTCCA
TTAGTAAGTAAACCACCTGGTTGCAAGTGGATTTTCAAATGGAAATTGAAGACCGATGGGTCAATAGATAAATATAAAGCAAGACTTGTCGCTAAAGGTTACAAGCAACA
AGAAGGACTTGACTATTTTGTTACATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAACATCAGCTTTGCATGGATTTGAGATACATCAGATGGATG
TCAAGACGACATTTTTAAACGGTGAGTTAGATGAAGAGATCTACATGCAACAACCCGAAAGTGCAATGATGGCCAATGGGTTTAAAATCAATGAATGTGACAAATGTGTA
TATGTCAAAAACAATGAGCATGACCATGTCATTGTATCAAAAAATTCTAGAACCCCACAAGGGTTAGTGCTATCTCAATCACATTATATTGATAAAATATTGAAGAAATA
TACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAATAATGGAGATAGTATAGCACAATTGGAATACTCTCGCATCATTGGTA
GTTTGATGTGCATCATGAGTTGTACACGTCCTGATATAGCGTATGCGGTAAGCAAATTAAGTCGCTATACAAGTAATCCAGGTCGTGATCATTGGAAAGTTATCTTGAGA
GTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATAAACTATACTCGATATCCTGCTGTACTTGAAGGCTATAGTGATGTCAATTGGATATCGAGCACTAAAGACTC
CAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAAACATGTATAGCACGATCCACAATGGAATTTGAATTTATAGCCT
TAGACAAGGCTGGAGAAGAAGCATAA
mRNA sequenceShow/hide mRNA sequence
AATGATGAACGCAATGCTTATAAGTTCAGGTTTACCCCAAAATTTGTGGGGAGAAGCTTTGTTAACATCAAATTATTTATTAAACAGAATTCCTCATAAGAAGTCACAAA
ATATTTCTTATGAAAAATGGAAAGGAAGAAAACTTTCATATAAATTCTTAAAAGTATGGGGGTGCCTAGCAAAGGTTGTCATGCCTAAACCTAAAATGGTTAAAATTGGA
CCAAAAACTATTGATTGCATATTCATTGGTTATGCTAGTAACAGTTGTGCATATCGATTTATAGTTCATAAATCAGATATTTCAGATATACATGTTAATACAATCATGGT
ATCGAGGAATGCAACATTCTTTGAGAATATTTTTCCTCATAAAATGTTTTGTGAAGCAAGGTTACAAAAACGTTCGTTTGATGCTATAACGAGTGAATATCACAATAGAT
CAAATGTTGAGTTAACGAACAATGAGGAACTCCGACAAAGTAAAAGGATGAGGATCTCAAAATCATTTGGTCCTGATTATTTAACTTATTTGTTAGAAAACGAACCTCAA
ACATTTAAAGAGGCCATGTCCTCTCCTGAAGCTTCATATTGGAAAGAGGCTGTGAATAGTGAAATTGAGTCCATTATGCATAACCATACTTGGGAACCAGTTAATCTTCC
ATTAGTAAGTAAACCACCTGGTTGCAAGTGGATTTTCAAATGGAAATTGAAGACCGATGGGTCAATAGATAAATATAAAGCAAGACTTGTCGCTAAAGGTTACAAGCAAC
AAGAAGGACTTGACTATTTTGTTACATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAACATCAGCTTTGCATGGATTTGAGATACATCAGATGGAT
GTCAAGACGACATTTTTAAACGGTGAGTTAGATGAAGAGATCTACATGCAACAACCCGAAAGTGCAATGATGGCCAATGGGTTTAAAATCAATGAATGTGACAAATGTGT
ATATGTCAAAAACAATGAGCATGACCATGTCATTGTATCAAAAAATTCTAGAACCCCACAAGGGTTAGTGCTATCTCAATCACATTATATTGATAAAATATTGAAGAAAT
ATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAATAATGGAGATAGTATAGCACAATTGGAATACTCTCGCATCATTGGT
AGTTTGATGTGCATCATGAGTTGTACACGTCCTGATATAGCGTATGCGGTAAGCAAATTAAGTCGCTATACAAGTAATCCAGGTCGTGATCATTGGAAAGTTATCTTGAG
AGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATAAACTATACTCGATATCCTGCTGTACTTGAAGGCTATAGTGATGTCAATTGGATATCGAGCACTAAAGACT
CCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAAACATGTATAGCACGATCCACAATGGAATTTGAATTTATAGCC
TTAGACAAGGCTGGAGAAGAAGCATAA
Protein sequenceShow/hide protein sequence
MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAKVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMV
SRNATFFENIFPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLP
LVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPESAMMANGFKINECDKCV
YVKNNEHDHVIVSKNSRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILR
VLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIALDKAGEEA