; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg26168 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg26168
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCarg_Chr04:11332147..11336925
RNA-Seq ExpressionCarg26168
SyntenyCarg26168
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN69607.1 hypothetical protein VITISV_009561 [Vitis vinifera]6.5e-18053.35Show/hide
Query:  AKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKT
        AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNS FLYYVIFID+                                   KV QSDG  EFT+TCFK 
Subjt:  AKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKT

Query:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFS--------TTAYIINRLPTPLLGAFSPQH-------PLYFLGVIVLL
        HL  SGI+HQLS PYTPAQNGRAERK+RHV E GLALLFHSH+SPRFW DA +        +T++ I   P   LG +SP H       P      I   
Subjt:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFS--------TTAYIINRLPTPLLGAFSPQH-------PLYFLGVIVLL

Query:  IKGSANHILII--------------------LIH----PP--TTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLG
         +    H   +                    L H    PP  T  S HIPRS+SSPC                   S+LPP  S+  SIE  VD SSSLG
Subjt:  IKGSANHILII--------------------LIH----PP--TTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLG

Query:  THPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQ--------------------------------LSQKYSNLQLRILLGL----------LPWMK
        +HPMITRAK GIFK RHP NLG+ G SGLL A LAS                                 L  +  N  +R+   L          L +  
Subjt:  THPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQ--------------------------------LSQKYSNLQLRILLGL----------LPWMK

Query:  KFERYNKMILGLWFLALPTPTSWA----------LNGC-----------------FVLNICLMDPSSV----------------------------SSLV
         F    K       L+L     W           LNG                  F  ++CL+  +                              +SL 
Subjt:  KFERYNKMILGLWFLALPTPTSWA----------LNGC-----------------FVLNICLMDPSSV----------------------------SSLV

Query:  LLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGS
        +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPTPDGLFI QLKYARDILT  QLLDSKPVHTPMVVSQHLT  G 
Subjt:  LLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGS

Query:  PFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN
        PFS+PTLYRSLVG LQYL ITR DIA+AVNSVSQFL+APT+DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGN
Subjt:  PFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN

Query:  NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI
        NLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHDL VPI
Subjt:  NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI

CAN73071.1 hypothetical protein VITISV_032383 [Vitis vinifera]1.7e-18354.34Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK
        +AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNSGFLYYVIFID+                                   KV QSDGG EFT+TCFK
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQH-PLYFLGVIVLLIKGSANHILIILI
         HLR SGI+HQLSCPYT AQNGRAERK+RHV E GLALLFH HLSPRFWV+ F                    QH  LY   V       S +  LI+L 
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQH-PLYFLGVIVLLIKGSANHILIILI

Query:  HPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS---
                                          L G +LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALL+S   
Subjt:  HPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS---

Query:  --------------QLSQKYSNLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPT
                       + ++   LQ      L+P           W+ +         ER    ++   +                        L+L    
Subjt:  --------------QLSQKYSNLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPT

Query:  SWA----------LNGC-----------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVT
         W           LNG                  F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVT
Subjt:  SWA----------LNGC-----------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVT

Query:  GNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTIT
        GNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTIT
Subjt:  GNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTIT

Query:  RPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE
        RPDIA+AVNSVSQFL+APT+DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE
Subjt:  RPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE

Query:  YRALATTTAELLWVTHLLHDLKVPI
        YRALA T AELLW+THLLHDLKVPI
Subjt:  YRALATTTAELLWVTHLLHDLKVPI

KAG7032431.1 putative mitochondrial protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MDGRSSQTFSGFRVDFGARSTALWSGNDKLLVEDTSGLKQVTMVDSLGLRASIGLYTVAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYV
        MDGRSSQTFSGFRVDFGARSTALWSGNDKLLVEDTSGLKQVTMVDSLGLRASIGLYTVAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYV
Subjt:  MDGRSSQTFSGFRVDFGARSTALWSGNDKLLVEDTSGLKQVTMVDSLGLRASIGLYTVAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYV

Query:  IFIDEKVSQSDGGTEFTSTCFKTHLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY
        IFIDEKVSQSDGGTEFTSTCFKTHLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY
Subjt:  IFIDEKVSQSDGGTEFTSTCFKTHLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY

Query:  FLGVIVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVDSSSLGTHPMITRAKTGIFKPRHPA
        FLGVIVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVDSSSLGTHPMITRAKTGIFKPRHPA
Subjt:  FLGVIVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVDSSSLGTHPMITRAKTGIFKPRHPA

Query:  NLGMFGSSGLLFALLASQLSQKYSNLQLRILLGLLPWMKKFERYNKMILGLWFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVD
        NLGMFGSSGLLFALLASQLSQKYSNLQLRILLGLLPWMKKFERYNKMILGLWFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVD
Subjt:  NLGMFGSSGLLFALLASQLSQKYSNLQLRILLGLLPWMKKFERYNKMILGLWFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVD

Query:  DIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQ
        DIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQ
Subjt:  DIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQ

Query:  YLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS
        YLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS
Subjt:  YLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS

Query:  SCESEYRALATTTAELLWVTHLLHDLKVPIS
        SCESEYRALATTTAELLWVTHLLHDLKVPIS
Subjt:  SCESEYRALATTTAELLWVTHLLHDLKVPIS

RVW43615.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-17554.55Show/hide
Query:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------
        HLR SGI+HQLSCPYTPAQNGRAERK+RHV E GLALLFHSHLSPRFWVDAFST  YIINRLPTPLLG  SP   LY                       
Subjt:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------

Query:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC
                     FLG                                              I   ++   +HI      PP+ SS HIPRS+SSPC+IC
Subjt:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC

Query:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS
        SDLVDE V+VDTSLAGS+LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALLAS                  + ++  
Subjt:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS

Query:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----
         LQ      L+P           W+ +         ER    ++   +                        L+L     W           LNG     
Subjt:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----

Query:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK
                     F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHSEFATK
Subjt:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK

Query:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD
        DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFL+APT+D
Subjt:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD

Query:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL
        HFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTV RSSCESEYRALA T AELLW+THLLHDL
Subjt:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL

Query:  KVPI
        KVPI
Subjt:  KVPI

RVW45095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.6e-20854.53Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK
        +AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNSGFLYYVIFID+                                   KV QSDGG EFT+TCFK
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY----------------------
         HLR SGI+HQLSCPYTPAQNGRAERK+RHV E GLALLFHSHLSPRFWVDAFSTT YIINRLPTPLLG  SP   LY                      
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY----------------------

Query:  --------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDI
                      FLG                                              I   ++   +HI      PP+ SS HIPRS+SSPC+I
Subjt:  --------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDI

Query:  CSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQLSQKYSNL------------QL
        CSDLVDE V+VDTSLAGS+LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALLAS   + + +             ++
Subjt:  CSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQLSQKYSNL------------QL

Query:  RILLGLLPWMKKFERYNKMILG-LW-----------------------------------------------FLALPTPTSWA----------LNGC---
        + L     W+      N  I+G  W                                                L+L     W           LNG    
Subjt:  RILLGLLPWMKKFERYNKMILG-LW-----------------------------------------------FLALPTPTSWA----------LNGC---

Query:  --------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFAT
                      F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHS+FAT
Subjt:  --------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFAT

Query:  KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTV
        KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFL+A T+
Subjt:  KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTV

Query:  DHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHD
        DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHD
Subjt:  DHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHD

Query:  LKVPI
        LKVPI
Subjt:  LKVPI

TrEMBL top hitse value%identityAlignment
A0A438E275 Retrovirus-related Pol polyprotein from transposon RE11.4e-17554.4Show/hide
Query:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------
        HLR SGI+HQLSCPYTPAQNGRAERK+RHV E GLALLFHSHLSPRFWVDAFST  YIIN LPTPLLG  SP   LY                       
Subjt:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------

Query:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC
                     FLG                                              I   ++   +HI      PP+ SS HIPRS+SSPC+IC
Subjt:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC

Query:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS
        SDLVDE V+VDTSLAG +LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLLFALLAS                  + ++  
Subjt:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS

Query:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----
         LQ      L+P           W+ +         ER    ++   +                        L+L     W           LNG     
Subjt:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----

Query:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK
                     F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHSEFATK
Subjt:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK

Query:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD
        DLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFL+APT+D
Subjt:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD

Query:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL
        HFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHDL
Subjt:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL

Query:  KVPI
        KVPI
Subjt:  KVPI

A0A438E763 Retrovirus-related Pol polyprotein from transposon RE11.4e-17554.55Show/hide
Query:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------
        HLR SGI+HQLSCPYTPAQNGRAERK+RHV E GLALLFHSHLSPRFWVDAFST  YIINRLPTPLLG  SP   LY                       
Subjt:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY-----------------------

Query:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC
                     FLG                                              I   ++   +HI      PP+ SS HIPRS+SSPC+IC
Subjt:  -------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDIC

Query:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS
        SDLVDE V+VDTSLAGS+LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALLAS                  + ++  
Subjt:  SDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS-----------------QLSQKYS

Query:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----
         LQ      L+P           W+ +         ER    ++   +                        L+L     W           LNG     
Subjt:  NLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPTSWA----------LNGC----

Query:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK
                     F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHSEFATK
Subjt:  -------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATK

Query:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD
        DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFL+APT+D
Subjt:  DLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVD

Query:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL
        HFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTV RSSCESEYRALA T AELLW+THLLHDL
Subjt:  HFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL

Query:  KVPI
        KVPI
Subjt:  KVPI

A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-20854.53Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK
        +AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNSGFLYYVIFID+                                   KV QSDGG EFT+TCFK
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY----------------------
         HLR SGI+HQLSCPYTPAQNGRAERK+RHV E GLALLFHSHLSPRFWVDAFSTT YIINRLPTPLLG  SP   LY                      
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY----------------------

Query:  --------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDI
                      FLG                                              I   ++   +HI      PP+ SS HIPRS+SSPC+I
Subjt:  --------------FLGV---------------------------------------------IVLLIKGSANHILIILIHPPTTSSLHIPRSSSSPCDI

Query:  CSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQLSQKYSNL------------QL
        CSDLVDE V+VDTSLAGS+LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALLAS   + + +             ++
Subjt:  CSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQLSQKYSNL------------QL

Query:  RILLGLLPWMKKFERYNKMILG-LW-----------------------------------------------FLALPTPTSWA----------LNGC---
        + L     W+      N  I+G  W                                                L+L     W           LNG    
Subjt:  RILLGLLPWMKKFERYNKMILG-LW-----------------------------------------------FLALPTPTSWA----------LNGC---

Query:  --------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFAT
                      F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHS+FAT
Subjt:  --------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFAT

Query:  KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTV
        KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFL+A T+
Subjt:  KDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTV

Query:  DHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHD
        DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHD
Subjt:  DHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHD

Query:  LKVPI
        LKVPI
Subjt:  LKVPI

A5B2A6 Integrase catalytic domain-containing protein3.1e-18053.35Show/hide
Query:  AKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKT
        AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNS FLYYVIFID+                                   KV QSDG  EFT+TCFK 
Subjt:  AKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKT

Query:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFS--------TTAYIINRLPTPLLGAFSPQH-------PLYFLGVIVLL
        HL  SGI+HQLS PYTPAQNGRAERK+RHV E GLALLFHSH+SPRFW DA +        +T++ I   P   LG +SP H       P      I   
Subjt:  HLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFS--------TTAYIINRLPTPLLGAFSPQH-------PLYFLGVIVLL

Query:  IKGSANHILII--------------------LIH----PP--TTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLG
         +    H   +                    L H    PP  T  S HIPRS+SSPC                   S+LPP  S+  SIE  VD SSSLG
Subjt:  IKGSANHILII--------------------LIH----PP--TTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLG

Query:  THPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQ--------------------------------LSQKYSNLQLRILLGL----------LPWMK
        +HPMITRAK GIFK RHP NLG+ G SGLL A LAS                                 L  +  N  +R+   L          L +  
Subjt:  THPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQ--------------------------------LSQKYSNLQLRILLGL----------LPWMK

Query:  KFERYNKMILGLWFLALPTPTSWA----------LNGC-----------------FVLNICLMDPSSV----------------------------SSLV
         F    K       L+L     W           LNG                  F  ++CL+  +                              +SL 
Subjt:  KFERYNKMILGLWFLALPTPTSWA----------LNGC-----------------FVLNICLMDPSSV----------------------------SSLV

Query:  LLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGS
        +  +QS+LIYLLLYVDDIIVTGNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPTPDGLFI QLKYARDILT  QLLDSKPVHTPMVVSQHLT  G 
Subjt:  LLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGS

Query:  PFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN
        PFS+PTLYRSLVG LQYL ITR DIA+AVNSVSQFL+APT+DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGN
Subjt:  PFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN

Query:  NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI
        NLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHDL VPI
Subjt:  NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI

A5C5R8 Integrase catalytic domain-containing protein8.0e-18454.34Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK
        +AK+H+LPYSRNEHRSSHVLDLIHCDLWGPSP+KSNSGFLYYVIFID+                                   KV QSDGG EFT+TCFK
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQH-PLYFLGVIVLLIKGSANHILIILI
         HLR SGI+HQLSCPYT AQNGRAERK+RHV E GLALLFH HLSPRFWV+ F                    QH  LY   V       S +  LI+L 
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQH-PLYFLGVIVLLIKGSANHILIILI

Query:  HPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS---
                                          L G +LPP  S+  SIE   D SSSLG+HPMITRAK GIFK RHPANLG+ GSSGLL ALL+S   
Subjt:  HPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVD-SSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLAS---

Query:  --------------QLSQKYSNLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPT
                       + ++   LQ      L+P           W+ +         ER    ++   +                        L+L    
Subjt:  --------------QLSQKYSNLQLRILLGLLP-----------WMKK--------FERYNKMILGLWF------------------------LALPTPT

Query:  SWA----------LNGC-----------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVT
         W           LNG                  F  ++CL+  +                            + +SL +  +QS+LIYLLLYVDDIIVT
Subjt:  SWA----------LNGC-----------------FVLNICLMDPS----------------------------SVSSLVLLPKQSNLIYLLLYVDDIIVT

Query:  GNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTIT
        GNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTIT
Subjt:  GNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTIT

Query:  RPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE
        RPDIA+AVNSVSQFL+APT+DHFLAVKRI RYVKGTLHFGLTF PST+PS LVAY D DWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE
Subjt:  RPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESE

Query:  YRALATTTAELLWVTHLLHDLKVPI
        YRALA T AELLW+THLLHDLKVPI
Subjt:  YRALATTTAELLWVTHLLHDLKVPI

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-3331Show/hide
Query:  RYNKMILGL------WFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNL---IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSY
        + NK I GL      WF         AL  C  +N      SSV   + +  + N+   IY+LLYVDD+++   + + +++F   L  +F   DL  + +
Subjt:  RYNKMILGL------WFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNL---IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSY

Query:  FLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNSVSQFLNAPTVDHFLAVK
        F+G+      D +++SQ  Y + IL++  + +   V TP+    +     S     T  RSL+G L Y+ + TRPD+  AVN +S++ +    + +  +K
Subjt:  FLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNSVSQFLNAPTVDHFLAVK

Query:  RIFRYVKGTLHFGLTFHPS-TVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN-NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI
        R+ RY+KGT+   L F  +    + ++ Y+D+DWAG    R+ST+GY   + + NL+ W+ K+Q +V+ SS E+EY AL     E LW+  LL  + + +
Subjt:  RIFRYVKGTLHFGLTFHPS-TVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGN-NLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-3337.1Show/hide
Query:  QSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPF
        ++N I LLLYVDD+++ G +  LI      L   F  KDLG     LG++     T   L++SQ KY   +L R  + ++KPV TP+     L+    P 
Subjt:  QSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPF

Query:  S-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGY
        +           Y S VG+L Y +  TRPDIA+AV  VS+FL  P  +H+ AVK I RY++GT    L F  S    +L  Y D D AG  D R+S++GY
Subjt:  S-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGY

Query:  SIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL
                +SW +K Q  V+ S+ E+EY A   T  E++W+   L +L
Subjt:  SIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-1429.07Show/hide
Query:  KSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKTH
        K H++ +  +  R  ++LDL++ D+ GP  ++S  G  Y+V FID+                                   K  +SD G E+TS  F+ +
Subjt:  KSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDE-----------------------------------KVSQSDGGTEFTSTCFKTH

Query:  LRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQ
          + GI H+ + P TP  NG AER  R + E   ++L  + L   FW +A  T  Y+INR P+  L    P+
Subjt:  LRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQ

P92519 Uncharacterized mitochondrial protein AtMg008101.8e-6352.65Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q ++ PT+  F  +KR+ RYVKGT+  GL  H ++  + + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSSCESEYRALATTTAELLW
        +QPTVSRSS E+EYRALA T AEL W
Subjt:  KQPTVSRSSCESEYRALATTTAELLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-5944.91Show/hide
Query:  FVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSK
        ++L I  ++  S +SL +L +  +++Y+L+YVDDI++TGN+ +L+ +    L   F+ KD   L YFLG+EA   P GL +SQ +Y  D+L R  ++ +K
Subjt:  FVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSK

Query:  PVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWA
        PV TPM  S  L+   G+  +DPT YR +VG+LQYL  TRPDI+YAVN +SQF++ PT +H  A+KRI RY+ GT + G+        S L AY D DWA
Subjt:  PVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWA

Query:  GCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS
        G  D   ST+GY +YLG++ +SWS+KKQ  V RSS E+EYR++A T++E+ W+  LL +L + ++
Subjt:  GCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-2034.27Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFID-----------EKVSQ------------------------SDGGTEFTSTCFK
        + KS+++P+S++   S+  L+ I+ D+W  SP+ S+  + YYVIF+D           ++ SQ                        SD G EF +    
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFID-----------EKVSQ------------------------SDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY
         +    GI H  S P+TP  NG +ERK+RH+ E GL LL H+ +   +W  AF+   Y+INRLPTPLL   SP   L+
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-6043.64Show/hide
Query:  RYNKMILGLWFLALPTPTSWALN-GCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASP
        R  K I GL       P +W +    ++L +  ++  S +SL +L +  ++IY+L+YVDDI++TGN++ L+      L   F+ K+   L YFLG+EA  
Subjt:  RYNKMILGLWFLALPTPTSWALN-GCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASP

Query:  TPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKG
         P GL +SQ +Y  D+L R  +L +KPV TPM  S  LT   G+   DPT YR +VG+LQYL  TRPD++YAVN +SQ+++ PT DH+ A+KR+ RY+ G
Subjt:  TPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKG

Query:  TLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS
        T   G+        S L AY D DWAG  D   ST+GY +YLG++ +SWS+KKQ  V RSS E+EYR++A T++EL W+  LL +L + +S
Subjt:  TLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-2236.52Show/hide
Query:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFID-----------EKVSQ------------------------SDGGTEFTSTCFK
        + KSH++P+S +   SS  L+ I+ D+W  SP+ S   + YYVIF+D           ++ SQ                        SD G EF     +
Subjt:  VAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFID-----------EKVSQ------------------------SDGGTEFTSTCFK

Query:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY
         +L   GI H  S P+TP  NG +ERK+RH+ E+GL LL H+ +   +W  AFS   Y+INRLPTPLL   SP   L+
Subjt:  THLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.5e-5345.8Show/hide
Query:  LLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYR
        +L+YVDDII+  NN + +D    +L S F  +DLG L YFLGLE + +  G+ I Q KYA D+L    LL  KP   PM  S   +A  G  F D   YR
Subjt:  LLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYR

Query:  SLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKK
         L+G L YL ITR DI++AVN +SQF  AP + H  AV +I  Y+KGT+  GL F+ S     L  + D  +  C DTRRST+GY ++LG +L+SW +KK
Subjt:  SLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKK

Query:  QPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS
        Q  VS+SS E+EYRAL+  T E++W+     +L++P+S
Subjt:  QPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS

ATMG00240.1 Gag-Pol-related retrotransposon family protein8.3e-1651.28Show/hide
Query:  YLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGY
        YLTITRPD+ +AVN +SQF +A       AV ++  YVKGT+  GL F+ +T    L A+ D+DWA CPDTRRS +G+
Subjt:  YLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-6452.65Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q ++ PT+  F  +KR+ RYVKGT+  GL  H ++  + + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHPSTVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSSCESEYRALATTTAELLW
        +QPTVSRSS E+EYRALA T AEL W
Subjt:  KQPTVSRSSCESEYRALATTTAELLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCCGCTCTTCACAGACGTTTTCAGGATTCCGGGTGGACTTTGGGGCTAGGTCCACTGCCTTATGGTCAGGTAATGACAAATTGTTGGTCGAGGACACGAGTGG
TCTCAAGCAAGTAACCATGGTAGATAGCTTAGGTTTGAGAGCCTCCATCGGTTTGTATACGGTTGCAAAAAGTCATCAATTGCCTTATTCCCGCAATGAACATAGGTCGT
CTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCGTCAAATCAAATTCGGGTTTTCTTTATTATGTTATTTTTATTGATGAAAAGGTATCTCAAAGC
GATGGAGGTACCGAATTTACTAGTACTTGTTTCAAAACACATTTACGTAATTCTGGCATCTACCATCAACTCTCTTGTCCATATACACCTGCTCAAAATGGTCGTGCTGA
GAGAAAATATCGTCATGTGAATGAGATTGGCTTGGCTCTTCTCTTTCACTCTCATCTTTCTCCTCGTTTTTGGGTTGATGCCTTCAGCACTACAGCTTATATTATCAATC
GGTTGCCTACTCCACTTCTTGGAGCTTTCTCCCCGCAGCATCCCTTGTATTTTTTGGGGGTTATAGTCCTACTCATAAAGGGTTCCGCTAACCACATCCTTATCATATTG
ATTCATCCCCCTACCACTTCATCATTGCATATTCCTCGATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTGGATGAGTTTGTGGAGGTTGATACTTCTCTTGC
AGGTTCCACTTTGCCACCCTCGACTTCTAATTCGACCTCTATTGAACCTACTGTTGATTCCTCTTCTTTGGGCACTCATCCTATGATCACAAGAGCCAAAACTGGTATAT
TCAAGCCTCGTCATCCAGCAAATCTTGGTATGTTTGGCTCATCTGGACTTCTTTTTGCTCTTCTTGCATCACAACTGAGCCAAAAGTATTCAAATCTGCAACTAAGAATC
CTGCTTGGGTTGCTGCCATGGATGAAGAAATTTGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACACCAACATCGTGGGCTCTAAATGGGTG
TTTCGTATTAAATATTTGCCTGATGGATCCGTCGAGCGTTTCAAGCCTCGTCTTGTTGCCAAAGCAATCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTG
TTACCGGCAACAACTCATCTCTTATTGATAGCTTTACTCTCAAACTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCA
CCCACTCCTGATGGTCTCTTTATTAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTTGATAGCAAACCAGTCCACACTCCCATGGTTGTTTCTCA
ACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTATAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATTACGCGTCCAGATATTGCCTATGCTGTCA
ATTCTGTCAGTCAATTCTTGAATGCCCCTACTGTAGATCACTTTCTTGCTGTCAAACGTATTTTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCATCCA
TCCACTGTTCCTAGTATGCTAGTCGCTTATTTGGATACTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCGT
TTCTTGGAGTGCCAAAAAGCAACCTACGGTCTCACGCTCCAGCTGTGAATCTGAATATCGTGCTCTTGCCACAACTACTGCTGAACTTCTTTGGGTTACGCATCTTTTGC
ATGACCTCAAGGTCCCTATTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGGCCGCTCTTCACAGACGTTTTCAGGATTCCGGGTGGACTTTGGGGCTAGGTCCACTGCCTTATGGTCAGGTAATGACAAATTGTTGGTCGAGGACACGAGTGG
TCTCAAGCAAGTAACCATGGTAGATAGCTTAGGTTTGAGAGCCTCCATCGGTTTGTATACGGTTGCAAAAAGTCATCAATTGCCTTATTCCCGCAATGAACATAGGTCGT
CTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCGTCAAATCAAATTCGGGTTTTCTTTATTATGTTATTTTTATTGATGAAAAGGTATCTCAAAGC
GATGGAGGTACCGAATTTACTAGTACTTGTTTCAAAACACATTTACGTAATTCTGGCATCTACCATCAACTCTCTTGTCCATATACACCTGCTCAAAATGGTCGTGCTGA
GAGAAAATATCGTCATGTGAATGAGATTGGCTTGGCTCTTCTCTTTCACTCTCATCTTTCTCCTCGTTTTTGGGTTGATGCCTTCAGCACTACAGCTTATATTATCAATC
GGTTGCCTACTCCACTTCTTGGAGCTTTCTCCCCGCAGCATCCCTTGTATTTTTTGGGGGTTATAGTCCTACTCATAAAGGGTTCCGCTAACCACATCCTTATCATATTG
ATTCATCCCCCTACCACTTCATCATTGCATATTCCTCGATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTGGATGAGTTTGTGGAGGTTGATACTTCTCTTGC
AGGTTCCACTTTGCCACCCTCGACTTCTAATTCGACCTCTATTGAACCTACTGTTGATTCCTCTTCTTTGGGCACTCATCCTATGATCACAAGAGCCAAAACTGGTATAT
TCAAGCCTCGTCATCCAGCAAATCTTGGTATGTTTGGCTCATCTGGACTTCTTTTTGCTCTTCTTGCATCACAACTGAGCCAAAAGTATTCAAATCTGCAACTAAGAATC
CTGCTTGGGTTGCTGCCATGGATGAAGAAATTTGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACACCAACATCGTGGGCTCTAAATGGGTG
TTTCGTATTAAATATTTGCCTGATGGATCCGTCGAGCGTTTCAAGCCTCGTCTTGTTGCCAAAGCAATCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTG
TTACCGGCAACAACTCATCTCTTATTGATAGCTTTACTCTCAAACTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCA
CCCACTCCTGATGGTCTCTTTATTAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTTGATAGCAAACCAGTCCACACTCCCATGGTTGTTTCTCA
ACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTATAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATTACGCGTCCAGATATTGCCTATGCTGTCA
ATTCTGTCAGTCAATTCTTGAATGCCCCTACTGTAGATCACTTTCTTGCTGTCAAACGTATTTTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCATCCA
TCCACTGTTCCTAGTATGCTAGTCGCTTATTTGGATACTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCGT
TTCTTGGAGTGCCAAAAAGCAACCTACGGTCTCACGCTCCAGCTGTGAATCTGAATATCGTGCTCTTGCCACAACTACTGCTGAACTTCTTTGGGTTACGCATCTTTTGC
ATGACCTCAAGGTCCCTATTTCATAG
Protein sequenceShow/hide protein sequence
MDGRSSQTFSGFRVDFGARSTALWSGNDKLLVEDTSGLKQVTMVDSLGLRASIGLYTVAKSHQLPYSRNEHRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDEKVSQS
DGGTEFTSTCFKTHLRNSGIYHQLSCPYTPAQNGRAERKYRHVNEIGLALLFHSHLSPRFWVDAFSTTAYIINRLPTPLLGAFSPQHPLYFLGVIVLLIKGSANHILIIL
IHPPTTSSLHIPRSSSSPCDICSDLVDEFVEVDTSLAGSTLPPSTSNSTSIEPTVDSSSLGTHPMITRAKTGIFKPRHPANLGMFGSSGLLFALLASQLSQKYSNLQLRI
LLGLLPWMKKFERYNKMILGLWFLALPTPTSWALNGCFVLNICLMDPSSVSSLVLLPKQSNLIYLLLYVDDIIVTGNNSSLIDSFTLKLHSEFATKDLGSLSYFLGLEAS
PTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLNAPTVDHFLAVKRIFRYVKGTLHFGLTFHP
STVPSMLVAYLDTDWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTTAELLWVTHLLHDLKVPIS