; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G11060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G11060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy1-copia retrotransposon protein
Genome locationChr2:11298333..11299541
RNA-Seq ExpressionCSPI02G11060
SyntenyCSPI02G11060
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]5.9e-10553.25Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF  SG E+K             APK+WYEKF+ TL  +G+ VN+SD+CVYSK S    +LICLYVDDMLIFG ++  
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        I  TK FLSS FEMKDLGEAD+ILGVKI++  N +SL QSHY E +LKKFD F+++ V+TPYD S  LKKN  ESVS+ +YAKIIG VM+LMN+TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        Y VSRLSRYTHNPS+ HW A+  L+R L+GT+   LH+NKFP+VLEGYCDANWVTDNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAIS+AKN VYNGK+RHIRLRH  VKH+LK   + ++FVRSE+NL D  TK L +KMV  +S  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

KAG7551885.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]5.9e-9748.88Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF   GQENK             APKQW+EKF++TL+ NGF  N  DTCV+SK+     ++ICLYVDDMLI GT+L++
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        + DTK+FLSS F+MKDLGEAD+ILG+K+ K ++  SL+QSHY E +LKKF  +D    ++PYD S +L +N+GESV++ +YAK+IG VMYLMN TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        Y VSRLSRYTHNP   HW AL  L+R LKGTI+++L ++    VLE YCDANW +DNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAI++A N++YNGK+RHIR+RH V++ L++   + LEFVRS KN+ D LTKGL  +MVLD++  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

Query:  F
        F
Subjt:  F

KAG7571733.1 Integrase catalytic core [Arabidopsis suecica]2.5e-9548.39Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF   GQENK             APKQW+EKF++TL+ NGF  N  DTCV+SK+     ++ICLYVDDMLI GT+L++
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        + DTK+FLSS F+MKDLGEAD+ILG+K+ K ++  SL+QSHY E +LKKF  +D    ++PYD S +L +N+GESV++ +YAK+IG VMYLMN TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTHNP   HW AL  L+R LKGTI+++L ++    VLE YCDANW +DNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAI++A N++YNGK+RHIR+RH V++ L++   + LEFVRS KN+ D LTKGL  +MVLD++  MG   
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

Query:  FGD
         GD
Subjt:  FGD

KAG7592238.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.5e-9748.64Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF   GQENK             APKQW+EKF++TL+ NGF  N  DTCV+SK+     ++ICLYVDDMLI GT+L++
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        + DTK+FLSS F+MKDLGEAD+ILG+K+ K ++  SL+QSHY E +LKKF  +D    ++PYD S +L +N+GESV++ +YAK+IG VMYLMN TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        Y VSRLSRYTHNP   HW AL  L+R LKGTI+++L ++    VLE YCDANW +DNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAI++A N++YNGK+RHIR+RH V++ L++   + LEFVRS KN+ D LTKGL  +MVLD++  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

Query:  FGD
        F +
Subjt:  FGD

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]2.5e-14066.5Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNG+L EEIYM QPEGFK SGQENK             APKQWYEKFN+TLI NGFK+NSSDTCVYSKM GA+CILICLYVDDMLIFGTN++L
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        I DTK FLSSHFEMKDLGEAD+ILGVKI KN+  LSL QSHY E +LKKFDSFDVS VRTP+D SK+LKKNKG+SVS+P+YAKIIG VMYLMN+TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTHNP+RYHWDALRHLLR LKGTI+Y LHF KFP+VLEGYCDANWVTDNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  ------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKPF
                                      AAI  AKNSVYNGK RH+RLRH VVK LLKE TI LEFVRSEKNL D LTKGL+RKMVLDSS+NMGLKPF
Subjt:  ------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKPF

Query:  GDP
        GDP
Subjt:  GDP

TrEMBL top hitse value%identityAlignment
A0A151TB60 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)3.5e-9547.25Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNG+L EEIYM QPEG +  GQENK             APKQW+EKF+  L+N+GF  +S+D CVY+K    +C++ICLYVDDMLIFGT  D+
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        +  TK FL+S+F+MKD+GEA +ILGVK+ +  + + LSQ HY E LLKKFD +D   V TPYDV+  LK+NKG+S+++ +YA+IIG +++LMN +RP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTH P++ HW+AL  L+R L+GT++Y + ++ FP+VLEGY DANW++D+DE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       +AI+IAKN  YNGK RHI+LRH +VK LLK+ TI + +V+SE NL D LTK L RKM+ ++S  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

A0A2N9EQT1 Integrase catalytic domain-containing protein2.6e-10653Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF   GQENK             APKQW+EKF+ TL++NGF VN SD CVYSK SGA  ++ICLYVDDMLIFGT+++ 
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        +K+TK FLSS+F+MKDLGEAD+ILG++I +N   L+LSQSHY E +LKKF+ +D   VRTPYD S +LKKN G  VS+ +YAKIIG VM+LMN TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTHNP+  HW+A+  LL+ LKGT+N  L +   P+VLEGYCDANW++DNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAI+ AKN +YNGK+RHIRLRH +V+ L+    I +EFVRSEKNL D LTKGL+RK+V D+S  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

A0A2N9H4B0 Uncharacterized protein2.6e-10653Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF   GQENK             APKQW+EKF+ TL++NGF VN SD CVYSK SGA  ++ICLYVDDMLIFGT+++ 
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        +K+TK FLSS+F+MKDLGEAD+ILG++I +N   L+LSQSHY E +LKKF+ +D   VRTPYD S +LKKN G  VS+ +YAKIIG VM+LMN TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTHNP+  HW+A+  LL+ LKGT+N  L +   P+VLEGYCDANW++DNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAI+ AKN +YNGK+RHIRLRH +V+ L+    I +EFVRSEKNL D LTKGL+RK+V D+S  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

A0A5D3C5T2 Ty1-copia retrotransposon protein1.2e-14066.5Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNG+L EEIYM QPEGFK SGQENK             APKQWYEKFN+TLI NGFK+NSSDTCVYSKM GA+CILICLYVDDMLIFGTN++L
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        I DTK FLSSHFEMKDLGEAD+ILGVKI KN+  LSL QSHY E +LKKFDSFDVS VRTP+D SK+LKKNKG+SVS+P+YAKIIG VMYLMN+TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        YAVSRLSRYTHNP+RYHWDALRHLLR LKGTI+Y LHF KFP+VLEGYCDANWVTDNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  ------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKPF
                                      AAI  AKNSVYNGK RH+RLRH VVK LLKE TI LEFVRSEKNL D LTKGL+RKMVLDSS+NMGLKPF
Subjt:  ------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKPF

Query:  GDP
        GDP
Subjt:  GDP

A0A5N6PGV2 Reverse transcriptase Ty1/copia-type domain-containing protein2.9e-10553.25Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNGDL EEIYM QPEGF  SG E+K             APK+WYEKF+ TL  +G+ VN+SD+CVYSK S    +LICLYVDDMLIFG ++  
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENK-------------APKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA
        I  TK FLSS FEMKDLGEAD+ILGVKI++  N +SL QSHY E +LKKFD F+++ V+TPYD S  LKKN  ESVS+ +YAKIIG VM+LMN+TRP+IA
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIA

Query:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------
        Y VSRLSRYTHNPS+ HW A+  L+R L+GT+   LH+NKFP+VLEGYCDANWVTDNDE                                         
Subjt:  YAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDE-----------------------------------------

Query:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP
                                       AAIS+AKN VYNGK+RHIRLRH  VKH+LK   + ++FVRSE+NL D  TK L +KMV  +S  MGLKP
Subjt:  -------------------------------AAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-3227.68Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQE-----------NKAPKQWYEKFNDTLINNGFKVNSSDTCVY--SKMSGAECILICLYVDDMLIFGTNLDL
        MDVK AFLNG L EEIYM  P+G   +               +A + W+E F   L    F  +S D C+Y   K +  E I + LYVDD++I   ++  
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQE-----------NKAPKQWYEKFNDTLINNGFKVNSSDTCVY--SKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVS-KYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNI
        + + K +L   F M DL E    +G++IE  E+ + LSQS Y + +L KF+  + + V TP      Y   N  E  + P    +IG +MY+M  TRP++
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVS-KYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNI

Query:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNK---FPSVLEGYCDANW-------------------------------------------
          AV+ LSRY+   +   W  L+ +LR LKGTI+  L F K   F + + GY D++W                                           
Subjt:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNK---FPSVLEGYCDANW-------------------------------------------

Query:  ----------------------------VTDNDEAAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMG
                                    + ++++  ISIA N   + + +HI +++   +  ++   I LE++ +E  L D+ TK L     ++    +G
Subjt:  ----------------------------VTDNDEAAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMG

Query:  L
        L
Subjt:  L

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-4339.44Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSK-MSGAECILICLYVDDMLIFGTNLD
        +DVK AFL+GDL EEIYM QPEGF+ +G+++             +AP+QWY KF+  + +  +    SD CVY K  S    I++ LYVDDMLI G +  
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSK-MSGAECILICLYVDDMLIFGTNLD

Query:  LIKDTKLFLSSHFEMKDLGEADIILGVKI--EKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNK---------GESVSKPKYAKIIGRV
        LI   K  LS  F+MKDLG A  ILG+KI  E+    L LSQ  Y E +L++F+  +   V TP  ++ +LK +K           +++K  Y+  +G +
Subjt:  LIKDTKLFLSSHFEMKDLGEADIILGVKI--EKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNK---------GESVSKPKYAKIIGRV

Query:  MYLMNHTRPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDEAAISIAKNSVYNG
        MY M  TRP+IA+AV  +SR+  NP + HW+A++ +LR L+GT    L F     +L+GY DA+   D D    S      ++G
Subjt:  MYLMNHTRPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSVLEGYCDANWVTDNDEAAISIAKNSVYNG

P25600 Putative transposon Ty5-1 protein YCL074W5.9e-2331.37Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGF-----------KFSGQE--NKAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        MDV  AFLN  + E IY+ QP GF            + G     +AP  W E  N+TL   GF  +  +  +Y + +    I I +YVDD+L+   +  +
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGF-----------KFSGQE--NKAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNEN-DLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSK-PKYAKIIGRVMYLMNHTRPN
            K  L+  + MKDLG+ D  LG+ I ++ N D++LS   Y      + +     L +TP   SK L +     +     Y  I+G++++  N  RP+
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNEN-DLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSK-PKYAKIIGRVMYLMNHTRPN

Query:  IAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDAN
        I+Y VS LSR+   P   H ++ R +LR L  T +  L +     + L  YCDA+
Subjt:  IAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDAN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-3334.48Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        +DV  AFL G L +++YM+QP GF    + N             +AP+ WY +  + L+  GF  + SDT ++    G   + + +YVDD+LI G +  L
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKP-KYAKIIGRVMYLMNHTRPNI
        + +T   LS  F +KD  E    LG++ ++    L LSQ  Y   LL + +      V TP   S  L    G  ++ P +Y  I+G + YL   TRP+I
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKP-KYAKIIGRVMYLMNHTRPNI

Query:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDNDE
        +YAV+RLS++ H P+  H  AL+ +LR L GT N+ +   K  ++ L  Y DA+W  D D+
Subjt:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDNDE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-3434.48Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL
        +DV  AFL G L +E+YM+QP GF    + +             +AP+ WY +    L+  GF  + SDT ++    G   I + +YVDD+LI G +  L
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDL

Query:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKP-KYAKIIGRVMYLMNHTRPNI
        +K T   LS  F +K+  +    LG++ ++    L LSQ  Y   LL + +      V TP   S  L  + G  +  P +Y  I+G + YL   TRP++
Subjt:  IKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKP-KYAKIIGRVMYLMNHTRPNI

Query:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDNDE
        +YAV+RLS+Y H P+  HW+AL+ +LR L GT ++ +   K  ++ L  Y DA+W  D D+
Subjt:  AYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDNDE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.9e-2729.92Show/hide
Query:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-----------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGT
        +D+  AFLNGDL EEIYM  P G+     ++                 +A +QW+ KF+ TLI  GF  + SD   + K++    + + +YVDD++I   
Subjt:  MDVKIAFLNGDLGEEIYMAQPEGFKFSGQEN-----------------KAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGT

Query:  NLDLIKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVS-KYLKKNKGESVSKPKYAKIIGRVMYLMNHT
        N   + + K  L S F+++DLG     LG++I ++   +++ Q  Y   LL +           P D S  +   + G+ V    Y ++IGR+MYL   T
Subjt:  NLDLIKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVS-KYLKKNKGESVSKPKYAKIIGRVMYLMNHT

Query:  RPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDND
        R +I++AV++LS+++  P   H  A+  +L  +KGT+   L ++    + L+ + DA++ +  D
Subjt:  RPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANWVTDND

ATMG00810.1 DNA/RNA polymerases superfamily protein4.8e-2034.86Show/hide
Query:  ICLYVDDMLIFGTNLDLIKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSK----P
        + LYVDD+L+ G++  L+      LSS F MKDLG     LG++I+ + + L LSQ+ Y E +L      D   + TP      LK N   S +K     
Subjt:  ICLYVDDMLIFGTNLDLIKDTKLFLSSHFEMKDLGEADIILGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSK----P

Query:  KYAKIIGRVMYLMNHTRPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANW
         +  I+G + YL   TRP+I+YAV+ + +  H P+   +D L+ +LR +KGTI + L+ +K   + ++ +CD++W
Subjt:  KYAKIIGRVMYLMNHTRPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTINYSLHFNKFPSV-LEGYCDANW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAAAAATTGCATTTCTAAATGGTGATTTAGGAGAAGAAATCTATATGGCTCAACCAGAAGGATTCAAATTTTCTGGCCAAGAAAACAAAGCTCCTAAGCAATG
GTATGAAAAATTTAATGACACTTTGATAAACAATGGATTTAAGGTAAATTCCTCAGACACATGTGTTTATTCAAAGATGTCTGGAGCTGAGTGCATACTAATATGTCTAT
ATGTCGATGACATGTTGATCTTTGGAACAAATTTGGATTTAATAAAGGACACCAAGTTGTTCCTGTCATCACACTTTGAAATGAAAGACCTAGGTGAAGCAGACATAATT
CTTGGAGTTAAAATTGAGAAAAATGAAAATGATTTATCTTTATCTCAATCTCATTATAATGAGATTTTACTAAAGAAATTTGACTCATTCGATGTCTCTCTGGTGAGAAC
TCCCTATGATGTTAGTAAATACCTTAAAAAGAACAAAGGAGAGAGTGTATCTAAACCTAAATATGCTAAGATCATAGGTCGTGTTATGTACTTAATGAACCACACTAGAC
CAAATATTGCATATGCTGTAAGTAGATTGAGTAGATATACACATAATCCTAGTAGATACCACTGGGATGCCTTACGTCATCTGTTGAGATGTCTTAAAGGAACTATAAAC
TACAGTTTGCACTTCAATAAGTTTCCTTCTGTATTAGAAGGATATTGTGATGCAAATTGGGTTACAGATAACGATGAGGCAGCCATAAGTATCGCCAAGAACAGTGTTTA
TAATGGGAAGAGAAGACACATACGTCTTAGACATGGAGTCGTGAAACACTTACTGAAGGAAAGAACTATTTTCTTAGAATTTGTTCGATCTGAGAAAAACCTGGTTGATC
TTTTAACCAAAGGACTGTCTAGAAAAATGGTTTTAGATTCCTCAATAAACATGGGACTAAAGCCCTTCGGAGATCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAAAAATTGCATTTCTAAATGGTGATTTAGGAGAAGAAATCTATATGGCTCAACCAGAAGGATTCAAATTTTCTGGCCAAGAAAACAAAGCTCCTAAGCAATG
GTATGAAAAATTTAATGACACTTTGATAAACAATGGATTTAAGGTAAATTCCTCAGACACATGTGTTTATTCAAAGATGTCTGGAGCTGAGTGCATACTAATATGTCTAT
ATGTCGATGACATGTTGATCTTTGGAACAAATTTGGATTTAATAAAGGACACCAAGTTGTTCCTGTCATCACACTTTGAAATGAAAGACCTAGGTGAAGCAGACATAATT
CTTGGAGTTAAAATTGAGAAAAATGAAAATGATTTATCTTTATCTCAATCTCATTATAATGAGATTTTACTAAAGAAATTTGACTCATTCGATGTCTCTCTGGTGAGAAC
TCCCTATGATGTTAGTAAATACCTTAAAAAGAACAAAGGAGAGAGTGTATCTAAACCTAAATATGCTAAGATCATAGGTCGTGTTATGTACTTAATGAACCACACTAGAC
CAAATATTGCATATGCTGTAAGTAGATTGAGTAGATATACACATAATCCTAGTAGATACCACTGGGATGCCTTACGTCATCTGTTGAGATGTCTTAAAGGAACTATAAAC
TACAGTTTGCACTTCAATAAGTTTCCTTCTGTATTAGAAGGATATTGTGATGCAAATTGGGTTACAGATAACGATGAGGCAGCCATAAGTATCGCCAAGAACAGTGTTTA
TAATGGGAAGAGAAGACACATACGTCTTAGACATGGAGTCGTGAAACACTTACTGAAGGAAAGAACTATTTTCTTAGAATTTGTTCGATCTGAGAAAAACCTGGTTGATC
TTTTAACCAAAGGACTGTCTAGAAAAATGGTTTTAGATTCCTCAATAAACATGGGACTAAAGCCCTTCGGAGATCCATAA
Protein sequenceShow/hide protein sequence
MDVKIAFLNGDLGEEIYMAQPEGFKFSGQENKAPKQWYEKFNDTLINNGFKVNSSDTCVYSKMSGAECILICLYVDDMLIFGTNLDLIKDTKLFLSSHFEMKDLGEADII
LGVKIEKNENDLSLSQSHYNEILLKKFDSFDVSLVRTPYDVSKYLKKNKGESVSKPKYAKIIGRVMYLMNHTRPNIAYAVSRLSRYTHNPSRYHWDALRHLLRCLKGTIN
YSLHFNKFPSVLEGYCDANWVTDNDEAAISIAKNSVYNGKRRHIRLRHGVVKHLLKERTIFLEFVRSEKNLVDLLTKGLSRKMVLDSSINMGLKPFGDP