; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0103701 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0103701
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr04:20875303..20876133
RNA-Seq ExpressionCmc04g0103701
SyntenyCmc04g0103701
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]4.8e-11171.38Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QPEGF +SG E+KVCKLRKSLYG KQAPK+WYEKF+ TL  +G+ +N+SD+CVYSK      +LICLYVDDMLIFG +M  
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I  TK FLSS FEMKDL EADVIL VKI++    +SLCQSHY+E++LKKFD F+++PV+TP+D S  L KN  +SVSQ EYAKIIGSVM+LMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        Y VSRLSRYTHNP + HW A+  L+RYL+GT++ CLH+NKFPAVLEGY DANWVTDNDEV+STSGYVF++GGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

KAE8670806.1 hypothetical protein F3Y22_tig00112079pilonHSYRG00011 [Hibiscus syriacus]6.0e-10669.2Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QP GF+  G E KV +L+KSLYG KQAPKQWYEKF+ T+++ GF +N SD CVYSK+F  +C++I LYVDDMLIF +N+E 
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I   K FLS+ FEM  L E DVIL V++ K +   SLCQ+HY++K+LKKFDSFDV PVRTP+D S HL KNKG SVSQ EYAK+IGS+M+LMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP   HW AL+ LL+YLKGT+D+ L F  FPAVLEGY DANWV+DNDEV+STSGYVF LGG AIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

KAG7551885.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.7e-10368.12Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL EEIYM QPEGF I GQENKVCKL KSLYG KQAPKQW+EKF+NTL+ NGF  N  DTCV+SKV     ++ICLYVDDMLI GT++E+
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        + DTK+FLSS F+MKDL EADVIL +K+ K  +  SL QSHY+EKILKKF  +D    ++P+D+S HL +N+G+SV+Q EYAK+IGSVMYLMN TRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        Y VSRLSRYTHNP  NHW AL  L+RYLKGTID+ L ++    VLE Y DANW +DNDEVNSTSG+VF L GGAI+
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

KAG7571733.1 Integrase catalytic core [Arabidopsis suecica]1.3e-10368.48Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL EEIYM QPEGF I GQENKVCKL KSLYG KQAPKQW+EKF+NTL+ NGF  N  DTCV+SKV     ++ICLYVDDMLI GT++E+
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        + DTK+FLSS F+MKDL EADVIL +K+ K  +  SL QSHY+EKILKKF  +D    ++P+D+S HL +N+G+SV+Q EYAK+IGSVMYLMN TRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP  NHW AL  L+RYLKGTID+ L ++    VLE Y DANW +DNDEVNSTSG+VF L GGAI+
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]7.1e-14794.57Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNG+LEEEIYMTQPEGFKISGQENKVCKLRKSLYG KQAPKQWYEKFNNTLITNGFKINSSDTCVYSK+ GADCILICLYVDDMLIFGTNMEL
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I DTK FLSSHFEMKDL EADVIL VKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHL KNKGDSVSQ EYAKIIGSVMYLMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP+R HWDALRHLLRYLKGTIDYCLHF KFPAVLEGY DANWVTDNDEVNSTSGYVFLLGGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

TrEMBL top hitse value%identityAlignment
A0A2N9EQT1 Integrase catalytic domain-containing protein2.8e-10970.29Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QPEGF + GQENKVCKLRKSLYG KQAPKQW+EKF+ TL++NGF +N SD CVYSK  GA  ++ICLYVDDMLIFGT+M  
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        + +TK FLSS+F+MKDL EAD+IL ++I +N   L+L QSHY+EK+LKKF+ +D  PVRTP+D S HL KN G  VSQ EYAKIIGSVM+LMN TRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP   HW+A+  LL+YLKGT++  L +   PAVLEGY DANW++DNDE NSTSGYVF LGGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

A0A2N9H4B0 Uncharacterized protein2.8e-10970.29Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QPEGF + GQENKVCKLRKSLYG KQAPKQW+EKF+ TL++NGF +N SD CVYSK  GA  ++ICLYVDDMLIFGT+M  
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        + +TK FLSS+F+MKDL EAD+IL ++I +N   L+L QSHY+EK+LKKF+ +D  PVRTP+D S HL KN G  VSQ EYAKIIGSVM+LMN TRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP   HW+A+  LL+YLKGT++  L +   PAVLEGY DANW++DNDE NSTSGYVF LGGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

A0A5D3C5T2 Ty1-copia retrotransposon protein3.4e-14794.57Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNG+LEEEIYMTQPEGFKISGQENKVCKLRKSLYG KQAPKQWYEKFNNTLITNGFKINSSDTCVYSK+ GADCILICLYVDDMLIFGTNMEL
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I DTK FLSSHFEMKDL EADVIL VKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHL KNKGDSVSQ EYAKIIGSVMYLMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP+R HWDALRHLLRYLKGTIDYCLHF KFPAVLEGY DANWVTDNDEVNSTSGYVFLLGGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

A0A5N6PGV2 Reverse transcriptase Ty1/copia-type domain-containing protein2.3e-11171.38Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QPEGF +SG E+KVCKLRKSLYG KQAPK+WYEKF+ TL  +G+ +N+SD+CVYSK      +LICLYVDDMLIFG +M  
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I  TK FLSS FEMKDL EADVIL VKI++    +SLCQSHY+E++LKKFD F+++PV+TP+D S  L KN  +SVSQ EYAKIIGSVM+LMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        Y VSRLSRYTHNP + HW A+  L+RYL+GT++ CLH+NKFPAVLEGY DANWVTDNDEV+STSGYVF++GGGAIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

A0A6A2Y4J7 Uncharacterized protein2.9e-10669.2Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDVKT FLNGDL+EEIYM QP GF+  G E KV +L+KSLYG KQAPKQWYEKF+ T+++ GF +N SD CVYSK+F  +C++I LYVDDMLIF +N+E 
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA
        I   K FLS+ FEM  L E DVIL V++ K +   SLCQ+HY++K+LKKFDSFDV PVRTP+D S HL KNKG SVSQ EYAK+IGS+M+LMNYTRPDIA
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIA

Query:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        YAVSRLSRYTHNP   HW AL+ LL+YLKGT+D+ L F  FPAVLEGY DANWV+DNDEV+STSGYVF LGG AIS
Subjt:  YAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.2e-4439.56Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFG--ADCILICLYVDDMLIFGTNM
        MDVKT FLNG L+EEIYM  P+G  IS   + VCKL K++YG KQA + W+E F   L    F  +S D C+Y    G   + I + LYVDD++I   +M
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFG--ADCILICLYVDDMLIFGTNM

Query:  ELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPD
          + + K +L   F M DL E    + ++I   +  + L QS YV+KIL KF+  + + V TP  +  +      D         +IG +MY+M  TRPD
Subjt:  ELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPD

Query:  IAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNK---FPAVLEGYRDANWVTDNDEVNSTSGYVF
        +  AV+ LSRY+   +   W  L+ +LRYLKGTID  L F K   F   + GY D++W     +  ST+GY+F
Subjt:  IAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNK---FPAVLEGYRDANWVTDNDEVNSTSGYVF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-6446.15Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGA-DCILICLYVDDMLIFGTNME
        +DVKT FL+GDLEEEIYM QPEGF+++G+++ VCKL KSLYG KQAP+QWY KF++ + +  +    SD CVY K F   + I++ LYVDDMLI G +  
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGA-DCILICLYVDDMLIFGTNME

Query:  LIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTS--LSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNK-------NKGDSVSQLEYAKIIGSVMY
        LI   K  LS  F+MKDL  A  IL +KI + +TS  L L Q  Y+E++L++F+  +  PV TP      L+K        +  +++++ Y+  +GS+MY
Subjt:  LIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTS--LSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNK-------NKGDSVSQLEYAKIIGSVMY

Query:  LMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
         M  TRPDIA+AV  +SR+  NP + HW+A++ +LRYL+GT   CL F     +L+GY DA+   D D   S++GY+F   GGAIS
Subjt:  LMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

P25600 Putative transposon Ty5-1 protein YCL074W3.5e-3232.97Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        MDV T FLN  ++E IY+ QP GF      + V +L   +YG KQAP  W E  NNTL   GF  +  +  +Y +      I I +YVDD+L+   + ++
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKT-SLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQL-EYAKIIGSVMYLMNYTRPD
            K  L+  + MKDL + D  L + I ++    ++L    Y+ K   + +       +TP   SK L +     +  +  Y  I+G +++  N  RPD
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKT-SLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQL-EYAKIIGSVMYLMNYTRPD

Query:  IAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNK-FPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        I+Y VS LSR+   P   H ++ R +LRYL  T   CL +       L  Y DA+    +D  +ST GYV LL G  ++
Subjt:  IAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNK-FPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-5040.29Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        +DV   FL G L +++YM+QP GF    + N VCKLRK+LYG KQAP+ WY +  N L+T GF  + SDT ++    G   + + +YVDD+LI G +  L
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQ-LEYAKIIGSVMYLMNYTRPDI
        + +T   LS  F +KD EE    L ++ ++  T L L Q  Y+  +L + +     PV TP   S  L+   G  ++   EY  I+GS+ YL  +TRPDI
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQ-LEYAKIIGSVMYLMNYTRPDI

Query:  AYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        +YAV+RLS++ H P   H  AL+ +LRYL GT ++ +   K   + L  Y DA+W  D D+  ST+GY+  LG   IS
Subjt:  AYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-4939.21Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL
        +DV   FL G L +E+YM+QP GF    + + VC+LRK++YG KQAP+ WY +    L+T GF  + SDT ++    G   I + +YVDD+LI G +  L
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMEL

Query:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSV-SQLEYAKIIGSVMYLMNYTRPDI
        +  T   LS  F +K+ E+    L ++ ++    L L Q  Y   +L + +     PV TP   S  L  + G  +    EY  I+GS+ YL  +TRPD+
Subjt:  IIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSV-SQLEYAKIIGSVMYLMNYTRPDI

Query:  AYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        +YAV+RLS+Y H P  +HW+AL+ +LRYL GT D+ +   K   + L  Y DA+W  D D+  ST+GY+  LG   IS
Subjt:  AYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-4034.75Show/hide
Query:  MDVKTTFLNGDLEEEIYMTQPEGFKI----SGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGT
        +D+   FLNGDL+EEIYM  P G+      S   N VC L+KS+YG KQA +QW+ KF+ TLI  GF  + SD   + K+     + + +YVDD++I   
Subjt:  MDVKTTFLNGDLEEEIYMTQPEGFKI----SGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGT

Query:  NMELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNK-GDSVSQLEYAKIIGSVMYLMNYT
        N   + + K  L S F+++DL      L ++I ++   +++CQ  Y   +L +       P   P D S   + +  GD V    Y ++IG +MYL   T
Subjt:  NMELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNK-GDSVSQLEYAKIIGSVMYLMNYT

Query:  RPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        R DI++AV++LS+++  P   H  A+  +L Y+KGT+   L ++    + L+ + DA++ +  D   ST+GY   LG   IS
Subjt:  RPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.7e-0531.33Show/hide
Query:  MYLMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLL
        MYL   TRPD+ +AV+RLS+++         A+  +L Y+KGT+   L ++    + L+ + D++W +  D   S +G+  L+
Subjt:  MYLMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-2232.47Show/hide
Query:  ICLYVDDMLIFGTNMELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAK
        + LYVDD+L+ G++  L+      LSS F MKDL      L ++I+ + + L L Q+ Y E+IL      D  P+ TP     + + +        ++  
Subjt:  ICLYVDDMLIFGTNMELIIDTKLFLSSHFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAK

Query:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS
        I+G++ YL   TRPDI+YAV+ + +  H P    +D L+ +LRY+KGTI + L+ +K   + ++ + D++W        ST+G+   LG   IS
Subjt:  IIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRNHWDALRHLLRYLKGTIDYCLHFNKFPAV-LEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTAAAAACAACTTTTTTAAATGGTGACCTAGAAGAAGAAATTTATATGACACAACCAGAAGGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTGAG
AAAATCCCTATACGGTCCTAAGCAAGCTCCCAAGCAGTGGTATGAAAAATTTAATAATACGTTGATAACCAATGGGTTTAAAATAAATTCCTCTGACACGTGTGTTTATT
CAAAGGTGTTTGGAGCTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAATTGACACTAAGTTATTTCTCTCGTCA
CACTTTGAAATGAAAGACCTAGAAGAAGCAGACGTAATCCTACGTGTTAAAATCAGGAAAAATAAAACTAGTTTGTCTCTTTGTCAATCTCACTACGTGGAGAAAATACT
AAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACGCTAGTAAACATCTTAATAAGAATAAAGGAGATAGTGTGTCTCAACTTGAATATGCAAAGA
TCATAGGTAGTGTGATGTATTTAATGAATTACACTAGACCGGATATTGCATATGCTGTCAGTAGATTAAGTAGATATACACATAATCCTGATAGAAACCACTGGGATGCC
TTACGCCATCTATTGAGATATCTTAAAGGGACAATAGATTACTGTTTACACTTCAACAAATTTCCTGCCGTATTAGAAGGATATCGTGATGCTAACTGGGTCACAGATAA
TGATGAGGTTAACTCTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTAAAAACAACTTTTTTAAATGGTGACCTAGAAGAAGAAATTTATATGACACAACCAGAAGGCTTTAAAATCTCTGGGCAAGAAAACAAAGTGTGTAAACTGAG
AAAATCCCTATACGGTCCTAAGCAAGCTCCCAAGCAGTGGTATGAAAAATTTAATAATACGTTGATAACCAATGGGTTTAAAATAAATTCCTCTGACACGTGTGTTTATT
CAAAGGTGTTTGGAGCTGATTGCATATTAATATGTCTATATGTTGATGACATGTTAATCTTTGGAACAAACATGGAGTTAATAATTGACACTAAGTTATTTCTCTCGTCA
CACTTTGAAATGAAAGACCTAGAAGAAGCAGACGTAATCCTACGTGTTAAAATCAGGAAAAATAAAACTAGTTTGTCTCTTTGTCAATCTCACTACGTGGAGAAAATACT
AAAGAAGTTTGATTCCTTTGATGTTTCTCCTGTGAGAACTCCCTTTGACGCTAGTAAACATCTTAATAAGAATAAAGGAGATAGTGTGTCTCAACTTGAATATGCAAAGA
TCATAGGTAGTGTGATGTATTTAATGAATTACACTAGACCGGATATTGCATATGCTGTCAGTAGATTAAGTAGATATACACATAATCCTGATAGAAACCACTGGGATGCC
TTACGCCATCTATTGAGATATCTTAAAGGGACAATAGATTACTGTTTACACTTCAACAAATTTCCTGCCGTATTAGAAGGATATCGTGATGCTAACTGGGTCACAGATAA
TGATGAGGTTAACTCTACTAGTGGGTATGTATTTTTGCTCGGAGGTGGAGCAATATCTTGA
Protein sequenceShow/hide protein sequence
MDVKTTFLNGDLEEEIYMTQPEGFKISGQENKVCKLRKSLYGPKQAPKQWYEKFNNTLITNGFKINSSDTCVYSKVFGADCILICLYVDDMLIFGTNMELIIDTKLFLSS
HFEMKDLEEADVILRVKIRKNKTSLSLCQSHYVEKILKKFDSFDVSPVRTPFDASKHLNKNKGDSVSQLEYAKIIGSVMYLMNYTRPDIAYAVSRLSRYTHNPDRNHWDA
LRHLLRYLKGTIDYCLHFNKFPAVLEGYRDANWVTDNDEVNSTSGYVFLLGGGAIS