; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:13810122..13810928
RNA-Seq ExpressionCmc08g0224031
SyntenyCmc08g0224031
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.5e-11981.72Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQW+KAM+L++ESMY NSVWTLVD  +DVK IGCKWIYKRKRDQAGKVQTFKARLV KGY +KEGV+Y+ETFSPV MLKSIRILLSI TFY+YE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKTAFLNGNLEESIYMVQPEGFI + QEQKV KLQKSIYGLKQASRSWNIRFDTAIKSYGFE NVDEPCVYK+I+ S VAFL+L VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        + +LT++KKWL TQFQMKDL  AQY+LGIQIV NRKNK L MSQ SYIDK+LSRYKMQNSK G L +R
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-11478.73Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQWVKAMDL++ESMY NSVW LVD    VK IGCKWIYKRKRD AGKVQTFKARLV KGY ++EGV+Y+ETFSPV MLKSIRILLSI TFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKTAFLNGNLEESI+M QPEGFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIKSYGF+ NVDEPCVYK+I K  VAFLVL VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        +G+LT++K WLA QFQMKDL  AQYVLGIQI+ +RKNK L +SQ +YIDK+L RY MQNSK GLL +R
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

KAA0043610.1 gag/pol protein [Cucumis melo var. makuwa]7.9e-12793.52Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSY
        IGHLTNIKKWLATQFQMKDLENAQYVL       R+ ++ +MS   Y
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSY

KAA0045356.1 gag/pol protein [Cucumis melo var. makuwa]9.6e-12584.64Show/hide
Query:  NDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEI
        NDVD DQW+KAMDLK+ESMYSNSVWTLVDQ N+++ IGCKWIYKRKRDQ  KVQTF+ARLV KGY +KEG++Y+ETFSP+ M+KSIRILLSI TFYDYEI
Subjt:  NDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEI

Query:  CQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDI
         QMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKV KLQKSIYGLKQASRSWNIRFDT IKSYGFE NVDEPCVYKRII STVAFLVL VD+ILLIGN++
Subjt:  CQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDI

Query:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
         HLT+IK+WL TQFQMKDL +AQYVLGIQIV NRKNK L MSQTSYIDKMLSRYKMQNSK GLL YR
Subjt:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-12083.96Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQW+KAMDL++ESMYSNSVWTLVDQ NDVK IGCKWIYKRKRDQAGKVQTFKARLV KGY +KEG++Y+E FS   M+KSIRILLSI TFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKT FLN NLEESIYMVQPE FIQKGQEQK+ KLQKSIYGLKQASRS NIRFDTAIKSYG E NVDEPCVYKRI+ STVAFLVL VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        +GHL +IKKWLA QFQMKDL NAQYVLG+QIV NRKNK L MSQTSYIDKMLSRYKM NSK GLL YR
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

TrEMBL top hitse value%identityAlignment
A0A5A7TK28 Gag/pol protein3.8e-12793.52Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSY
        IGHLTNIKKWLATQFQMKDLENAQYVL       R+ ++ +MS   Y
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSY

A0A5A7TTA2 Gag/pol protein4.7e-12584.64Show/hide
Query:  NDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEI
        NDVD DQW+KAMDLK+ESMYSNSVWTLVDQ N+++ IGCKWIYKRKRDQ  KVQTF+ARLV KGY +KEG++Y+ETFSP+ M+KSIRILLSI TFYDYEI
Subjt:  NDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEI

Query:  CQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDI
         QMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKV KLQKSIYGLKQASRSWNIRFDT IKSYGFE NVDEPCVYKRII STVAFLVL VD+ILLIGN++
Subjt:  CQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDI

Query:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
         HLT+IK+WL TQFQMKDL +AQYVLGIQIV NRKNK L MSQTSYIDKMLSRYKMQNSK GLL YR
Subjt:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

A0A5A7TZD0 Gag/pol protein1.3e-11478.73Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQWVKAMDL++ESMY NSVW LVD    VK IGCKWIYKRKRD AGKVQTFKARLV KGY ++EGV+Y+ETFSPV MLKSIRILLSI TFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKTAFLNGNLEESI+M QPEGFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIKSYGF+ NVDEPCVYK+I K  VAFLVL VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        +G+LT++K WLA QFQMKDL  AQYVLGIQI+ +RKNK L +SQ +YIDK+L RY MQNSK GLL +R
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

A0A5D3BX45 Gag/pol protein9.1e-12183.96Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQW+KAMDL++ESMYSNSVWTLVDQ NDVK IGCKWIYKRKRDQAGKVQTFKARLV KGY +KEG++Y+E FS   M+KSIRILLSI TFYDYE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKT FLN NLEESIYMVQPE FIQKGQEQK+ KLQKSIYGLKQASRS NIRFDTAIKSYG E NVDEPCVYKRI+ STVAFLVL VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        +GHL +IKKWLA QFQMKDL NAQYVLG+QIV NRKNK L MSQTSYIDKMLSRYKM NSK GLL YR
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

E2GK51 Gag/pol protein (Fragment)1.7e-11981.72Show/hide
Query:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE
        MNDVD DQW+KAM+L++ESMY NSVWTLVD  +DVK IGCKWIYKRKRDQAGKVQTFKARLV KGY +KEGV+Y+ETFSPV MLKSIRILLSI TFY+YE
Subjt:  MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYE

Query:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND
        I QMDVKTAFLNGNLEESIYMVQPEGFI + QEQKV KLQKSIYGLKQASRSWNIRFDTAIKSYGFE NVDEPCVYK+I+ S VAFL+L VD+ILLIGND
Subjt:  ICQMDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGND

Query:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR
        + +LT++KKWL TQFQMKDL  AQY+LGIQIV NRKNK L MSQ SYIDK+LSRYKMQNSK G L +R
Subjt:  IGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.1e-4234.88Show/hide
Query:  DCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQM
        D   W +A++ ++ +   N+ WT+  +  +  ++  +W++  K ++ G    +KARLV +G+ +K  ++Y+ETF+PV  + S R +LS+   Y+ ++ QM
Subjt:  DCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQM

Query:  DVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVY---KRIIKSTVAFLVLDVDNILLIGNDI
        DVKTAFLNG L+E IYM  P+G         V KL K+IYGLKQA+R W   F+ A+K   F ++  + C+Y   K  I   + +++L VD++++   D+
Subjt:  DVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVY---KRIIKSTVAFLVLDVDNILLIGNDI

Query:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQN
          + N K++L  +F+M DL   ++ +GI+I   +++KI  +SQ++Y+ K+LS++ M+N
Subjt:  GHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-5744.53Show/hide
Query:  DQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMDV
        +Q +KAM  ++ES+  N  + LV+     + + CKW++K K+D   K+  +KARLV KG+ +K+G+++ E FSPVV + SIR +LS+    D E+ Q+DV
Subjt:  DQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMDV

Query:  KTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVY-KRIIKSTVAFLVLDVDNILLIGNDIGHLT
        KTAFL+G+LEE IYM QPEGF   G++  V KL KS+YGLKQA R W ++FD+ +KS  +     +PCVY KR  ++    L+L VD++L++G D G + 
Subjt:  KTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVY-KRIIKSTVAFLVLDVDNILLIGNDIGHLT

Query:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK
         +K  L+  F MKDL  AQ +LG++IV  R ++ L +SQ  YI+++L R+ M+N+K
Subjt:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK

P25600 Putative transposon Ty5-1 protein YCL074W1.9e-1431.65Show/hide
Query:  MDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGH
        MDV TAFLN  ++E IY+ QP GF+ +     V++L   +YGLKQA   WN   +  +K  GF  +  E  +Y R       ++ + VD++L+       
Subjt:  MDVKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGH

Query:  LTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK
           +K+ L   + MKDL      LG+ I     N  + +S   YI K  S  ++   K
Subjt:  LTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-4337.5Show/hide
Query:  DQWVKAMDLKIESMYSNSVWTLV-DQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMD
        ++W  AM  +I +   N  W LV    + V ++GC+WI+ +K +  G +  +KARLV KGY ++ G++Y ETFSPV+   SIRI+L +     + I Q+D
Subjt:  DQWVKAMDLKIESMYSNSVWTLV-DQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMD

Query:  VKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHLT
        V  AFL G L + +YM QP GFI K +   V KL+K++YGLKQA R+W +     + + GF ++V +  ++      ++ ++++ VD+IL+ GND   L 
Subjt:  VKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHLT

Query:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK
        N    L+ +F +KD E   Y LGI+    R    L +SQ  YI  +L+R  M  +K
Subjt:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-4236.33Show/hide
Query:  DQWVKAMDLKIESMYSNSVWTLV-DQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMD
        D+W +AM  +I +   N  W LV      V ++GC+WI+ +K +  G +  +KARLV KGY ++ G++Y ETFSPV+   SIRI+L +     + I Q+D
Subjt:  DQWVKAMDLKIESMYSNSVWTLV-DQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMD

Query:  VKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHLT
        V  AFL G L + +YM QP GF+ K +   V +L+K+IYGLKQA R+W +   T + + GF +++ +  ++      ++ ++++ VD+IL+ GND   L 
Subjt:  VKTAFLNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHLT

Query:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK
        +    L+ +F +K+ E+  Y LGI+    R  + L +SQ  Y   +L+R  M  +K
Subjt:  NIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-4236.56Show/hide
Query:  WVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMDVKT
        W  AMD +I +M +   W +     + K IGCKW+YK K +  G ++ +KARLV KGY ++EG+++ ETFSPV  L S++++L+I   Y++ + Q+D+  
Subjt:  WVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMDVKT

Query:  AFLNGNLEESIYMVQPEGFIQKGQE----QKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHL
        AFLNG+L+E IYM  P G+  +  +      V  L+KSIYGLKQASR W ++F   +  +GF  +  +   + +I  +    +++ VD+I++  N+   +
Subjt:  AFLNGNLEESIYMVQPEGFIQKGQE----QKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHL

Query:  TNIKKWLATQFQMKDLENAQYVLGIQI
          +K  L + F+++DL   +Y LG++I
Subjt:  TNIKKWLATQFQMKDLENAQYVLGIQI

ATMG00810.1 DNA/RNA polymerases superfamily protein9.0e-0438.16Show/hide
Query:  FLVLDVDNILLIGNDIGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK
        +L+L VD+ILL G+    L  +   L++ F MKDL    Y LGIQI  +     L +SQT Y +++L+   M + K
Subjt:  FLVLDVDNILLIGNDIGHLTNIKKWLATQFQMKDLENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.3e-1437.65Show/hide
Query:  WVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSI
        W +AM  +++++  N  W LV    +  ++GCKW++K K    G +   KARLV KG+ ++EG+ + ET+SPVV   +IR +L++
Subjt:  WVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTGGACTGTGATCAATGGGTCAAAGCCATGGACCTCAAAATAGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAATCAAATGATGTAAAACT
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGATCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGATTAGTGGGAAAAGGTTATATTGAAAAGGAGGGAGTGAATT
ATAAAGAAACTTTCTCTCCTGTTGTCATGCTGAAGTCGATCAGAATACTCTTATCTATCCCCACTTTTTATGATTATGAAATTTGCCAGATGGATGTCAAGACAGCCTTT
TTGAATGGAAATCTTGAGGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTATAAGCTTCAAAAATCCATATATGGATTAAA
ACAAGCATCTAGATCCTGGAATATAAGGTTTGATACTGCGATCAAATCTTATGGTTTTGAACACAATGTTGATGAACCTTGTGTTTACAAAAGGATCATTAAATCCACTG
TAGCATTCTTAGTTCTAGATGTAGATAACATTCTACTCATTGGGAATGATATAGGTCATCTAACTAATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTG
GAAAATGCTCAATATGTTCTTGGTATCCAAATAGTTTGGAATCGAAAGAACAAAATATTAGACATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGAT
GCAGAATTCCAAAAATGGTCTGTTGCTGTACAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGATGTGGACTGTGATCAATGGGTCAAAGCCATGGACCTCAAAATAGAATCTATGTATTCCAATTCTGTCTGGACTCTAGTAGATCAATCAAATGATGTAAAACT
TATTGGTTGTAAATGGATCTACAAGAGAAAACGAGATCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGATTAGTGGGAAAAGGTTATATTGAAAAGGAGGGAGTGAATT
ATAAAGAAACTTTCTCTCCTGTTGTCATGCTGAAGTCGATCAGAATACTCTTATCTATCCCCACTTTTTATGATTATGAAATTTGCCAGATGGATGTCAAGACAGCCTTT
TTGAATGGAAATCTTGAGGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTATAAGCTTCAAAAATCCATATATGGATTAAA
ACAAGCATCTAGATCCTGGAATATAAGGTTTGATACTGCGATCAAATCTTATGGTTTTGAACACAATGTTGATGAACCTTGTGTTTACAAAAGGATCATTAAATCCACTG
TAGCATTCTTAGTTCTAGATGTAGATAACATTCTACTCATTGGGAATGATATAGGTCATCTAACTAATATTAAGAAATGGCTAGCTACGCAATTCCAAATGAAAGATTTG
GAAAATGCTCAATATGTTCTTGGTATCCAAATAGTTTGGAATCGAAAGAACAAAATATTAGACATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGAT
GCAGAATTCCAAAAATGGTCTGTTGCTGTACAGATAA
Protein sequenceShow/hide protein sequence
MNDVDCDQWVKAMDLKIESMYSNSVWTLVDQSNDVKLIGCKWIYKRKRDQAGKVQTFKARLVGKGYIEKEGVNYKETFSPVVMLKSIRILLSIPTFYDYEICQMDVKTAF
LNGNLEESIYMVQPEGFIQKGQEQKVYKLQKSIYGLKQASRSWNIRFDTAIKSYGFEHNVDEPCVYKRIIKSTVAFLVLDVDNILLIGNDIGHLTNIKKWLATQFQMKDL
ENAQYVLGIQIVWNRKNKILDMSQTSYIDKMLSRYKMQNSKNGLLLYR