; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0095521 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0095521
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:9131553..9132356
RNA-Seq ExpressionCmc04g0095521
SyntenyCmc04g0095521
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7040802.1 unnamed protein product [Microthlaspi erraticum]9.7e-9360.46Show/hide
Query:  EDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRI
        E+AT  E++LE +++   K     + P DLPP+ DIQH+I+L+PG+SLP+L HYRMSP E +IL + IE+LLKKG I++  SPCAVP LL PKK   W++
Subjt:  EDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRI

Query:  CVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVY
        CVDSRAINKIT+KYRF I R+ D+LD+  G+ +FSK DLRSGYHQIRIR GD+WKT FK+ +GL+EWLVMPFG+SNAPSTFMRLMN++L PF   F++VY
Subjt:  CVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVY

Query:  FDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        FDDIL++SKT ++HL H+ Q+ QVL  N+LYVNLKKC FC+N++ FLGF++ ++ + +DE KV
Subjt:  FDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.2e-13590.26Show/hide
Query:  MSGTEDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDG
        MSGTED T+DEQI EAIKELFKKYPKISKEPT LPPL DI HNIELL GAS PHL HY MSPNEYKILHD IEELLKKGHIK SFS C VPALLTPKKDG
Subjt:  MSGTEDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDG

Query:  TWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKF
        TWR+CVDSRAINKITVKYRF I RVSDLLDQ GGACIFSK DLRS YHQIRIR GD+WKT FKTNEGLFEWLVMPF +SNAPSTFMRLMNKVLHPFLNKF
Subjt:  TWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKF

Query:  IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDE KV
Subjt:  IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]3.0e-9462.45Show/hide
Query:  MSGTEDATKDEQILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKK
        + G ED   D  I E IK + +++ K+  +  P  LPPL +IQH I+L+PGASLP+L HYRMSP E  IL + +EELL+KGHI++S SPCAVPALLTPKK
Subjt:  MSGTEDATKDEQILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKK

Query:  DGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLN
        DG+WR+CVDSRAINKITV+YRF I R+ DLLDQ  GA +FSK DLRSGYHQIRI+ GD+WKT FKT +GL+EWLVMPFG+SNAPSTFMRLM +VL PF+ 
Subjt:  DGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLN

Query:  KFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        KF++VYFDDILV+S+T  +HL H+ ++ + L  NEL+VNLKKC F +N++ FLG+I+  D + +DE+KV
Subjt:  KFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

XP_020877199.1 uncharacterized protein LOC110227443 [Arabidopsis lyrata subsp. lyrata]1.7e-9259.63Show/hide
Query:  MSGTEDATKDE-QILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPK
        + G   A K+E  I E + E+ +++ ++  +  P DLPP+ DIQH I+L+PG+SLP+L HYRMSP E +IL + IE+LLKKG I++S SPCAVP LL PK
Subjt:  MSGTEDATKDE-QILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPK

Query:  KDGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFL
        K   WR+CVDSRAINKIT+KYRF I R+ D+LD+  G+ +FSK DLRSGYHQIRIR GD+WKT FK+ +GL+EWLVMPFG+SNAPSTFMRLMN+VL PF 
Subjt:  KDGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFL

Query:  NKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
          F++VYFDDIL++SKT ++HL+H+ Q+ QVL  N+L+VNLKKC+F +N++ FLGF++ +D + +DE+KV
Subjt:  NKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

XP_040245606.1 uncharacterized protein LOC109732219 isoform X1 [Aegilops tauschii subsp. strangulata]2.5e-9362.89Show/hide
Query:  QILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAI
        +I   ++ L  ++  I  EP  LPP+  IQH I+L+PGASLP+L HYRMSP E+ IL + +EELL+KGHI++S SPCAVPALL PKKDG+WR+C DSRA+
Subjt:  QILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAI

Query:  NKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVF
        NKITV+YRF I R+ D+LDQ  GA +F+K DLRSGYHQIRIR GD+WKT FKT EGLFEWLVMPFG+SNAPSTFM LMN+VL PFL+ F++VYFDDIL++
Subjt:  NKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVF

Query:  SKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        SK  D+H  HI ++ +VL  NELYVNLKKC+F   ++ FLGF+I  D + +D++KV
Subjt:  SKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein1.5e-9462.45Show/hide
Query:  MSGTEDATKDEQILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKK
        + G ED   D  I E IK + +++ K+  +  P  LPPL +IQH I+L+PGASLP+L HYRMSP E  IL + +EELL+KGHI++S SPCAVPALLTPKK
Subjt:  MSGTEDATKDEQILEAIKELFKKYPKISKE--PTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKK

Query:  DGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLN
        DG+WR+CVDSRAINKITV+YRF I R+ DLLDQ  GA +FSK DLRSGYHQIRI+ GD+WKT FKT +GL+EWLVMPFG+SNAPSTFMRLM +VL PF+ 
Subjt:  DGTWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLN

Query:  KFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        KF++VYFDDILV+S+T  +HL H+ ++ + L  NEL+VNLKKC F +N++ FLG+I+  D + +DE+KV
Subjt:  KFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

A0A5D3DGR0 Reverse transcriptase5.8e-13690.26Show/hide
Query:  MSGTEDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDG
        MSGTED T+DEQI EAIKELFKKYPKISKEPT LPPL DI HNIELL GAS PHL HY MSPNEYKILHD IEELLKKGHIK SFS C VPALLTPKKDG
Subjt:  MSGTEDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDG

Query:  TWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKF
        TWR+CVDSRAINKITVKYRF I RVSDLLDQ GGACIFSK DLRS YHQIRIR GD+WKT FKTNEGLFEWLVMPF +SNAPSTFMRLMNKVLHPFLNKF
Subjt:  TWRICVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKF

Query:  IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDE KV
Subjt:  IIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

A0A6D2HLB5 Reverse transcriptase1.0e-9261.07Show/hide
Query:  DATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRIC
        +AT  E++LE +++   K     + P  LPP+ DIQH+I+L+PG+SLP+L HYRMSP E +IL + IE+LLKKG I++S SPCAVP LL PKK   WR+C
Subjt:  DATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRIC

Query:  VDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYF
        VDSRAINKIT+KYRF I R+ D+LD+  G+ IFSK DLRSGYHQIRIR GD+WKT FK+ +GL+EWLVMPFG+SNAPSTFMRLMN++L PF   F++VYF
Subjt:  VDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYF

Query:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        DDIL++SKT ++HL H+ Q+ QVL  N+LYVNLKKC FC+N++ FLGF++ ++ + +DE KV
Subjt:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

A0A6D2IKM3 Reverse transcriptase1.0e-9261.07Show/hide
Query:  DATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRIC
        +AT  E++LE +++   K     + P  LPP+ DIQH+I+L+PG+SLP+L HYRMSP E +IL + IE+LLKKG I++S SPCAVP LL PKK   WR+C
Subjt:  DATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRIC

Query:  VDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYF
        VDSRAINKIT+KYRF I R+ D+LD+  G+ IFSK DLRSGYHQIRIR GD+WKT FK+ +GL+EWLVMPFG+SNAPSTFMRLMN++L PF   F++VYF
Subjt:  VDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYF

Query:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        DDIL++SKT ++HL H+ Q+ QVL  N+LYVNLKKC FC+N++ FLGF++ ++ + +DE KV
Subjt:  DDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

A0A6D2JW30 Reverse transcriptase domain-containing protein4.7e-9360.46Show/hide
Query:  EDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRI
        E+AT  E++LE +++   K     + P DLPP+ DIQH+I+L+PG+SLP+L HYRMSP E +IL + IE+LLKKG I++  SPCAVP LL PKK   W++
Subjt:  EDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRI

Query:  CVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVY
        CVDSRAINKIT+KYRF I R+ D+LD+  G+ +FSK DLRSGYHQIRIR GD+WKT FK+ +GL+EWLVMPFG+SNAPSTFMRLMN++L PF   F++VY
Subjt:  CVDSRAINKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVY

Query:  FDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV
        FDDIL++SKT ++HL H+ Q+ QVL  N+LYVNLKKC FC+N++ FLGF++ ++ + +DE KV
Subjt:  FDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.2e-3733.06Show/hide
Query:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK
        + +++K++  I+ E     LP P+  ++  +EL        + +Y + P + + ++D I + LK G I++S +  A P +  PKK+GT R+ VD + +NK
Subjt:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK

Query:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
              + +  +  LL +  G+ IF+K DL+S YH IR+R GD+ K  F+   G+FE+LVMP+G+S AP+ F   +N +L       ++ Y DDIL+ SK
Subjt:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII
        +  +H++H+  + Q L +  L +N  KC F  +++ F+G+ I
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII

P0CT35 Transposon Tf2-2 polyprotein1.2e-3733.06Show/hide
Query:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK
        + +++K++  I+ E     LP P+  ++  +EL        + +Y + P + + ++D I + LK G I++S +  A P +  PKK+GT R+ VD + +NK
Subjt:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK

Query:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
              + +  +  LL +  G+ IF+K DL+S YH IR+R GD+ K  F+   G+FE+LVMP+G+S AP+ F   +N +L       ++ Y DDIL+ SK
Subjt:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII
        +  +H++H+  + Q L +  L +N  KC F  +++ F+G+ I
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII

P0CT41 Transposon Tf2-12 polyprotein1.2e-3733.06Show/hide
Query:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK
        + +++K++  I+ E     LP P+  ++  +EL        + +Y + P + + ++D I + LK G I++S +  A P +  PKK+GT R+ VD + +NK
Subjt:  IKELFKKYPKISKEPT--DLP-PLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK

Query:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
              + +  +  LL +  G+ IF+K DL+S YH IR+R GD+ K  F+   G+FE+LVMP+G+S AP+ F   +N +L       ++ Y DDIL+ SK
Subjt:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII
        +  +H++H+  + Q L +  L +N  KC F  +++ F+G+ I
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.4e-4842.15Show/hide
Query:  LFKKYPKISKEPTDLPPLH------DIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK
        L +KY +I +   DLPP         ++H+IE+ PGA LP L  Y ++    + ++  +++LL    I  S SPC+ P +L PKKDGT+R+CVD R +NK
Subjt:  LFKKYPKISKEPTDLPPLH------DIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK

Query:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
         T+   F + R+ +LL + G A IF+  DL SGYHQI +   D++KT F T  G +E+ VMPFG+ NAPSTF R M         +F+ VY DDIL+FS+
Subjt:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII
        + ++H +H+D + + L +  L V  KKC F S E  FLG+ I
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.4e-4842.15Show/hide
Query:  LFKKYPKISKEPTDLPPLH------DIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK
        L +KY +I +   DLPP         ++H+IE+ PGA LP L  Y ++    + ++  +++LL    I  S SPC+ P +L PKKDGT+R+CVD R +NK
Subjt:  LFKKYPKISKEPTDLPPLH------DIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRAINK

Query:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK
         T+   F + R+ +LL + G A IF+  DL SGYHQI +   D++KT F T  G +E+ VMPFG+ NAPSTF R M         +F+ VY DDIL+FS+
Subjt:  ITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSK

Query:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII
        + ++H +H+D + + L +  L V  KKC F S E  FLG+ I
Subjt:  TYDQHLQHIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFII

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGGACTGAAGATGCAACAAAAGATGAACAAATCCTTGAAGCCATTAAAGAATTGTTCAAGAAGTATCCTAAAATCAGTAAAGAACCTACAGACCTTCCTCCTTT
GCATGATATACAACACAATATTGAATTACTTCCAGGTGCCTCTCTTCCTCACTTATCTCACTATCGCATGAGCCCAAATGAATATAAAATTTTACATGATACTATTGAAG
AATTACTAAAAAAGGGGCACATTAAGCAAAGCTTTAGCCCTTGTGCAGTACCAGCACTACTAACACCTAAAAAAGATGGAACTTGGAGGATATGTGTGGATAGTAGAGCT
ATCAATAAAATCACAGTAAAATATAGATTTTCAATCCTCAGGGTCAGTGATTTGTTAGATCAATTCGGAGGTGCTTGTATCTTTTCGAAGTTTGATCTAAGGAGTGGCTA
TCATCAAATACGTATCAGACTTGGAGATAAATGGAAAACAACCTTCAAGACCAATGAAGGACTTTTTGAGTGGCTTGTAATGCCATTTGGCGTTTCTAATGCTCCTAGTA
CTTTCATGAGGTTGATGAACAAGGTACTGCATCCTTTCTTAAACAAGTTCATTATAGTTTACTTTGATGACATTCTTGTCTTTAGCAAAACCTATGATCAACACCTCCAA
CACATTGACCAGCTGTTCCAAGTACTTAATCATAATGAACTTTATGTAAATCTCAAGAAGTGCATTTTCTGCTCTAATGAAATAGCCTTCTTGGGGTTCATAATCAGAAA
AGATCATGTTCTAATGGATGAGAATAAGGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGGACTGAAGATGCAACAAAAGATGAACAAATCCTTGAAGCCATTAAAGAATTGTTCAAGAAGTATCCTAAAATCAGTAAAGAACCTACAGACCTTCCTCCTTT
GCATGATATACAACACAATATTGAATTACTTCCAGGTGCCTCTCTTCCTCACTTATCTCACTATCGCATGAGCCCAAATGAATATAAAATTTTACATGATACTATTGAAG
AATTACTAAAAAAGGGGCACATTAAGCAAAGCTTTAGCCCTTGTGCAGTACCAGCACTACTAACACCTAAAAAAGATGGAACTTGGAGGATATGTGTGGATAGTAGAGCT
ATCAATAAAATCACAGTAAAATATAGATTTTCAATCCTCAGGGTCAGTGATTTGTTAGATCAATTCGGAGGTGCTTGTATCTTTTCGAAGTTTGATCTAAGGAGTGGCTA
TCATCAAATACGTATCAGACTTGGAGATAAATGGAAAACAACCTTCAAGACCAATGAAGGACTTTTTGAGTGGCTTGTAATGCCATTTGGCGTTTCTAATGCTCCTAGTA
CTTTCATGAGGTTGATGAACAAGGTACTGCATCCTTTCTTAAACAAGTTCATTATAGTTTACTTTGATGACATTCTTGTCTTTAGCAAAACCTATGATCAACACCTCCAA
CACATTGACCAGCTGTTCCAAGTACTTAATCATAATGAACTTTATGTAAATCTCAAGAAGTGCATTTTCTGCTCTAATGAAATAGCCTTCTTGGGGTTCATAATCAGAAA
AGATCATGTTCTAATGGATGAGAATAAGGTATAA
Protein sequenceShow/hide protein sequence
MSGTEDATKDEQILEAIKELFKKYPKISKEPTDLPPLHDIQHNIELLPGASLPHLSHYRMSPNEYKILHDTIEELLKKGHIKQSFSPCAVPALLTPKKDGTWRICVDSRA
INKITVKYRFSILRVSDLLDQFGGACIFSKFDLRSGYHQIRIRLGDKWKTTFKTNEGLFEWLVMPFGVSNAPSTFMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQ
HIDQLFQVLNHNELYVNLKKCIFCSNEIAFLGFIIRKDHVLMDENKV