; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0108401 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0108401
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:27599277..27600285
RNA-Seq ExpressionCmc04g0108401
SyntenyCmc04g0108401
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032794.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-15688.89Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLRMVSF+GHVVSKAGVS+DPAKIEAV+SWPRPSTVSEVRSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGL GYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLK+KLV      VPDGSGSF+IY DASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

KAA0035816.1 pol protein [Cucumis melo var. makuwa]8.0e-15588.57Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY+ LNKVTVKN+YPLPRIDDLFDQLQGA VFS+IDL+ GYHQ RIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEH+EHLR+VSF+GHVVSKAGVS+DPAKIEAV+ W RPSTVSEVRSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGLTGYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

KAA0036553.1 pol protein [Cucumis melo var. makuwa]2.0e-15387.94Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLP IDDLFDQLQ ATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLR+VSF+GHVVSKAGVS+DPAKIEAV+ W RPSTVSE RSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGL GYY+RF+ENFS IATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

KAA0040695.1 pol protein [Cucumis melo var. makuwa]1.7e-15283.14Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFSEIDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLRM                       VSF+GHVVSKAGVS+
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM

Query:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLVVVP-----DGSGSFVIYSDASKKGLGCV
        DPAKIEAV+ W RPSTVSEVRSFLGL GYY++F+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV  P     DGSGSFVIYSDASKKGLGCV
Subjt:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLVVVP-----DGSGSFVIYSDASKKGLGCV

Query:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI
        LMQQGKVVAYAS QLKSHEQNYPTHDLELA +VFA+KI
Subjt:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI

KAA0048687.1 pol protein [Cucumis melo var. makuwa]5.7e-15383.43Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLRM                       VSF+GHVVSKAGVS+
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM

Query:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV
        DPAKIEAV+ W RPSTVSEVRSFLGL GYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCV
Subjt:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV

Query:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI
        LMQQGKVVAYAS QLKSHEQNYPTHDLELA +VFA+KI
Subjt:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI

TrEMBL top hitse value%identityAlignment
A0A5A7T0S7 Reverse transcriptase3.9e-15588.57Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY+ LNKVTVKN+YPLPRIDDLFDQLQGA VFS+IDL+ GYHQ RIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEH+EHLR+VSF+GHVVSKAGVS+DPAKIEAV+ W RPSTVSEVRSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGLTGYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

A0A5A7T0Y9 Reverse transcriptase9.5e-15487.94Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLP IDDLFDQLQ ATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLR+VSF+GHVVSKAGVS+DPAKIEAV+ W RPSTVSE RSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGL GYY+RF+ENFS IATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

A0A5A7T190 Reverse transcriptase8.1e-15383.14Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+ EHEEHLRM                       VSF+GHVVSKAGVS+
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM

Query:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV
        DPAKIEAV+ W RPSTVSEVRSFLGL GYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCV
Subjt:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV

Query:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI
        LMQQGKVVAYAS QLKSHEQNYPTHDLELA +VFA+KI
Subjt:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI

A0A5A7U330 Reverse transcriptase2.8e-15383.43Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM
        RS YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLRM                       VSF+GHVVSKAGVS+
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVSM

Query:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV
        DPAKIEAV+ W RPSTVSEVRSFLGL GYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLKQKLV      VPDGSGSFVIYSDASKKGLGCV
Subjt:  DPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCV

Query:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI
        LMQQGKVVAYAS QLKSHEQNYPTHDLELA +VFA+KI
Subjt:  LMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI

A0A5D3E456 Reverse transcriptase9.2e-15788.89Show/hide
Query:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF
        AEL+ELKVQLQELLDKGF RPSVSPW APVLFVKKKDGSMRL IDY++LNKVTVKN+YPLPRIDDLFDQLQGATVFS+IDL+ GYHQLRIKD DV KTAF
Subjt:  AELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAF

Query:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF
        RS YGHYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKT+AEHEEHLRMVSF+GHVVSKAGVS+DPAKIEAV+SWPRPSTVSEVRSF
Subjt:  RSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSF

Query:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP
        LGL GYY+RF+ENFSRIATPLTQLTRKGAPFVWSKACE SFQNLK+KLV      VPDGSGSF+IY DASKKGLGCVLMQQGKVVAYAS QLKSHEQNYP
Subjt:  LGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLV-----VVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYP

Query:  THDLELATIVFAMKI
        THDLELA +VFA+KI
Subjt:  THDLELATIVFAMKI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.0e-6137.06Show/hide
Query:  EELKVQLQELLDKGFSRPSVSPWAAPVLFV-KKKDGS----MRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKT
        +E++ Q+Q++L++G  R S SP+ +P+  V KK+D S     R+ IDY+KLN++TV +++P+P +D++  +L     F+ IDL  G+HQ+ +    VSKT
Subjt:  EELKVQLQELLDKGFSRPSVSPWAAPVLFV-KKKDGS----MRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKT

Query:  AFRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHVVSKAGV
        AF + +GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V                       +F+GHV++  G+
Subjt:  AFRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHVVSKAGV

Query:  SMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPF-VWSKACEGSFQNLK-----QKLVVVPDGSGSFVIYSDASKKGL
          +P KIEA+  +P P+   E+++FLGLTGYY++F+ NF+ IA P+T+  +K       +   + +F+ LK       ++ VPD +  F + +DAS   L
Subjt:  SMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPF-VWSKACEGSFQNLK-----QKLVVVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMK
        G VL Q G  ++Y S  L  HE NY T + EL  IV+A K
Subjt:  GCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy5.4e-5333.53Show/hide
Query:  QLQELLDKGFSRPSVSPWAAPVLFVKKK------DGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAFRS
        ++++LL  G  RPS SP+ +P   V KK      + + RL ID++KLN+ T+ ++YP+P I  +   L  A  F+ +DL+ GYHQ+ + ++D  KT+F  
Subjt:  QLQELLDKGFSRPSVSPWAAPVLFVKKK------DGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAFRS

Query:  GYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHL-----------------------RMVSFIGHVVSKAGVSMDP
          G YEF  + FGL NA ++F   ++ V RE +     V++DD++I+S+ +++H  H+                         V ++G +VSK G   DP
Subjt:  GYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHL-----------------------RMVSFIGHVVSKAGVSMDP

Query:  AKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTR-----------KGAPFVWSKACEGSFQNLKQKL------VVVPDGSGSFVIYS
         K++A+  +P P  V +VRSFLGL  YY+ F+++F+ IA P+T + +           K  P  +++    +FQ L+  L      +  PD    F + +
Subjt:  AKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTR-----------KGAPFVWSKACEGSFQNLKQKL------VVVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAM
        DAS  G+G VL Q+G+ +   S  LK  EQNY T++ EL  IV+A+
Subjt:  DASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAM

P20825 Retrovirus-related Pol polyprotein from transposon 2971.3e-5936.13Show/hide
Query:  LAELEELKV--QLQELLDKGFSRPSVSPWAAPVLFVKKKD-----GSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKD
        LA+  E++V  Q+QE+L++G  R S SP+ +P   V KK         R+ IDY+KLN++T+ ++YP+P +D++  +L     F+ IDL  G+HQ+ + +
Subjt:  LAELEELKV--QLQELLDKGFSRPSVSPWAAPVLFVKKKD-----GSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKD

Query:  NDVSKTAFRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHV
          +SKTAF +  GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V                       +F+GH+
Subjt:  NDVSKTAFRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHV

Query:  VSKAGVSMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSK-ACEGSFQNLK-----QKLVVVPDGSGSFVIYSD
        V+  G+  +P K++A+ S+P P+   E+R+FLGLTGYY++F+ N++ IA P+T   +K       K     +F+ LK       ++ +PD    FV+ +D
Subjt:  VSKAGVSMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSK-ACEGSFQNLK-----QKLVVVPDGSGSFVIYSD

Query:  ASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMK
        AS   LG VL Q G  +++ S  L  HE NY   + EL  IV+A K
Subjt:  ASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.1e-5336.28Show/hide
Query:  EELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAFRSG
        +E+   +Q+LLD  F  PS SP ++PV+ V KKDG+ RL +DY+ LNK T+ + +PLPRID+L  ++  A +F+ +DL  GYHQ+ ++  D  KTAF + 
Subjt:  EELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAFRSG

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHVVSKAGVSMDPA
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  V                        F+G+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMV-----------------------SFIGHVVSKAGVSMDPA

Query:  KIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKL----VVVP-DGSGSFVIYSDASKKGLGCVLMQ
        K  A+  +P P TV + + FLG+  YY+RF+ N S+IA P+       +   W++  + + + LK  L    V+VP +   ++ + +DASK G+G VL +
Subjt:  KIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKL----VVVP-DGSGSFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYASCQLKSHEQNYPTHDLELATIVFAM
                 VV Y S  L+S ++NYP  +LEL  I+ A+
Subjt:  QGK------VVAYASCQLKSHEQNYPTHDLELATIVFAM

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus8.3e-5432.67Show/hide
Query:  ELKVQLQELLDKGFSRPSVSPWAAPVLFVKKK-----DGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTA
        E++ Q+ ELL  G  RPS SP+ +P+  V KK     +   R+ +D+K+LN VT+ + YP+P I+     L  A  F+ +DL  G+HQ+ +K++D+ KTA
Subjt:  ELKVQLQELLDKGFSRPSVSPWAAPVLFVKKK-----DGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTA

Query:  FRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVS
        F +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI+++S+    H ++LR+                       V F+G++V+  G+ 
Subjt:  FRSGYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRM-----------------------VSFIGHVVSKAGVS

Query:  MDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTR-----------KGAPFVWSKACEGSFQNLK-----QKLVVVPDGSGSFVI
         DP K+ A+S  P P++V E++ FLG+T YY++F+++++++A PLT LTR              P    +    SF +LK      +++  P  +  F +
Subjt:  MDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTR-----------KGAPFVWSKACEGSFQNLK-----QKLVVVPDGSGSFVI

Query:  YSDASKKGLGCVLMQ----QGKVVAYASCQLKSHEQNYPTHDLELATIVFAM
         +DAS   +G VL Q    + + +AY S  L   E+NY T + E+  I++++
Subjt:  YSDASKKGLGCVLMQ----QGKVVAYASCQLKSHEQNYPTHDLELATIVFAM

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.5e-1842.39Show/hide
Query:  VSFIG--HVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLVVVP
        ++++G  H++S  GVS DPAK+EA+  WP P   +E+R FLGLTGYY+RF++N+ +I  PLT+L +K +   W++    +F+ LK  +  +P
Subjt:  VSFIG--HVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRIATPLTQLTRKGAPFVWSKACEGSFQNLKQKLVVVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTAGCAGAGTTGGAAGAGCTGAAGGTGCAGTTACAAGAGTTGCTTGATAAAGGCTTCAGTCGGCCGAGTGTGTCACCTTGGGCTGCACCAGTCTTATTTGTTAA
GAAGAAGGATGGATCGATGCGCCTATATATTGACTACAAGAAGTTGAATAAGGTCACCGTTAAGAACAAATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTAC
AAGGAGCTACAGTGTTCTCTGAGATCGACCTTCAGTTGGGATATCATCAACTGAGGATTAAGGATAACGATGTATCGAAGACAGCCTTTCGTTCCGGATATGGGCACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTCATGGATTTGATGAACAGAGTGTTTAGAGAATTCCTAGATACTTTTGTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAAAGGCCGAGCATGAGGAGCATTTACGTATGGTATCCTTTATAGGCCATGTGGTTTCTAAAGCTGGTGTTTCAATGGATCCAGCTA
AGATAGAGGCAGTCAGCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAACAGGTTATTATCAACGGTTTATGGAGAACTTTTCCCGTATA
GCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTCGTGTGGAGCAAGGCCTGTGAGGGCAGCTTTCAGAACCTTAAACAGAAGCTAGTTGTTGTACCTGATGG
TTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAAGGTTTAGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCGCTTACGCTTCTTGTCAGTTGAAGAGTCATG
AGCAAAATTACCCTACACACGATTTAGAGTTGGCAACAATAGTTTTTGCAATGAAGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCTAGCAGAGTTGGAAGAGCTGAAGGTGCAGTTACAAGAGTTGCTTGATAAAGGCTTCAGTCGGCCGAGTGTGTCACCTTGGGCTGCACCAGTCTTATTTGTTAA
GAAGAAGGATGGATCGATGCGCCTATATATTGACTACAAGAAGTTGAATAAGGTCACCGTTAAGAACAAATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTAC
AAGGAGCTACAGTGTTCTCTGAGATCGACCTTCAGTTGGGATATCATCAACTGAGGATTAAGGATAACGATGTATCGAAGACAGCCTTTCGTTCCGGATATGGGCACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTCATGGATTTGATGAACAGAGTGTTTAGAGAATTCCTAGATACTTTTGTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAAAGGCCGAGCATGAGGAGCATTTACGTATGGTATCCTTTATAGGCCATGTGGTTTCTAAAGCTGGTGTTTCAATGGATCCAGCTA
AGATAGAGGCAGTCAGCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAACAGGTTATTATCAACGGTTTATGGAGAACTTTTCCCGTATA
GCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTCGTGTGGAGCAAGGCCTGTGAGGGCAGCTTTCAGAACCTTAAACAGAAGCTAGTTGTTGTACCTGATGG
TTCAGGGAGTTTTGTGATTTACAGTGATGCTTCTAAGAAAGGTTTAGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCGCTTACGCTTCTTGTCAGTTGAAGAGTCATG
AGCAAAATTACCCTACACACGATTTAGAGTTGGCAACAATAGTTTTTGCAATGAAGATATAG
Protein sequenceShow/hide protein sequence
MDLAELEELKVQLQELLDKGFSRPSVSPWAAPVLFVKKKDGSMRLYIDYKKLNKVTVKNKYPLPRIDDLFDQLQGATVFSEIDLQLGYHQLRIKDNDVSKTAFRSGYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTKAEHEEHLRMVSFIGHVVSKAGVSMDPAKIEAVSSWPRPSTVSEVRSFLGLTGYYQRFMENFSRI
ATPLTQLTRKGAPFVWSKACEGSFQNLKQKLVVVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELATIVFAMKI