; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0221561 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0221561
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:9371436..9372239
RNA-Seq ExpressionCmc08g0221561
SyntenyCmc08g0221561
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038231.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-13489.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEHKEHLRMVL+TLR N+L++KFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFL LAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

KAA0048687.1 pol protein [Cucumis melo var. makuwa]2.3e-13489.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFL LAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

KAA0052348.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-13489.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIR SVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
        I FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        + +DPAKIEA+TSWPRP TVS+VHSFL LAGYYR+F+ENFSRIATPLTQLTRKRA  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

KAA0058399.1 pol protein [Cucumis melo var. makuwa]2.3e-13489.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFL LAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

TYK01903.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-13489.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIR SVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
        I FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        + +DPAKIEA+TSWPRP TVS+VHSFL LAGYYR+F+ENFSRIATPLTQLTRKRA  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

TrEMBL top hitse value%identityAlignment
A0A5A7T9B7 Gag protease polyprotein8.4e-13589.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEHKEHLRMVL+TLR N+L++KFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFL LAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

A0A5A7TXE4 Reverse transcriptase1.1e-13489.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFLSLAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+++F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

A0A5A7UAV9 Reverse transcriptase5.0e-13589.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIR SVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
        I FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        + +DPAKIEA+TSWPRP TVS+VHSFL LAGYYR+F+ENFSRIATPLTQLTRKRA  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

A0A5D3BPI1 Reverse transcriptase1.1e-13489.89Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        +S+DPAKIEAVT W RPSTVSEV SFL LAGYYRRF+ENFSRIATPLTQLTRK A  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

A0A5D3BQD5 Reverse transcriptase5.0e-13589.51Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK
        MAPAELKELKVQLQELLDKGFIR SVSP G PVLFVKKKDGSMRLCIDYRELNKVTV+N+YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPK

Query:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG
        I FRSRYGHYEFI+MSFGLTNAP VFMDLMNRVFRE LDTFVIVFIDDILIYSKTEAEH+EHLRMVL+TLR N+L+AKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  ITFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAG

Query:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF
        + +DPAKIEA+TSWPRP TVS+VHSFL LAGYYR+F+ENFSRIATPLTQLTRKRA  VWSKAC+D+F
Subjt:  ISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.4e-5440.23Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPSGVPVLFV-KKKDGS----MRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKI
        +E++ Q+Q++L++G IR S SP   P+  V KK+D S     R+ IDYR+LN++TV +++P+P +D++  +L     F+ IDL  G+HQ+ +    V K 
Subjt:  KELKVQLQELLDKGFIRPSVSPSGVPVLFV-KKKDGS----MRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKI

Query:  TFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGI
         F +++GHYE++ M FGL NAP  F   MN + R LL+   +V++DDI+++S +  EH + L +V E L    L  +  KCEF  ++ +FLGHV++  GI
Subjt:  TFRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGI

Query:  SMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASL
          +P KIEA+  +P P+   E+ +FL L GYYR+FI NF+ IA P+T+  +K   +
Subjt:  SMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASL

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-5439.22Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKD-----GSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKIT
        E++ Q+QE+L++G IR S SP   P   V KK         R+ IDYR+LN++T+ ++YP+P +D++  +L     F+ IDL  G+HQ+ + +  + K  
Subjt:  ELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKD-----GSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKIT

Query:  FRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGIS
        F ++ GHYE++ M FGL NAP  F   MN + R LL+   +V++DDI+I+S +  EH   +++V   L    L  +  KCEF  K+ +FLGH+V+  GI 
Subjt:  FRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGIS

Query:  MDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASL
         +P K++A+ S+P P+   E+ +FL L GYYR+FI N++ IA P+T   +KR  +
Subjt:  MDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.5e-5344.4Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKITFRSR
        +E+   +Q+LLD  FI PS SP   PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  K  F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKITFRSR

Query:  YGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGISMDPA
         G YE+ +M FGL NAP  F   M   FR+L   FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F  ++  FLG+ +    I+    
Subjt:  YGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGISMDPA

Query:  KIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPL
        K  A+  +P P TV +   FL +  YYRRFI N S+IA P+
Subjt:  KIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.5e-5139.6Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPSGVPVLFVKKK-----DGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKIT
        E++ Q+ ELL  G IRPS SP   P+  V KK     +   R+ +D++ LN VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+SD+PK  
Subjt:  ELKVQLQELLDKGFIRPSVSPSGVPVLFVKKK-----DGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKIT

Query:  FRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGIS
        F +  G YEF+ + FGL NAP +F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  GI 
Subjt:  FRSRYGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGIS

Query:  MDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTR
         DP K+ A++  P P++V E+  FL +  YYR+FI++++++A PLT LTR
Subjt:  MDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.5e-5344.4Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKITFRSR
        +E+   +Q+LLD  FI PS SP   PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  K  F + 
Subjt:  KELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKITFRSR

Query:  YGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGISMDPA
         G YE+ +M FGL NAP  F   M   FR+L   FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F  ++  FLG+ +    I+    
Subjt:  YGHYEFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGISMDPA

Query:  KIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPL
        K  A+  +P P TV +   FL +  YYRRFI N S+IA P+
Subjt:  KIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.6e-2144.12Show/hide
Query:  HLRMVLETLRANELFAKFSKCEFWLKQVSFLG--HVVSKAGISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVW
        HL MVL+    ++ +A   KC F   Q+++LG  H++S  G+S DPAK+EA+  WP P   +E+  FL L GYYRRF++N+ +I  PLT+L +K  SL W
Subjt:  HLRMVLETLRANELFAKFSKCEFWLKQVSFLG--HVVSKAGISMDPAKIEAVTSWPRPSTVSEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVW

Query:  SK
        ++
Subjt:  SK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCAGCAGAGTTGAAAGAGCTGAAGGTGCAGTTACAAGAGTTACTTGATAAAGGCTTCATTCGACCGAGTGTGTCACCTTCGGGTGTACCAGTCTTATTTGTTAA
GAAGAAGGATGGATCGATGCGCCTATGTATTGACTACAGGGAGTTGAATAAGGTCACCGTTCAGAACAAATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AAGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCGGGATATCATCAGCTGAGGATTAAGGATAGCGATGTACCGAAGATAACCTTTCGTTCCAGATATGGACACTAT
GAGTTTATTATAATGTCTTTTGGTTTGACGAATGCTCCGATAGTGTTTATGGATTTGATGAACAGAGTGTTTAGGGAGCTCCTAGACACTTTTGTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAGAGGCCGAGCATAAGGAGCATTTACGTATGGTTTTAGAAACCCTTCGAGCTAATGAACTGTTTGCAAAGTTCTCAAAATGTGAGT
TTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGGTTTCTAAAGCTGGTATTTCGATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCTCGACCTTCCACAGTC
AGTGAGGTTCATAGTTTTCTTAGTTTAGCGGGTTATTATCGACGGTTTATTGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGAGAGCTTCTTT
AGTGTGGAGCAAGGCCTGTAAGGACAATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCAGCAGAGTTGAAAGAGCTGAAGGTGCAGTTACAAGAGTTACTTGATAAAGGCTTCATTCGACCGAGTGTGTCACCTTCGGGTGTACCAGTCTTATTTGTTAA
GAAGAAGGATGGATCGATGCGCCTATGTATTGACTACAGGGAGTTGAATAAGGTCACCGTTCAGAACAAATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AAGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCGGGATATCATCAGCTGAGGATTAAGGATAGCGATGTACCGAAGATAACCTTTCGTTCCAGATATGGACACTAT
GAGTTTATTATAATGTCTTTTGGTTTGACGAATGCTCCGATAGTGTTTATGGATTTGATGAACAGAGTGTTTAGGGAGCTCCTAGACACTTTTGTGATCGTGTTTATTGA
TGATATTTTGATATATTCCAAGACAGAGGCCGAGCATAAGGAGCATTTACGTATGGTTTTAGAAACCCTTCGAGCTAATGAACTGTTTGCAAAGTTCTCAAAATGTGAGT
TTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGGTTTCTAAAGCTGGTATTTCGATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGCCTCGACCTTCCACAGTC
AGTGAGGTTCATAGTTTTCTTAGTTTAGCGGGTTATTATCGACGGTTTATTGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGAGAGCTTCTTT
AGTGTGGAGCAAGGCCTGTAAGGACAATTTTTAG
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPSGVPVLFVKKKDGSMRLCIDYRELNKVTVQNKYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDVPKITFRSRYGHY
EFIIMSFGLTNAPIVFMDLMNRVFRELLDTFVIVFIDDILIYSKTEAEHKEHLRMVLETLRANELFAKFSKCEFWLKQVSFLGHVVSKAGISMDPAKIEAVTSWPRPSTV
SEVHSFLSLAGYYRRFIENFSRIATPLTQLTRKRASLVWSKACKDNF