; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0061771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0061771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRibonuclease H
Genome locationCMiso1.1chr03:1977715..1978614
RNA-Seq ExpressionCmc03g0061771
SyntenyCmc03g0061771
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28756.1 uncharacterized protein E5676_scaffold403G001090 [Cucumis melo var. makuwa]3.3e-15892.43Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEYKDIFAW YKEMPGLDPKV VHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIR+ALSDEEMTAFRTPKGIYCYKVMPFGLKN GATYQRAMQKVFDDMLHKYVE YV DLVVKSK
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFG+TSGKFL FIVRHRGIEIDQSKID IQKMPR KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENF+
Subjt:  ENFV

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]1.9e-15892.76Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEY+DIFAW YKEMPGLDPKVAVHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLH+YVE YV DLVVK+K
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIEIDQSKID IQKM R KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENFV
Subjt:  ENFV

XP_031737039.1 uncharacterized protein LOC116402129 [Cucumis sativus]1.9e-15892.76Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEY+DIFAW YKEMPGLDPKVAVHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLH+YVE YV DLVVK+K
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIEIDQSKID IQKM R KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENFV
Subjt:  ENFV

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]5.6e-15892.43Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEY+DIFAW YKEMPGLDPKVAVHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKN GATYQRAMQKVFDDMLH+YVE YV DLVVK+K
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIEIDQSKID IQKM R KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENFV
Subjt:  ENFV

XP_031742390.1 uncharacterized protein LOC116401672 [Cucumis sativus]5.6e-15892.43Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEY+DIFAW YKEMPGLDPKVAVHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKN GATYQRAMQKVFDDMLH+YVE YV DLVVK+K
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIEIDQSKID IQKM R KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENFV
Subjt:  ENFV

TrEMBL top hitse value%identityAlignment
A0A5A7SPV8 Ribonuclease H1.8e-15491.97Show/hide
Query:  EYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC---
        EYKDIFAW YKEMPGLDPKVAVHHL IKP YR IKQ QRRFR ELIPQI+VEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC   
Subjt:  EYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC---

Query:  --PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNH
          PLPITEIMVDATTGHE LSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKV+PFGLKN GATYQRAMQKVFDDMLHKYVE YV DLVVKSKR+Q+H
Subjt:  --PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNH

Query:  LKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKGENFV
        LKDLKVVFDRLRKYQLRMNPLKCAF VTSGKFL FIVRHRGIEIDQSKID IQKMPR KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKGENFV
Subjt:  LKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKGENFV

A0A5A7TZU9 Ribonuclease H2.5e-14383.5Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        ++LL  YKD+FAW YKEMPGLDPKVAVH L IKPE+RP+KQ QRRFR ELI QIE EVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPI EIM+DAT GHEALSFMDGSSGYNQIRMAL DEE TAFRTPKGIYCYKVMPFGLKN GATYQRAMQ++FDDMLHK+VE YV DLVVKSK
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        ++ +HLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIE+D SKID IQKMP  K+LH+LR LQGRLAYI+RFISNLAGRCQPFQ+LMRK 
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENF
          F
Subjt:  ENF

A0A5D3CXS1 Uncharacterized protein2.5e-14383.5Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        ++LL  YKD+FAW YKEMPGLDPKVAVH L IKPE+RP+KQ QRRFR ELI QIE EVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPI EIM+DAT GHEALSFMDGSSGYNQIRMAL DEE TAFRTPKGIYCYKVMPFGLKN GATYQRAMQ++FDDMLHK+VE YV DLVVKSK
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        ++ +HLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIE+D SKID IQKMP  K+LH+LR LQGRLAYI+RFISNLAGRCQPFQ+LMRK 
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENF
          F
Subjt:  ENF

A0A5D3D1E5 Ribonuclease H2.5e-14383.5Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        ++LL  YKD+FAW YKEMPGLDPKVAVH L IKPE+RP+KQ QRRFR ELI QIE EVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPI EIM+DAT GHEALSFMDGSSGYNQIRMAL DEE TAFRTPKGIYCYKVMPFGLKN GATYQRAMQ++FDDMLHK+VE YV DLVVKSK
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        ++ +HLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL FIVRHRGIE+D SKID IQKMP  K+LH+LR LQGRLAYI+RFISNLAGRCQPFQ+LMRK 
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENF
          F
Subjt:  ENF

A0A5D3DYG9 Reverse transcriptase domain-containing protein1.6e-15892.43Show/hide
Query:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
        MSLLTEYKDIFAW YKEMPGLDPKV VHHL IKP YRPIKQ QRRFR ELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN
Subjt:  MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNN

Query:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK
        AC     PLPITEIMVDATTGHEALSFMDGSSGYNQIR+ALSDEEMTAFRTPKGIYCYKVMPFGLKN GATYQRAMQKVFDDMLHKYVE YV DLVVKSK
Subjt:  AC-----PLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSK

Query:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG
        R+Q+HLKDLKVVFDRLRKYQLRMNPLKCAFG+TSGKFL FIVRHRGIEIDQSKID IQKMPR KSLHDLRSLQGRLAYI+RFISNLAGRCQPFQKLMRKG
Subjt:  RQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKG

Query:  ENFV
        ENF+
Subjt:  ENFV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.7e-3030.49Show/hide
Query:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK
        E+N+ +++G IRE K      N  PV    KK G LR+ VD++ LN     N  PLP+ E ++    G    + +D  S Y+ IR+   DE   AFR P+
Subjt:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  +  +  +    +V  Y+ D+++ SK +  H+K +K V  +L+   L +N  KC F  +  KF+ + +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK

Query:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK
        ID + +  + K+  +LR   G + Y+++FI   +    P   L++K
Subjt:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK

P0CT35 Transposon Tf2-2 polyprotein2.7e-3030.49Show/hide
Query:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK
        E+N+ +++G IRE K      N  PV    KK G LR+ VD++ LN     N  PLP+ E ++    G    + +D  S Y+ IR+   DE   AFR P+
Subjt:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  +  +  +    +V  Y+ D+++ SK +  H+K +K V  +L+   L +N  KC F  +  KF+ + +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK

Query:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK
        ID + +  + K+  +LR   G + Y+++FI   +    P   L++K
Subjt:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK

P0CT41 Transposon Tf2-12 polyprotein2.7e-3030.49Show/hide
Query:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK
        E+N+ +++G IRE K      N  PV    KK G LR+ VD++ LN     N  PLP+ E ++    G    + +D  S Y+ IR+   DE   AFR P+
Subjt:  EVNKLIEAGFIREVKYPTWIANIVPVR---KKNGQLRVCVDFRDLN-----NACPLPITEIMVDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPK

Query:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK
        G++ Y VMP+G+    A +Q  +  +  +    +V  Y+ D+++ SK +  H+K +K V  +L+   L +N  KC F  +  KF+ + +  +G    Q  
Subjt:  GIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLDFIVRHRGIEIDQSK

Query:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK
        ID + +  + K+  +LR   G + Y+++FI   +    P   L++K
Subjt:  IDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-3033.33Show/hide
Query:  HHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC-----PLPITEIMVDATTGHEALSF
        H +EIKP  R  +        +   +I   V KL++  FI   K P   + +V V KK+G  R+CVD+R LN A      PLP  + ++      +  + 
Subjt:  HHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC-----PLPITEIMVDATTGHEALSF

Query:  MDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLK
        +D  SGY+QI M   D   TAF TP G Y Y VMPFGL N  +T+ R M   F D+  ++V  Y+ D+++ S+  + H K L  V +RL+   L +   K
Subjt:  MDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLK

Query:  CAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQ
        C F     +FL + +  + I   Q K   I+  P  K++   +   G + Y +RFI N +   QP Q
Subjt:  CAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-3033.33Show/hide
Query:  HHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC-----PLPITEIMVDATTGHEALSF
        H +EIKP  R  +        +   +I   V KL++  FI   K P   + +V V KK+G  R+CVD+R LN A      PLP  + ++      +  + 
Subjt:  HHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNAC-----PLPITEIMVDATTGHEALSF

Query:  MDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLK
        +D  SGY+QI M   D   TAF TP G Y Y VMPFGL N  +T+ R M   F D+  ++V  Y+ D+++ S+  + H K L  V +RL+   L +   K
Subjt:  MDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNPLK

Query:  CAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQ
        C F     +FL + +  + I   Q K   I+  P  K++   +   G + Y +RFI N +   QP Q
Subjt:  CAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGCTTACAGAGTATAAGGACATTTTTGCTTGGCCGTATAAGGAGATGCCAGGACTTGATCCAAAGGTAGCAGTCCATCATCTCGAAATTAAACCAGAGTATCG
ACCGATTAAGCAAGTACAACGACGTTTTCGATCAGAGCTTATTCCCCAGATCGAGGTTGAAGTCAACAAGTTGATTGAAGCAGGATTCATTCGCGAAGTCAAATATCCCA
CATGGATAGCAAACATTGTCCCTGTCAGAAAAAAGAACGGGCAGCTTCGTGTCTGTGTAGACTTTCGTGACCTGAATAATGCGTGTCCTTTACCCATCACAGAAATCATG
GTTGACGCAACTACTGGACACGAGGCACTGTCCTTTATGGATGGGTCGTCTGGATATAATCAAATACGAATGGCCCTTTCGGATGAAGAAATGACAGCTTTCAGGACCCC
AAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTAAAAAATGTTGGTGCCACTTATCAACGTGCTATGCAAAAAGTGTTTGACGATATGCTACATAAGTATGTCG
AACGTTACGTTCATGACCTTGTGGTCAAATCCAAGAGACAACAAAACCATTTGAAGGATCTAAAGGTTGTGTTCGATCGCTTACGAAAATATCAGCTAAGGATGAACCCT
CTCAAATGCGCGTTCGGTGTGACTTCAGGAAAGTTTCTTGACTTTATTGTAAGGCATCGAGGGATTGAGATAGACCAGTCCAAGATTGATGTCATTCAGAAGATGCCAAG
GTCAAAGAGTTTGCATGACCTAAGAAGTCTCCAGGGACGATTGGCATACATTCAAAGGTTCATCTCCAACCTGGCCGGTCGGTGCCAACCTTTTCAAAAGTTGATGAGAA
AAGGAGAAAATTTTGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGCTTACAGAGTATAAGGACATTTTTGCTTGGCCGTATAAGGAGATGCCAGGACTTGATCCAAAGGTAGCAGTCCATCATCTCGAAATTAAACCAGAGTATCG
ACCGATTAAGCAAGTACAACGACGTTTTCGATCAGAGCTTATTCCCCAGATCGAGGTTGAAGTCAACAAGTTGATTGAAGCAGGATTCATTCGCGAAGTCAAATATCCCA
CATGGATAGCAAACATTGTCCCTGTCAGAAAAAAGAACGGGCAGCTTCGTGTCTGTGTAGACTTTCGTGACCTGAATAATGCGTGTCCTTTACCCATCACAGAAATCATG
GTTGACGCAACTACTGGACACGAGGCACTGTCCTTTATGGATGGGTCGTCTGGATATAATCAAATACGAATGGCCCTTTCGGATGAAGAAATGACAGCTTTCAGGACCCC
AAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTAAAAAATGTTGGTGCCACTTATCAACGTGCTATGCAAAAAGTGTTTGACGATATGCTACATAAGTATGTCG
AACGTTACGTTCATGACCTTGTGGTCAAATCCAAGAGACAACAAAACCATTTGAAGGATCTAAAGGTTGTGTTCGATCGCTTACGAAAATATCAGCTAAGGATGAACCCT
CTCAAATGCGCGTTCGGTGTGACTTCAGGAAAGTTTCTTGACTTTATTGTAAGGCATCGAGGGATTGAGATAGACCAGTCCAAGATTGATGTCATTCAGAAGATGCCAAG
GTCAAAGAGTTTGCATGACCTAAGAAGTCTCCAGGGACGATTGGCATACATTCAAAGGTTCATCTCCAACCTGGCCGGTCGGTGCCAACCTTTTCAAAAGTTGATGAGAA
AAGGAGAAAATTTTGTGTGA
Protein sequenceShow/hide protein sequence
MSLLTEYKDIFAWPYKEMPGLDPKVAVHHLEIKPEYRPIKQVQRRFRSELIPQIEVEVNKLIEAGFIREVKYPTWIANIVPVRKKNGQLRVCVDFRDLNNACPLPITEIM
VDATTGHEALSFMDGSSGYNQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKYVERYVHDLVVKSKRQQNHLKDLKVVFDRLRKYQLRMNP
LKCAFGVTSGKFLDFIVRHRGIEIDQSKIDVIQKMPRSKSLHDLRSLQGRLAYIQRFISNLAGRCQPFQKLMRKGENFV