; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G16410 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G16410
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr6:14796686..14797642
RNA-Seq ExpressionCSPI06G16410
SyntenyCSPI06G16410
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14889.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14889.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-14989.9Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY FNQIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAV SLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDYMLVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14889.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-14889.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.5e-14989.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

A0A5A7TWB9 Gag/pol protein5.5e-14989.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDYMLVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+F LNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

A0A5A7V4M1 Gag/pol protein1.9e-14989.9Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY FNQIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAV SLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDYMLVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

A0A5D3CPJ6 Gag/pol protein5.5e-14989.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

A0A5D3CSZ6 Gag/pol protein5.5e-14989.56Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+VKTAFLNGNLEETIYMQQ EGFI PGQEQKICKLNRSIYGLKQASRSWNIRF+ AIKSY F+QIVDEPCVYKRIIN SVAFLVL VDDILLIGNDIG+
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L DIKQW+A QFQMKDLGEA FVLGIQIFRDR+NK LALS+ASYIDK+VVKYSMQNSKRGLLPF+HGVTLSKEQ PKTPQ+VEEMRHIPYASAVGSLMY 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS
        MLCTRPDICYAVGIV RYQSNPGLAHWTAVKTI  YLRRTRDY LVYGSKDLIL  Y DSDFQTDRDSRKSTSGS+FTLNG AVVWRSIKQGCIADS
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADS

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.3e-4135.08Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVY---KRIINNSVAFLVLDVDDILLIGND
        M+VKTAFLNG L+E IYM+  +G         +CKLN++IYGLKQA+R W   FE A+K   F     + C+Y   K  IN ++ +++L VDD+++   D
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVY---KRIINNSVAFLVLDVDDILLIGND

Query:  IGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSL
        +  + + K+++  +F+M DL E    +GI+I  + +   + LS+++Y+ K++ K++M+N      P       SK  Y       +E  + P  S +G L
Subjt:  IGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSL

Query:  MYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLI----LIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVV-WRSIKQG
        MY+MLCTRPD+  AV I+ RY S      W  +K +  YL+ T D  L++  K+L     +I Y DSD+      RKST+G +F +    ++ W + +Q 
Subjt:  MYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLI----LIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVV-WRSIKQG

Query:  CIADS
         +A S
Subjt:  CIADS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-7446.96Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVY-KRIINNSVAFLVLDVDDILLIGNDIG
        ++VKTAFL+G+LEE IYM+Q EGF   G++  +CKLN+S+YGLKQA R W ++F++ +KS ++ +   +PCVY KR   N+   L+L VDD+L++G D G
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVY-KRIINNSVAFLVLDVDDILLIGNDIG

Query:  VLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMY
        ++A +K  ++  F MKDLG A  +LG++I R+R ++ L LS+  YI++V+ +++M+N+K    P    + LSK+  P T +E   M  +PY+SAVGSLMY
Subjt:  VLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMY

Query:  VMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIA
         M+CTRPDI +AVG+V R+  NPG  HW AVK I  YLR T    L +G  D IL  Y D+D   D D+RKS++G +FT +G A+ W+S  Q C+A
Subjt:  VMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIA

P25600 Putative transposon Ty5-1 protein YCL074W3.2e-2930.58Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        M+V TAFLN  ++E IY++Q  GF+       + +L   +YGLKQA   WN    N +K   F +   E  +Y R  ++   ++ + VDD+L+      +
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
           +KQ +   + MKDLG+    LG+ I +   N  + LS   YI K   +  +   K    P  +    SK  +  T   ++++   PY S VG L++ 
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGS-KDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIK
            RPDI Y V ++ R+   P   H  + + +  YL  TR   L Y S   L L  Y D+      D   ST G +  L G  V W S K
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGS-KDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.2e-3331.03Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        ++V  AFL G L + +YM Q  GFI   +   +CKL +++YGLKQA R+W +   N + +  F   V +  ++      S+ ++++ VDDIL+ GND  +
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L +    ++ +F +KD  E  + LGI+    R    L LS+  YI  ++ + +M  +K    P      LS     K     E      Y   VGSL Y+
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQ-GCIADSK
           TRPDI YAV  + ++   P   H  A+K I  YL  T ++ + +     L L  Y D+D+  D+D   ST+G +  L    + W S KQ G +  S 
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQ-GCIADSK

Query:  KGPMLWPRKKPTNSEQYQW
        +      R     S + QW
Subjt:  KGPMLWPRKKPTNSEQYQW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-3329.47Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV
        ++V  AFL G L + +YM Q  GF+   +   +C+L ++IYGLKQA R+W +     + +  F   + +  ++      S+ ++++ VDDIL+ GND  +
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGV

Query:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV
        L      ++ +F +K+  +  + LGI+    R  + L LS+  Y   ++ + +M  +K    P      L+     K P   E      Y   VGSL Y+
Subjt:  LADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYV

Query:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQ-GCIADSK
           TRPD+ YAV  + +Y   P   HW A+K +  YL  T D+ + +     L L  Y D+D+  D D   ST+G +  L    + W S KQ G +  S 
Subjt:  MLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQ-GCIADSK

Query:  KGPMLWPRKKPTNSEQYQW
        +      R     S + QW
Subjt:  KGPMLWPRKKPTNSEQYQW

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-3128.48Show/hide
Query:  MNVKTAFLNGNLEETIYMQQAEGFITPGQE----QKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGN
        +++  AFLNG+L+E IYM+   G+     +      +C L +SIYGLKQASR W ++F   +  + F Q   +   + +I       +++ VDDI++  N
Subjt:  MNVKTAFLNGNLEETIYMQQAEGFITPGQE----QKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGN

Query:  DIGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGS
        +   + ++K  + + F+++DLG   + LG++I R      + + +  Y   ++ +  +   K   +P    VT S           + +    Y   +G 
Subjt:  DIGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGS

Query:  LMYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSK-DLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIA
        LMY+ + TR DI +AV  + ++   P LAH  AV  I +Y++ T    L Y S+ ++ L  + D+ FQ+ +D+R+ST+G    L    + W+S KQ  ++
Subjt:  LMYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDYMLVYGSK-DLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIA

Query:  DS
         S
Subjt:  DS

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-1631.19Show/hide
Query:  FLVLDVDDILLIGNDIGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSK--RGLLPFKHGVTLSKEQYPKTPQ
        +L+L VDDILL G+   +L  +   +++ F MKDLG   + LGIQI        L LS+  Y ++++    M + K     LP K   ++S  +YP  P 
Subjt:  FLVLDVDDILLIGNDIGVLADIKQWMAAQFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSK--RGLLPFKHGVTLSKEQYPKTPQ

Query:  EVEEMRHIPYASAVGSLMYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTL
        +        + S VG+L Y+ L TRPDI YAV IV +    P LA +  +K +  Y++ T  + + ++ +  L +  + DSD+     +R+ST+G    L
Subjt:  EVEEMRHIPYASAVGSLMYVMLCTRPDICYAVGIVGRYQSNPGLAHWTAVKTIFNYLRRTRDY-MLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTL

Query:  NGVAVVWRSIKQGCIADS
            + W + +Q  ++ S
Subjt:  NGVAVVWRSIKQGCIADS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGTTAAGACTGCTTTTTTGAATGGCAATCTTGAGGAAACCATCTACATGCAACAAGCAGAAGGATTCATAACTCCAGGTCAAGAGCAAAAAATTTGCAAGCTTAA
TCGTTCCATTTATGGATTGAAGCAAGCTTCTCGATCTTGGAACATAAGATTTGAGAACGCAATAAAATCTTATAGCTTTAATCAAATCGTTGATGAACCTTGTGTCTACA
AGAGAATCATCAACAATTCAGTAGCTTTCTTAGTTCTGGACGTAGATGATATCCTACTCATTGGGAATGATATAGGTGTACTAGCTGATATCAAACAATGGATGGCGGCC
CAATTTCAAATGAAAGATTTGGGAGAGGCACTATTTGTTCTGGGCATTCAGATCTTTAGAGATCGAGAGAACAAAACGCTAGCTTTGTCTAAAGCATCGTATATTGACAA
GGTAGTTGTTAAATATTCAATGCAAAACTCCAAGAGAGGCTTACTACCTTTCAAGCATGGAGTTACTTTGTCTAAGGAACAATATCCTAAGACACCTCAAGAGGTTGAGG
AAATGAGACATATCCCCTATGCATCAGCTGTTGGTAGCTTGATGTATGTGATGTTATGTACTAGACCTGACATCTGTTATGCGGTGGGAATAGTCGGTAGATATCAATCA
AATCCAGGATTAGCTCACTGGACTGCCGTTAAAACTATCTTCAATTATCTTAGGAGAACAAGGGACTACATGCTTGTGTATGGTTCTAAGGATTTGATCCTTATAAGATA
CAAAGACTCTGACTTTCAAACTGATAGAGATTCTAGGAAATCTACTTCAGGTTCAATGTTCACTCTTAATGGAGTAGCTGTAGTTTGGAGAAGTATCAAGCAAGGATGTA
TTGCTGACTCCAAAAAAGGGCCAATGCTATGGCCTAGAAAGAAACCGACCAACTCAGAACAATATCAATGGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGTTAAGACTGCTTTTTTGAATGGCAATCTTGAGGAAACCATCTACATGCAACAAGCAGAAGGATTCATAACTCCAGGTCAAGAGCAAAAAATTTGCAAGCTTAA
TCGTTCCATTTATGGATTGAAGCAAGCTTCTCGATCTTGGAACATAAGATTTGAGAACGCAATAAAATCTTATAGCTTTAATCAAATCGTTGATGAACCTTGTGTCTACA
AGAGAATCATCAACAATTCAGTAGCTTTCTTAGTTCTGGACGTAGATGATATCCTACTCATTGGGAATGATATAGGTGTACTAGCTGATATCAAACAATGGATGGCGGCC
CAATTTCAAATGAAAGATTTGGGAGAGGCACTATTTGTTCTGGGCATTCAGATCTTTAGAGATCGAGAGAACAAAACGCTAGCTTTGTCTAAAGCATCGTATATTGACAA
GGTAGTTGTTAAATATTCAATGCAAAACTCCAAGAGAGGCTTACTACCTTTCAAGCATGGAGTTACTTTGTCTAAGGAACAATATCCTAAGACACCTCAAGAGGTTGAGG
AAATGAGACATATCCCCTATGCATCAGCTGTTGGTAGCTTGATGTATGTGATGTTATGTACTAGACCTGACATCTGTTATGCGGTGGGAATAGTCGGTAGATATCAATCA
AATCCAGGATTAGCTCACTGGACTGCCGTTAAAACTATCTTCAATTATCTTAGGAGAACAAGGGACTACATGCTTGTGTATGGTTCTAAGGATTTGATCCTTATAAGATA
CAAAGACTCTGACTTTCAAACTGATAGAGATTCTAGGAAATCTACTTCAGGTTCAATGTTCACTCTTAATGGAGTAGCTGTAGTTTGGAGAAGTATCAAGCAAGGATGTA
TTGCTGACTCCAAAAAAGGGCCAATGCTATGGCCTAGAAAGAAACCGACCAACTCAGAACAATATCAATGGTCTTGA
Protein sequenceShow/hide protein sequence
MNVKTAFLNGNLEETIYMQQAEGFITPGQEQKICKLNRSIYGLKQASRSWNIRFENAIKSYSFNQIVDEPCVYKRIINNSVAFLVLDVDDILLIGNDIGVLADIKQWMAA
QFQMKDLGEALFVLGIQIFRDRENKTLALSKASYIDKVVVKYSMQNSKRGLLPFKHGVTLSKEQYPKTPQEVEEMRHIPYASAVGSLMYVMLCTRPDICYAVGIVGRYQS
NPGLAHWTAVKTIFNYLRRTRDYMLVYGSKDLILIRYKDSDFQTDRDSRKSTSGSMFTLNGVAVVWRSIKQGCIADSKKGPMLWPRKKPTNSEQYQWS