; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15740 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15740
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr6:14129671..14131045
RNA-Seq ExpressionCSPI06G15740
SyntenyCSPI06G15740
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]1.7e-15761.25Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR+HSEAF G+P   +ME+FD+ IR+ DLVEPSVQGNWFTWTSK+ GSG++RR++ VL+NDE LSAWP MR+N+LPWGISDHSP+L YPS Q N+R
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFP-----------------------------IVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRAL
        V                               +VS+++NL +LKPIL   FG+HI+++ E+V +AK+  + AQREVE NPLS+ LS  ASLAT  FW A+
Subjt:  VFP-----------------------------IVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRAL

Query:  RVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQA
        R+EEA++RQKS++RW  LGDQNTAFFHRSVRSR S N+L S                               IG++ELS  I+DIV+F W+EECCQ LQ 
Subjt:  RVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQA

Query:  PIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLR
        PI REEVRRVLFSMDSGKAP  DG+SVGF+KGAW++VGED C+ VLHFFET Y P GVN TAITLIPK   A+RLEDFRPISCCNV+YKCIS+ILADRLR
Subjt:  PIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLR

Query:  VWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL
        +WLPSF+S+NQS FIPGRSII+NILLCQELV GYH+N+GKP CT+KVDL
Subjt:  VWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-14163.12Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR HSEA  G+P   +MEDFDM IR+ DLVEPSVQGNWFTWTSK+ GSG+MRR++ VL+ND+ LSAWP M VN+LPWGISDHSP+L+YPS Q+N++
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS
        V  +   + N     P     FG+HIR++ E+VR+AK+  + AQREVE NP+S+ LS  ASLAT  FW A+R+E+           P+ G +N  F   S
Subjt:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS

Query:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV
        V SR S             SN+L S  IG++EL+  I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVGFFKGAW+++GED CD V
Subjt:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV

Query:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM
        LHFFET Y P GVN TAITLIPK N A+RLEDFRPISCCNV+YKCIS+ILADRLRVWLPSF+S+NQS FI GRSII+NILLCQELV GYH+N+GKP CT+
Subjt:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM

Query:  KVDL
        KVDL
Subjt:  KVDL

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]2.3e-14661.76Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPM-LVYPSHQRNT
        DFN IR+H EAF G+P   +MEDFD+  R+ DLVEPSVQGNWFTWTSK+HGSG++RR++ +L+NDE LSAWP + V  L   + D S + +V     R+ 
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPM-LVYPSHQRNT

Query:  RVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHR
         V P+VS+++NL+NLKP L   FG+HI+++ E+V +AK+  +RAQREVE NP+S+ LS    LAT  FW A+R+EEA++RQKSRIRW +LGDQNTAFFHR
Subjt:  RVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHR

Query:  SVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVG
         VRSR S N+L S                               IG++EL   I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVG
Subjt:  SVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVG

Query:  FFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQ
        FFKGAW++V ED CDVVLHFFET Y P GVN T ITLIPKR  A+++E+FRPISCCNVIYKCIS+ILADRLRVWLPSF+ +NQS FIPGRSIIDNILLCQ
Subjt:  FFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQ

Query:  ELVWGYHMNTGKPWCTMKVDL
        ELV GYH+N+GKP CT+KVDL
Subjt:  ELVWGYHMNTGKPWCTMKVDL

XP_031740402.1 uncharacterized protein LOC116403409 [Cucumis sativus]1.1e-14370.48Show/hide
Query:  MRVNILPWGISDHSPMLVYPSHQR-----------------------------NTRVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQ
        MRVN+LPWGISDHSP+LVYPS+QR                             +TRV PIV+IV+NLRNLK IL  HFG+HIRTI EDVRLA DT +RA+
Subjt:  MRVNILPWGISDHSPMLVYPSHQR-----------------------------NTRVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQ

Query:  REVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NI
        RE+E N LSE+ SNHASLATVNFW+A+RVEEAAMRQKSR RW KL DQNTAFFHRSVRSRQSSNALRS                              NI
Subjt:  REVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NI

Query:  GHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDAD
         + ELSTSIE+IV+F WTEECCQ LQ+PIGREEVRRVLFSMD GKAP  DGYSVGFFKGAWT+VGE  CDVVLHFFETNYFPQGVNTTAITLIPKRN AD
Subjt:  GHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDAD

Query:  RLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL
        RLEDF PISCC+VIYKCISRILADRLRVWLPSFVS NQ  FIPGRSIIDNILLCQELV  YH++ GKP CTMKVDL
Subjt:  RLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL

XP_031745634.1 uncharacterized protein LOC116406053 [Cucumis sativus]5.1e-13866.58Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFNTIRL SEAF GAPN               L+  +VQGNWFTWTSKIHGSGLM+R++ +L+NDEGLS WPNMRVN+LPW        +V  +  ++TR
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALR--VEEAAMRQKSRIRWPKLGDQNTAFFH
        V PIV+IV+NLRNLK IL  HFG+HIRTI EDVRLA DT +RA+RE+E N LSE+ SNHASLAT +   ALR  ++    R  +        DQ T    
Subjt:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALR--VEEAAMRQKSRIRWPKLGDQNTAFFH

Query:  RSVRSRQSSNALRSNIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQG
           +    S     NI + ELSTSIE+IV+F WTEECCQ LQ+PIGR EVRRVLFSMD GKAP  DGYSVGFFKGAWT+VGE  CDVVLHFFETNYFPQG
Subjt:  RSVRSRQSSNALRSNIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQG

Query:  VNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL
        VNTTAITLIPKRN ADRLEDF PISCC+VIYKCISRILADRLRVWLPSFVS NQ  FIPGRSIIDNILLCQELV  YH++ GKP CTMKVDL
Subjt:  VNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL

TrEMBL top hitse value%identityAlignment
A0A1S3BSI8 uncharacterized protein LOC1034932255.0e-12357.67Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR HSEA  G+P   +MEDFDM IR+ DLVEPSVQ NWFTWTSK+ GSG+MRR++ VL+ND+ LSAWP +R                         
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS
                             FG+HIR++ E+VR+AK+  + AQREVE NP+S+ LS  ASLAT  FW A+R+E+           P+ G +N  F   S
Subjt:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS

Query:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV
        V SR S             SN+L S  IG++EL+  I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVGFFKGAW+++GED CD V
Subjt:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV

Query:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM
        LHFFET Y P GVN TAITLIPK N A+RLEDFRPISCCNV+YKCIS+ILADRLRVWLPSF+S+NQS FI GRSII+NILLCQELV GYH+N+GKP CT+
Subjt:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM

Query:  KVDL
        KVDL
Subjt:  KVDL

A0A5A7TZS0 Reverse transcriptase domain-containing protein8.1e-15861.25Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR+HSEAF G+P   +ME+FD+ IR+ DLVEPSVQGNWFTWTSK+ GSG++RR++ VL+NDE LSAWP MR+N+LPWGISDHSP+L YPS Q N+R
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFP-----------------------------IVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRAL
        V                               +VS+++NL +LKPIL   FG+HI+++ E+V +AK+  + AQREVE NPLS+ LS  ASLAT  FW A+
Subjt:  VFP-----------------------------IVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRAL

Query:  RVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQA
        R+EEA++RQKS++RW  LGDQNTAFFHRSVRSR S N+L S                               IG++ELS  I+DIV+F W+EECCQ LQ 
Subjt:  RVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQA

Query:  PIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLR
        PI REEVRRVLFSMDSGKAP  DG+SVGF+KGAW++VGED C+ VLHFFET Y P GVN TAITLIPK   A+RLEDFRPISCCNV+YKCIS+ILADRLR
Subjt:  PIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLR

Query:  VWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL
        +WLPSF+S+NQS FIPGRSII+NILLCQELV GYH+N+GKP CT+KVDL
Subjt:  VWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL

A0A5A7UP65 Reverse transcriptase6.3e-14263.12Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR HSEA  G+P   +MEDFDM IR+ DLVEPSVQGNWFTWTSK+ GSG+MRR++ VL+ND+ LSAWP M VN+LPWGISDHSP+L+YPS Q+N++
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS
        V  +   + N     P     FG+HIR++ E+VR+AK+  + AQREVE NP+S+ LS  ASLAT  FW A+R+E+           P+ G +N  F   S
Subjt:  VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRS

Query:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV
        V SR S             SN+L S  IG++EL+  I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVGFFKGAW+++GED CD V
Subjt:  VRSRQS-------------SNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVV

Query:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM
        LHFFET Y P GVN TAITLIPK N A+RLEDFRPISCCNV+YKCIS+ILADRLRVWLPSF+S+NQS FI GRSII+NILLCQELV GYH+N+GKP CT+
Subjt:  LHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTM

Query:  KVDL
        KVDL
Subjt:  KVDL

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein1.1e-14661.76Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPM-LVYPSHQRNT
        DFN IR+H EAF G+P   +MEDFD+  R+ DLVEPSVQGNWFTWTSK+HGSG++RR++ +L+NDE LSAWP + V  L   + D S + +V     R+ 
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPM-LVYPSHQRNT

Query:  RVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHR
         V P+VS+++NL+NLKP L   FG+HI+++ E+V +AK+  +RAQREVE NP+S+ LS    LAT  FW A+R+EEA++RQKSRIRW +LGDQNTAFFHR
Subjt:  RVFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHR

Query:  SVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVG
         VRSR S N+L S                               IG++EL   I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVG
Subjt:  SVRSRQSSNALRS------------------------------NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVG

Query:  FFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQ
        FFKGAW++V ED CDVVLHFFET Y P GVN T ITLIPKR  A+++E+FRPISCCNVIYKCIS+ILADRLRVWLPSF+ +NQS FIPGRSIIDNILLCQ
Subjt:  FFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQ

Query:  ELVWGYHMNTGKPWCTMKVDL
        ELV GYH+N+GKP CT+KVDL
Subjt:  ELVWGYHMNTGKPWCTMKVDL

A0A5D3D7P6 Reverse transcriptase7.4e-13560.71Show/hide
Query:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR
        DFN IR HSEA  G+P   +MEDFDM IR+ DLVEPSVQ NWFTWTSK+ GSG+MRR++ VL+ND+ LSAWP M VN+LPWGISDHSP+L+YPS Q+N++
Subjt:  DFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTR

Query:  ------VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNT
              V P+V +++NL  LKPIL   FG+HIR++ E+VR+AK+  + AQRE++                   WR+L + E +    S ++         
Subjt:  ------VFPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNT

Query:  AFFHRSVRSRQSSNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETN
        A     +     SN+L S  IG++EL+  I+DIV+F W+EECCQ LQ PI REEVRRVLFSMDSGKAP  DG+SVGFFKGAW+++GED CD VLHFFET 
Subjt:  AFFHRSVRSRQSSNALRS-NIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETN

Query:  YFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL
        Y P GVN TAITLIPK N A+RLEDFRPISCCNV+YKCIS+ILADRLRVWLPSF+S+NQS FI GRSII+NILLCQELV GYH+N+GKP CT+KVDL
Subjt:  YFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein7.7e-1232.8Show/hide
Query:  LQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFE----TNYFPQGVNTTAITLIPK-RNDADRLEDFRPISCCNVIYKCIS
        L +PI  +E+  V+ S+ + K+P  DG+S  F++       EDL  ++   F         P       ITLIPK + D  ++E+FRPIS  N+  K ++
Subjt:  LQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFE----TNYFPQGVNTTAITLIPK-RNDADRLEDFRPISCCNVIYKCIS

Query:  RILADRLRVWLPSFVSANQSTFIPG
        +ILA+R++  + + +  +Q  FIPG
Subjt:  RILADRLRVWLPSFVSANQSTFIPG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-1730.6Show/hide
Query:  LQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILAD
        L+ PI  +E+ + L  M   K+P LDG ++ FF+  W  +G D   V+   F+    P       ++L+PK+ D   ++++RP+S  +  YK +++ ++ 
Subjt:  LQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCISRILAD

Query:  RLRVWLPSFVSANQSTFIPGRSIIDNILLCQELV
        RL+  L   +  +QS  +PGR+I DN+ L ++L+
Subjt:  RLRVWLPSFVSANQSTFIPGRSIIDNILLCQELV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.2e-3326.57Show/hide
Query:  DFNTIRLHSEAFRGAPNS---RDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSH--
        DF+ I   S+ +     S   R +E+F   +R+ DLV+   +G  +TW++    + ++R+++  + N +  S++P+        G+SDHSP ++   +  
Subjt:  DFNTIRLHSEAFRGAPNS---RDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSH--

Query:  QRNTRVFPIVSIVKNLRNLKPILCNHF------GKHIRTIGEDVRLAK----------------------DTTNRAQREVEINPLSEDL--SNHASLATV
        +R+ + F   S +         L   +      G H+ ++GE ++ AK                      D+    Q ++  NP S+ L    H +    
Subjt:  QRNTRVFPIVSIVKNLRNLKPILCNHF------GKHIRTIGEDVRLAK----------------------DTTNRAQREVEINPLSEDL--SNHASLATV

Query:  NFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALR-------------------------------SNIGHKELSTSIEDIVKFSWTEE
        NF+ A    E+  RQKSRI+W + GD NT FFH+ + + Q+ N ++                               S+I   +    I+DI  F   + 
Subjt:  NFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALR-------------------------------SNIGHKELSTSIEDIVKFSWTEE

Query:  CCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCIS
            L A    +E+   +F+M   KAP  D ++  FF  +W +V +     V  FF T +  +  N TAITLIPK    D+L  FRP+SCC V+YK I+
Subjt:  CCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRPISCCNVIYKCIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-0435.71Show/hide
Query:  LADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTG-KPWCTMKVDL
        + +RL+  + + +   Q++FIPGR   DNI+  QE V       G K W  +K+DL
Subjt:  LADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTG-KPWCTMKVDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGGGCCGAGTATGGTCATTGGGATTTTAATACCATCAGACTCCATTCTGAAGCATTCAGGGGAGCTCCTAACTCGAGGGATATGGAGGATTTTGACATGACTAT
TCGAGAGGAAGACCTTGTTGAACCCTCGGTTCAGGGGAACTGGTTTACCTGGACTAGTAAGATCCATGGTTCTGGTTTGATGAGGAGAATTAATCCTGTCTTAATGAATG
ATGAGGGACTTAGTGCTTGGCCTAACATGAGGGTTAACATCCTCCCTTGGGGTATATCTGATCATTCTCCCATGCTTGTTTATCCCAGCCATCAACGAAATACTAGAGTT
TTTCCGATTGTGAGTATTGTGAAGAACTTAAGAAATCTCAAGCCAATTCTTTGCAACCATTTTGGTAAGCATATCCGGACCATCGGTGAGGATGTTCGTCTTGCTAAGGA
TACCACGAATCGAGCTCAAAGAGAGGTGGAGATTAACCCTCTGTCAGAGGATCTGAGTAATCACGCGAGCTTAGCCACTGTGAACTTTTGGAGAGCGCTTAGAGTGGAGG
AAGCCGCTATGCGACAAAAGTCGCGAATCAGGTGGCCGAAGTTAGGTGACCAAAATACTGCCTTTTTTCATCGGTCTGTCCGATCAAGGCAGAGCAGTAATGCATTGCGA
TCAAATATTGGACATAAAGAGCTTTCTACTAGTATTGAGGATATTGTTAAGTTTAGTTGGACTGAGGAGTGCTGCCAGGTCCTTCAGGCGCCAATTGGTAGAGAGGAGGT
GAGACGTGTTCTATTTTCCATGGATAGTGGAAAGGCTCCAAGCCTTGATGGGTATTCGGTTGGCTTCTTCAAAGGAGCATGGACGATGGTTGGAGAGGATTTATGTGATG
TCGTCTTACACTTCTTTGAGACTAATTACTTCCCTCAAGGGGTGAATACAACTGCTATTACGCTAATTCCTAAAAGGAACGATGCTGATCGGTTGGAGGATTTCAGGCCT
ATCTCTTGTTGTAATGTTATTTACAAGTGCATTTCAAGAATATTGGCTGATAGGCTTCGTGTGTGGCTTCCTTCTTTTGTAAGTGCAAACCAATCAACTTTCATCCCTGG
GAGGAGTATTATTGACAATATACTTCTTTGTCAAGAGCTTGTATGGGGTTACCATATGAACACAGGAAAACCTTGGTGCACTATGAAGGTTGACCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGGGCCGAGTATGGTCATTGGGATTTTAATACCATCAGACTCCATTCTGAAGCATTCAGGGGAGCTCCTAACTCGAGGGATATGGAGGATTTTGACATGACTAT
TCGAGAGGAAGACCTTGTTGAACCCTCGGTTCAGGGGAACTGGTTTACCTGGACTAGTAAGATCCATGGTTCTGGTTTGATGAGGAGAATTAATCCTGTCTTAATGAATG
ATGAGGGACTTAGTGCTTGGCCTAACATGAGGGTTAACATCCTCCCTTGGGGTATATCTGATCATTCTCCCATGCTTGTTTATCCCAGCCATCAACGAAATACTAGAGTT
TTTCCGATTGTGAGTATTGTGAAGAACTTAAGAAATCTCAAGCCAATTCTTTGCAACCATTTTGGTAAGCATATCCGGACCATCGGTGAGGATGTTCGTCTTGCTAAGGA
TACCACGAATCGAGCTCAAAGAGAGGTGGAGATTAACCCTCTGTCAGAGGATCTGAGTAATCACGCGAGCTTAGCCACTGTGAACTTTTGGAGAGCGCTTAGAGTGGAGG
AAGCCGCTATGCGACAAAAGTCGCGAATCAGGTGGCCGAAGTTAGGTGACCAAAATACTGCCTTTTTTCATCGGTCTGTCCGATCAAGGCAGAGCAGTAATGCATTGCGA
TCAAATATTGGACATAAAGAGCTTTCTACTAGTATTGAGGATATTGTTAAGTTTAGTTGGACTGAGGAGTGCTGCCAGGTCCTTCAGGCGCCAATTGGTAGAGAGGAGGT
GAGACGTGTTCTATTTTCCATGGATAGTGGAAAGGCTCCAAGCCTTGATGGGTATTCGGTTGGCTTCTTCAAAGGAGCATGGACGATGGTTGGAGAGGATTTATGTGATG
TCGTCTTACACTTCTTTGAGACTAATTACTTCCCTCAAGGGGTGAATACAACTGCTATTACGCTAATTCCTAAAAGGAACGATGCTGATCGGTTGGAGGATTTCAGGCCT
ATCTCTTGTTGTAATGTTATTTACAAGTGCATTTCAAGAATATTGGCTGATAGGCTTCGTGTGTGGCTTCCTTCTTTTGTAAGTGCAAACCAATCAACTTTCATCCCTGG
GAGGAGTATTATTGACAATATACTTCTTTGTCAAGAGCTTGTATGGGGTTACCATATGAACACAGGAAAACCTTGGTGCACTATGAAGGTTGACCTCTAA
Protein sequenceShow/hide protein sequence
MERAEYGHWDFNTIRLHSEAFRGAPNSRDMEDFDMTIREEDLVEPSVQGNWFTWTSKIHGSGLMRRINPVLMNDEGLSAWPNMRVNILPWGISDHSPMLVYPSHQRNTRV
FPIVSIVKNLRNLKPILCNHFGKHIRTIGEDVRLAKDTTNRAQREVEINPLSEDLSNHASLATVNFWRALRVEEAAMRQKSRIRWPKLGDQNTAFFHRSVRSRQSSNALR
SNIGHKELSTSIEDIVKFSWTEECCQVLQAPIGREEVRRVLFSMDSGKAPSLDGYSVGFFKGAWTMVGEDLCDVVLHFFETNYFPQGVNTTAITLIPKRNDADRLEDFRP
ISCCNVIYKCISRILADRLRVWLPSFVSANQSTFIPGRSIIDNILLCQELVWGYHMNTGKPWCTMKVDL