; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G22000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G22000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr6:19940628..19941778
RNA-Seq ExpressionCSPI06G22000
SyntenyCSPI06G22000
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034826.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]3.1e-13162.11Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMK D+K+YV QCE+CQRNKYEATKPA VL     P+KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        ++K     +S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQPHKWD+FIPWAELWYNTTFHAST++ PFE VYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG +KT+NNEVE++LKERDLA                               E +  +  PYRQRSLARK+SEKLAP++YGPYKIIEE+G V
Subjt:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYR  LPPEA IHNVFHI QLK KLG+QQV Q Q P+LT+ FELQLW E +LG+ WNK+LG NEWL+KW+ LPE+EAT E
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

KAA0063769.1 small RNA 2'-O-methyltransferase isoform X1 [Cucumis melo var. makuwa]3.9e-13463.42Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL WKGMK+DVKR+V+QCEICQRNKYEATKPAGVL      +KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        +++      S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQP KWD+FIPWAELWYNTTFHAST++ PFE V+G
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG +KT NNEVE++LK RDLA                               E +  +  PYRQ SLARKRSEKLAP+FYGPYKIIEE+G V
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRL+LPPEA IHNVFHI QLK KLG+QQV Q+Q P+LT+EFELQLW ET+LG+ WNK+LG NEWL+KWK LPE+EATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]5.9e-13564.14Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMKAD+K+YV+QCEICQRNKYEATKPAGVL     P+ ILEEWSMDFIEGLPKA G+NVIMVVVDRLSKY+YFI +KHPFT K+VA  FI++
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        I+       S +S ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQP KW +FI WAELWYNTTFH+S RSNPF+IVYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        RQ PP++SYG++KT N+EVE MLKERDLA                               + +  +  PY QRSLARKR EKLAPKFYGPY IIEE+GEV
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWEVM
        AYRL+LPPEA IH+VFHI QLKLKLG Q V QQQQP+LT +FELQLW E +LG+ WNKDLG NEWLVKWK+LPETEATWEV+
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWEVM

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]2.8e-13261.58Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL WKGMKAD+KRYV++C+ CQRNK+EATKPAGVL     P+KILE+W+MDFIEGLP A G NVIMVVVDRLSKYSYF+PLKHP+T K+VA +F+++
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        ++       S I+ ++           ++  G  ++    F PQTDGQTERVN+CLETY+RCFCNEQP KWDK IPWAELWYNTTFHAST++ P++ V+G
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-ECLEG------------------------------EFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG K++ NN+VE MLKERDLA   LE                               +  PYRQRSLARK+ EKL+PKFYGPY++IEE+GEV
Subjt:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-ECLEG------------------------------EFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRLQLPPEA IHNVFH+ QLKLKLG Q V QQQQPILT++FELQLW E +LG+ WNK+LGGNEWL+KWK+LP++EATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]3.0e-13464.14Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        M GEL WKGMKADVK+YV++CE+CQRNK EATKPAGVL     PE+ILE+WSMDFIEGLPKA G+NVIMV+VDRLSKYSYFI ++HPF  ++VAEVFIDR
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        ++       S IS ++           ++  G  ++    F PQTDGQTERVN+C+ETY+RCFCNEQP KW+KFIPWAELWYNTTFHAS+RSNPF+IVYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLAECLEGE-------------------------------FEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG+ KT +NEVE MLKERDLA     E                                 PYRQ SLARKRSEKLAPKFYGPY+IIEE+GEV
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLAECLEGE-------------------------------FEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWEVM
        AYRLQLPPEA IH+VFHI QLKLKLG   V Q QQP+LT+EFELQL  ET+LG+ WNKDLG NEWLVKWK+LPE EATWE +
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWEVM

TrEMBL top hitse value%identityAlignment
A0A5A7SY68 Transposon Ty3-G Gag-Pol polyprotein1.5e-13162.11Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMK D+K+YV QCE+CQRNKYEATKPA VL     P+KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        ++K     +S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQPHKWD+FIPWAELWYNTTFHAST++ PFE VYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG +KT+NNEVE++LKERDLA                               E +  +  PYRQRSLARK+SEKLAP++YGPYKIIEE+G V
Subjt:  R-QPPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYR  LPPEA IHNVFHI QLK KLG+QQV Q Q P+LT+ FELQLW E +LG+ WNK+LG NEWL+KW+ LPE+EAT E
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

A0A5A7T8C0 Ty3/gypsy retrotransposon protein7.3e-13162.37Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMK D+K+YV QCE+CQRNKYEATK AGVL     P+KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        I++     +S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQPHKWD+FIPWAELWYNTTFHAST++ PFE VYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG KKT+NNEVE MLKERD A                               E +  + +PYRQRSLARK+SEKLAP++YGPYKIIEE+G V
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRL LPPEA IHNVFHI QLK KLG+QQV Q Q P+LT+ FELQL  E +LG+ WNK+LG NEWL+KW+ L E+EATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

A0A5A7UM77 Ty3/gypsy retrotransposon protein7.3e-13161.84Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMK D+K+YV+QCE+CQRNKYEATKPAGVL     P+KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        I++     +S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQPHKWD+FIPWAELWYNTTFH+ST++ PFE VYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG KK +NNEVE MLKERD A                               E +  + +PYRQRSLARK+SEKLAP++YGPYKIIEE+G V
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRL LPPEA IHNVFHI QLK KLG+QQV Q Q P+LT+ FELQL  E +LG+ WNK+LG NEWL+KW+ L E++ATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

A0A5A7UYM1 Ty3/gypsy retrotransposon protein7.3e-13162.11Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL W+GMK D+K+YV+QCE+CQRNKYEATKPAGVL     P+KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        I++     +S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQPHKWD+FIPWAELWYNT FHAST++ PFE VYG
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG KKT NNEVE MLKERD A                               E +  + +PYRQRSLARK+SEKLAP++YGPYKIIEE+G V
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRL LPPEA IHNVFHI QLK KLG+QQV Q Q P+LT+ FELQL  E +LG+ WNK+LG NEWL+KW+ L E++ATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

A0A5D3C0R9 Rotamase1.9e-13463.42Show/hide
Query:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR
        MSGEL WKGMK+DVKR+V+QCEICQRNKYEATKPAGVL      +KILE+W+MDFIEGLPKA G+NVIMVVVDRL+KY+YFI LKHPF+ K+VA  FID+
Subjt:  MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDR

Query:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG
        +++      S IS ++           ++  G  ++    F PQTDGQTERVNQCLETY+RCFCNEQP KWD+FIPWAELWYNTTFHAST++ PFE V+G
Subjt:  IIKIRS--SSAISGKN-----------YSPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYG

Query:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV
        R  PPL+SYG +KT NNEVE++LK RDLA                               E +  +  PYRQ SLARKRSEKLAP+FYGPYKIIEE+G V
Subjt:  RQ-PPLISYGSKKTSNNEVEAMLKERDLA-------------------------------ECLEGEFEPYRQRSLARKRSEKLAPKFYGPYKIIEEMGEV

Query:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
        AYRL+LPPEA IHNVFHI QLK KLG+QQV Q+Q P+LT+EFELQLW ET+LG+ WNK+LG NEWL+KWK LPE+EATWE
Subjt:  AYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.8e-3331.31Show/hide
Query:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--
        WKG++  ++ YV+ C  CQ NK    KP G L      E+  E  SMDFI  LP++ G N + VVVDR SK +  +P     T ++ A +F  R+I    
Subjt:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--

Query:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-
             I+  ++   S        +Y F         PQTDGQTER NQ +E  +RC C+  P+ W   I   +  YN   H++T+  PFEIV+   P L 
Subjt:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-

Query:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E
           +   S KT  N  E +   + + E L                   EF+P       R ++    +S KLAP F GP+ ++++ G   Y L LP   +
Subjt:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E

Query:  AVIHNVFHIFQLK
         +  + FH+  L+
Subjt:  AVIHNVFHIFQLK

P0CT35 Transposon Tf2-2 polyprotein1.8e-3331.31Show/hide
Query:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--
        WKG++  ++ YV+ C  CQ NK    KP G L      E+  E  SMDFI  LP++ G N + VVVDR SK +  +P     T ++ A +F  R+I    
Subjt:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--

Query:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-
             I+  ++   S        +Y F         PQTDGQTER NQ +E  +RC C+  P+ W   I   +  YN   H++T+  PFEIV+   P L 
Subjt:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-

Query:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E
           +   S KT  N  E +   + + E L                   EF+P       R ++    +S KLAP F GP+ ++++ G   Y L LP   +
Subjt:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E

Query:  AVIHNVFHIFQLK
         +  + FH+  L+
Subjt:  AVIHNVFHIFQLK

P0CT36 Transposon Tf2-3 polyprotein1.8e-3331.31Show/hide
Query:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--
        WKG++  ++ YV+ C  CQ NK    KP G L      E+  E  SMDFI  LP++ G N + VVVDR SK +  +P     T ++ A +F  R+I    
Subjt:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--

Query:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-
             I+  ++   S        +Y F         PQTDGQTER NQ +E  +RC C+  P+ W   I   +  YN   H++T+  PFEIV+   P L 
Subjt:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-

Query:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E
           +   S KT  N  E +   + + E L                   EF+P       R ++    +S KLAP F GP+ ++++ G   Y L LP   +
Subjt:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E

Query:  AVIHNVFHIFQLK
         +  + FH+  L+
Subjt:  AVIHNVFHIFQLK

P0CT41 Transposon Tf2-12 polyprotein1.8e-3331.31Show/hide
Query:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--
        WKG++  ++ YV+ C  CQ NK    KP G L      E+  E  SMDFI  LP++ G N + VVVDR SK +  +P     T ++ A +F  R+I    
Subjt:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--

Query:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-
             I+  ++   S        +Y F         PQTDGQTER NQ +E  +RC C+  P+ W   I   +  YN   H++T+  PFEIV+   P L 
Subjt:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-

Query:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E
           +   S KT  N  E +   + + E L                   EF+P       R ++    +S KLAP F GP+ ++++ G   Y L LP   +
Subjt:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E

Query:  AVIHNVFHIFQLK
         +  + FH+  L+
Subjt:  AVIHNVFHIFQLK

Q9UR07 Transposon Tf2-11 polyprotein1.8e-3331.31Show/hide
Query:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--
        WKG++  ++ YV+ C  CQ NK    KP G L      E+  E  SMDFI  LP++ G N + VVVDR SK +  +P     T ++ A +F  R+I    
Subjt:  WKGMKADVKRYVKQCEICQRNKYEATKPAGVL-----PEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKI--

Query:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-
             I+  ++   S        +Y F         PQTDGQTER NQ +E  +RC C+  P+ W   I   +  YN   H++T+  PFEIV+   P L 
Subjt:  RSSSAISGKNY---SPHGDSIEEEYCF--------SPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPL-

Query:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E
           +   S KT  N  E +   + + E L                   EF+P       R ++    +S KLAP F GP+ ++++ G   Y L LP   +
Subjt:  ---ISYGSKKTSNNEVEAMLKERDLAECLE-----------------GEFEP------YRQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPP--E

Query:  AVIHNVFHIFQLK
         +  + FH+  L+
Subjt:  AVIHNVFHIFQLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGGGAATTAGATTGGAAAGGAATGAAAGCAGATGTCAAGAGATATGTCAAGCAGTGTGAGATCTGCCAGAGAAATAAATACGAGGCAACTAAACCAGCAGGGGT
TCTTCCGGAGAAAATCCTCGAAGAATGGTCCATGGATTTCATTGAGGGGCTGCCCAAAGCAAGAGGAATCAATGTAATCATGGTAGTAGTTGATAGACTCAGCAAGTACT
CTTACTTCATCCCTCTGAAACATCCATTCACAACCAAAAGAGTAGCGGAAGTGTTCATTGATAGAATCATCAAGATAAGATCTTCATCAGCAATTTCTGGAAAGAATTAT
TCACCACATGGGGACAGTATTGAAGAGGAGTACTGCTTTTCACCCCAAACCGATGGGCAGACCGAAAGAGTCAACCAGTGTTTAGAAACATATATGAGATGTTTCTGTAA
TGAACAACCCCATAAATGGGATAAGTTCATTCCGTGGGCAGAGTTGTGGTATAACACCACCTTCCATGCCTCTACGAGAAGCAATCCATTTGAGATAGTGTATGGAAGAC
AACCACCTCTAATTTCCTATGGTAGTAAGAAGACATCGAACAATGAGGTGGAGGCCATGTTGAAAGAAAGAGACTTAGCAGAATGCCTTGAAGGAGAATTTGAGCCTTAT
CGGCAGCGATCTTTAGCACGCAAGAGGAGTGAAAAATTAGCTCCGAAGTTCTATGGTCCCTACAAGATAATTGAGGAAATGGGTGAAGTAGCCTATAGACTGCAACTTCC
CCCAGAGGCTGTGATACATAATGTTTTCCATATTTTTCAACTGAAACTCAAGTTAGGCAGTCAGCAAGTGGCCCAGCAGCAACAGCCCATACTAACCAAAGAATTTGAGC
TCCAACTATGGTCAGAAACTATGCTGGGAGTTCATTGGAACAAAGACTTGGGAGGAAATGAATGGTTGGTGAAGTGGAAAGATCTTCCAGAGACTGAAGCAACTTGGGAA
GTTATGGAGATAGGCTATTGTAGCCGCCAGGGGTCTCTTTTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGGGAATTAGATTGGAAAGGAATGAAAGCAGATGTCAAGAGATATGTCAAGCAGTGTGAGATCTGCCAGAGAAATAAATACGAGGCAACTAAACCAGCAGGGGT
TCTTCCGGAGAAAATCCTCGAAGAATGGTCCATGGATTTCATTGAGGGGCTGCCCAAAGCAAGAGGAATCAATGTAATCATGGTAGTAGTTGATAGACTCAGCAAGTACT
CTTACTTCATCCCTCTGAAACATCCATTCACAACCAAAAGAGTAGCGGAAGTGTTCATTGATAGAATCATCAAGATAAGATCTTCATCAGCAATTTCTGGAAAGAATTAT
TCACCACATGGGGACAGTATTGAAGAGGAGTACTGCTTTTCACCCCAAACCGATGGGCAGACCGAAAGAGTCAACCAGTGTTTAGAAACATATATGAGATGTTTCTGTAA
TGAACAACCCCATAAATGGGATAAGTTCATTCCGTGGGCAGAGTTGTGGTATAACACCACCTTCCATGCCTCTACGAGAAGCAATCCATTTGAGATAGTGTATGGAAGAC
AACCACCTCTAATTTCCTATGGTAGTAAGAAGACATCGAACAATGAGGTGGAGGCCATGTTGAAAGAAAGAGACTTAGCAGAATGCCTTGAAGGAGAATTTGAGCCTTAT
CGGCAGCGATCTTTAGCACGCAAGAGGAGTGAAAAATTAGCTCCGAAGTTCTATGGTCCCTACAAGATAATTGAGGAAATGGGTGAAGTAGCCTATAGACTGCAACTTCC
CCCAGAGGCTGTGATACATAATGTTTTCCATATTTTTCAACTGAAACTCAAGTTAGGCAGTCAGCAAGTGGCCCAGCAGCAACAGCCCATACTAACCAAAGAATTTGAGC
TCCAACTATGGTCAGAAACTATGCTGGGAGTTCATTGGAACAAAGACTTGGGAGGAAATGAATGGTTGGTGAAGTGGAAAGATCTTCCAGAGACTGAAGCAACTTGGGAA
GTTATGGAGATAGGCTATTGTAGCCGCCAGGGGTCTCTTTTTCTGTAA
Protein sequenceShow/hide protein sequence
MSGELDWKGMKADVKRYVKQCEICQRNKYEATKPAGVLPEKILEEWSMDFIEGLPKARGINVIMVVVDRLSKYSYFIPLKHPFTTKRVAEVFIDRIIKIRSSSAISGKNY
SPHGDSIEEEYCFSPQTDGQTERVNQCLETYMRCFCNEQPHKWDKFIPWAELWYNTTFHASTRSNPFEIVYGRQPPLISYGSKKTSNNEVEAMLKERDLAECLEGEFEPY
RQRSLARKRSEKLAPKFYGPYKIIEEMGEVAYRLQLPPEAVIHNVFHIFQLKLKLGSQQVAQQQQPILTKEFELQLWSETMLGVHWNKDLGGNEWLVKWKDLPETEATWE
VMEIGYCSRQGSLFL