; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G022845 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G022845
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionReverse transcriptase
Genome locationGy14Chr3:21650470..21652072
RNA-Seq ExpressionCsGy3G022845
SyntenyCsGy3G022845
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7016811.1 unnamed protein product [Microthlaspi erraticum]2.05e-14364.67Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARW+SFLQ+F F+I+H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        I++ H GGL+GH G++KT+    +RYYWP +R+D+   VKRC ICQ +KG S N GLY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF+ACKKT DA  IA LFF+EVVRLHGVPK+I+SDRD KFLSHFW TLW+ F TTLK S+TAHPQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q E
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

KAA0033030.1 serine/threonine-protein kinase TIO-like [Cucumis melo var. makuwa]2.13e-15281.97Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARWISFLQRFDF IKH++GKENKVADALSRK SLLT+LS EI+AFKHLP LYEED DF DIWYKCSN+L  DDYHIV+G+LFKGEQLCIPHTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +KEAHS GLAGHFGQ+KT EI SKRYYWPQ+R+DS NFVK CP+CQR KG   NAGLYSPLPIPTSIWEDLS+DFV+GLPKTQR +DS++V+VDRFS+MT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHF
        HF+ACKKT DAIYIANLFF+E+V LHGVPK+IVS+RDVKFLSHF
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHF

KAA0063034.1 serine/threonine-protein kinase TIO-like [Cucumis melo var. makuwa]3.51e-16974.33Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARWISFLQRFDFVIKHQ GKENKVADALSRK SLLT+LS EI AFKHLP LYE+D+DF + W KCSNF+ A+D+HI+EG+LFK +QLCI HTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +KEAHSG L GHFGQ+KT E  SKRYYWPQ+R+D NNFV+RCP CQR KG+STNAGLYSPLP      +   +   +  PK + + DS+MV+VDRFSKMT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF++CKK NDAIYIANLFF+E+VRLHGV K+IVSDRDVKFLSHFW+TLW+K DTTLK+ +TAHPQTD QTEVTN+TLGNL+RCLSG+KPKQWDL LAQAE
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

KAE8652794.1 hypothetical protein Csa_022828 [Cucumis sativus]3.84e-15971.57Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARW+ FLQRFDFVIKH SGK NKVADALSRKG LLT L S+IIAF HL  LY  DIDF+ IW  CSN     DYHIV  +LFKG+ LC+PHTSLREA+
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        IKE HS GLAGHFG++KTL     +++WPQ+ ++  NF+KRC ICQ  KG+S N GLY+PLPI ++IWEDLS+DFV+GLP+TQR  DSI V+V+RFSKM 
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQA
        HF+ CKKT+DA+ IANLFF+E+VRLHG+PK+IVSDRDVKFLSHFWR+L KKFDT L FST +HPQTDGQTEVTNRTLGNL+RCLSG KPKQWDLAL QA
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQA

PNX90096.1 kanadaptin-like protein, partial [Trifolium pratense]3.31e-14463.67Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARW+SFLQ+F F+I+H+SG  NKVADALSR+ SLL  L+ E++ F+ L  LYE D++FK++W KC      DD+H+ EG+LFKG +LCIP +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +++ H GGL+GH G++KT+    +R+YWP +RKD    VK+C  CQ +KG S N GLY PLPIP  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKM+
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF+ CKKT DA  IANLFF+EVVRLHGVPKSI SD D KFLSHFW TLWK FDT L  S+ AHPQTDGQTEVTN+TLGN++RC+ G KPKQWDLAL Q E
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

TrEMBL top hitse value%identityAlignment
A0A2K3MH35 Kanadaptin-like protein (Fragment)1.60e-14463.67Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARW+SFLQ+F F+I+H+SG  NKVADALSR+ SLL  L+ E++ F+ L  LYE D++FK++W KC      DD+H+ EG+LFKG +LCIP +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +++ H GGL+GH G++KT+    +R+YWP +RKD    VK+C  CQ +KG S N GLY PLPIP  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKM+
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF+ CKKT DA  IANLFF+EVVRLHGVPKSI SD D KFLSHFW TLWK FDT L  S+ AHPQTDGQTEVTN+TLGN++RC+ G KPKQWDLAL Q E
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

A0A5A7V732 Serine/threonine-protein kinase TIO-like1.70e-16974.33Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARWISFLQRFDFVIKHQ GKENKVADALSRK SLLT+LS EI AFKHLP LYE+D+DF + W KCSNF+ A+D+HI+EG+LFK +QLCI HTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +KEAHSG L GHFGQ+KT E  SKRYYWPQ+R+D NNFV+RCP CQR KG+STNAGLYSPLP      +   +   +  PK + + DS+MV+VDRFSKMT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF++CKK NDAIYIANLFF+E+VRLHGV K+IVSDRDVKFLSHFW+TLW+K DTTLK+ +TAHPQTD QTEVTN+TLGNL+RCLSG+KPKQWDL LAQAE
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

A0A5B7BER3 Uncharacterized protein1.30e-14365Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MH RWI+FLQRF FV+KH++G++NKVADALSR+ +LL V+SSEI +F+ L +LY+ED DF+  W KC     + ++HI +GYLFKG QLCIP TSLRE +
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +++ HSGGL GH G++KT+ +  +RYYWPQ+++D   FV++CPICQ  KG + N GLY+PLP+P  IWEDL++DF++GLP+TQR  DS+ V+VDRFSKM 
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF+ CKKT+DA ++ANLFF+E+VRLHGVPKSI SDRDVKFLSHFWRTLW+KFDT+L++S+TAHPQTDGQTEVTNRTLGNL+RC SG +PKQWD+ L Q E
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

A0A5D3BWJ1 Serine/threonine-protein kinase TIO-like1.03e-15281.97Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARWISFLQRFDF IKH++GKENKVADALSRK SLLT+LS EI+AFKHLP LYEED DF DIWYKCSN+L  DDYHIV+G+LFKGEQLCIPHTSLREAL
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        +KEAHS GLAGHFGQ+KT EI SKRYYWPQ+R+DS NFVK CP+CQR KG   NAGLYSPLPIPTSIWEDLS+DFV+GLPKTQR +DS++V+VDRFS+MT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHF
        HF+ACKKT DAIYIANLFF+E+V LHGVPK+IVS+RDVKFLSHF
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHF

A0A6D2HS27 Uncharacterized protein9.94e-14464.67Show/hide
Query:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL
        MHARW+SFLQ+F F+I+H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE L
Subjt:  MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREAL

Query:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT
        I++ H GGL+GH G++KT+    +RYYWP +R+D+   VKRC ICQ +KG S N GLY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMT
Subjt:  IKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMT

Query:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE
        HF+ACKKT DA  IA LFF+EVVRLHGVPK+I+SDRD KFLSHFW TLW+ F TTLK S+TAHPQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q E
Subjt:  HFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.7e-4036.13Show/hide
Query:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------
        ARW  FLQ F+F I ++ G  N +ADALSR       +   SE  +   +  +   D DFK+          K  N L+ +D  + E    K        
Subjt:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------

Query:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT
        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP++
Subjt:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT

Query:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR
           ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  LLR
Subjt:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR

Query:  CLSGSKPKQW
        C+  + P  W
Subjt:  CLSGSKPKQW

P0CT41 Transposon Tf2-12 polyprotein1.7e-4036.13Show/hide
Query:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------
        ARW  FLQ F+F I ++ G  N +ADALSR       +   SE  +   +  +   D DFK+          K  N L+ +D  + E    K        
Subjt:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------

Query:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT
        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP++
Subjt:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT

Query:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR
           ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  LLR
Subjt:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR

Query:  CLSGSKPKQW
        C+  + P  W
Subjt:  CLSGSKPKQW

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.4e-4332.2Show/hide
Query:  RWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEII-----------------AFKHLPDLYEEDIDFKDI-----WYKCSNFLDA--DDYHIV
        RW+  L  +DF +++ +G +N VADA+SR    +T  +S  I                    H+ +L + ++  +D+     + K     +    +Y + 
Subjt:  RWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEII-----------------AFKHLPDLYEEDIDFKDI-----WYKCSNFLDA--DDYHIV

Query:  EGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFVI
        +  ++  ++L +P    + A+++  H   L  GHFG   TL   S  YYWP+++     +++ C  CQ  K       GL  PLPI    W D+S+DFV 
Subjt:  EGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFVI

Query:  GLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTL
        GLP T    + I+V+VDRFSK  HF+A +KT DA  + +L F+ +   HG P++I SDRDV+  +  ++ L K+       S+  HPQTDGQ+E T +TL
Subjt:  GLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTL

Query:  GNLLRCLSGSKPKQWDLALAQAE
          LLR    +  + W + L Q E
Subjt:  GNLLRCLSGSKPKQWDLALAQAE

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-4332.2Show/hide
Query:  RWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEII-----------------AFKHLPDLYEEDIDFKDI-----WYKCSNFLDA--DDYHIV
        RW+  L  +DF +++ +G +N VADA+SR    +T  +S  I                    H+ +L + ++  +D+     + K     +    +Y + 
Subjt:  RWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEII-----------------AFKHLPDLYEEDIDFKDI-----WYKCSNFLDA--DDYHIV

Query:  EGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFVI
        +  ++  ++L +P    + A+++  H   L  GHFG   TL   S  YYWP+++     +++ C  CQ  K       GL  PLPI    W D+S+DFV 
Subjt:  EGYLFKGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNA-GLYSPLPIPTSIWEDLSIDFVI

Query:  GLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTL
        GLP T    + I+V+VDRFSK  HF+A +KT DA  + +L F+ +   HG P++I SDRDV+  +  ++ L K+       S+  HPQTDGQ+E T +TL
Subjt:  GLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTL

Query:  GNLLRCLSGSKPKQWDLALAQAE
          LLR  + +  + W + L Q E
Subjt:  GNLLRCLSGSKPKQWDLALAQAE

Q9UR07 Transposon Tf2-11 polyprotein1.7e-4036.13Show/hide
Query:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------
        ARW  FLQ F+F I ++ G  N +ADALSR       +   SE  +   +  +   D DFK+          K  N L+ +D  + E    K        
Subjt:  ARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVL--SSEIIAFKHLPDLYEEDIDFKDIWY-------KCSNFLDADDYHIVEGYLFKG-------

Query:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT
        +Q+ +P+ T L   +IK+ H  G   H G      I  +R+ W  IRK    +V+ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP++
Subjt:  EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPL-PIPTS--IWEDLSIDFVIGLPKT

Query:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR
           ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  LLR
Subjt:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLR

Query:  CLSGSKPKQW
        C+  + P  W
Subjt:  CLSGSKPKQW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACTTCGTGATCAAACACCAATCAGGCAAAGAGAACAAGGTGGCCGATGCTCTAAGCAGAAAAGGCTCCCTACT
CACAGTACTGTCTTCGGAAATCATAGCATTCAAACATTTACCCGACTTATACGAAGAAGATATTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTTAGACGCTG
ATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGAACAGTTATGCATCCCGCACACCTCACTACGTGAAGCCTTGATAAAGGAAGCACATTCTGGAGGGCTAGCT
GGACATTTCGGACAGAATAAGACATTGGAGATCACTTCCAAACGATACTACTGGCCGCAAATAAGAAAAGACTCCAATAATTTCGTAAAAAGATGCCCCATCTGCCAAAG
AACCAAAGGCTCCAGCACGAATGCAGGATTATACTCGCCACTACCCATCCCGACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAAAACACAAA
GACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAGCAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCTTCTTTAAA
GAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTATCAGACAGAGATGTCAAGTTCCTGAGTCACTTTTGGCGAACACTGTGGAAGAAGTTTGACACAACACTGAA
ATTCAGCACCACAGCCCACCCACAGACAGATGGACAAACTGAAGTAACAAACAGGACCCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACAATGGGATC
TAGCATTGGCTCAAGCTGAATTGCCTTCAATAATATGA
mRNA sequenceShow/hide mRNA sequence
CCCTATCCTTAAATTACTAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCATGCTGTACAGGGATTGGAGCTGTCCTAGTACAACAAGGACATCCTATCGAATACT
TCAGTGAAAAACTCAGCACCGCAAGACAGACCTGAAGCACATACGAACAAGAGCTGTATGCCCTCGTCCGAGCACTAAAACAATGGGAACACTACCTGATCTCTAAAGAA
TTTGTACTCCTAACTGACCATTTCTCACTAAAATACCTTCAAGCTCAAAAGAATATCAGTAGGATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACTTCGTGAT
CAAACACCAATCAGGCAAAGAGAACAAGGTGGCCGATGCTCTAAGCAGAAAAGGCTCCCTACTCACAGTACTGTCTTCGGAAATCATAGCATTCAAACATTTACCCGACT
TATACGAAGAAGATATTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTTAGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGAACAGTTA
TGCATCCCGCACACCTCACTACGTGAAGCCTTGATAAAGGAAGCACATTCTGGAGGGCTAGCTGGACATTTCGGACAGAATAAGACATTGGAGATCACTTCCAAACGATA
CTACTGGCCGCAAATAAGAAAAGACTCCAATAATTTCGTAAAAAGATGCCCCATCTGCCAAAGAACCAAAGGCTCCAGCACGAATGCAGGATTATACTCGCCACTACCCA
TCCCGACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAAAACACAAAGACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAGCAAAATG
ACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCTTCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTATCAGACAG
AGATGTCAAGTTCCTGAGTCACTTTTGGCGAACACTGTGGAAGAAGTTTGACACAACACTGAAATTCAGCACCACAGCCCACCCACAGACAGATGGACAAACTGAAGTAA
CAAACAGGACCCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACAATGGGATCTAGCATTGGCTCAAGCTGAATTGCCTTCAATAATATGAAGAACAGAT
CAACAGGAAAGTCCCCCTTCGAAGTAGTTTATACCAAACTACCACGATTAACCTTTGATCTCACTACACTCCCCACAACCGTGGATCTCAACAACGAAGCAGAATGCATG
GCAGAAAATATAAAAAAACTACACAAGGAAGTCCATGATCATCTTATACAGACAACAGACTCCTACAAAAAGGCAGCAGATAAAAAAAAGAAGACAAGCCCACTTCAATA
AAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACTGGCACCTACAACAAGCTGAAAGACAGACAAATTGGGTCATTCCCTATATTAGAGAAATACGGA
GATAATGCCTTCAAGATCGATCTACCACTACACATACACCCAGTCTTCAATGTTGCTGACCTA
Protein sequenceShow/hide protein sequence
MHARWISFLQRFDFVIKHQSGKENKVADALSRKGSLLTVLSSEIIAFKHLPDLYEEDIDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLA
GHFGQNKTLEITSKRYYWPQIRKDSNNFVKRCPICQRTKGSSTNAGLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFK
EVVRLHGVPKSIVSDRDVKFLSHFWRTLWKKFDTTLKFSTTAHPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAELPSII