; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12038 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12038
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationctg1820:337479..340115
RNA-Seq ExpressionCucsat.G12038
SyntenyCucsat.G12038
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049562.1 Retrotransposable element Tf2 [Cucumis melo var. makuwa]2.00e-7372.79Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+  PAGLLQP+PIP L+LEDW+MDF+EGLP AG  ++IMVVVDRLSK A+F+ L+HPFS KQVAE F++++I KHG+P  I  DRDK FLS+FWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGTSLKRSTAFHPQTDGQTERVNRCLETYL C+ NEQPTKWH  IP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

KAA0057186.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]7.50e-7373.47Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T PAG+LQP+PIP+ +LEDW++DF+EGLP AG +N IMVVVDRLSK AYFVT++HPFS KQVA  FIDK++R+HG+PK I  DRDK FLSNFWKE F 
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         M T LKRSTAFHPQTDGQTERVN+CLETYL C+ NEQP KWH FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]7.90e-7174.15Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T PAG+LQPIPIPE +LEDWSMDF+EGLP AG +N IMV+VDRLSK +YF+T+RHPF+ +QVAEVFID+V+ +HG+PK I  DRDK F+SNFWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGT LKRSTAFHPQTDGQTERVNRC+ETYL C+ NEQPTKW+ FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

TYK09687.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.37e-7072.11Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T PAG+LQP+PIP  +L+DW+MDF+EGLP AG +N IMVVVDR SK AYFVT++HPFS KQVA  FIDK++ +HG+PK I  DRDK FLSNFWKE F 
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         M T LKRSTAFHPQTDGQTERVN+CLETYL C+ NEQP KWH FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

TYK29995.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cucumis melo var. makuwa]2.29e-7070.07Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T P G+LQP+PIP  +L+DW+MDF+EGLP A  VN IMVVVDRL+K AYF+TL+HPFS KQVA  FIDKV+RKH +PK I  DRDK FL NFWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGT LKRST FHPQTDGQTE+VN+CLETYL C+ NEQ  KW  FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

TrEMBL top hitse value%identityAlignment
A0A5A7U1C1 Retrotransposable element Tf29.67e-7472.79Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+  PAGLLQP+PIP L+LEDW+MDF+EGLP AG  ++IMVVVDRLSK A+F+ L+HPFS KQVAE F++++I KHG+P  I  DRDK FLS+FWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGTSLKRSTAFHPQTDGQTERVNRCLETYL C+ NEQPTKWH  IP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

A0A5A7UCA3 Retrovirus-related Pol polyprotein from transposon 297 family1.11e-7070.07Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T P G+LQP+PIP  +L+DW+MDF+EGLP A  VN IMVVVDRL+K AYF+TL+HPFS KQVA  FIDKV+RKH +PK I  DRDK FL NFWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGT LKRST FHPQTDGQTE+VN+CLETYL C+ NEQ  KW  FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

A0A5A7USP1 Transposon Ty3-G Gag-Pol polyprotein3.63e-7373.47Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T PAG+LQP+PIP+ +LEDW++DF+EGLP AG +N IMVVVDRLSK AYFVT++HPFS KQVA  FIDK++R+HG+PK I  DRDK FLSNFWKE F 
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         M T LKRSTAFHPQTDGQTERVN+CLETYL C+ NEQP KWH FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

A0A5D3CCG2 Transposon Tf2-1 polyprotein isoform X16.64e-7172.11Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T PAG+LQP+PIP  +L+DW+MDF+EGLP AG +N IMVVVDR SK AYFVT++HPFS KQVA  FIDK++ +HG+PK I  DRDK FLSNFWKE F 
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         M T LKRSTAFHPQTDGQTERVN+CLETYL C+ NEQP KWH FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

A0A5D3E3X6 Retrovirus-related Pol polyprotein from transposon 297 family1.11e-7070.07Show/hide
Query:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS
        E+T P G+LQP+PIP  +L+DW+MDF+EGLP A  VN IMVVVDRL+K AYF+TL+HPFS KQVA  FIDKV+RKH +PK I  DRDK FL NFWKE F+
Subjt:  ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFS

Query:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
         MGT LKRST FHPQTDGQTE+VN+CLETYL C+ NEQ  KW  FIP
Subjt:  PMGTSLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.3e-2142.03Show/hide
Query:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT
        P G LQPIP  E   E  SMDF+  LP +   NA+ VVVDR SK A  V      + +Q A +F  +VI   G PK I  D D  F S  WK+       
Subjt:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT

Query:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW
         +K S  + PQTDGQTER N+ +E  L C  +  P  W
Subjt:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW

P0CT41 Transposon Tf2-12 polyprotein7.3e-2142.03Show/hide
Query:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT
        P G LQPIP  E   E  SMDF+  LP +   NA+ VVVDR SK A  V      + +Q A +F  +VI   G PK I  D D  F S  WK+       
Subjt:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT

Query:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW
         +K S  + PQTDGQTER N+ +E  L C  +  P  W
Subjt:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.3e-2240.85Show/hide
Query:  GLLQPIPIPELMLEDWSMDFVEGL-PTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGTS
        GLLQP+PI E    D SMDFV GL PT+ N+N I+VVVDR SK A+F+  R      Q+ ++    +   HG P+ I  DRD +  ++ ++E    +G  
Subjt:  GLLQPIPIPELMLEDWSMDFVEGL-PTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGTS

Query:  LKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
           S+A HPQTDGQ+ER  + L   L  Y +     WH+++P
Subjt:  LKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.0e-2240.85Show/hide
Query:  GLLQPIPIPELMLEDWSMDFVEGL-PTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGTS
        GLLQP+PI E    D SMDFV GL PT+ N+N I+VVVDR SK A+F+  R      Q+ ++    +   HG P+ I  DRD +  ++ ++E    +G  
Subjt:  GLLQPIPIPELMLEDWSMDFVEGL-PTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGTS

Query:  LKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP
           S+A HPQTDGQ+ER  + L   L  Y +     WH+++P
Subjt:  LKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP

Q9UR07 Transposon Tf2-11 polyprotein7.3e-2142.03Show/hide
Query:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT
        P G LQPIP  E   E  SMDF+  LP +   NA+ VVVDR SK A  V      + +Q A +F  +VI   G PK I  D D  F S  WK+       
Subjt:  PAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPK-IHHDRDKKFLSNFWKEHFSPMGT

Query:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW
         +K S  + PQTDGQTER N+ +E  L C  +  P  W
Subjt:  SLKRSTAFHPQTDGQTERVNRCLETYLTCYYNEQPTKW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAATCAACCTTACCAGCAGGGTTACTTCAACCTATTCCTATACCAGAACTCATGCTTGAAGATTGGTCCATGGATTTTGTGGAAGGACTACCAACGGCTGGAAATGTGAA
TGCAATAATGGTTGTGGTGGATAGGCTAAGCAAATGTGCCTACTTTGTGACATTGAGACACCCCTTCTCGGTGAAACAAGTTGCTGAAGTCTTTATTGATAAAGTGATAA
GGAAACATGGGGTACCAAAAATCCATCATGACCGAGACAAAAAATTTCTAAGTAACTTCTGGAAGGAACATTTTTCCCCTATGGGCACCTCCTTGAAGAGAAGCACAGCT
TTCCATCCCCAAACAGATGGACAAACTGAGAGAGTTAACCGTTGCCTTGAGACTTACCTAACGTGTTACTACAATGAACAACCCACAAAATGGCACATGTTTATTCCGTA
G
mRNA sequenceShow/hide mRNA sequence
GAATCAACCTTACCAGCAGGGTTACTTCAACCTATTCCTATACCAGAACTCATGCTTGAAGATTGGTCCATGGATTTTGTGGAAGGACTACCAACGGCTGGAAATGTGAA
TGCAATAATGGTTGTGGTGGATAGGCTAAGCAAATGTGCCTACTTTGTGACATTGAGACACCCCTTCTCGGTGAAACAAGTTGCTGAAGTCTTTATTGATAAAGTGATAA
GGAAACATGGGGTACCAAAAATCCATCATGACCGAGACAAAAAATTTCTAAGTAACTTCTGGAAGGAACATTTTTCCCCTATGGGCACCTCCTTGAAGAGAAGCACAGCT
TTCCATCCCCAAACAGATGGACAAACTGAGAGAGTTAACCGTTGCCTTGAGACTTACCTAACGTGTTACTACAATGAACAACCCACAAAATGGCACATGTTTATTCCGTA
G
Protein sequenceShow/hide protein sequence
ESTLPAGLLQPIPIPELMLEDWSMDFVEGLPTAGNVNAIMVVVDRLSKCAYFVTLRHPFSVKQVAEVFIDKVIRKHGVPKIHHDRDKKFLSNFWKEHFSPMGTSLKRSTA
FHPQTDGQTERVNRCLETYLTCYYNEQPTKWHMFIP