; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G3771 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G3771
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg105:1058850..1074413
RNA-Seq ExpressionCucsat.G3771
SyntenyCucsat.G3771
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016311 - dephosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003993 - acid phosphatase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
GO:0052745 - inositol phosphate phosphatase activity (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]6.78e-7058.5Show/hide
Query:  YLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKE
        Y NPYFLHH+DNT+LVLV++ LT ENY SWSR+M I L+VKNK+GFVDG+I RPTGDLL  WI  NN+VISWILNS+SK ISA+ILFSD AR IW++LKE
Subjt:  YLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKE

Query:  RFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFLWDSMRIFSRHKLNSYLWILFHLQAEP
        RF+K+N PRIFQL+R L+ L Q+Q S+  YFT  KTL+ ELN+Y P+C  G C+C G+KE+  F Q E+++ FL      FS+ ++   L     ++ EP
Subjt:  RFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFLWDSMRIFSRHKLNSYLWILFHLQAEP

XP_022154608.1 uncharacterized protein LOC111021831 [Momordica charantia]5.34e-6968Show/hide
Query:  LNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKER
        LNPY+LHH DNT LVLVT+ LTEENY SWSR+M I LS+KNK+GF+DG+I+RP G+LLP WI NN++VI+WILNSVSK IS++ILFS+ AR IW++LKER
Subjt:  LNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKER

Query:  FQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCG
        F+K N PRIFQLKR LA L QNQ S+ +YFTK K ++DEL  YRP C+C 
Subjt:  FQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCG

XP_031736904.1 uncharacterized protein LOC105434586 isoform X1 [Cucumis sativus]1.59e-10891.3Show/hide
Query:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
        MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
Subjt:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI

Query:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA
        RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIG   T     +   + Y+P+
Subjt:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA

XP_031736905.1 uncharacterized protein LOC105434586 isoform X2 [Cucumis sativus]4.42e-10991.3Show/hide
Query:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
        MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
Subjt:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI

Query:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA
        RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIG   T     +   + Y+P+
Subjt:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA

XP_031736906.1 uncharacterized protein LOC105434586 isoform X3 [Cucumis sativus]4.27e-10991.3Show/hide
Query:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
        MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI
Subjt:  MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWI

Query:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA
        RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIG   T     +   + Y+P+
Subjt:  RNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPA

TrEMBL top hitse value%identityAlignment
A0A5J5B2C5 Uncharacterized protein8.05e-6760Show/hide
Query:  NPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTG---DLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELK
        NPY+LHH+++   VLV++QLT ENY +WSRAM I LSVKNK+GFVDG I  P G   +LL  WIRNNNIVISWILNS+SK ISA+I+F+  AR IW++L+
Subjt:  NPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTG---DLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELK

Query:  ERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL
        +RFQ++N PRIFQLKR L  L Q Q S+ +YFTK KT+++EL+ YRP C+CG C C G+K + D+ Q EY++ FL
Subjt:  ERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL

A0A5J5BKC2 Uncharacterized protein9.33e-6760.57Show/hide
Query:  NPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTG---DLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELK
        NPY+LHH+D+   +LV++QLT ENY +WSRAM I LSVKNK+GFVDG+I  P G   +LL  WIRNNNIVISWILNSVSK ISA+I+F+  AR IW++L+
Subjt:  NPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTG---DLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELK

Query:  ERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL
        +RFQ++N PRIFQLKR L  L Q Q S+ +YFTK KT+++EL+ YR  C+CG C+C G+K + D  QMEY++ FL
Subjt:  ERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL

A0A6J1DIP8 uncharacterized protein LOC1110203993.28e-7058.5Show/hide
Query:  YLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKE
        Y NPYFLHH+DNT+LVLV++ LT ENY SWSR+M I L+VKNK+GFVDG+I RPTGDLL  WI  NN+VISWILNS+SK ISA+ILFSD AR IW++LKE
Subjt:  YLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKE

Query:  RFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFLWDSMRIFSRHKLNSYLWILFHLQAEP
        RF+K+N PRIFQL+R L+ L Q+Q S+  YFT  KTL+ ELN+Y P+C  G C+C G+KE+  F Q E+++ FL      FS+ ++   L     ++ EP
Subjt:  RFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFLWDSMRIFSRHKLNSYLWILFHLQAEP

A0A6J1DKR8 uncharacterized protein LOC1110218312.59e-6968Show/hide
Query:  LNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKER
        LNPY+LHH DNT LVLVT+ LTEENY SWSR+M I LS+KNK+GF+DG+I+RP G+LLP WI NN++VI+WILNSVSK IS++ILFS+ AR IW++LKER
Subjt:  LNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKER

Query:  FQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCG
        F+K N PRIFQLKR LA L QNQ S+ +YFTK K ++DEL  YRP C+C 
Subjt:  FQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCG

A0A6J1DLQ9 uncharacterized protein LOC1110221173.24e-6858.33Show/hide
Query:  LHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKN
        +HHND +NLVLV++ LT  NYVSWSR+MTI LS+KNK+GF++G++ +P GDLLPVWIRN ++VI+W LNSVSKPISA+++F++    IW++LK+RFQ +N
Subjt:  LHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKN

Query:  APRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL
         P+IFQL+R LATL+Q+Q S+ MY+TK K L+DE  +YRP C CG+C+C G + +  F+Q E+L+ FL
Subjt:  APRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.6e-2330.69Show/hide
Query:  SAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPT--GDLLPVWIRNNNIVISWILNSVSKPISANILF
        S   + D +  Y  P  +HH  + ++  +++   E+NYV+W       L V  K GF+DGT+ +P     L   W + N +V+ W++NS++  +  ++++
Subjt:  SAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPT--GDLLPVWIRNNNIVISWILNSVSKPISANILF

Query:  SDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYR--PACNCGTCTCDGLKEMADFLQMEYLVDFL
        ++ A  +W +L+  F      +I+QL+R LATL Q  DS+  YF K   ++ EL+ Y   P C CG C C+  K   +  + E   +FL
Subjt:  SDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYR--PACNCGTCTCDGLKEMADFLQMEYLVDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACTCCGACCATCAATGTGTCACACACCTCGCCGAAAAACCAAGAAAATCCCAATTCAAATCAGTCTGATACCTCCGCCCAAACCTCTTTCGATCAAAACCAAGG
ATATTTGAATCCTTATTTCCTTCATCACAACGATAATACAAACTTAGTACTTGTCACGGAGCAGTTGACTGAGGAGAATTACGTCTCTTGGAGCCGAGCAATGACCATTG
GACTCTCTGTGAAGAATAAGATTGGTTTCGTTGATGGAACTATCGCACGACCAACTGGAGATCTCCTTCCAGTCTGGATCAGAAATAACAATATTGTTATTTCTTGGATA
CTAAACTCAGTCTCCAAACCCATCTCAGCCAATATTCTCTTCTCAGATTTGGCAAGAACAATATGGGTAGAGCTCAAGGAAAGATTCCAAAAGAAGAATGCCCCAAGGAT
ATTTCAATTGAAACGATCCCTGGCAACACTATCACAAAACCAAGACTCCATTGGTATGTACTTTACTAAGTTCAAAACTTTGTTTGATGAATTAAATACATACAGACCAG
CCTGCAACTGTGGAACTTGTACTTGTGATGGCCTAAAAGAAATGGCAGACTTCCTTCAGATGGAGTATCTCGTGGATTTCTTATGGGACTCAATGAGAATTTTTTCCAGG
CACAAGCTCAACTCCTACTTATGGATCCTCTTCCATCTACAAGCCGAGCCTATTCTCTTCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGACTCCGACCATCAATGTGTCACACACCTCGCCGAAAAACCAAGAAAATCCCAATTCAAATCAGTCTGATACCTCCGCCCAAACCTCTTTCGATCAAAACCAAGG
ATATTTGAATCCTTATTTCCTTCATCACAACGATAATACAAACTTAGTACTTGTCACGGAGCAGTTGACTGAGGAGAATTACGTCTCTTGGAGCCGAGCAATGACCATTG
GACTCTCTGTGAAGAATAAGATTGGTTTCGTTGATGGAACTATCGCACGACCAACTGGAGATCTCCTTCCAGTCTGGATCAGAAATAACAATATTGTTATTTCTTGGATA
CTAAACTCAGTCTCCAAACCCATCTCAGCCAATATTCTCTTCTCAGATTTGGCAAGAACAATATGGGTAGAGCTCAAGGAAAGATTCCAAAAGAAGAATGCCCCAAGGAT
ATTTCAATTGAAACGATCCCTGGCAACACTATCACAAAACCAAGACTCCATTGGTATGTACTTTACTAAGTTCAAAACTTTGTTTGATGAATTAAATACATACAGACCAG
CCTGCAACTGTGGAACTTGTACTTGTGATGGCCTAAAAGAAATGGCAGACTTCCTTCAGATGGAGTATCTCGTGGATTTCTTATGGGACTCAATGAGAATTTTTTCCAGG
CACAAGCTCAACTCCTACTTATGGATCCTCTTCCATCTACAAGCCGAGCCTATTCTCTTCTCCTAA
Protein sequenceShow/hide protein sequence
MTTPTINVSHTSPKNQENPNSNQSDTSAQTSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIARPTGDLLPVWIRNNNIVISWI
LNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQLKRSLATLSQNQDSIGMYFTKFKTLFDELNTYRPACNCGTCTCDGLKEMADFLQMEYLVDFLWDSMRIFSR
HKLNSYLWILFHLQAEPILFS