; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G219720 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G219720
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCicolChr11:23656026..23657524
RNA-Seq ExpressionCcUC11G219720
SyntenyCcUC11G219720
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8475622.1 hypothetical protein CXB51_032632 [Gossypium anomalum]2.5e-1448.1Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM
        FCDSQSA+ L+ +  +H+RTKHIDV++H+VR+II +G++ + KIS   NP D+MT +L   KFE+  DL++++ V+  M
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM

KAG8487895.1 hypothetical protein CXB51_018354 [Gossypium anomalum]1.9e-1452Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEV
        +CD QSA+ L+NN  YH RTKHIDV+FH+VREII +G++ L KI    NP D+MTN + + KFE+  +L+ + +V
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEV

KAG8498396.1 hypothetical protein CXB51_007029 [Gossypium anomalum]4.3e-1448.1Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM
        FCDSQSA+ L  +  +H+RTKHIDV++H+VR+II +G++ + KIS   NP D+MT +L   KFE+  DL+ LE+ ++ +
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM

KAG8500107.1 hypothetical protein CXB51_003710 [Gossypium anomalum]3.3e-1452Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEV
        +CDSQSA+ L+ N  YH RTKHIDV+FH+VREII +G++ L KI    NP D+MTN + + KFE+  +L+ + +V
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEV

XP_024006146.1 uncharacterized protein LOC112082836 [Eutrema salsugineum]2.5e-1448.75Show/hide
Query:  FTPCSAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKL
        F+  S +  CDSQSAL L+ N  YH+RTKH+  K+H++R+I+ +G++TL KI    NP D +T AL   KFE  CDLL +
Subjt:  FTPCSAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKL

TrEMBL top hitse value%identityAlignment
A0A2G3B6L9 NB-ARC domain-containing protein2.1e-1450Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVI
        FCDSQSA+ L+ +  +H+RTKHIDV++H+VREII +G++ + KIS  +NP D+MT  L S KF++  DL+ L +++
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVI

A0A2R6RIX9 Endonuclease6.1e-1447.37Show/hide
Query:  EAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEE
        + +CDSQSA+ L+ N  +H RTKHIDV+FH++REI+ +G++ L KIS   NP D++TN ++  KF++  DL+ + E
Subjt:  EAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEE

A0A438JQU4 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1452.7Show/hide
Query:  EAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKL
        E F DSQSA+ L NNP +HDRTKHIDV++H++RE I  G + L KIS + NP D+ T  L   KF+Y  DL+++
Subjt:  EAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKL

Q9M1F5 Copia-like polyprotein3.6e-1450.65Show/hide
Query:  SAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLE
        + E FCDSQSA+ LS N  +H+RTKHIDVKFH++REII  G++ + KIS   NP D+ T  L  +KF+   D L+++
Subjt:  SAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLE

Q9SH77 Putative retroelement pol polyprotein3.6e-1451.95Show/hide
Query:  SAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLE
        S E FCDSQSA+ LS N  +H+RTKHIDVK+H++REII  G + + KIS   NP D+ T  LA  KF+   +LL+++
Subjt:  SAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLE

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-0736.07Show/hide
Query:  DSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKF
        D+Q  ++++NNP+ H R KHID+K+H+ RE +Q   + L  I   +   D+ T  L + +F
Subjt:  DSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-1350Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE
        +CDSQSA+ LS N  YH RTKHIDV++H++RE++    L + KIS   NP D++T  +  +KFE
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-0532.81Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE
        +CD+  A  L  NP +H R KHI + +H++R  +Q G L +  +S      D +T  L+   F+
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-0532.81Show/hide
Query:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE
        +CD+  A  L  NP +H R KHI + +H++R  +Q G L +  +S      D +T  L+   F+
Subjt:  FCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTTACACCTTGTTCTGCTGAAGCTTTTTGTGACAGTCAAAGTGCTCTTACACTTTCTAACAATCCAACTTATCATGATCGAACTAAACATATAGATGTTAAGTT
CCATTATGTTCGAGAAATAATTCAGAAGGGAGAACTTACTTTACACAAAATCAGTGCAATGCATAATCCAACAGATGTTATGACAAATGCTTTGGCAAGTGACAAGTTTG
AGTATCCATGTGACTTGCTAAAGTTAGAGGAGGTAATTGTTGCAATGGTGACTTTGACAGAGTACTGGCTAGAGGAATTTGACTCAACGGGAGGTGGTGAAAATAGAGTG
GAAGGAGAAGTCGGCAACAATTGTTTAGGCACCATAGTCGCCAATAATATTAGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTTACACCTTGTTCTGCTGAAGCTTTTTGTGACAGTCAAAGTGCTCTTACACTTTCTAACAATCCAACTTATCATGATCGAACTAAACATATAGATGTTAAGTT
CCATTATGTTCGAGAAATAATTCAGAAGGGAGAACTTACTTTACACAAAATCAGTGCAATGCATAATCCAACAGATGTTATGACAAATGCTTTGGCAAGTGACAAGTTTG
AGTATCCATGTGACTTGCTAAAGTTAGAGGAGGTAATTGTTGCAATGGTGACTTTGACAGAGTACTGGCTAGAGGAATTTGACTCAACGGGAGGTGGTGAAAATAGAGTG
GAAGGAGAAGTCGGCAACAATTGTTTAGGCACCATAGTCGCCAATAATATTAGTTGGTAG
Protein sequenceShow/hide protein sequence
MWFTPCSAEAFCDSQSALTLSNNPTYHDRTKHIDVKFHYVREIIQKGELTLHKISAMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAMVTLTEYWLEEFDSTGGGENRV
EGEVGNNCLGTIVANNISW