; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G10970 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G10970
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr06:19174593..19175057
RNA-Seq ExpressionClc06G10970
SyntenyClc06G10970
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]1.7e-2841.56Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  L G    PP FLD QQ Q N  F  W+RYNR +M WI +S++E  + +IV     + IWE L+  Y + + A +  L+T LQ I+K+GL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
          Y+ + + + +  ++IGEP++Y DHL Y L GL   YN FV SIQ++  RPS+
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.2e-2841.56Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  L G    PP FLD QQ Q N  F  W+RYNR +M WI +S++E  + +IV     + IWE L+  Y + + A +  L+T LQ I+K+GL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
          Y+ + + + +  ++IGEP++Y DHL Y L GL   YN FV SIQ++  RPS+
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]3.5e-2640.67Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  + G    PP F D  +   N  +  W+R+NR IM WI +SL++  M +IV       IWE L   Y S+++AKI  L+ +LQ++RKDGL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLD
         +Y+ + K + +  +A+GEP+S +DHL Y+  GL  +YNAFV SI  R D
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLD

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-2844.81Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  +    +SPP++LD    Q N  F  W+R N+ +M WI SSL+   + +IV       IW  L   Y+S +IA +M+L +QLQ I+K  + +
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
        S+YLS++K V D+F+ IGEP+SYRD L  IL+GL  +Y+ FV SI NR DRPSL
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.8e-5164.29Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+G L G +  PP+FLD  Q+QPN  +  WERYNR +MCWI SSLSEEKM E+VSLE T  IW  L   YDS T A+IM LKT+LQ++RKDG SV
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
        SQYL++IKE+ DKF+A+GEP+SYRDHLA++LDGL S+YNAFV SI NR D PSL
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.7e-2640.67Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  + G    PP F D  +   N  +  W+R+NR IM WI +SL++  M +IV       IWE L   Y S+++AKI  L+ +LQ++RKDGL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLD
         +Y+ + K + +  +A+GEP+S +DHL Y+  GL  +YNAFV SI  R D
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLD

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.4e-2844.81Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  +    +SPP++LD    Q N  F  W+R N+ +M WI SSL+   + +IV       IW  L   Y+S +IA +M+L +QLQ I+K  + +
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
        S+YLS++K V D+F+ IGEP+SYRD L  IL+GL  +Y+ FV SI NR DRPSL
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

A0A6J1DQX7 uncharacterized protein LOC1110223158.8e-5264.29Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+G L G +  PP+FLD  Q+QPN  +  WERYNR +MCWI SSLSEEKM E+VSLE T  IW  L   YDS T A+IM LKT+LQ++RKDG SV
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
        SQYL++IKE+ DKF+A+GEP+SYRDHLA++LDGL S+YNAFV SI NR D PSL
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

A0A7J0EGI5 Uncharacterized protein8.0e-2941.56Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  L G    PP FLD QQ Q N  F  W+RYNR +M WI +S++E  + +IV     + IWE L+  Y + + A +  L+T LQ I+K+GL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
          Y+ + + + +  ++IGEP++Y DHL Y L GL   YN FV SIQ++  RPS+
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

A0A7J0GPN0 UBX domain-containing protein1.0e-2841.56Show/hide
Query:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV
        +++NGL+  L G    PP FLD QQ Q N  F  W+RYNR +M WI +S++E  + +IV     + IWE L+  Y + + A +  L+T LQ I+K+GL+ 
Subjt:  MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSV

Query:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL
          Y+ + + + +  ++IGEP++Y DHL Y L GL   YN FV SIQ++  RPS+
Subjt:  SQYLSQIKEVTDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.6e-0520.83Show/hide
Query:  WERYNRFIMCWISSSLSEEK-MAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSVSQYLSQIKEVTDKFSAIGEPISYRDHLAYILDG
        W++ +  +   +  +L+ ++     V+   +  IW  +K  + +N  A+ + L ++L+      + V+ Y  ++K++ D    +  P++ R+ + Y+L+G
Subjt:  WERYNRFIMCWISSSLSEEK-MAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSVSQYLSQIKEVTDKFSAIGEPISYRDHLAYILDG

Query:  LRSKYNAFVISIQNRLDRPS
        L  K++  +  I++R   PS
Subjt:  LRSKYNAFVISIQNRLDRPS

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.5e-0824.17Show/hide
Query:  WERYNRFIMCWISSSLSEEKMAEIVSLEMTTA-IWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSVSQYLSQIKEVTDKFSAIGEPISYRDHLAYILDG
        W+  +  +  WI  ++++  +  I+ +  T   +W  L+  +  N  A+ +  + +L+    D LSV +Y  ++K ++D  + +  PIS R  + ++L+G
Subjt:  WERYNRFIMCWISSSLSEEKMAEIVSLEMTTA-IWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSVSQYLSQIKEVTDKFSAIGEPISYRDHLAYILDG

Query:  LRSKYNAFVISIQNRLDRPS
        L  KY+  +  I+++   PS
Subjt:  LRSKYNAFVISIQNRLDRPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCAAATGGACTCCAAGGTATTTTGTATGGTATGGTAACATCCCCTCCAGAATTTCTTGATCAGCAACAAATTCAACCGAATCTTGGTTTTCAGGTATGGGAGAG
GTATAATAGATTTATTATGTGCTGGATTTCCTCTTCTCTGTCTGAAGAAAAGATGGCTGAAATTGTGAGTTTAGAAATGACTACGGCTATTTGGGAACCTTTGAAATGTA
CCTACGATTCTAATACTATTGCGAAGATTATGGCTTTAAAAACTCAGTTGCAGCATATTAGAAAGGATGGATTATCTGTAAGTCAATATTTGTCTCAAATTAAAGAAGTG
ACTGATAAATTTTCCGCTATAGGAGAGCCTATATCATATCGTGACCATCTTGCTTATATTTTAGATGGATTACGAAGTAAATACAATGCCTTTGTCATATCTATACAAAA
TAGATTAGATAGACCTTCTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCAAATGGACTCCAAGGTATTTTGTATGGTATGGTAACATCCCCTCCAGAATTTCTTGATCAGCAACAAATTCAACCGAATCTTGGTTTTCAGGTATGGGAGAG
GTATAATAGATTTATTATGTGCTGGATTTCCTCTTCTCTGTCTGAAGAAAAGATGGCTGAAATTGTGAGTTTAGAAATGACTACGGCTATTTGGGAACCTTTGAAATGTA
CCTACGATTCTAATACTATTGCGAAGATTATGGCTTTAAAAACTCAGTTGCAGCATATTAGAAAGGATGGATTATCTGTAAGTCAATATTTGTCTCAAATTAAAGAAGTG
ACTGATAAATTTTCCGCTATAGGAGAGCCTATATCATATCGTGACCATCTTGCTTATATTTTAGATGGATTACGAAGTAAATACAATGCCTTTGTCATATCTATACAAAA
TAGATTAGATAGACCTTCTCTTTAA
Protein sequenceShow/hide protein sequence
MLSNGLQGILYGMVTSPPEFLDQQQIQPNLGFQVWERYNRFIMCWISSSLSEEKMAEIVSLEMTTAIWEPLKCTYDSNTIAKIMALKTQLQHIRKDGLSVSQYLSQIKEV
TDKFSAIGEPISYRDHLAYILDGLRSKYNAFVISIQNRLDRPSL