; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G035005 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G035005
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCla97Chr02:10248812..10249195
RNA-Seq ExpressionCla97C02G035005
SyntenyCla97C02G035005
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]3.1e-2546.46Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M+ +YASL +  MG+IV + + ++IW +L++ Y SSS  ++  L+A++Q ++KDGLT  +Y+ K K+I + L+A+ EP+S KDH+ Y+  GL  EYNAFV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSI  + D   LE++ +LLLSY++RLE
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

RVW33435.1 hypothetical protein CK203_098877 [Vitis vinifera]3.5e-2444.88Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M+ LYASL E+ M +IV ++T  +IW++L++ Y +SS  R   L+ ++Q ++KDGL+  +Y+ + K I + ++AI EP+S K H+ Y+  GL  EYN+FV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSIQ + D P ++ + +LLLSYD+RLE
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]1.2e-2472.84Show/hide
Query:  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE
        +IQ+++KDGL+V+QYLAK K+I+ KLS+I EPIS KDHISYI+EGLG+EYNAFVTSIQN+ D+  LEDV TLLL+YDYRLE
Subjt:  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]6.3e-3455.91Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M  +Y+SL EEKMGE+VS  TT+DIW SL R Y+S +  R++ LK ++Q ++KDG +V+QYLAK K+I+DK +A+ EP+S++DH++++L+GLG EYNAFV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSI N+ D P LEDV +LLL+Y+ RL+
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]3.0e-2855.65Show/hide
Query:  MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPML
        MGEIV + + +DIW +L   YESSS   ++   +Q+QKI+KDGLTV+QYLA+ KD+ D  +AI EP+S++DH+SYILEGLG EYN FV+SI N+ + P +
Subjt:  MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPML

Query:  EDVITLLLSYDYRLE
         DV  LL++YD RLE
Subjt:  EDVITLLLSYDYRLE

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.5e-2546.46Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M+ +YASL +  MG+IV + + ++IW +L++ Y SSS  ++  L+A++Q ++KDGLT  +Y+ K K+I + L+A+ EP+S KDH+ Y+  GL  EYNAFV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSI  + D   LE++ +LLLSY++RLE
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

A0A438DD82 Uncharacterized protein1.7e-2444.88Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M+ LYASL E+ M +IV ++T  +IW++L++ Y +SS  R   L+ ++Q ++KDGL+  +Y+ + K I + ++AI EP+S K H+ Y+  GL  EYN+FV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSIQ + D P ++ + +LLLSYD+RLE
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

A0A6J1D6N7 uncharacterized protein LOC1110174385.8e-2572.84Show/hide
Query:  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE
        +IQ+++KDGL+V+QYLAK K+I+ KLS+I EPIS KDHISYI+EGLG+EYNAFVTSIQN+ D+  LEDV TLLL+YDYRLE
Subjt:  QIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIPMLEDVITLLLSYDYRLE

A0A6J1DQX7 uncharacterized protein LOC1110223153.1e-3455.91Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M  +Y+SL EEKMGE+VS  TT+DIW SL R Y+S +  R++ LK ++Q ++KDG +V+QYLAK K+I+DK +A+ EP+S++DH++++L+GLG EYNAFV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        TSI N+ D P LEDV +LLL+Y+ RL+
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

A0A7J6E2L1 Uncharacterized protein6.4e-2445.67Show/hide
Query:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV
        M+ LYASL +  + +IV+FTT  +IW SL R+Y ++S+ R    +  +Q ++KDGL  + YL K K + + L+++ EPIS ++H++Y+L GLG EYNAFV
Subjt:  MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFV

Query:  TSIQNKGDIPMLEDVITLLLSYDYRLE
        T I  +   P++E+V  LLLSY+ RLE
Subjt:  TSIQNKGDIPMLEDVITLLLSYDYRLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0924Show/hide
Query:  LYASLIEEK-MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTS
        LY +L  ++  G  V+ +T+ DIW  +   + ++   R L L ++++      + V  Y  K K ++D L  +  P++ ++ + Y+L GL  +++  +  
Subjt:  LYASLIEEK-MGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTS

Query:  IQNKGDIPMLEDVITLLLSYDYRLE
        I+++   P  +D  T+L   + RL+
Subjt:  IQNKGDIPMLEDVITLLLSYDYRLE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-0925.81Show/hide
Query:  LYASLIEEKMGEIVSF-TTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTS
        +Y ++ +  +  I+    T  D+W SL   +  +   R L  + +++    D L+V +Y  K K +SD L+ +  PIS +  + ++L GL  +Y+  +  
Subjt:  LYASLIEEKMGEIVSF-TTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTS

Query:  IQNKGDIPMLEDVITLLLSYDYRL
        I++K   P   +  ++LL  + RL
Subjt:  IQNKGDIPMLEDVITLLLSYDYRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTATACTTTATGCTTCCCTTATAGAGGAAAAGATGGGAGAAATAGTATCCTTCACAACTACCTATGATATATGGCATTCACTTCACCGATCATATGAGTCATCATC
ATACACGCGAGTTCTCAGTCTTAAAGCCCAAATACAAAAAATTCAAAAGGACGGTCTTACTGTCACACAATATTTGGCCAAATTTAAAGATATTTCTGATAAGTTGTCTG
CGATCAGTGAACCCATATCTCATAAAGACCACATCTCCTATATTTTAGAGGGTCTTGGAGTTGAGTACAATGCTTTTGTCACCTCCATCCAGAACAAGGGGGATATTCCA
ATGCTTGAGGATGTTATCACACTTCTTCTCAGTTACGATTATCGTCTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACTATACTTTATGCTTCCCTTATAGAGGAAAAGATGGGAGAAATAGTATCCTTCACAACTACCTATGATATATGGCATTCACTTCACCGATCATATGAGTCATCATC
ATACACGCGAGTTCTCAGTCTTAAAGCCCAAATACAAAAAATTCAAAAGGACGGTCTTACTGTCACACAATATTTGGCCAAATTTAAAGATATTTCTGATAAGTTGTCTG
CGATCAGTGAACCCATATCTCATAAAGACCACATCTCCTATATTTTAGAGGGTCTTGGAGTTGAGTACAATGCTTTTGTCACCTCCATCCAGAACAAGGGGGATATTCCA
ATGCTTGAGGATGTTATCACACTTCTTCTCAGTTACGATTATCGTCTTGAATGA
Protein sequenceShow/hide protein sequence
MTILYASLIEEKMGEIVSFTTTYDIWHSLHRSYESSSYTRVLSLKAQIQKIQKDGLTVTQYLAKFKDISDKLSAISEPISHKDHISYILEGLGVEYNAFVTSIQNKGDIP
MLEDVITLLLSYDYRLE