; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G09087 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G09087
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr03:9795150..9796906
RNA-Seq ExpressionClc03G09087
SyntenyClc03G09087
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]4.1e-1134.75Show/hide
Query:  SLLCKVEDMHRRGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-----
        +++ ++     RG LR+     A NA ++    G GES+     K      LKALGA+T   T +  D EAW++ +EKCF+V RCPEDRK+ L       
Subjt:  SLLCKVEDMHRRGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-----

Query:  ---------EGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK
                 E R +   + +W+ F +AF +++ PRSF  AK
Subjt:  ---------EGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK

TYK11725.1 uncharacterized protein E5676_scaffold304G00370 [Cucumis melo var. makuwa]5.4e-1139.06Show/hide
Query:  RRGSLRKQCADGAQNAISKSLVEGEGESSHPQM--TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWKE-----
        R    R+Q  DG Q+A      E    S   +M   + LK LGA+  E + DLADA+AW++ LEKCF VM CP++RK+ L         EG WK      
Subjt:  RRGSLRKQCADGAQNAISKSLVEGEGESSHPQM--TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWKE-----

Query:  NSEPT--WEYFNEAFNEEYVPRSFCYAK
        N   T  W+ F   F E+Y P ++C AK
Subjt:  NSEPT--WEYFNEAFNEEYVPRSFCYAK

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]7.0e-1137.69Show/hide
Query:  RGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EG
        RG LR+     A NA ++    G GES+     K      LKALGA+T   T +  D EAW++ +EKCF+V RCPEDRK+ L                E 
Subjt:  RGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EG

Query:  RWKENSEPTWEYFNEAFNEEYVPRSFCYAK
        R +   + +W+ F +AF +++ PRSF  AK
Subjt:  RWKENSEPTWEYFNEAFNEEYVPRSFCYAK

XP_038877203.1 uncharacterized protein LOC120069501 [Benincasa hispida]9.1e-1148.84Show/hide
Query:  LKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWK-------ENSEPTWEYFNEAFNEEYVPRSF
        LKALGA+  E T D ADAE W++Q+EKCF+VMRCP+ RK+SL        VE  WK       +  E  WE F ++F E + PRSF
Subjt:  LKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWK-------ENSEPTWEYFNEAFNEEYVPRSF

XP_038877272.1 uncharacterized protein LOC120069556 [Benincasa hispida]1.1e-1136.81Show/hide
Query:  NAISKSLVEGEGESSHPQ--------------------------------MTKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISL----
        +A SK+    +GESSHPQ                                  + LKALGAST E T++ ADAEAW   +EKCF+VM CPEDRK++L    
Subjt:  NAISKSLVEGEGESSHPQ--------------------------------MTKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISL----

Query:  ----------VVEGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK
                  V + R +   E TWE F +AF+E++ P++F  AK
Subjt:  ----------VVEGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK

TrEMBL top hitse value%identityAlignment
A0A5A7SRT9 Reverse transcriptase1.3e-1040Show/hide
Query:  EGESSHPQMT---KTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-------EGRWK-------ENSEPTWEYFNEAFNEEYVPRS
        +GE S P+     + LK LGA+  E +ID ADAE W++ LEKCF VM CPE+RK+ L +       EG WK       +     W+ F   F ++Y P +
Subjt:  EGESSHPQMT---KTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-------EGRWK-------ENSEPTWEYFNEAFNEEYVPRS

Query:  FCYAK
        +C AK
Subjt:  FCYAK

A0A5A7TBS0 CCHC-type domain-containing protein2.0e-1134.75Show/hide
Query:  SLLCKVEDMHRRGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-----
        +++ ++     RG LR+     A NA ++    G GES+     K      LKALGA+T   T +  D EAW++ +EKCF+V RCPEDRK+ L       
Subjt:  SLLCKVEDMHRRGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV-----

Query:  ---------EGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK
                 E R +   + +W+ F +AF +++ PRSF  AK
Subjt:  ---------EGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK

A0A5A7UZM6 Gag protease polyprotein-like protein5.8e-1137.98Show/hide
Query:  RGSLRKQCADGAQNAISKSLV-EGEGESSHPQM---TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EGR
        +G  RK     A NA  ++ +  GE   S P+     + LKALGA+T   T + ADAEAW++ +EKCF+V RCPEDRK+ L                E R
Subjt:  RGSLRKQCADGAQNAISKSLV-EGEGESSHPQM---TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EGR

Query:  WKENSEPTWEYFNEAFNEEYVPRSFCYAK
         +   + +W  F +AF +++ PRSF  AK
Subjt:  WKENSEPTWEYFNEAFNEEYVPRSFCYAK

A0A5D3CJL4 Retrotrans_gag domain-containing protein2.6e-1139.06Show/hide
Query:  RRGSLRKQCADGAQNAISKSLVEGEGESSHPQM--TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWKE-----
        R    R+Q  DG Q+A      E    S   +M   + LK LGA+  E + DLADA+AW++ LEKCF VM CP++RK+ L         EG WK      
Subjt:  RRGSLRKQCADGAQNAISKSLVEGEGESSHPQM--TKTLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLV-------VEGRWKE-----

Query:  NSEPT--WEYFNEAFNEEYVPRSFCYAK
        N   T  W+ F   F E+Y P ++C AK
Subjt:  NSEPT--WEYFNEAFNEEYVPRSFCYAK

A0A5D3CTK6 CCHC-type domain-containing protein3.4e-1137.69Show/hide
Query:  RGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EG
        RG LR+     A NA ++    G GES+     K      LKALGA+T   T +  D EAW++ +EKCF+V RCPEDRK+ L                E 
Subjt:  RGSLRKQCADGAQNAISKSLVEGEGESSHPQMTK-----TLKALGASTLERTIDLADAEAWISQLEKCFQVMRCPEDRKISLVV--------------EG

Query:  RWKENSEPTWEYFNEAFNEEYVPRSFCYAK
        R +   + +W+ F +AF +++ PRSF  AK
Subjt:  RWKENSEPTWEYFNEAFNEEYVPRSFCYAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTACCTTGGTCCTGACCATGCATCCAACATGGCATCTAACAGAGGAAACCCTGCGTCCAACATTGTGCAGTCCTGCATCCAAGAGAAGGGTTTCCAAGGCTCCAA
AGGATTCCCAATTGTTGTTTTCTTAGTTCCAAAAGCTAGGGAAGAGCCAGAAAGTCGAGACCAAGCTAACTTGTGTTCAGAGCAGGGAAAAGAGGCTCTAAATTGTGAGC
CTGTGCTGGATTTTAACTTCACAGGATGGTTAGGGGTTGCTAGAAACCTAGTTCCTTATCCAATGAGATGCATCTCTTGTGAGATGAGCATAGATGCAGTTGTTGTGCTA
GCAAATCAGTTCAGTACCGAGTTTAAGCTGAAATGGGAGCTTCGTGAAGCTCATCCTTTCTTTATGTTTTCTTTTCAAGTAGCAAACGAGTTCTCGTTGCTGTGCAAAGT
CGAGGATATGCATAGAAGAGGCAGTCTTCGCAAGCAATGTGCAGACGGAGCCCAGAATGCTATCTCTAAATCCCTAGTAGAAGGGGAGGGAGAATCAAGTCATCCTCAGA
TGACGAAAACATTAAAAGCCTTAGGTGCAAGTACATTAGAGAGGACTATAGATCTAGCTGATGCAGAAGCATGGATCAGTCAGTTGGAAAAATGTTTCCAAGTAATGAGG
TGCCCTGAGGATAGAAAAATCTCTTTGGTAGTTGAAGGAAGATGGAAGGAGAATTCAGAACCTACCTGGGAGTATTTTAATGAAGCTTTTAATGAAGAATATGTTCCTCG
ATCATTCTGTTATGCGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTTACCTTGGTCCTGACCATGCATCCAACATGGCATCTAACAGAGGAAACCCTGCGTCCAACATTGTGCAGTCCTGCATCCAAGAGAAGGGTTTCCAAGGCTCCAA
AGGATTCCCAATTGTTGTTTTCTTAGTTCCAAAAGCTAGGGAAGAGCCAGAAAGTCGAGACCAAGCTAACTTGTGTTCAGAGCAGGGAAAAGAGGCTCTAAATTGTGAGC
CTGTGCTGGATTTTAACTTCACAGGATGGTTAGGGGTTGCTAGAAACCTAGTTCCTTATCCAATGAGATGCATCTCTTGTGAGATGAGCATAGATGCAGTTGTTGTGCTA
GCAAATCAGTTCAGTACCGAGTTTAAGCTGAAATGGGAGCTTCGTGAAGCTCATCCTTTCTTTATGTTTTCTTTTCAAGTAGCAAACGAGTTCTCGTTGCTGTGCAAAGT
CGAGGATATGCATAGAAGAGGCAGTCTTCGCAAGCAATGTGCAGACGGAGCCCAGAATGCTATCTCTAAATCCCTAGTAGAAGGGGAGGGAGAATCAAGTCATCCTCAGA
TGACGAAAACATTAAAAGCCTTAGGTGCAAGTACATTAGAGAGGACTATAGATCTAGCTGATGCAGAAGCATGGATCAGTCAGTTGGAAAAATGTTTCCAAGTAATGAGG
TGCCCTGAGGATAGAAAAATCTCTTTGGTAGTTGAAGGAAGATGGAAGGAGAATTCAGAACCTACCTGGGAGTATTTTAATGAAGCTTTTAATGAAGAATATGTTCCTCG
ATCATTCTGTTATGCGAAATAG
Protein sequenceShow/hide protein sequence
MRYLGPDHASNMASNRGNPASNIVQSCIQEKGFQGSKGFPIVVFLVPKAREEPESRDQANLCSEQGKEALNCEPVLDFNFTGWLGVARNLVPYPMRCISCEMSIDAVVVL
ANQFSTEFKLKWELREAHPFFMFSFQVANEFSLLCKVEDMHRRGSLRKQCADGAQNAISKSLVEGEGESSHPQMTKTLKALGASTLERTIDLADAEAWISQLEKCFQVMR
CPEDRKISLVVEGRWKENSEPTWEYFNEAFNEEYVPRSFCYAK