; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G02000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G02000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr04:6274145..6274684
RNA-Seq ExpressionClc04G02000
SyntenyClc04G02000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038874978.1 putative uncharacterized protein DDB_G0279653 [Benincasa hispida]1.0e-2944.62Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGY-SRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNH--KNASGG
        M+   + L +++Q   +A+A G  +DKT+TEAK ILDRISRN +DW D GY  R  +R  +    +  D +  L AQ+AA+T+LLQ M +N    + S  
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGY-SRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNH--KNASGG

Query:  QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNN----QPEQHGAPMNDKGGSSGFHQGQ
        Q NA  Q+AA+ CV     H  ++CP N QSV  I+NN YSNTYNP  RNHPNF WGGN+    Q         ++G  S FHQ Q
Subjt:  QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNN----QPEQHGAPMNDKGGSSGFHQGQ

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]1.2e-2746.47Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYS--RSGRRCNSALGALENDNVVALQAQVAAITNLLQ-IMTMNHKNASGG
        M+    GL +ASQIA +AAAA  L+DK++TEAK+IL  I++++ +W D  Y      RR +  + +++ + +  L  QVA +T+LLQ IM  N  N    
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYS--RSGRRCNSALGALENDNVVALQAQVAAITNLLQ-IMTMNHKNASGG

Query:  QVNAVNQMAAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH
            VNQ+   G     CVGC + H+Y  CPQN QSVCFI+NN +SNTYNP   NHPNFSW G NQ E H
Subjt:  QVNAVNQMAAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH

XP_038882276.1 uncharacterized protein LOC120073506 [Benincasa hispida]3.6e-2746.07Show/hide
Query:  LTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYS--RSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQM
        L +ASQ+A +AAAA  L++K++TEAK+ILDRI++++ +W D  Y      RR +  + +++ + +  L AQVA +T+LLQ +T+   NA   Q   VNQ+
Subjt:  LTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYS--RSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQM

Query:  AAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH-GAPMNDKGGSSGFHQ
         A G     CVGC + H+Y  C QN QSVCFI+NN +SNTYNP  RNHPNFSW   NQ E H  A    + G S   Q
Subjt:  AAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH-GAPMNDKGGSSGFHQ

XP_038902511.1 uncharacterized protein LOC120089170 [Benincasa hispida]1.7e-2944.62Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSG---RRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGG
        M+   +GL ++ Q+   A+AA   +DK +TEAK ILDRI +N +DW D+GY   G   R+  SA+  +  D +  L AQ+A +T+LLQ+M +NH   S G
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSG---RRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGG

Query:  --QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNN-QPEQHGAPMN--DKGGSSGFHQG
          Q N + Q+A + C  C E H+ ++CP N Q+V  I+NN Y+NTYNP  RNHPNF+WGGNN Q  Q     N  ++G    FHQG
Subjt:  --QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNN-QPEQHGAPMN--DKGGSSGFHQG

XP_038904327.1 uncharacterized protein LOC120090680 [Benincasa hispida]1.5e-2541.42Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGY-SRSGRRCNSA-LGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQ
        M++   GL +A Q+A +AAAA  L+DK++ EAK+IL RI+++  +W D  Y  R+ R+  S  + +++++ +  L +QVA +T+LLQ +T+ + +     
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGY-SRSGRRCNSA-LGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQ

Query:  VNAVNQMAAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH
        +  VNQ+ A G     CVGC +PH+Y  CP N QS+C I+NN + NTYN   +NHPNFSW G N  + H
Subjt:  VNAVNQMAAMG-----CVGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQH

TrEMBL top hitse value%identityAlignment
A0A1S4AR95 uncharacterized protein LOC1078004524.3e-1836.53Show/hide
Query:  DGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGR-RCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQ
        +GL    +I  +AA  G +L+K F E   +L++ S+++ DW+       GR +   + G LE D + AL AQ++ +TN +  M ++       QV A   
Subjt:  DGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGR-RCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQ

Query:  MAAMGCVGCNEPHTYKVCPQNSQSVCFIRN------NLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP
             C  C E HT  +CP N +S+CF+ N      N Y NTYNPN R HPNFSWGGN   +    P
Subjt:  MAAMGCVGCNEPHTYKVCPQNSQSVCFIRN------NLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP

A0A1U7X6N4 uncharacterized protein LOC1042312803.3e-1836.53Show/hide
Query:  DGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGR-RCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQ
        +GL    +I  +AA  G +L+K F E   +L++ S+++ DW+       GR +   + G LE D + AL AQ++ +TN +  M ++       QV A   
Subjt:  DGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGR-RCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQ

Query:  MAAMGCVGCNEPHTYKVCPQNSQSVCFIRN------NLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP
             C  C E HT  +CP N +S+CF+ N      N Y NTYNPN R HPNFSWGGN   +    P
Subjt:  MAAMGCVGCNEPHTYKVCPQNSQSVCFIRN------NLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP

A0A6J1EEI2 uncharacterized protein LOC1114333941.6e-2038.42Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVN
        M+   +GL  A++   +A+A G +L KT+ EA +IL+RI+ N+  W D   S  GR+     G LE D + ++ AQ+A++TN+LQ + +   +     V+
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVN

Query:  AV---NQMAAMGCVGCNEPHTYKVCPQNSQSVCFI---------RNNLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP
         V   NQ AA  CV C E HT+  CP N  S+ ++         +NN +SNTYNP  RNHPNFSW G     Q   P
Subjt:  AV---NQMAAMGCVGCNEPHTYKVCPQNSQSVCFI---------RNNLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP

A0A6J1EQ90 uncharacterized protein LOC1114364112.7e-2037.85Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVN
        M+   +GL   ++   +A+A G +L KT+ EA +IL+RI+ N+  W D   S  GR+     G LE D + ++ AQ+A++TN+LQ + +   +     V+
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVN

Query:  ---AVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFI---------RNNLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP
           A+NQ AA  CV C E HT+  CP N  S+ ++         +NN +SNTYNP  RNHPNFSW G +   Q   P
Subjt:  ---AVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFI---------RNNLYSNTYNPNCRNHPNFSWGGNNQPEQHGAP

A0A6J1H7E4 uncharacterized protein LOC1114611687.4e-1836.99Show/hide
Query:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHK---NASGG
        M+   +GL  A++   +A+A G +L KT+ EA +IL+RI+ N+  W D   S  G++     G LE D + ++ AQ+A++TN+LQ +         A   
Subjt:  MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHK---NASGG

Query:  QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIR---------NNLYSNTYNPNCRNHPNFSWGGNNQPEQ
            + Q A   CV C E HT+  CP+N  S+ ++R         NN  SNTYNP  RNHPNFSW G     Q
Subjt:  QVNAVNQMAAMGCVGCNEPHTYKVCPQNSQSVCFIR---------NNLYSNTYNPNCRNHPNFSWGGNNQPEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATATTATGTGATGGTCTAACTAAGGCGTCTCAAATAGCTAGAAATGCTGCCGCAGCTGGAGAATTACTAGATAAAACTTTCACTGAGGCTAAAGATATCTTAGA
TAGAATTTCCAGAAATCATGAAGATTGGGAAGACCACGGCTATAGTCGATCGGGCAGACGATGCAATAGTGCATTGGGAGCATTAGAAAATGATAATGTTGTTGCGTTGC
AAGCACAAGTCGCCGCAATAACCAACCTGCTCCAAATTATGACTATGAATCATAAGAACGCAAGTGGAGGGCAGGTGAATGCAGTGAATCAGATGGCTGCAATGGGATGT
GTTGGATGCAATGAGCCTCATACGTACAAAGTTTGTCCACAGAATTCACAGTCTGTGTGTTTTATACGAAACAACCTTTATTCCAACACATACAATCCTAACTGCAGGAA
TCATCCCAATTTCTCATGGGGTGGAAATAACCAGCCCGAGCAGCACGGTGCCCCAATGAACGATAAAGGTGGATCATCTGGATTCCACCAAGGACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATATTATGTGATGGTCTAACTAAGGCGTCTCAAATAGCTAGAAATGCTGCCGCAGCTGGAGAATTACTAGATAAAACTTTCACTGAGGCTAAAGATATCTTAGA
TAGAATTTCCAGAAATCATGAAGATTGGGAAGACCACGGCTATAGTCGATCGGGCAGACGATGCAATAGTGCATTGGGAGCATTAGAAAATGATAATGTTGTTGCGTTGC
AAGCACAAGTCGCCGCAATAACCAACCTGCTCCAAATTATGACTATGAATCATAAGAACGCAAGTGGAGGGCAGGTGAATGCAGTGAATCAGATGGCTGCAATGGGATGT
GTTGGATGCAATGAGCCTCATACGTACAAAGTTTGTCCACAGAATTCACAGTCTGTGTGTTTTATACGAAACAACCTTTATTCCAACACATACAATCCTAACTGCAGGAA
TCATCCCAATTTCTCATGGGGTGGAAATAACCAGCCCGAGCAGCACGGTGCCCCAATGAACGATAAAGGTGGATCATCTGGATTCCACCAAGGACAATAG
Protein sequenceShow/hide protein sequence
MKILCDGLTKASQIARNAAAAGELLDKTFTEAKDILDRISRNHEDWEDHGYSRSGRRCNSALGALENDNVVALQAQVAAITNLLQIMTMNHKNASGGQVNAVNQMAAMGC
VGCNEPHTYKVCPQNSQSVCFIRNNLYSNTYNPNCRNHPNFSWGGNNQPEQHGAPMNDKGGSSGFHQGQ