; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG03G016367 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG03G016367
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr03:31500618..31504532
RNA-Seq ExpressionClCG03G016367
SyntenyClCG03G016367
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]5.1e-3342.97Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKE
        CV C   HT+E C  N  SVC++        NN YSN+YNP  ++ PN SWGG  Q KQ   P        GF Q  + Q QP Q   S  SS+E+L+++
Subjt:  CVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKE

Query:  YMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAP------HCAG---SSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSS----PTIE
        YM KND  IQ+QA+S+RNLE+ LGQ+A +LK R +G+ PS TE P      HC      SGK         ++  PS+  +  +   +P +S    P + 
Subjt:  YMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAP------HCAG---SSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSS----PTIE

Query:  YNSTTQEKQADKSAFTSVSTELQK--PPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
          +++Q   A+KS        LQK  PP+PQR KK++ D+ QF+RFLDVLKQLHINIPLVE L
Subjt:  YNSTTQEKQADKSAFTSVSTELQK--PPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

XP_030504954.1 uncharacterized protein LOC115719921 [Cannabis sativa]2.8e-3139.07Show/hide
Query:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALL
        + CV C + H +E C  N +SVC++        N  +SN+YN   +N PNLSWGG       +     +    GF Q  +   Q Q S P   SS+E+L+
Subjt:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALL

Query:  KEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQAD
        ++YM KND  IQ+QA+S+RNLEL LG +A ELK R +GSFP+ TE     G  GKEQ K        +     E    T     SP      TT  +Q D
Subjt:  KEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQAD

Query:  KSAFTSVSTELQKP-PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL-------PFLRFLVSPGSNAGICDKLGV
              V +  + P P+PQR +K+  D  +FK+FLDVLKQLHINIPLVE L        FL+ +++  S  G  + LG+
Subjt:  KSAFTSVSTELQKP-PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL-------PFLRFLVSPGSNAGICDKLGV

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.7e-3140.23Show/hide
Query:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQ-NQPQQSTPSPLSSIEAL
        + CV CR+ H +E C  NP+SVC++        N  +SN+YN   +N PNLSWG  ++ K      H   G   +  G+ +Q   PQ +  S  SS+E+L
Subjt:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQ-NQPQQSTPSPLSSIEAL

Query:  LKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNS--TTQE-
        +++YM KND  IQ+QA+ +RNLEL LG +A ELK R +GS PS TE P      GKEQ K        +  N  E  + + +P S    E  S  T QE 
Subjt:  LKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNS--TTQE-

Query:  -----------KQADKSAFTSVSTELQKP-PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
                   +Q++      V +  + P P+PQR +K++ D  QFK+FLDVLKQLHINIPLVE L
Subjt:  -----------KQADKSAFTSVSTELQKP-PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

XP_030509134.1 uncharacterized protein LOC115723804 [Cannabis sativa]2.8e-3140.3Show/hide
Query:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQ---NQPQQSTPSPLSSIE
        + CV C    T+E    NP SVC++        NN YSN+YNP  ++ PN SWGG       S+      G   F  G+ +Q    QP Q   S  SS+E
Subjt:  MGCVGCRKPHTYEVCLQNPQSVCFI-------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQ---NQPQQSTPSPLSSIE

Query:  ALLKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKA--------PTNKTTNPSNQSENSQPTNQPLSSPTIE
        +L+++YM KND  IQ+QA+S+RNLE+ LGQ+A +LK R +G+ PS T+ P      GKE  K           +N     S +  + Q   +   + T  
Subjt:  ALLKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKA--------PTNKTTNPSNQSENSQPTNQPLSSPTIE

Query:  YNSTTQEKQADKSAFTSVSTELQKP--PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
         N    +   D + F S    LQKP  P+PQR KK++ D+ QF+RFLDVLKQLHINIPLVE L
Subjt:  YNSTTQEKQADKSAFTSVSTELQKP--PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

XP_038874882.1 uncharacterized protein LOC120067385 [Benincasa hispida]1.4e-3040.86Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFIQNNRYSNTYNPDR---RNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQS-------TPSPLSSIEAL
        C+GC  PH+Y  C QNP  VCFI+NN +SNTYN      R   ++SW G N   Q+  P  N+   +G    +Q+ +QP Q          S  S +E L
Subjt:  CVGCRKPHTYEVCLQNPQSVCFIQNNRYSNTYNPDR---RNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQS-------TPSPLSSIEAL

Query:  LKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQA
        LKEY+ +NDV +++QASSIRNLE+ +GQIA ELK RQ G  PS T+ P    ++GKEQ         +  S +     P N   S   I ++   +E+  
Subjt:  LKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQA

Query:  DKSAFTSVST-ELQKPPYPQR-----LKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
          ++ +  ST    KP  P       +  KKNDE+QFKRFL++L+QLHINIPL+E L
Subjt:  DKSAFTSVST-ELQKPPYPQR-----LKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

TrEMBL top hitse value%identityAlignment
A0A1S4DBU7 uncharacterized protein LOC1078279691.6e-1935.41Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQ--STPSPLSSIEALLK
        C  C + H   +CL NP+SV F+      Q N+Y +TYNP+ RN PN SWGGN   + Q  P           Q  Q+Q +P Q     SP S +E +LK
Subjt:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQ--STPSPLSSIEALLK

Query:  EYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQP-----TNQP---LSSPTIEYNST
        + M +     Q  A+++RNLE  +GQ+A    TR  G+ PS TE P+           KA  N  T  + ++    P     T+ P   L+   +E N  
Subjt:  EYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQP-----TNQP---LSSPTIEYNST

Query:  TQEKQADKSAFTSVSTELQKPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
           ++ DK     + T    PP+PQRL+K+K D+ ++K+FLD+L Q+ +N+PLVE+L
Subjt:  TQEKQADKSAFTSVSTELQKPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.6e-2437.6Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY
        C  C   H+Y+ C  N +SV F+      QNN YSNTYNP  RN PN SW  N  P      M           G+Q+Q +PQ   P   S +E LL +Y
Subjt:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY

Query:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAP--TNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADK
        + K D  IQ+Q +S+RNLE  +GQ+A  +  R +GS PS T+        GKEQ +     + K     NQ                E     Q+K  DK
Subjt:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAP--TNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADK

Query:  SAFTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
        +     S  +   PP+PQRL+K+K  E+QF++FL+V K+LHINIP  E L
Subjt:  SAFTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

A0A6J1BDW4 uncharacterized protein LOC1104265841.4e-2338.4Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY
        C  C   H+Y+ C  N +SV F+      QNN YSNTYNP  RN PN SW  N  P      M       GFHQ    Q +PQ S     S +E LL +Y
Subjt:  CVGCRKPHTYEVCLQNPQSVCFI------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY

Query:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAP--TNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADK
        + K D  IQ+Q +S+RNLE  +GQ+A  +  R +GS PS T+        GKEQ +     + K     NQ                E     Q+K  DK
Subjt:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAP--TNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADK

Query:  SAFTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
        +     S  +   PP+PQRL+K+K  E+QF++FL+V K+LHINIP  E L
Subjt:  SAFTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

A0A6J1DWN2 uncharacterized protein LOC1110252036.1e-2436.82Show/hide
Query:  CRKPHTYEVCLQNPQSVCFIQ------NNRYSNTYNPDRRNDPNLSWG---GNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY
        C   H Y  C  NP+SV ++       NN YSNTYN    + PN SW    G N     +AP + + G+       Q Q   Q+      +S+E L+K+Y
Subjt:  CRKPHTYEVCLQNPQSVCFIQ------NNRYSNTYNPDRRNDPNLSWG---GNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEY

Query:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADKSA
        M+KN+V +Q+ A+S+RNLEL +GQ+A +LK+R  G+ PS T+       S + +  +   +K  NP N   ++  T+          ++   EK+  K  
Subjt:  MQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADKSA

Query:  FTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVE----VLPFLRFL
             TE +  PPYP+RLKKK+ D  QF++FLDVL QLH+NIPLVE    +  ++RFL
Subjt:  FTSVSTELQ-KPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVE----VLPFLRFL

A0A6J1EQ90 uncharacterized protein LOC1114364115.8e-2234.09Show/hide
Query:  CVGCRKPHTYEVCLQNPQSVCFI---------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQ--------------PQ
        CV C + HT++ C  NP S+ ++         +NN +SNTYNP  RN PN SW G +   QQ  P  N      +  G++ QNQ                
Subjt:  CVGCRKPHTYEVCLQNPQSVCFI---------QNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQ--------------PQ

Query:  QSTPSPLSSIEALLKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSP
        Q+  +  +SIE+L+KEYM KND  IQ+Q +S+RNLE+   QI GE    Q  S    T       +  +++ ++A   K  +        QP  Q     
Subjt:  QSTPSPLSSIEALLKEYMQKNDVQIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSP

Query:  TIEYNSTTQEKQADKSAFTSVSTELQKPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL
              TT   + +   +T        PP+PQR+K+KK +E  F++F+D+LK++HINIPLVE L
Subjt:  TIEYNSTTQEKQADKSAFTSVSTELQKPPYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACTATGGGATGTGTTGGATGTAGGAAGCCGCACACGTATGAAGTCTGCCTGCAGAACCCTCAGTCTGTATGTTTCATACAGAACAACCGGTACTCCAAC
ACCTATAATCCCGACAGGAGAAATGATCCCAATTTATCATGGGGTGGCAACAACCAACCTAAGCAACAAAGTGCACCCATGCACAACAAGGATGGATCATCTGGA
TTCCACCAAGGATATCAGAAGCAAAATCAACCGCAACAATCAACGCCTAGTCCATTGTCATCAATAGAAGCTCTTCTGAAAGAATATATGCAGAAGAACGACGTC
CAAATCCAAACCCAGGCGTCGTCTATAAGGAATCTGGAGCTTCCGCTGGGCCAGATCGCTGGAGAACTGAAAACACGGCAGAAAGGCTCTTTTCCGAGCACTACC
GAAGCCCCGCACTGTGCAGGCAGTTCAGGAAAGGAGCAATGGAAGAAGGCTCCGACCAACAAGACTACGAATCCTTCTAACCAATCAGAAAACTCTCAACCAACA
AATCAGCCTCTTTCTAGCCCTACAATCGAGTACAACAGCACAACTCAAGAAAAACAAGCTGATAAATCTGCATTTACAAGCGTATCAACTGAGCTACAAAAGCCT
CCTTACCCACAAAGGTTAAAGAAGAAGAAAAATGATGAAGAGCAGTTCAAGCGCTTCTTGGATGTATTGAAACAGTTGCATATCAACATCCCTCTGGTGGAAGTG
TTGCCTTTTTTACGTTTCTTGGTGAGTCCTGGGTCGAACGCAGGGATTTGTGATAAATTAGGTGTTGAGAATAGATACAAAAACTATGCGGTGACTATACATGTT
CTCACTCGCCGAGAGATGACTCTAAAATTTCTTAGAATCCTTCTCACAGAAGAGCTCGAAATGGAGGTCCACGGCTACTTTGTCTCGTGTAAGGCACTCTCCATA
ATATGGTTTTACGCGTTGACCATTTACTTTAAATGCATTGGTGCCGTCTTCATTCCTCAGCTCCACGGCACAATGAGGAAAGATTTCTTTGATCACAAAGGTACC
AGACCATCTTGTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATACTATGGGATGTGTTGGATGTAGGAAGCCGCACACGTATGAAGTCTGCCTGCAGAACCCTCAGTCTGTATGTTTCATACAGAACAACCGGTACTCCAAC
ACCTATAATCCCGACAGGAGAAATGATCCCAATTTATCATGGGGTGGCAACAACCAACCTAAGCAACAAAGTGCACCCATGCACAACAAGGATGGATCATCTGGA
TTCCACCAAGGATATCAGAAGCAAAATCAACCGCAACAATCAACGCCTAGTCCATTGTCATCAATAGAAGCTCTTCTGAAAGAATATATGCAGAAGAACGACGTC
CAAATCCAAACCCAGGCGTCGTCTATAAGGAATCTGGAGCTTCCGCTGGGCCAGATCGCTGGAGAACTGAAAACACGGCAGAAAGGCTCTTTTCCGAGCACTACC
GAAGCCCCGCACTGTGCAGGCAGTTCAGGAAAGGAGCAATGGAAGAAGGCTCCGACCAACAAGACTACGAATCCTTCTAACCAATCAGAAAACTCTCAACCAACA
AATCAGCCTCTTTCTAGCCCTACAATCGAGTACAACAGCACAACTCAAGAAAAACAAGCTGATAAATCTGCATTTACAAGCGTATCAACTGAGCTACAAAAGCCT
CCTTACCCACAAAGGTTAAAGAAGAAGAAAAATGATGAAGAGCAGTTCAAGCGCTTCTTGGATGTATTGAAACAGTTGCATATCAACATCCCTCTGGTGGAAGTG
TTGCCTTTTTTACGTTTCTTGGTGAGTCCTGGGTCGAACGCAGGGATTTGTGATAAATTAGGTGTTGAGAATAGATACAAAAACTATGCGGTGACTATACATGTT
CTCACTCGCCGAGAGATGACTCTAAAATTTCTTAGAATCCTTCTCACAGAAGAGCTCGAAATGGAGGTCCACGGCTACTTTGTCTCGTGTAAGGCACTCTCCATA
ATATGGTTTTACGCGTTGACCATTTACTTTAAATGCATTGGTGCCGTCTTCATTCCTCAGCTCCACGGCACAATGAGGAAAGATTTCTTTGATCACAAAGGTACC
AGACCATCTTGTTCATAA
Protein sequenceShow/hide protein sequence
MNTMGCVGCRKPHTYEVCLQNPQSVCFIQNNRYSNTYNPDRRNDPNLSWGGNNQPKQQSAPMHNKDGSSGFHQGYQKQNQPQQSTPSPLSSIEALLKEYMQKNDV
QIQTQASSIRNLELPLGQIAGELKTRQKGSFPSTTEAPHCAGSSGKEQWKKAPTNKTTNPSNQSENSQPTNQPLSSPTIEYNSTTQEKQADKSAFTSVSTELQKP
PYPQRLKKKKNDEEQFKRFLDVLKQLHINIPLVEVLPFLRFLVSPGSNAGICDKLGVENRYKNYAVTIHVLTRREMTLKFLRILLTEELEMEVHGYFVSCKALSI
IWFYALTIYFKCIGAVFIPQLHGTMRKDFFDHKGTRPSCS