; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr09:20839208..20839679
RNA-Seq ExpressionClc09G15340
SyntenyClc09G15340
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]6.0e-2141.5Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLI-----------RVRLKS--------------------AGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT
        MK+LCWN+RGLGNP+  R L +LI             +LK+                     GRSGGL LLW   + V V S+S +HID  +   +G   
Subjt:  MKILCWNARGLGNPRAFRALSNLI-----------RVRLKS--------------------AGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT

Query:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
         F+G+YG+PE   + LTW LLR L+ G D PWLVGGD NE++   EK
Subjt:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

KAF5477448.1 hypothetical protein F2P56_004088 [Juglans regia]6.0e-2143.75Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIRVR------------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEG-RVTCFSGIYGHPESHRKILTWQ
        +K LCWN+RGLGNP   RAL +LI               +   G SGGL LLW+  + +E+ SYS+ HID  V  +  R   F+GIYG+PE+  ++ TW 
Subjt:  MKILCWNARGLGNPRAFRALSNLIRVR------------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEG-RVTCFSGIYGHPESHRKILTWQ

Query:  LLRHLHGGNDEPWLVGGDINEIMKSEEK
        L+R LH     PWL+GGD NE++   EK
Subjt:  LLRHLHGGNDEPWLVGGDINEIMKSEEK

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]1.3e-2038.51Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC
        +K+L WNA+G  NPR   ALSNLIR                               + +++ GRSGG+ LLW   + + +L YS  HID K+  E  + C
Subjt:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC

Query:  F--SGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
        +  +GIYGHPE+ +++ TW LL+ L   + E WLV GD NEI+ +EEK
Subjt:  F--SGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

XP_022854365.1 uncharacterized protein LOC111375728 [Olea europaea var. sylvestris]7.9e-2140.41Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC
        MK+L WNARGLGNPR   ALS LIR                                 ++S GRSGGL +LW   + + +LSYS++HID K+        
Subjt:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC

Query:  FSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
         +GIYGHP++++ I TW L+R +     EPWLV  D NEI+ +EEK
Subjt:  FSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]1.3e-2040.14Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT
        MK +CWN+RGLGNP   RAL +LI                                  + S GRSGGL LLW+  + VE+ S+SQ HID  + I +  V 
Subjt:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT

Query:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
         F+G+YGHP++ R+  TW L+R L      PWLVGGD+NE++   EK
Subjt:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

TrEMBL top hitse value%identityAlignment
A0A2N9EH45 Reverse transcriptase domain-containing protein5.5e-2041.5Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLI-----------------------RVRLK--------SAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWE-GRVT
        M IL WN RGLGN  A R L NL+                       RV+L+        S GRSGGL LLW++  ++ V ++SQNHID+ V  + G + 
Subjt:  MKILCWNARGLGNPRAFRALSNLI-----------------------RVRLK--------SAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWE-GRVT

Query:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
         F+G YG PE HRK  +W L+  LHG +  PWL  GD NEI+  EE+
Subjt:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

A0A2N9ESV7 Uncharacterized protein6.5e-2138.78Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIRVR-------------------------------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC
        M I+ WN +GLGNP+A R L NL + +                               + S GRSGGL L+W + IEV V ++SQ+H+D  V  +     
Subjt:  MKILCWNARGLGNPRAFRALSNLIRVR-------------------------------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTC

Query:  -FSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
          +G YGHPE H++  TW+LL HL   N  PWL  GD NEI+  EEK
Subjt:  -FSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

A0A2N9FD73 Uncharacterized protein3.8e-2139.22Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLI---------------------RVR----------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIW-EGRVT
        M++L WN RGLGNP A RAL +L+                     R+R          + S GRSGGL LLW E I +E+ ++S +HID+ + + +GR  
Subjt:  MKILCWNARGLGNPRAFRALSNLI---------------------RVR----------LKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIW-EGRVT

Query:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEKLEALQR
          +G YG PE HR+  +W LL HL+     PWL  GD NEI+  EEK+   Q+
Subjt:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEKLEALQR

A0A5B6WXI9 Reverse transcriptase1.4e-1838.26Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVT-
        MK +CWN RGLG+PRA R L  L +                                 +++ G  GGLCL W + I V + SYS+ HID  ++WEG +  
Subjt:  MKILCWNARGLGNPRAFRALSNLIR-------------------------------VRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVT-

Query:  --CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK
           F+G YG P S  +   W LL+ L  GN  PWLV GD NEI+ S EK
Subjt:  --CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEK

A0A803PTM0 Uncharacterized protein2.3e-1837.18Show/hide
Query:  MKILCWNARGLGNPRAFRALSNLI-----------------------RVRL--------KSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT
        M +L WNA+GLGNP   R+L++L+                       RV+L        ++ G+SGGL LLWS  +E +VLS+S  HID+ V I  G+  
Subjt:  MKILCWNARGLGNPRAFRALSNLI-----------------------RVRL--------KSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKV-IWEGRVT

Query:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEKLEALQRKAY
         F+G YG P+  ++  +W+LLR +      PW VGGD NEI+  +EK+  L +  Y
Subjt:  CFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIMKSEEKLEALQRKAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATATTATGTTGGAATGCTCGGGGATTGGGGAATCCTAGAGCATTCCGAGCCTTGTCCAACCTAATCCGGGTAAGACTCAAATCTGCTGGTCGCAGTGGTGGCTT
GTGTCTGCTATGGTCTGAGATGATTGAAGTGGAAGTATTATCATACTCCCAAAATCATATAGATACAAAGGTGATCTGGGAAGGTCGTGTTACTTGTTTCTCTGGTATTT
ATGGTCATCCAGAAAGTCACAGGAAAATTCTCACATGGCAACTTCTAAGACACCTACACGGAGGAAATGATGAACCTTGGTTGGTGGGGGGAGACATAAATGAGATTATG
AAATCAGAAGAAAAGCTTGAGGCCCTCCAAAGGAAAGCATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATATTATGTTGGAATGCTCGGGGATTGGGGAATCCTAGAGCATTCCGAGCCTTGTCCAACCTAATCCGGGTAAGACTCAAATCTGCTGGTCGCAGTGGTGGCTT
GTGTCTGCTATGGTCTGAGATGATTGAAGTGGAAGTATTATCATACTCCCAAAATCATATAGATACAAAGGTGATCTGGGAAGGTCGTGTTACTTGTTTCTCTGGTATTT
ATGGTCATCCAGAAAGTCACAGGAAAATTCTCACATGGCAACTTCTAAGACACCTACACGGAGGAAATGATGAACCTTGGTTGGTGGGGGGAGACATAAATGAGATTATG
AAATCAGAAGAAAAGCTTGAGGCCCTCCAAAGGAAAGCATATTTCTAG
Protein sequenceShow/hide protein sequence
MKILCWNARGLGNPRAFRALSNLIRVRLKSAGRSGGLCLLWSEMIEVEVLSYSQNHIDTKVIWEGRVTCFSGIYGHPESHRKILTWQLLRHLHGGNDEPWLVGGDINEIM
KSEEKLEALQRKAYF