; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G10448 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G10448
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr06:15526208..15527034
RNA-Seq ExpressionClc06G10448
SyntenyClc06G10448
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60722.1 hypothetical protein VITISV_022058 [Vitis vinifera]2.8e-1731.55Show/hide
Query:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK
        ++  N  +V+ QETK    D   + S+WS     W  L    +S GILI W+    S  +V+  +FSVS++  L      WL AVY   +     DFW +
Subjt:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK

Query:  FDDLACLGGDNWVIGLEGNLMIERYHPI----FVHLIVGYN-----YQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSD
          DL CL   +W +G + N++  R   +    F   +  ++      +LHD PL N  FTWS+         LDRFF        F     + L R TSD
Subjt:  FDDLACLGGDNWVIGLEGNLMIERYHPI----FVHLIVGYN-----YQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSD

Query:  HFPLDL
        H+ + L
Subjt:  HFPLDL

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.0e-1934.52Show/hide
Query:  IVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDLACL
        IV+L ETK SS+++  IKS+WSS  I W SL A  +S GI++ W +   S  +VI  +FS+S+   LAD +++WL  VY          FWQ+  DL  L
Subjt:  IVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDLACL

Query:  GGDNWVIGLEGNLMIERYHPIFVH-LIVGYNYQLHDIP--------LFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL
         G  W++G + N+    +     +    G N   H I         + NG +TWS+      +S ++RF   K    KF    +K L R  SDH+P+
Subjt:  GGDNWVIGLEGNLMIERYHPIFVH-LIVGYNYQLHDIP--------LFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]5.6e-2635.44Show/hide
Query:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK
        I + N  +V+LQETKLS +D  ++KS+WS+  I W++L A   ++GILI W++ D    ++I+  FS++I   L+DG+ FW+  +Y  +   +H  FWQ+
Subjt:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK

Query:  FDDLACLGGDNWVIGLEGNLMIERY-------HPIFVHLIVGYNY----QLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTT
          DL+ L  ++W+  L G+  + R+        P+   + +  ++     L D+PL NG  TWS   S    SL+D F +   C+ K G+   K + RTT
Subjt:  FDDLACLGGDNWVIGLEGNLMIERY-------HPIFVHLIVGYNY----QLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTT

Query:  SDHFPL
        SDHFP+
Subjt:  SDHFPL

XP_023905831.1 uncharacterized protein LOC112017605 [Quercus suber]1.4e-1633.82Show/hide
Query:  LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVT-LADGYSFWLCA-VYCLTDNVYHADF
        L+++    IV +QETKL S  S+++KS+W S ++ W +L AI ++ G+ + W    F I D +  +FS+SI +  +ADG+  W+C+ VY  TD       
Subjt:  LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVT-LADGYSFWLCA-VYCLTDNVYHADF

Query:  WQKFDDLA-------CLGGDNWVIGLEG-NLMIERYHP-IFVHLIVGYNYQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERT
        W + D +        CL GD  VI      L    + P IF        + L D+PL  G +TW        MS LDR  +  +  + F  VT ++L R 
Subjt:  WQKFDDLA-------CLGGDNWVIGLEG-NLMIERYHP-IFVHLIVGYNYQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERT

Query:  TSDHFPL
         SDH PL
Subjt:  TSDHFPL

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.6e-1735.33Show/hide
Query:  LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQ
        L+ + N  +V+LQ++K+S+V+ HL+KS+WSS ++GW +L A  SS GILI W E   ++ D IQ  FS+SI      G+S W+  VY  +       FW 
Subjt:  LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQ

Query:  KFDDLACLGGDNWVIGLEGNLM-----------IERYHPIFVHLIVGYNYQLHDIPLFNGCFTWSSF
        +   L  L  +NW +G + N++             R    F  LI      + DIP  NG F+W  F
Subjt:  KFDDLACLGGDNWVIGLEGNLM-----------IERYHPIFVHLIVGYNYQLHDIPLFNGCFTWSSF

TrEMBL top hitse value%identityAlignment
A0A6J1CVN2 uncharacterized protein LOC1110146575.0e-2034.52Show/hide
Query:  IVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDLACL
        IV+L ETK SS+++  IKS+WSS  I W SL A  +S GI++ W +   S  +VI  +FS+S+   LAD +++WL  VY          FWQ+  DL  L
Subjt:  IVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDLACL

Query:  GGDNWVIGLEGNLMIERYHPIFVH-LIVGYNYQLHDIP--------LFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL
         G  W++G + N+    +     +    G N   H I         + NG +TWS+      +S ++RF   K    KF    +K L R  SDH+P+
Subjt:  GGDNWVIGLEGNLMIERYHPIFVH-LIVGYNYQLHDIP--------LFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL

A0A6J1E2G6 uncharacterized protein LOC1110254052.7e-2635.44Show/hide
Query:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK
        I + N  +V+LQETKLS +D  ++KS+WS+  I W++L A   ++GILI W++ D    ++I+  FS++I   L+DG+ FW+  +Y  +   +H  FWQ+
Subjt:  IQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQK

Query:  FDDLACLGGDNWVIGLEGNLMIERY-------HPIFVHLIVGYNY----QLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTT
          DL+ L  ++W+  L G+  + R+        P+   + +  ++     L D+PL NG  TWS   S    SL+D F +   C+ K G+   K + RTT
Subjt:  FDDLACLGGDNWVIGLEGNLMIERY-------HPIFVHLIVGYNY----QLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTT

Query:  SDHFPL
        SDHFP+
Subjt:  SDHFPL

A0A7N2L897 Uncharacterized protein6.1e-1829.25Show/hide
Query:  PPITSKKPKLQRELQGLHSPIHYNKTTTLAIREG----------SFLIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSE
        PP+ + +  L REL+GL S ++Y+   TL+ + G            L+++    +V LQETKL+S++S L++S+W S ++ W  L A+ +S G+L+ W +
Subjt:  PPITSKKPKLQRELQGLHSPIHYNKTTTLAIREG----------SFLIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSE

Query:  SDFSINDVIQVNFSVSIQVT-LADGYSFWLC-AVYCLTDNVYHADFWQKFDDLACLGGDNWVIGLEGNLM--------IERYHP-IFVHLIVGYNYQLHD
              DV    FSVS+ +  + DG+  W+C  +Y    + + A  W++   +       W +  + N++         E + P +F       N  L D
Subjt:  SDFSINDVIQVNFSVSIQVT-LADGYSFWLC-AVYCLTDNVYHADFWQKFDDLACLGGDNWVIGLEGNLM--------IERYHP-IFVHLIVGYNYQLHD

Query:  IPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL
        +PL    FTW      Q MS +DR  +  + +  FG+++ + L R  SDH PL
Subjt:  IPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL

A0A7N2M0A0 Kinesin motor domain-containing protein3.0e-1730.99Show/hide
Query:  KPKLQRELQGLHSPIHYNKTTTLAIREGSF---LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNF
        K K +REL+ L S ++Y+       R+ S    L+++    ++ LQETKLS +D  ++ S+WS  Y+ W +L A+ ++ G+L+ W        + +  +F
Subjt:  KPKLQRELQGLHSPIHYNKTTTLAIREGSF---LIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNF

Query:  SVSI-QVTLADGYSFWLCA-VYCLTDNVYHADFWQKFDDL--------ACLGGDNWVIGLEGNLMIERYHP---IFVHLIVGYNYQLHDIPLFNGCFTWS
        SVS+    L DG+  W C+ VY   D+      W K   +         C+G  N V      L   R  P   +F   I   N  L D+PL  G +TWS
Subjt:  SVSI-QVTLADGYSFWLCA-VYCLTDNVYHADFWQKFDDL--------ACLGGDNWVIGLEGNLMIERYHP---IFVHLIVGYNYQLHDIPLFNGCFTWS

Query:  SFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL
        S  +   MS LDR  +  +  + +  V  + L R  SDHFP+
Subjt:  SFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHFPL

A0A803P8A0 Uncharacterized protein5.1e-1732.67Show/hide
Query:  NTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDL
        N  +V+LQE K ++VD   I SIW S +  W  L AI  S G L+ W     S+ D +   FS+S+ +       +W   VY          FW +   L
Subjt:  NTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSINDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDL

Query:  ACLGGDNWVIGLEGNL-----------MIERYHPIFVHLIVGYNYQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHF
        + + G++W +G + N+              R   +F  LI     QL D  L NG FTWS+F +I   S LDRF  + N    F  V  + L R  SDH 
Subjt:  ACLGGDNWVIGLEGNL-----------MIERYHPIFVHLIVGYNYQLHDIPLFNGCFTWSSFGSIQYMSLLDRFFIMKNCLQKFGVVTLKSLERTTSDHF

Query:  PL
        P+
Subjt:  PL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCCCTTCCATTACTTCACAATCCAAACCCAAATGTCCACCGCCAATTACAAGCAAAAAACCCAAGTTACAGAGAGAGTTGCAAGGACTTCATTCCCCAATACA
CTACAATAAGACAACGACATTGGCTATTAGGGAGGGATCTTTTCTCATTCAACAACATAATACGGGTATTGTTCTTTTACAAGAAACAAAGCTGTCATCTGTGGATTCCC
ATTTGATTAAATCAATATGGAGCTCTACTTACATTGGTTGGACATCCTTGTATGCTATTGATTCTTCGGCGGGTATTCTCATTTTTTGGAGTGAATCAGACTTTTCTATC
AATGATGTCATTCAAGTTAATTTTTCTGTCTCTATTCAAGTAACATTGGCTGATGGTTATTCTTTTTGGCTTTGCGCTGTTTATTGTCTGACCGATAATGTCTACCATGC
TGATTTTTGGCAAAAGTTTGATGATTTGGCGTGTTTGGGAGGAGATAACTGGGTTATTGGTTTGGAAGGAAATCTCATGATCGAACGATATCATCCAATATTCGTGCATT
TAATCGTTGGATATAACTACCAGCTCCATGATATCCCTTTATTTAATGGCTGCTTTACTTGGTCTAGCTTTGGCTCTATTCAATACATGTCCCTCCTTGACAGATTTTTT
ATCATGAAAAATTGTCTACAGAAGTTCGGAGTTGTTACTCTTAAAAGTCTAGAAAGAACTACATCAGATCACTTTCCTTTGGATTTAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCCCTTCCATTACTTCACAATCCAAACCCAAATGTCCACCGCCAATTACAAGCAAAAAACCCAAGTTACAGAGAGAGTTGCAAGGACTTCATTCCCCAATACA
CTACAATAAGACAACGACATTGGCTATTAGGGAGGGATCTTTTCTCATTCAACAACATAATACGGGTATTGTTCTTTTACAAGAAACAAAGCTGTCATCTGTGGATTCCC
ATTTGATTAAATCAATATGGAGCTCTACTTACATTGGTTGGACATCCTTGTATGCTATTGATTCTTCGGCGGGTATTCTCATTTTTTGGAGTGAATCAGACTTTTCTATC
AATGATGTCATTCAAGTTAATTTTTCTGTCTCTATTCAAGTAACATTGGCTGATGGTTATTCTTTTTGGCTTTGCGCTGTTTATTGTCTGACCGATAATGTCTACCATGC
TGATTTTTGGCAAAAGTTTGATGATTTGGCGTGTTTGGGAGGAGATAACTGGGTTATTGGTTTGGAAGGAAATCTCATGATCGAACGATATCATCCAATATTCGTGCATT
TAATCGTTGGATATAACTACCAGCTCCATGATATCCCTTTATTTAATGGCTGCTTTACTTGGTCTAGCTTTGGCTCTATTCAATACATGTCCCTCCTTGACAGATTTTTT
ATCATGAAAAATTGTCTACAGAAGTTCGGAGTTGTTACTCTTAAAAGTCTAGAAAGAACTACATCAGATCACTTTCCTTTGGATTTAACATGA
Protein sequenceShow/hide protein sequence
MAVPSITSQSKPKCPPPITSKKPKLQRELQGLHSPIHYNKTTTLAIREGSFLIQQHNTGIVLLQETKLSSVDSHLIKSIWSSTYIGWTSLYAIDSSAGILIFWSESDFSI
NDVIQVNFSVSIQVTLADGYSFWLCAVYCLTDNVYHADFWQKFDDLACLGGDNWVIGLEGNLMIERYHPIFVHLIVGYNYQLHDIPLFNGCFTWSSFGSIQYMSLLDRFF
IMKNCLQKFGVVTLKSLERTTSDHFPLDLT