; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G09220 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G09220
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationClcChr07:23597128..23598408
RNA-Seq ExpressionClc07G09220
SyntenyClc07G09220
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038339.1 uncharacterized protein E6C27_scaffold270G002130 [Cucumis melo var. makuwa]3.9e-2131.92Show/hide
Query:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------
        + G+   +K LA +R + K NPE+VLIQE+K +  +I  IK++WS ++IGW + E+ G SG +LT+WD S + V+ETLK        G +++        
Subjt:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------

Query:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL
            + KC    RK +++       E+S S        G F W    F    S   +T+   +IE +    +  GWAGF+ S KLR V+ ++K W A + 
Subjt:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL

Query:  GKQRKQEEEILQN
          Q ++E++++ +
Subjt:  GKQRKQEEEILQN

RVW13148.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.1e-2028.89Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVK-----C
        MKI+SWN+RG+G ++K   +K  +   NP++V+IQETKK++ +   + ++W+ +   W+   A G SG +L +WD   +   E + G +S+SVK     C
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVK-----C

Query:  KTL------NRKLHSIQKLGVWEEISTSPELYK-------------KEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVR
          L           S++K   W E+     L +              +   F WGP+PF+F N WL +T   +     ++     GW G     +L+ V+
Subjt:  KTL------NRKLHSIQKLGVWEEISTSPELYK-------------KEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVR

Query:  RSLKNWHAEYLGKQRKQEEEILQNL
          LK W+    G+ +++++ IL +L
Subjt:  RSLKNWHAEYLGKQRKQEEEILQNL

TYJ97056.1 uncharacterized protein E5676_scaffold506G001240 [Cucumis melo var. makuwa]8.8e-2131.46Show/hide
Query:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------
        + G+   +K LA +R + K NPE+VLIQE+K +  +I  IK++WS +++GW + E+ G SG +LT+WD S + V+ETLK        G +++        
Subjt:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------

Query:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL
            + KC    RK +++       E+S S        G F W    F    S   +T+   +IE +    +  GWAGF+ S KLR V+ ++K W A + 
Subjt:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL

Query:  GKQRKQEEEILQN
          Q ++E++++ +
Subjt:  GKQRKQEEEILQN

XP_010263157.1 PREDICTED: uncharacterized protein LOC104601500 [Nelumbo nucifera]5.1e-2126.84Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR
        MKIVSWN+RG+G+  K   IK ++ K NP++ LIQE+K  +I+   ++++W +  + W+ S ++G SG ++T+W +  +  +E L G +S+S+K K +  
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR

Query:  KLHSI-----------QKLGVWEEI----------------------STSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFT-VGNHQGWAGF
        +   +           Q+  +W+E+                      S    +   +A     GPSPF+F N WLL+    + +++ +  +     WAG 
Subjt:  KLHSI-----------QKLGVWEEI----------------------STSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFT-VGNHQGWAGF

Query:  IISSKLRLVRRSLKNWHAEYLGKQRKQEEEI
            KL+ ++  +K W+  +L + R++E  +
Subjt:  IISSKLRLVRRSLKNWHAEYLGKQRKQEEEI

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.8e-2650.39Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR
        MKIV+W+ RG+GD SK L +KR + K NP++VLIQETKKD IE   IK++WSSKE+G  + EA GKSG LLT+WD+SKI V    K  +SLS+KC+T+N+
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR

Query:  KLHSI----------QKLGVWEEISTSPE
        K+  I          ++  +W E+S+  E
Subjt:  KLHSI----------QKLGVWEEISTSPE

TrEMBL top hitse value%identityAlignment
A0A1U8A916 uncharacterized protein LOC1046015002.5e-2126.84Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR
        MKIVSWN+RG+G+  K   IK ++ K NP++ LIQE+K  +I+   ++++W +  + W+ S ++G SG ++T+W +  +  +E L G +S+S+K K +  
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR

Query:  KLHSI-----------QKLGVWEEI----------------------STSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFT-VGNHQGWAGF
        +   +           Q+  +W+E+                      S    +   +A     GPSPF+F N WLL+    + +++ +  +     WAG 
Subjt:  KLHSI-----------QKLGVWEEI----------------------STSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFT-VGNHQGWAGF

Query:  IISSKLRLVRRSLKNWHAEYLGKQRKQEEEI
            KL+ ++  +K W+  +L + R++E  +
Subjt:  IISSKLRLVRRSLKNWHAEYLGKQRKQEEEI

A0A438BQB2 Transposon TX1 uncharacterized 149 kDa protein5.5e-2128.89Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVK-----C
        MKI+SWN+RG+G ++K   +K  +   NP++V+IQETKK++ +   + ++W+ +   W+   A G SG +L +WD   +   E + G +S+SVK     C
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVK-----C

Query:  KTL------NRKLHSIQKLGVWEEISTSPELYK-------------KEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVR
          L           S++K   W E+     L +              +   F WGP+PF+F N WL +T   +     ++     GW G     +L+ V+
Subjt:  KTL------NRKLHSIQKLGVWEEISTSPELYK-------------KEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVR

Query:  RSLKNWHAEYLGKQRKQEEEILQNL
          LK W+    G+ +++++ IL +L
Subjt:  RSLKNWHAEYLGKQRKQEEEILQNL

A0A5A7TAF5 Uncharacterized protein1.9e-2131.92Show/hide
Query:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------
        + G+   +K LA +R + K NPE+VLIQE+K +  +I  IK++WS ++IGW + E+ G SG +LT+WD S + V+ETLK        G +++        
Subjt:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------

Query:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL
            + KC    RK +++       E+S S        G F W    F    S   +T+   +IE +    +  GWAGF+ S KLR V+ ++K W A + 
Subjt:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL

Query:  GKQRKQEEEILQN
          Q ++E++++ +
Subjt:  GKQRKQEEEILQN

A0A5D3BDI9 Uncharacterized protein4.2e-2131.46Show/hide
Query:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------
        + G+   +K LA +R + K NPE+VLIQE+K +  +I  IK++WS +++GW + E+ G SG +LT+WD S + V+ETLK        G +++        
Subjt:  IRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLK--------GGYSL--------

Query:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL
            + KC    RK +++       E+S S        G F W    F    S   +T+   +IE +    +  GWAGF+ S KLR V+ ++K W A + 
Subjt:  ----SVKCKTLNRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYL

Query:  GKQRKQEEEILQN
          Q ++E++++ +
Subjt:  GKQRKQEEEILQN

A0A803PZR9 Uncharacterized protein1.3e-2230.04Show/hide
Query:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR
        MKI++WNIRG GD+ K  AIK  I K N +LV++QE KK +I+   I  +W S+   WIY  A+G+SG  L +WD   ++V+++L G +S+S   +   +
Subjt:  MKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTLNR

Query:  K----------LHSIQKLGVWEEISTSPELYKKE---AGAF-----------------------------EWGPSPFQFCNSWLLNTQCCKIIERSFTVG
                        +   W+E++   E+        G F                             +WG SPF+F N WL N    K+ E  +   
Subjt:  K----------LHSIQKLGVWEEISTSPELYKKE---AGAF-----------------------------EWGPSPFQFCNSWLLNTQCCKIIERSFTVG

Query:  NHQGWAGFIISSKLRLVRRSLKNWHAEYLGKQR
        N  GW G    +KLR+V+ ++K W     G  +
Subjt:  NHQGWAGFIISSKLRLVRRSLKNWHAEYLGKQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTAAAACACAATAAGCATTCTAAAGAAGAAATGAAGATTGTCTCATGGAACATTAGAGGCATTGGAGATCAATCAAAACACTTGGCAATCAAACGCTTAATCAT
GAAGACAAATCCGGAGTTGGTTTTGATTCAAGAAACAAAGAAAGACTCAATTGAAATTGACATCATTAAAGCAATGTGGAGCTCAAAAGAGATTGGATGGATTTACTCGG
AAGCCTATGGAAAATCAGGTAGATTATTGACAATGTGGGATGAAAGCAAGATATCGGTGATTGAAACACTTAAAGGAGGGTACTCACTCTCAGTAAAATGCAAAACTCTG
AACAGAAAACTGCATAGTATACAGAAGCTTGGTGTTTGGGAGGAGATTTCAACATCACCAGAACTATACAAGAAAGAAGCAGGAGCTTTTGAATGGGGACCTTCCCCTTT
CCAGTTTTGCAATAGTTGGCTGCTTAACACTCAGTGCTGTAAGATCATTGAAAGATCGTTTACAGTGGGTAATCATCAAGGATGGGCTGGTTTCATTATATCTTCTAAGT
TGCGATTAGTAAGAAGATCCTTAAAGAATTGGCATGCAGAGTATCTTGGAAAACAAAGAAAGCAGGAGGAGGAAATCTTACAAAATCTTGAAAAAGAAGAACAGCGGGCA
GAATTGGAAGAAATCACCCCTAGAACAGGATATGAGGAATTCCTTAAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTAAAACACAATAAGCATTCTAAAGAAGAAATGAAGATTGTCTCATGGAACATTAGAGGCATTGGAGATCAATCAAAACACTTGGCAATCAAACGCTTAATCAT
GAAGACAAATCCGGAGTTGGTTTTGATTCAAGAAACAAAGAAAGACTCAATTGAAATTGACATCATTAAAGCAATGTGGAGCTCAAAAGAGATTGGATGGATTTACTCGG
AAGCCTATGGAAAATCAGGTAGATTATTGACAATGTGGGATGAAAGCAAGATATCGGTGATTGAAACACTTAAAGGAGGGTACTCACTCTCAGTAAAATGCAAAACTCTG
AACAGAAAACTGCATAGTATACAGAAGCTTGGTGTTTGGGAGGAGATTTCAACATCACCAGAACTATACAAGAAAGAAGCAGGAGCTTTTGAATGGGGACCTTCCCCTTT
CCAGTTTTGCAATAGTTGGCTGCTTAACACTCAGTGCTGTAAGATCATTGAAAGATCGTTTACAGTGGGTAATCATCAAGGATGGGCTGGTTTCATTATATCTTCTAAGT
TGCGATTAGTAAGAAGATCCTTAAAGAATTGGCATGCAGAGTATCTTGGAAAACAAAGAAAGCAGGAGGAGGAAATCTTACAAAATCTTGAAAAAGAAGAACAGCGGGCA
GAATTGGAAGAAATCACCCCTAGAACAGGATATGAGGAATTCCTTAAAAGTTGA
Protein sequenceShow/hide protein sequence
MRLKHNKHSKEEMKIVSWNIRGIGDQSKHLAIKRLIMKTNPELVLIQETKKDSIEIDIIKAMWSSKEIGWIYSEAYGKSGRLLTMWDESKISVIETLKGGYSLSVKCKTL
NRKLHSIQKLGVWEEISTSPELYKKEAGAFEWGPSPFQFCNSWLLNTQCCKIIERSFTVGNHQGWAGFIISSKLRLVRRSLKNWHAEYLGKQRKQEEEILQNLEKEEQRA
ELEEITPRTGYEEFLKS