; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G07230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G07230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr07:19417952..19419567
RNA-Seq ExpressionClc07G07230
SyntenyClc07G07230
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]9.6e-2249.25Show/hide
Query:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSCAI
        V L NG RI VD IG++++  SL L DVL + +FAYNLISVSCLL   +I+L+F    C I D     MIGKA+ ++GLY+LN   ++N   A +   AI
Subjt:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSCAI

Query:  SVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHT
        SV+ WH RLGHLSPK L  L  TL L +  + H+
Subjt:  SVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHT

KAG7544005.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]2.3e-1537.75Show/hide
Query:  YFKVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNF-IDS
        +F   +    + V+L N  R+ + + G+V++  SL+L+DVL +P F +NLISVS LLK+  ++ +FY D+C I + I   MIG+    H LY+L     S
Subjt:  YFKVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNF-IDS

Query:  SNHHTAVIV-----SCAISVEIWHHRLGHLSPKHLFLLKDTLSL-PSSLLQ
        S  H +        S A+   +WH RLGH S   L +L  TLS+  +S+LQ
Subjt:  SNHHTAVIV-----SCAISVEIWHHRLGHLSPKHLFLLKDTLSL-PSSLLQ

KAG7600358.1 Zinc finger CCHC-type [Arabidopsis suecica]2.3e-1537.5Show/hide
Query:  YFKVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNF-IDS
        +F   +    + V+L N  R+ + + G+V++  SL+L+DVL +P F +NLISVS LLK+  ++ +FY D+C I + I   MIG+    H LY+L     S
Subjt:  YFKVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNF-IDS

Query:  SNHHTAVIV-----SCAISVEIWHHRLGHLSPKHLFLLKDTLSL
        S  H +        S A+   +WH RLGH S   L +L  TLS+
Subjt:  SNHHTAVIV-----SCAISVEIWHHRLGHLSPKHLFLLKDTLSL

KAG7609732.1 Retrotransposon gag domain [Arabidopsis suecica]2.2e-1828Show/hide
Query:  IPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC
        + V+L N  R+++ + G + +  SL+L+DVL +P F +NLISVS LLK+ + + +F+ D+C IH+ I   MIGK    H LY+L  +DS +  T   +S 
Subjt:  IPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC

Query:  -------AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNH--VLPLPIPGTLQQNEENHTGVDIYDALNLG----NTPHT
                +   +WH RLGH S   L LL  TLS+P +               S ++ +H  V PL     L     NH     +D ++L         +
Subjt:  -------AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNH--VLPLPIPGTLQQNEENHTGVDIYDALNLG----NTPHT

Query:  TEGQMVFNEG----------------------------FVQNPSTNITTDSTEIIKPKNIVEPNE---------VANPPHDIAVGLRRSIRRHQPPGFRQ
         EG ++F+E                              + +PS +I  D+++I  P     P+          V +    +++   R  R  + PG+  
Subjt:  TEGQMVFNEG----------------------------FVQNPSTNITTDSTEIIKPKNIVEPNE---------VANPPHDIAVGLRRSIRRHQPPGFRQ

Query:  DYDCNLLQSQ---VLNTTTAYSISN
        DY C L+Q+       TTT Y IS+
Subjt:  DYDCNLLQSQ---VLNTTTAYSISN

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]1.2e-1635.6Show/hide
Query:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLN--FIDSSNHHT------
        V L N  RI V ++G V+   +LML DVL +P F +NL+SVS L+K  +I + F+ + C I D  + KMIGK     GLY+++   +DS   H+      
Subjt:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLN--FIDSSNHHT------

Query:  AVIVSCA-----ISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNHVLPLPIPGTLQQNEENHTGVDIYDALN
         V VS +     ++   WHHRLGHLS K L  LKD+L L        F   ++HN +     ++V PL     L     NH   + +D ++
Subjt:  AVIVSCA-----ISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNHVLPLPIPGTLQQNEENHTGVDIYDALN

TrEMBL top hitse value%identityAlignment
A0A151S098 gag_pre-integrs domain-containing protein6.1e-1437.31Show/hide
Query:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC-A
        V L NG RI ++ IG+V ++  L+L +V  +P F ++LIS+S LL   S+ + F+D+   I DK+   +IGK  + +GLY+L+      +  +    C +
Subjt:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC-A

Query:  ISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQH
        +S++IWHHRLGHLS      L +  +L SS + H
Subjt:  ISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQH

A0A1S3CJ63 uncharacterized protein LOC1035014521.6e-1429.2Show/hide
Query:  KVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNH
        ++G+      +  L    I VD IG++ +  SL+  DVL +P+FAYNLIS                      D     MIGKA+ ++GLY+LN   ++N 
Subjt:  KVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNH

Query:  HTAVIVSCAISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNHVLPLPIPGTLQQNEENHTGVDIYDALNLGNTPHTTEGQM
          A +   AISV+ WH RLGHLSPK L  L  TL L +    H+ H    H ++ D+L   ++ +       Q  +        +A  L  T    +   
Subjt:  HTAVIVSCAISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNHVLPLPIPGTLQQNEENHTGVDIYDALNLGNTPHTTEGQM

Query:  VFNEGFVQNPSTN---------ITTDSTEIIKPKNIVEPNEVANPPHDIAVGLRRSIRRHQPPGFRQDYDCNLL
        V     V+ P  N         +  D+T +    N         P + +    R+S R+H+PP   +DY C+LL
Subjt:  VFNEGFVQNPSTN---------ITTDSTEIIKPKNIVEPNEVANPPHDIAVGLRRSIRRHQPPGFRQDYDCNLL

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 84.6e-2249.25Show/hide
Query:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSCAI
        V L NG RI VD IG++++  SL L DVL + +FAYNLISVSCLL   +I+L+F    C I D     MIGKA+ ++GLY+LN   ++N   A +   AI
Subjt:  VALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSCAI

Query:  SVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHT
        SV+ WH RLGHLSPK L  L  TL L +  + H+
Subjt:  SVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHT

A0A6D2HGS3 Integrase catalytic domain-containing protein7.9e-1437.78Show/hide
Query:  IPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC
        + V+L NG R  + + G V ++ SL L++VL +P F +NLISVS LLK    + +FY D+C I + I   MIG+ N  + LY+L  I  S   +    S 
Subjt:  IPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAVIVSC

Query:  AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQH
             +WH RLGH SP  L  +   L L  +   H
Subjt:  AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQH

A0A6J1CR17 uncharacterized protein LOC1110134412.7e-1430.14Show/hide
Query:  KFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLK-YGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLL-NFIDSSNHHT
        K   + V L N  R  V+Y G VR+ + L ++ VL +P+F +NLISV+ LL+   S+++ F +D C I DK   K I K    HGLYLL N  + S+   
Subjt:  KFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLK-YGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLL-NFIDSSNHHT

Query:  AVIVSC--AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTF-------HLEENHNDLS-----DMLVNHVLPLPIPGTLQQNEENHTGVDIYDALN
        ++ VS    +S ++WH+RLGH S   L  LK  L + +  L+           LE + ND+S     D+++ +V+   I       +  +  +DI   + 
Subjt:  AVIVSC--AISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTF-------HLEENHNDLS-----DMLVNHVLPLPIPGTLQQNEENHTGVDIYDALN

Query:  LGNTPHTTEGQMVFNEGFVQN--PSTNITTDSTEIIKPKNIVEPNEVANP---------PHDIAVGLRRSIRRHQPPGFRQDYDCNLLQSQV
            P T     V  E       PS + ++ +  + +P     P+ V+ P         P DI    RRS R  + P + QD+ C+LL + +
Subjt:  LGNTPHTTEGQMVFNEGFVQN--PSTNITTDSTEIIKPKNIVEPNEVANP---------PHDIAVGLRRSIRRHQPPGFRQDYDCNLLQSQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGTTCTTTATTTCAAAGTTGGAATCAAGTTTTTTGATATTCCTGTTGCTTTGCTTAATGGTTTTCGAATTAAGGTTGATTATATTGGGAATGTTCGTGTATATGA
ATCATTGATGCTAAATGATGTTCTCTTGTTACCAAAATTCGCCTACAACTTGATATCAGTGAGTTGCTTATTGAAATATGGATCTATAGCACTCAATTTTTATGATGATT
ACTGCACCATACATGACAAAATTTCCTTGAAGATGATTGGCAAGGCTAACAATAAACATGGACTCTATTTGCTCAACTTTATTGACAGCTCCAATCATCATACTGCTGTT
ATTGTTTCTTGTGCAATTTCTGTTGAAATTTGGCATCACCGTTTGGGCCATTTATCTCCCAAACATTTATTCTTGTTAAAAGATACTTTGTCTTTACCAAGTTCTCTGTT
ACAACATACTTTTCATCTAGAAGAAAATCACAATGATTTATCTGATATGTTGGTGAATCATGTTTTACCTCTTCCTATTCCAGGAACATTACAGCAAAATGAGGAGAATC
ACACTGGTGTTGACATTTATGATGCACTTAATCTTGGGAATACTCCTCATACAACAGAAGGGCAAATGGTTTTTAATGAAGGATTTGTGCAAAATCCTTCTACCAACATC
ACTACTGACTCCACTGAAATTATTAAGCCCAAAAATATAGTTGAACCTAATGAAGTTGCCAATCCTCCACATGACATTGCTGTTGGTCTAAGAAGATCAATAAGAAGGCA
TCAACCACCTGGTTTTCGTCAAGATTACGATTGTAATTTGCTTCAAAGCCAAGTTTTGAACACTACAACTGCATATTCCATCAGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGTTCTTTATTTCAAAGTTGGAATCAAGTTTTTTGATATTCCTGTTGCTTTGCTTAATGGTTTTCGAATTAAGGTTGATTATATTGGGAATGTTCGTGTATATGA
ATCATTGATGCTAAATGATGTTCTCTTGTTACCAAAATTCGCCTACAACTTGATATCAGTGAGTTGCTTATTGAAATATGGATCTATAGCACTCAATTTTTATGATGATT
ACTGCACCATACATGACAAAATTTCCTTGAAGATGATTGGCAAGGCTAACAATAAACATGGACTCTATTTGCTCAACTTTATTGACAGCTCCAATCATCATACTGCTGTT
ATTGTTTCTTGTGCAATTTCTGTTGAAATTTGGCATCACCGTTTGGGCCATTTATCTCCCAAACATTTATTCTTGTTAAAAGATACTTTGTCTTTACCAAGTTCTCTGTT
ACAACATACTTTTCATCTAGAAGAAAATCACAATGATTTATCTGATATGTTGGTGAATCATGTTTTACCTCTTCCTATTCCAGGAACATTACAGCAAAATGAGGAGAATC
ACACTGGTGTTGACATTTATGATGCACTTAATCTTGGGAATACTCCTCATACAACAGAAGGGCAAATGGTTTTTAATGAAGGATTTGTGCAAAATCCTTCTACCAACATC
ACTACTGACTCCACTGAAATTATTAAGCCCAAAAATATAGTTGAACCTAATGAAGTTGCCAATCCTCCACATGACATTGCTGTTGGTCTAAGAAGATCAATAAGAAGGCA
TCAACCACCTGGTTTTCGTCAAGATTACGATTGTAATTTGCTTCAAAGCCAAGTTTTGAACACTACAACTGCATATTCCATCAGCAATTAA
Protein sequenceShow/hide protein sequence
MIVLYFKVGIKFFDIPVALLNGFRIKVDYIGNVRVYESLMLNDVLLLPKFAYNLISVSCLLKYGSIALNFYDDYCTIHDKISLKMIGKANNKHGLYLLNFIDSSNHHTAV
IVSCAISVEIWHHRLGHLSPKHLFLLKDTLSLPSSLLQHTFHLEENHNDLSDMLVNHVLPLPIPGTLQQNEENHTGVDIYDALNLGNTPHTTEGQMVFNEGFVQNPSTNI
TTDSTEIIKPKNIVEPNEVANPPHDIAVGLRRSIRRHQPPGFRQDYDCNLLQSQVLNTTTAYSISN