; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G181560 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G181560
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCiama_Chr10:2662811..2664154
RNA-Seq ExpressionCaUC10G181560
SyntenyCaUC10G181560
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0009536 - plastid (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336140.1 unnamed protein product [Arabidopsis thaliana]3.5e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

CAD5336141.1 unnamed protein product [Arabidopsis thaliana]3.5e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]3.5e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

KAF3622453.1 hypothetical protein FXO37_32330 [Capsicum annuum]1.5e-4771.52Show/hide
Query:  ESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKG-----------EVQTKKGLRWIPRHP
        ESVGG  GGGLPCGGCQRFESAYLQLVNLADTKLYD T FFRF  SIYDLSF+DVDKI PFSSTLGWHSLK +G           E  TK       R+ 
Subjt:  ESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKG-----------EVQTKKGLRWIPRHP

Query:  ETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGG
           K  GS+SASET+GDKLHRRE NSPDHQLRPLN RS+IKEVGV+RQPGG
Subjt:  ETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGG

KAF3622463.1 hypothetical protein FXO38_31271 [Capsicum annuum]2.9e-4670.86Show/hide
Query:  ESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKG-----------EVQTKKGLRWIPRHP
        ESVGG  GGGLP GGCQRFESAYLQLVNLADTKLYD T FFRF  SIYDLSF+DVDKI PFSSTLGWHSLK +G           E  TK       R+ 
Subjt:  ESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKG-----------EVQTKKGLRWIPRHP

Query:  ETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGG
           K  GS+SASET+GDKLHRRE NSPDHQLRPLN RS+IKEVGV+RQPGG
Subjt:  ETRKGVGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGG

TrEMBL top hitse value%identityAlignment
A0A2N9GIA5 Uncharacterized protein ycf683.4e-6168.42Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKG----
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKG    
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKG----

Query:  ----------------------------------------------------VGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                                             GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ----------------------------------------------------VGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A2N9I366 Uncharacterized protein ycf686.3e-5552.19Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV---
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV   
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                      GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  --------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A7G2FJL3 Uncharacterized protein ycf681.7e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A7G2FKR6 Uncharacterized protein ycf681.7e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A7G2FMH4 Uncharacterized protein ycf681.7e-6075.3Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV S 
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGS-

Query:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                                       RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  -------------------------------RSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07706.1 unknown protein1.5e-0856.45Show/hide
Query:  SETMGDKLHRR----------EGNSPDHQLRPLNDRSVIKEVGVQRQPGGL------PRSSH
        SET G ++ R           EGNSPDHQLRP N RSVIKEVGVQRQP  L      PRS H
Subjt:  SETMGDKLHRR----------EGNSPDHQLRPLNDRSVIKEVGVQRQPGGL------PRSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATA
TGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAG
TTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTT
CATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAG
CCACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATA
TGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAG
TTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTT
CATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGGAGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAG
CCACCCTTGA
Protein sequenceShow/hide protein sequence
MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVGSRSASETMGDKL
HRREGNSPDHQLRPLNDRSVIKEVGVQRQPGGLPRSSHP