; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC06G114380 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC06G114380
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCiama_Chr06:9900442..9901442
RNA-Seq ExpressionCaUC06G114380
SyntenyCaUC06G114380
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336141.1 unnamed protein product [Arabidopsis thaliana]2.2e-7377.25Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR             + + ++  KL   GS     +    +++SV GL
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL

Query:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
         GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]2.2e-7377.25Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR             + + ++  KL   GS     +    +++SV GL
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL

Query:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
         GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

KAD3640919.1 hypothetical protein E3N88_30142 [Mikania micrantha]1.2e-8280.2Show/hide
Query:  LVRSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGF
        LVRSSMDRTWTVVGVGGS RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR             PG+     
Subjt:  LVRSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGF

Query:  SGTTRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRK
                  G LRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRK
Subjt:  SGTTRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRK

Query:  GV
        GV
Subjt:  GV

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]2.6e-7481.11Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP---KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGG
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR    + + ++  KL   GS     +    +++SV GL GGGLPCGG
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP---KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGLRGGGLPCGG

Query:  CQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
        CQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  CQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

OVA05688.1 hypothetical protein BVC80_4285g1 [Macleaya cordata]4.7e-7981.22Show/hide
Query:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK--
        VVGVGGS RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR P   + +  +     S  A F    R+    
Subjt:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK--

Query:  ---SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
           S GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV
Subjt:  ---SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

TrEMBL top hitse value%identityAlignment
A0A200Q5G5 Uncharacterized protein ycf682.3e-7981.22Show/hide
Query:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK--
        VVGVGGS RVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR P   + +  +     S  A F    R+    
Subjt:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR-PKRRAEKGGKLSVPGSPVAGFSGTTRILK--

Query:  ---SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
           S GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV
Subjt:  ---SVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

A0A5N6MLP8 Uncharacterized protein ycf685.7e-8380.2Show/hide
Query:  LVRSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGF
        LVRSSMDRTWTVVGVGGS RVP SGIPGEEDQVGPCEQLDALSPFNPLSE+RQKEGKSMDRPH LHPVGTTR PQGRLR             PG+     
Subjt:  LVRSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGF

Query:  SGTTRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRK
                  G LRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SIYDLSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRK
Subjt:  SGTTRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRK

Query:  GV
        GV
Subjt:  GV

A0A7G2FJL3 Uncharacterized protein ycf681.1e-7377.25Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR             + + ++  KL   GS     +    +++SV GL
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL

Query:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
         GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

A0A7G2FKR6 Uncharacterized protein ycf681.1e-7377.25Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR             + + ++  KL   GS     +    +++SV GL
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL

Query:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
         GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

A0A7G2FMH4 Uncharacterized protein ycf681.1e-7377.25Show/hide
Query:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL
        IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLR             + + ++  KL   GS     +    +++SV GL
Subjt:  IPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRP------------KRRAEKGGKLSVPGSPVAGFSGTTRILKSVGGL

Query:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV
         GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGV
Subjt:  RGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTGGTCAGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTT
GGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTA
GGAACTACGAGATCACCCCAAGGACGCCTTCGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATTCTCCGGAACC
ACAAGAATCCTTAAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGAT
ACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGA
TGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTA
mRNA sequenceShow/hide mRNA sequence
ATGATATTGGTCAGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTT
GGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTA
GGAACTACGAGATCACCCCAAGGACGCCTTCGACCAAAGAGGCGGGCGGAAAAGGGGGGAAAGCTCTCCGTTCCTGGTTCTCCTGTAGCTGGATTCTCCGGAACC
ACAAGAATCCTTAAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGAT
ACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGA
TGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTA
Protein sequenceShow/hide protein sequence
MILVRSSMDRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGRLRPKRRAEKGGKLSVPGSPVAGFSGT
TRILKSVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGV