; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G012580 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G012580
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionReverse transcriptase
Genome locationCiama_Chr01:24594215..24598458
RNA-Seq ExpressionCaUC01G012580
SyntenyCaUC01G012580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.1e-1168.63Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        R+F+RYGPPTF   SEKAT  E+WI ELE+L+ YL C D LKV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]7.5e-1370.59Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTF  GS+KATTAE+W+ ELE+L+ YL CED  KV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.4e-1370.59Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTFG GSE+AT AE+W+ ELE+L+ YL CED  KV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

XP_022158749.1 uncharacterized protein LOC111025213 [Momordica charantia]3.7e-1256.58Show/hide
Query:  PPRGWRPRGGLGM-PALPGDGAN--RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        PP   R R    + PA+P   A   ++FKRYGPPTF   SE+AT AE+WI ELE+L+ YL CED  KV+GAVFMLR
Subjt:  PPRGWRPRGGLGM-PALPGDGAN--RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

XP_022159307.1 uncharacterized protein LOC111025716, partial [Momordica charantia]3.7e-1268.63Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTF  GSEKAT AE+W+ ELE+L+ YL CED  K +GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196035.3e-1268.63Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        R+F+RYGPPTF   SEKAT  E+WI ELE+L+ YL C D LKV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

A0A6J1DQ01 uncharacterized protein LOC1110232503.6e-1370.59Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTF  GS+KATTAE+W+ ELE+L+ YL CED  KV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

A0A6J1DVA0 uncharacterized protein LOC1110234241.6e-1370.59Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTFG GSE+AT AE+W+ ELE+L+ YL CED  KV+GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

A0A6J1DZH0 uncharacterized protein LOC1110257161.8e-1268.63Show/hide
Query:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        ++FKRYGPPTF  GSEKAT AE+W+ ELE+L+ YL CED  K +GAVFMLR
Subjt:  RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

A0A6J1E0B4 uncharacterized protein LOC1110252131.8e-1256.58Show/hide
Query:  PPRGWRPRGGLGM-PALPGDGAN--RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR
        PP   R R    + PA+P   A   ++FKRYGPPTF   SE+AT AE+WI ELE+L+ YL CED  KV+GAVFMLR
Subjt:  PPRGWRPRGGLGM-PALPGDGAN--RNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKVRGAVFMLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGGCGTTATAGATTCTTTGGTACAGGAAGTAATTTGGGACACCTACGATGATGAGGATGAGGATGAAAATTATGGAGAATGCACGATGTTATGGGATGAGATTGCTTC
CGTCACATTGTCACATCTAACGATGCCACCACGTGGATGGAGACCTAGAGGAGGCTTAGGCATGCCTGCGCTTCCCGGCGACGGAGCAAACAGGAATTTCAAGCGCTATG
GGCCTCCGACCTTCGGCGATGGGTCAGAGAAAGCTACTACAGCTGAGCAGTGGATTGTAGAGCTGGAGTCATTGTTTGACTACCTAAATTGCGAGGATCATCTTAAGGTC
AGAGGAGCAGTTTTCATGCTCCGAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATTGGCGTTATAGATTCTTTGGTACAGGAAGTAATTTGGGACACCTACGATGATGAGGATGAGGATGAAAATTATGGAGAATGCACGATGTTATGGGATGAGATTGCTTC
CGTCACATTGTCACATCTAACGATGCCACCACGTGGATGGAGACCTAGAGGAGGCTTAGGCATGCCTGCGCTTCCCGGCGACGGAGCAAACAGGAATTTCAAGCGCTATG
GGCCTCCGACCTTCGGCGATGGGTCAGAGAAAGCTACTACAGCTGAGCAGTGGATTGTAGAGCTGGAGTCATTGTTTGACTACCTAAATTGCGAGGATCATCTTAAGGTC
AGAGGAGCAGTTTTCATGCTCCGAGACTAG
Protein sequenceShow/hide protein sequence
IGVIDSLVQEVIWDTYDDEDEDENYGECTMLWDEIASVTLSHLTMPPRGWRPRGGLGMPALPGDGANRNFKRYGPPTFGDGSEKATTAEQWIVELESLFDYLNCEDHLKV
RGAVFMLRD