; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G087670 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G087670
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCiama_Chr05:8010589..8015678
RNA-Seq ExpressionCaUC05G087670
SyntenyCaUC05G087670
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]1.7e-3240.19Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK+L+WN RG+GS  KRA+ K  I + SP+FVILTET L + +K+IIKS W S SIN     ++G+ GGI+I+WD+  H      +   SL  +   ++N
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR
          WW++G+YGP  RR R  FW +L  L    +  W L  D NV R   ETTS   S  + R  N FI    L++P   NN FT          SN+ +P 
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR

Query:  RISLVFNLL
          S +   L
Subjt:  RISLVFNLL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]1.7e-3240.19Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK+L+WN RG+GS  KRA+ K  I + SP+FVILTET L + +K+IIKS W S SIN     ++G+ GGI+I+WD+  H      +   SL  +   ++N
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR
          WW++G+YGP  RR R  FW +L  L    +  W L  D NV R   ETTS   S  + R  N FI    L++P   NN FT          SN+ +P 
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR

Query:  RISLVFNLL
          S +   L
Subjt:  RISLVFNLL

XP_011650214.1 uncharacterized protein LOC105434766 [Cucumis sativus]5.9e-3845.9Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MKIL+WNVRGIGS QKR   K VI+  +P+FV L+ETKL+ V+ K++KS+WSSISI       +G  GGI++MWD L H       G  ++  +   +D 
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
          WWI+ +Y   +R  RK FW+EL  LA TC S W+L GDFN  RW  ET+S  P + +M KFN  I    L++P   N  +T
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]5.5e-4448.63Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        M IL+WNVRG+GSA KRA  K  IT++ P+ VIL+ETK   ++ K IKSLWSSISI  A   ++G  GGII++WD L   A EV  G  S+ V  K +DN
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
          WW++G+Y P  ++ RK FW+EL  L   C   W+LG DFN+YRW++ET+S  P +  M KFN FI+   L++P   N  +T
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]5.7e-3343.17Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK L+WNVRG+ S +K A+ K  I+ ++PN VIL ETKL  +D  I+KSLWS+  IN +   ++G   GI+I+W+     A E+ +G  SL ++   SD 
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
         L+W+SGIYGP++      FW+EL  L+  C + WIL GDFNV RW+ E ++ RP   +M  FN+FIE ++L++    N   T
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein8.0e-3340.19Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK+L+WN RG+GS  KRA+ K  I + SP+FVILTET L + +K+IIKS W S SIN     ++G+ GGI+I+WD+  H      +   SL  +   ++N
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR
          WW++G+YGP  RR R  FW +L  L    +  W L  D NV R   ETTS   S  + R  N FI    L++P   NN FT          SN+ +P 
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR

Query:  RISLVFNLL
          S +   L
Subjt:  RISLVFNLL

A0A5D3CI86 Reverse transcriptase domain-containing protein8.0e-3340.19Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK+L+WN RG+GS  KRA+ K  I + SP+FVILTET L + +K+IIKS W S SIN     ++G+ GGI+I+WD+  H      +   SL  +   ++N
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR
          WW++G+YGP  RR R  FW +L  L    +  W L  D NV R   ETTS   S  + R  N FI    L++P   NN FT          SN+ +P 
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNMASPR

Query:  RISLVFNLL
          S +   L
Subjt:  RISLVFNLL

A0A6J1CVN2 uncharacterized protein LOC1110146572.7e-4448.63Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        M IL+WNVRG+GSA KRA  K  IT++ P+ VIL+ETK   ++ K IKSLWSSISI  A   ++G  GGII++WD L   A EV  G  S+ V  K +DN
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
          WW++G+Y P  ++ RK FW+EL  L   C   W+LG DFN+YRW++ET+S  P +  M KFN FI+   L++P   N  +T
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

A0A6J1E2G6 uncharacterized protein LOC1110254052.8e-3343.17Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MK L+WNVRG+ S +K A+ K  I+ ++PN VIL ETKL  +D  I+KSLWS+  IN +   ++G   GI+I+W+     A E+ +G  SL ++   SD 
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
         L+W+SGIYGP++      FW+EL  L+  C + WIL GDFNV RW+ E ++ RP   +M  FN+FIE ++L++    N   T
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

A0A6P5T1U8 uncharacterized protein LOC1107621451.8e-3239.89Show/hide
Query:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN
        MKI+SWN+RG+GS +KR + K  +T + P+ VIL ETK   +D++++ S+W S   +     STG  GGI+I+W++      +      S+ + I+ +  
Subjt:  MKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYDGSNSLLVSIKFSDN

Query:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT
          WW+SGIYGP  +R+R  FW EL  L   C   W +GGDFNV R+ ++ ++      +MR FN FI  TNL +PR  N  FT
Subjt:  VLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGATTGAAGTTTCCTGATAACCAGGTGCTCATGAAGTTGATAACATGCAAGGAGCACCAAGGGTCCATGAAGATCCTCTCATGGAATGTGAGGGGCATTGGCTC
AGCCCAGAAAAGAGCCATTACAAAGGGTGTTATCACGGCTGTAAGCCCGAACTTTGTCATTCTAACTGAAACGAAATTAGTCTTAGTGGATAAGAAGATTATCAAATCTC
TCTGGAGTTCCATCAGTATCAACGGGGCACATACGCTTTCCACTGGTACATTTGGGGGAATCATAATCATGTGGGACTCTTTAATCCACTGTGCAGATGAGGTTTATGAT
GGAAGCAACTCCTTATTAGTCTCTATCAAGTTTTCAGATAATGTTCTGTGGTGGATATCGGGAATATATGGTCCTGCTAGCAGACGAAATAGAAAGTTTTTTTGGGAAGA
GCTTGACACTCTAGCTTCAACTTGCAATAGCTGCTGGATTTTGGGGGGAGACTTCAATGTTTATAGATGGACCAATGAAACTACATCATGCAGACCATCCAGACTCAACA
TGAGAAAATTCAACGCATTTATTGAAAAGACAAATCTTATGGAACCGAGATCGTTTAATAACCCCTTTACTTGCTACCTCCATCCATGCTTATTAATGCTCTCAAATATG
GCTTCCCCTAGGAGGATTTCCCTGGTGTTCAACCTTCTTTGCTTTGCTTTCCTTCTCATTTTCTCTATTGTTTCTGCAACCAATAACGTCCAGCCTTTGCTTCACAAACC
TCATATTCGTAAAATGGCTCTGGAGTACAAAAGTGATGGGAAGGCTGTTGTTGTTCCTAAAGTAATCAAACCAACTTCTGAGGATTTCCCATGGGGGGGAGGTTATATTG
ATCATCCAATCTTAAGCAACCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGATTGAAGTTTCCTGATAACCAGGTGCTCATGAAGTTGATAACATGCAAGGAGCACCAAGGGTCCATGAAGATCCTCTCATGGAATGTGAGGGGCATTGGCTC
AGCCCAGAAAAGAGCCATTACAAAGGGTGTTATCACGGCTGTAAGCCCGAACTTTGTCATTCTAACTGAAACGAAATTAGTCTTAGTGGATAAGAAGATTATCAAATCTC
TCTGGAGTTCCATCAGTATCAACGGGGCACATACGCTTTCCACTGGTACATTTGGGGGAATCATAATCATGTGGGACTCTTTAATCCACTGTGCAGATGAGGTTTATGAT
GGAAGCAACTCCTTATTAGTCTCTATCAAGTTTTCAGATAATGTTCTGTGGTGGATATCGGGAATATATGGTCCTGCTAGCAGACGAAATAGAAAGTTTTTTTGGGAAGA
GCTTGACACTCTAGCTTCAACTTGCAATAGCTGCTGGATTTTGGGGGGAGACTTCAATGTTTATAGATGGACCAATGAAACTACATCATGCAGACCATCCAGACTCAACA
TGAGAAAATTCAACGCATTTATTGAAAAGACAAATCTTATGGAACCGAGATCGTTTAATAACCCCTTTACTTGCTACCTCCATCCATGCTTATTAATGCTCTCAAATATG
GCTTCCCCTAGGAGGATTTCCCTGGTGTTCAACCTTCTTTGCTTTGCTTTCCTTCTCATTTTCTCTATTGTTTCTGCAACCAATAACGTCCAGCCTTTGCTTCACAAACC
TCATATTCGTAAAATGGCTCTGGAGTACAAAAGTGATGGGAAGGCTGTTGTTGTTCCTAAAGTAATCAAACCAACTTCTGAGGATTTCCCATGGGGGGGAGGTTATATTG
ATCATCCAATCTTAAGCAACCCATGATGAGATTTGTAACTATGTTTCATTGGATGATCAGCTGTGTTTTTTCTTTGCTTGGTGGTGTTGACTTTTTTGGGAGCATCAATC
TCAAGCTCTCAACCTTACTATAAA
Protein sequenceShow/hide protein sequence
MQGLKFPDNQVLMKLITCKEHQGSMKILSWNVRGIGSAQKRAITKGVITAVSPNFVILTETKLVLVDKKIIKSLWSSISINGAHTLSTGTFGGIIIMWDSLIHCADEVYD
GSNSLLVSIKFSDNVLWWISGIYGPASRRNRKFFWEELDTLASTCNSCWILGGDFNVYRWTNETTSCRPSRLNMRKFNAFIEKTNLMEPRSFNNPFTCYLHPCLLMLSNM
ASPRRISLVFNLLCFAFLLIFSIVSATNNVQPLLHKPHIRKMALEYKSDGKAVVVPKVIKPTSEDFPWGGGYIDHPILSNP