; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G060110 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G060110
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCicolChr04:2611068..2612149
RNA-Seq ExpressionCcUC04G060110
SyntenyCcUC04G060110
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]4.2e-2343.45Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        MSW Y+S+ E  +G+IVGY +A +IWEA                      IKKDGL+   Y+ + + + +   +IGEP++Y DHL Y L  LG +YNPFV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+IQ++  R SI +V +LL+ Y+  LE+Q ATD L+ +QANLAN+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]4.2e-2343.45Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        MSW Y+S+ E  +G+IVGY +A +IWEA                      IKKDGL+   Y+ + + + +   +IGEP++Y DHL Y L  LG +YNPFV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+IQ++  R SI +V +LL+ Y+  LE+Q ATD L+ +QANLAN+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]4.2e-2356.25Show/hide
Query:  RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        ++KKDGLS++QYLA+IK++  K ++IGEP+S +DH+ YI+E LG EYN FVT+IQNR+D  ++ DVR LL+ Y+  LEKQ + D LN++QAN+AN+
Subjt:  RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.6e-3049.66Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAF----------RI--------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        M W YSSL E+K+GE+V  +T ++IW +           RI         ++KDG S++QYLA+IK++ DKF A+GEPLSYRDHL ++L+ LG+EYN FV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAF----------RI--------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+I NR D  S+ DVR+LL+ YE  L+KQ   D LN+ QANL N+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]3.7e-3562.41Show/hide
Query:  VGEIVGYDTAYEIWEAFRI------------------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSI
        +GEIVGY++A++IWEA R                   +IKKDGL+++QYLAQIKDV+D F AIGEPLSYRDHL YILE LG+EYNPFV++I NRT+R SI
Subjt:  VGEIVGYDTAYEIWEAFRI------------------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSI

Query:  ADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        ADVRNLLI Y+  LEKQ ATD L LIQAN+A++
Subjt:  ADVRNLLIIYEICLEKQIATDLLNLIQANLANV

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein5.6e-2138.46Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        MSW Y+SL +  +G+IVGY +A+EIWEA                      ++KDGL+  +Y+ + K++ +   A+GEP+S +DHL Y+   L  EYN FV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANVLSLFKRRFIPS
        T+I  R D L + ++ +LL+ YE  LE Q A+  L+ +QANLA+ L++ K+ + P+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANVLSLFKRRFIPS

A0A6J1D6N7 uncharacterized protein LOC1110174382.0e-2356.25Show/hide
Query:  RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        ++KKDGLS++QYLA+IK++  K ++IGEP+S +DH+ YI+E LG EYN FVT+IQNR+D  ++ DVR LL+ Y+  LEKQ + D LN++QAN+AN+
Subjt:  RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

A0A6J1DQX7 uncharacterized protein LOC1110223157.7e-3149.66Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAF----------RI--------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        M W YSSL E+K+GE+V  +T ++IW +           RI         ++KDG S++QYLA+IK++ DKF A+GEPLSYRDHL ++L+ LG+EYN FV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAF----------RI--------RIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+I NR D  S+ DVR+LL+ YE  L+KQ   D LN+ QANL N+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

A0A7J0DER3 Uncharacterized protein2.0e-2343.45Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        MSW Y+S+ E  +G+IVGY +A +IWEA                      IKKDGL+   Y+ + + + +   +IGEP++Y DHL Y L  LG +YNPFV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+IQ++  R SI +V +LL+ Y+  LE+Q ATD L+ +QANLAN+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

A0A7J0E8R3 Uncharacterized protein2.0e-2343.45Show/hide
Query:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV
        MSW Y+S+ E  +G+IVGY +A +IWEA                      IKKDGL+   Y+ + + + +   +IGEP++Y DHL Y L  LG +YNPFV
Subjt:  MSWFYSSLIEDKVGEIVGYDTAYEIWEAFR------------------IRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFV

Query:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV
        T+IQ++  R SI +V +LL+ Y+  LE+Q ATD L+ +QANLAN+
Subjt:  TTIQNRTDRLSIADVRNLLIIYEICLEKQIATDLLNLIQANLANV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGGTTCTACTCCTCACTTATTGAAGACAAGGTGGGTGAGATAGTTGGCTATGATACTGCTTATGAAATTTGGGAGGCATTCCGCATAAGAATCAAGAAAGATGG
TCTCTCCATGGCTCAGTATTTGGCTCAGATCAAGGACGTCGTCGATAAATTTAATGCAATTGGAGAACCCTTGTCTTATAGAGATCATTTGGGATATATCCTCGAAAGAC
TTGGAAATGAATATAATCCTTTTGTGACTACAATTCAAAATCGTACCGATCGTCTTTCTATTGCTGATGTCCGCAATTTACTCATTATTTATGAAATTTGTCTGGAGAAA
CAAATAGCCACTGATCTGTTAAATTTGATTCAAGCCAATCTTGCAAATGTCTTAAGTCTATTTAAAAGACGATTTATTCCATCCGAGGGACATCAATTCTTCATGGGATC
TAAACCTAAGGCAAAGCTCTGCCATCTGAAAGAGTGCACTGCTGCTGGTGATCAATTCGAGGCCCGAAGACGAAGGTCGGGGGTCAAGATTGGGAGACGGCAACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTGGTTCTACTCCTCACTTATTGAAGACAAGGTGGGTGAGATAGTTGGCTATGATACTGCTTATGAAATTTGGGAGGCATTCCGCATAAGAATCAAGAAAGATGG
TCTCTCCATGGCTCAGTATTTGGCTCAGATCAAGGACGTCGTCGATAAATTTAATGCAATTGGAGAACCCTTGTCTTATAGAGATCATTTGGGATATATCCTCGAAAGAC
TTGGAAATGAATATAATCCTTTTGTGACTACAATTCAAAATCGTACCGATCGTCTTTCTATTGCTGATGTCCGCAATTTACTCATTATTTATGAAATTTGTCTGGAGAAA
CAAATAGCCACTGATCTGTTAAATTTGATTCAAGCCAATCTTGCAAATGTCTTAAGTCTATTTAAAAGACGATTTATTCCATCCGAGGGACATCAATTCTTCATGGGATC
TAAACCTAAGGCAAAGCTCTGCCATCTGAAAGAGTGCACTGCTGCTGGTGATCAATTCGAGGCCCGAAGACGAAGGTCGGGGGTCAAGATTGGGAGACGGCAACGGTGA
Protein sequenceShow/hide protein sequence
MSWFYSSLIEDKVGEIVGYDTAYEIWEAFRIRIKKDGLSMAQYLAQIKDVVDKFNAIGEPLSYRDHLGYILERLGNEYNPFVTTIQNRTDRLSIADVRNLLIIYEICLEK
QIATDLLNLIQANLANVLSLFKRRFIPSEGHQFFMGSKPKAKLCHLKECTAAGDQFEARRRRSGVKIGRRQR