; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G035280 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G035280
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationCmU531Chr02:13290400..13295505
RNA-Seq ExpressionCmUC02G035280
SyntenyCmUC02G035280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143616.1 uncharacterized protein LOC111013476 [Momordica charantia]8.9e-0660.87Show/hide
Query:  LNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        L KF +D SQ  + +YI YEIGTRFKDYR++LY++YKK+ DL +AR
Subjt:  LNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

XP_022159083.1 uncharacterized protein LOC111025525 [Momordica charantia]9.5e-0836.43Show/hide
Query:  MSSSNEENIVDTSNTPTNSQIRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNKFDM
        MSS +E    DT    T +Q  G T+G  L RV+    G+I + W    G+PVG  +  F+SEI                              L KF +
Subjt:  MSSSNEENIVDTSNTPTNSQIRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNKFDM

Query:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        D SQ  + +YI YEIGTRFKDYR++L+++YKK  D  +AR
Subjt:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]2.0e-1337.76Show/hide
Query:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK
        S  N ++ V  +T+N  T ++  +RG ++GV L +   AT GRI ++W P  GKP+G +A++F+ EI                            + LN+
Subjt:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK

Query:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        FD+D SQ  I++YI YEIG RFKDYR  LY++Y+K  D V+AR
Subjt:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]2.0e-1337.76Show/hide
Query:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK
        S  N ++ V  +T+N  T ++  +RG ++GV L +   AT GRI ++W P  GKP+G +A++F+ EI                            + LN+
Subjt:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK

Query:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        FD+D SQ  I++YI YEIG RFKDYR  LY++Y+K  D V+AR
Subjt:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]2.0e-1337.76Show/hide
Query:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK
        S  N ++ V  +T+N  T ++  +RG ++GV L +   AT GRI ++W P  GKP+G +A++F+ EI                            + LN+
Subjt:  SSSNEENIV--DTSNTPTNSQ--IRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNK

Query:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        FD+D SQ  I++YI YEIG RFKDYR  LY++Y+K  D V+AR
Subjt:  FDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

TrEMBL top hitse value%identityAlignment
A0A2N9EEK2 Reverse transcriptase domain-containing protein3.1e-0430.48Show/hide
Query:  IVDTSNTPTNSQI-------RGVTQGVGLYRVIEATGG-RIPISWDPYPGKPVGKVANIFSSEIRTLNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQ
        +V +++TP++S +       RG+T+G+G+  +++  G   +P S D     PVGK A    S++  LN+FD+ C  ++I K +    G R  D++++L+ 
Subjt:  IVDTSNTPTNSQI-------RGVTQGVGLYRVIEATGG-RIPISWDPYPGKPVGKVANIFSSEIRTLNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQ

Query:  YYKKL
         YK++
Subjt:  YYKKL

A0A438JBT2 Uncharacterized protein6.9e-0426.87Show/hide
Query:  PTNSQIRGVTQGVGLYRVIEATGGR-IPISWDPYPGKPVGKVANIFSSEI------------------------------------RTLNKFDMDCSQLQ
        P    +RG T+GV L ++IEA GG+ +PI+  P  GK +GK     S+EI                                        KF +D +Q  
Subjt:  PTNSQIRGVTQGVGLYRVIEATGGR-IPISWDPYPGKPVGKVANIFSSEI------------------------------------RTLNKFDMDCSQLQ

Query:  IRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        ++K +E ++  RF+++R  L++++KK   +V+A+
Subjt:  IRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

A0A6J1CQT5 uncharacterized protein LOC1110134764.3e-0660.87Show/hide
Query:  LNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        L KF +D SQ  + +YI YEIGTRFKDYR++LY++YKK+ DL +AR
Subjt:  LNKFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

A0A6J1D6S9 uncharacterized protein LOC1110174612.8e-0565Show/hide
Query:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        D SQL + +YI YEIGTRFKDYR++LY++YKK+ DL +AR
Subjt:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

A0A6J1DXU5 uncharacterized protein LOC1110255254.6e-0836.43Show/hide
Query:  MSSSNEENIVDTSNTPTNSQIRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNKFDM
        MSS +E    DT    T +Q  G T+G  L RV+    G+I + W    G+PVG  +  F+SEI                              L KF +
Subjt:  MSSSNEENIVDTSNTPTNSQIRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEI----------------------------RTLNKFDM

Query:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR
        D SQ  + +YI YEIGTRFKDYR++L+++YKK  D  +AR
Subjt:  DCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGACGAAAACGCTTGTCGTCGCAGGGTATTGCAACAAAAACATTTTGTGTCGCAGGGTTTTCGACGATAACTTTCAGCAATTTTCATCGGCTGCCAATCGCCGATTT
CATCCATGGGAAGCTCAGCTGCTGTCGTTCTCTCCATTCGCCACCGTCGCTCCCTCCATTCGCCGTCGTCGACGGATTTCCGGTAGAGAACACGAGATTGAGCTACAGAA
CAGAAATCTCCATCATGATCTTGAGATGCATGAAGCACAATTTCCATATTGGTTTAAAGATAAAAGATGATGTAGATTCAACTGTAGTTTGTGATGACGGTGATTTGATT
GAGATAGTTTTAGATTTTGATGAACGAGTGGAAGACAAATTAGAAGACGATGAGAATAAGGGTGACGAGGAAGATGGAGAAGTTGAAGAAGATGATGGGGATGAAGTTGA
GGACAATGATGAAAGGATGTCATCAAGTAATGAAGAGAACATTGTAGACACTAGTAACACTCCCACAAATAGCCAAATTCGTGGTGTTACACAAGGAGTCGGATTATATC
GTGTGATTGAGGCAACTGGAGGAAGAATACCTATCTCATGGGATCCTTACCCCGGGAAACCAGTTGGGAAAGTTGCGAATATTTTTAGCAGTGAGATTCGTACACTGAAT
AAATTTGACATGGATTGCTCTCAACTGCAAATCAGAAAGTACATTGAATACGAGATTGGTACTCGCTTTAAGGACTATAGATCAAGATTGTACCAGTATTATAAAAAATT
GGGTGATCTGGTCCAAGCTCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGACGAAAACGCTTGTCGTCGCAGGGTATTGCAACAAAAACATTTTGTGTCGCAGGGTTTTCGACGATAACTTTCAGCAATTTTCATCGGCTGCCAATCGCCGATTT
CATCCATGGGAAGCTCAGCTGCTGTCGTTCTCTCCATTCGCCACCGTCGCTCCCTCCATTCGCCGTCGTCGACGGATTTCCGGTAGAGAACACGAGATTGAGCTACAGAA
CAGAAATCTCCATCATGATCTTGAGATGCATGAAGCACAATTTCCATATTGGTTTAAAGATAAAAGATGATGTAGATTCAACTGTAGTTTGTGATGACGGTGATTTGATT
GAGATAGTTTTAGATTTTGATGAACGAGTGGAAGACAAATTAGAAGACGATGAGAATAAGGGTGACGAGGAAGATGGAGAAGTTGAAGAAGATGATGGGGATGAAGTTGA
GGACAATGATGAAAGGATGTCATCAAGTAATGAAGAGAACATTGTAGACACTAGTAACACTCCCACAAATAGCCAAATTCGTGGTGTTACACAAGGAGTCGGATTATATC
GTGTGATTGAGGCAACTGGAGGAAGAATACCTATCTCATGGGATCCTTACCCCGGGAAACCAGTTGGGAAAGTTGCGAATATTTTTAGCAGTGAGATTCGTACACTGAAT
AAATTTGACATGGATTGCTCTCAACTGCAAATCAGAAAGTACATTGAATACGAGATTGGTACTCGCTTTAAGGACTATAGATCAAGATTGTACCAGTATTATAAAAAATT
GGGTGATCTGGTCCAAGCTCGCTAA
Protein sequenceShow/hide protein sequence
MRRKRLSSQGIATKTFCVAGFSTITFSNFHRLPIADFIHGKLSCCRSLHSPPSLPPFAVVDGFPVENTRLSYRTEISIMILRCMKHNFHIGLKIKDDVDSTVVCDDGDLI
EIVLDFDERVEDKLEDDENKGDEEDGEVEEDDGDEVEDNDERMSSSNEENIVDTSNTPTNSQIRGVTQGVGLYRVIEATGGRIPISWDPYPGKPVGKVANIFSSEIRTLN
KFDMDCSQLQIRKYIEYEIGTRFKDYRSRLYQYYKKLGDLVQAR