; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:10494961..10497662
RNA-Seq ExpressionMoc03g15710
SyntenyMoc03g15710
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]7.7e-2844.44Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        IISWN+RG+ +  KR+L+K+ L +  PD+V+L ETK + VDR+++   W SR   W+   ++G SGGI V+WN  S+S++DS++  FSVS+ I    G  
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W+SG+YGP     R  FW+EL DL   C   WC+
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]1.4e-2944.44Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        IISWN+RG+ +  KR+++K+ L +  PD+V+L ETK   +DR+++ S W SR   W+ + + G SGGI+++WN   +S++DS +  FSVS+ IR   G  
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W+SG+YGP     R  FW EL  L  LC  NWCI
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.8e-2948.15Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        I++WNVRG+ + SKR  IKD +    PD+V+L ETK  S++ K IKS WSS  I W SLDA G+SGGI+++W++ S S V+ +   FS+S+  + AD F 
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W++GVY P     R LFWQEL DL  LC   W +
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]4.5e-2844.03Show/hide
Query:  ISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQG
        ++WNVRG+ +  K  LIK F+ + +P+VV+L ETKL  +D  I+KS WS+  I W +LDA G + GIL++WN+  +   + +   FS+++    +DGF  
Subjt:  ISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQG

Query:  WISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
        W+SG+YGPS+T    LFWQEL DL+ LC+ +W +
Subjt:  WISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]3.9e-3249.23Show/hide
Query:  VRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQGWISG
        + G++  + + ++K+ L K +PDVV+L ++K+ +V+R ++KS WSS  +GW +L+A GSSGGIL++W EDSI++VDS+   FS+S+  +F  GF GWI+G
Subjt:  VRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQGWISG

Query:  VYGPSSTTGRDLFWQELRDLARLCDANWCI
        VYGPSS   RD FW EL  L  LC+ NWC+
Subjt:  VYGPSSTTGRDLFWQELRDLARLCDANWCI

TrEMBL top hitse value%identityAlignment
A0A6J1CVN2 uncharacterized protein LOC1110146578.9e-3048.15Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        I++WNVRG+ + SKR  IKD +    PD+V+L ETK  S++ K IKS WSS  I W SLDA G+SGGI+++W++ S S V+ +   FS+S+  + AD F 
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W++GVY P     R LFWQEL DL  LC   W +
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

A0A6J1E2G6 uncharacterized protein LOC1110254052.2e-2844.03Show/hide
Query:  ISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQG
        ++WNVRG+ +  K  LIK F+ + +P+VV+L ETKL  +D  I+KS WS+  I W +LDA G + GIL++WN+  +   + +   FS+++    +DGF  
Subjt:  ISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQG

Query:  WISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
        W+SG+YGPS+T    LFWQEL DL+ LC+ +W +
Subjt:  WISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

A0A6P5T1U8 uncharacterized protein LOC1107621456.8e-3044.44Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        IISWN+RG+ +  KR+++K+ L +  PD+V+L ETK   +DR+++ S W SR   W+ + + G SGGI+++WN   +S++DS +  FSVS+ IR   G  
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W+SG+YGP     R  FW EL  L  LC  NWCI
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

A0A803P8A0 Uncharacterized protein2.2e-2845.93Show/hide
Query:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ
        I++WN+RG     KR  IK  +CKA+PD+V+L E K  +VDR+ I S W SR   WI L A+G SGG L++W+   IS++DS++  FS+S+LI       
Subjt:  IISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQ

Query:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
         W SGVYGP S   R +FW EL  L+ +C  +WC+
Subjt:  GWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)5.7e-2941.29Show/hide
Query:  KMVLYHQKHNLCIRAIPSKAIISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIV
        K  L H++H   +  +    IISWN+RG+ +  KR+L+K+ L +  PD+V+L ETK + VDR+++   W SR   W+   ++G SGGI V+WN  S+S++
Subjt:  KMVLYHQKHNLCIRAIPSKAIISWNVRGISAPSKRVLIKDFLCKADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIV

Query:  DSVLDSFSVSMLIRFADGFQGWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI
        DS++  FSVS+ I    G   W+SG+YGP     R+ FW+EL DL   C   WC+
Subjt:  DSVLDSFSVSMLIRFADGFQGWISGVYGPSSTTGRDLFWQELRDLARLCDANWCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGTATCATGGTATGGACGAGCGTTGCAATACACGTGTCTTCTTGGACAACTTATTTTGAGAATTGGAAACTCAATAACAGTATTGGGTGGCACAAATGTG
CTAGAATTGCGCCTTATGGATGGATGTCTGGAAGGTTCTGGTTGGAATTATGAACTCGTAGTTCCCACAGAAGCCATCCGTTTGAACAAAATCAGGAAAGGTAAA
GAAAAAGAATTTGAGGTGATGTTTGATTTTTCCGTTTCCAGTTTTGATAGTGAAACTCAGAACTTTGACTTGGGGGCAATGCAAAGAAAGTCGGGGACAATGACT
GAAGAGGTGGACCCACCTTTAGAATACTATAATATTTTCCAAGAAGGAGAGAATGGCCAAACTCCCATAGATAACGGAATAATGATAGAGTTAGGACAGGAACAC
CAGGATGATCAAATTCAGTGGGTGGAGCAAGTTTCGGAGGAAGACGAAGAGGATGACAATAATTTCCCTTTCACTAGAAAAATGGTTCTTTACCATCAAAAACAT
AATCTGTGCATTAGAGCTATTCCTTCTAAAGCGATAATATCGTGGAATGTAAGAGGGATAAGCGCCCCCTCGAAGAGAGTGCTGATAAAGGACTTCCTTTGCAAG
GCTGATCCTGATGTTGTTTTGCTGCACGAAACTAAATTAGATTCAGTAGATAGAAAGATCATTAAGTCCACTTGGAGTTCTAGGCATATCGGCTGGATTTCTTTA
GATGCGGTGGGTTCTTCTGGTGGTATCTTAGTTATGTGGAACGAAGATAGTATTTCTATAGTAGATTCTGTTTTGGACAGCTTCTCTGTTTCAATGCTGATAAGA
TTTGCGGATGGTTTTCAGGGTTGGATTTCAGGAGTTTATGGACCTTCGTCAACCACGGGCAGAGATTTGTTTTGGCAAGAACTTAGAGATTTAGCTCGTTTATGC
GATGCCAATTGGTGCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGTATCATGGTATGGACGAGCGTTGCAATACACGTGTCTTCTTGGACAACTTATTTTGAGAATTGGAAACTCAATAACAGTATTGGGTGGCACAAATGTG
CTAGAATTGCGCCTTATGGATGGATGTCTGGAAGGTTCTGGTTGGAATTATGAACTCGTAGTTCCCACAGAAGCCATCCGTTTGAACAAAATCAGGAAAGGTAAA
GAAAAAGAATTTGAGGTGATGTTTGATTTTTCCGTTTCCAGTTTTGATAGTGAAACTCAGAACTTTGACTTGGGGGCAATGCAAAGAAAGTCGGGGACAATGACT
GAAGAGGTGGACCCACCTTTAGAATACTATAATATTTTCCAAGAAGGAGAGAATGGCCAAACTCCCATAGATAACGGAATAATGATAGAGTTAGGACAGGAACAC
CAGGATGATCAAATTCAGTGGGTGGAGCAAGTTTCGGAGGAAGACGAAGAGGATGACAATAATTTCCCTTTCACTAGAAAAATGGTTCTTTACCATCAAAAACAT
AATCTGTGCATTAGAGCTATTCCTTCTAAAGCGATAATATCGTGGAATGTAAGAGGGATAAGCGCCCCCTCGAAGAGAGTGCTGATAAAGGACTTCCTTTGCAAG
GCTGATCCTGATGTTGTTTTGCTGCACGAAACTAAATTAGATTCAGTAGATAGAAAGATCATTAAGTCCACTTGGAGTTCTAGGCATATCGGCTGGATTTCTTTA
GATGCGGTGGGTTCTTCTGGTGGTATCTTAGTTATGTGGAACGAAGATAGTATTTCTATAGTAGATTCTGTTTTGGACAGCTTCTCTGTTTCAATGCTGATAAGA
TTTGCGGATGGTTTTCAGGGTTGGATTTCAGGAGTTTATGGACCTTCGTCAACCACGGGCAGAGATTTGTTTTGGCAAGAACTTAGAGATTTAGCTCGTTTATGC
GATGCCAATTGGTGCATATGA
Protein sequenceShow/hide protein sequence
MCVSWYGRALQYTCLLGQLILRIGNSITVLGGTNVLELRLMDGCLEGSGWNYELVVPTEAIRLNKIRKGKEKEFEVMFDFSVSSFDSETQNFDLGAMQRKSGTMT
EEVDPPLEYYNIFQEGENGQTPIDNGIMIELGQEHQDDQIQWVEQVSEEDEEDDNNFPFTRKMVLYHQKHNLCIRAIPSKAIISWNVRGISAPSKRVLIKDFLCK
ADPDVVLLHETKLDSVDRKIIKSTWSSRHIGWISLDAVGSSGGILVMWNEDSISIVDSVLDSFSVSMLIRFADGFQGWISGVYGPSSTTGRDLFWQELRDLARLC
DANWCI