; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein isoform X1
Genome locationchr1:2598449..2601761
RNA-Seq ExpressionMoc01g04010
SyntenyMoc01g04010
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-1431.1Show/hide
Query:  SDKYLGLLLDMAGKKVIPVEGDSSLVDKVAENATVLSPRSSTNRLLMVEGVVENLQWNMVEIKQILSVSVDKIDNLQEQQDQPR----------------
        ++K+LG    M  ++  P   +    ++ AE   VLSP++++ RLL +E  VE ++  +  + Q L       +  QE Q++ R                
Subjt:  SDKYLGLLLDMAGKKVIPVEGDSSLVDKVAENATVLSPRSSTNRLLMVEGVVENLQWNMVEIKQILSVSVDKIDNLQEQQDQPR----------------

Query:  ---KRNGEDVQE---PHQRYNNP-RRNRMTNRRRGTPFESSSDDDEYQEWEM--DHRG-----NQRHNQSRPCRFK-------AKKRGFRLVGA------
           + +  DVQE   P Q Y NP  RN+   +    P + SS DDE QE  +   +RG     ++R  +    + K        K+  F  + A      
Subjt:  ---KRNGEDVQE---PHQRYNNP-RRNRMTNRRRGTPFESSSDDDEYQEWEM--DHRG-----NQRHNQSRPCRFK-------AKKRGFRLVGA------

Query:  -----VRDKSTKGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
                   +G+R++ADYI+EFH LGAR NL EN+Q+Q+A F+GGL  DIKE++++QP  FL EAI+    +EE    R K
Subjt:  -----VRDKSTKGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]2.4e-1461.11Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +G RS+A+YIEEFHRL ARTNL EN+Q+QVA FVGGL  DIKE++++QP  FL EAI+    +EE I  R K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]5.3e-1459.72Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +G R++A+YIEEFHRL ARTNL EN+Q+QVA FVGGL  DIKE++++QP  FL EAI+    +EE I  R K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

XP_031745468.1 uncharacterized protein LOC116405837 [Cucumis sativus]5.3e-1459.72Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +G R++A+YIEEFHRL ARTNL EN+Q+QVA FVGGL  DIKE++++QP  FL EAI+    +EE I  R K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

XP_031745523.1 uncharacterized protein LOC116405899 [Cucumis sativus]9.0e-1455.56Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +G+R++ +YIEEFHRL ARTNL EN+Q+Q+A FVGGL  DIKE++++QP+ FL EAI+    +EE I  + K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

TrEMBL top hitse value%identityAlignment
A0A5A7U7S6 E3 ubiquitin-protein ligase SHPRH isoform X42.8e-1350.67Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFKYQY
        +G RS+A+YIEEFHRL  RTNL EN+ +Q+A F+GGL  DIKE++++QP  FL EAI+    ++E I    +Y++
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFKYQY

A0A5D3C3X9 Reverse transcriptase6.8e-1531.1Show/hide
Query:  SDKYLGLLLDMAGKKVIPVEGDSSLVDKVAENATVLSPRSSTNRLLMVEGVVENLQWNMVEIKQILSVSVDKIDNLQEQQDQPR----------------
        ++K+LG    M  ++  P   +    ++ AE   VLSP++++ RLL +E  VE ++  +  + Q L       +  QE Q++ R                
Subjt:  SDKYLGLLLDMAGKKVIPVEGDSSLVDKVAENATVLSPRSSTNRLLMVEGVVENLQWNMVEIKQILSVSVDKIDNLQEQQDQPR----------------

Query:  ---KRNGEDVQE---PHQRYNNP-RRNRMTNRRRGTPFESSSDDDEYQEWEM--DHRG-----NQRHNQSRPCRFK-------AKKRGFRLVGA------
           + +  DVQE   P Q Y NP  RN+   +    P + SS DDE QE  +   +RG     ++R  +    + K        K+  F  + A      
Subjt:  ---KRNGEDVQE---PHQRYNNP-RRNRMTNRRRGTPFESSSDDDEYQEWEM--DHRG-----NQRHNQSRPCRFK-------AKKRGFRLVGA------

Query:  -----VRDKSTKGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
                   +G+R++ADYI+EFH LGAR NL EN+Q+Q+A F+GGL  DIKE++++QP  FL EAI+    +EE    R K
Subjt:  -----VRDKSTKGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

A0A5D3DGR0 Reverse transcriptase1.4e-1252.78Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +G R  A+YIEEFHRLG RTNL E +++ ++WFVGGL  D+KE++++QP   L EAI     +EE I NR K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X12.0e-1151.39Show/hide
Query:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK
        +GS+ +A+YIEEFHRLGAR NL EN+Q+Q+A F+GGL  DIKE++++     L EAI+    +EE +  R K
Subjt:  KGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFK

A0A6J1DLQ6 uncharacterized protein LOC1110223207.0e-1265.52Show/hide
Query:  ARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFKYQY
        ARTNLGEN+ YQVA  +GGL  DI+ER+QVQ IG+L+EAIAT + IEEQ  N++K+QY
Subjt:  ARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFLDEAIATTIIIEEQIGNRFKYQY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTAGGCTTGATGTTCCGATGTCAAAAAGCGATGGATAATGCTGGTTTGAGTCCAAATAAGGGTGGGGTCACCATCGGGACGCAGCAGAAGCGCCACTGCCACTG
CAAGAACGCCAGAACCGCCCTGGGTTTGGGTTCAAGACACCGTTTGAGACACCGGAATTGGCTGGAAACGCGCCGCTGCTGCCGTCGACGATTGCCGAAGGGGTTTCGAA
GTCGCACCGTCGCTGAAGTTAAAATGCTGCCGTTGCTCGCGAACCGCCGCCGATGGAGATCACCGAGGGGTCGCGCTGCTGCTGGTTCGAGGAGACGTTGCTGCTACCGT
TGCCGCCGTGAGAGCGACAAATATCTTGGGCTCCTTTTGGATATGGCCGGCAAGAAAGTTATTCCAGTTGAAGGGGATTCTTCTCTCGTCGACAAAGTGGCGGAAAATGC
AACCGTCCTCTCTCCACGATCATCAACCAATCGTTTACTGATGGTTGAAGGTGTTGTAGAGAATTTGCAGTGGAATATGGTAGAGATTAAGCAAATCTTGAGTGTTTCGG
TGGACAAGATTGACAACTTGCAAGAACAACAAGATCAGCCAAGAAAGAGGAATGGTGAGGATGTTCAAGAACCTCATCAACGTTACAACAATCCACGCCGAAATCGTATG
ACCAACCGACGTAGGGGCACTCCATTTGAGTCCTCTAGTGATGATGATGAGTATCAAGAATGGGAAATGGATCATAGGGGCAACCAACGACATAATCAAAGTCGACCTTG
TCGCTTTAAAGCTAAAAAGAGGGGCTTCCGCTTGGTGGGAGCAGTTAGAGACAAATCGACAAAGGGAAGCCGATCGATTGCAGACTACATTGAGGAGTTTCACCGATTGG
GGGCCAGAACAAATTTAGGGGAGAATGACCAATACCAAGTGGCATGGTTCGTAGGAGGCCTCTGGATTGACATCAAAGAGCGGCTGCAAGTGCAACCCATTGGATTCTTG
GATGAAGCCATTGCCACAACTATCATTATTGAAGAACAAATTGGCAATCGGTTTAAGTACCAATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTAGGCTTGATGTTCCGATGTCAAAAAGCGATGGATAATGCTGGTTTGAGTCCAAATAAGGGTGGGGTCACCATCGGGACGCAGCAGAAGCGCCACTGCCACTG
CAAGAACGCCAGAACCGCCCTGGGTTTGGGTTCAAGACACCGTTTGAGACACCGGAATTGGCTGGAAACGCGCCGCTGCTGCCGTCGACGATTGCCGAAGGGGTTTCGAA
GTCGCACCGTCGCTGAAGTTAAAATGCTGCCGTTGCTCGCGAACCGCCGCCGATGGAGATCACCGAGGGGTCGCGCTGCTGCTGGTTCGAGGAGACGTTGCTGCTACCGT
TGCCGCCGTGAGAGCGACAAATATCTTGGGCTCCTTTTGGATATGGCCGGCAAGAAAGTTATTCCAGTTGAAGGGGATTCTTCTCTCGTCGACAAAGTGGCGGAAAATGC
AACCGTCCTCTCTCCACGATCATCAACCAATCGTTTACTGATGGTTGAAGGTGTTGTAGAGAATTTGCAGTGGAATATGGTAGAGATTAAGCAAATCTTGAGTGTTTCGG
TGGACAAGATTGACAACTTGCAAGAACAACAAGATCAGCCAAGAAAGAGGAATGGTGAGGATGTTCAAGAACCTCATCAACGTTACAACAATCCACGCCGAAATCGTATG
ACCAACCGACGTAGGGGCACTCCATTTGAGTCCTCTAGTGATGATGATGAGTATCAAGAATGGGAAATGGATCATAGGGGCAACCAACGACATAATCAAAGTCGACCTTG
TCGCTTTAAAGCTAAAAAGAGGGGCTTCCGCTTGGTGGGAGCAGTTAGAGACAAATCGACAAAGGGAAGCCGATCGATTGCAGACTACATTGAGGAGTTTCACCGATTGG
GGGCCAGAACAAATTTAGGGGAGAATGACCAATACCAAGTGGCATGGTTCGTAGGAGGCCTCTGGATTGACATCAAAGAGCGGCTGCAAGTGCAACCCATTGGATTCTTG
GATGAAGCCATTGCCACAACTATCATTATTGAAGAACAAATTGGCAATCGGTTTAAGTACCAATATTAG
Protein sequenceShow/hide protein sequence
MSLGLMFRCQKAMDNAGLSPNKGGVTIGTQQKRHCHCKNARTALGLGSRHRLRHRNWLETRRCCRRRLPKGFRSRTVAEVKMLPLLANRRRWRSPRGRAAAGSRRRCCYR
CRRESDKYLGLLLDMAGKKVIPVEGDSSLVDKVAENATVLSPRSSTNRLLMVEGVVENLQWNMVEIKQILSVSVDKIDNLQEQQDQPRKRNGEDVQEPHQRYNNPRRNRM
TNRRRGTPFESSSDDDEYQEWEMDHRGNQRHNQSRPCRFKAKKRGFRLVGAVRDKSTKGSRSIADYIEEFHRLGARTNLGENDQYQVAWFVGGLWIDIKERLQVQPIGFL
DEAIATTIIIEEQIGNRFKYQY