; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g27170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g27170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:20486601..20488953
RNA-Seq ExpressionMoc06g27170
SyntenyMoc06g27170
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74964.1 hypothetical protein VITISV_006810 [Vitis vinifera]6.0e-3537.24Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------
        DDH T + P +E++K W+R+DARL LQI+NSI+ +I+G++N CE VKEL+ YLEFLYSGK N+SR++++CK+FY+ E +                     
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPV------------------------LASQFNSALVGRST
           S D KV+  Q E++ ++SFL+GLP  F+ AK Q+LS SEI +L++ +  +L  E +  +                        +A+   +     S+
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPV------------------------LASQFNSALVGRST

Query:  NSYRVTILEEEFTKFQQYQESLTTSFSNPITVIAETDDT
            V +  +EF KF QYQESL    S P+T +AET  T
Subjt:  NSYRVTILEEEFTKFQQYQESLTTSFSNPITVIAETDDT

XP_021674814.1 uncharacterized protein LOC110660727 [Hevea brasiliensis]6.0e-3545.2Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------
        DDH T+DPP D++++ WLR DARL LQ++NSI  ++I ++N CE VKEL+ YL+FLYSGK N+SRI+++CK+FY++E +                     
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTN
           S D KV+ AQ EQLA++SFL GLP  ++ AK Q+LS SEI +L + +T VL  E +Q   +   +SAL+ R+ N
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTN

XP_022850817.1 uncharacterized protein LOC111372670 [Olea europaea var. sylvestris]7.1e-3645.25Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------
        DDH  +DPPKD++K++WLR DA+L LQI+NSI+ ++IG++N CE VKEL+ YLEFLYS K NVSRI+E+C++FY++E +                     
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNSY
           S D KV+  Q EQ+A++SFL GLP  F+ +K Q+LS SEIP L+  ++ VL  E + P+   Q N+ LV +   +Y
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNSY

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]2.1e-3535Show/hide
Query:  MDDHTTEDPPKD-ESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD-------------------
        MDDH TEDPPKD + KK WLR+DARL LQIKNSIE +IIG+V+ CESVKELL++L+FLYSGKE V R+FE+C  F+++E                     
Subjt:  MDDHTTEDPPKD-ESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD-------------------

Query:  -----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKS-QPVLASQFNSALVGRSTN--------------------
             S D KV+  Q E++A++ FL GL P F +AK Q+LS S+IP+L+ A+T VL +E S   V   Q +SAL  ++ N                    
Subjt:  -----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKS-QPVLASQFNSALVGRSTN--------------------

Query:  -------------------------------------SYRVTILEEEFTKFQQYQESLTTSFSN------------------------------------
                                                VTI  +EF KFQ YQESL  S S+                                    
Subjt:  -------------------------------------SYRVTILEEEFTKFQQYQESLTTSFSN------------------------------------

Query:  ----------PITVIAETDDTTSPVLGSGTVHLSKSLSLT
                  P   +   D +TS VLGSGT+HL+ S SL+
Subjt:  ----------PITVIAETDDTTSPVLGSGTVHLSKSLSLT

XP_038882618.1 uncharacterized protein LOC120073824 [Benincasa hispida]9.9e-3848.62Show/hide
Query:  MDDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQ-------------------SEFD-
        MDDH TE+ P D +KK W R+D+R++LQIKNSI+ +I+ +VN CESVK+LL+YL+FLYSGKEN++R+F++CK+ YQ                   +EF+ 
Subjt:  MDDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQ-------------------SEFD-

Query:  ----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNSYR
            S D KV +A+ E+L I+SFL+GL P++++AKDQ+LS   I +LE+AYT +L  EK+Q V +   +S L+GR TN YR
Subjt:  ----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNSYR

TrEMBL top hitse value%identityAlignment
A0A438GWA1 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-3540.27Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSF------------YQSEFD---------
        DDH TE+PP D ++K W+++DARL LQ+KNSI  DI+G+++ CE VKEL+ YL+FLYSGK NVSR++++  +F            Y  +F          
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSF------------YQSEFD---------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVG------RSTNSYRVTILEEEFTKFQQY
           S D +V+ AQ EQ+A++SFL GLP  F+ AK Q+LS S+I +L++ ++ VL  E    V +SQ  + L+       + +++  VT+  EEF+K+ QY
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVG------RSTNSYRVTILEEEFTKFQQY

Query:  QESLTTSFSNPITVIAETDDT
        Q++L    S P++ +AE+  T
Subjt:  QESLTTSFSNPITVIAETDDT

A0A5N5JJ99 Uncharacterized protein9.4e-3432.25Show/hide
Query:  DDHTTEDPPKDES-KKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD--------------------
        DDH  E+PP DE+ KK W+R+DARL LQI+NSI+ +I+G++N CE VKEL+ YLEFLYSGK N+SR++++CK+FY++E +                    
Subjt:  DDHTTEDPPKDES-KKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD--------------------

Query:  ----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLH------VEKSQPVLASQFNSALVGRSTNSYRVTILEEEFT----
            S D KV+  Q E++A++SFL GLP   +  K Q+LS  EI +L++ ++ +L       ++ +  VL ++      GR  N    +   + +T    
Subjt:  ----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLH------VEKSQPVLASQFNSALVGRSTNSYRVTILEEEFT----

Query:  ------------------KFQQYQE----------SLTTSFSNPITVIAETDDTTSPVLGSGTVHLSKSLSLTFDLTTKKLLLGGAFTNDD-------FL
                          K Q + +          S T+S S+  T++   DD     L   ++ +  S   T    T+      +  N D       +L
Subjt:  ------------------KFQQYQE----------SLTTSFSNPITVIAETDDTTSPVLGSGTVHLSKSLSLTFDLTTKKLLLGGAFTNDD-------FL

Query:  VYFIVFTSTEVLPSNTSPS----TPDLSLPTITQVYSHRQPPTDSCPIPVASSSVDPGTSDDLPIALSK
        +Y +   +T    S T        P L+ P I QVYS RQ  TD+CP P    S DP    DLPI L K
Subjt:  VYFIVFTSTEVLPSNTSPS----TPDLSLPTITQVYSHRQPPTDSCPIPVASSSVDPGTSDDLPIALSK

A5BSK6 Integrase catalytic domain-containing protein2.9e-3537.24Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------
        DDH T + P +E++K W+R+DARL LQI+NSI+ +I+G++N CE VKEL+ YLEFLYSGK N+SR++++CK+FY+ E +                     
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPV------------------------LASQFNSALVGRST
           S D KV+  Q E++ ++SFL+GLP  F+ AK Q+LS SEI +L++ +  +L  E +  +                        +A+   +     S+
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPV------------------------LASQFNSALVGRST

Query:  NSYRVTILEEEFTKFQQYQESLTTSFSNPITVIAETDDT
            V +  +EF KF QYQESL    S P+T +AET  T
Subjt:  NSYRVTILEEEFTKFQQYQESLTTSFSNPITVIAETDDT

Q6L3Q0 Polyprotein, putative8.5e-3548.28Show/hide
Query:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------
        DDH  +DPP D++KKAWLR+DARLILQI NSI+ +++G+VN CE VKEL+ YLE+LYSGK N+SRI+E+ K+FY+SE +                     
Subjt:  DDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD---------------------

Query:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGR
           S D KV+ AQ EQ+AI+SFL GLP  F+ AK Q+LS SEI +L+  ++ VL  E +    A+Q  + LV +
Subjt:  ---SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGR

U5CZW1 Uncharacterized protein (Fragment)2.1e-3345Show/hide
Query:  DDHTTEDPPKD--ESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD-------------------
        DDH T+DPP++  +S+K WLR DARL LQI+NSI+ ++I ++N CE VKEL+ YLEFLYSGK+N+SRI+++CK+FY++E                     
Subjt:  DDHTTEDPPKD--ESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFD-------------------

Query:  -----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNS
             S D KV+  Q EQ+AI+SFL GL   FD AK Q+LS S++ +L+  +T VL  E +    ++  NSALV R+ ++
Subjt:  -----SNDAKVRLAQHEQLAIISFLLGLPPRFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATCACACAACTGAAGATCCTCCTAAGGATGAGAGCAAGAAAGCTTGGTTGAGAAATGATGCTCGACTGATTCTACAAATCAAGAATTCGATCGAGGGT
GACATTATTGGCATGGTCAACGAGTGTGAGTCTGTTAAAGAGTTGCTTAAATACTTAGAATTCCTTTATTCTGGAAAAGAAAATGTTAGTCGAATATTTGAAATC
TGCAAGAGCTTCTACCAATCTGAGTTTGATAGTAATGATGCAAAAGTTCGGCTTGCCCAACACGAACAACTAGCAATTATAAGTTTTCTTCTTGGTCTTCCACCT
AGATTTGATGTGGCCAAAGACCAATTACTCTCTCATTCAGAAATTCCAGCTTTAGAGAAGGCATACACTTGGGTACTTCACGTTGAGAAGTCACAACCCGTCTTG
GCATCTCAGTTTAACAGTGCTTTGGTTGGACGTAGTACAAATTCATACCGAGTTACAATTCTTGAGGAAGAGTTTACTAAGTTCCAACAATATCAAGAGTCATTG
ACAACATCATTTTCTAATCCGATTACCGTCATCGCTGAGACAGATGATACCACTTCTCCTGTTCTTGGTTCTGGAACAGTTCATCTTTCCAAATCTCTTTCATTG
ACCTTTGATCTTACGACGAAGAAACTATTGTTAGGGGGAGCATTCACAAATGATGACTTTCTTGTCTATTTCATTGTCTTTACTTCTACTGAAGTGCTTCCTAGC
AATACATCTCCCTCTACGCCTGATCTTTCTCTCCCCACTATTACTCAAGTTTATTCTCACCGACAACCTCCTACAGACTCATGCCCTATACCAGTAGCTTCTTCG
TCCGTGGATCCAGGAACGAGTGATGACCTTCCTATTGCCCTTTCGAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGATCACACAACTGAAGATCCTCCTAAGGATGAGAGCAAGAAAGCTTGGTTGAGAAATGATGCTCGACTGATTCTACAAATCAAGAATTCGATCGAGGGT
GACATTATTGGCATGGTCAACGAGTGTGAGTCTGTTAAAGAGTTGCTTAAATACTTAGAATTCCTTTATTCTGGAAAAGAAAATGTTAGTCGAATATTTGAAATC
TGCAAGAGCTTCTACCAATCTGAGTTTGATAGTAATGATGCAAAAGTTCGGCTTGCCCAACACGAACAACTAGCAATTATAAGTTTTCTTCTTGGTCTTCCACCT
AGATTTGATGTGGCCAAAGACCAATTACTCTCTCATTCAGAAATTCCAGCTTTAGAGAAGGCATACACTTGGGTACTTCACGTTGAGAAGTCACAACCCGTCTTG
GCATCTCAGTTTAACAGTGCTTTGGTTGGACGTAGTACAAATTCATACCGAGTTACAATTCTTGAGGAAGAGTTTACTAAGTTCCAACAATATCAAGAGTCATTG
ACAACATCATTTTCTAATCCGATTACCGTCATCGCTGAGACAGATGATACCACTTCTCCTGTTCTTGGTTCTGGAACAGTTCATCTTTCCAAATCTCTTTCATTG
ACCTTTGATCTTACGACGAAGAAACTATTGTTAGGGGGAGCATTCACAAATGATGACTTTCTTGTCTATTTCATTGTCTTTACTTCTACTGAAGTGCTTCCTAGC
AATACATCTCCCTCTACGCCTGATCTTTCTCTCCCCACTATTACTCAAGTTTATTCTCACCGACAACCTCCTACAGACTCATGCCCTATACCAGTAGCTTCTTCG
TCCGTGGATCCAGGAACGAGTGATGACCTTCCTATTGCCCTTTCGAAAAGGTAA
Protein sequenceShow/hide protein sequence
MDDHTTEDPPKDESKKAWLRNDARLILQIKNSIEGDIIGMVNECESVKELLKYLEFLYSGKENVSRIFEICKSFYQSEFDSNDAKVRLAQHEQLAIISFLLGLPP
RFDVAKDQLLSHSEIPALEKAYTWVLHVEKSQPVLASQFNSALVGRSTNSYRVTILEEEFTKFQQYQESLTTSFSNPITVIAETDDTTSPVLGSGTVHLSKSLSL
TFDLTTKKLLLGGAFTNDDFLVYFIVFTSTEVLPSNTSPSTPDLSLPTITQVYSHRQPPTDSCPIPVASSSVDPGTSDDLPIALSKR