; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:1328337..1330852
RNA-Seq ExpressionMoc01g01970
SyntenyMoc01g01970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8550233.1 hypothetical protein F0562_001917 [Nyssa sinensis]1.3e-1735.88Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------
        H+NE   ++N+L  M + I++E++A+ LL+SL DSWET+ + +SNS  D  L    +  +   EETRRKL +     +    WK ++  E  N       
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------

Query:  NTTNVVKGEEWIEEFLACVETNSF-VYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL
        NTT   +G+E +   L+C E     V +  EWVV+T AS H T ++ +F+S+ AG++G++++GN S S++
Subjt:  NTTNVVKGEEWIEEFLACVETNSF-VYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL

KAF7129215.1 hypothetical protein RHSIM_Rhsim10G0143100 [Rhododendron simsii]3.8e-1428.8Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKL------------------------------
        H+NE+  I N+L  M I  D+E++A+ LL+SL ++WET+ + VSNS  D  +  S +  + L EETRRK                               
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKL------------------------------

Query:  -----GKMPASTS--------------EAEMWKFKE-----HFEKGNNTTNVVKGEEWIEEFLACVET-NSFVYHSSEWVVNTAASTHVTSDRCWFSSFA
             G+ P+                 E    KFKE     H +K    T  V  +   E  + C E   + V   + WV+++ AS HVTS   +F+S+ 
Subjt:  -----GKMPASTS--------------EAEMWKFKE-----HFEKGNNTTNVVKGEEWIEEFLACVET-NSFVYHSSEWVVNTAASTHVTSDRCWFSSFA

Query:  AGNYGSMRIGN-------------VSVSELNDDGPNSEFVGVCWKLKRES
         G++G +RIGN             +S  +L+D+G N+ F    WKL ++S
Subjt:  AGNYGSMRIGN-------------VSVSELNDDGPNSEFVGVCWKLKRES

KAF8378616.1 hypothetical protein HHK36_029964 [Tetracentron sinense]4.1e-1633.33Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGNNTTNVVK
        H+NE   ++N+L  M + ID+E++A+ LL+SL D WET+ + VSNS  +  L  S + ++   EETRRK       T  A+    +       ++ +VV 
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGNNTTNVVK

Query:  GEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSELN--DDGPNSEFVGVCWKLKRESRVVATCRKRFRELVKS
             EE+L   +         EWVV+TAAS H T ++ +F+S+  G++G++++GN S S++    DG  + F     KL   S VVA  +  F  L K+
Subjt:  GEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSELN--DDGPNSEFVGVCWKLKRESRVVATCRKRFRELVKS

Query:  HRRI
        H ++
Subjt:  HRRI

KAF8387595.1 hypothetical protein HHK36_026248 [Tetracentron sinense]1.8e-1634.71Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------
        H+NE   ++N+L  M + +D+E++A+ LL+SL DSWET+ + VSNS  +  L  S + ++   EETRRK     A T  A+    +    KG+       
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------

Query:  -NTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL
         NTT  +  E+ +       E         EWVV+TAAS H T ++ +F+S+ AG++G++++GN S S++
Subjt:  -NTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL

RVW91307.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.1e-1629.6Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK----------------LGKMPASTSEAEMWK
        H+NE+  I+N+L  M I  D+E++A+ LL+SL +SWET+ + VSNS  D  +  S +  + L EETRRK                 GK      EA + K
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK----------------LGKMPASTSEAEMWK

Query:  F-------KEHFEKGNNTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGN---------------------
                +E+ EK  N T VV   E +   +    + + +   ++WV+++ AS HVTS   +F+S+  G++G++R+GN                     
Subjt:  F-------KEHFEKGNNTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGN---------------------

Query:  ----------------VSVSELNDDGPNSEFVGVCWKLKRESRVVATCRK
                        +S  +L+D+G N+ F    WKL + S VVA   K
Subjt:  ----------------VSVSELNDDGPNSEFVGVCWKLKRESRVVATCRK

TrEMBL top hitse value%identityAlignment
A0A2N9FAH0 Integrase catalytic domain-containing protein9.9e-1635.05Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-LGKMPASTSEAEMWKFKEHFEKGNNTTNVV
        H++E  D++N+L  M + +D+E++A+ LL+SL DSWET+ +++SNS  +  L+ + + D+   EETRRK +GK  A     E         KG N+    
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-LGKMPASTSEAEMWKFKEHFEKGNNTTNVV

Query:  KGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYG-------SMRIGNVSVSELNDDGPNSEFVGVCWKLKRESRVVA
        KG +  +                EWV+++AAS HVT  R +F+S+ AGN G        MR+  +SVS L+ +G  S      WKL + S V A
Subjt:  KGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYG-------SMRIGNVSVSELNDDGPNSEFVGVCWKLKRESRVVA

A0A2N9GH12 Uncharacterized protein2.9e-1533.04Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-----LGKMPASTSEAEMWKFKEHFEKGNNT
        H+NE   I N+L  + I+ D+EI+A+ +L SL +SWE M++AV+NS G + L++  I D  L EE RR+       K  A  +E      K   EKG  T
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-----LGKMPASTSEAEMWKFKEHFEKGNNT

Query:  TNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSELNDDGP-----NSEFVGVC------WKLKRESRV
        T VV  E+ +   +   +         EWVVN+AA+ HV   +  F+++ AG++G++++GN S S++   G      N  F  +       WKL +   V
Subjt:  TNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSELNDDGP-----NSEFVGVC------WKLKRESRV

Query:  VA---TCRKRFRELVKSHRRISASKGT
        VA    C   +R  VK+ ++   + GT
Subjt:  VA---TCRKRFRELVKSHRRISASKGT

A0A2N9IPN0 Integrase catalytic domain-containing protein4.9e-1536.2Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-LGKMPASTSEAEMWKFKEHFEKGNNTTNVV
        H++E  D++N+L  M + +D+E++A+ LL+SL DSWET+ +++SNS  +  L+ + + D+   EETRRK +GK  A     E         KG N+    
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK-LGKMPASTSEAEMWKFKEHFEKGNNTTNVV

Query:  KGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL
        +GE+         E    V    EWV+++AAS HVT  R +F+S+ AGN G +++GN S +++
Subjt:  KGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL

A0A438I3N7 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-1629.6Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK----------------LGKMPASTSEAEMWK
        H+NE+  I+N+L  M I  D+E++A+ LL+SL +SWET+ + VSNS  D  +  S +  + L EETRRK                 GK      EA + K
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK----------------LGKMPASTSEAEMWK

Query:  F-------KEHFEKGNNTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGN---------------------
                +E+ EK  N T VV   E +   +    + + +   ++WV+++ AS HVTS   +F+S+  G++G++R+GN                     
Subjt:  F-------KEHFEKGNNTTNVVKGEEWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGN---------------------

Query:  ----------------VSVSELNDDGPNSEFVGVCWKLKRESRVVATCRK
                        +S  +L+D+G N+ F    WKL + S VVA   K
Subjt:  ----------------VSVSELNDDGPNSEFVGVCWKLKRESRVVATCRK

A0A5J5C4A8 gag_pre-integrs domain-containing protein6.2e-1835.88Show/hide
Query:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------
        H+NE   ++N+L  M + I++E++A+ LL+SL DSWET+ + +SNS  D  L    +  +   EETRRKL +     +    WK ++  E  N       
Subjt:  HINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGN-------

Query:  NTTNVVKGEEWIEEFLACVETNSF-VYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL
        NTT   +G+E +   L+C E     V +  EWVV+T AS H T ++ +F+S+ AG++G++++GN S S++
Subjt:  NTTNVVKGEEWIEEFLACVETNSF-VYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSEL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-0624.31Show/hide
Query:  SHINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK------------------------LGKMPA
        SH+N    ++ +L  + +KI+EE KA+ LL SL  S++ +   + +  G  ++    +  A L  E  RK                         G+  A
Subjt:  SHINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRK------------------------LGKMPA

Query:  STSEAEMWKFK----------EHFE-------KGNNTTNVVKGEE--------------WIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSF
                K +           HF+       KG   T+  K ++              +I E   C+  +      SEWVV+TAAS H T  R  F  +
Subjt:  STSEAEMWKFK----------EHFE-------KGNNTTNVVKGEE--------------WIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSF

Query:  AAGNYGSMRIGNVSVSEL
         AG++G++++GN S S++
Subjt:  AAGNYGSMRIGNVSVSEL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCCACATTAATGAGCTCACTGATATATTGAACAAACTAAAAGGGATGTGCATCAAAATTGATGAGGAGATCAAGGCTATGAGGCTGCTGACATCTTTG
TCTGACAGTTGGGAGACGATGAAGATCGCAGTGTCGAATTCATTGGGGGATAATAGTCTGAGATTTTCAGCTATTTGTGATGCTGCCTTATTTGAGGAAACCAGG
AGGAAATTGGGTAAAATGCCTGCATCTACTTCAGAGGCAGAAATGTGGAAGTTTAAAGAACATTTTGAGAAGGGGAACAACACTACAAATGTTGTAAAAGGAGAA
GAATGGATTGAAGAGTTTCTAGCTTGTGTTGAGACGAACTCATTTGTATACCATTCTTCAGAGTGGGTAGTGAACACTGCAGCATCAACGCATGTTACTTCAGAC
AGATGTTGGTTCTCATCCTTTGCTGCAGGTAATTATGGCTCAATGAGGATAGGGAATGTGAGTGTCTCCGAGCTAAATGATGATGGCCCCAATAGTGAGTTTGTT
GGGGTTTGCTGGAAGCTCAAGAGGGAATCCAGGGTAGTGGCTACATGCCGCAAGAGATTTCGCGAACTAGTGAAGTCGCATAGGCGAATTAGTGCATCGAAGGGT
ACGAGTTCGGTTTCTAGCGTGGCGACATGCTTGCGTAGGAGTGCCAAGCCTAATGCAGAGAGAAAATCTTCTTTCAAAGTGGATTTAATATCTAGTCTTTATCTT
TTGATTCTTCTTCTCGAACTTGATGTGAAGATTGAGATACTTGTTCTTGAATTGGGATTGAAGTTCTATATGCTTCGAAAGTCTTCAAGTCATCAAAGTCTTCAG
GTTGGTCTTCAAATCTTCAAGTCTTTAGAGGGAGTCTTCAATTTGATGTATGAATTCTCTCAATTCTTCACTTATGAACCCTTGAAAATAAGGGAGACACCCCTA
TTTATAGAGCTTGCCACAAGCTATAGTCGGCTTTGGGCTTGGGTGATTTTGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCCCACATTAATGAGCTCACTGATATATTGAACAAACTAAAAGGGATGTGCATCAAAATTGATGAGGAGATCAAGGCTATGAGGCTGCTGACATCTTTG
TCTGACAGTTGGGAGACGATGAAGATCGCAGTGTCGAATTCATTGGGGGATAATAGTCTGAGATTTTCAGCTATTTGTGATGCTGCCTTATTTGAGGAAACCAGG
AGGAAATTGGGTAAAATGCCTGCATCTACTTCAGAGGCAGAAATGTGGAAGTTTAAAGAACATTTTGAGAAGGGGAACAACACTACAAATGTTGTAAAAGGAGAA
GAATGGATTGAAGAGTTTCTAGCTTGTGTTGAGACGAACTCATTTGTATACCATTCTTCAGAGTGGGTAGTGAACACTGCAGCATCAACGCATGTTACTTCAGAC
AGATGTTGGTTCTCATCCTTTGCTGCAGGTAATTATGGCTCAATGAGGATAGGGAATGTGAGTGTCTCCGAGCTAAATGATGATGGCCCCAATAGTGAGTTTGTT
GGGGTTTGCTGGAAGCTCAAGAGGGAATCCAGGGTAGTGGCTACATGCCGCAAGAGATTTCGCGAACTAGTGAAGTCGCATAGGCGAATTAGTGCATCGAAGGGT
ACGAGTTCGGTTTCTAGCGTGGCGACATGCTTGCGTAGGAGTGCCAAGCCTAATGCAGAGAGAAAATCTTCTTTCAAAGTGGATTTAATATCTAGTCTTTATCTT
TTGATTCTTCTTCTCGAACTTGATGTGAAGATTGAGATACTTGTTCTTGAATTGGGATTGAAGTTCTATATGCTTCGAAAGTCTTCAAGTCATCAAAGTCTTCAG
GTTGGTCTTCAAATCTTCAAGTCTTTAGAGGGAGTCTTCAATTTGATGTATGAATTCTCTCAATTCTTCACTTATGAACCCTTGAAAATAAGGGAGACACCCCTA
TTTATAGAGCTTGCCACAAGCTATAGTCGGCTTTGGGCTTGGGTGATTTTGGACTAA
Protein sequenceShow/hide protein sequence
MNSHINELTDILNKLKGMCIKIDEEIKAMRLLTSLSDSWETMKIAVSNSLGDNSLRFSAICDAALFEETRRKLGKMPASTSEAEMWKFKEHFEKGNNTTNVVKGE
EWIEEFLACVETNSFVYHSSEWVVNTAASTHVTSDRCWFSSFAAGNYGSMRIGNVSVSELNDDGPNSEFVGVCWKLKRESRVVATCRKRFRELVKSHRRISASKG
TSSVSSVATCLRRSAKPNAERKSSFKVDLISSLYLLILLLELDVKIEILVLELGLKFYMLRKSSSHQSLQVGLQIFKSLEGVFNLMYEFSQFFTYEPLKIRETPL
FIELATSYSRLWAWVILD