; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g13080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g13080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:10227955..10229362
RNA-Seq ExpressionMoc06g13080
SyntenyMoc06g13080
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.8e-2236.93Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L ++L VPNI KNL+SVSKL  +NNI +EF  N C  KD  +GKV+L G L+D LY+L+                                  N    V 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K++ H RLGHP+++V   +++ C + +   DN +FCE C+YGK H+L F   SS A      VH  +W  AP+M+
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.3e-2237.5Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L +VL VP I KNL+SVSKLT +NNI++EF AN C  KD  +G+ +L G L+D LY+L+ V        SP        +NK  C             + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K++ H +LGHP+++V   ++K CN+ +   D  +FCE C++GK H+L F   SS        +H+ +W  AP++S
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]9.6e-2336.93Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L +VL VP I KNL+SVSKLT +NNI +EF A+ C  KD  +GK +L G L++ LY+++ V                S +NK  C+            + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K++ H +LGHP+++V   ++KHCN+     D   FCE C++GK H+L F    S A      +H  +W  AP+MS
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]1.0e-2438.64Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L NVL VP I KNL+SVSKLT +NN  +EF AN C  KD  +GK +L G LRD LY+L++V               +S  NK  C+            + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K+  H +LGHP+++V   ++K+CN+    +D  +FCE C++GK H+L F   SS A      +H+ +W  AP++S
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

TXG52359.1 hypothetical protein EZV62_021528 [Acer yangbiense]1.5e-2335.87Show/hide
Query:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKT--ICSLSLPKLS----
        +L N+L  P+I KNL+S+S+LT +NN+++EF++  CL KD   G V+L G L+D LY L+ + A I  +   V     S+ +    +C++   K      
Subjt:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKT--ICSLSLPKLS----

Query:  -NSINVVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         N  ++V +K+T H  L HPSS+V   ++ +CN  +K++ +  FC+  +YGKSH+L + + +S A+     VH  +W LAP+ S
Subjt:  -NSINVVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

TrEMBL top hitse value%identityAlignment
A0A2K3NEN7 Copia-like polyprotein (Fragment)6.1e-2337.5Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L +VL VP I KNL+SVSKLT +NNI++EF AN C  KD  +G+ +L G L+D LY+L+ V        SP        +NK  C             + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K++ H +LGHP+++V   ++K CN+ +   D  +FCE C++GK H+L F   SS        +H+ +W  AP++S
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)4.7e-2336.93Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L +VL VP I KNL+SVSKLT +NNI +EF A+ C  KD  +GK +L G L++ LY+++ V                S +NK  C+            + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K++ H +LGHP+++V   ++KHCN+     D   FCE C++GK H+L F    S A      +H  +W  AP+MS
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

A0A445H1W7 Retrovirus-related Pol polyprotein from transposon RE15.0e-2538.64Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L NVL VP I KNL+SVSKLT +NN  +EF AN C  KD  +GK +L G LRD LY+L++V               +S  NK  C+            + 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         K+  H +LGHP+++V   ++K+CN+    +D  +FCE C++GK H+L F   SS A      +H+ +W  AP++S
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

A0A5C7H6K5 Integrase catalytic domain-containing protein7.2e-2435.87Show/hide
Query:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKT--ICSLSLPKLS----
        +L N+L  P+I KNL+S+S+LT +NN+++EF++  CL KD   G V+L G L+D LY L+ + A I  +   V     S+ +    +C++   K      
Subjt:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKT--ICSLSLPKLS----

Query:  -NSINVVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
         N  ++V +K+T H  L HPSS+V   ++ +CN  +K++ +  FC+  +YGKSH+L + + +S A+     VH  +W LAP+ S
Subjt:  -NSINVVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

A0A803P5A9 Uncharacterized protein9.1e-2739.66Show/hide
Query:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESA--ANKTICSLSLPKLSNSIN
        +L++VL VP +AK L+S+SKLT +N+I +EF ++ C  KD  + KV+L G L+D LY+LN+        S PV    +SA   +  +CS S+ + SN ++
Subjt:  MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESA--ANKTICSLSLPKLSNSIN

Query:  VVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS
            KD  H RLGHPSS++ + ++   N+P+  ++N +FC+ C+YGKSH L F L +S+A      +H  +W  AP+ S
Subjt:  VVVAKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1026.11Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV
        L N+L VPNI KNL+SV +L   N + +EF   S   KD+ +G  +L G  +DELY          ++S PV      ++  T                 
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVV

Query:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNF--CETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMSFD
           + H RLGHP+  +  +++ + +L + ++ +  F  C  C   KS+ + F   +  +     ++++ +W  +P++S D
Subjt:  AKDTCHCRLGHPSSQVCRNLVKHCNLPMKVHDNVNF--CETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMSFD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1128.89Show/hide
Query:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGA-AIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVV
        L  VL VPNI KNL+SV +L   N + +EF   S   KD+ +G  +L G  +DELY      + A+   +SP             CS    K ++S    
Subjt:  LENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGA-AIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVV

Query:  VAKDTCHCRLGHPSSQVCRNLVKHCNLP-MKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMSFD
            + H RLGHPS  +  +++ + +LP +     +  C  C   KSH + F   +  ++    ++++ +W  +P++S D
Subjt:  VAKDTCHCRLGHPSSQVCRNLVKHCNLP-MKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMSFD

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein8.2e-0432.81Show/hide
Query:  HCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIW
        H RL H S +    LVK   L      ++ FCE C YGK+H ++F            +VH+ +W
Subjt:  HCRLGHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGAAAATGTGCTTTGCGTACCTAACATAGCTAAAAATCTAGTTAGTGTGTCCAAACTCACTAAAAACAATAACATATACCTTGAATTTCATGCTAATTCTTGTCT
TGCAAAGGATATACGTTCGGGCAAGGTGGTACTTATAGGAGATCTTAGAGATGAGCTTTATCGCCTCAATACAGTTGGAGCAGCCATTAGGAGTACTTCGAGTCCAGTTG
ACTGTGGCCTGGAGTCGGCTGCTAATAAAACTATTTGTTCTTTGTCTCTTCCCAAATTATCTAATAGTATAAATGTTGTGGTAGCCAAGGACACTTGTCATTGTCGACTT
GGACATCCGTCTTCTCAAGTTTGTAGAAATTTAGTTAAACATTGTAATCTGCCAATGAAAGTTCATGATAATGTCAACTTTTGTGAAACATGCAAATATGGTAAATCTCA
TGTCCTGTCTTTCCCTCTGTTTAGTTCACAAGCTAATGCTTCATTTATGTTCGTGCATGCTGGTATATGGGAACTTGCACCTGTTATGTCATTTGATCGGAAGCAAAACG
ATCCAAGTGGGAAGGGAGATTGGACCCCACCTCAGTGCAGAGACCTTCTGGACCACAGGTCGAGAAGATATCTCGACCTCTCTCCCGGTTCCCATGCCAATGGACTTGCA
CTATCACCTGGGCCCAAAAACGATATAGGAACCCACAACATGCAAAAGACCTGGGAAATCCGACTATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGAAAATGTGCTTTGCGTACCTAACATAGCTAAAAATCTAGTTAGTGTGTCCAAACTCACTAAAAACAATAACATATACCTTGAATTTCATGCTAATTCTTGTCT
TGCAAAGGATATACGTTCGGGCAAGGTGGTACTTATAGGAGATCTTAGAGATGAGCTTTATCGCCTCAATACAGTTGGAGCAGCCATTAGGAGTACTTCGAGTCCAGTTG
ACTGTGGCCTGGAGTCGGCTGCTAATAAAACTATTTGTTCTTTGTCTCTTCCCAAATTATCTAATAGTATAAATGTTGTGGTAGCCAAGGACACTTGTCATTGTCGACTT
GGACATCCGTCTTCTCAAGTTTGTAGAAATTTAGTTAAACATTGTAATCTGCCAATGAAAGTTCATGATAATGTCAACTTTTGTGAAACATGCAAATATGGTAAATCTCA
TGTCCTGTCTTTCCCTCTGTTTAGTTCACAAGCTAATGCTTCATTTATGTTCGTGCATGCTGGTATATGGGAACTTGCACCTGTTATGTCATTTGATCGGAAGCAAAACG
ATCCAAGTGGGAAGGGAGATTGGACCCCACCTCAGTGCAGAGACCTTCTGGACCACAGGTCGAGAAGATATCTCGACCTCTCTCCCGGTTCCCATGCCAATGGACTTGCA
CTATCACCTGGGCCCAAAAACGATATAGGAACCCACAACATGCAAAAGACCTGGGAAATCCGACTATCCTGA
Protein sequenceShow/hide protein sequence
MLENVLCVPNIAKNLVSVSKLTKNNNIYLEFHANSCLAKDIRSGKVVLIGDLRDELYRLNTVGAAIRSTSSPVDCGLESAANKTICSLSLPKLSNSINVVVAKDTCHCRL
GHPSSQVCRNLVKHCNLPMKVHDNVNFCETCKYGKSHVLSFPLFSSQANASFMFVHAGIWELAPVMSFDRKQNDPSGKGDWTPPQCRDLLDHRSRRYLDLSPGSHANGLA
LSPGPKNDIGTHNMQKTWEIRLS