; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:27262345..27267536
RNA-Seq ExpressionMoc08g36680
SyntenyMoc08g36680
Gene Ontology termsGO:0019538 - protein metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74381.1 hypothetical protein VITISV_007944 [Vitis vinifera]7.5e-3851.18Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEF----TIDKETNPQFTQ-WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNET
        LW+SQ LPL+RSLG+ HHL SEN +       T  KET+ Q  + W++NDGLL SWLLG ++E+++ +++ TETA  VW+SL E+LL M+KE E+ L   
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEF----TIDKETNPQFTQ-WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNET

Query:  LLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
        L  +KKG+ ++DEYL++ K IC+     +KPV DL K F +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.8e-3950.3Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       E T+  ET     Q    W++NDGLL SWLLG ++E+++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.6e-3849.7Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       + T+  ET     Q    W++NDGLL SWLLG ++E+++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

RVW93768.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.6e-3849.7Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       E T+  ET     Q    W++NDGLL SWLLG ++++++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

XP_022154021.1 uncharacterized protein LOC111021379 [Momordica charantia]5.3e-5261.11Show/hide
Query:  RLGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKE----TNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKEN
        +L      LWKSQ LPLIR+LG+EHHL  E    +    KE       Q   W NNDGLL SWLLG I+ED+L ++E TETA++VW SLEE LLTM+KEN
Subjt:  RLGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKE----TNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKEN

Query:  EIHLNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVLLSK
        EIHLNE LLTLKKGSL++DEY++K K++C+     KKP+DDLTKVFH+ARGLG  Y+ F+TAMLSKAPYP+YN+FVL  K
Subjt:  EIHLNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVLLSK

TrEMBL top hitse value%identityAlignment
A0A2C9VFN4 Uncharacterized protein2.8e-3850Show/hide
Query:  RLGKGENQLWKSQTLPLIRSLGIEHHL-KSENIYEEFTIDKETNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIH
        +LG     +W+SQ L L RSLG+ HHL K+ +  E+   + +TNP +  W  NDGL++SWLLGTI E++   +    T   VWSSLEEQLL ++ E E  
Subjt:  RLGKGENQLWKSQTLPLIRSLGIEHHL-KSENIYEEFTIDKETNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIH

Query:  LNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
        L   L+T+KKGS ++D +LK+ KSIC+     KKPVDDL KVF +ARGLGS Y+ F+ AM++K PY T+N+FVL
Subjt:  LNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE14.3e-3950.3Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       E T+  ET     Q    W++NDGLL SWLLG ++E+++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3849.7Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       + T+  ET     Q    W++NDGLL SWLLG ++E+++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

A0A438IAM2 Retrovirus-related Pol polyprotein from transposon RE11.2e-3849.7Show/hide
Query:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL
        LW+SQ LPL+RSLG+ HHL       E T+  ET     Q    W++NDGLL SWLLG ++++++ +++ TETA  VW+SL E+LL M+KE E+ L   L
Subjt:  LWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQ----WTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETL

Query:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL
          +KKG+ ++DEYL++ K IC+     +KPV DL KVF +A+GLG+ Y  F+ AMLSK PYP+YN+FVL
Subjt:  LTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVL

A0A6J1DMG5 uncharacterized protein LOC1110213792.6e-5261.11Show/hide
Query:  RLGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKE----TNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKEN
        +L      LWKSQ LPLIR+LG+EHHL  E    +    KE       Q   W NNDGLL SWLLG I+ED+L ++E TETA++VW SLEE LLTM+KEN
Subjt:  RLGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKE----TNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKEN

Query:  EIHLNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVLLSK
        EIHLNE LLTLKKGSL++DEY++K K++C+     KKP+DDLTKVFH+ARGLG  Y+ F+TAMLSKAPYP+YN+FVL  K
Subjt:  EIHLNETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVLLSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-1025.21Show/hide
Query:  LGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQWTNNDGLLVSWLLGTISEDIL-AMIESTETAQQVWSSLEEQLLTMSKENEIHL
        L K    +W+     L  S G+  H+   +     T  +        W   DGL+  W+ GTI++ +L  +I+   TA+ +W SLE       +   +  
Subjt:  LGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQWTNNDGLLVSWLLGTISEDIL-AMIESTETAQQVWSSLEEQLLTMSKENEIHL

Query:  NETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEF--------VLLSKDMKSSTS----PNFIN
           L T     L+V EY +K+KS+ +       P+ D   V H+  GL   Y      +  K+P+P++ E           LS   KSS S    P+  N
Subjt:  NETLLTLKKGSLTVDEYLKKIKSICE-----KKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEF--------VLLSKDMKSSTS----PNFIN

Query:  IPTTVDTIPSARVTDFKRQNTKINQDQGLEEWKN
        +  TV         ++   N+  N  +G  + KN
Subjt:  IPTTVDTIPSARVTDFKRQNTKINQDQGLEEWKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCCACGACTTGGAAAGGGGGAAAATCAATTGTGGAAATCACAAACTCTTCCACTGATAAGAAGTCTGGGGATTGAGCACCATTTGAAATCTGAAAATAT
TTATGAAGAGTTCACCATCGACAAAGAGACAAATCCACAGTTCACTCAATGGACGAACAATGATGGACTTCTTGTCTCTTGGCTTCTTGGCACAATATCTGAAGATATCC
TTGCAATGATAGAAAGTACTGAAACAGCGCAACAAGTTTGGTCATCCTTGGAAGAGCAATTGCTCACCATGAGCAAAGAAAATGAAATTCACCTAAATGAAACTTTACTC
ACCTTGAAAAAGGGCAGCCTCACCGTGGATGAATACCTAAAGAAGATCAAGAGCATATGCGAGAAGAAACCAGTAGACGATCTTACAAAGGTGTTCCACATTGCTAGAGG
ACTTGGGAGCAATTATCAAGGATTCAAAACAGCCATGTTATCCAAAGCTCCATACCCCACCTACAATGAGTTTGTTCTACTCTCAAAGGACATGAAATCATCAACATCAC
CCAATTTCATAAACATCCCTACTACAGTGGACACCATCCCTTCTGCTAGAGTTACTGATTTCAAGAGGCAGAACACTAAAATCAACCAAGATCAAGGACTAGAAGAGTGG
AAGAATCCCACAAAAATAGTAGAAGACACCACGAAGACAGCAAGCTTCACAATTCATCCTAATGCAGAGATTAATAAAGTCACAACGGCACCTTTGAGAAGAAATTTTAT
GTATGAAAACGAGGAGTTTCTGGAGGAAGGACGAAACAATACCCCAAATGTTGATCAAGACCAACACGATTTTGAATTGGGGACTAATAATGACTCAATACAACATTCTC
TGGGTGAAAACCTACAACCAACTCAAGTTGAGGAAAGAAGTCAAGTTACTGATAGTCACTCCAACATGTTCCAGCAACCTCAAATTGAGGACAGACACCTCAATGGTCTG
AATGAAATTACAACAATTGACTTGCACCAAGAACAAGCAGATATTGAAATCTCAACTGAATCGCCAATTGTCATGGAAAATATACAAAGAAGTGAATGCAGGAATCCACA
ACAGTCCAAGTCACAGGGAGATTATGAAAATTCTATGCCTCTTAACTTTAATGAACTACCTGATATTAGCAATGAGTTGTCTATTGAATTAAATATTTCTTCTGCAGGGA
ATATCCGAGACAACAACGAATGTAGAAACCTTACAGAAACTCAGTCATCACATCCTATGGTTACAAGACAAAAACTCAATAAGGACCCCTCCATTGACCCAATCTTGCAT
CAAGATGTTCAACAACTAAAGACTACCCATCGAAAAGTAACCTACCTAACACTTAACCCAGCTGTAGAACCAAAGGGAGTGTTAGCACCCTACCTCCTAGCTGTCCTTTA
TACGGGACGTTTAGGAGCACTCGACCCTGTTGTTGGTGCGCTTGATTGCTTTTCATTGCGTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCCACGACTTGGAAAGGGGGAAAATCAATTGTGGAAATCACAAACTCTTCCACTGATAAGAAGTCTGGGGATTGAGCACCATTTGAAATCTGAAAATAT
TTATGAAGAGTTCACCATCGACAAAGAGACAAATCCACAGTTCACTCAATGGACGAACAATGATGGACTTCTTGTCTCTTGGCTTCTTGGCACAATATCTGAAGATATCC
TTGCAATGATAGAAAGTACTGAAACAGCGCAACAAGTTTGGTCATCCTTGGAAGAGCAATTGCTCACCATGAGCAAAGAAAATGAAATTCACCTAAATGAAACTTTACTC
ACCTTGAAAAAGGGCAGCCTCACCGTGGATGAATACCTAAAGAAGATCAAGAGCATATGCGAGAAGAAACCAGTAGACGATCTTACAAAGGTGTTCCACATTGCTAGAGG
ACTTGGGAGCAATTATCAAGGATTCAAAACAGCCATGTTATCCAAAGCTCCATACCCCACCTACAATGAGTTTGTTCTACTCTCAAAGGACATGAAATCATCAACATCAC
CCAATTTCATAAACATCCCTACTACAGTGGACACCATCCCTTCTGCTAGAGTTACTGATTTCAAGAGGCAGAACACTAAAATCAACCAAGATCAAGGACTAGAAGAGTGG
AAGAATCCCACAAAAATAGTAGAAGACACCACGAAGACAGCAAGCTTCACAATTCATCCTAATGCAGAGATTAATAAAGTCACAACGGCACCTTTGAGAAGAAATTTTAT
GTATGAAAACGAGGAGTTTCTGGAGGAAGGACGAAACAATACCCCAAATGTTGATCAAGACCAACACGATTTTGAATTGGGGACTAATAATGACTCAATACAACATTCTC
TGGGTGAAAACCTACAACCAACTCAAGTTGAGGAAAGAAGTCAAGTTACTGATAGTCACTCCAACATGTTCCAGCAACCTCAAATTGAGGACAGACACCTCAATGGTCTG
AATGAAATTACAACAATTGACTTGCACCAAGAACAAGCAGATATTGAAATCTCAACTGAATCGCCAATTGTCATGGAAAATATACAAAGAAGTGAATGCAGGAATCCACA
ACAGTCCAAGTCACAGGGAGATTATGAAAATTCTATGCCTCTTAACTTTAATGAACTACCTGATATTAGCAATGAGTTGTCTATTGAATTAAATATTTCTTCTGCAGGGA
ATATCCGAGACAACAACGAATGTAGAAACCTTACAGAAACTCAGTCATCACATCCTATGGTTACAAGACAAAAACTCAATAAGGACCCCTCCATTGACCCAATCTTGCAT
CAAGATGTTCAACAACTAAAGACTACCCATCGAAAAGTAACCTACCTAACACTTAACCCAGCTGTAGAACCAAAGGGAGTGTTAGCACCCTACCTCCTAGCTGTCCTTTA
TACGGGACGTTTAGGAGCACTCGACCCTGTTGTTGGTGCGCTTGATTGCTTTTCATTGCGTATATGA
Protein sequenceShow/hide protein sequence
MVSEPRLGKGENQLWKSQTLPLIRSLGIEHHLKSENIYEEFTIDKETNPQFTQWTNNDGLLVSWLLGTISEDILAMIESTETAQQVWSSLEEQLLTMSKENEIHLNETLL
TLKKGSLTVDEYLKKIKSICEKKPVDDLTKVFHIARGLGSNYQGFKTAMLSKAPYPTYNEFVLLSKDMKSSTSPNFINIPTTVDTIPSARVTDFKRQNTKINQDQGLEEW
KNPTKIVEDTTKTASFTIHPNAEINKVTTAPLRRNFMYENEEFLEEGRNNTPNVDQDQHDFELGTNNDSIQHSLGENLQPTQVEERSQVTDSHSNMFQQPQIEDRHLNGL
NEITTIDLHQEQADIEISTESPIVMENIQRSECRNPQQSKSQGDYENSMPLNFNELPDISNELSIELNISSAGNIRDNNECRNLTETQSSHPMVTRQKLNKDPSIDPILH
QDVQQLKTTHRKVTYLTLNPAVEPKGVLAPYLLAVLYTGRLGALDPVVGALDCFSLRI