; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:4481078..4488029
RNA-Seq ExpressionMoc07g05340
SyntenyMoc07g05340
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG49237.1 hypothetical protein EZV62_025112 [Acer yangbiense]7.8e-5349.15Show/hide
Query:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR
        +F+++KF G GDF +W+ K+KA+L QQK  KAI+ P KLP S+  E+K+ M E+A GT+ILNLSD+VLR+I D KTA +VW KLE+++ +K L NK+YL+
Subjt:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR

Query:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------
        E+ F +KMD +K    NLDDFKKM+ E  N G  EK+ DENEA ILLNSLP ++++VK A+KYGR S++ +  ISA+K++EL+L                
Subjt:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------

Query:  ----ITD---LWHKRLSHISTKGLQELEKQGVLPQD
            I+D   LWH RL H+S +G+ EL K+ +L  D
Subjt:  ----ITD---LWHKRLSHISTKGLQELEKQGVLPQD

XP_038880370.1 uncharacterized protein LOC120072018 [Benincasa hispida]1.1e-5161.05Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        MA  R+E+EKF GK DFEL K KIKAVLGQQK   AI DPTK P+++   +KET+E  AYGT+ILN++DS+LRQI+D  TAY +W KL  I+ +KDLPNK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITT
         + RE+FFTYK+D  KS +DNL++FK++SSEF+++ + I +ENEAFILLNSLP+++++VK  LKYGRE ITT
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITT

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]6.8e-5761.83Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        M   R+E+EKF  K DFELWK KIK VL +QKA  AI DP K P+ +   EKET+E  AYGT++LN+ DSVLRQI+D  TAY +W KL  I+ +KDLPNK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL
         +LRE+FFTYKMD  KS +DNL++FK +SS+F+++G+ IG+ENEAFILLNSLP+ +++VK ALKYGRE ITT AIISAV  +EL+L
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]6.8e-5758.42Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        MA  ++E+EKF  K DFEL K KIKAVLGQQKA  AI DP+K P+++   EKET+E  AYGT+ILN++DSVLRQI+D  T Y +W KL  I+ +KD PNK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITDLWHKRLSHIST
         +LRE+FFTYKMD TKS +DNL++FK++SSEF+++G+ IG+ENEAFIL NSLP+ +++VK ALKY R+ IT DAIISAV+ +EL+L   + ++ +   S 
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITDLWHKRLSHIST

Query:  KG
        KG
Subjt:  KG

XP_038890043.1 uncharacterized protein LOC120079747 [Benincasa hispida]4.0e-5757.97Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        MA  R+E+EKF  K DFELWKVKIK VLGQQKA  AI DP K P+++   EKET+E  A GT++LN++D+VLRQ+I+  TAY +W KL  I+ +KDL NK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITDLWHKRLSHIST
         +LRE+FFTYKMD  KS +D L++FK++SSEF+++G  IG+ENEAFILLNSLP+++++ K A+KYGRE ITT+AIISAV+ REL+L           +S 
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITDLWHKRLSHIST

Query:  KGLQELE
        KG QE E
Subjt:  KGLQELE

TrEMBL top hitse value%identityAlignment
A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-5049.74Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        M   +FE+EKF G GDF LW  +I A+LG QKA KA++DP +LP ++   E+ET+EE+AY TLI+N++D+VLRQ+I+  TA+  W KL++++  KDLPNK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITD
        ++++EK F++K +  K+  +NLD+FKK+++     GEK+G ENEA IL+NS+   Y+EVK  LKYGRE+IT +++I+ +K++EL+L T+
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITD

A0A5C7GXL9 Sucrose-phosphate phosphatase3.8e-5349.15Show/hide
Query:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR
        +F+++KF G GDF +W+ K+KA+L QQK  KAI+ P KLP S+  E+K+ M E+A GT+ILNLSD+VLR+I D KTA +VW KLE+++ +K L NK+YL+
Subjt:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR

Query:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------
        E+ F +KMD +K    NLDDFKKM+ E  N G  EK+ DENEA ILLNSLP ++++VK A+KYGR S++ +  ISA+K++EL+L                
Subjt:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------

Query:  ----ITD---LWHKRLSHISTKGLQELEKQGVLPQD
            I+D   LWH RL H+S +G+ EL K+ +L  D
Subjt:  ----ITD---LWHKRLSHISTKGLQELEKQGVLPQD

A0A5C7HB65 gag_pre-integrs domain-containing protein8.7e-5043.89Show/hide
Query:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR
        +F+++KF G GDF +W+ K+KA+L QQK  KAI+ P KLP S+  E+K+ M E+A GT+ILNLSD+VLR+I D KTA +VW KLE+++ +K L NK+YL+
Subjt:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR

Query:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------
        E+ F +KMD +K    NLDDFKKM+ +  N G  EK+ DENEA ILLNSLP ++++VK A+KYGR S++ +  ISA+K++EL+L                
Subjt:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL----------------

Query:  ------------------------------ITD---LWHKRLSHISTKGLQELEKQGVLPQD
                                      I+D   LWH RL H+S +G+ EL K+ +L +D
Subjt:  ------------------------------ITD---LWHKRLSHISTKGLQELEKQGVLPQD

A0A5C7I661 Uncharacterized protein1.2e-4854.89Show/hide
Query:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR
        +F+++KF G GDF +W+ K+KA+L QQK  KAI+ P KLP S+  E+K+ M E+A GT+ILNLSD+VLR+I D KTA +VW KLE+++ +K L NK+YL+
Subjt:  RFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLR

Query:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL
        E+ F +KMD +K    NLDDFKKM+ E  N G  EK+ DENEA ILLNSLP ++++VK A+KYGR S++ +  ISA+K++EL+L
Subjt:  EKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLG--EKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL

A0A5D3DNU1 Putative gag-pol polyprotein6.2e-4852.69Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        MA  RFE+ KF G GDF LW+ KI+A+L Q K +K I D  +LP +I   EK  M+E+AY T++L LSD VLR + +  T  E+W KLE+++ +K LPNK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL
        +Y++EKFF YKMD +KS  +NLD+F+K+  +  N+GEK+ DEN+A ILLNSLP+ YREVK A+KYGR+S+T   ++ A+KTR L++
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-2533.88Show/hide
Query:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK
        M+  ++E+ KF G   F  W+ +++ +L QQ   K +   +K P ++KAE+   ++E A   + L+LSD V+  IID  TA  +WT+LE+++ SK L NK
Subjt:  MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNK

Query:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRE
        +YL+++ +   M    +F  +L+ F  + ++  NLG KI +E++A +LLNSLP +Y  +   + +G+ +I    + SA+   E
Subjt:  VYLREKFFTYKMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTAGCAAGATTTGAAATGGAGAAGTTCGTTGGAAAGGGCGACTTTGAATTGTGGAAAGTCAAAATTAAAGCTGTACTTGGACAGCAAAAGGCTTCGAAGGCAAT
TCAAGATCCTACCAAGTTACCCCAGTCGATCAAAGCAGAGGAAAAGGAAACAATGGAGGAAATCGCATATGGAACTTTAATTCTGAATTTAAGTGACAGTGTTTTAAGGC
AAATTATAGATTTGAAGACAGCATATGAAGTCTGGACAAAGCTAGAAACAATTTTTTCTTCTAAAGATCTCCCAAATAAAGTATATCTCAGGGAGAAATTCTTTACCTAC
AAGATGGATAATACAAAATCATTCTCCGATAATCTTGATGACTTCAAGAAGATGTCATCAGAATTCAAGAACCTAGGAGAAAAGATTGGGGATGAAAATGAAGCCTTCAT
TCTCTTAAACTCTCTACCAAAAGCCTACAGAGAAGTCAAGGTAGCATTAAAATATGGCAGAGAGTCAATAACAACGGATGCAATCATATCTGCTGTCAAGACTAGAGAAC
TCAAGCTAATTACAGATTTGTGGCACAAGCGGTTGTCTCACATCAGCACAAAAGGACTACAAGAACTTGAAAAACAAGGAGTTCTACCTCAAGATTACACTATCAGACAC
TTGTGGCACTTTCTAACGAACTCCTTAGAGTCTGATCCAAGGTTGGACAGTAATAGCCTTGGTGGACCATCTTCACTGCCATGGATCGAGTTCGTGGACTTCTCTCATGA
CATAGTTCACCTTGTCCAGGTCAAGGCATCTGAGAAGTGGGAGGGAAATCCTCGCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGTAGCAAGATTTGAAATGGAGAAGTTCGTTGGAAAGGGCGACTTTGAATTGTGGAAAGTCAAAATTAAAGCTGTACTTGGACAGCAAAAGGCTTCGAAGGCAAT
TCAAGATCCTACCAAGTTACCCCAGTCGATCAAAGCAGAGGAAAAGGAAACAATGGAGGAAATCGCATATGGAACTTTAATTCTGAATTTAAGTGACAGTGTTTTAAGGC
AAATTATAGATTTGAAGACAGCATATGAAGTCTGGACAAAGCTAGAAACAATTTTTTCTTCTAAAGATCTCCCAAATAAAGTATATCTCAGGGAGAAATTCTTTACCTAC
AAGATGGATAATACAAAATCATTCTCCGATAATCTTGATGACTTCAAGAAGATGTCATCAGAATTCAAGAACCTAGGAGAAAAGATTGGGGATGAAAATGAAGCCTTCAT
TCTCTTAAACTCTCTACCAAAAGCCTACAGAGAAGTCAAGGTAGCATTAAAATATGGCAGAGAGTCAATAACAACGGATGCAATCATATCTGCTGTCAAGACTAGAGAAC
TCAAGCTAATTACAGATTTGTGGCACAAGCGGTTGTCTCACATCAGCACAAAAGGACTACAAGAACTTGAAAAACAAGGAGTTCTACCTCAAGATTACACTATCAGACAC
TTGTGGCACTTTCTAACGAACTCCTTAGAGTCTGATCCAAGGTTGGACAGTAATAGCCTTGGTGGACCATCTTCACTGCCATGGATCGAGTTCGTGGACTTCTCTCATGA
CATAGTTCACCTTGTCCAGGTCAAGGCATCTGAGAAGTGGGAGGGAAATCCTCGCTTGTAA
Protein sequenceShow/hide protein sequence
MAVARFEMEKFVGKGDFELWKVKIKAVLGQQKASKAIQDPTKLPQSIKAEEKETMEEIAYGTLILNLSDSVLRQIIDLKTAYEVWTKLETIFSSKDLPNKVYLREKFFTY
KMDNTKSFSDNLDDFKKMSSEFKNLGEKIGDENEAFILLNSLPKAYREVKVALKYGRESITTDAIISAVKTRELKLITDLWHKRLSHISTKGLQELEKQGVLPQDYTIRH
LWHFLTNSLESDPRLDSNSLGGPSSLPWIEFVDFSHDIVHLVQVKASEKWEGNPRL