; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g03620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g03620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:2857331..2862871
RNA-Seq ExpressionMoc09g03620
SyntenyMoc09g03620
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040138.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.7e-3758.87Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        M E +K+ M+E  Y  LILN++D+VLRQV++  T YEI  KL  L+  KDLP+K Y+REK F++KM+ SKTL+ENLD+FKKL++EFN LGEK+ AE+EA 
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQ
        I +NSL ++Y+EVK  LKYGRES+  D +I+A+K+KELELQ
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQ

XP_038875093.1 uncharacterized protein LOC120067620 [Benincasa hispida]1.2e-3756Show/hide
Query:  LNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFILLNSLPESYREVKVALK
        L+ +D    + +D  TAY +W KL+ ++L+KDLPNKA+LRE++FTYKM ++K+L++NL++ K+LS EF S+ + IG ENEAFILLNSL ES+++VK A+K
Subjt:  LNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFILLNSLPESYREVKVALK

Query:  YGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKN
        YGRE ITT+AIISAVK +ELELQ  K++    +  F KGK KNNGK ++N
Subjt:  YGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKN

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]3.6e-4761.25Show/hide
Query:  EKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFILLN
        EKE ++  AYGT++LN+ DSVLRQ++D  TAY +W KL+ ++L+KDLPNKA+LRE++FTYKMD +K+L++NL++FK LSS+F S+G+ IG ENEAFILLN
Subjt:  EKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFILLN

Query:  SLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK
        SLPE++++VK ALKYGRE ITT AIISAV  KELELQ  K++    EG F KG  K  G+
Subjt:  SLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]5.3e-4356.71Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        + E EKE ++  AYGT+ILN++DSVLRQ+MD  T Y +W KL+ ++L+KD PNK +LRE++FTYKMD +K+L++NL++FK+LSSEF S+G+ IG ENEAF
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK
        IL NSLPE++++VK ALKY R+ IT DAIISAV+ KELELQ     N   +   VKG+ +  G+
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK

XP_038890043.1 uncharacterized protein LOC120079747 [Benincasa hispida]5.7e-4558.54Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        + E EKE ++  A GT++LN++D+VLRQV++  TAY +W KL+ ++L+KDL NKA+LRE++FTYKMD++K+L++ L++FK+LSSEF S+G  IG ENEAF
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK
        ILLNSLPES+++ K A+KYGRE ITT+AIISAV+ +ELELQ  K+     EG F KGK KN+ K
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGK

TrEMBL top hitse value%identityAlignment
A0A5A7TAZ3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-3758.87Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        M E +K+ M+E  Y  LILN++D+VLRQV++  T YEI  KL  L+  KDLP+K Y+REK F++KM+ SKTL+ENLD+FKKL++EFN LGEK+ AE+EA 
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQ
        I +NSL ++Y+EVK  LKYGRES+  D +I+A+K+KELELQ
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQ

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-3746.86Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        + + E+E ++E+AY TLI+N++D+VLRQV++  TA+  W KL +L+  KDLPNK +++EK F++K + +K L ENLD+FKKL++  N  GEK+GAENEA 
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKNQYEDRGK
        IL+NS+ ++Y+EVK  LKYGRE+IT +++I+ +K+KELEL+   + ++ AEG       KN  +D+  +Y   G+
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKNQYEDRGK

A0A5C7GXL9 Sucrose-phosphate phosphatase2.3e-3654.84Show/hide
Query:  EEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLG--EKIGAENEAFI
        E+K+ M EMA GT+ILNLSD+VLR++ D KTA ++W KL++L+L+K L NK YL+E+ F +KMD+SK L +NLDDFKK++ E  + G  EK+  ENEA I
Subjt:  EEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLG--EKIGAENEAFI

Query:  LLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVK
        LLNSLP+S+++VK A+KYGR S++ +  ISA+K+KELEL+  K++  N E  FV+
Subjt:  LLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVK

A0A5C7HB65 gag_pre-integrs domain-containing protein4.7e-3751.14Show/hide
Query:  EEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLG--EKIGAENEAFI
        E+K+ M EMA GT+ILNLSD+VLR++ D KTA ++W KL++L+L+K L NK YL+E+ F +KMD+SK L +NLDDFKK++ +  + G  EK+  ENEA I
Subjt:  EEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLG--EKIGAENEAFI

Query:  LLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGK--FKNNGKDQKNQYEDRGK
        LLNSLP+S+++VK A+KYGR S++ +  ISA+K+KELEL+  K++  N E  FV+G+   KN+  +  N+ + R K
Subjt:  LLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGK--FKNNGKDQKNQYEDRGK

A0A5D3DNU1 Putative gag-pol polyprotein2.8e-3748.88Show/hide
Query:  EEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFIL
        E EK  M EMAY T++L LSD VLR V +  T  E+W KL++L+L+K LPNK Y++EK+F YKMD SK+L ENLD+F+K+  + N++GEK+  EN+A IL
Subjt:  EEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFIL

Query:  LNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKN---NGKDQKNQYEDRGKIR
        LNSLPE+YREVK A+KYGR+S+T   ++ A+KT+ LE+   K+E  + E    +G+ +     GK++  + + +GK R
Subjt:  LNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKN---NGKDQKNQYEDRGKIR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1528.24Show/hide
Query:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF
        MK E+   + E A   + L+LSD V+  ++D  TA  IWT+L++L++SK L NK YL+++ +   M        +L+ F  L ++  +LG KI  E++A 
Subjt:  MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAF

Query:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKNQYED--RGKIRPLDLSAI-------------T
        +LLNSLP SY  +   + +G+ +I    + SA+   E   +  K+  +  +    +G+ ++  +   N      RGK +    S +              
Subjt:  ILLNSLPESYREVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKNQYED--RGKIRPLDLSAI-------------T

Query:  DTDNPQKMLSKATGLK
        D  NP+K   + +G K
Subjt:  DTDNPQKMLSKATGLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAGGAAGAAAAGGAAGTTATGCAAGAAATGGCATATGGAACTCTGATATTGAATTTGAGTGATAGCGTTCTAAGGCAGGTTATGGATTTAAAAACAGCCTATGA
AATATGGACAAAACTAGATACTCTTTTTTTGTCAAAAGATTTACCGAACAAAGCCTATTTACGGGAAAAATACTTCACGTACAAGATGGATAGCTCTAAAACACTGAGTG
AGAACTTAGATGACTTCAAGAAACTCTCATCAGAATTTAATAGTCTTGGAGAAAAGATAGGTGCTGAGAATGAAGCATTTATTCTTCTGAATTCACTACCGGAATCTTAT
AGAGAAGTAAAAGTTGCTCTAAAGTACGGTAGGGAATCCATAACCACGGATGCGATTATTTCTGCAGTCAAGACAAAGGAGCTAGAATTGCAGGCTGGGAAAAGAGAAAA
CTCAAATGCAGAGGGACATTTTGTAAAAGGAAAATTCAAGAACAATGGAAAAGATCAAAAGAATCAGTACGAAGATAGAGGAAAAATCAGACCGCTAGATTTATCAGCTA
TAACTGATACGGACAATCCGCAGAAAATGCTAAGCAAAGCTACTGGCCTGAAGTGGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAGGAAGAAAAGGAAGTTATGCAAGAAATGGCATATGGAACTCTGATATTGAATTTGAGTGATAGCGTTCTAAGGCAGGTTATGGATTTAAAAACAGCCTATGA
AATATGGACAAAACTAGATACTCTTTTTTTGTCAAAAGATTTACCGAACAAAGCCTATTTACGGGAAAAATACTTCACGTACAAGATGGATAGCTCTAAAACACTGAGTG
AGAACTTAGATGACTTCAAGAAACTCTCATCAGAATTTAATAGTCTTGGAGAAAAGATAGGTGCTGAGAATGAAGCATTTATTCTTCTGAATTCACTACCGGAATCTTAT
AGAGAAGTAAAAGTTGCTCTAAAGTACGGTAGGGAATCCATAACCACGGATGCGATTATTTCTGCAGTCAAGACAAAGGAGCTAGAATTGCAGGCTGGGAAAAGAGAAAA
CTCAAATGCAGAGGGACATTTTGTAAAAGGAAAATTCAAGAACAATGGAAAAGATCAAAAGAATCAGTACGAAGATAGAGGAAAAATCAGACCGCTAGATTTATCAGCTA
TAACTGATACGGACAATCCGCAGAAAATGCTAAGCAAAGCTACTGGCCTGAAGTGGGGTTGA
Protein sequenceShow/hide protein sequence
MKEEEKEVMQEMAYGTLILNLSDSVLRQVMDLKTAYEIWTKLDTLFLSKDLPNKAYLREKYFTYKMDSSKTLSENLDDFKKLSSEFNSLGEKIGAENEAFILLNSLPESY
REVKVALKYGRESITTDAIISAVKTKELELQAGKRENSNAEGHFVKGKFKNNGKDQKNQYEDRGKIRPLDLSAITDTDNPQKMLSKATGLKWG