; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g43580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g43580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:33523184..33524313
RNA-Seq ExpressionMoc08g43580
SyntenyMoc08g43580
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65728.1 hypothetical protein VITISV_015033 [Vitis vinifera]2.6e-2539.27Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   VVKE    +L+K L   YEKPSAN K+ L  K FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM
         +L SLP+SWE M+ AVSNS G+  LK++ I D +L+EE R +         G  +G  S L  + KGKG  K   K   +NN ++  N++
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM

CAN73240.1 hypothetical protein VITISV_035336 [Vitis vinifera]1.7e-2439.68Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   VVKE    +L+K L   YEKPSAN K+ L  K FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR
         +L SLP+SWE M+ AVSNS G+  LK++ I D +L+EE RR+         G  +G  S L  + +G+G  K +  G+   RN+  NR
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR

CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]1.3e-2439.68Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   VVKE    +L+K L   YEKPSAN K+ L TK FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR
         +L SLP+SWE M+ AVSNS G+  LK++ I D +L+EE RR+         G  +G  S L  + +G+G  + +  G+   RN+  NR
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]7.4e-2539.52Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W+ +D QV+  IR+ L   V   V KE   + L+K L D YEKPSAN K+FL  K F++ MEEG  V +H+NE   I+N+L  +  + ++EV+A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARR-KLGKMS-ATTLGAEN-GIESVLVAQFKGKGKMKYNGKQEHRNNRE-NRWNLMRRSDI
         LL SLP+SWE M+ AVSNS G   LKF  + D +L EE RR   G+ S ++    EN G +     Q +G+ K + NGK + ++ +    WN  +    
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARR-KLGKMS-ATTLGAEN-GIESVLVAQFKGKGKMKYNGKQEHRNNRE-NRWNLMRRSDI

Query:  VVVGHRKASM
          V   KAS+
Subjt:  VVVGHRKASM

RVX10218.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.7e-2538.22Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   +VKE    +L+K L D YEKPSAN K+ L  K FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM
         +L SLP+SWE+M+  VSNS G+   K++ I D +L+EE RRK         G  +G  S L  + +G+G  K   K   +NN ++  N++
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein1.2e-2542.47Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M    W  +D QV+  IR+ L   V   VVKE   + L+K L D YEKPSAN+K+FL  K F++ MEEG  V +HINE   I+N+L  +  + E+EV+A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNREN
         LL SLP+SWE M+ AVSNS G   LKF+ + D +L+EE RR +    A+T  A N               ++  G+   RNNR N
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNREN

A0A0D3CS45 Uncharacterized protein3.6e-2541.94Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M    W  +D QV+  IR+ L   V   V KE I + L+K L D YEKPSAN K+FL  K F++ MEEG  V +H+NE   I+N+L  +  + E+EV+A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNREN
         LL SLP+SWE M+ AVSNS G   LKF+ + D +L+EE RR +    A+T  A N               ++  G+   RNNR N
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNREN

A0A438JMM5 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-2538.22Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   +VKE    +L+K L D YEKPSAN K+ L  K FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM
         +L SLP+SWE+M+  VSNS G+   K++ I D +L+EE RRK         G  +G  S L  + +G+G  K   K   +NN ++  N++
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM

A5BGX3 Uncharacterized protein6.1e-2539.68Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   VVKE    +L+K L   YEKPSAN K+ L TK FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR
         +L SLP+SWE M+ AVSNS G+  LK++ I D +L+EE RR+         G  +G  S L  + +G+G  + +  G+   RN+  NR
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYN--GKQEHRNNRENR

A5BJK9 Integrase catalytic domain-containing protein1.2e-2539.27Show/hide
Query:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W  +D QV+  IR+ L   V   VVKE    +L+K L   YEKPSAN K+ L  K FN+ M E  SV  H+NE   I N+L  +    ++E++A+
Subjt:  MTDKTWNEIDEQVVANIRMAL--GVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM
         +L SLP+SWE M+ AVSNS G+  LK++ I D +L+EE R +         G  +G  S L  + KGKG  K   K   +NN ++  N++
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLM

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.8e-0422.7Show/hide
Query:  DKTWNEIDEQVVANI--RMALGVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAMRL
        D +W + +    + I   ++         +  A+++L+ L   YE+ S  +++ L  +  ++ +    S+ SH +   +++++L   G KIEE  K   L
Subjt:  DKTWNEIDEQVVANI--RMALGVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAMRL

Query:  LTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRK
        L +LP  ++ + TA+     EN L  + + + +L +E + K
Subjt:  LTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-1328.34Show/hide
Query:  MTDKTWNEIDEQVVANIRMALG--VFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM
        M  + W ++DE+  + IR+ L   V   ++ E  A+ +   L+  Y   +   K++L  + + +HM EGT+  SH+N    ++ +L  +G KIEEE KA+
Subjt:  MTDKTWNEIDEQVVANIRMALG--VFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAM

Query:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTL-----GAENGIESVLVAQFKGKGKMKYNGKQEHRN
         LL SLP S++ + T + +  G+ +++   +  A+L  E  RK  +     L     G      S    +   +GK K   K   RN
Subjt:  RLLTSLPDSWEMMKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTL-----GAENGIESVLVAQFKGKGKMKYNGKQEHRN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGACAAAACTTGGAATGAGATAGATGAGCAGGTCGTTGCAAATATCAGAATGGCACTGGGGGTATTCCGTCTCGTGGTGAAAGAGACGATTGCAAAAGAATTGTT
GAAGGGCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATATTTTTATGGACAAAGTATTTTAACATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACA
TTAATGAGCTCACCGATATCTTGAACAAATTAGAAAAGATGGGTGACAAGATTGAGGAAGAGGTGAAGGCTATGAGGTTGTTGACATCTTTGCCTGATAGTTGGGAGATG
ATGAAGACCGCTGTGTCGAATTCGCGAGGAGAAAATAGCTTGAAATTTTCAGCTATTTGTGATGCCGTCTTATCTGAAGAAGCCCGTAGAAAATTAGGGAAAATGTCTGC
AACTACTTTAGGGGCAGAGAACGGAATTGAATCAGTTTTGGTAGCTCAATTTAAGGGGAAGGGCAAGATGAAGTACAACGGGAAGCAGGAACATAGGAATAACAGGGAGA
ATAGATGGAACCTCATGAGGCGATCCGACATAGTGGTTGTTGGCCACAGAAAAGCTTCAATGTATGTGTTGAGGTTTGGTGTTGCTAGAGGATTAGGGAGACAGGTCATG
CACAGGGCTGCAGATAGTTCAGGGGAGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGACAAAACTTGGAATGAGATAGATGAGCAGGTCGTTGCAAATATCAGAATGGCACTGGGGGTATTCCGTCTCGTGGTGAAAGAGACGATTGCAAAAGAATTGTT
GAAGGGCTTGCAAGACAGGTATGAAAAACCTTCTGCCAATACAAAAATATTTTTATGGACAAAGTATTTTAACATCCACATGGAGGAGGGAACCTCGGTGAATTCCCACA
TTAATGAGCTCACCGATATCTTGAACAAATTAGAAAAGATGGGTGACAAGATTGAGGAAGAGGTGAAGGCTATGAGGTTGTTGACATCTTTGCCTGATAGTTGGGAGATG
ATGAAGACCGCTGTGTCGAATTCGCGAGGAGAAAATAGCTTGAAATTTTCAGCTATTTGTGATGCCGTCTTATCTGAAGAAGCCCGTAGAAAATTAGGGAAAATGTCTGC
AACTACTTTAGGGGCAGAGAACGGAATTGAATCAGTTTTGGTAGCTCAATTTAAGGGGAAGGGCAAGATGAAGTACAACGGGAAGCAGGAACATAGGAATAACAGGGAGA
ATAGATGGAACCTCATGAGGCGATCCGACATAGTGGTTGTTGGCCACAGAAAAGCTTCAATGTATGTGTTGAGGTTTGGTGTTGCTAGAGGATTAGGGAGACAGGTCATG
CACAGGGCTGCAGATAGTTCAGGGGAGACTTGA
Protein sequenceShow/hide protein sequence
MTDKTWNEIDEQVVANIRMALGVFRLVVKETIAKELLKGLQDRYEKPSANTKIFLWTKYFNIHMEEGTSVNSHINELTDILNKLEKMGDKIEEEVKAMRLLTSLPDSWEM
MKTAVSNSRGENSLKFSAICDAVLSEEARRKLGKMSATTLGAENGIESVLVAQFKGKGKMKYNGKQEHRNNRENRWNLMRRSDIVVVGHRKASMYVLRFGVARGLGRQVM
HRAADSSGET