; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g33050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g33050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:25052233..25053410
RNA-Seq ExpressionMoc09g33050
SyntenyMoc09g33050
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5755123.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.2e-1335.29Show/hide
Query:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK
        KE  KA  + K K + +K D   +        EEF  C + D        S WVVD+  + HVTS R +FSS+  G++G V+M N  +SK+ G+ DVCLK
Subjt:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK

Query:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                                 G LDDDGY S F  G WKL R S +VA   + S +YM+   ++ D
Subjt:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

KAF5774771.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.2e-1335.29Show/hide
Query:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK
        KE  KA  + K K + +K D   +        EEF  C + D        S WVVD+  + HVTS R +FSS+  G++G V+M N  +SK+ G+ DVCLK
Subjt:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK

Query:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                                 G LDDDGY S F  G WKL R S +VA   + S +YM+   ++ D
Subjt:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

KAF5795403.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.2e-1335.29Show/hide
Query:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK
        KE  KA  + K K + +K D   +        EEF  C + D        S WVVD+  + HVTS R +FSS+  G++G V+M N  +SK+ G+ DVCLK
Subjt:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK

Query:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                                 G LDDDGY S F  G WKL R S +VA   + S +YM+   ++ D
Subjt:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

KAF5811941.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.2e-1335.29Show/hide
Query:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK
        KE  KA  + K K + +K D   +        EEF  C + D        S WVVD+  + HVTS R +FSS+  G++G V+M N  +SK+ G+ DVCLK
Subjt:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK

Query:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                                 G LDDDGY S F  G WKL R S +VA   + S +YM+   ++ D
Subjt:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

VVB04180.1 unnamed protein product [Arabis nemorensis]1.6e-1533.9Show/hide
Query:  KETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIE
        K+   K+  K  +D K  +  E  ++ V V       ++FL  +E D      H + WVVD+  +TH TS R +F+++  G+YGSV+M N++++KV GI 
Subjt:  KETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIE

Query:  DVCLK------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKDLL
        D+CL+                        TGKLDD+G++S F  G WKL R S V+A   K S  Y     V+KD++
Subjt:  DVCLK------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKDLL

TrEMBL top hitse value%identityAlignment
A0A251SV86 Putative zinc finger, CCHC-type, Ribonuclease H-like domain, GAG-pre-integrase domain protein5.7e-1435.29Show/hide
Query:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK
        KE  KA  + K K + +K D   +        EEF  C + D        S WVVD+  + HVTS R +FSS+  G++G V+M N  +SK+ G+ DVCLK
Subjt:  KELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK

Query:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                                 G LDDDGY S F  G WKL R S +VA   + S +YM+   ++ D
Subjt:  ------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

A0A371FS05 gag_pre-integrs domain-containing protein (Fragment)6.3e-1332.24Show/hide
Query:  EKRDNYVNVVIGKEQIEEFLACVERDTYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKTG----------------
        EK D+ V    G + +   L   E   +V + S+W++D+ T+ HVT  + +F+S+  G++G ++M N+ ++KV G+ DVCL+T                 
Subjt:  EKRDNYVNVVIGKEQIEEFLACVERDTYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKTG----------------

Query:  --------KLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                 LDD GY + F  G WKL + + VVA   K S +Y ++  VAKD
Subjt:  --------KLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

A0A565BS43 Uncharacterized protein7.9e-1633.9Show/hide
Query:  KETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIE
        K+   K+  K  +D K  +  E  ++ V V       ++FL  +E D      H + WVVD+  +TH TS R +F+++  G+YGSV+M N++++KV GI 
Subjt:  KETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIE

Query:  DVCLK------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKDLL
        D+CL+                        TGKLDD+G++S F  G WKL R S V+A   K S  Y     V+KD++
Subjt:  DVCLK------------------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKDLL

A0A5N6NVJ4 Uncharacterized protein6.3e-1330.38Show/hide
Query:  KEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK------------
        K +  K D   ++ +     +EF  C + D        S WVVD+  + HVTS R +++S+  G++G  +M N  +SK+ G+ D+CLK            
Subjt:  KEDFEKRDNYVNVVIGKEQIEEFLACVERD--TYVDHSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLK------------

Query:  ------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD
                     G LD+DGY + F  G WKL   S +VA   + S +YM+   ++KD
Subjt:  ------------TGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKD

A5AJI7 Integrase catalytic domain-containing protein1.6e-1328.14Show/hide
Query:  QVKDLLTCKKIHKSL---GDRPTEITEKDWKLIYKQAVANIRMSLSMGVCSLVTKETTAKEL---LKALQDRKFKE----------DFEKRDNYVNVVIG
        ++KD+L CK++ + +   G +P    E +WK + ++ V  IR  +   V   V KE  A  L   L++L +RK  +          + + +D ++     
Subjt:  QVKDLLTCKKIHKSL---GDRPTEITEKDWKLIYKQAVANIRMSLSMGVCSLVTKETTAKEL---LKALQDRKFKE----------DFEKRDNYVNVVIG

Query:  KEQIEEFLACVER---------------------DTYVD---HSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKT-----
        K + E F    E                      DT V+     S WVVDTATS H+T+ R +FSS+  G++G V M NE+  ++ G+ DV L+T     
Subjt:  KEQIEEFLACVER---------------------DTYVD---HSSVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKT-----

Query:  -------------------GKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAK
                           GKLDD+GY S    G WK+ + S V+    K + +Y  E  + K
Subjt:  -------------------GKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.6e-1136.04Show/hide
Query:  SVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKTG------------------------KLDDDGYSSEFVWGCWKLRRESRV
        S WVVDTA S H T  R  F  +  G++G+V+M N S SK+ GI D+C+KT                          LD DGY S F    W+L + S V
Subjt:  SVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKTG------------------------KLDDDGYSSEFVWGCWKLRRESRV

Query:  VATSYKRSFVY
        +A    R  +Y
Subjt:  VATSYKRSFVY

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.1e-0630.68Show/hide
Query:  MQVKDLLTCKKIHKSLGDRPTEITEKDWKLIYKQAVANIRMSLSMGVCSLVTKETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQ
        M+++D L  KK+H+ LG +   +++ DW ++Y+Q +  IR+++S  +   V KE +   L+K L       D  K+ +  N VI  E+
Subjt:  MQVKDLLTCKKIHKSLGDRPTEITEKDWKLIYKQAVANIRMSLSMGVCSLVTKETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTAAAAGATCTTCTTACGTGCAAGAAGATACACAAGAGTTTGGGGGATAGACCAACGGAGATTACCGAAAAGGATTGGAAATTGATATATAAGCAGGCC
GTTGCAAACATCAGAATGTCTTTATCGATGGGGGTATGCAGTCTGGTGACAAAAGAGACAACAGCGAAAGAACTGTTGAAGGCCTTGCAAGACAGAAAGTTCAAA
GAAGATTTTGAGAAGAGGGACAATTATGTAAATGTTGTAATAGGTAAAGAACAGATTGAAGAGTTTCTAGCTTGTGTTGAGAGAGACACATATGTAGATCATTCA
TCAGTGTGGGTAGTGGACACTGCAACATCAACACATGTTACTTCAGGCAGACATTGGTTCTCATCTTTTGCTGTAGGTAATTATGGCTCAGTGAGGATGAGGAAT
GAGAGTATCTCCAAGGTGAGAGGAATTGAAGATGTTTGTTTGAAGACAGGAAAGCTAGACGATGATGGCTATAGCAGTGAGTTTGTTTGGGGTTGCTGGAAGCTC
AGGAGGGAATCTAGAGTAGTGGCGACAAGCTACAAGAGATCTTTTGTTTATATGTCAGAGTTTGGGGTTGCGAAGGATTTACTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTAAAAGATCTTCTTACGTGCAAGAAGATACACAAGAGTTTGGGGGATAGACCAACGGAGATTACCGAAAAGGATTGGAAATTGATATATAAGCAGGCC
GTTGCAAACATCAGAATGTCTTTATCGATGGGGGTATGCAGTCTGGTGACAAAAGAGACAACAGCGAAAGAACTGTTGAAGGCCTTGCAAGACAGAAAGTTCAAA
GAAGATTTTGAGAAGAGGGACAATTATGTAAATGTTGTAATAGGTAAAGAACAGATTGAAGAGTTTCTAGCTTGTGTTGAGAGAGACACATATGTAGATCATTCA
TCAGTGTGGGTAGTGGACACTGCAACATCAACACATGTTACTTCAGGCAGACATTGGTTCTCATCTTTTGCTGTAGGTAATTATGGCTCAGTGAGGATGAGGAAT
GAGAGTATCTCCAAGGTGAGAGGAATTGAAGATGTTTGTTTGAAGACAGGAAAGCTAGACGATGATGGCTATAGCAGTGAGTTTGTTTGGGGTTGCTGGAAGCTC
AGGAGGGAATCTAGAGTAGTGGCGACAAGCTACAAGAGATCTTTTGTTTATATGTCAGAGTTTGGGGTTGCGAAGGATTTACTAAAATAG
Protein sequenceShow/hide protein sequence
MQVKDLLTCKKIHKSLGDRPTEITEKDWKLIYKQAVANIRMSLSMGVCSLVTKETTAKELLKALQDRKFKEDFEKRDNYVNVVIGKEQIEEFLACVERDTYVDHS
SVWVVDTATSTHVTSGRHWFSSFAVGNYGSVRMRNESISKVRGIEDVCLKTGKLDDDGYSSEFVWGCWKLRRESRVVATSYKRSFVYMSEFGVAKDLLK