; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:10073427..10074074
RNA-Seq ExpressionMoc04g12970
SyntenyMoc04g12970
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67821.1 hypothetical protein VITISV_025855, partial [Vitis vinifera]6.2e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

CAN81238.1 hypothetical protein VITISV_031073 [Vitis vinifera]8.1e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

ONK57710.1 uncharacterized protein A4U43_C09F3290 [Asparagus officinalis]7.8e-6064.35Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        M LMIIK G+ EAFRG+ SE IT AKDFLA+I KRFAKNDK ETS LL RLI MKYKGKGNIREYIMEMSH+  KLKAL+LDL D LLVH VL+SLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSK-KNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASL
        NQFK                      E+RLKQD TES H A  SK K  KRK   EAAKGP  KKQ +D +GCF C KPGH KK+ TKYHAWR  KG  L
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSK-KNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASL

Query:  TLVCSEVNMTSVPRNT
         LVCSEVN+ SVPRNT
Subjt:  TLVCSEVNMTSVPRNT

RVW15788.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.1e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

RWR91636.1 hypothetical protein CKAN_02080200 [Cinnamomum micranthum f. kanehirae]4.1e-6165.12Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        M LMIIK GI EAFRG VSE++T AK+FLA+I KRF KNDK ETS LLQ LI MKY GKGNIREYIM MSH+  KLKAL L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASLT
        +QFK                      EERLKQD TES H ASTSK   K++KK EAAKGPA KKQQ   +GCF C KPGH KK+ TKYHAWR KKG  L 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASLT

Query:  LVCSEVNMTSVPRNT
        LVCSEVN+ SVPRNT
Subjt:  LVCSEVNMTSVPRNT

TrEMBL top hitse value%identityAlignment
A0A438BXW8 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

A0A443PLH5 Uncharacterized protein2.0e-6165.12Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        M LMIIK GI EAFRG VSE++T AK+FLA+I KRF KNDK ETS LLQ LI MKY GKGNIREYIM MSH+  KLKAL L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASLT
        +QFK                      EERLKQD TES H ASTSK   K++KK EAAKGPA KKQQ   +GCF C KPGH KK+ TKYHAWR KKG  L 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASLT

Query:  LVCSEVNMTSVPRNT
        LVCSEVN+ SVPRNT
Subjt:  LVCSEVNMTSVPRNT

A0A5P1E579 CCHC-type domain-containing protein3.8e-6064.35Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        M LMIIK G+ EAFRG+ SE IT AKDFLA+I KRFAKNDK ETS LL RLI MKYKGKGNIREYIMEMSH+  KLKAL+LDL D LLVH VL+SLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSK-KNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASL
        NQFK                      E+RLKQD TES H A  SK K  KRK   EAAKGP  KKQ +D +GCF C KPGH KK+ TKYHAWR  KG  L
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSK-KNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASL

Query:  TLVCSEVNMTSVPRNT
         LVCSEVN+ SVPRNT
Subjt:  TLVCSEVNMTSVPRNT

A5BE46 Uncharacterized protein (Fragment)3.0e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

A5BKR0 Uncharacterized protein3.9e-5762.39Show/hide
Query:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS
        + LMI+K GI EAFRG V++E+TNA DFLA+I KRFAKNDK ETS LL  LI MKYKGKGN+REYIMEMSHL  KLKAL+L+LSD LLVH VLISLP   
Subjt:  MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHS

Query:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA
        NQFK                      EERLKQD TES H ASTSK   KRK    K  A+ GP  KKQ+ ++  CF C KPGH+KKE TKY AWR KKG 
Subjt:  NQFK----------------------EERLKQDSTESIHFASTSKKNVKRK---KKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGA

Query:  SLTLVCSEVNMTSVPRNT
         LTLVCSEVN+ SV RNT
Subjt:  SLTLVCSEVNMTSVPRNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein4.6e-1039.8Show/hide
Query:  LMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLD---LSDGLLVHPVLISLP
        +MI+K  I + FRGVV +++T AKDFLA +   FAKN++ E S +      M Y    N+RE IM M  L  K K L ++    +D +L H  +  LP
Subjt:  LMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLD---LSDGLLVHPVLISLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTAATGATCATAAAGTGTGGCATTCTAGAGGCATTTAGAGGTGTGGTATCCGAAGAAATAACTAATGCCAAAGATTTCCTTGCTAAAATAGTAAAGCGT
TTTGCCAAAAACGATAAACCTGAAACAAGCATGCTCTTACAACGCTTGATTTGTATGAAATATAAGGGCAAAGGAAATATTAGGGAGTACATTATGGAAATGTCT
CATCTAGTATTAAAACTAAAAGCGCTTCAGCTTGACCTATCTGATGGTTTGCTTGTGCATCCAGTCTTGATCTCTCTTCCTACACACTCTAATCAATTTAAGGAA
GAGAGATTAAAGCAAGATAGTACAGAAAGTATTCACTTTGCAAGCACCTCTAAGAAAAATGTCAAAAGGAAAAAGAAATATGAAGCTGCTAAGGGTCCAGCTCCT
AAGAAACAACAACAAGATATCAAAGGTTGTTTCTCTTGTGGGAAACCTGGACATTCAAAGAAAGAACGTACCAAGTATCATGCTTGGCGTAAAAAGAAAGGTGCA
TCTCTTACTTTGGTCTGTTCTGAAGTAAATATGACTTCAGTGCCTAGAAACACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTCTAATGATCATAAAGTGTGGCATTCTAGAGGCATTTAGAGGTGTGGTATCCGAAGAAATAACTAATGCCAAAGATTTCCTTGCTAAAATAGTAAAGCGT
TTTGCCAAAAACGATAAACCTGAAACAAGCATGCTCTTACAACGCTTGATTTGTATGAAATATAAGGGCAAAGGAAATATTAGGGAGTACATTATGGAAATGTCT
CATCTAGTATTAAAACTAAAAGCGCTTCAGCTTGACCTATCTGATGGTTTGCTTGTGCATCCAGTCTTGATCTCTCTTCCTACACACTCTAATCAATTTAAGGAA
GAGAGATTAAAGCAAGATAGTACAGAAAGTATTCACTTTGCAAGCACCTCTAAGAAAAATGTCAAAAGGAAAAAGAAATATGAAGCTGCTAAGGGTCCAGCTCCT
AAGAAACAACAACAAGATATCAAAGGTTGTTTCTCTTGTGGGAAACCTGGACATTCAAAGAAAGAACGTACCAAGTATCATGCTTGGCGTAAAAAGAAAGGTGCA
TCTCTTACTTTGGTCTGTTCTGAAGTAAATATGACTTCAGTGCCTAGAAACACATAG
Protein sequenceShow/hide protein sequence
MCLMIIKCGILEAFRGVVSEEITNAKDFLAKIVKRFAKNDKPETSMLLQRLICMKYKGKGNIREYIMEMSHLVLKLKALQLDLSDGLLVHPVLISLPTHSNQFKE
ERLKQDSTESIHFASTSKKNVKRKKKYEAAKGPAPKKQQQDIKGCFSCGKPGHSKKERTKYHAWRKKKGASLTLVCSEVNMTSVPRNT