; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g15570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g15570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:12292005..12296223
RNA-Seq ExpressionMoc06g15570
SyntenyMoc06g15570
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65867.1 hypothetical protein VITISV_034935 [Vitis vinifera]1.3e-1837.25Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL---------------------------------------------VGNPFELLFRIL---CGKEKLK
        DYL+ ++L LPL G KP+ M  +EW  LDR+VL                                             + N +E +   +    GKEKLK
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL---------------------------------------------VGNPFELLFRIL---CGKEKLK

Query:  FEDVRDAALAEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHD
        + D+RD  LAE+IR++D+G  S SGS LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   V E++ D
Subjt:  FEDVRDAALAEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHD

Query:  ALVL
        AL+L
Subjt:  ALVL

RVW30183.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.7e-2143.5Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-----CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSV
        DYL+ ++L LPL G KP+ M  +EW  LDR+VL      L                L + L      GKEKLK+ D+RD  LAE+IR++D+G  S SGS 
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-----CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSV

Query:  LNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL
        LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   V E++ DAL+L
Subjt:  LNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL

RVX02202.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.2e-1838.58Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-------------------------CGKEKLKFEDVRDA
        DYL+ ++L LPL G KP+ M  KEW  LDR+VL      L                L + L                          GKEKLK+ D+RD 
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-------------------------CGKEKLKFEDVRDA

Query:  ALAEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL
         LAE+IR++D+   S SGS LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   + E++ DAL+L
Subjt:  ALAEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL

RVX04667.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.5e-1938.46Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL---CGKEKLKFEDVRDAAL
        DYL+ ++L LPL G KP+ M  +EW  LDR+VL                                      N   ++   +    GKEKLK+ D+RD  L
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL---CGKEKLKFEDVRDAAL

Query:  AEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL
        AE+IR++D+G  S SGS LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   V E++ DAL+L
Subjt:  AEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]1.3e-5053.23Show/hide
Query:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVL--------------------------VGNPFE----------------------------------
        VLDYLHSKELE PLEGKPDDMGEKEWKKLDRKVL                          + N +E                                  
Subjt:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVL--------------------------VGNPFE----------------------------------

Query:  -------------------LLFRIL--------------CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR
                           LL R L              C KEKLKFEDVRDAALAE+IR+KDSGIA TSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR
Subjt:  -------------------LLFRIL--------------CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR

Query:  NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVLQLRA
        N+ FECWNCGK GHLK N KAPKKNEGNEA A+VAEQIHDALV+ + +
Subjt:  NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVLQLRA

TrEMBL top hitse value%identityAlignment
A0A0D2ZVN0 Uncharacterized protein7.3e-2044.87Show/hide
Query:  DYLHSKELELPLEGKPDDMGEKEWKKLDRK---VLVGNPFELLFRILCGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRN---NNRGYGNR
        DYL+ K+L  PL  KP+ M + EW+ LDR+   V+   P  +        +KLKF DVRD  L E++R+ DSG ASTS +    +RGRN   NNR  G R
Subjt:  DYLHSKELELPLEGKPDDMGEKEWKKLDRK---VLVGNPFELLFRILCGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRN---NNRGYGNR

Query:  GKSKNNRSRSR-NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGA--DVAEQIHDAL
         KS+N   +S+     ECWNCGK GH+K+N++AP K E N  G    V  +I DAL
Subjt:  GKSKNNRSRSR-NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGA--DVAEQIHDAL

A0A2N9IKQ5 Uncharacterized protein2.8e-1934.76Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL------------------C
        DYL+ K+L LPL G KP+DM + EW  LDR+VL                                      N   L+ +++                   
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL------------------C

Query:  GKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVD-RGRNNNRGYGNRGKSKNNRSRSRN---NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAE
        GK KLK+ D+RD  L E++R++D+G  S+SGS LN++ RGR  +R Y NRG+SK+ + RS++      ECWNCGK GH+++N    KK   N++   V E
Subjt:  GKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVD-RGRNNNRGYGNRGKSKNNRSRSRN---NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAE

Query:  QIHDALVLQL
        ++HDAL+L +
Subjt:  QIHDALVLQL

A0A438D407 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-2143.5Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-----CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSV
        DYL+ ++L LPL G KP+ M  +EW  LDR+VL      L                L + L      GKEKLK+ D+RD  LAE+IR++D+G  S SGS 
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVLVGNPFEL----------------LFRIL-----CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSV

Query:  LNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL
        LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   V E++ DAL+L
Subjt:  LNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL

A0A438J6T4 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-1938.46Show/hide
Query:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL---CGKEKLKFEDVRDAAL
        DYL+ ++L LPL G KP+ M  +EW  LDR+VL                                      N   ++   +    GKEKLK+ D+RD  L
Subjt:  DYLHSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------VGNPFELLFRIL---CGKEKLKFEDVRDAAL

Query:  AEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL
        AE+IR++D+G  S SGS LN++ RGR NNR    G  N   S  NRS+SR+    +CWNCGK GH KR  K+PKK   +++   V E++ DAL+L
Subjt:  AEKIRKKDSGIASTSGSVLNVD-RGRNNNR----GYGNRGKSKNNRSRSRN-NMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVL

A0A6J1DF43 uncharacterized protein LOC1110204696.1e-5153.23Show/hide
Query:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVL--------------------------VGNPFE----------------------------------
        VLDYLHSKELE PLEGKPDDMGEKEWKKLDRKVL                          + N +E                                  
Subjt:  VLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVL--------------------------VGNPFE----------------------------------

Query:  -------------------LLFRIL--------------CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR
                           LL R L              C KEKLKFEDVRDAALAE+IR+KDSGIA TSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR
Subjt:  -------------------LLFRIL--------------CGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSR

Query:  NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVLQLRA
        N+ FECWNCGK GHLK N KAPKKNEGNEA A+VAEQIHDALV+ + +
Subjt:  NNMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVLQLRA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0438.78Show/hide
Query:  ILCGKEKLKFEDVRDAALA-EKIRKKDSGIAS---TSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSRNNMFECWNCGKNGHLKRNYKAPKKNEGNEAG
        IL GK  ++ +DV  A L  EK+RKK         T G   +  R  NN    G RGKSKN   RS++ +  C+NC + GH KR+   P+K +G  +G
Subjt:  ILCGKEKLKFEDVRDAALA-EKIRKKDSGIAS---TSGSVLNVDRGRNNNRGYGNRGKSKNNRSRSRNNMFECWNCGKNGHLKRNYKAPKKNEGNEAG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCGGTCTTCCGGTTTTCCTCCTCGCACCTACCAAATTCGTTGATCCGGGACTAATACCTGCAAAACCAAATCGAGCTAAACCGCGTGAAGTTGTTCCAGAACC
TTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCGAAATCGTCGATCGAAGATGGGGGATCACTGGCTGGGCCGTGGGCCGAGCACGGCTCGGGACCGAGCCCC
GGGTCGGGGCCGAGCATCGGGTCGAGACCGAGCACCGGGCCGGGGCGGAGTACCGGGTCGGGACCGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCCGGTCGCTTTCCTCG
TTCTCTGCCCCGACGTTAGGTCCTTCTTTGGGCCGGCCCGTATTAGATTATTTGCACTCGAAAGAGTTGGAACTGCCATTAGAAGGAAAGCCGGATGATATGGGAGAAAA
AGAATGGAAGAAGTTGGACAGGAAAGTGTTAGTTGGGAATCCATTCGAGCTGCTATTTCGAATTCTTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAG
CTCTTGCAGAAAAAATTCGCAAGAAGGACTCTGGTATCGCTTCTACTTCTGGTTCAGTATTGAATGTGGACAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGC
AAGTCGAAAAACAATAGAAGCAGATCGAGAAACAACATGTTTGAGTGTTGGAATTGTGGTAAGAATGGACACTTGAAGAGGAATTACAAGGCCCCGAAAAAAAATGAAGG
GAACGAAGCCGGTGCTGATGTTGCTGAGCAGATACATGATGCTTTGGTTCTGCAGTTGAGAGCGCTCATGACACATGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCGGTCTTCCGGTTTTCCTCCTCGCACCTACCAAATTCGTTGATCCGGGACTAATACCTGCAAAACCAAATCGAGCTAAACCGCGTGAAGTTGTTCCAGAACC
TTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCGAAATCGTCGATCGAAGATGGGGGATCACTGGCTGGGCCGTGGGCCGAGCACGGCTCGGGACCGAGCCCC
GGGTCGGGGCCGAGCATCGGGTCGAGACCGAGCACCGGGCCGGGGCGGAGTACCGGGTCGGGACCGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCCGGTCGCTTTCCTCG
TTCTCTGCCCCGACGTTAGGTCCTTCTTTGGGCCGGCCCGTATTAGATTATTTGCACTCGAAAGAGTTGGAACTGCCATTAGAAGGAAAGCCGGATGATATGGGAGAAAA
AGAATGGAAGAAGTTGGACAGGAAAGTGTTAGTTGGGAATCCATTCGAGCTGCTATTTCGAATTCTTTGTGGGAAAGAGAAATTGAAATTTGAAGATGTTAGAGATGCAG
CTCTTGCAGAAAAAATTCGCAAGAAGGACTCTGGTATCGCTTCTACTTCTGGTTCAGTATTGAATGTGGACAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGC
AAGTCGAAAAACAATAGAAGCAGATCGAGAAACAACATGTTTGAGTGTTGGAATTGTGGTAAGAATGGACACTTGAAGAGGAATTACAAGGCCCCGAAAAAAAATGAAGG
GAACGAAGCCGGTGCTGATGTTGCTGAGCAGATACATGATGCTTTGGTTCTGCAGTTGAGAGCGCTCATGACACATGGGTGA
Protein sequenceShow/hide protein sequence
MDFGLPVFLLAPTKFVDPGLIPAKPNRAKPREVVPEPSVANPNSPLQGLGEIVDRRWGITGWAVGRARLGTEPRVGAEHRVETEHRAGAEYRVGTEPFVAQWASLRSLSS
FSAPTLGPSLGRPVLDYLHSKELELPLEGKPDDMGEKEWKKLDRKVLVGNPFELLFRILCGKEKLKFEDVRDAALAEKIRKKDSGIASTSGSVLNVDRGRNNNRGYGNRG
KSKNNRSRSRNNMFECWNCGKNGHLKRNYKAPKKNEGNEAGADVAEQIHDALVLQLRALMTHG