; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:10603621..10605786
RNA-Seq ExpressionMoc03g15940
SyntenyMoc03g15940
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032016.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]1.5e-5244.12Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------
        MDSL+  FGQP  S+ H+AIKY+Y  RMKEG+SV+EHVL++M+HF++AEVNG  ++E +                                         
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------

Query:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKESK------
               + EANV TT+K KF +GSSS SKSGPS   + I+KK K     K P   KG++   K KC+HC ENGH  RNCP+YL +K+  K+ K      
Subjt:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKESK------

Query:  --------DEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETK
                + KV VS NA FLEEDHI++H+  SKLVLEEISK+ATD+ +        ST+V ++    G +HP QE+ EPRRSGRV+RQPDRY+ L E +
Subjt:  --------DEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETK

Query:  VTIPDD
        + IPDD
Subjt:  VTIPDD

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]7.9e-4133.64Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------
        MDSL+  FGQ S  I HDA+ Y+YN RM EG+SV+EHVLNMMVHFNVAE+NGAV++E S                                         
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------

Query:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKA---PAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----
                 EANVA TS RKFH+G + G+KS PS       KKKK     KA    A T  + K AK  CFHCN+ GHWKRNCP+YL +K+  K+     
Subjt:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKA---PAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEM
                                      KD KV VS NATFLEEDHI++HKPCSK+VL ++SK+ T+ STRVV++    TRV +  S+   +H PQ +
Subjt:  ------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEM

Query:  REPRRSGRVLRQPDRYMSLIETKVTIPD
        REPRRSGRV   P RYMSL ET   I D
Subjt:  REPRRSGRVLRQPDRYMSLIETKVTIPD

KAA0056663.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4235.62Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG
        MD+L+A FGQP  S+ H+AIKY+   RMKEG+SV+EHVL+MM+H N+AEVNG V++E +           EANV TT KRKF +GSSS +K GPS  K  
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG

Query:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------
        I+KK+K     + P   KG++   K KC+HC++ GHW RNCP+YL KK+AEKE                                               
Subjt:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------

Query:  ------------------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDH
                                                                                      ++ KV VS NAT L+EDHI++H
Subjt:  ------------------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDH

Query:  KPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYM
        +P SKLVL EISK A DK +        ST+V ++    G +HP QE+REPRRSGRV+ QPDRY+
Subjt:  KPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYM

KAA0066192.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-4143.73Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEISSDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDN
        MD LR  FGQ S  I  +AIKYVYN  MKE  +V+EHVL+M+V+FN              + EAN A    R+F   SS GSK         IQ +K   
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEISSDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDN

Query:  VKVKAPAAT-KGEEKIA-KEKCFHCNENGHWKRNCPEYLVKKR----------------------------------------AEKESKD--------EK
         K    AA  KG+ K+A K KCFHCN +GHWKRNCP+YL KK+                                        A KE++D         K
Subjt:  VKVKAPAAT-KGEEKIA-KEKCFHCNENGHWKRNCPEYLVKKR----------------------------------------AEKESKD--------EK

Query:  VVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD
        V VS NATFL+E+H+ DHKP SKLVL +    A D+ST+VV++ GPS+RVD   +T G SHP Q +R PRRSGR++ QP+RY+ L ET+V IPDD
Subjt:  VVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD

TYJ98102.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4235.5Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG
        MDSL+  FGQP  S+  + IKY+Y  RMK+G+SV+EHVL+MM+HFN+AEVNG V++E +         + EANVATT KRKF + S S SK+GPS   + 
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG

Query:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------
        I+KKKK     K P     ++   K KC+H  ENGHW RNCP++L  K A+KE+                                              
Subjt:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------

Query:  ----------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVL
                                                                              ++ KV VS NATFLEE+HI++H+  SKLVL
Subjt:  ----------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVL

Query:  EEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD
        EEISK+ TD+ +        ST++ ++    G +HP Q+ REPRRSGRV+RQPDRY+ L E ++ IPDD
Subjt:  EEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD

TrEMBL top hitse value%identityAlignment
A0A5A7U676 Gag/pol protein3.8e-4133.64Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------
        MDSL+  FGQ S  I HDA+ Y+YN RM EG+SV+EHVLNMMVHFNVAE+NGAV++E S                                         
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------

Query:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKA---PAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----
                 EANVA TS RKFH+G + G+KS PS       KKKK     KA    A T  + K AK  CFHCN+ GHWKRNCP+YL +K+  K+     
Subjt:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKA---PAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEM
                                      KD KV VS NATFLEEDHI++HKPCSK+VL ++SK+ T+ STRVV++    TRV +  S+   +H PQ +
Subjt:  ------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEM

Query:  REPRRSGRVLRQPDRYMSLIETKVTIPD
        REPRRSGRV   P RYMSL ET   I D
Subjt:  REPRRSGRVLRQPDRYMSLIETKVTIPD

A0A5A7UL81 Gag/pol protein2.0e-4235.62Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG
        MD+L+A FGQP  S+ H+AIKY+   RMKEG+SV+EHVL+MM+H N+AEVNG V++E +           EANV TT KRKF +GSSS +K GPS  K  
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG

Query:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------
        I+KK+K     + P   KG++   K KC+HC++ GHW RNCP+YL KK+AEKE                                               
Subjt:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------

Query:  ------------------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDH
                                                                                      ++ KV VS NAT L+EDHI++H
Subjt:  ------------------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDH

Query:  KPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYM
        +P SKLVL EISK A DK +        ST+V ++    G +HP QE+REPRRSGRV+ QPDRY+
Subjt:  KPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYM

A0A5D3BEB1 Gag/pol protein5.4e-4335.5Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG
        MDSL+  FGQP  S+  + IKY+Y  RMK+G+SV+EHVL+MM+HFN+AEVNG V++E +         + EANVATT KRKF + S S SK+GPS   + 
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS--------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKG

Query:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------
        I+KKKK     K P     ++   K KC+H  ENGHW RNCP++L  K A+KE+                                              
Subjt:  IQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKES----------------------------------------------

Query:  ----------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVL
                                                                              ++ KV VS NATFLEE+HI++H+  SKLVL
Subjt:  ----------------------------------------------------------------------KDEKVVVSINATFLEEDHIQDHKPCSKLVL

Query:  EEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD
        EEISK+ TD+ +        ST++ ++    G +HP Q+ REPRRSGRV+RQPDRY+ L E ++ IPDD
Subjt:  EEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD

A0A5D3BR24 Gag/pol protein1.3e-4143.73Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEISSDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDN
        MD LR  FGQ S  I  +AIKYVYN  MKE  +V+EHVL+M+V+FN              + EAN A    R+F   SS GSK         IQ +K   
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEISSDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDN

Query:  VKVKAPAAT-KGEEKIA-KEKCFHCNENGHWKRNCPEYLVKKR----------------------------------------AEKESKD--------EK
         K    AA  KG+ K+A K KCFHCN +GHWKRNCP+YL KK+                                        A KE++D         K
Subjt:  VKVKAPAAT-KGEEKIA-KEKCFHCNENGHWKRNCPEYLVKKR----------------------------------------AEKESKD--------EK

Query:  VVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD
        V VS NATFL+E+H+ DHKP SKLVL +    A D+ST+VV++ GPS+RVD   +T G SHP Q +R PRRSGR++ QP+RY+ L ET+V IPDD
Subjt:  VVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETKVTIPDD

A0A5D3CYG9 Retrovirus-related pol polyprotein from transposon tnt 1-947.5e-5344.12Show/hide
Query:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------
        MDSL+  FGQP  S+ H+AIKY+Y  RMKEG+SV+EHVL++M+HF++AEVNG  ++E +                                         
Subjt:  MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEIS-----------------------------------------

Query:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKESK------
               + EANV TT+K KF +GSSS SKSGPS   + I+KK K     K P   KG++   K KC+HC ENGH  RNCP+YL +K+  K+ K      
Subjt:  ------SDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKAPAATKGEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKESK------

Query:  --------DEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETK
                + KV VS NA FLEEDHI++H+  SKLVLEEISK+ATD+ +        ST+V ++    G +HP QE+ EPRRSGRV+RQPDRY+ L E +
Subjt:  --------DEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPRRSGRVLRQPDRYMSLIETK

Query:  VTIPDD
        + IPDD
Subjt:  VTIPDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCTCTACGAGCCCGGTTTGGACAACCATCAACATCTATCATGCATGATGCAATTAAGTATGTGTACAACTGTAGAATGAAGGAAGGATCTTCCGTAAAGGAGCA
TGTTTTGAACATGATGGTTCACTTCAATGTTGCAGAAGTGAACGGTGCAGTCATGAATGAAATAAGTTCTGATGCTGAGGCAAATGTCGCTACCACCTCAAAAAGGAAAT
TCCACAAGGGATCTTCCTCTGGGAGTAAATCTGGACCTTCTTATCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGATAATGTGAAGGTGAAGGCTCCGGCTGCGACAAAA
GGCGAGGAAAAGATTGCAAAAGAAAAATGTTTCCATTGCAATGAAAATGGGCACTGGAAAAGAAATTGCCCAGAATACCTCGTCAAGAAAAGAGCTGAGAAGGAAAGCAA
GGATGAAAAGGTAGTTGTATCGATAAACGCCACATTCCTAGAGGAAGACCACATACAAGATCATAAACCCTGCAGCAAACTAGTATTAGAAGAGATTTCAAAAGATGCTA
CAGATAAATCAACAAGAGTTGTTGATCAGGCTGGTCCATCAACAAGAGTTGATAATAGACCAAGCACATGTGGTCCGTCACATCCTCCTCAAGAGATGAGAGAGCCTCGA
CGTAGTGGCAGAGTTTTGAGACAACCTGACCGCTATATGAGTTTAATTGAAACCAAAGTCACCATACCTGATGATGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCTCTACGAGCCCGGTTTGGACAACCATCAACATCTATCATGCATGATGCAATTAAGTATGTGTACAACTGTAGAATGAAGGAAGGATCTTCCGTAAAGGAGCA
TGTTTTGAACATGATGGTTCACTTCAATGTTGCAGAAGTGAACGGTGCAGTCATGAATGAAATAAGTTCTGATGCTGAGGCAAATGTCGCTACCACCTCAAAAAGGAAAT
TCCACAAGGGATCTTCCTCTGGGAGTAAATCTGGACCTTCTTATCAGAAGAAAGGAATTCAGAAGAAGAAGAAGGATAATGTGAAGGTGAAGGCTCCGGCTGCGACAAAA
GGCGAGGAAAAGATTGCAAAAGAAAAATGTTTCCATTGCAATGAAAATGGGCACTGGAAAAGAAATTGCCCAGAATACCTCGTCAAGAAAAGAGCTGAGAAGGAAAGCAA
GGATGAAAAGGTAGTTGTATCGATAAACGCCACATTCCTAGAGGAAGACCACATACAAGATCATAAACCCTGCAGCAAACTAGTATTAGAAGAGATTTCAAAAGATGCTA
CAGATAAATCAACAAGAGTTGTTGATCAGGCTGGTCCATCAACAAGAGTTGATAATAGACCAAGCACATGTGGTCCGTCACATCCTCCTCAAGAGATGAGAGAGCCTCGA
CGTAGTGGCAGAGTTTTGAGACAACCTGACCGCTATATGAGTTTAATTGAAACCAAAGTCACCATACCTGATGATGTCTGA
Protein sequenceShow/hide protein sequence
MDSLRARFGQPSTSIMHDAIKYVYNCRMKEGSSVKEHVLNMMVHFNVAEVNGAVMNEISSDAEANVATTSKRKFHKGSSSGSKSGPSYQKKGIQKKKKDNVKVKAPAATK
GEEKIAKEKCFHCNENGHWKRNCPEYLVKKRAEKESKDEKVVVSINATFLEEDHIQDHKPCSKLVLEEISKDATDKSTRVVDQAGPSTRVDNRPSTCGPSHPPQEMREPR
RSGRVLRQPDRYMSLIETKVTIPDDV