; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:15171348..15174896
RNA-Seq ExpressionMoc04g20840
SyntenyMoc04g20840
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4944.9Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA R  R+ Y+RW KAN+KA+ YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q +H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK
        VHFNVAEMNGAVIDE SQ                                             GQ+GEANVATS R FHRGSTSGTK MPS   +  +KK
Subjt:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK

Query:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM
        KK G+G K    A+ TT  AK                         K KAK                                    GISS R+L+  EM
Subjt:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM

Query:  TLKVGTGDVVSAVA
        T++VGTG VVSA+A
Subjt:  TLKVGTGDVVSAVA

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-4944.9Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA R  R+ Y+RW KAN+KA+ YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q +H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK
        VHFNVAEMNGAVIDE SQ                                             GQ+GEANVATS R FHRGSTSGTK MPS   +  +KK
Subjt:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK

Query:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM
        KK G+G K    A+ TT  AK                         K KAK                                    GISS R+L+  EM
Subjt:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM

Query:  TLKVGTGDVVSAVA
        T++VGTG VVSA+A
Subjt:  TLKVGTGDVVSAVA

KAA0060254.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-5365.75Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA +  R+ Y+RW K N+K + YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q  H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI
        VHFNVAEMNGAVIDE SQGQ+GEANVATS R FHRGSTSGTK MPS   +  +KKKK G+G KA+       KK KA KGI
Subjt:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-5365.75Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA +  R+ Y+RW K N+K + YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q  H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI
        VHFNVAEMNGAVIDE SQGQ+GEANVATS R FHRGSTSGTK MPS   +  +KKKK G+G KA+       KK KA KGI
Subjt:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]1.7e-5299.09Show/hide
Query:  QAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM
        +APLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM
Subjt:  QAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM

Query:  NGAVIDEGSQ
        NGAVIDEGSQ
Subjt:  NGAVIDEGSQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.1e-5044.9Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA R  R+ Y+RW KAN+KA+ YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q +H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK
        VHFNVAEMNGAVIDE SQ                                             GQ+GEANVATS R FHRGSTSGTK MPS   +  +KK
Subjt:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK

Query:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM
        KK G+G K    A+ TT  AK                         K KAK                                    GISS R+L+  EM
Subjt:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM

Query:  TLKVGTGDVVSAVA
        T++VGTG VVSA+A
Subjt:  TLKVGTGDVVSAVA

A0A5A7TWB9 Gag/pol protein5.1e-5044.9Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA R  R+ Y+RW KAN+KA+ YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q +H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK
        VHFNVAEMNGAVIDE SQ                                             GQ+GEANVATS R FHRGSTSGTK MPS   +  +KK
Subjt:  VHFNVAEMNGAVIDEGSQ---------------------------------------------GQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKK

Query:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM
        KK G+G K    A+ TT  AK                         K KAK                                    GISS R+L+  EM
Subjt:  KKNGKGIK----ASPTTVAAK-------------------------KGKAK------------------------------------GISSSRKLDVEEM

Query:  TLKVGTGDVVSAVA
        T++VGTG VVSA+A
Subjt:  TLKVGTGDVVSAVA

A0A5A7UYX7 Gag/pol protein1.3e-5365.75Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA +  R+ Y+RW K N+K + YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q  H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI
        VHFNVAEMNGAVIDE SQGQ+GEANVATS R FHRGSTSGTK MPS   +  +KKKK G+G KA+       KK KA KGI
Subjt:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI

A0A5D3BHG7 Gag/pol protein1.3e-5365.75Show/hide
Query:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM
        FVL EECPQ P  NA +  R+ Y+RW K N+K + YILAS+S+VLAKKHE+M+ A+EIMDSL++MFGQ S Q  H+ALK+IYN+RM EG S++EHVLN+M
Subjt:  FVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLM

Query:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI
        VHFNVAEMNGAVIDE SQGQ+GEANVATS R FHRGSTSGTK MPS   +  +KKKK G+G KA+       KK KA KGI
Subjt:  VHFNVAEMNGAVIDEGSQGQEGEANVATSNR-FHRGSTSGTKFMPSFR-SMTFKKKKNGKGIKAS-PTTVAAKKGKA-KGI

A0A6J1DRZ2 uncharacterized protein LOC1110229748.4e-5399.09Show/hide
Query:  QAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM
        +APLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM
Subjt:  QAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQEIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEM

Query:  NGAVIDEGSQ
        NGAVIDEGSQ
Subjt:  NGAVIDEGSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGCAGCATTTCAGTAGCATTCGAGCTTACCCACATCCGTTCGGAGCTCGATTTAGGCTACCCACACCTTGGTGAGCTAGATCTAGTGCCTAGGCTAGGGGTTGA
TGTGAAGTTGGTCATGACAGGGAACGTTGGATTCTTGGGGAAAGTGAATGGGTTTGTCTTAACTGAGGAGTGTCCTCAGGCTCCCTTGCCTAATGCAGCCCGAGCAAGTC
GGGATGCCTATGATAGGTGGATCAAGGCCAATGATAAGGCTAAGGTCTACATCTTAGCGAGCATATCTGATGTGCTAGCCAAGAAGCACGAGAACATGGTCAACGCACAG
GAGATCATGGACTCGTTACGGGACATGTTTGGACAGCCGTCCATTCAAGCCCGACACGAAGCCCTCAAGTTCATTTACAATTCCCGCATGAAAGAAGGAACATCAATGCA
AGAACATGTTCTCAATCTGATGGTCCACTTTAATGTGGCTGAAATGAATGGGGCTGTCATAGACGAGGGAAGCCAGGGACAAGAAGGGGAGGCAAACGTTGCTACCTCAA
ATCGGTTCCATCGAGGTTCGACCTCAGGAACAAAATTTATGCCTTCTTTTAGAAGTATGACTTTCAAGAAGAAGAAAAATGGTAAGGGTATCAAAGCTAGCCCTACTACT
GTTGCTGCCAAGAAGGGCAAGGCCAAAGGAATTAGTTCCTCGAGGAAGCTTGATGTTGAGGAGATGACTCTCAAAGTTGGAACAGGAGATGTCGTCTCAGCAGTCGCATG
TTCATTTCACGAAATGGTGTTAACATTTGTTCTACAAAACTTGAAAACAGCTTATACGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGCAGCATTTCAGTAGCATTCGAGCTTACCCACATCCGTTCGGAGCTCGATTTAGGCTACCCACACCTTGGTGAGCTAGATCTAGTGCCTAGGCTAGGGGTTGA
TGTGAAGTTGGTCATGACAGGGAACGTTGGATTCTTGGGGAAAGTGAATGGGTTTGTCTTAACTGAGGAGTGTCCTCAGGCTCCCTTGCCTAATGCAGCCCGAGCAAGTC
GGGATGCCTATGATAGGTGGATCAAGGCCAATGATAAGGCTAAGGTCTACATCTTAGCGAGCATATCTGATGTGCTAGCCAAGAAGCACGAGAACATGGTCAACGCACAG
GAGATCATGGACTCGTTACGGGACATGTTTGGACAGCCGTCCATTCAAGCCCGACACGAAGCCCTCAAGTTCATTTACAATTCCCGCATGAAAGAAGGAACATCAATGCA
AGAACATGTTCTCAATCTGATGGTCCACTTTAATGTGGCTGAAATGAATGGGGCTGTCATAGACGAGGGAAGCCAGGGACAAGAAGGGGAGGCAAACGTTGCTACCTCAA
ATCGGTTCCATCGAGGTTCGACCTCAGGAACAAAATTTATGCCTTCTTTTAGAAGTATGACTTTCAAGAAGAAGAAAAATGGTAAGGGTATCAAAGCTAGCCCTACTACT
GTTGCTGCCAAGAAGGGCAAGGCCAAAGGAATTAGTTCCTCGAGGAAGCTTGATGTTGAGGAGATGACTCTCAAAGTTGGAACAGGAGATGTCGTCTCAGCAGTCGCATG
TTCATTTCACGAAATGGTGTTAACATTTGTTCTACAAAACTTGAAAACAGCTTATACGCGTTAA
Protein sequenceShow/hide protein sequence
MDSSISVAFELTHIRSELDLGYPHLGELDLVPRLGVDVKLVMTGNVGFLGKVNGFVLTEECPQAPLPNAARASRDAYDRWIKANDKAKVYILASISDVLAKKHENMVNAQ
EIMDSLRDMFGQPSIQARHEALKFIYNSRMKEGTSMQEHVLNLMVHFNVAEMNGAVIDEGSQGQEGEANVATSNRFHRGSTSGTKFMPSFRSMTFKKKKNGKGIKASPTT
VAAKKGKAKGISSSRKLDVEEMTLKVGTGDVVSAVACSFHEMVLTFVLQNLKTAYTR