; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:15403240..15404289
RNA-Seq ExpressionMoc07g21140
SyntenyMoc07g21140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]5.7e-10178.8Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNP
        MRTQM +ME+MY+EM+ AAGA SRSENRV R  + EQRG HLGPV++ HPE  + E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQ ESS NP
Subjt:  MRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNP

Query:  ATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
         TP  VITREEFDQL+ + DAQVEALKAKCE+KE   +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  ATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        IALTGS RLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIR
Subjt:  IALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]9.7e-8593.3Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQ ESS NPATPA VITREEFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT S RLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIR
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]6.5e-8179.4Show/hide
Query:  DNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV
        + + E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQ ESS NP TP RVITREEFDQL+ + DAQVEALKA CE+KE   +DGDLGE PFT D+
Subjt:  DNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGS RLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIR
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]3.9e-8669.17Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESS
        MEAMRTQMR+MEEMYN+M+  AGA SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQ ESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESS

Query:  RNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P  VITREEF+QL+ + DAQVEALK +CE+KE   +DGDLGESPFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        AFQIALTGS RLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIR
Subjt:  AFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]6.7e-11868.48Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPA
        RTQM +MEEMYNEM+ A GAGSRSE+R  R                             +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQ ESS NP 
Subjt:  RTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPA

Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
         P  VITREEFDQL+ + DAQVE LKA+CE K    +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQI
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        ALT S RLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIR
Subjt:  ALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198424.7e-8593.3Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQ ESS NPATPA VITREEFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT S RLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIR
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

A0A6J1DDW5 uncharacterized protein LOC1110196342.7e-10178.8Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNP
        MRTQM +ME+MY+EM+ AAGA SRSENRV R  + EQRG HLGPV++ HPE  + E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQ ESS NP
Subjt:  MRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNP

Query:  ATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
         TP  VITREEFDQL+ + DAQVEALKAKCE+KE   +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  ATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        IALTGS RLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIR
Subjt:  IALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

A0A6J1DM55 uncharacterized protein LOC1110222673.2e-8179.4Show/hide
Query:  DNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV
        + + E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQ ESS NP TP RVITREEFDQL+ + DAQVEALKA CE+KE   +DGDLGE PFT D+
Subjt:  DNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGS RLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIR
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

A0A6J1DPN4 uncharacterized protein LOC1110230601.9e-8669.17Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESS
        MEAMRTQMR+MEEMYN+M+  AGA SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQ ESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESS

Query:  RNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P  VITREEF+QL+ + DAQVEALK +CE+KE   +DGDLGESPFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        AFQIALTGS RLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIR
Subjt:  AFQIALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

A0A6J1DZJ1 uncharacterized protein LOC1110257383.2e-11868.48Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPA
        RTQM +MEEMYNEM+ A GAGSRSE+R  R                             +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQ ESS NP 
Subjt:  RTQMRSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPA

Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI
         P  VITREEFDQL+ + DAQVE LKA+CE K    +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQI
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQI

Query:  ALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR
        ALT S RLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIR
Subjt:  ALTGSTRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGAGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAGGACAACAAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGAAGCTCCAACCAGCAGGTTGAATCCTCTCGCAACCCAGCAACTCCTGCACGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGTGCCTTTCAGATCGCGCTTACTGGCAGCACGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGACATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGAGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAGGACAACAAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGAAGCTCCAACCAGCAGGTTGAATCCTCTCGCAACCCAGCAACTCCTGCACGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGTGCCTTTCAGATCGCGCTTACTGGCAGCACGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGACATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGATAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTQMRSMEEM
YNEMILAAGAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNKSEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQVESSRNPATPARVITREEFDQLRGQLDA
QVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSTRLWYRRLPARSISTYSQLRREFLA
QFSSRHYDKKTATHLATIR