; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:27462271..27463352
RNA-Seq ExpressionMoc04g36580
SyntenyMoc04g36580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.0e-7295.3Show/hide
Query:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        VITR+EFDQLRGQLDAQVEALKAKCEQKEGPLN GDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTG
Subjt:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEG
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]4.8e-7395.97Show/hide
Query:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        +ITR+EFDQLRGQLDAQ EALKAKCEQKEGPLN GDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.1e-7755.59Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMILAAGAGSPLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQ
                                                              E+S N  T           VITR+EFDQL+ + DAQVEALKA+CE+
Subjt:  RTKMRSMEEMYNEMILAAGAGSPLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE   + GDLGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++Q
Subjt:  KEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEG
        FSSRHYD+KT THLATIRQKEG
Subjt:  FSSRHYDKKTATHLATIRQKEG

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]5.9e-7187.8Show/hide
Query:  NTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAA
        +TS+ R   FS +  +ITR+EFDQLRG+LDAQVEALKAKCEQK+  LN GDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAA
Subjt:  NTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAA

Query:  SDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SDAIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
Subjt:  SDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.0e-9964.92Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMILAAGAGS----PLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKA
        RT+M +MEEMYNEM+ A GAGS      A   +G      SR+      + R      + S N   E   +    + VITR+EFDQL+ + DAQVE LKA
Subjt:  RTKMRSMEEMYNEMILAAGAGS----PLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKA

Query:  KCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRE
        +CE K    + GDLGES FTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPARSISTYSQLR+E
Subjt:  KCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRE

Query:  FLAQFSSRHYDKKTATHLATIRQKE
        F +QFSSRHY++KTATHLATIRQKE
Subjt:  FLAQFSSRHYDKKTATHLATIRQKE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.4e-7295.3Show/hide
Query:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        VITR+EFDQLRGQLDAQVEALKAKCEQKEGPLN GDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTG
Subjt:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEG
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

A0A6J1CKB3 uncharacterized protein LOC1110120812.3e-7395.97Show/hide
Query:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        +ITR+EFDQLRGQLDAQ EALKAKCEQKEGPLN GDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  VITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

A0A6J1DHB3 uncharacterized protein LOC1110204795.4e-7855.59Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAH P+ SKA                                  
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMILAAGAGSPLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQ
                                                              E+S N  T           VITR+EFDQL+ + DAQVEALKA+CE+
Subjt:  RTKMRSMEEMYNEMILAAGAGSPLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE   + GDLGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++Q
Subjt:  KEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEG
        FSSRHYD+KT THLATIRQKEG
Subjt:  FSSRHYDKKTATHLATIRQKEG

A0A6J1DS95 uncharacterized protein LOC1110234212.8e-7187.8Show/hide
Query:  NTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAA
        +TS+ R   FS +  +ITR+EFDQLRG+LDAQVEALKAKCEQK+  LN GDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAA
Subjt:  NTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAA

Query:  SDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
        SDAIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG
Subjt:  SDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEG

A0A6J1DZJ1 uncharacterized protein LOC1110257381.4e-9964.92Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSMEEMYNEMILAAGAGS----PLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKA
        RT+M +MEEMYNEM+ A GAGS      A   +G      SR+      + R      + S N   E   +    + VITR+EFDQL+ + DAQVE LKA
Subjt:  RTKMRSMEEMYNEMILAAGAGS----PLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKA

Query:  KCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRE
        +CE K    + GDLGES FTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPARSISTYSQLR+E
Subjt:  KCEQKEGPLNGGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRRE

Query:  FLAQFSSRHYDKKTATHLATIRQKE
        F +QFSSRHY++KTATHLATIRQKE
Subjt:  FLAQFSSRHYDKKTATHLATIRQKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCCTTTGGCATACGCGGGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAATGAGAGC
GAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATTTCTCCGAAAAGGACAGAGTGATTACAAGGGACGAGTTCGACCAGCTGA
GGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAGGAAGGTCCACTGAACGGTGGCGACTTGGGAGAATCGTCGTTCACCTCGGACGTTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGA
GAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGACATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGGAGACGCTGCGAGAATATGTCACCA
GATTCCAGGAGGAGCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCCTTTGGCATACGCGGGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAATGAGAGC
GAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATTTCTCCGAAAAGGACAGAGTGATTACAAGGGACGAGTTCGACCAGCTGA
GGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAGGAAGGTCCACTGAACGGTGGCGACTTGGGAGAATCGTCGTTCACCTCGGACGTTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGC
GGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGA
GAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGACATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGGAGACGCTGCGAGAATATGTCACCA
GATTCCAGGAGGAGCAATTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEM
YNEMILAAGAGSPLAYAGKGVPTSAQSRRNIPKTMRARDTLAREETSVNTSTEREAHFSEKDRVITRDEFDQLRGQLDAQVEALKAKCEQKEGPLNGGDLGESSFTSDVL
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGRRCENMSP
DSRRSN