; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:2523710..2524747
RNA-Seq ExpressionMoc03g03310
SyntenyMoc03g03310
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]6.8e-9977.2Show/hide
Query:  MRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN-
        MRTQM +ME+MY+EM+ AAGA SRSENRV R D+ EQ G HLGP ++  PE  E E YT Q GDLREHLNRKR SSLRKGQSPSCSHR+SN+QAESS+N 
Subjt:  MRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN-

Query:  --PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  TALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
         ALTGSARLWYRRLPARSISTYSQLR+EF+ QF SRHYD+KTATHL TIR
Subjt:  TALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]3.5e-7988.83Show/hide
Query:  KRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGT
        +RGSSLRKGQSPS SHRSSN+QAESSHN   P G+ITREEFDQLR +LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        +DPKDYVEVFEGLMDFQAASD IKCRAFQ ALT SARLWYRRLPARSISTYSQLRREFLAQF SRHYDK+TATHLATIR
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]3.7e-7677.16Show/hide
Query:  ESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLE
        E E YT Q GDLREHLNRKR SSLRKGQSPS SHR+SN+QAESS+N   P  +ITREEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+LE
Subjt:  ESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLE

Query:  APIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        API  KFK PT+KPYDG+K+PKDYV+VFEGLM+FQAA+DAIKCRAFQ A TGSARLWYRRLPARSISTYSQLR+EF++QF SR+YD+KTATHLATIR
Subjt:  APIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]1.9e-8568.38Show/hide
Query:  MEAMRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESS
        MEAMRTQM +MEEMYN+M+  AGA SRS ++V   DV EQG  H  P +EE             GGDLR+HLNRKR SS R  ++ +  H++SN+QAESS
Subjt:  MEAMRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESS

Query:  HN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCR
        +N   P G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        AFQ ALTGSARLWYRRLPARSISTYSQLR+EF++QFFSRHYD+KT THLATIR
Subjt:  AFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]8.6e-11868.39Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGASKKGARNPAPAPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGGAS++     APAP+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGASKKGARNPAPAPTSENFDALKREMEAMR

Query:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHNPT-
        TQM +MEEMYNEM+ A GAGSRSE+R  R               +ER             GDLR+HL+RKR SSLRKG+SPSCSH++SN+QAESS+NP  
Subjt:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHNPT-

Query:  --GIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTA
          G+ITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IPSKFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQ A
Subjt:  --GIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        LT SARLWYRRLPARSISTYSQLR+EF +QF SRHY++KTATHLATIR
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198421.7e-7988.83Show/hide
Query:  KRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGT
        +RGSSLRKGQSPS SHRSSN+QAESSHN   P G+ITREEFDQLR +LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        +DPKDYVEVFEGLMDFQAASD IKCRAFQ ALT SARLWYRRLPARSISTYSQLRREFLAQF SRHYDK+TATHLATIR
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

A0A6J1DDW5 uncharacterized protein LOC1110196343.3e-9977.2Show/hide
Query:  MRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN-
        MRTQM +ME+MY+EM+ AAGA SRSENRV R D+ EQ G HLGP ++  PE  E E YT Q GDLREHLNRKR SSLRKGQSPSCSHR+SN+QAESS+N 
Subjt:  MRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN-

Query:  --PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  TALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
         ALTGSARLWYRRLPARSISTYSQLR+EF+ QF SRHYD+KTATHL TIR
Subjt:  TALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

A0A6J1DM55 uncharacterized protein LOC1110222671.8e-7677.16Show/hide
Query:  ESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLE
        E E YT Q GDLREHLNRKR SSLRKGQSPS SHR+SN+QAESS+N   P  +ITREEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+LE
Subjt:  ESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLE

Query:  APIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        API  KFK PT+KPYDG+K+PKDYV+VFEGLM+FQAA+DAIKCRAFQ A TGSARLWYRRLPARSISTYSQLR+EF++QF SR+YD+KTATHLATIR
Subjt:  APIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

A0A6J1DPN4 uncharacterized protein LOC1110230609.4e-8668.38Show/hide
Query:  MEAMRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESS
        MEAMRTQM +MEEMYN+M+  AGA SRS ++V   DV EQG  H  P +EE             GGDLR+HLNRKR SS R  ++ +  H++SN+QAESS
Subjt:  MEAMRTQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESS

Query:  HN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCR
        +N   P G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---PTGIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        AFQ ALTGSARLWYRRLPARSISTYSQLR+EF++QFFSRHYD+KT THLATIR
Subjt:  AFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

A0A6J1DZJ1 uncharacterized protein LOC1110257384.2e-11868.39Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGASKKGARNPAPAPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGGAS++     APAP+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGASKKGARNPAPAPTSENFDALKREMEAMR

Query:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHNPT-
        TQM +MEEMYNEM+ A GAGSRSE+R  R               +ER             GDLR+HL+RKR SSLRKG+SPSCSH++SN+QAESS+NP  
Subjt:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHNPT-

Query:  --GIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTA
          G+ITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IPSKFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQ A
Subjt:  --GIITREEFDQLRRELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR
        LT SARLWYRRLPARSISTYSQLR+EF +QF SRHY++KTATHLATIR
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACAACAGACCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGGCCTCTA
AGAAGGGCGCCCGGAATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCACTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAGGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAAACAACGAGAGCGAGGGGTACACTTGCCAGGGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCTGCT
CACACAGGAGCTCCAACCGGCAGGCTGAATCCTCTCACAATCCCACAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGAAGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTTCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGACCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTTTTCT
CGACACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACAACAGACCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGGCCTCTA
AGAAGGGCGCCCGGAATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCACTCCATGGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAGGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAAACAACGAGAGCGAGGGGTACACTTGCCAGGGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCTGCT
CACACAGGAGCTCCAACCGGCAGGCTGAATCCTCTCACAATCCCACAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGAAGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTTCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGACCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTTTTCT
CGACACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGASKKGARNPAPAPTSENFDALKREMEAMRTQMHSMEEMY
NEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNESEGYTCQGGDLREHLNRKRGSSLRKGQSPSCSHRSSNRQAESSHNPTGIITREEFDQLRRELDAQVEA
LKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPSKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQTALTGSARLWYRRLPARSISTYSQLRREFLAQFFS
RHYDKKTATHLATIR