; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g01000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g01000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr2:812195..813559
RNA-Seq ExpressionMoc02g01000
SyntenyMoc02g01000
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148135.1 uncharacterized protein LOC111016888 [Momordica charantia]3.1e-5296.23Show/hide
Query:  MGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCN
        MGCDGLTGQPNDEKLQ MVQSSGTNDVKEGEVFD KKELSLRMHLVAMRLNFQFK+KKSTPELYILRCVDTSCTWRLRATKLRDCN+FKIKKYYSIHTCN
Subjt:  MGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCN

Query:  GGVLKQ
        GGVLKQ
Subjt:  GGVLKQ

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]3.2e-7093.24Show/hide
Query:  EEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKK
        EEG Y+AEFVNDDYDDALDEESEPDVEQVHAEI RDEAAVQQMGCDGLTGQ N E LQL+VQSSGTNDVKEGEVFDTKKELSLRMHLV MRLNFQFK+KK
Subjt:  EEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKK

Query:  STPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
        STPELYIL CVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
Subjt:  STPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]2.3e-9286.36Show/hide
Query:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG
        GHDIAGLTPLESDVVPC LGDDRVC WN+PGL NDNQDESDESYD LG+SEEG Y+AEF+NDDYDDA DE+ EPDVEQV  EIRRDE  V QMGCDGL G
Subjt:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG

Query:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
        QPNDEKLQL+VQSSGTNDVKEG+VFDTKKELSLR HLVAM LNFQFK+KKSTPELYILRCVD+SCTWRLRA KL DCNLFKIKKYYSIHTCNG VLKQ
Subjt:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.4e-5263.48Show/hide
Query:  PGRPKRVLELYAHSLYSPIPETHRFGHDIAGLT----PLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEE
        PG  K   +L  H  +      H    +IA         + +VVPC LGDDRVCDW+VPG+ NDN+DES ESYDPL ES+EGH  AE+ N+++DDALD+E
Subjt:  PGRPKRVLELYAHSLYSPIPETHRFGHDIAGLT----PLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEE

Query:  SEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIK
         EPDVEQVH EIRRDE AV+  GC+GLTG PNDEKLQL+VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFKI+
Subjt:  SEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIK

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]1.8e-8176.14Show/hide
Query:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG
        GHD+ GLTPL SDVVPC LGDDRVCDW+VPG+ NDN+DES ESYDPL  SEEGH  AE+ N+++DDALD+E E DVEQVH EIRRDE AV+  GC+GLTG
Subjt:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG

Query:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLK
         PNDEKLQL+VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFK+KKSTP+LYILRCV   CTWRLRATKL++C LFKIKKY + HTC GG LK
Subjt:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLK

TrEMBL top hitse value%identityAlignment
A0A6J1D234 uncharacterized protein LOC1110168881.5e-5296.23Show/hide
Query:  MGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCN
        MGCDGLTGQPNDEKLQ MVQSSGTNDVKEGEVFD KKELSLRMHLVAMRLNFQFK+KKSTPELYILRCVDTSCTWRLRATKLRDCN+FKIKKYYSIHTCN
Subjt:  MGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCN

Query:  GGVLKQ
        GGVLKQ
Subjt:  GGVLKQ

A0A6J1DJT1 uncharacterized protein LOC1110207151.6e-7093.24Show/hide
Query:  EEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKK
        EEG Y+AEFVNDDYDDALDEESEPDVEQVHAEI RDEAAVQQMGCDGLTGQ N E LQL+VQSSGTNDVKEGEVFDTKKELSLRMHLV MRLNFQFK+KK
Subjt:  EEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKK

Query:  STPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
        STPELYIL CVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
Subjt:  STPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ

A0A6J1DP00 uncharacterized protein LOC1110229541.1e-9286.36Show/hide
Query:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG
        GHDIAGLTPLESDVVPC LGDDRVC WN+PGL NDNQDESDESYD LG+SEEG Y+AEF+NDDYDDA DE+ EPDVEQV  EIRRDE  V QMGCDGL G
Subjt:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG

Query:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ
        QPNDEKLQL+VQSSGTNDVKEG+VFDTKKELSLR HLVAM LNFQFK+KKSTPELYILRCVD+SCTWRLRA KL DCNLFKIKKYYSIHTCNG VLKQ
Subjt:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQ

A0A6J1DQB9 Reverse transcriptase6.6e-5363.48Show/hide
Query:  PGRPKRVLELYAHSLYSPIPETHRFGHDIAGLT----PLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEE
        PG  K   +L  H  +      H    +IA         + +VVPC LGDDRVCDW+VPG+ NDN+DES ESYDPL ES+EGH  AE+ N+++DDALD+E
Subjt:  PGRPKRVLELYAHSLYSPIPETHRFGHDIAGLT----PLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEE

Query:  SEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIK
         EPDVEQVH EIRRDE AV+  GC+GLTG PNDEKLQL+VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFKI+
Subjt:  SEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIK

A0A6J1DTG5 uncharacterized protein LOC1110238438.9e-8276.14Show/hide
Query:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG
        GHD+ GLTPL SDVVPC LGDDRVCDW+VPG+ NDN+DES ESYDPL  SEEGH  AE+ N+++DDALD+E E DVEQVH EIRRDE AV+  GC+GLTG
Subjt:  GHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDPLGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTG

Query:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLK
         PNDEKLQL+VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFK+KKSTP+LYILRCV   CTWRLRATKL++C LFKIKKY + HTC GG LK
Subjt:  QPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKSTPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAATCCCGGATCGAAAGATGGTTCGGGAGTGAACCGGGATACATCCCGGACCAATAGAAGCACTGTGGTCCGGGATGTGTCTCGGACCAAGACTGGTCCG
GGACGGCCAAAACGTGTGCTCGAGTTGTATGCCCACAGCTTGTATTCCCCCATTCCAGAGACCCACAGATTCGGTCATGATATAGCTGGTTTAACACCATTGGAA
TCAGATGTTGTTCCATGTAAGCTAGGAGATGACAGGGTATGTGATTGGAATGTGCCGGGATTATTGAATGATAATCAAGATGAAAGTGATGAATCATATGACCCG
TTGGGAGAGTCAGAAGAAGGACACTATGATGCGGAATTTGTGAATGATGACTATGACGATGCACTTGATGAAGAGTCTGAGCCCGATGTGGAACAGGTACATGCT
GAGATTCGTAGGGATGAAGCAGCCGTTCAACAAATGGGGTGTGATGGTCTCACTGGGCAGCCTAATGATGAGAAGTTGCAACTCATGGTACAGTCTTCTGGAACA
AATGATGTTAAGGAGGGCGAAGTATTTGATACGAAGAAGGAGTTGAGTTTGAGAATGCATTTAGTTGCAATGCGGCTGAATTTTCAGTTTAAAATAAAAAAGTCG
ACACCGGAACTATATATACTACGCTGCGTTGATACTAGTTGCACCTGGAGACTTCGAGCTACAAAGTTGAGGGACTGCAATCTGTTCAAGATAAAAAAATACTAT
AGCATCCATACATGCAATGGTGGAGTTTTGAAACAGATCATAGGCAAGCCAAAAGTTGGGTGGTCGGACATCTTGTCCAAGCGAAGTTTACAGACGTCTCCCGCA
CGTATAGACCGAAGGACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAATCCCGGATCGAAAGATGGTTCGGGAGTGAACCGGGATACATCCCGGACCAATAGAAGCACTGTGGTCCGGGATGTGTCTCGGACCAAGACTGGTCCG
GGACGGCCAAAACGTGTGCTCGAGTTGTATGCCCACAGCTTGTATTCCCCCATTCCAGAGACCCACAGATTCGGTCATGATATAGCTGGTTTAACACCATTGGAA
TCAGATGTTGTTCCATGTAAGCTAGGAGATGACAGGGTATGTGATTGGAATGTGCCGGGATTATTGAATGATAATCAAGATGAAAGTGATGAATCATATGACCCG
TTGGGAGAGTCAGAAGAAGGACACTATGATGCGGAATTTGTGAATGATGACTATGACGATGCACTTGATGAAGAGTCTGAGCCCGATGTGGAACAGGTACATGCT
GAGATTCGTAGGGATGAAGCAGCCGTTCAACAAATGGGGTGTGATGGTCTCACTGGGCAGCCTAATGATGAGAAGTTGCAACTCATGGTACAGTCTTCTGGAACA
AATGATGTTAAGGAGGGCGAAGTATTTGATACGAAGAAGGAGTTGAGTTTGAGAATGCATTTAGTTGCAATGCGGCTGAATTTTCAGTTTAAAATAAAAAAGTCG
ACACCGGAACTATATATACTACGCTGCGTTGATACTAGTTGCACCTGGAGACTTCGAGCTACAAAGTTGAGGGACTGCAATCTGTTCAAGATAAAAAAATACTAT
AGCATCCATACATGCAATGGTGGAGTTTTGAAACAGATCATAGGCAAGCCAAAAGTTGGGTGGTCGGACATCTTGTCCAAGCGAAGTTTACAGACGTCTCCCGCA
CGTATAGACCGAAGGACATAA
Protein sequenceShow/hide protein sequence
MFNPGSKDGSGVNRDTSRTNRSTVVRDVSRTKTGPGRPKRVLELYAHSLYSPIPETHRFGHDIAGLTPLESDVVPCKLGDDRVCDWNVPGLLNDNQDESDESYDP
LGESEEGHYDAEFVNDDYDDALDEESEPDVEQVHAEIRRDEAAVQQMGCDGLTGQPNDEKLQLMVQSSGTNDVKEGEVFDTKKELSLRMHLVAMRLNFQFKIKKS
TPELYILRCVDTSCTWRLRATKLRDCNLFKIKKYYSIHTCNGGVLKQIIGKPKVGWSDILSKRSLQTSPARIDRRT