; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g30560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g30560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr9:22991774..22994550
RNA-Seq ExpressionMoc09g30560
SyntenyMoc09g30560
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]6.9e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]6.9e-5253.11Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKLFGFVDG+  CP      TS S++ST P   NP + DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]6.9e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]6.9e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]7.3e-5456.31Show/hide
Query:  SDSSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINAT
        +DSS    +LSSPIF LSNICNL+S+RLD+SNFVLWKFQLTAILKAHKL+GF+DGS   P   L S  + SSS  P A NPA S+WIAKDHALMT++NA 
Subjt:  SDSSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINAT

Query:  LSQTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTR
        LS +ALAY+VGC++S+                                        QR+KELKDKLANV V++D+EDL+IYTLN L  ++N FRTSM TR
Subjt:  LSQTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTR

Query:  AQPVSFKELHVLLTSEESAIEK
        +Q VSF+ELHVLL SEE+AI+K
Subjt:  AQPVSFKELHVLLTSEESAIEK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.3e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.3e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.3e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

A0A5D3CLI6 T4.53.3e-5252.28Show/hide
Query:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS
        SS    +  SPIF LSNICNLIS+RLD++NFVLWKFQLTAILKAHKL+GF+DG+  CPP    + + SS+ST P   NP++ DWIAKD ALMTVINATLS
Subjt:  SSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLS

Query:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ
          ALAY+VG  +SK                                        +R+KE+KDKLANVS  I++EDL+IY LNGL  +YN FRTSM TR+Q
Subjt:  QTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQ

Query:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS
        PV+F+ELHVLL +EESA+ KQ      Y+    V + S QS
Subjt:  PVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQS

A0A6J1E049 uncharacterized protein LOC1110251503.6e-5456.31Show/hide
Query:  SDSSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINAT
        +DSS    +LSSPIF LSNICNL+S+RLD+SNFVLWKFQLTAILKAHKL+GF+DGS   P   L S  + SSS  P A NPA S+WIAKDHALMT++NA 
Subjt:  SDSSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINAT

Query:  LSQTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTR
        LS +ALAY+VGC++S+                                        QR+KELKDKLANV V++D+EDL+IYTLN L  ++N FRTSM TR
Subjt:  LSQTALAYIVGCETSKS---------------------------------------QRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTR

Query:  AQPVSFKELHVLLTSEESAIEK
        +Q VSF+ELHVLL SEE+AI+K
Subjt:  AQPVSFKELHVLLTSEESAIEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCATCTGATTCTTCTGGTGTTGCGACCGAATTGTCTTCTCCAATTTTCTGTTTGTCAAATATCTGCAACCTAATTTCAATTCGTCTTGATGCGTCAAACTTTGT
TCTATGGAAATTTCAGTTGACTGCAATCTTGAAGGCTCATAAGCTATTCGGATTTGTCGATGGCTCAGTGGTTTGTCCTCCTCTTCATCTATCATCTACTTCGGAGTCCT
CTTCCTCTACCACTCCTCTTGCGGTCAATCCAGCGCACAGTGATTGGATCGCCAAAGATCATGCCTTAATGACAGTAATCAATGCAACACTTTCTCAGACTGCTCTCGCT
TACATCGTTGGATGTGAAACCTCTAAAAGTCAAAGGGTCAAGGAACTTAAGGACAAGTTAGCGAATGTTTCAGTCGTCATAGATGATGAAGATTTGATCATATACACCTT
GAATGGTCTCTCTGCTGATTATAATATCTTTAGAACCTCCATGTGCACACGTGCTCAACCTGTTTCATTTAAAGAACTTCATGTTCTTCTAACTTCCGAGGAGTCTGCTA
TCGAGAAACAGATTACTGCTCCAACTGAATACTCTGGGGATGAAAATGTTGGTGTTGGAAGTGGTCAGTCCTCACCTATTGCTCATACAGATCCTATGCTTTCTACTAGT
CCTTCATCTCAATCCCCTATTTTTCATGCCATTTCCTATCCTAACCCAAGTGATATCCCAATTATCTCACAAAATTTGACTAATATTCCTCCCCCTGATTCTACACACCA
TCCTGATATGAGCAATAATACCACTGCCACTTCTGTCCTCAATTCTCATCCCATGCAGACCAGAAGCAAATCCGGCATATTCAAGAAGAAACTTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCATCTGATTCTTCTGGTGTTGCGACCGAATTGTCTTCTCCAATTTTCTGTTTGTCAAATATCTGCAACCTAATTTCAATTCGTCTTGATGCGTCAAACTTTGT
TCTATGGAAATTTCAGTTGACTGCAATCTTGAAGGCTCATAAGCTATTCGGATTTGTCGATGGCTCAGTGGTTTGTCCTCCTCTTCATCTATCATCTACTTCGGAGTCCT
CTTCCTCTACCACTCCTCTTGCGGTCAATCCAGCGCACAGTGATTGGATCGCCAAAGATCATGCCTTAATGACAGTAATCAATGCAACACTTTCTCAGACTGCTCTCGCT
TACATCGTTGGATGTGAAACCTCTAAAAGTCAAAGGGTCAAGGAACTTAAGGACAAGTTAGCGAATGTTTCAGTCGTCATAGATGATGAAGATTTGATCATATACACCTT
GAATGGTCTCTCTGCTGATTATAATATCTTTAGAACCTCCATGTGCACACGTGCTCAACCTGTTTCATTTAAAGAACTTCATGTTCTTCTAACTTCCGAGGAGTCTGCTA
TCGAGAAACAGATTACTGCTCCAACTGAATACTCTGGGGATGAAAATGTTGGTGTTGGAAGTGGTCAGTCCTCACCTATTGCTCATACAGATCCTATGCTTTCTACTAGT
CCTTCATCTCAATCCCCTATTTTTCATGCCATTTCCTATCCTAACCCAAGTGATATCCCAATTATCTCACAAAATTTGACTAATATTCCTCCCCCTGATTCTACACACCA
TCCTGATATGAGCAATAATACCACTGCCACTTCTGTCCTCAATTCTCATCCCATGCAGACCAGAAGCAAATCCGGCATATTCAAGAAGAAACTTCTCTAG
Protein sequenceShow/hide protein sequence
MAASDSSGVATELSSPIFCLSNICNLISIRLDASNFVLWKFQLTAILKAHKLFGFVDGSVVCPPLHLSSTSESSSSTTPLAVNPAHSDWIAKDHALMTVINATLSQTALA
YIVGCETSKSQRVKELKDKLANVSVVIDDEDLIIYTLNGLSADYNIFRTSMCTRAQPVSFKELHVLLTSEESAIEKQITAPTEYSGDENVGVGSGQSSPIAHTDPMLSTS
PSSQSPIFHAISYPNPSDIPIISQNLTNIPPPDSTHHPDMSNNTTATSVLNSHPMQTRSKSGIFKKKLL