; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g15540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g15540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr1:9880440..9884876
RNA-Seq ExpressionMoc01g15540
SyntenyMoc01g15540
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]1.3e-3750.54Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPPNSTYDEWIAKDHGLMTVINATLSPAALA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK F F+DG+ P P      ++T  P  N  Y++WIAKD  LMTVINATLSP ALA
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPPNSTYDEWIAKDHGLMTVINATLSPAALA

Query:  NV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
         V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  NV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.2e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.2e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.3e-3750.54Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPPNSTYDEWIAKDHGLMTVINATLSPAALA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK F F+DG+ P P      ++T  P  N  Y++WIAKD  LMTVINATLSP ALA
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPPNSTYDEWIAKDHGLMTVINATLSPAALA

Query:  NV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
         V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  NV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]7.5e-3851.3Show/hide
Query:  MAASSS--KDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLL----ELSTTEAPPPNSTYDEWIAKDHGLMTVINAT
        MA SS   KDL+S IFLLSNICNLVS+RLDSSNFVLWKFQL +ILKAHK + FIDGS P+P+  L    + S++  P  N  + EWIAKDH LMT++NA 
Subjt:  MAASSS--KDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLL----ELSTTEAPPPNSTYDEWIAKDHGLMTVINAT

Query:  LSPAALANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LS +ALA V                                 SI++KP  SI+ Y+QR+KELK KL+NV V +D+EDLLIYTLN L  +++ F
Subjt:  LSPAALANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.1e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.1e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.1e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

A0A5D3CLI6 T4.51.1e-3751.06Show/hide
Query:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA
        ++S+ KD  S IFLLSNICNL+S+RLDS+NFVLWKFQL +ILKAHK + FIDG+ P P      S+T   PP  N +Y++WIAKD  LMTVINATLSP A
Subjt:  AASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPP--NSTYDEWIAKDHGLMTVINATLSPAA

Query:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LA V                                 +I +KP ESI+ YI+R+KE+K KL+NVS  I++EDLLIY LNGL ++Y+ F
Subjt:  LANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

A0A6J1E049 uncharacterized protein LOC1110251503.6e-3851.3Show/hide
Query:  MAASSS--KDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLL----ELSTTEAPPPNSTYDEWIAKDHGLMTVINAT
        MA SS   KDL+S IFLLSNICNLVS+RLDSSNFVLWKFQL +ILKAHK + FIDGS P+P+  L    + S++  P  N  + EWIAKDH LMT++NA 
Subjt:  MAASSS--KDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLL----ELSTTEAPPPNSTYDEWIAKDHGLMTVINAT

Query:  LSPAALANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF
        LS +ALA V                                 SI++KP  SI+ Y+QR+KELK KL+NV V +D+EDLLIYTLN L  +++ F
Subjt:  LSPAALANV---------------------------------SITRKPSESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.6e-0444.74Show/hide
Query:  DSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLEL
        D  N+V WK +  S L+  K+F FIDG++P+P P   L
Subjt:  DSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCATCCTCCTCGAAGGATCTTACATCTCAAATCTTTCTTTTATCAAATATCTGTAACCTAGTATCCATTCGACTGGATTCCTCGAATTTCGTTCTATGGAAGTT
CCAATTGAACTCCATCTTGAAAGCTCATAAACAGTTCGACTTTATTGATGGCTCAATTCCTCGACCATCTCCTCTTCTGGAATTATCTACTACGGAAGCTCCTCCACCAA
ATTCGACGTATGATGAATGGATTGCCAAAGATCATGGTCTCATGACTGTTATTAATGCTACACTTTCACCTGCTGCTCTTGCGAATGTCTCTATTACCAGGAAGCCTTCA
GAATCCATTAATCAGTACATTCAGCGTGTTAAGGAACTTAAGTACAAATTATCGAATGTTTCAGTTCAAATTGATGATGAAGATTTGCTCATCTATACATTGAATGGTCT
TACTTCTGATTATAGCATCTTCGTACATCAATGCATACTTGATCACAGCTGGTTAGCTTTGAAGAACTTCATATTCTCCTTGTCTCAAAAGAAGCCACTCCTGACAAACA
GTTCCTCTTATGGTGTACCTGGTCGCGACCGTGGATCTTTTGGCACGATTCTCTTCCGCGACCCTAGCATCAACGGGTTATATCCCATTCCTTCCATTGCTCGTGTCTCG
TCTTCCTCAACTTCTACTCCTGCTCTTGCTCATGTTGCAACACCCGTGTCCTCTATTGTCTGGCATAATTGCATAGGACATCCCAGTAACTCCACTCTCAACTCTGTTCT
TCAGCTTTTACATTTTCCTTCTTGTAAATCTTCTGCTTGTAGTTGTAAACATTGTAAAGGATCTTCTTACGTACAAGAAGATACACAAGACTTTGAGGGTAGACCAGCAG
AGATGACAAACAAGGATTTGAATGAGATGGATGAGCAGGCCGTTGTGAACGTCAGAATGTCGTTGTCGATGAATGTTTGTAGTCTGGTGGCGAAAGAGACTATAACAAAG
AAATTGTTAAAGGTCTTGCAAGACAGGCCACAAGAGATCTTCTGTTATGTGTCTGAGTTTGAGGTTGCTAGGGGATTTGAGAGACATAGGATGCATAGAGTAGCTGCAGA
TGGTTCAGGGCGAGACTTGAAAGAATCAGCATCATTGACAACCAGGACAGATAAGAAGAATATGCCATCAGTTCAAGTACAACAGCTGGGAAGTAGAGGAAAGGGTAAGG
AGAACAACTCAGCGAGGTGTTCAACAGGTTGTCGGTATAATACCCCAATTGTCAGACGAATGAGCGAGCTGATGAAGTCGCACAGGCATAGTGCATTGAAGGAGAAAACT
ATAGTTGGTGCTGAAGTCAAGGGTAATGTCTCTAGAAAGGCAACAAACTTGGTTGAGAGCGCCAAGTCATCAAGGGAATCTTCCTTCAGAGGTCGTTGGCGTCTGGGGAA
CAAACCACGTAGGATTGCTCAGTCTCAGGGCAATCACAGACAGAGCTTGGGTAAGGCCGGGCAGTGTAGATCAGTTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCATCCTCCTCGAAGGATCTTACATCTCAAATCTTTCTTTTATCAAATATCTGTAACCTAGTATCCATTCGACTGGATTCCTCGAATTTCGTTCTATGGAAGTT
CCAATTGAACTCCATCTTGAAAGCTCATAAACAGTTCGACTTTATTGATGGCTCAATTCCTCGACCATCTCCTCTTCTGGAATTATCTACTACGGAAGCTCCTCCACCAA
ATTCGACGTATGATGAATGGATTGCCAAAGATCATGGTCTCATGACTGTTATTAATGCTACACTTTCACCTGCTGCTCTTGCGAATGTCTCTATTACCAGGAAGCCTTCA
GAATCCATTAATCAGTACATTCAGCGTGTTAAGGAACTTAAGTACAAATTATCGAATGTTTCAGTTCAAATTGATGATGAAGATTTGCTCATCTATACATTGAATGGTCT
TACTTCTGATTATAGCATCTTCGTACATCAATGCATACTTGATCACAGCTGGTTAGCTTTGAAGAACTTCATATTCTCCTTGTCTCAAAAGAAGCCACTCCTGACAAACA
GTTCCTCTTATGGTGTACCTGGTCGCGACCGTGGATCTTTTGGCACGATTCTCTTCCGCGACCCTAGCATCAACGGGTTATATCCCATTCCTTCCATTGCTCGTGTCTCG
TCTTCCTCAACTTCTACTCCTGCTCTTGCTCATGTTGCAACACCCGTGTCCTCTATTGTCTGGCATAATTGCATAGGACATCCCAGTAACTCCACTCTCAACTCTGTTCT
TCAGCTTTTACATTTTCCTTCTTGTAAATCTTCTGCTTGTAGTTGTAAACATTGTAAAGGATCTTCTTACGTACAAGAAGATACACAAGACTTTGAGGGTAGACCAGCAG
AGATGACAAACAAGGATTTGAATGAGATGGATGAGCAGGCCGTTGTGAACGTCAGAATGTCGTTGTCGATGAATGTTTGTAGTCTGGTGGCGAAAGAGACTATAACAAAG
AAATTGTTAAAGGTCTTGCAAGACAGGCCACAAGAGATCTTCTGTTATGTGTCTGAGTTTGAGGTTGCTAGGGGATTTGAGAGACATAGGATGCATAGAGTAGCTGCAGA
TGGTTCAGGGCGAGACTTGAAAGAATCAGCATCATTGACAACCAGGACAGATAAGAAGAATATGCCATCAGTTCAAGTACAACAGCTGGGAAGTAGAGGAAAGGGTAAGG
AGAACAACTCAGCGAGGTGTTCAACAGGTTGTCGGTATAATACCCCAATTGTCAGACGAATGAGCGAGCTGATGAAGTCGCACAGGCATAGTGCATTGAAGGAGAAAACT
ATAGTTGGTGCTGAAGTCAAGGGTAATGTCTCTAGAAAGGCAACAAACTTGGTTGAGAGCGCCAAGTCATCAAGGGAATCTTCCTTCAGAGGTCGTTGGCGTCTGGGGAA
CAAACCACGTAGGATTGCTCAGTCTCAGGGCAATCACAGACAGAGCTTGGGTAAGGCCGGGCAGTGTAGATCAGTTCGTTGA
Protein sequenceShow/hide protein sequence
MAASSSKDLTSQIFLLSNICNLVSIRLDSSNFVLWKFQLNSILKAHKQFDFIDGSIPRPSPLLELSTTEAPPPNSTYDEWIAKDHGLMTVINATLSPAALANVSITRKPS
ESINQYIQRVKELKYKLSNVSVQIDDEDLLIYTLNGLTSDYSIFVHQCILDHSWLALKNFIFSLSQKKPLLTNSSSYGVPGRDRGSFGTILFRDPSINGLYPIPSIARVS
SSSTSTPALAHVATPVSSIVWHNCIGHPSNSTLNSVLQLLHFPSCKSSACSCKHCKGSSYVQEDTQDFEGRPAEMTNKDLNEMDEQAVVNVRMSLSMNVCSLVAKETITK
KLLKVLQDRPQEIFCYVSEFEVARGFERHRMHRVAADGSGRDLKESASLTTRTDKKNMPSVQVQQLGSRGKGKENNSARCSTGCRYNTPIVRRMSELMKSHRHSALKEKT
IVGAEVKGNVSRKATNLVESAKSSRESSFRGRWRLGNKPRRIAQSQGNHRQSLGKAGQCRSVR