; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:11468718..11469503
RNA-Seq ExpressionMoc04g15100
SyntenyMoc04g15100
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.1e-4355.19Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNSINTWAE
        MNR  QDPPPPQNPPVNGD+AGEGAANRAGEIPN ILL DNR+VAMRNYVT A HNLNS INN LPQAAQ ELKPVMF M              + T  +
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDIN
                                F    NE      + F E+   F   G+ E  +  +    L+ SSRMM NT ANGSLLEKS+NEIVDILNKM DIN
Subjt:  LTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDIN

Query:  DQGEIGRSLPKK
        DQGE GRSL KK
Subjt:  DQGEIGRSLPKK

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]1.5e-3238.5Show/hide
Query:  IPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM------------------------------------DGVRTR---------
        +PNPI + D ++ AMR+Y    L +LNS + NPLP  AQFE KP+M QM                                    D +R           
Subjt:  IPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM------------------------------------DGVRTR---------

Query:  ----LNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLL
            LNA    +I TW+++ +KFL KY   T NA++ E+I+SFRQKENEAV    EHFK+L+   P+ G+P CV IE F+R  +  ++MM N  ANG   
Subjt:  ----LNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLL

Query:  EKSINEIVDILNKMTDINDQGEIGRS
         KS NEIV+IL+++++ NDQ    RS
Subjt:  EKSINEIVDILNKMTDINDQGEIGRS

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]1.2e-5661.01Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNS-INTWA
        MN   QDPP P NPPV+GD AGEGAANRAGE+PNPILL DNR+VA+RNYVTHA HNLNS + +                 DG   R +  +P S + ++ 
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNS-INTWA

Query:  ELTEKFLAKYHT-----LTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILN
        E+   F     +     L MNA+L EDIVSFRQKENEAVQE  E FKELL R  SHGLP CV IEQFYR L+  SRMM NT AN SL EKSI+EI+DILN
Subjt:  ELTEKFLAKYHT-----LTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILN

Query:  KMTDINDQGEIGRSLPKK
        KMTD NDQGEIGRSLPKK
Subjt:  KMTDINDQGEIGRSLPKK

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]2.2e-8768.2Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM--------------------
        MNR  QDPPPPQNPPVNGD+AGE AANR GEIPN ILL DNR+VAMRNYVTHA HNLNS INNPLPQAAQFELKPVMFQ+                    
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM--------------------

Query:  -----------------------------DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHG
                                     DG RT +NALEPNSINTWAELT+KFLAKYHTLT NA+L EDIVSFRQKENEAVQE  E FKELL R PSHG
Subjt:  -----------------------------DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHG

Query:  LPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK
        LP CV IEQFYR L+ SS+MM NT ANGSLLEKS+NEIVD+LNKMTDINDQGE+GRSLPKK
Subjt:  LPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]6.6e-4466.23Show/hide
Query:  LPQAAQFELKPVMFQM---DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQF
        LP  ++  L+  MF     DG  T +N LE N I TWAELT+KFLAKYHTLT NA+L EDIVSFRQ+E+EAVQE  E FKELL R  SHGLP CV I+QF
Subjt:  LPQAAQFELKPVMFQM---DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQF

Query:  YRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK
        YR L+   RMM +T AN SLLEKS+NEI+DILNKM DINDQ E+GRSLPKK
Subjt:  YRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220075.4e-4455.19Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNSINTWAE
        MNR  QDPPPPQNPPVNGD+AGEGAANRAGEIPN ILL DNR+VAMRNYVT A HNLNS INN LPQAAQ ELKPVMF M              + T  +
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDIN
                                F    NE      + F E+   F   G+ E  +  +    L+ SSRMM NT ANGSLLEKS+NEIVDILNKM DIN
Subjt:  LTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDIN

Query:  DQGEIGRSLPKK
        DQGE GRSL KK
Subjt:  DQGEIGRSLPKK

A0A6J1DSZ5 uncharacterized protein LOC1110241077.4e-3338.5Show/hide
Query:  IPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM------------------------------------DGVRTR---------
        +PNPI + D ++ AMR+Y    L +LNS + NPLP  AQFE KP+M QM                                    D +R           
Subjt:  IPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM------------------------------------DGVRTR---------

Query:  ----LNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLL
            LNA    +I TW+++ +KFL KY   T NA++ E+I+SFRQKENEAV    EHFK+L+   P+ G+P CV IE F+R  +  ++MM N  ANG   
Subjt:  ----LNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLL

Query:  EKSINEIVDILNKMTDINDQGEIGRS
         KS NEIV+IL+++++ NDQ    RS
Subjt:  EKSINEIVDILNKMTDINDQGEIGRS

A0A6J1DYY9 uncharacterized protein LOC1110255572.4e-4466.89Show/hide
Query:  LPQAAQFELKPVMFQM---DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQF
        LP  ++  L+  MF     DG  T LN LE N I TWAELT+KFLAKYHTLT NA+L EDIVSFRQ+E+EAVQE  E FKELL R  SHGLP CV I+QF
Subjt:  LPQAAQFELKPVMFQM---DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQF

Query:  YRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK
        YR L+   RMM +T AN SLLEKS+NEI+DILNKM DINDQ E+GRSLPKK
Subjt:  YRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK

A0A6J1DZ19 uncharacterized protein LOC1110248245.6e-5761.01Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNS-INTWA
        MN   QDPP P NPPV+GD AGEGAANRAGE+PNPILL DNR+VA+RNYVTHA HNLNS + +                 DG   R +  +P S + ++ 
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNS-INTWA

Query:  ELTEKFLAKYHT-----LTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILN
        E+   F     +     L MNA+L EDIVSFRQKENEAVQE  E FKELL R  SHGLP CV IEQFYR L+  SRMM NT AN SL EKSI+EI+DILN
Subjt:  ELTEKFLAKYHT-----LTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILN

Query:  KMTDINDQGEIGRSLPKK
        KMTD NDQGEIGRSLPKK
Subjt:  KMTDINDQGEIGRSLPKK

A0A6J1E251 uncharacterized protein LOC1110253021.0e-8768.2Show/hide
Query:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM--------------------
        MNR  QDPPPPQNPPVNGD+AGE AANR GEIPN ILL DNR+VAMRNYVTHA HNLNS INNPLPQAAQFELKPVMFQ+                    
Subjt:  MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQM--------------------

Query:  -----------------------------DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHG
                                     DG RT +NALEPNSINTWAELT+KFLAKYHTLT NA+L EDIVSFRQKENEAVQE  E FKELL R PSHG
Subjt:  -----------------------------DGVRTRLNALEPNSINTWAELTEKFLAKYHTLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHG

Query:  LPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK
        LP CV IEQFYR L+ SS+MM NT ANGSLLEKS+NEIVD+LNKMTDINDQGE+GRSLPKK
Subjt:  LPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAAACAACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATGTGGCAGGTGAAGGAGCAGCAAACCGAGCAGGAGAAATTCCCAATCCAATCCT
TCTAGTAGATAATCGAAATGTAGCCATGCGGAATTATGTCACTCATGCGCTCCACAACCTAAATTCAAGGATAAATAATCCTTTACCCCAAGCCGCACAGTTCGAGCTCA
AGCCAGTCATGTTCCAGATGGATGGTGTAAGGACTAGGCTAAATGCGTTAGAACCAAATTCTATCAACACATGGGCGGAATTGACAGAGAAATTTTTGGCAAAGTACCAC
ACTTTGACCATGAATGCAAACCTTCATGAAGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGTTTGTGAGCATTTTAAGGAGTTACTTATAAGATT
CCCGAGCCATGGATTGCCCGAATGTGTGCATATTGAACAATTCTATAGAGAATTGAATCCTTCATCAAGGATGATGTCAAACACTCCAGCCAATGGCTCGTTGTTAGAGA
AGTCGATAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGGGAAATTGGAAGGTCATTACCAAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAAACAACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATGTGGCAGGTGAAGGAGCAGCAAACCGAGCAGGAGAAATTCCCAATCCAATCCT
TCTAGTAGATAATCGAAATGTAGCCATGCGGAATTATGTCACTCATGCGCTCCACAACCTAAATTCAAGGATAAATAATCCTTTACCCCAAGCCGCACAGTTCGAGCTCA
AGCCAGTCATGTTCCAGATGGATGGTGTAAGGACTAGGCTAAATGCGTTAGAACCAAATTCTATCAACACATGGGCGGAATTGACAGAGAAATTTTTGGCAAAGTACCAC
ACTTTGACCATGAATGCAAACCTTCATGAAGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGTTTGTGAGCATTTTAAGGAGTTACTTATAAGATT
CCCGAGCCATGGATTGCCCGAATGTGTGCATATTGAACAATTCTATAGAGAATTGAATCCTTCATCAAGGATGATGTCAAACACTCCAGCCAATGGCTCGTTGTTAGAGA
AGTCGATAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGGGAAATTGGAAGGTCATTACCAAAGAAGTAA
Protein sequenceShow/hide protein sequence
MNRKQQDPPPPQNPPVNGDVAGEGAANRAGEIPNPILLVDNRNVAMRNYVTHALHNLNSRINNPLPQAAQFELKPVMFQMDGVRTRLNALEPNSINTWAELTEKFLAKYH
TLTMNANLHEDIVSFRQKENEAVQEVCEHFKELLIRFPSHGLPECVHIEQFYRELNPSSRMMSNTPANGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKK