; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:9983780..9984406
RNA-Seq ExpressionMoc03g14840
SyntenyMoc03g14840
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]5.1e-3451.69Show/hide
Query:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGI--SNPLPQVAQLELKPVMFQMLQMMDALRLKMFPFSLRD
        MN NPQDP  P NPPV+GD   EG ANRAGEVPNPILL  NRDVA+RNYVTHAFHNLNS +    P+ +    +    +   +++ +A +L        D
Subjt:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGI--SNPLPQVAQLELKPVMFQMLQMMDALRLKMFPFSLRD

Query:  GVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
         +R                           L  NADLREDIVSFRQKEN+AVQE WE  KELLRRCLSHGL  CVQIE
Subjt:  GVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]5.8e-7068.75Show/hide
Query:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------
        MNRN QDP PPQNPPVNGDM  E  ANR GE+PN ILLA NRDVAMRNYVTHAFHNLNSGI+NPLPQ AQ ELKPVMFQ+LQ M                
Subjt:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------

Query:  ----------------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHG
                        DALRLKMFPFSLRDG RTW+N LEPN I TW EL     AKYHTLT+NADLREDIVSFRQKEN+AVQEAWE  KELLRRC SHG
Subjt:  ----------------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHG

Query:  LAACVQIE
        L +CVQIE
Subjt:  LAACVQIE

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]8.2e-3276.09Show/hide
Query:  DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
        DALRLKMFPFSLRDG  TW+NVLE N ITTW EL     AKYHTLTRNADL+EDIVSFRQ+E++AVQEAWE  KELL+RC SHGL  CVQI+
Subjt:  DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

XP_030490806.1 uncharacterized protein LOC115707099 [Cannabis sativa]1.3e-3247.4Show/hide
Query:  RPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM-----DALRLKMFPFSLRDGVRTW
        R  Q      +MVD      A    NPI LA +R  A+R Y    F+ LN GI  P  Q    ELKPVMFQMLQ +     +ALRLK+FPFSLRD  R W
Subjt:  RPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM-----DALRLKMFPFSLRDGVRTW

Query:  LNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
        LN L P+ +T W EL +    KY   TRNA  R +I+SF+Q E++   +AWE  KELLR+CL HG+  C+Q+E
Subjt:  LNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

XP_030502440.1 uncharacterized protein LOC115717596 [Cannabis sativa]8.2e-3247.53Show/hide
Query:  MVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM-----DALRLKMFPFSLRDGVRTWLNVLEPNCITT
        M D      A    NPI LA +R  A+R Y T  F+ LN GI  P  Q    ELKPV+FQMLQ +     +AL+LK+FPFSLRD  R WLN L PN +T 
Subjt:  MVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM-----DALRLKMFPFSLRDGVRTWLNVLEPNCITT

Query:  WVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
        W +L +    KY   TRNA  R +I+SF+Q E++   +AWE  KELLR+C  HG+  C+Q+E
Subjt:  WVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

TrEMBL top hitse value%identityAlignment
A0A6J1DSZ5 uncharacterized protein LOC1110241074.4e-3144.07Show/hide
Query:  VPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM--------------------------------DALRLKMFPFSLRDG
        VPNPI +A  +D AMR+Y      +LNS + NPLP  AQ E KP+M QML ++                                DALRL +FPFSL   
Subjt:  VPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM--------------------------------DALRLKMFPFSLRDG

Query:  VRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
           WLN      ITTW +++     KY   TRNAD+RE+I+SFRQKEN+AV  AWEH K+L+R C + G+ ACVQIE
Subjt:  VRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

A0A6J1DW02 uncharacterized protein LOC1110248971.5e-3143.07Show/hide
Query:  PRPPQNPP-VNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------------
        PR P +PP VNG+M D    +   +  N I +A NRDVAMR Y   AF N +SGI NP+P     ELKP+MFQMLQ +                      
Subjt:  PRPPQNPP-VNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------------

Query:  ----------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQ
                  DA  L +FPFSL+D  R  LN      ITTW  L++    K+   TR+AD+RE+I+SFRQ + + V EAWE  KEL+R+C +HGL AC Q
Subjt:  ----------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQ

Query:  IE
        IE
Subjt:  IE

A0A6J1DYY9 uncharacterized protein LOC1110255573.0e-3277.17Show/hide
Query:  DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
        DALRLKMFPFSLRDG  TWLNVLE N ITTW EL     AKYHTLTRNADL+EDIVSFRQ+E++AVQEAWE  KELL+RC SHGL  CVQI+
Subjt:  DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

A0A6J1DZ19 uncharacterized protein LOC1110248242.5e-3451.69Show/hide
Query:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGI--SNPLPQVAQLELKPVMFQMLQMMDALRLKMFPFSLRD
        MN NPQDP  P NPPV+GD   EG ANRAGEVPNPILL  NRDVA+RNYVTHAFHNLNS +    P+ +    +    +   +++ +A +L        D
Subjt:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGI--SNPLPQVAQLELKPVMFQMLQMMDALRLKMFPFSLRD

Query:  GVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE
         +R                           L  NADLREDIVSFRQKEN+AVQE WE  KELLRRCLSHGL  CVQIE
Subjt:  GVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE

A0A6J1E251 uncharacterized protein LOC1110253022.8e-7068.75Show/hide
Query:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------
        MNRN QDP PPQNPPVNGDM  E  ANR GE+PN ILLA NRDVAMRNYVTHAFHNLNSGI+NPLPQ AQ ELKPVMFQ+LQ M                
Subjt:  MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMM----------------

Query:  ----------------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHG
                        DALRLKMFPFSLRDG RTW+N LEPN I TW EL     AKYHTLT+NADLREDIVSFRQKEN+AVQEAWE  KELLRRC SHG
Subjt:  ----------------DALRLKMFPFSLRDGVRTWLNVLEPNCITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHG

Query:  LAACVQIE
        L +CVQIE
Subjt:  LAACVQIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGAAATCCACAAGATCCTCGACCTCCACAAAATCCACCTGTGAACGGAGATATGGTGGATGAAGGAGTCGCAAACCGAGCCGGAGAAGTGCCTAATCCGATCCT
TTTAGCTCGCAACCGAGATGTAGCCATGCGAAACTATGTCACTCATGCGTTCCATAACTTAAATTCAGGGATAAGTAATCCTTTACCTCAAGTCGCACAGCTCGAGCTTA
AGCCAGTCATGTTCCAGATGTTGCAGATGATGGATGCACTAAGACTAAAAATGTTTCCTTTTTCTCTCAGAGACGGTGTAAGGACCTGGCTAAACGTGCTAGAACCAAAT
TGTATCACCACGTGGGTGGAACTAATGAAGAATGTTTTTGCAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAAGACATTGTGTCTTTTCGACAAAAGGAGAA
CAAAGCAGTTCAAGAAGCTTGGGAACATGTCAAGGAGTTACTCAGAAGATGCCTGAGCCATGGATTGGCTGCATGTGTGCAGATTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAGAAATCCACAAGATCCTCGACCTCCACAAAATCCACCTGTGAACGGAGATATGGTGGATGAAGGAGTCGCAAACCGAGCCGGAGAAGTGCCTAATCCGATCCT
TTTAGCTCGCAACCGAGATGTAGCCATGCGAAACTATGTCACTCATGCGTTCCATAACTTAAATTCAGGGATAAGTAATCCTTTACCTCAAGTCGCACAGCTCGAGCTTA
AGCCAGTCATGTTCCAGATGTTGCAGATGATGGATGCACTAAGACTAAAAATGTTTCCTTTTTCTCTCAGAGACGGTGTAAGGACCTGGCTAAACGTGCTAGAACCAAAT
TGTATCACCACGTGGGTGGAACTAATGAAGAATGTTTTTGCAAAGTACCACACTTTGACCAGGAACGCAGACCTTCGAGAAGACATTGTGTCTTTTCGACAAAAGGAGAA
CAAAGCAGTTCAAGAAGCTTGGGAACATGTCAAGGAGTTACTCAGAAGATGCCTGAGCCATGGATTGGCTGCATGTGTGCAGATTGAGTAG
Protein sequenceShow/hide protein sequence
MNRNPQDPRPPQNPPVNGDMVDEGVANRAGEVPNPILLARNRDVAMRNYVTHAFHNLNSGISNPLPQVAQLELKPVMFQMLQMMDALRLKMFPFSLRDGVRTWLNVLEPN
CITTWVELMKNVFAKYHTLTRNADLREDIVSFRQKENKAVQEAWEHVKELLRRCLSHGLAACVQIE