; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g03640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g03640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:3176777..3177547
RNA-Seq ExpressionMoc07g03640
SyntenyMoc07g03640
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]8.8e-5949.61Show/hide
Query:  IPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQ-TGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDG
        +PNPI +AD +D AMR+Y   +  +LNS + NPLP   QF+ K +M QML    QFGGL +EDP SH KSFI+ AN  +L G+  DALRL +FPFSL   
Subjt:  IPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQ-TGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWA--------KYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLL
        A  WLNA    +I TW+        KY   +RN ++RE+I+SFRQKENE V  AWEHFK L+R C + G+PACVQIE F+RG D  ++MMLN   NG   
Subjt:  ARTWLNALEPNSINTWA--------KYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLL

Query:  EKSVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATM
         KS NEIV+I +++ + ND+   E  R+  K+   AGV  LD + SMQ Q+ T+
Subjt:  EKSVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATM

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]2.1e-7361.42Show/hide
Query:  GKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQTGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLK
        G+GA +RAGE+PNPILL DNRDVA+RNY TH FHNLNS                    ++  G      NEDPYSH KSFIE ANAFQL GV +DALRLK
Subjt:  GKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQTGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLK

Query:  MFPFSLRDGARTWLNALEPNSINTWAKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSL
        M                                N +LREDIVSFRQKENE VQE WE FK+LLRRC SHGLP CVQIEQFYRGLDR SRMMLNT  N SL
Subjt:  MFPFSLRDGARTWLNALEPNSINTWAKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSL

Query:  LEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
         EKS++EI+DI NKM D ND+GE+GRSLPKKQVSA VFELDTVASMQAQMAT+N
Subjt:  LEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]2.2e-6251.78Show/hide
Query:  NPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGAR
        N I +ADNRDVAMR Y    F N +S I NP+P    F+LK +MFQMLQT G FGG  +EDP+ H KSFI+ ANAF+L G+  DA  L +FPFSL+D AR
Subjt:  NPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGAR

Query:  TWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEK
          LNA    SI TW         K+   +R+ ++RE+I+SFRQ + E V EAWE FK+L+R+C +HGLPAC QIE F+RGLD  ++MMLN   NG+  +K
Subjt:  TWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEK

Query:  SVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        + NEIVDI N +   N+    +  R+ PKKQ  AGV  LD   SMQ +M TMN
Subjt:  SVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]1.3e-11581.13Show/hide
Query:  MVGKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDAL
        M G+ A +R GEIPN ILLADNRDVAMRNY TH FHNLNS INNPLPQA QF+LK VMFQ+LQT GQFGGLTNEDPYSH KSFIE ANAFQL G  +DAL
Subjt:  MVGKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDAL

Query:  RLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSR
        RLKMFPFSLRDGARTW+NALEPNSINTW        AKYHTL++N +LREDIVSFRQKENE VQEAWE FK+LLRRC SHGLP+CVQIEQFYRGLDRSS+
Subjt:  RLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSR

Query:  MMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        MMLNT+ NGSLLEKSVNEIVD+ NKM DIND+GE+GRSLPKKQVS G+FELDTVASMQAQMA MN
Subjt:  MMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.1e-7472.36Show/hide
Query:  QFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEA
        QFGG TNEDPYSH KSFI+ ANAFQL GV +DALRLKMFPFSLRDGA TW+N LE N I TW        AKYHTL+RN +L+EDIVSFRQ+E+E VQEA
Subjt:  QFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEA

Query:  WEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        WE FK+LL+RC SHGLP CVQI+QFYRGLD   RMM +T  N SLLEKSVNEI+DI NKMIDIND+ E+GRSLPKKQ SAG+FELDTV S+QAQ++ M+
Subjt:  WEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

TrEMBL top hitse value%identityAlignment
A0A6J1DSZ5 uncharacterized protein LOC1110241074.2e-5949.61Show/hide
Query:  IPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQ-TGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDG
        +PNPI +AD +D AMR+Y   +  +LNS + NPLP   QF+ K +M QML    QFGGL +EDP SH KSFI+ AN  +L G+  DALRL +FPFSL   
Subjt:  IPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQ-TGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWA--------KYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLL
        A  WLNA    +I TW+        KY   +RN ++RE+I+SFRQKENE V  AWEHFK L+R C + G+PACVQIE F+RG D  ++MMLN   NG   
Subjt:  ARTWLNALEPNSINTWA--------KYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLL

Query:  EKSVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATM
         KS NEIV+I +++ + ND+   E  R+  K+   AGV  LD + SMQ Q+ T+
Subjt:  EKSVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATM

A0A6J1DW02 uncharacterized protein LOC1110248971.1e-6251.78Show/hide
Query:  NPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGAR
        N I +ADNRDVAMR Y    F N +S I NP+P    F+LK +MFQMLQT G FGG  +EDP+ H KSFI+ ANAF+L G+  DA  L +FPFSL+D AR
Subjt:  NPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGAR

Query:  TWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEK
          LNA    SI TW         K+   +R+ ++RE+I+SFRQ + E V EAWE FK+L+R+C +HGLPAC QIE F+RGLD  ++MMLN   NG+  +K
Subjt:  TWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEK

Query:  SVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        + NEIVDI N +   N+    +  R+ PKKQ  AGV  LD   SMQ +M TMN
Subjt:  SVNEIVDISNKMIDINDR--GEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

A0A6J1DYY9 uncharacterized protein LOC1110255574.2e-7572.86Show/hide
Query:  QFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEA
        QFGG TNEDPYSH KSFI+ ANAFQL GV +DALRLKMFPFSLRDGA TWLN LE N I TW        AKYHTL+RN +L+EDIVSFRQ+E+E VQEA
Subjt:  QFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEA

Query:  WEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        WE FK+LL+RC SHGLP CVQI+QFYRGLD   RMM +T  N SLLEKSVNEI+DI NKMIDIND+ E+GRSLPKKQ SAG+FELDTV S+QAQ++ M+
Subjt:  WEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

A0A6J1DZ19 uncharacterized protein LOC1110248241.0e-7361.42Show/hide
Query:  GKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQTGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLK
        G+GA +RAGE+PNPILL DNRDVA+RNY TH FHNLNS                    ++  G      NEDPYSH KSFIE ANAFQL GV +DALRLK
Subjt:  GKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQTGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLK

Query:  MFPFSLRDGARTWLNALEPNSINTWAKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSL
        M                                N +LREDIVSFRQKENE VQE WE FK+LLRRC SHGLP CVQIEQFYRGLDR SRMMLNT  N SL
Subjt:  MFPFSLRDGARTWLNALEPNSINTWAKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSL

Query:  LEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
         EKS++EI+DI NKM D ND+GE+GRSLPKKQVSA VFELDTVASMQAQMAT+N
Subjt:  LEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

A0A6J1E251 uncharacterized protein LOC1110253026.4e-11681.13Show/hide
Query:  MVGKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDAL
        M G+ A +R GEIPN ILLADNRDVAMRNY TH FHNLNS INNPLPQA QF+LK VMFQ+LQT GQFGGLTNEDPYSH KSFIE ANAFQL G  +DAL
Subjt:  MVGKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQT-GQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDAL

Query:  RLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSR
        RLKMFPFSLRDGARTW+NALEPNSINTW        AKYHTL++N +LREDIVSFRQKENE VQEAWE FK+LLRRC SHGLP+CVQIEQFYRGLDRSS+
Subjt:  RLKMFPFSLRDGARTWLNALEPNSINTW--------AKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSR

Query:  MMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN
        MMLNT+ NGSLLEKSVNEIVD+ NKM DIND+GE+GRSLPKKQVS G+FELDTVASMQAQMA MN
Subjt:  MMLNTVDNGSLLEKSVNEIVDISNKMIDINDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGTAAAGGAGCAACAGACCGAGCAGGAGAAATTCCTAATCCGATCCTTCTAGCAGATAACCGAGATGTAGCCATGCGGAACTATGAAACTCATGTGTTCCACAA
CCTAAATTCAAGGATAAATAATCCTTTACCCCAAGCCACACAGTTCAAGCTTAAGACAGTCATGTTCCAGATGTTGCAGACTGGCCAGTTCGGAGGATTGACTAACGAAG
ATCCTTACTCCCATTTCAAATCTTTTATTGAAACTGCTAATGCATTTCAACTTCTTGGTGTCTTTAAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGAT
GGTGCAAGGACTTGGCTAAATGCATTAGAACCAAATTCTATCAACACTTGGGCGAAGTACCATACTTTGTCTAGGAACACAAACCTTCGAGAAGACATTGTGTCTTTTCG
ACAAAAGGAGAATGAAGTAGTTCAAGAAGCTTGGGAGCATTTTAAGAAGTTACTGAGAAGATGCTCGAGCCATGGATTGCCTGCATGTGTACAGATTGAACAATTCTATA
GAGGATTGGATCGTTCATCAAGGATGATGTTGAACACCGTAGACAATGGCTCATTGTTAGAAAAGTCGGTAAATGAGATCGTTGACATTTCGAATAAGATGATAGACATT
AATGACCGAGGTGAAGTAGGAAGGTCGCTGCCAAAGAAGCAAGTATCAGCCGGAGTCTTTGAGTTGGACACAGTAGCTTCAATGCAAGCCCAAATGGCGACTATGAACTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGTAAAGGAGCAACAGACCGAGCAGGAGAAATTCCTAATCCGATCCTTCTAGCAGATAACCGAGATGTAGCCATGCGGAACTATGAAACTCATGTGTTCCACAA
CCTAAATTCAAGGATAAATAATCCTTTACCCCAAGCCACACAGTTCAAGCTTAAGACAGTCATGTTCCAGATGTTGCAGACTGGCCAGTTCGGAGGATTGACTAACGAAG
ATCCTTACTCCCATTTCAAATCTTTTATTGAAACTGCTAATGCATTTCAACTTCTTGGTGTCTTTAAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGAT
GGTGCAAGGACTTGGCTAAATGCATTAGAACCAAATTCTATCAACACTTGGGCGAAGTACCATACTTTGTCTAGGAACACAAACCTTCGAGAAGACATTGTGTCTTTTCG
ACAAAAGGAGAATGAAGTAGTTCAAGAAGCTTGGGAGCATTTTAAGAAGTTACTGAGAAGATGCTCGAGCCATGGATTGCCTGCATGTGTACAGATTGAACAATTCTATA
GAGGATTGGATCGTTCATCAAGGATGATGTTGAACACCGTAGACAATGGCTCATTGTTAGAAAAGTCGGTAAATGAGATCGTTGACATTTCGAATAAGATGATAGACATT
AATGACCGAGGTGAAGTAGGAAGGTCGCTGCCAAAGAAGCAAGTATCAGCCGGAGTCTTTGAGTTGGACACAGTAGCTTCAATGCAAGCCCAAATGGCGACTATGAACTA
G
Protein sequenceShow/hide protein sequence
MVGKGATDRAGEIPNPILLADNRDVAMRNYETHVFHNLNSRINNPLPQATQFKLKTVMFQMLQTGQFGGLTNEDPYSHFKSFIETANAFQLLGVFKDALRLKMFPFSLRD
GARTWLNALEPNSINTWAKYHTLSRNTNLREDIVSFRQKENEVVQEAWEHFKKLLRRCSSHGLPACVQIEQFYRGLDRSSRMMLNTVDNGSLLEKSVNEIVDISNKMIDI
NDRGEVGRSLPKKQVSAGVFELDTVASMQAQMATMN