; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:2673134..2676814
RNA-Seq ExpressionMoc01g04100
SyntenyMoc01g04100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]5.1e-5053.36Show/hide
Query:  VPIATDPEVVVPPLNVVLLADDIDREIKAYAAPTFYKFNPVITEPEIAVPKFELK------------PLMFQMLQTVGQ--FHEHPTED-----------
        VP+AT+ EV+VP LNVVLLA  IDREI+AYAAPTFY FNPVITE EI  PKFELK             L    L+   +      P+E            
Subjt:  VPIATDPEVVVPPLNVVLLADDIDREIKAYAAPTFYKFNPVITEPEIAVPKFELK------------PLMFQMLQTVGQ--FHEHPTED-----------

Query:  ------PHSHLKFFMGLCN--SFKDEGCNKEVLRLK-------------CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSD
              P  + K+   + N   F  E   +     K             CI IE YYNGLDDAT LV   S NE LLAKPY EAFNILE+ISSN HS SD
Subjt:  ------PHSHLKFFMGLCN--SFKDEGCNKEVLRLK-------------CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSD

Query:  PRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSH
         RAIQGRG+KRLNES+SYST NSKI NV DLV RSMTQQSTVGA  GKAN SH
Subjt:  PRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSH

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]1.3e-3780.19Show/hide
Query:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK
        +CIQIE YY GLDDAT LVIDAS N  LL KPY EAFNILE+ISSNNHSWSDPRAIQGRG K LNESESY  LNSK+ N+T+LVMRSMTQQ+TVGA  GK
Subjt:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK

Query:  ANVSHI
        ANVSHI
Subjt:  ANVSHI

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]5.9e-3852.49Show/hide
Query:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLKCI------QIETYYNGLDDATCLVIDASA-----------------NEVLLAKP
        MFQMLQTVG+FH H TEDPH HLKF MG+CNSFKDEG +K+V+RLK        +  T+   L   +    D  A                 NE+   + 
Subjt:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLKCI------QIETYYNGLDDATCLVIDASA-----------------NEVLLAKP

Query:  YD-------EAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSHI
        +D       EAFNILE+ISSNNHSW DP+A+QG+ SK L ESESY+TLNSKI N+TDLVMRS+TQQS  GA  G  NV+ I
Subjt:  YD-------EAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSHI

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]9.4e-3649.48Show/hide
Query:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLK------------------------------------------------------
        MFQM+  VGQFH H TE PH HLKFFMG+ NSFKDEG +K VLRLK                                                      
Subjt:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLK------------------------------------------------------

Query:  CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQST
         IQIETYY GLD+AT LVIDAS N  LL KPY +A NILE+ISS+NHSWSD RAI+G+ SK L ESESY+TLNSKI  +TDL  R+ +  +T
Subjt:  CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQST

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]5.9e-3881.13Show/hide
Query:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK
        +CIQI+TYYNGLDDAT LVIDASAN  LLAKPY EAFNILE+ISSNN SWSDPRAI G+GSK  NESES++ LN KI N+TDLVMRSMT QSTVGA AGK
Subjt:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK

Query:  ANVSHI
        ANVSHI
Subjt:  ANVSHI

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189102.5e-5053.36Show/hide
Query:  VPIATDPEVVVPPLNVVLLADDIDREIKAYAAPTFYKFNPVITEPEIAVPKFELK------------PLMFQMLQTVGQ--FHEHPTED-----------
        VP+AT+ EV+VP LNVVLLA  IDREI+AYAAPTFY FNPVITE EI  PKFELK             L    L+   +      P+E            
Subjt:  VPIATDPEVVVPPLNVVLLADDIDREIKAYAAPTFYKFNPVITEPEIAVPKFELK------------PLMFQMLQTVGQ--FHEHPTED-----------

Query:  ------PHSHLKFFMGLCN--SFKDEGCNKEVLRLK-------------CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSD
              P  + K+   + N   F  E   +     K             CI IE YYNGLDDAT LV   S NE LLAKPY EAFNILE+ISSN HS SD
Subjt:  ------PHSHLKFFMGLCN--SFKDEGCNKEVLRLK-------------CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSD

Query:  PRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSH
         RAIQGRG+KRLNES+SYST NSKI NV DLV RSMTQQSTVGA  GKAN SH
Subjt:  PRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSH

A0A6J1DRG1 uncharacterized protein LOC1110236696.3e-3880.19Show/hide
Query:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK
        +CIQIE YY GLDDAT LVIDAS N  LL KPY EAFNILE+ISSNNHSWSDPRAIQGRG K LNESESY  LNSK+ N+T+LVMRSMTQQ+TVGA  GK
Subjt:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK

Query:  ANVSHI
        ANVSHI
Subjt:  ANVSHI

A0A6J1DTD1 uncharacterized protein LOC1110241362.8e-3852.49Show/hide
Query:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLKCI------QIETYYNGLDDATCLVIDASA-----------------NEVLLAKP
        MFQMLQTVG+FH H TEDPH HLKF MG+CNSFKDEG +K+V+RLK        +  T+   L   +    D  A                 NE+   + 
Subjt:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLKCI------QIETYYNGLDDATCLVIDASA-----------------NEVLLAKP

Query:  YD-------EAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSHI
        +D       EAFNILE+ISSNNHSW DP+A+QG+ SK L ESESY+TLNSKI N+TDLVMRS+TQQS  GA  G  NV+ I
Subjt:  YD-------EAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSHI

A0A6J1DWK1 uncharacterized protein LOC1110250534.5e-3649.48Show/hide
Query:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLK------------------------------------------------------
        MFQM+  VGQFH H TE PH HLKFFMG+ NSFKDEG +K VLRLK                                                      
Subjt:  MFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLK------------------------------------------------------

Query:  CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQST
         IQIETYY GLD+AT LVIDAS N  LL KPY +A NILE+ISS+NHSWSD RAI+G+ SK L ESESY+TLNSKI  +TDL  R+ +  +T
Subjt:  CIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQST

A0A6J1DXK5 uncharacterized protein LOC1110255002.8e-3881.13Show/hide
Query:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK
        +CIQI+TYYNGLDDAT LVIDASAN  LLAKPY EAFNILE+ISSNN SWSDPRAI G+GSK  NESES++ LN KI N+TDLVMRSMT QSTVGA AGK
Subjt:  KCIQIETYYNGLDDATCLVIDASANEVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGK

Query:  ANVSHI
        ANVSHI
Subjt:  ANVSHI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTGTCACTAACGGAGCTGTCGCAGCTGCTGCACGCCGCTGTCACCAACGGGCAATTTCCTGTTGTCACCGCCGCACGCCGCTGTCACGAACAGAAACGCCACCC
GCTGCACGCTGCTGTCCCGAACAGACTGCCGCACCTGCTGTTGTGGGTTGGGGTCGCCGCTGCACGCCGCTGTCACCAACGGAGCGCTGCCCTGCTGCACGCTACTGCCC
CGAACAGAACCGTCGCACCTGCTGCCATGGGCTGGGTCGCCGTGTCGTCGCTAAATATGGTTCCTATTGCTACTGACCCTGAGGTAGTAGTGCCCCCTCTCAATGTTGTA
TTACTAGCAGATGACATCGACAGAGAGATCAAGGCATATGCAGCTCCGACATTTTATAAATTCAACCCAGTAATCACGGAGCCTGAAATTGCAGTCCCAAAGTTTGAACT
CAAGCCGTTAATGTTTCAGATGCTCCAGACAGTGGGCCAGTTTCACGAACATCCTACAGAGGACCCACATTCGCATCTGAAGTTTTTTATGGGACTATGCAATTCGTTTA
AGGATGAAGGATGCAACAAAGAAGTGTTGCGGCTTAAATGCATCCAGATCGAAACGTATTACAATGGTTTGGATGATGCTACATGCTTAGTAATTGATGCGTCAGCAAAT
GAGGTTTTGCTAGCGAAACCTTATGATGAAGCATTCAACATCTTGGAAAAGATATCATCCAACAATCATTCATGGTCTGACCCTAGAGCTATTCAAGGTAGAGGAAGCAA
GAGACTTAACGAATCTGAGTCATACTCTACTCTAAACTCGAAGATTGGGAACGTGACAGACTTAGTGATGAGAAGTATGACACAACAAAGTACAGTGGGAGCATTTGCTG
GCAAAGCAAATGTTAGCCACATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCTGTCACTAACGGAGCTGTCGCAGCTGCTGCACGCCGCTGTCACCAACGGGCAATTTCCTGTTGTCACCGCCGCACGCCGCTGTCACGAACAGAAACGCCACCC
GCTGCACGCTGCTGTCCCGAACAGACTGCCGCACCTGCTGTTGTGGGTTGGGGTCGCCGCTGCACGCCGCTGTCACCAACGGAGCGCTGCCCTGCTGCACGCTACTGCCC
CGAACAGAACCGTCGCACCTGCTGCCATGGGCTGGGTCGCCGTGTCGTCGCTAAATATGGTTCCTATTGCTACTGACCCTGAGGTAGTAGTGCCCCCTCTCAATGTTGTA
TTACTAGCAGATGACATCGACAGAGAGATCAAGGCATATGCAGCTCCGACATTTTATAAATTCAACCCAGTAATCACGGAGCCTGAAATTGCAGTCCCAAAGTTTGAACT
CAAGCCGTTAATGTTTCAGATGCTCCAGACAGTGGGCCAGTTTCACGAACATCCTACAGAGGACCCACATTCGCATCTGAAGTTTTTTATGGGACTATGCAATTCGTTTA
AGGATGAAGGATGCAACAAAGAAGTGTTGCGGCTTAAATGCATCCAGATCGAAACGTATTACAATGGTTTGGATGATGCTACATGCTTAGTAATTGATGCGTCAGCAAAT
GAGGTTTTGCTAGCGAAACCTTATGATGAAGCATTCAACATCTTGGAAAAGATATCATCCAACAATCATTCATGGTCTGACCCTAGAGCTATTCAAGGTAGAGGAAGCAA
GAGACTTAACGAATCTGAGTCATACTCTACTCTAAACTCGAAGATTGGGAACGTGACAGACTTAGTGATGAGAAGTATGACACAACAAAGTACAGTGGGAGCATTTGCTG
GCAAAGCAAATGTTAGCCACATCTAA
Protein sequenceShow/hide protein sequence
MSLSLTELSQLLHAAVTNGQFPVVTAARRCHEQKRHPLHAAVPNRLPHLLLWVGVAAARRCHQRSAALLHATAPNRTVAPAAMGWVAVSSLNMVPIATDPEVVVPPLNVV
LLADDIDREIKAYAAPTFYKFNPVITEPEIAVPKFELKPLMFQMLQTVGQFHEHPTEDPHSHLKFFMGLCNSFKDEGCNKEVLRLKCIQIETYYNGLDDATCLVIDASAN
EVLLAKPYDEAFNILEKISSNNHSWSDPRAIQGRGSKRLNESESYSTLNSKIGNVTDLVMRSMTQQSTVGAFAGKANVSHI