; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g16010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g16010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotransposon gag protein
Genome locationchr4:11950418..11952483
RNA-Seq ExpressionMoc04g16010
SyntenyMoc04g16010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]3.8e-2949.18Show/hide
Query:  IETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA--------
        IE YYNGLDDATRLV   S N ALLAKPYAEAFNILERISSN HS  D RAIQGRG+K +NES+S+++ NSKIEN+ DLV RS+TQQ+T GA        
Subjt:  IETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA--------

Query:  --------------------------FVCRNSYTPIRTIQAGETIP---TYAGAKTREEIMLVHLMLQHISKKEVILQASQTK
                                  +   N+Y       +    P    +   + REEIMLVH MLQHIS+K VI Q  + K
Subjt:  --------------------------FVCRNSYTPIRTIQAGETIP---TYAGAKTREEIMLVHLMLQHISKKEVILQASQTK

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]2.6e-3381.72Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA
        +IE YY GLDDATRLVIDAS NGALL KPYAEAFNILERISSNNHSW D RAIQGRG KG+NESES+ +LNSK+ENL++LVMRS+TQQNT GA
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]9.8e-3347.09Show/hide
Query:  AEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCRNSYTPIRTI-----------------------Q
        AEAFNILERISSNNHSWFD +A+QG+ SK + ESES+T+LNSKIENL+DLVMRS+TQQ+ +GA V   +   I+ I                       Q
Subjt:  AEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCRNSYTPIRTI-----------------------Q

Query:  AGETIPTYAGAKTREE-------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQ
         G    T      +++         L  LM Q+++  +  +   + + + LRNLE+QVG LATDL SRP  A PS+TEV KRDG EQC ALTL +G    
Subjt:  AGETIPTYAGAKTREE-------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQ

Query:  IKALPP
         KALPP
Subjt:  IKALPP

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]4.0e-4251.09Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNT----------
        +IETYY GLD+ATRLVIDAS NGALL KPYA+A NILERISS+NHSW DHRAI+G+ SK + ESES+T+LNSKIE L+DL  R+ +  NT          
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNT----------

Query:  ---SGAFVCRNS-YTPIRTIQAGETIP---TYAGAKTREE------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNT
           SG     N+  +   T Q   + P    Y G              L ++M Q+++  +  +   Q++AA LRNLE+QVG LA DLKSRP  A PS+T
Subjt:  ---SGAFVCRNS-YTPIRTIQAGETIP---TYAGAKTREE------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNT

Query:  EVTKRDGNEQCNALTLWNGKTLQIKALPP
        EV KRD  EQCNALTL +G     KALPP
Subjt:  EVTKRDGNEQCNALTLWNGKTLQIKALPP

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]2.7e-5152.57Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCR---
        +I+TYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNN SW D RAI G+GSKG NESESFT+LN KIENL+DLVMRS+T Q+T GA   +   
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCR---

Query:  -------------------------------------NSYTPIRTIQAGETIPTYAGAKTREEIMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGY
                                             N+   IRT  AG TI T    + R+E      ML+++   +  +   Q++A  LRNLEMQVG 
Subjt:  -------------------------------------NSYTPIRTIQAGETIPTYAGAKTREEIMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGY

Query:  LATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQIKALPPKETPNCQ
        LATDLKS+P    PS+ +V KRDG EQCNALTL +GKTL      P   PN Q
Subjt:  LATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQIKALPPKETPNCQ

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189101.9e-2949.18Show/hide
Query:  IETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA--------
        IE YYNGLDDATRLV   S N ALLAKPYAEAFNILERISSN HS  D RAIQGRG+K +NES+S+++ NSKIEN+ DLV RS+TQQ+T GA        
Subjt:  IETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA--------

Query:  --------------------------FVCRNSYTPIRTIQAGETIP---TYAGAKTREEIMLVHLMLQHISKKEVILQASQTK
                                  +   N+Y       +    P    +   + REEIMLVH MLQHIS+K VI Q  + K
Subjt:  --------------------------FVCRNSYTPIRTIQAGETIP---TYAGAKTREEIMLVHLMLQHISKKEVILQASQTK

A0A6J1DRG1 uncharacterized protein LOC1110236691.2e-3381.72Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA
        +IE YY GLDDATRLVIDAS NGALL KPYAEAFNILERISSNNHSW D RAIQGRG KG+NESES+ +LNSK+ENL++LVMRS+TQQNT GA
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGA

A0A6J1DTD1 uncharacterized protein LOC1110241364.7e-3347.09Show/hide
Query:  AEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCRNSYTPIRTI-----------------------Q
        AEAFNILERISSNNHSWFD +A+QG+ SK + ESES+T+LNSKIENL+DLVMRS+TQQ+ +GA V   +   I+ I                       Q
Subjt:  AEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCRNSYTPIRTI-----------------------Q

Query:  AGETIPTYAGAKTREE-------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQ
         G    T      +++         L  LM Q+++  +  +   + + + LRNLE+QVG LATDL SRP  A PS+TEV KRDG EQC ALTL +G    
Subjt:  AGETIPTYAGAKTREE-------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQ

Query:  IKALPP
         KALPP
Subjt:  IKALPP

A0A6J1DWK1 uncharacterized protein LOC1110250531.9e-4251.09Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNT----------
        +IETYY GLD+ATRLVIDAS NGALL KPYA+A NILERISS+NHSW DHRAI+G+ SK + ESES+T+LNSKIE L+DL  R+ +  NT          
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNT----------

Query:  ---SGAFVCRNS-YTPIRTIQAGETIP---TYAGAKTREE------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNT
           SG     N+  +   T Q   + P    Y G              L ++M Q+++  +  +   Q++AA LRNLE+QVG LA DLKSRP  A PS+T
Subjt:  ---SGAFVCRNS-YTPIRTIQAGETIP---TYAGAKTREE------IMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGYLATDLKSRPYRARPSNT

Query:  EVTKRDGNEQCNALTLWNGKTLQIKALPP
        EV KRD  EQCNALTL +G     KALPP
Subjt:  EVTKRDGNEQCNALTLWNGKTLQIKALPP

A0A6J1DXK5 uncharacterized protein LOC1110255001.3e-5152.57Show/hide
Query:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCR---
        +I+TYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNN SW D RAI G+GSKG NESESFT+LN KIENL+DLVMRS+T Q+T GA   +   
Subjt:  EIETYYNGLDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCR---

Query:  -------------------------------------NSYTPIRTIQAGETIPTYAGAKTREEIMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGY
                                             N+   IRT  AG TI T    + R+E      ML+++   +  +   Q++A  LRNLEMQVG 
Subjt:  -------------------------------------NSYTPIRTIQAGETIPTYAGAKTREEIMLVHLMLQHISKKEVILQASQTKAAKLRNLEMQVGY

Query:  LATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQIKALPPKETPNCQ
        LATDLKS+P    PS+ +V KRDG EQCNALTL +GKTL      P   PN Q
Subjt:  LATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQIKALPPKETPNCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGGGTCGATACGAGGAGTCCTTTGGGAGGAAGACTATTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAA
TGGTCGAGGGCCGATGCAGCAGTCTCAGGGCTCTGGGTATAAATGTGAGGACTTGGTTGGAGTCACTCCCTTCAGAATCAAGTACAAGCCGGGACGACTTGGCGAGAAAG
TTTTTGATGAAATCGAAACATACTATAATGGATTGGACGATGCCACCCGTCTTGTCATCGATGCATCAGCAAATGGGGCTTTGCTAGCAAAACCTTATGCTGAAGCATTC
AATATCTTAGAGAGGATATCGTCGAACAATCATTCATGGTTTGACCATAGAGCCATCCAAGGAAGAGGAAGCAAGGGAATGAACGAATCAGAGTCTTTCACCTCCTTAAA
TTCAAAGATTGAGAATTTGTCAGATTTGGTTATGAGGAGTGTGACGCAGCAAAACACATCTGGAGCATTTGTTTGTAGGAACAGCTATACTCCAATACGTACAATCCAGG
CTGGAGAAACCATCCCAACTTATGCTGGAGCGAAAACCAGGGAGGAAATAATGCTGGTACATCTAATGCTCCAGCATATCAGCAAAAAGGAGGTTATCCTCCAGGCTTCT
CAAACCAAGGCAGCAAAGCTGAGGAATCTGGAAATGCAGGTGGGTTATTTGGCAACAGATCTGAAGAGTAGACCTTATAGAGCGCGACCTAGTAATACTGAGGTGACCAA
GAGGGATGGGAATGAGCAATGTAATGCTCTGACATTGTGGAATGGCAAGACATTGCAGATAAAAGCCCTTCCACCTAAGGAGACCCCTAATTGTCAATTTCAAAAATCTA
GATCAAATAGCACAATCCCAAGGAATGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGGGTCGATACGAGGAGTCCTTTGGGAGGAAGACTATTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAA
TGGTCGAGGGCCGATGCAGCAGTCTCAGGGCTCTGGGTATAAATGTGAGGACTTGGTTGGAGTCACTCCCTTCAGAATCAAGTACAAGCCGGGACGACTTGGCGAGAAAG
TTTTTGATGAAATCGAAACATACTATAATGGATTGGACGATGCCACCCGTCTTGTCATCGATGCATCAGCAAATGGGGCTTTGCTAGCAAAACCTTATGCTGAAGCATTC
AATATCTTAGAGAGGATATCGTCGAACAATCATTCATGGTTTGACCATAGAGCCATCCAAGGAAGAGGAAGCAAGGGAATGAACGAATCAGAGTCTTTCACCTCCTTAAA
TTCAAAGATTGAGAATTTGTCAGATTTGGTTATGAGGAGTGTGACGCAGCAAAACACATCTGGAGCATTTGTTTGTAGGAACAGCTATACTCCAATACGTACAATCCAGG
CTGGAGAAACCATCCCAACTTATGCTGGAGCGAAAACCAGGGAGGAAATAATGCTGGTACATCTAATGCTCCAGCATATCAGCAAAAAGGAGGTTATCCTCCAGGCTTCT
CAAACCAAGGCAGCAAAGCTGAGGAATCTGGAAATGCAGGTGGGTTATTTGGCAACAGATCTGAAGAGTAGACCTTATAGAGCGCGACCTAGTAATACTGAGGTGACCAA
GAGGGATGGGAATGAGCAATGTAATGCTCTGACATTGTGGAATGGCAAGACATTGCAGATAAAAGCCCTTCCACCTAAGGAGACCCCTAATTGTCAATTTCAAAAATCTA
GATCAAATAGCACAATCCCAAGGAATGATTAG
Protein sequenceShow/hide protein sequence
MVEGRYEESFGRKTIGALGINGQGPIDGEVIGASGINGRGPMQQSQGSGYKCEDLVGVTPFRIKYKPGRLGEKVFDEIETYYNGLDDATRLVIDASANGALLAKPYAEAF
NILERISSNNHSWFDHRAIQGRGSKGMNESESFTSLNSKIENLSDLVMRSVTQQNTSGAFVCRNSYTPIRTIQAGETIPTYAGAKTREEIMLVHLMLQHISKKEVILQAS
QTKAAKLRNLEMQVGYLATDLKSRPYRARPSNTEVTKRDGNEQCNALTLWNGKTLQIKALPPKETPNCQFQKSRSNSTIPRND