; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:6432044..6433568
RNA-Seq ExpressionMoc01g10370
SyntenyMoc01g10370
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]5.4e-3741.18Show/hide
Query:  LRDQFQKEIEDLKRQCRPVD-PHQPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL
        LR Q   + E LK +C   + P    +  E PF+  +L+API  +FKAPT+ PYD S+DP  Y EVFEG MDF ATSD +KCRAFQIAL GS RLWYR+L
Subjt:  LRDQFQKEIEDLKRQCRPVD-PHQPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL

Query:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY
          RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K                        + G   P++  E+L + ++ 
Subjt:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY

Query:  IDGLELWKANGARRRTVVKIG
        IDG EL +    R    +  G
Subjt:  IDGLELWKANGARRRTVVKIG

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]4.7e-4975Show/hide
Query:  MDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVKG-------------
        MDFLA SD +KCRAFQIALEGSVRLWY+QLKPRSIDSYQQLRR FINQFSA QLLKLPPSHL TVKQRD+ESLTEYIA+ MDEHVK              
Subjt:  MDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVKG-------------

Query:  ----------EFGSRPPASLNEMLARARQYIDGLELWKANGARR
                  EFGSRPPASLN+MLARARQYIDGLELWKA GARR
Subjt:  ----------EFGSRPPASLNEMLARARQYIDGLELWKANGARR

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.1e-3638.62Show/hide
Query:  TPVRAEMRGHQDDEGWH---PKIGNLLRDQFQK-------EIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVF
        +P R+    +Q  E  H      G + R++F +       ++E LK +C   +      +  E PF+  +L+API  +FKAPT+ PYD S DP  Y EVF
Subjt:  TPVRAEMRGHQDDEGWH---PKIGNLLRDQFQK-------EIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVF

Query:  EGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------
        EG MDF A SDT+KCRAFQIAL  S RLWYR+L  RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K           
Subjt:  EGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------

Query:  ------------GEFGSRPPASLNEMLARARQYIDGLELWKANGAR
                     + G   PA+  E+L +A++ IDG EL +    R
Subjt:  ------------GEFGSRPPASLNEMLARARQYIDGLELWKANGAR

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]4.5e-5267.24Show/hide
Query:  MSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQF
        MS YD S DPISY EVFEGKMDFLA SD MKC AFQI LEGS RLWYRQLK RSIDSYQQLRR FINQFS  Q LKLP SHLGTVKQRD+ES T YIA+F
Subjt:  MSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQF

Query:  MDEHVKG-----------------------EFGSRPPASLNEMLARARQYIDGLELWKANGARRRTVVKIGTKS
        MDEHVK                        EFGS  PA LNEM ARARQYIDGLELW A+GA     V+ GT S
Subjt:  MDEHVKG-----------------------EFGSRPPASLNEMLARARQYIDGLELWKANGARRRTVVKIGTKS

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.6e-3641.78Show/hide
Query:  LRDQFQKEIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL
        LR +   ++E LK +C   D      +  E PF+  +L+API  +FKAPT+ PYD ++DP  Y EVFEG MDF A SD +KCRAFQIAL GS RLWYR+L
Subjt:  LRDQFQKEIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL

Query:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY
          RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K                        + G   PA+  E+L +A++ 
Subjt:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY

Query:  IDGLELWKANGAR
        IDG EL +    R
Subjt:  IDGLELWKANGAR

TrEMBL top hitse value%identityAlignment
A0A6J1CKB3 uncharacterized protein LOC1110120812.6e-3741.18Show/hide
Query:  LRDQFQKEIEDLKRQCRPVD-PHQPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL
        LR Q   + E LK +C   + P    +  E PF+  +L+API  +FKAPT+ PYD S+DP  Y EVFEG MDF ATSD +KCRAFQIAL GS RLWYR+L
Subjt:  LRDQFQKEIEDLKRQCRPVD-PHQPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL

Query:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY
          RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K                        + G   P++  E+L + ++ 
Subjt:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY

Query:  IDGLELWKANGARRRTVVKIG
        IDG EL +    R    +  G
Subjt:  IDGLELWKANGARRRTVVKIG

A0A6J1D5T3 uncharacterized protein LOC1110175482.3e-4975Show/hide
Query:  MDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVKG-------------
        MDFLA SD +KCRAFQIALEGSVRLWY+QLKPRSIDSYQQLRR FINQFSA QLLKLPPSHL TVKQRD+ESLTEYIA+ MDEHVK              
Subjt:  MDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVKG-------------

Query:  ----------EFGSRPPASLNEMLARARQYIDGLELWKANGARR
                  EFGSRPPASLN+MLARARQYIDGLELWKA GARR
Subjt:  ----------EFGSRPPASLNEMLARARQYIDGLELWKANGARR

A0A6J1DDS5 uncharacterized protein LOC1110198429.9e-3738.62Show/hide
Query:  TPVRAEMRGHQDDEGWH---PKIGNLLRDQFQK-------EIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVF
        +P R+    +Q  E  H      G + R++F +       ++E LK +C   +      +  E PF+  +L+API  +FKAPT+ PYD S DP  Y EVF
Subjt:  TPVRAEMRGHQDDEGWH---PKIGNLLRDQFQK-------EIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVF

Query:  EGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------
        EG MDF A SDT+KCRAFQIAL  S RLWYR+L  RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K           
Subjt:  EGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------

Query:  ------------GEFGSRPPASLNEMLARARQYIDGLELWKANGAR
                     + G   PA+  E+L +A++ IDG EL +    R
Subjt:  ------------GEFGSRPPASLNEMLARARQYIDGLELWKANGAR

A0A6J1DIZ8 uncharacterized protein LOC1110204752.2e-5267.24Show/hide
Query:  MSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQF
        MS YD S DPISY EVFEGKMDFLA SD MKC AFQI LEGS RLWYRQLK RSIDSYQQLRR FINQFS  Q LKLP SHLGTVKQRD+ES T YIA+F
Subjt:  MSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQF

Query:  MDEHVKG-----------------------EFGSRPPASLNEMLARARQYIDGLELWKANGARRRTVVKIGTKS
        MDEHVK                        EFGS  PA LNEM ARARQYIDGLELW A+GA     V+ GT S
Subjt:  MDEHVKG-----------------------EFGSRPPASLNEMLARARQYIDGLELWKANGARRRTVVKIGTKS

A0A6J1DS95 uncharacterized protein LOC1110234217.6e-3741.78Show/hide
Query:  LRDQFQKEIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL
        LR +   ++E LK +C   D      +  E PF+  +L+API  +FKAPT+ PYD ++DP  Y EVFEG MDF A SD +KCRAFQIAL GS RLWYR+L
Subjt:  LRDQFQKEIEDLKRQCRPVDPH-QPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWYRQL

Query:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY
          RSI +Y QLRR F+ QFS+    K   +HL T++Q++ E+L EY+ +F +E +K                        + G   PA+  E+L +A++ 
Subjt:  KPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVK-----------------------GEFGSRPPASLNEMLARARQY

Query:  IDGLELWKANGAR
        IDG EL +    R
Subjt:  IDGLELWKANGAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAAACCCGCTCCCAGCAGTCACGATCTCAACTTTCATCGATCCGCTCTCCAATGAGGGATGCTGCCACTTCCCGTGCCCATCCTAGCCTCACGTACTCT
CAGGTGGCCGGGACTCCTATCATCGAACGACGACCCCAGGCAGGCGTGGTTAAGAAAAATGGAGGTCGGCCTGCCACATCCGATCCCATAGCAGCCCAAGACTTC
CATCTCACCCCAGATCAGTTTCCGCCACTACAGCCTCAGAGGAACAGGTTGCCGCCCCGTGTTCCTCGTCTCCGCAGCTGGGGGAGCACAGGTGCGCGTTCCGGG
GCAAGTGCCAACATAGGCATGGACCCCGTCATGGTAGCTAACGTGATCGCCGAGCTTGCGGAAGTCAAGGCGAGGCTCGAAGCGGTCGAGAGAGGCAGCGAGATG
TCCGGCTTTTCCGTCTCCAGGGATCCCACTCGAGGAAAGGGGCCGATGCATCCGACCCAAAGAACGGAGTATAAGTACGCAGCTACGCTCAGCAGGACAACCAGG
TGGAAGACCAACGCCCGAGGGTCCGACCAATTCGGACCCCTCTGGCATCATTCGATAGCTACAACGCCCGTCAGGGCTGAGATGAGGGGCCACCAAGACGACGAG
GGGTGGCACCCGAAGATCGGGAACCTCCTCCGAGACCAATTTCAGAAGGAAATAGAAGATCTCAAGCGGCAGTGCAGACCTGTAGACCCACATCAGCCGGCTGAG
CAGGAAGAGCGGCCTTTCTCCCAAGCTATCCTGGACGCGCCCATTCTGCTGAGGTTCAAGGCTCCGACCATGAGTCCCTATGATGAATCCGAAGATCCGATCTCT
TATGCAGAGGTGTTCGAGGGAAAGATGGACTTCTTGGCAACAAGTGACACTATGAAGTGCCGAGCCTTCCAAATAGCCTTGGAAGGCTCGGTCAGGTTGTGGTAC
CGACAGTTGAAGCCCCGATCCATCGACAGTTATCAACAGTTGAGAAGGTTCTTCATCAACCAGTTCTCAGCTTGGCAGTTGTTGAAGTTGCCGCCCTCGCACCTC
GGAACAGTGAAGCAACGAGATAGTGAGTCCCTAACGGAGTACATCGCTCAATTCATGGACGAGCATGTCAAAGGTGAGTTTGGAAGCCGTCCGCCAGCCTCCCTG
AACGAGATGCTCGCTCGAGCCCGCCAGTACATTGACGGCTTGGAGTTGTGGAAAGCCAATGGAGCCCGGCGAAGAACCGTGGTAAAGATCGGGACCAAAAGTCCC
CTGACTGGGCGTTTATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAAACCCGCTCCCAGCAGTCACGATCTCAACTTTCATCGATCCGCTCTCCAATGAGGGATGCTGCCACTTCCCGTGCCCATCCTAGCCTCACGTACTCT
CAGGTGGCCGGGACTCCTATCATCGAACGACGACCCCAGGCAGGCGTGGTTAAGAAAAATGGAGGTCGGCCTGCCACATCCGATCCCATAGCAGCCCAAGACTTC
CATCTCACCCCAGATCAGTTTCCGCCACTACAGCCTCAGAGGAACAGGTTGCCGCCCCGTGTTCCTCGTCTCCGCAGCTGGGGGAGCACAGGTGCGCGTTCCGGG
GCAAGTGCCAACATAGGCATGGACCCCGTCATGGTAGCTAACGTGATCGCCGAGCTTGCGGAAGTCAAGGCGAGGCTCGAAGCGGTCGAGAGAGGCAGCGAGATG
TCCGGCTTTTCCGTCTCCAGGGATCCCACTCGAGGAAAGGGGCCGATGCATCCGACCCAAAGAACGGAGTATAAGTACGCAGCTACGCTCAGCAGGACAACCAGG
TGGAAGACCAACGCCCGAGGGTCCGACCAATTCGGACCCCTCTGGCATCATTCGATAGCTACAACGCCCGTCAGGGCTGAGATGAGGGGCCACCAAGACGACGAG
GGGTGGCACCCGAAGATCGGGAACCTCCTCCGAGACCAATTTCAGAAGGAAATAGAAGATCTCAAGCGGCAGTGCAGACCTGTAGACCCACATCAGCCGGCTGAG
CAGGAAGAGCGGCCTTTCTCCCAAGCTATCCTGGACGCGCCCATTCTGCTGAGGTTCAAGGCTCCGACCATGAGTCCCTATGATGAATCCGAAGATCCGATCTCT
TATGCAGAGGTGTTCGAGGGAAAGATGGACTTCTTGGCAACAAGTGACACTATGAAGTGCCGAGCCTTCCAAATAGCCTTGGAAGGCTCGGTCAGGTTGTGGTAC
CGACAGTTGAAGCCCCGATCCATCGACAGTTATCAACAGTTGAGAAGGTTCTTCATCAACCAGTTCTCAGCTTGGCAGTTGTTGAAGTTGCCGCCCTCGCACCTC
GGAACAGTGAAGCAACGAGATAGTGAGTCCCTAACGGAGTACATCGCTCAATTCATGGACGAGCATGTCAAAGGTGAGTTTGGAAGCCGTCCGCCAGCCTCCCTG
AACGAGATGCTCGCTCGAGCCCGCCAGTACATTGACGGCTTGGAGTTGTGGAAAGCCAATGGAGCCCGGCGAAGAACCGTGGTAAAGATCGGGACCAAAAGTCCC
CTGACTGGGCGTTTATGCTGA
Protein sequenceShow/hide protein sequence
MAQTRSQQSRSQLSSIRSPMRDAATSRAHPSLTYSQVAGTPIIERRPQAGVVKKNGGRPATSDPIAAQDFHLTPDQFPPLQPQRNRLPPRVPRLRSWGSTGARSG
ASANIGMDPVMVANVIAELAEVKARLEAVERGSEMSGFSVSRDPTRGKGPMHPTQRTEYKYAATLSRTTRWKTNARGSDQFGPLWHHSIATTPVRAEMRGHQDDE
GWHPKIGNLLRDQFQKEIEDLKRQCRPVDPHQPAEQEERPFSQAILDAPILLRFKAPTMSPYDESEDPISYAEVFEGKMDFLATSDTMKCRAFQIALEGSVRLWY
RQLKPRSIDSYQQLRRFFINQFSAWQLLKLPPSHLGTVKQRDSESLTEYIAQFMDEHVKGEFGSRPPASLNEMLARARQYIDGLELWKANGARRRTVVKIGTKSP
LTGRLC