; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:7992752..7998359
RNA-Seq ExpressionMoc04g10700
SyntenyMoc04g10700
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]8.8e-5847.19Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV
        +L   SLRD+ARTWL SLP ESITSW+DLAE FLMKYF PSKNAKY +DINN QQF       + E FKRL+QKC HH IPRCI I+ YYNG DDA RLV
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV

Query:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLV-MSMTQQNAAGVSAAKANFSHIQGI-------YCS
           S N  LLAKPY EA NILERISSN H   D RAIQG+G K LNES+S++  NSKIEN+ +LV  SMTQQ+  G    KAN SH QG        Y  
Subjt:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLV-MSMTQQNAAGVSAAKANFSHIQGI-------YCS

Query:  FSNFSQIPEALSHLSNPLWTRNRIIGTLQVNKS---------AQSMLEHQMLKHHKWALEVLSYASKYRVLLIRDNPSVLSYSSHKPLESNRK-DGLTRK
        ++N    PE++  L N   +RN          S          + ML H ML+H      +      Y+VL I+    ++         SN + + + R 
Subjt:  FSNFSQIPEALSHLSNPLWTRNRIIGTLQVNKS---------AQSMLEHQMLKHHKWALEVLSYASKYRVLLIRDNPSVLSYSSHKPLESNRK-DGLTRK

Query:  AMVRGITFAVVTLNRQITHY
        A + G T   V+  R+ T +
Subjt:  AMVRGITFAVVTLNRQITHY

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]8.0e-5968.13Show/hide
Query:  MKYFLPSKNAKYINDINNVQQFAR-------ERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP
        MKYF P KNAKY ++I N QQ  R       ERFK+LLQKCPHHGIPRCIQI+ YY G DDA RLVIDASTNG LL KPY EA NILERISSNNH W DP
Subjt:  MKYFLPSKNAKYINDINNVQQFAR-------ERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP

Query:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN
        RAIQG+GGK LNESES+ ALNSK+ENLTNLVM SMTQQN  G S  KAN SHIQGI CSF       +N+   PE++ +L N
Subjt:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]4.2e-4463.06Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNG
        +L   SLR +ARTWLESL  E ITSW+DL EKFLMKYFLPS                    KRL Q+CP+HGIP  IQI+TYY G D+A RLVIDAS NG
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNG

Query:  TLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNL
         LL KPY +ALNILERISS+NH W D RAI+GK  KEL ESES+T LNSKIE LT+L
Subjt:  TLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNL

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.8e-4752.68Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARER-------FKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV
        +L   SLRD+ARTWLESLP ESITSW+DLAEKFLMKYF PSKNAKY ++INN QQFA E        FKRLLQ CPHHGIPRCIQI+TYY   +DA RL 
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARER-------FKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV

Query:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF------
                                        DPRA+QGK  K L ESES+T LNS IENLT LVM SM QQ++ G     AN + IQGI CSF      
Subjt:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF------

Query:  -SNFSQIPEALSHLSNPLWTRNRI
         +N    PE++ +L NP   RN +
Subjt:  -SNFSQIPEALSHLSNPLWTRNRI

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]1.2e-5458.26Show/hide
Query:  MKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP
        MKYF PSKNAKY ++INN QQF       + ERFKRL+QK  + GIPRCIQIKTYYNG DDA RLVIDAS NG LLAKPY EA NILERISSNN  W DP
Subjt:  MKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP

Query:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN------------PLWTRN
        RAI GKG K  NESESFTALN KIENLT+LVM SMT Q+  G SA KAN SHIQGI CSF       +N    PE++ +L N              W   
Subjt:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN------------PLWTRN

Query:  RIIGTLQVNKSAQSMLEH
         I+  ++V    ++ML++
Subjt:  RIIGTLQVNKSAQSMLEH

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189104.3e-5847.19Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV
        +L   SLRD+ARTWL SLP ESITSW+DLAE FLMKYF PSKNAKY +DINN QQF       + E FKRL+QKC HH IPRCI I+ YYNG DDA RLV
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV

Query:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLV-MSMTQQNAAGVSAAKANFSHIQGI-------YCS
           S N  LLAKPY EA NILERISSN H   D RAIQG+G K LNES+S++  NSKIEN+ +LV  SMTQQ+  G    KAN SH QG        Y  
Subjt:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLV-MSMTQQNAAGVSAAKANFSHIQGI-------YCS

Query:  FSNFSQIPEALSHLSNPLWTRNRIIGTLQVNKS---------AQSMLEHQMLKHHKWALEVLSYASKYRVLLIRDNPSVLSYSSHKPLESNRK-DGLTRK
        ++N    PE++  L N   +RN          S          + ML H ML+H      +      Y+VL I+    ++         SN + + + R 
Subjt:  FSNFSQIPEALSHLSNPLWTRNRIIGTLQVNKS---------AQSMLEHQMLKHHKWALEVLSYASKYRVLLIRDNPSVLSYSSHKPLESNRK-DGLTRK

Query:  AMVRGITFAVVTLNRQITHY
        A + G T   V+  R+ T +
Subjt:  AMVRGITFAVVTLNRQITHY

A0A6J1DRG1 uncharacterized protein LOC1110236693.9e-5968.13Show/hide
Query:  MKYFLPSKNAKYINDINNVQQFAR-------ERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP
        MKYF P KNAKY ++I N QQ  R       ERFK+LLQKCPHHGIPRCIQI+ YY G DDA RLVIDASTNG LL KPY EA NILERISSNNH W DP
Subjt:  MKYFLPSKNAKYINDINNVQQFAR-------ERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP

Query:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN
        RAIQG+GGK LNESES+ ALNSK+ENLTNLVM SMTQQN  G S  KAN SHIQGI CSF       +N+   PE++ +L N
Subjt:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN

A0A6J1DWK1 uncharacterized protein LOC1110250532.1e-4463.06Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNG
        +L   SLR +ARTWLESL  E ITSW+DL EKFLMKYFLPS                    KRL Q+CP+HGIP  IQI+TYY G D+A RLVIDAS NG
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNG

Query:  TLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNL
         LL KPY +ALNILERISS+NH W D RAI+GK  KEL ESES+T LNSKIE LT+L
Subjt:  TLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNL

A0A6J1DXK5 uncharacterized protein LOC1110255005.8e-5558.26Show/hide
Query:  MKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP
        MKYF PSKNAKY ++INN QQF       + ERFKRL+QK  + GIPRCIQIKTYYNG DDA RLVIDAS NG LLAKPY EA NILERISSNN  W DP
Subjt:  MKYFLPSKNAKYINDINNVQQF-------ARERFKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDP

Query:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN------------PLWTRN
        RAI GKG K  NESESFTALN KIENLT+LVM SMT Q+  G SA KAN SHIQGI CSF       +N    PE++ +L N              W   
Subjt:  RAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF-------SNFSQIPEALSHLSN------------PLWTRN

Query:  RIIGTLQVNKSAQSMLEH
         I+  ++V    ++ML++
Subjt:  RIIGTLQVNKSAQSMLEH

A0A6J1E1F3 uncharacterized protein LOC1110250658.9e-4852.68Show/hide
Query:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARER-------FKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV
        +L   SLRD+ARTWLESLP ESITSW+DLAEKFLMKYF PSKNAKY ++INN QQFA E        FKRLLQ CPHHGIPRCIQI+TYY   +DA RL 
Subjt:  RLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARER-------FKRLLQKCPHHGIPRCIQIKTYYNGFDDAMRLV

Query:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF------
                                        DPRA+QGK  K L ESES+T LNS IENLT LVM SM QQ++ G     AN + IQGI CSF      
Subjt:  IDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLVM-SMTQQNAAGVSAAKANFSHIQGIYCSF------

Query:  -SNFSQIPEALSHLSNPLWTRNRI
         +N    PE++ +L NP   RN +
Subjt:  -SNFSQIPEALSHLSNPLWTRNRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGATTATATCCCTTCTTCAGTCAAAGGAGTATCCGATCCCATGGATGCACCAAGAGATGCTGTGGAGTCCGCCTCGTCTTCTTGAAAAGTCGCTAAGAGATGATGC
AAGGACTTGGTTGGAGTCACTACCTTATGAGTCGATCACGAGCTGGAATGACTTGGCTGAGAAATTCCTGATGAAGTACTTCCTACCTAGCAAGAACGCCAAATACATAA
ATGATATCAACAACGTCCAACAATTTGCTAGGGAGCGGTTCAAGAGACTTTTGCAAAAATGCCCCCACCACGGGATCCCACGATGTATCCAGATCAAGACTTATTACAAT
GGATTTGATGACGCCATGCGCCTGGTCATTGATGCATCGACAAATGGAACATTGCTAGCAAAACCTTATGTTGAAGCACTCAACATCTTGGAGAGGATATCATCGAACAA
TCACCAGTGGTATGACCCTAGAGCCATTCAAGGGAAAGGAGGCAAGGAATTAAATGAATCAGAGTCTTTCACTGCTTTAAATTCAAAGATTGAGAATTTGACAAACTTGG
TTATGAGTATGACACAGCAAAACGCAGCAGGAGTTTCGGCTGCTAAAGCGAATTTTAGCCACATCCAAGGGATTTATTGTTCTTTTTCGAATTTTAGCCAAATCCCTGAG
GCTTTAAGCCACCTAAGCAATCCCTTATGGACGAGAAATCGAATTATAGGGACACTACAAGTAAATAAGTCAGCCCAATCAATGCTAGAGCATCAAATGCTGAAGCATCA
TAAGTGGGCGCTTGAGGTCCTCTCATATGCATCTAAGTATCGGGTTCTACTGATACGCGACAACCCAAGCGTATTGAGCTACTCGAGCCACAAGCCACTTGAATCAAATC
GTAAAGATGGTTTAACCCGAAAGGCTATGGTTAGAGGGATTACCTTTGCAGTCGTTACCCTGAACAGACAGATTACTCACTATGCTCGAGGCGCTGGTGTCAGTAGGGGC
GTTGGTGAGGCGCGACAGGGGAGCTGGCGATGCGCTACAAGGCGCTGGCGTTGGGCGGCAGAGGCGCTGGTGATGTGCAGCAGGGGCGCTGGTGAGCATGCGGCGTACAC
GGGCATGGGCGCTGGTGAGCATGCGGCGTACACGGGCATGGGCACTGGTGGCGGCATAAGTGCGCGGGTGGCGGCAGGGCGCGCAGGCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGATTATATCCCTTCTTCAGTCAAAGGAGTATCCGATCCCATGGATGCACCAAGAGATGCTGTGGAGTCCGCCTCGTCTTCTTGAAAAGTCGCTAAGAGATGATGC
AAGGACTTGGTTGGAGTCACTACCTTATGAGTCGATCACGAGCTGGAATGACTTGGCTGAGAAATTCCTGATGAAGTACTTCCTACCTAGCAAGAACGCCAAATACATAA
ATGATATCAACAACGTCCAACAATTTGCTAGGGAGCGGTTCAAGAGACTTTTGCAAAAATGCCCCCACCACGGGATCCCACGATGTATCCAGATCAAGACTTATTACAAT
GGATTTGATGACGCCATGCGCCTGGTCATTGATGCATCGACAAATGGAACATTGCTAGCAAAACCTTATGTTGAAGCACTCAACATCTTGGAGAGGATATCATCGAACAA
TCACCAGTGGTATGACCCTAGAGCCATTCAAGGGAAAGGAGGCAAGGAATTAAATGAATCAGAGTCTTTCACTGCTTTAAATTCAAAGATTGAGAATTTGACAAACTTGG
TTATGAGTATGACACAGCAAAACGCAGCAGGAGTTTCGGCTGCTAAAGCGAATTTTAGCCACATCCAAGGGATTTATTGTTCTTTTTCGAATTTTAGCCAAATCCCTGAG
GCTTTAAGCCACCTAAGCAATCCCTTATGGACGAGAAATCGAATTATAGGGACACTACAAGTAAATAAGTCAGCCCAATCAATGCTAGAGCATCAAATGCTGAAGCATCA
TAAGTGGGCGCTTGAGGTCCTCTCATATGCATCTAAGTATCGGGTTCTACTGATACGCGACAACCCAAGCGTATTGAGCTACTCGAGCCACAAGCCACTTGAATCAAATC
GTAAAGATGGTTTAACCCGAAAGGCTATGGTTAGAGGGATTACCTTTGCAGTCGTTACCCTGAACAGACAGATTACTCACTATGCTCGAGGCGCTGGTGTCAGTAGGGGC
GTTGGTGAGGCGCGACAGGGGAGCTGGCGATGCGCTACAAGGCGCTGGCGTTGGGCGGCAGAGGCGCTGGTGATGTGCAGCAGGGGCGCTGGTGAGCATGCGGCGTACAC
GGGCATGGGCGCTGGTGAGCATGCGGCGTACACGGGCATGGGCACTGGTGGCGGCATAAGTGCGCGGGTGGCGGCAGGGCGCGCAGGCCCTTGA
Protein sequenceShow/hide protein sequence
MRIISLLQSKEYPIPWMHQEMLWSPPRLLEKSLRDDARTWLESLPYESITSWNDLAEKFLMKYFLPSKNAKYINDINNVQQFARERFKRLLQKCPHHGIPRCIQIKTYYN
GFDDAMRLVIDASTNGTLLAKPYVEALNILERISSNNHQWYDPRAIQGKGGKELNESESFTALNSKIENLTNLVMSMTQQNAAGVSAAKANFSHIQGIYCSFSNFSQIPE
ALSHLSNPLWTRNRIIGTLQVNKSAQSMLEHQMLKHHKWALEVLSYASKYRVLLIRDNPSVLSYSSHKPLESNRKDGLTRKAMVRGITFAVVTLNRQITHYARGAGVSRG
VGEARQGSWRCATRRWRWAAEALVMCSRGAGEHAAYTGMGAGEHAAYTGMGTGGGISARVAAGRAGP