; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCACTA en-spm transposon protein
Genome locationchr4:17162020..17165687
RNA-Seq ExpressionMoc04g23760
SyntenyMoc04g23760
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143616.1 uncharacterized protein LOC111013476 [Momordica charantia]6.0e-2939.13Show/hide
Query:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE
        + KF VD+   H + YI YEIGTR+KDYR +L+R+Y+   D   AR + Y  I  E                K A+NKV++SKL FNHR GPK F  HRE
Subjt:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE

Query:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEAR
        D                                   E+M+ L      +  E +E++ METVL KRS+Y  GMGYGPKP   K  +  YS EYV +LEAR
Subjt:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEAR

Query:  LAKN-EELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNKE
        L K+ EE+ +T  +  QK  E Q QE+ R+   M+++   F  GGGSSS +K+
Subjt:  LAKN-EELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNKE

XP_022148911.1 uncharacterized protein LOC111017461 [Momordica charantia]7.3e-2739.02Show/hide
Query:  DLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHREDKKKED
        D+   H + YI YEIGTR+KDYR +L+R+Y+   D   AR + Y  I  E                K A+NKV++SKL FNHR GPK F  HRED     
Subjt:  DLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHREDKKKED

Query:  GTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEARLAKN-E
                                      E+M+ L      +  E +E++ METVL KRS+Y  GMGYGPKP   K  +  YS EYV +LEARL K+ E
Subjt:  GTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEARLAKN-E

Query:  ELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNK
        E+ +T  +  QK  E Q QE+ R+   M+++   F  GGGSSS +K
Subjt:  ELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNK

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]5.8e-3244.86Show/hide
Query:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE
        +N+F+VD+   H   YI YEIG R+KDYR  L+++Y+ + D   AR   Y   T +                K ARNKVS+SK+ FNH  G KSFLS R 
Subjt:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE

Query:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQA
        DK KEDGTY+S  ++F+ THC  +KGW D+AA+ A+E M+ L    ++ +K  ++++ +  VL KRSSY  G GYGPKPP +K+A
Subjt:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQA

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]5.4e-2245.59Show/hide
Query:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE
        +N+F+VD+   H   YI YEIG R+KDYR  L+++Y+ + D   AR   Y   T +                K ARNKVS+SK+ FNH  G KSFLS R 
Subjt:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE

Query:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAH
        DK KEDGTY+S  ++F+ THC  +KGW D+AA+ A+
Subjt:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAH

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]5.4e-2245.59Show/hide
Query:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE
        +N+F+VD+   H   YI YEIG R+KDYR  L+++Y+ + D   AR   Y   T +                K ARNKVS+SK+ FNH  G KSFLS R 
Subjt:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE

Query:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAH
        DK KEDGTY+S  ++F+ THC  +KGW D+AA+ A+
Subjt:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAH

TrEMBL top hitse value%identityAlignment
A0A438CMH8 Uncharacterized protein2.1e-1628.12Show/hide
Query:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED
        KF +DL  +H    ++ ++  R++++R  LH++++ F     A+   +  ++ +                 + A N V++SK+PF+H+ G +SF+ H   
Subjt:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED

Query:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL
           E+G  +   +LF   H     GW ++ AR  +EKM+EL  +   E    ++E +  E VL ++S Y KG+G+GPKP S  ++   S E+   LE RL
Subjt:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL

Query:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS
         + + L++TQ Q            + L + Q Q++ ++F   EE+ R      GSS
Subjt:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS

A0A438EUC9 Uncharacterized protein2.1e-1628.52Show/hide
Query:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED
        KF +DL  +H    ++ ++  R++++R  LH++++ F     A+   +  ++ +                 + A N V++SK+PF+HR G +SF+ H   
Subjt:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED

Query:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL
           E+G  +   +LF   H     GW ++ AR  +EKM+EL  +   E    ++E +  E VL ++S Y KG+G+GPKP S  ++   S E    LE RL
Subjt:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL

Query:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS
         + + L++TQ Q            + L + Q Q++ ++F   EE+ R      GSS
Subjt:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS

A0A438J796 Uncharacterized protein7.4e-1728.52Show/hide
Query:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED
        KF +DL  +H    ++ ++  R++++R  LH++++ F     A+   +  ++ +                 + A N V++SK+PF+HR G +SF+ H   
Subjt:  KFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE----------------AKLARNKVSQSKLPFNHRAGPKSFLSHRED

Query:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL
           E+G  +   +LF   H     GW ++ AR  +EKM+EL  +   E    ++E +  E VL ++S Y KG+G+GPKP S  ++   S E+   LE RL
Subjt:  KKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDK-ELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARL

Query:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS
         + + L++TQ Q            + L + Q Q++ ++F   EE+ R      GSS
Subjt:  AKNEELLQTQHQ----------GTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSS

A0A6J1CQT5 uncharacterized protein LOC1110134762.9e-2939.13Show/hide
Query:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE
        + KF VD+   H + YI YEIGTR+KDYR +L+R+Y+   D   AR + Y  I  E                K A+NKV++SKL FNHR GPK F  HRE
Subjt:  MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHRE

Query:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEAR
        D                                   E+M+ L      +  E +E++ METVL KRS+Y  GMGYGPKP   K  +  YS EYV +LEAR
Subjt:  DKKKEDGTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEAR

Query:  LAKN-EELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNKE
        L K+ EE+ +T  +  QK  E Q QE+ R+   M+++   F  GGGSSS +K+
Subjt:  LAKN-EELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNKE

A0A6J1D6S9 uncharacterized protein LOC1110174613.5e-2739.02Show/hide
Query:  DLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHREDKKKED
        D+   H + YI YEIGTR+KDYR +L+R+Y+   D   AR + Y  I  E                K A+NKV++SKL FNHR GPK F  HRED     
Subjt:  DLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPE---------------AKLARNKVSQSKLPFNHRAGPKSFLSHREDKKKED

Query:  GTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEARLAKN-E
                                      E+M+ L      +  E +E++ METVL KRS+Y  GMGYGPKP   K  +  YS EYV +LEARL K+ E
Subjt:  GTYLSPFDLFFTTHCHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQK-QAGGYSQEYVHALEARLAKN-E

Query:  ELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNK
        E+ +T  +  QK  E Q QE+ R+   M+++   F  GGGSSS +K
Subjt:  ELLQTQHQGTQKLFEMQRQEYERRFDSMEELFRRFTEGGGSSSLNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAATTTGAGGTGGACCTCCATGTTAAACATCACTATGATTATATCAAGTACGAGATCGGAACCCGTTACAAAGATTATCGACACCGCTTACATAGGTAC
TATCGTGATTTCGAAGATGCTGAAACGGCTCGACATCGACTATATGGACAAATTACACCAGAAGCAAAGTTAGCGAGAAACAAAGTTAGTCAGAGTAAACTACCC
TTTAATCATCGCGCTGGCCCGAAATCATTTTTATCCCATCGCGAAGATAAGAAAAAGGAAGATGGGACGTATTTGAGCCCTTTCGATTTGTTCTTCACAACACAC
TGCCATCCGACGAAGGGTTGGACCGACGAAGCCGCTCGTATTGCACATGAAAAAATGGTGGAGTTGGTAGAAAAAACGAAGGAAGAAGATAAAGAATTAAGTGAG
CAAGATGCGATGGAAACAGTGCTTAGAAAGCGATCATCATACACGAAAGGGATGGGGTACGGTCCAAAGCCACCAAGTCAGAAACAAGCAGGAGGATACTCACAG
GAATATGTTCATGCTTTGGAGGCTAGACTGGCAAAAAATGAAGAGTTATTGCAAACTCAACACCAGGGAACCCAGAAGTTGTTTGAAATGCAACGTCAAGAGTAT
GAAAGAAGGTTTGACAGTATGGAAGAACTTTTTCGAAGATTTACTGAAGGAGGAGGAAGTTCATCGTTGAACAAGGAGGGTGCATTGATGTGGGAGATTGAAATG
TGGGGACAAAATGACGTTCCGTCGCCGCGACGAAACAAAATTTTCGTCGCAAAACATTGCGACGCGCGCAATTTCGTCGCTGATGTCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAAATTTGAGGTGGACCTCCATGTTAAACATCACTATGATTATATCAAGTACGAGATCGGAACCCGTTACAAAGATTATCGACACCGCTTACATAGGTAC
TATCGTGATTTCGAAGATGCTGAAACGGCTCGACATCGACTATATGGACAAATTACACCAGAAGCAAAGTTAGCGAGAAACAAAGTTAGTCAGAGTAAACTACCC
TTTAATCATCGCGCTGGCCCGAAATCATTTTTATCCCATCGCGAAGATAAGAAAAAGGAAGATGGGACGTATTTGAGCCCTTTCGATTTGTTCTTCACAACACAC
TGCCATCCGACGAAGGGTTGGACCGACGAAGCCGCTCGTATTGCACATGAAAAAATGGTGGAGTTGGTAGAAAAAACGAAGGAAGAAGATAAAGAATTAAGTGAG
CAAGATGCGATGGAAACAGTGCTTAGAAAGCGATCATCATACACGAAAGGGATGGGGTACGGTCCAAAGCCACCAAGTCAGAAACAAGCAGGAGGATACTCACAG
GAATATGTTCATGCTTTGGAGGCTAGACTGGCAAAAAATGAAGAGTTATTGCAAACTCAACACCAGGGAACCCAGAAGTTGTTTGAAATGCAACGTCAAGAGTAT
GAAAGAAGGTTTGACAGTATGGAAGAACTTTTTCGAAGATTTACTGAAGGAGGAGGAAGTTCATCGTTGAACAAGGAGGGTGCATTGATGTGGGAGATTGAAATG
TGGGGACAAAATGACGTTCCGTCGCCGCGACGAAACAAAATTTTCGTCGCAAAACATTGCGACGCGCGCAATTTCGTCGCTGATGTCCAGTAG
Protein sequenceShow/hide protein sequence
MNKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAETARHRLYGQITPEAKLARNKVSQSKLPFNHRAGPKSFLSHREDKKKEDGTYLSPFDLFFTTH
CHPTKGWTDEAARIAHEKMVELVEKTKEEDKELSEQDAMETVLRKRSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEARLAKNEELLQTQHQGTQKLFEMQRQEY
ERRFDSMEELFRRFTEGGGSSSLNKEGALMWEIEMWGQNDVPSPRRNKIFVAKHCDARNFVADVQ