; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0003 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0003
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCACTA en-spm transposon protein
Genome locationMC01:260590..261287
RNA-Seq ExpressionMC01g0003
SyntenyMC01g0003
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157107.1 uncharacterized protein LOC111023906 isoform X2 [Momordica charantia]1.10e-108100Show/hide
Query:  MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP
        MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP
Subjt:  MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP

Query:  HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
        HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
Subjt:  HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]3.54e-3942.39Show/hide
Query:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHT
        S  +  + RG +R ++LD+ V   G + I +DE   KP+C NAT+ SNAI +I R N  P     W DV  EVRD    ++++ F  ++ +  ++KYV  
Subjt:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHT

Query:  ALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
         ++ TFKE+R++L+ HY+    P  AR  P  RIT   DW+ LC+RWETP+WK+K+   K+SR+K+PY HR GSKSF ++Q E+
Subjt:  ALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]4.66e-3438.39Show/hide
Query:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------
        S  +  + RG +R ++LD+ V   G + I +DE   KP+C NAT+ SNAI +I R N  P     W DV  EVRD    Q +V L               
Subjt:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------

Query:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG
                   F  ++ +  ++KYV   ++ TFKE+R++L+ HY+    P  AR  P  RIT   DW+ LC+RWETP+WK+K+   K+SR+K+PY HR G
Subjt:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG

Query:  SKSFGRLQHEL
        SKSF ++Q E+
Subjt:  SKSFGRLQHEL

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]4.48e-3438.39Show/hide
Query:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------
        S  +  + RG +R ++LD+ V   G + I +DE   KP+C NAT+ SNAI +I R N  P     W DV  EVRD    Q +V L               
Subjt:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------

Query:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG
                   F  ++ +  ++KYV   ++ TFKE+R++L+ HY+    P  AR  P  RIT   DW+ LC+RWETP+WK+K+   K+SR+K+PY HR G
Subjt:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG

Query:  SKSFGRLQHEL
        SKSF ++Q E+
Subjt:  SKSFGRLQHEL

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]3.90e-3438.39Show/hide
Query:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------
        S  +  + RG +R ++LD+ V   G + I +DE   KP+C NAT+ SNAI +I R N  P     W DV  EVRD    Q +V L               
Subjt:  SATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRD-AAKQRIVNL---------------

Query:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG
                   F  ++ +  ++KYV   ++ TFKE+R++L+ HY+    P  AR  P  RIT   DW+ LC+RWETP+WK+K+   K+SR+K+PY HR G
Subjt:  -----------FKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAG

Query:  SKSFGRLQHEL
        SKSF ++Q E+
Subjt:  SKSFGRLQHEL

TrEMBL top hitse value%identityAlignment
A0A5A7THM5 CACTA en-spm transposon protein7.50e-2936.41Show/hide
Query:  ATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTA
        AT + + R  +R ++L++ V  +G + + +   ++KPI  +A R S AI   VR+ F P    +W DV  E  +  K  +  LF V+ +   + ++V   
Subjt:  ATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTA

Query:  LRTTFKEFRAELHAHYKENGPPNVAREKP-HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
        + TTFKEFRA+ H H+K+   P  AR  P +A + + +DWH LCD + +  ++E+S   K +R K PYNH +GSKSF + Q+EL
Subjt:  LRTTFKEFRAELHAHYKENGPPNVAREKP-HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

A0A5A7TRX4 DUF4216 domain-containing protein6.41e-2838.07Show/hide
Query:  RGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKE
        RG  R ++LDK V K G + I ++E   KP+   A +++  I + VR N  P     W  VPM VR+     +   F+ + +  ++RKY+   ++ TF+E
Subjt:  RGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKE

Query:  FRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
        FRA+LH +Y +      AR  P  RIT  +DW+ +CDRWET  WK+K    KRS + + +NH  G+KSF +++HEL
Subjt:  FRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

A0A5D3C6Z8 CACTA en-spm transposon protein7.03e-2835.52Show/hide
Query:  TQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTAL
        T + + R  +R ++L++ V  +G + + +   ++KPI  +A R S AI   VR+ F P    +W DV  E  +  K  +   F ++ +   + ++V   +
Subjt:  TQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTAL

Query:  RTTFKEFRAELHAHYKENGPPNVAREKP-HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
         TTFKEFRA+ H H+K+   P  AR  P +A + + +DWH LCD + +  ++E+S   K +R K PYNH +GSKSF + QHEL
Subjt:  RTTFKEFRAELHAHYKENGPPNVAREKP-HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

A0A6J1DTP1 uncharacterized protein LOC111023906 isoform X25.34e-109100Show/hide
Query:  MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP
        MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP
Subjt:  MDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKP

Query:  HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
        HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL
Subjt:  HARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL

A0A6J1DUH3 uncharacterized protein LOC1110232128.00e-2944.63Show/hide
Query:  SRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSR
        +RWL + + + +         F+V++S+ ++ K++   ++ +FK++R++LH +Y E   P  AR  P  R+T  +DW+ LCDRWETP+WKE + K K++R
Subjt:  SRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEFRAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSR

Query:  AKLPYNHRAGSKSFGRLQHEL
        AKLP+NHRAGSKSF +LQHEL
Subjt:  AKLPYNHRAGSKSFGRLQHEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTAGTGCTACTCAGTCTCATAAGGGACGGGGAGTAACTCGGTGTGTACAATTGGACAAGGAGGTGGGAAAAGATGGGCCTGTCTCTATTGTAATGGACGAGTTCTCACA
GAAGCCAATATGCAAGAATGCCACACGCATGTCAAATGCCATAAGATCCATTGTTCGAGAGAATTTCGACCCAGCACTGTACTCCCGTTGGTTGGACGTACCGATGGAAG
TTAGGGATGCAGCAAAACAACGCATAGTGAATTTGTTCAAGGTGAACATGTCACAGGCGATAATAAGGAAGTATGTACACACAGCACTACGAACAACATTTAAGGAGTTT
AGGGCAGAATTGCATGCCCACTACAAAGAAAATGGACCTCCCAATGTAGCTCGGGAGAAACCTCATGCTAGAATCACGAAGTTGCAGGATTGGCACAAGTTGTGCGACAG
ATGGGAGACTCCAGATTGGAAGGAAAAATCAGGAAAAGCCAAGAGAAGTAGAGCGAAGTTACCATACAACCATCGGGCTGGGTCAAAATCTTTCGGTCGCCTGCAACACG
AATTG
mRNA sequenceShow/hide mRNA sequence
GGTAGTGCTACTCAGTCTCATAAGGGACGGGGAGTAACTCGGTGTGTACAATTGGACAAGGAGGTGGGAAAAGATGGGCCTGTCTCTATTGTAATGGACGAGTTCTCACA
GAAGCCAATATGCAAGAATGCCACACGCATGTCAAATGCCATAAGATCCATTGTTCGAGAGAATTTCGACCCAGCACTGTACTCCCGTTGGTTGGACGTACCGATGGAAG
TTAGGGATGCAGCAAAACAACGCATAGTGAATTTGTTCAAGGTGAACATGTCACAGGCGATAATAAGGAAGTATGTACACACAGCACTACGAACAACATTTAAGGAGTTT
AGGGCAGAATTGCATGCCCACTACAAAGAAAATGGACCTCCCAATGTAGCTCGGGAGAAACCTCATGCTAGAATCACGAAGTTGCAGGATTGGCACAAGTTGTGCGACAG
ATGGGAGACTCCAGATTGGAAGGAAAAATCAGGAAAAGCCAAGAGAAGTAGAGCGAAGTTACCATACAACCATCGGGCTGGGTCAAAATCTTTCGGTCGCCTGCAACACG
AATTG
Protein sequenceShow/hide protein sequence
GSATQSHKGRGVTRCVQLDKEVGKDGPVSIVMDEFSQKPICKNATRMSNAIRSIVRENFDPALYSRWLDVPMEVRDAAKQRIVNLFKVNMSQAIIRKYVHTALRTTFKEF
RAELHAHYKENGPPNVAREKPHARITKLQDWHKLCDRWETPDWKEKSGKAKRSRAKLPYNHRAGSKSFGRLQHEL