; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026300 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026300
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBED-type domain-containing protein
Genome locationscaffold38:418901..420082
RNA-Seq ExpressionMS026300
SyntenyMS026300
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152799.1 uncharacterized protein LOC111020429 [Momordica charantia]7.1e-9198.75Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV
        MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV

Query:  FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTE ESSAPVLDDS LDNLPLECRGSP
Subjt:  FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]6.7e-8189.44Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVN QGALGTDFAILGRT+N PGDWWSGYGYEIP+LQRAAIRILSQPCSSYGC RWNW TFE+LHSKKR+  EQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHI WTRD KYKPVVFDDIDVSLEWPTELESSA VLDDS LDNLPLEC GSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

XP_023007736.1 uncharacterized protein LOC111500259 [Cucurbita maxima]3.9e-8189.44Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVN QGALGTDFAILGRT+N PGDWWSGYGYEIP+LQR AIRILSQPCSSYGC RWNW TFE+LHSKKR+R EQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHI WTRD KYKPVVFDDIDVSLEWPTELESSA VLDDS LDNLPLEC GSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]9.3e-8390.68Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVNGQGALGTDFAILGRT+NAPGDWWSGYGYEIP+LQRAAIRIL+QPCSSYGC RWNW TFE+LHSKKR+RAEQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHIC TRD KYKPVVFDDIDVSLEWPTE E+SA VLDDS LDNLPLECRGSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]9.3e-8390.68Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVNGQGALGTDFAILGRT+NAPGDWWSGYGYEIP+LQRAAIRIL+QPCSSYGC RWNW TFE+LHSKKR+RAEQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHIC TRD KYKPVVFDDIDVSLEWPTE E+SA VLDDS LDNLPLECRGSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A1S3BLP8 uncharacterized protein LOC1034909277.2e-8188.89Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVNGQGALGTDFAILGRT+N+PGDWWSGYGYEIP+LQRAA+RILSQPCSSYGC RWNW TFE+LHSKKR+RAEQEKL DL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLD-NLPLECRGSP
        VFVQCNLWLQHIC TRDSKYKP+VFDDIDVSLEWP+ELE SA VLDDS LD NLPLECRGSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLD-NLPLECRGSP

A0A5D3D7G5 HAT transposon superfamily7.2e-8188.89Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVNGQGALGTDFAILGRT+N+PGDWWSGYGYEIP+LQRAA+RILSQPCSSYGC RWNW TFE+LHSKKR+RAEQEKL DL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLD-NLPLECRGSP
        VFVQCNLWLQHIC TRDSKYKP+VFDDIDVSLEWP+ELE SA VLDDS LD NLPLECRGSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLD-NLPLECRGSP

A0A6J1DH28 uncharacterized protein LOC1110204293.4e-9198.75Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV
        MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLV

Query:  FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTE ESSAPVLDDS LDNLPLECRGSP
Subjt:  FVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311323.2e-8189.44Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVN QGALGTDFAILGRT+N PGDWWSGYGYEIP+LQRAAIRILSQPCSSYGC RWNW TFE+LHSKKR+  EQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHI WTRD KYKPVVFDDIDVSLEWPTELESSA VLDDS LDNLPLEC GSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002591.9e-8189.44Show/hide
Query:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL
        MLKMA TDKDK+EITREHPAYVN QGALGTDFAILGRT+N PGDWWSGYGYEIP+LQR AIRILSQPCSSYGC RWNW TFE+LHSKKR+R EQEKLNDL
Subjt:  MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGC-RWNWVTFESLHSKKRNRAEQEKLNDL

Query:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP
        VFVQCNLWLQHI WTRD KYKPVVFDDIDVSLEWPTELESSA VLDDS LDNLPLEC GSP
Subjt:  VFVQCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily4.4e-2235.71Show/hide
Query:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFV
        K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR AIRILSQ CS Y     W TF+ +H ++RN+ ++E LN L +V
Subjt:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFV

Query:  QCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAP
          NL L  +         P+  +DID+  EW  E E+ +P
Subjt:  QCNLWLQHICWTRDSKYKPVVFDDIDVSLEWPTELESSAP

AT3G22220.1 hAT transposon superfamily1.8e-1536.59Show/hide
Query:  ITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFVQCNLWLQHIC
        + ++  +Y N  G  G + AI  R    P +WWS YG    +L R AIRILSQ C SS G   N  +   ++  K N  E+++LNDLVFVQ N+ L+ I 
Subjt:  ITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFVQCNLWLQHIC

Query:  --WTRDSKYKPVVFDDIDVSLEW
           + D    P+   +++V  +W
Subjt:  --WTRDSKYKPVVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily4.5e-1939.1Show/hide
Query:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVF
        ++   DK + +I +E  +Y    G  G + AI  R    P +WWS YG    +L R AIRILSQ C SS  CR N +  E ++  K N  EQ++L+DLVF
Subjt:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVF

Query:  VQCNLWLQHI-CWTRDSKYKPVVFDDIDVSLEW
        VQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  VQCNLWLQHI-CWTRDSKYKPVVFDDIDVSLEW

AT4G15020.2 hAT transposon superfamily4.5e-1939.1Show/hide
Query:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVF
        ++   DK + +I +E  +Y    G  G + AI  R    P +WWS YG    +L R AIRILSQ C SS  CR N +  E ++  K N  EQ++L+DLVF
Subjt:  KMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPC-SSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVF

Query:  VQCNLWLQHI-CWTRDSKYKPVVFDDIDVSLEW
        VQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  VQCNLWLQHI-CWTRDSKYKPVVFDDIDVSLEW

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related7.2e-2540.14Show/hide
Query:  EITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFVQCNLWLQHIC
        +I  E  A+    G  G   AI  RT  +P +WWS YG   P+LQ  AI++LS  CS+ GC  NW  F+ LH+K+RNR  Q +LND++FV+ N  LQ   
Subjt:  EITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFVQCNLWLQHIC

Query:  WTRDSKYKPVVFDDIDVSLEWPT---ELESSAPVLDDSLLDN
        + R+  + P++ ++ID   EW T   E  SS    DD + +N
Subjt:  WTRDSKYKPVVFDDIDVSLEWPT---ELESSAPVLDDSLLDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAAGATGGCGATGACGGATAAAGATAAATTGGAGATCACCAGAGAACATCCTGCATATGTAAATGGACAAGGTGCTCTTGGTACTGACTTTGCAATCTTGGGGAG
AACTGTAAATGCCCCAGGTGATTGGTGGTCTGGGTATGGTTACGAAATCCCCTCGCTCCAAAGAGCGGCAATACGAATTCTTAGCCAACCCTGTAGTTCTTATGGGTGTA
GATGGAACTGGGTCACGTTCGAAAGTTTGCACTCGAAGAAGCGTAATAGAGCTGAACAGGAAAAGTTGAACGATCTGGTGTTTGTACAGTGCAATCTTTGGTTGCAACAC
ATTTGTTGGACTCGGGACAGTAAATATAAACCCGTTGTATTCGACGATATAGATGTAAGTTTAGAATGGCCAACGGAACTCGAATCCTCGGCTCCTGTTTTAGATGATTC
ATTGTTGGATAATCTACCTCTTGAATGTAGAGGGAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAAGATGGCGATGACGGATAAAGATAAATTGGAGATCACCAGAGAACATCCTGCATATGTAAATGGACAAGGTGCTCTTGGTACTGACTTTGCAATCTTGGGGAG
AACTGTAAATGCCCCAGGTGATTGGTGGTCTGGGTATGGTTACGAAATCCCCTCGCTCCAAAGAGCGGCAATACGAATTCTTAGCCAACCCTGTAGTTCTTATGGGTGTA
GATGGAACTGGGTCACGTTCGAAAGTTTGCACTCGAAGAAGCGTAATAGAGCTGAACAGGAAAAGTTGAACGATCTGGTGTTTGTACAGTGCAATCTTTGGTTGCAACAC
ATTTGTTGGACTCGGGACAGTAAATATAAACCCGTTGTATTCGACGATATAGATGTAAGTTTAGAATGGCCAACGGAACTCGAATCCTCGGCTCCTGTTTTAGATGATTC
ATTGTTGGATAATCTACCTCTTGAATGTAGAGGGAGCCCTTAA
Protein sequenceShow/hide protein sequence
MLKMAMTDKDKLEITREHPAYVNGQGALGTDFAILGRTVNAPGDWWSGYGYEIPSLQRAAIRILSQPCSSYGCRWNWVTFESLHSKKRNRAEQEKLNDLVFVQCNLWLQH
ICWTRDSKYKPVVFDDIDVSLEWPTELESSAPVLDDSLLDNLPLECRGSP