; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0221 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0221
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTOX high mobility group box family member 4-A, putative isoform 4
Genome locationMC03:5322438..5322983
RNA-Seq ExpressionMC03g0221
SyntenyMC03g0221
Gene Ontology termsNA
InterPro domainsIPR012862 - Protein of unknown function DUF1635


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593425.1 hypothetical protein SDJN03_12901, partial [Cucurbita argyrosperma subsp. sororia]1.87e-7669.43Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++E+K+KLL   IELES KMEANQEM+ NKE++KNL+NLLQMAY+ERDEAK+QL K+LN+ ML S+FQAESPV KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  --GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING
          GSPVVDSFFDAVSS D  +       N+DP  LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQP  IPPVL+ G
Subjt:  --GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING

XP_022158099.1 uncharacterized protein LOC111024665 [Momordica charantia]1.92e-118100Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA
        SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA

XP_022949510.1 uncharacterized protein LOC111452838 [Cucurbita moschata]1.24e-7670.16Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++E+K+KLL   IELES KMEANQEM+ NKE++KNL+NLLQMAYKERDEAK+QL K+LN+ ML S+FQAESPV KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING
        GSPVVDSFFDAVSS D  +       N+DP  LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLP+WRNPPPLQP  IPPVL+ G
Subjt:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING

XP_022998708.1 uncharacterized protein LOC111493292 [Cucurbita maxima]7.82e-7669.63Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++E+K+KLL   IELES KMEANQEM+ NKE++KNL+NLLQMAY+ERDEAK+QL K+LN+ ML S+FQAESPV KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING
        GSPVVDSFFDAVSS D  +       N+DP  LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLP+WRNPPPLQP  IPPVL+ G
Subjt:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING

XP_038894989.1 uncharacterized protein LOC120083342 isoform X1 [Benincasa hispida]2.44e-7571.73Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++ELK+KLL T IELESVKMEANQEMI NKEN+KNLLNLLQ+AYKERDEAK+QLHK+LNK      +Q ESP+VKANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  GSPVVDSFFDAVSSPD-------SGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLIN
        GSPVVDSFFDAVSSPD         +N+D  +LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQP  + PPVLIN
Subjt:  GSPVVDSFFDAVSSPD-------SGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLIN

TrEMBL top hitse value%identityAlignment
A0A0A0M0C6 Uncharacterized protein6.39e-7569.9Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++ELK+KLLYT IELESVKMEANQEMI NKEN+KNLLNLLQ+AYKERDEA++QLHK+LNK     NFQ ESP++KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  -GSPVVDSFFDA--VSSPDS--------GNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLING
         GSPVV+SFFDA  VSSP           +N+D  +LVIESIVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQP+ + PPVLIN 
Subjt:  -GSPVVDSFFDA--VSSPDS--------GNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLING

A0A5A7STR2 TPRXL protein1.58e-7570.41Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++EL++KLLYT IELESVKMEANQEMI NKEN+KNLLNLLQ+AYKERDEAK+QLHK+LNK     NFQ ESP++KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  -GSPVVDSFFDA--VSSPDS--------GNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLING
         GSPVVDSFFDA  VSSP           +N+D  +LVIESIVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQP+ + PPVLIN 
Subjt:  -GSPVVDSFFDA--VSSPDS--------GNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNI-PPVLING

A0A6J1DYE5 uncharacterized protein LOC1110246659.27e-119100Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA
        SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA

A0A6J1GCY4 uncharacterized protein LOC1114528385.98e-7770.16Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++E+K+KLL   IELES KMEANQEM+ NKE++KNL+NLLQMAYKERDEAK+QL K+LN+ ML S+FQAESPV KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING
        GSPVVDSFFDAVSS D  +       N+DP  LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLP+WRNPPPLQP  IPPVL+ G
Subjt:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING

A0A6J1KHI5 uncharacterized protein LOC1114932923.79e-7669.63Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------
        ++E+K+KLL   IELES KMEANQEM+ NKE++KNL+NLLQMAY+ERDEAK+QL K+LN+ ML S+FQAESPV KANSSITES+SLS             
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLS-------------

Query:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING
        GSPVVDSFFDAVSS D  +       N+DP  LVIE+IVKG++LPEKGRLLQSVMEAGPLLQTLLVAGPLP+WRNPPPLQP  IPPVL+ G
Subjt:  GSPVVDSFFDAVSSPDSGN-------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLING

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G28140.1 Protein of unknown function (DUF1635)3.0e-1937.06Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        +EEL+  LL+T +ELE  +M A++E+I   + +  L +LL  A KE+DEA+++  ++L    L  NF            +   +     P ++ F  +  
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP-LQPLNIPPVLI
             ++ +P  + IE       LPEKG+LL++V++AGPLLQTLL+AG LP+WR+PPP L+   IPPV+I
Subjt:  SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP-LQPLNIPPVLI

AT2G28690.1 Protein of unknown function (DUF1635)3.2e-3749.22Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS
        M+EL++KL Y++ ELE+VK +AN+E   ++E +KNLL+LL++A +ERDEAKDQL KLL               +K NSSITES+S   SP VDSFF+ VS
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVS

Query:  SPDSGN-------------------------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPL-QPLNIPPV
        S +  N                          +DP   +++ I+KG+ LPEKG+LLQ+VME+GPLLQTLLVAGPLPRWRNPPPL Q   +PP+
Subjt:  SPDSGN-------------------------NMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPL-QPLNIPPV

AT3G44940.1 Protein of unknown function (DUF1635)1.4e-2137.63Show/hide
Query:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQL-HKLLNKFMLPSNFQAES---------PVVKANSSITESSSLSGSPV
        EEL++ L+YT +ELE  K+ A++E+    E L +L ++L    KERDEA ++  H LLN  +L    Q            P+  A+S I +       P 
Subjt:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQL-HKLLNKFMLPSNFQAES---------PVVKANSSITESSSLSGSPV

Query:  VDSFFDAVSSPDSGNNMDPSAL---------------VIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI
        ++S     SS    + M PS +               ++ +++  + LPEKG+LLQ+V++AGPLLQTLL+AGPLP+WR+ PPPL+   IPPV I
Subjt:  VDSFFDAVSSPDSGNNMDPSAL---------------VIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI

AT5G22930.1 Protein of unknown function (DUF1635)2.1e-2035.33Show/hide
Query:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL-NKFMLPSNFQAESPVVKANS---------SITESSSLSGSPV
        EE+++ LLYT +EL+  KM A +E+    E L +L ++L    KERDEA ++  +L+ +   L        P+  A+S          +  + S S S  
Subjt:  EELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL-NKFMLPSNFQAESPVVKANS---------SITESSSLSGSPV

Query:  VDSFFDAVS-----SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI
         +S            P     +  + ++++ +++ + LPEKG+LLQ+V++AGPLLQTLL+AGPLP+WR+ PPPL+   IPPV +
Subjt:  VDSFFDAVS-----SPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRN-PPPLQPLNIPPVLI

AT5G59760.1 Protein of unknown function (DUF1635)6.5e-2235.68Show/hide
Query:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL--------NKFMLPSNFQAESPVVKANSSITESSSLSGSPVV
        ++E+++ L  T  ELE++KMEAN++   ++E +  LLNLL+   +ERDEA+ QL + +        ++ +  SN  + S  V ++SS   S+ L+  P  
Subjt:  MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLL--------NKFMLPSNFQAESPVVKANSSITESSSLSGSPVV

Query:  DSFFDAVSSPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP---LQPLNIPPVLINGHDM
         +  +  ++ +    +DP    ++ +V G+  PE G+LL++V+EAGPLL+TLL+AGPLP+W NPPP    Q   +P +   G D+
Subjt:  DSFFDAVSSPDSGNNMDPSALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPP---LQPLNIPPVLINGHDM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGCTGAAAGAGAAGCTCTTATACACAGCGATTGAGCTGGAATCAGTGAAAATGGAAGCAAATCAAGAGATGATAAACAACAAAGAGAACTTAAAGAATCTGCT
GAATCTTCTTCAGATGGCATACAAAGAACGAGATGAAGCAAAGGACCAGCTGCATAAGCTGCTCAACAAATTCATGTTGCCATCCAATTTCCAAGCAGAGAGCCCTGTTG
TCAAAGCAAATTCAAGCATTACAGAATCTAGCAGCCTCTCCGGCTCCCCCGTCGTCGATTCCTTCTTTGACGCGGTTTCGTCGCCCGATTCGGGCAACAATATGGATCCA
AGCGCATTGGTGATTGAGAGCATTGTGAAGGGGAGGAGGCTGCCGGAGAAGGGGAGGCTGCTGCAGTCTGTGATGGAGGCAGGGCCTTTACTGCAGACGCTTCTCGTCGC
CGGGCCGCTCCCTCGGTGGCGCAATCCTCCGCCGCTGCAGCCCTTAAACATCCCACCAGTTCTCATCAATGGACACGACATGCCGGATGCTGAACAGAAACCAGCA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGCTGAAAGAGAAGCTCTTATACACAGCGATTGAGCTGGAATCAGTGAAAATGGAAGCAAATCAAGAGATGATAAACAACAAAGAGAACTTAAAGAATCTGCT
GAATCTTCTTCAGATGGCATACAAAGAACGAGATGAAGCAAAGGACCAGCTGCATAAGCTGCTCAACAAATTCATGTTGCCATCCAATTTCCAAGCAGAGAGCCCTGTTG
TCAAAGCAAATTCAAGCATTACAGAATCTAGCAGCCTCTCCGGCTCCCCCGTCGTCGATTCCTTCTTTGACGCGGTTTCGTCGCCCGATTCGGGCAACAATATGGATCCA
AGCGCATTGGTGATTGAGAGCATTGTGAAGGGGAGGAGGCTGCCGGAGAAGGGGAGGCTGCTGCAGTCTGTGATGGAGGCAGGGCCTTTACTGCAGACGCTTCTCGTCGC
CGGGCCGCTCCCTCGGTGGCGCAATCCTCCGCCGCTGCAGCCCTTAAACATCCCACCAGTTCTCATCAATGGACACGACATGCCGGATGCTGAACAGAAACCAGCA
Protein sequenceShow/hide protein sequence
MEELKEKLLYTAIELESVKMEANQEMINNKENLKNLLNLLQMAYKERDEAKDQLHKLLNKFMLPSNFQAESPVVKANSSITESSSLSGSPVVDSFFDAVSSPDSGNNMDP
SALVIESIVKGRRLPEKGRLLQSVMEAGPLLQTLLVAGPLPRWRNPPPLQPLNIPPVLINGHDMPDAEQKPA