; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016420 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016420
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPatatin
Genome locationscaffold1486:77451..77816
RNA-Seq ExpressionMS016420
SyntenyMS016420
Gene Ontology termsGO:0000162 - tryptophan biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584077.1 hypothetical protein SDJN03_20009, partial [Cucurbita argyrosperma subsp. sororia]6.5e-3673.91Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKIC-SAFDVSGSNRLNKISG
        MAATA VAIGTRGT+GSL+KKEIDYFAKIE ERCS          DMA+S   SSSPPTFWH+VMSWRRKKKR  +RF+ KIC SAFDVS SN++NKISG
Subjt:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKIC-SAFDVSGSNRLNKISG

Query:  FNYTILQNDFNSLHM
        FNYTILQN+F+SLHM
Subjt:  FNYTILQNDFNSLHM

KAG7019678.1 hypothetical protein SDJN02_18641, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-4373.28Show/hide
Query:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKIC-
        MQH+KPSTTFCK  E+MAATA VAIGTRGT+GSL+KKEIDYFAKIE ERCS          DMA+S   SS PPTFWH+VMSWRRKKKR  +RF+ KIC 
Subjt:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKIC-

Query:  SAFDVSGSNRLNKISGFNYTILQNDFNSLHM
        SAFDVS SN++NKISGFNYTILQN+F+SLHM
Subjt:  SAFDVSGSNRLNKISGFNYTILQNDFNSLHM

KGN64543.2 hypothetical protein Csa_013063 [Cucumis sativus]4.7e-3471.43Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY
        MAA APVAIGTRGT+GSLVKKEIDYFAKIE E      R  G +MASS  R SSPPTFW ++MSWRRK K   NRF+TK+CS FD S SNR+NKISG +Y
Subjt:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY

Query:  TILQNDFNSLHM
        TILQNDF+SLHM
Subjt:  TILQNDFNSLHM

XP_016898893.1 PREDICTED: uncharacterized protein LOC103485409 [Cucumis melo]3.2e-3572.32Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY
        MAATAPVAIGTRGT+GSL+KKEIDYFAKIE E      R  G +MASS  R SSPPTFW ++MSWRRKKK   NRFITK+CS FD S SNR+NKISG +Y
Subjt:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY

Query:  TILQNDFNSLHM
        TILQNDF+SLH+
Subjt:  TILQNDFNSLHM

XP_022140128.1 uncharacterized protein LOC111010862 [Momordica charantia]6.2e-63100Show/hide
Query:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN
        MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN
Subjt:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN

Query:  RLNKISGFNYTILQNDFNSLHM
        RLNKISGFNYTILQNDFNSLHM
Subjt:  RLNKISGFNYTILQNDFNSLHM

TrEMBL top hitse value%identityAlignment
A0A0A0LX43 Uncharacterized protein8.8e-3968.75Show/hide
Query:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAF
        MQH+KP+ TF K    MAA APVAIGTRGT+GSLVKKEIDYFAKIE E      R  G +MASS  R SSPPTFW ++MSWRRK K   NRF+TK+CS F
Subjt:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAF

Query:  DVSGSNRLNKISGFNYTILQNDFNSLHM
        D S SNR+NKISG +YTILQNDF+SLHM
Subjt:  DVSGSNRLNKISGFNYTILQNDFNSLHM

A0A1S4DSE9 uncharacterized protein LOC1034854091.6e-3572.32Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY
        MAATAPVAIGTRGT+GSL+KKEIDYFAKIE E      R  G +MASS  R SSPPTFW ++MSWRRKKK   NRFITK+CS FD S SNR+NKISG +Y
Subjt:  MAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY

Query:  TILQNDFNSLHM
        TILQNDF+SLH+
Subjt:  TILQNDFNSLHM

A0A5A7UTL1 Uncharacterized protein7.2e-3369.03Show/hide
Query:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAF
        MQH+KP+ TF K    MAATAPVAIGTRGT+GSL+KKEIDYFAKIE E      R  G +MASS  R SSPPTFW ++MSWRRKKK   NRFITK+CS F
Subjt:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFE------RCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAF

Query:  DVSGSNRLNKISG
        D S SNR+NKISG
Subjt:  DVSGSNRLNKISG

A0A5N6QB68 Uncharacterized protein7.0e-2854.92Show/hide
Query:  KPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDV
        K S T  KF E+M A APVAIGTRGTVGSLV+KEI+YF KIE +RC  +        D+AS+S RS+S P+FW  +M+W+RKK+R  +  +  ICSA +V
Subjt:  KPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGN--------DMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDV

Query:  SGSNRLNKISGFNYTILQNDFN
        + SNRLN+I G+NY IL+ND +
Subjt:  SGSNRLNKISGFNYTILQNDFN

A0A6J1CHB1 uncharacterized protein LOC1110108623.0e-63100Show/hide
Query:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN
        MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN
Subjt:  MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSN

Query:  RLNKISGFNYTILQNDFNSLHM
        RLNKISGFNYTILQNDFNSLHM
Subjt:  RLNKISGFNYTILQNDFNSLHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21780.1 unknown protein2.1e-1644.04Show/hide
Query:  APVAIGTRGTVGSLVKKEIDYFAKI-----EFERCSGNDMASSSR-----RSSSPPTFWHTVMSWRRKKKRI---GNRFITKICSAFDVSGSNRLNKISG
        AP+AIGTRGT+GSLV+KEIDYF        +F+   GN   + +      RSSS    W +   WR+KK++    G +F   +CSA +VSG NR   + G
Subjt:  APVAIGTRGTVGSLVKKEIDYFAKI-----EFERCSGNDMASSSR-----RSSSPPTFWHTVMSWRRKKKRI---GNRFITKICSAFDVSGSNRLNKISG

Query:  FNYTILQND
        FNY IL++D
Subjt:  FNYTILQND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACATAGCAAACCCAGCACCACATTCTGCAAATTCCTTGAAGAAATGGCTGCAACTGCTCCTGTGGCCATAGGAACTAGAGGCACTGTGGGCTCACTAGTCAAGAA
GGAAATTGACTATTTCGCCAAAATTGAGTTTGAAAGATGCAGCGGCAATGACATGGCTTCTTCAAGCCGCCGCAGCAGTTCCCCGCCAACTTTCTGGCATACAGTAATGT
CATGGCGAAGGAAGAAGAAGAGAATCGGCAATCGGTTCATCACGAAGATTTGCTCGGCTTTCGATGTCTCGGGAAGCAATCGGCTGAATAAGATTTCTGGGTTCAATTAT
ACGATCCTTCAGAATGATTTCAACAGCTTGCATATG
mRNA sequenceShow/hide mRNA sequence
ATGCAACATAGCAAACCCAGCACCACATTCTGCAAATTCCTTGAAGAAATGGCTGCAACTGCTCCTGTGGCCATAGGAACTAGAGGCACTGTGGGCTCACTAGTCAAGAA
GGAAATTGACTATTTCGCCAAAATTGAGTTTGAAAGATGCAGCGGCAATGACATGGCTTCTTCAAGCCGCCGCAGCAGTTCCCCGCCAACTTTCTGGCATACAGTAATGT
CATGGCGAAGGAAGAAGAAGAGAATCGGCAATCGGTTCATCACGAAGATTTGCTCGGCTTTCGATGTCTCGGGAAGCAATCGGCTGAATAAGATTTCTGGGTTCAATTAT
ACGATCCTTCAGAATGATTTCAACAGCTTGCATATG
Protein sequenceShow/hide protein sequence
MQHSKPSTTFCKFLEEMAATAPVAIGTRGTVGSLVKKEIDYFAKIEFERCSGNDMASSSRRSSSPPTFWHTVMSWRRKKKRIGNRFITKICSAFDVSGSNRLNKISGFNY
TILQNDFNSLHM