; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007849 (gene) of Snake gourd v1 genome

Gene IDTan0007849
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUDP-N-acetylglucosamine1-carboxyvinyltransferase
Genome locationLG07:13189990..13192588
RNA-Seq ExpressionTan0007849
SyntenyTan0007849
Gene Ontology termsGO:0000462 - maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0005730 - nucleolus (cellular component)
InterPro domainsIPR027973 - Protein of unknown function DUF4602


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010308.1 hypothetical protein SDJN02_27101 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-6380.66Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+S DMEP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE++VLKSSEGFFK GVLDVKHLLR AP+RNS     DFGNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

XP_008449630.1 PREDICTED: uncharacterized protein LOC103491457 [Cucumis melo]3.7e-6478.72Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S
        MHSRDK AKAGSSR STDME +M    IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH    QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLR------RAPTRNSD-FRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +++ RKSS KR+PE++VLKSSEGFFK GVLDVKHLLR        P+RNSD FR+ DFGNEM GKGRRKGGKKKNNK+KKKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLR------RAPTRNSD-FRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

XP_022944773.1 uncharacterized protein LOC111449125 [Cucurbita moschata]2.8e-6481.22Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+STDMEP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE++VLKSSEGFFK GVLDVKHLLR AP+RNS     DFGNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

XP_022985670.1 uncharacterized protein LOC111483659 [Cucurbita maxima]6.9e-6380.11Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+STDMEP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE +VLKSSEGFFK GVLDVKHLLR AP+RNS     D GNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

XP_023511857.1 uncharacterized protein LOC111776750 [Cucurbita pepo subsp. pepo]3.1e-6380.11Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+STD+EP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE++VLKSSEGFFK GVLDV HLLR AP+RNS     DFGNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

TrEMBL top hitse value%identityAlignment
A0A0A0KEY3 Uncharacterized protein9.7e-6376.6Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S
        MHSRDK AKAGSSR STDME +M    IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH    QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNS-------DFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +++ RKSS KR+PE++VLKSSEGFFK GVLDVKHLLR + +RN+        FR+ DFGNEM G GRRKGGKKKNNK+KKKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNS-------DFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

A0A1S3BNE0 uncharacterized protein LOC1034914571.8e-6478.72Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S
        MHSRDK AKAGSSR STDME +M    IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH    QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLR------RAPTRNSD-FRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +++ RKSS KR+PE++VLKSSEGFFK GVLDVKHLLR        P+RNSD FR+ DFGNEM GKGRRKGGKKKNNK+KKKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLR------RAPTRNSD-FRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

A0A6J1DHM3 uncharacterized protein LOC1110205771.9e-5872.78Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHFEQFGGSNNSR
        MH RDK+ + GSS  STD+EP+M+   I KEI+FLTSSHMSWKDKKEIENRK+VSLGGKPQKKQ+LPLSVARPIMKKQKEREQKM++E       S N+ 
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHFEQFGGSNNSR

Query:  RKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDF---RSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        R+S  KRKPED+VL+ SEG F+ GVLDVKHLL RAPTRN+DF   R+ DFGN+M GKGRRKGGKKK NK+KKKGGGKKRH
Subjt:  RKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDF---RSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

A0A6J1FZ06 uncharacterized protein LOC1114491251.4e-6481.22Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+STDMEP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE++VLKSSEGFFK GVLDVKHLLR AP+RNS     DFGNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

A0A6J1JBZ2 uncharacterized protein LOC1114836593.4e-6380.11Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S
        MHSRDK+AKAGSSR+STDMEP M+   IKKEI+FLTSSHMSWKDKKEIE+RK+VSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQEH+   QFGG  S
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHF--EQFGG--S

Query:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH
        +N  R S  KRKPE +VLKSSEGFFK GVLDVKHLLR AP+RNS     D GNEMAGKGRRKGGKKKNNK+ KKGGGKKRH
Subjt:  NNSRRKSSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44820.1 unknown protein2.2e-3047.15Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGGSNN
        ++ R    K      ST ++  + +  I K++    SSHM+WKDKK +E++KV +LGGK QK  RLPLSVAR  MKKQK+RE+KM++++    +FGG  +
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGGSNN

Query:  SRRK-SSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSD-----------FRSIDFGNEMAG---KGRRKGGKKKNNKNKKKGGGKKR
        S RK +  KR PE+ VLKS+ G FK GVLDVKHLLR  P+ +SD            +++  G +  G   KG++KGGK K NK KKKGGGKKR
Subjt:  SRRK-SSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSD-----------FRSIDFGNEMAG---KGRRKGGKKKNNKNKKKGGGKKR

AT2G44820.2 unknown protein2.2e-3047.15Show/hide
Query:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGGSNN
        ++ R    K      ST ++  + +  I K++    SSHM+WKDKK +E++KV +LGGK QK  RLPLSVAR  MKKQK+RE+KM++++    +FGG  +
Subjt:  MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEH--FEQFGGSNN

Query:  SRRK-SSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSD-----------FRSIDFGNEMAG---KGRRKGGKKKNNKNKKKGGGKKR
        S RK +  KR PE+ VLKS+ G FK GVLDVKHLLR  P+ +SD            +++  G +  G   KG++KGGK K NK KKKGGGKKR
Subjt:  SRRK-SSWKRKPEDEVLKSSEGFFKKGVLDVKHLLRRAPTRNSD-----------FRSIDFGNEMAG---KGRRKGGKKKNNKNKKKGGGKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCCAGAGATAAAAACGCAAAAGCGGGATCATCTAGGAGCTCGACGGATATGGAACCTGAAATGGATTACATAAAGATCAAGAAAGAAATTGATTTCCTGACCTC
CTCTCATATGTCATGGAAAGACAAAAAGGAGATTGAGAACAGAAAAGTTGTTTCTCTGGGTGGAAAGCCTCAAAAGAAACAAAGACTGCCTCTAAGTGTAGCACGACCAA
TTATGAAGAAACAGAAGGAAAGAGAACAAAAGATGGTACAAGAGCATTTTGAACAATTTGGGGGTAGCAACAATTCTAGAAGAAAATCCTCATGGAAGCGCAAGCCCGAG
GACGAGGTTCTTAAGTCGAGCGAAGGCTTTTTTAAAAAAGGAGTGCTTGATGTCAAGCATCTACTAAGACGAGCTCCTACCAGGAATAGTGACTTTAGGAGTATTGACTT
TGGAAACGAAATGGCCGGTAAAGGTAGAAGAAAGGGGGGAAAAAAGAAGAATAATAAGAACAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAACAAAAACAAATTGAGCGCCACTCTATAAAATTAAATAAAAGATGAATAAAAGATGAAATCCCTAAACGGCTCCACGGCACGTCTTCTTCTTCACCAGCTCTCC
TACGTTTCAGCAGCTTGGTGCCGCCGTCGCCGCCTCCTCTCTCCTGTTCACTGCCGCCGCTGGCGCCTGTCTTCGTTTGTGGCATCGCAGCCTCCTCCCTCCTAGGGTTT
TTGCATTGATTTTGTACAGAATTTTGTACTCCGCTGTTATCACAGGCGAATAACAACGAAAATAAAAACAGCATTGCGGTAAATTGGGACGTGGTTTCTGTTCGGTGTTC
TGCTGCCTCTCAGAGTTGTGTTTTAGTGTACACGAACTTCAACAGGCCTCGTAGCTACTGCAACCTTTTCTTCCATTGCTGCTTATCTCACTTCTTTTGTGTATCTGAAC
ACTCTTGAGAAGATGCATTCCAGAGATAAAAACGCAAAAGCGGGATCATCTAGGAGCTCGACGGATATGGAACCTGAAATGGATTACATAAAGATCAAGAAAGAAATTGA
TTTCCTGACCTCCTCTCATATGTCATGGAAAGACAAAAAGGAGATTGAGAACAGAAAAGTTGTTTCTCTGGGTGGAAAGCCTCAAAAGAAACAAAGACTGCCTCTAAGTG
TAGCACGACCAATTATGAAGAAACAGAAGGAAAGAGAACAAAAGATGGTACAAGAGCATTTTGAACAATTTGGGGGTAGCAACAATTCTAGAAGAAAATCCTCATGGAAG
CGCAAGCCCGAGGACGAGGTTCTTAAGTCGAGCGAAGGCTTTTTTAAAAAAGGAGTGCTTGATGTCAAGCATCTACTAAGACGAGCTCCTACCAGGAATAGTGACTTTAG
GAGTATTGACTTTGGAAACGAAATGGCCGGTAAAGGTAGAAGAAAGGGGGGAAAAAAGAAGAATAATAAGAACAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGAAATC
ATTCAACTCCAAGAACATTGCTGTTTTACAATGGAATATGAATATCTTTTTTGTCATATCAGCTGCAATTCAATGAAATCTTGCATATTCTTGTTTCACGTATAGCTTTA
AGACATCCAACTATGGATTGTGGTTTTACTTGTAGCATTTTTATTCACTTCTGGCTCGTTCAATGATAAAGTAAAGTGGCTCAGGAGTTGAGAATTCAACCTGGGGATGT
GGAAGTTATTCAGGATGATAGAGG
Protein sequenceShow/hide protein sequence
MHSRDKNAKAGSSRSSTDMEPEMDYIKIKKEIDFLTSSHMSWKDKKEIENRKVVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEHFEQFGGSNNSRRKSSWKRKPE
DEVLKSSEGFFKKGVLDVKHLLRRAPTRNSDFRSIDFGNEMAGKGRRKGGKKKNNKNKKKGGGKKRH