; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019209 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019209
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionthymocyte nuclear protein 1
Genome locationChr04:18703439..18704014
RNA-Seq ExpressionHG10019209
SyntenyHG10019209
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0007166 - cell surface receptor signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002740 - EVE domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044320.1 thymocyte nuclear protein 1 [Cucumis melo var. makuwa]4.4e-7992.95Show/hide
Query:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP
        MAN VAGD++QYWLLKTEPAEWSWADQAAN+GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS+V+++VVVDVEAVGEMREP
Subjt:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP

Query:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGS+
Subjt:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

KAG7030934.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-7878.68Show/hide
Query:  LGGNIFTIFPPEKSPTSLSLLLPLNRHKRT-------QKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFY
        +G N+  IFPP KSPTSLS LLPL    +T        K MA+A AGD+KQ+WLLKTEPAEWSWADQAAN GRTKWDGVKNKQAQK+LKSMKLGDLCFFY
Subjt:  LGGNIFTIFPPEKSPTSLSLLLPLNRHKRT-------QKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFY

Query:  HSGAKARRVVGVVAVAREWYSAVD-EDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        HSGAKARR+VGV++VAREWYS  D  D VVDVEAVGEMRE VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIW+KICELGGGFEGDGT+  HGSE
Subjt:  HSGAKARRVVGVVAVAREWYSAVD-EDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

XP_004150262.2 thymocyte nuclear protein 1 [Cucumis sativus]2.0e-8785.35Show/hide
Query:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK------RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCF
        M LGGNI TIFP E  P+  SLL LPLN HK      R++KTMANAVAGD++QYWLLKTEPAEWSWADQAAN+GRT WDGVKNKQAQKHLKSMKLGD CF
Subjt:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK------RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCF

Query:  FYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        FYHSGAKARRVVGVVAVAREWYS+VD++VVVDVEAVGEMREPVDLKEMKK MEGMKNFALFRQPRLSVVPV KEIWDKICELGGGFEGDGTEGG GSE
Subjt:  FYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

XP_016901490.1 PREDICTED: thymocyte nuclear protein 1 [Cucumis melo]7.7e-9287.31Show/hide
Query:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF
        +NLG NI TIFPP  SPTS SLL LPLNRHK     R++KTMAN VAGD++QYWLLKTEPAEWSWADQAAN+GR+KWDGVKNKQAQKHLKSMKLGD CFF
Subjt:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF

Query:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        YHSGAKARRVVGVVAVAREWYS+V+++VVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGS+
Subjt:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

XP_038905606.1 thymocyte nuclear protein 1 [Benincasa hispida]6.1e-8196.79Show/hide
Query:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP
        MA  VAGDRKQYWLLKTEPAEWSWADQAAN+GRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSA D+DVVVDVEAVGEMREP
Subjt:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP

Query:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
Subjt:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

TrEMBL top hitse value%identityAlignment
A0A0A0KWC0 EVE domain-containing protein9.4e-8885.35Show/hide
Query:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK------RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCF
        M LGGNI TIFP E  P+  SLL LPLN HK      R++KTMANAVAGD++QYWLLKTEPAEWSWADQAAN+GRT WDGVKNKQAQKHLKSMKLGD CF
Subjt:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK------RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCF

Query:  FYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        FYHSGAKARRVVGVVAVAREWYS+VD++VVVDVEAVGEMREPVDLKEMKK MEGMKNFALFRQPRLSVVPV KEIWDKICELGGGFEGDGTEGG GSE
Subjt:  FYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

A0A1S4E0I6 thymocyte nuclear protein 13.7e-9287.31Show/hide
Query:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF
        +NLG NI TIFPP  SPTS SLL LPLNRHK     R++KTMAN VAGD++QYWLLKTEPAEWSWADQAAN+GR+KWDGVKNKQAQKHLKSMKLGD CFF
Subjt:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF

Query:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        YHSGAKARRVVGVVAVAREWYS+V+++VVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGS+
Subjt:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

A0A5D3E131 Thymocyte nuclear protein 12.1e-7992.95Show/hide
Query:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP
        MAN VAGD++QYWLLKTEPAEWSWADQAAN+GR+KWDGVKNKQAQKHLKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS+V+++VVVDVEAVGEMREP
Subjt:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREP

Query:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGS+
Subjt:  VDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

A0A6J1FML1 thymocyte nuclear protein 18.0e-7186.62Show/hide
Query:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVD-EDVVVDVEAVGEMRE
        MA+A AGD+KQ+WLLKTEPAEWSWADQAAN GRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWY   D  D VVDVEAVGEMRE
Subjt:  MANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSAVD-EDVVVDVEAVGEMRE

Query:  PVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
         VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIW+KICELGGGFEGDGT+  HGSE
Subjt:  PVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

E5GBH2 EVE domain-containing protein3.7e-9287.31Show/hide
Query:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF
        +NLG NI TIFPP  SPTS SLL LPLNRHK     R++KTMAN VAGD++QYWLLKTEPAEWSWADQAAN+GR+KWDGVKNKQAQKHLKSMKLGD CFF
Subjt:  MNLGGNIFTIFPPEKSPTSLSLL-LPLNRHK-----RTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFF

Query:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        YHSGAKARRVVGVVAVAREWYS+V+++VVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGS+
Subjt:  YHSGAKARRVVGVVAVAREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE

SwissProt top hitse value%identityAlignment
Q6P3E0 Thymocyte nuclear protein 13.7e-1231.12Show/hide
Query:  KSPTSLSLLLPLNRHKRTQKTMANAVAGDRKQYWLLKTEP---------AEWSWAD-QAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRV
        K+  S S  + L      + T       +   YWL+K+EP          ++S  D QA  +  T WDGV+N QA   L++MKL D  FFYHS  K   +
Subjt:  KSPTSLSLLLPLNRHKRTQKTMANAVAGDRKQYWLLKTEP---------AEWSWAD-QAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRV

Query:  VGVVAVAREWY---------------SAVDED---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL
        VG++ + +E Y               S+ +++    +VDV+ V  M+  + L E+K      K   G +K+  LF + RLSV P+ +E +D I  L
Subjt:  VGVVAVAREWY---------------SAVDED---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL

Q6PFL8 Thymocyte nuclear protein 13.2e-0829.45Show/hide
Query:  WLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY----------------SAVDED
        WL+K+EP          ++   D  A   +T  WDGV+N QA+  ++ MK+G   FFYHS  K   + G++ + +E Y                S  D  
Subjt:  WLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY----------------SAVDED

Query:  V--VVDVEAVGEMREPVDLKEMKK-----GMEG--MKNFALFRQPRLSVVPVAKEIWDKICEL
           +VDV+    ++  + L E+KK      ++G  +K+ ALF + RLSV P+  E ++ +  L
Subjt:  V--VVDVEAVGEMREPVDLKEMKK-----GMEG--MKNFALFRQPRLSVVPVAKEIWDKICEL

Q90679 Thymocyte nuclear protein 12.4e-1132.93Show/hide
Query:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED
        +WLLK+EP          ++S  D  A   +T  W+GV+N QA+  L++MKLG   FFYHS  K   +VG+V + +E Y               S+  E+
Subjt:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED

Query:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL
            +VDV+ V   +  + L E+K      K   G +KN  LF + RLS+ P+ +E +D +  L
Subjt:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL

Q91YJ3 Thymocyte nuclear protein 11.6e-1234.15Show/hide
Query:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED
        YWL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKL D  FFYHS  K   +VG++ + +E Y               S+ ++D
Subjt:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED

Query:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL
            +VDV+ V  M+  + L+E+K      K   G +K+  LF + RLSV P+ +E +D I  L
Subjt:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL

Q9P016 Thymocyte nuclear protein 11.4e-1131.71Show/hide
Query:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED
        +WL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKLG+  FFYHS  K   + G++ + +E Y               S+ +++
Subjt:  YWLLKTEP---------AEWSWADQAANEGRTK-WDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---------------SAVDED

Query:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL
            +VDV+ V  M+  + L E+K      K   G +KN  LF + RLS+ P+ +E +D +  L
Subjt:  ---VVVDVEAVGEMREPVDLKEMK------KGMEG-MKNFALFRQPRLSVVPVAKEIWDKICEL

Arabidopsis top hitse value%identityAlignment
AT2G14660.1 unknown protein5.4e-5165.38Show/hide
Query:  GDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS----AVDEDVVVDVEAVGEMREPVD
        G  K+YWLLKTEP EWSW+DQ +N G +KWDGVKNKQAQK+LKSM LGDLCFFYHSG K+R VVGVV V+REWY+     V+ +  VDV+A+GEMR+ VD
Subjt:  GDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS----AVDEDVVVDVEAVGEMREPVD

Query:  LKEMKKGMEGM--KNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE
        LKEM KG +G+  K F LFRQPRLSVVPV +++W+ ICELG GF GDG E    S+
Subjt:  LKEMKKGMEGM--KNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTAGGCGGCAACATATTCACGATTTTCCCGCCGGAAAAATCACCGACGTCACTATCTCTTCTTCTCCCACTTAACCGCCACAAGCGAACGCAGAAAACAATGGC
CAACGCCGTCGCCGGAGACAGAAAACAGTATTGGCTTCTGAAGACGGAGCCGGCAGAGTGGTCGTGGGCGGACCAAGCCGCCAACGAGGGACGAACAAAGTGGGACGGCG
TCAAAAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTCGGCGTGGTCGCCGTC
GCACGCGAGTGGTACTCGGCAGTCGACGAAGATGTCGTCGTGGACGTGGAGGCGGTCGGAGAGATGAGGGAGCCGGTGGATTTGAAAGAGATGAAGAAGGGGATGGAAGG
GATGAAGAATTTTGCTCTGTTTCGGCAACCAAGGCTGTCGGTTGTGCCAGTTGCGAAGGAGATTTGGGATAAGATCTGTGAATTGGGAGGCGGATTTGAAGGAGATGGAA
CAGAGGGCGGCCATGGGAGTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTAGGCGGCAACATATTCACGATTTTCCCGCCGGAAAAATCACCGACGTCACTATCTCTTCTTCTCCCACTTAACCGCCACAAGCGAACGCAGAAAACAATGGC
CAACGCCGTCGCCGGAGACAGAAAACAGTATTGGCTTCTGAAGACGGAGCCGGCAGAGTGGTCGTGGGCGGACCAAGCCGCCAACGAGGGACGAACAAAGTGGGACGGCG
TCAAAAACAAGCAAGCTCAGAAGCATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTCGGCGTGGTCGCCGTC
GCACGCGAGTGGTACTCGGCAGTCGACGAAGATGTCGTCGTGGACGTGGAGGCGGTCGGAGAGATGAGGGAGCCGGTGGATTTGAAAGAGATGAAGAAGGGGATGGAAGG
GATGAAGAATTTTGCTCTGTTTCGGCAACCAAGGCTGTCGGTTGTGCCAGTTGCGAAGGAGATTTGGGATAAGATCTGTGAATTGGGAGGCGGATTTGAAGGAGATGGAA
CAGAGGGCGGCCATGGGAGTGAGTGA
Protein sequenceShow/hide protein sequence
MNLGGNIFTIFPPEKSPTSLSLLLPLNRHKRTQKTMANAVAGDRKQYWLLKTEPAEWSWADQAANEGRTKWDGVKNKQAQKHLKSMKLGDLCFFYHSGAKARRVVGVVAV
AREWYSAVDEDVVVDVEAVGEMREPVDLKEMKKGMEGMKNFALFRQPRLSVVPVAKEIWDKICELGGGFEGDGTEGGHGSE