; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004847 (gene) of Snake gourd v1 genome

Gene IDTan0004847
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEVE domain-containing protein
Genome locationLG01:5594347..5595318
RNA-Seq ExpressionTan0004847
SyntenyTan0004847
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0007166 - cell surface receptor signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002740 - EVE domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600275.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.4e-7489.17Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA AAAG KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWYSE +GGDAVVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
         VDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

KAG7030934.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-7581.11Show/hide
Query:  LSLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAR
        LSLLP  T   +      +  K MA AAAG KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK LKSMKLGDLCFFYHSGAKARR+VGV++VAR
Subjt:  LSLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAR

Query:  EWYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        EWYSE +GGDAVVDVEAVGEMRE VDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  EWYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

XP_022941981.1 thymocyte nuclear protein 1 [Cucurbita moschata]7.0e-7488.54Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA AAAG KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWY E +GGDAVVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
         VDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

XP_023530564.1 thymocyte nuclear protein 1 [Cucurbita pepo subsp. pepo]2.2e-7589.17Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA AA G KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWYSE +GGDAVVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        PVDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

XP_038905606.1 thymocyte nuclear protein 1 [Benincasa hispida]2.7e-7388.61Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA   AG +KQYWLLKTEP+EWSWADQ AN GRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS  +  D VVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSEG
        PVDLKEMK+GMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEG HGSEG
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSEG

TrEMBL top hitse value%identityAlignment
A0A1S4E0I6 thymocyte nuclear protein 11.1e-7279.33Show/hide
Query:  SLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVARE
        SLLP      +  A ++   K MA+  AG K+QYWLLKTEP+EWSWADQ AN GR+KWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARRVVGVVAVARE
Subjt:  SLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVARE

Query:  WYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        WYS  E  + VVDVEAVGEMREPVDLKEMK+GMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEG HGS+
Subjt:  WYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

A0A5D3E131 Thymocyte nuclear protein 17.1e-7286.62Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA+  AG K+QYWLLKTEP+EWSWADQ AN GR+KWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARRVVGVVAVAREWYS  E  + VVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        PVDLKEMK+GMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEG HGS+
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

A0A6J1FML1 thymocyte nuclear protein 13.4e-7488.54Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA AAAG KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK LKSMKLGDLCFFYHSGAKARR+VGVV+VAREWY E +GGDAVVDVEAVGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
         VDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HGSE
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

A0A6J1JS35 thymocyte nuclear protein 11.7e-7386.62Show/hide
Query:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE
        MA AA G KKQ+WLLKTEP+EWSWADQ ANGGRTKWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARR+VGVV+VAREWYSE +GGD VVDVE VGEMRE
Subjt:  MASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDAVVDVEAVGEMRE

Query:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        PVDLKEMK+G+EGMK+FALFRQPRLSVVPVAKEIWEKICELGGGFEGDGT+  HG+E
Subjt:  PVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

E5GBH2 EVE domain-containing protein1.1e-7279.33Show/hide
Query:  SLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVARE
        SLLP      +  A ++   K MA+  AG K+QYWLLKTEP+EWSWADQ AN GR+KWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARRVVGVVAVARE
Subjt:  SLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVARE

Query:  WYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        WYS  E  + VVDVEAVGEMREPVDLKEMK+GMEGMKNFALFRQPRLSVVPVAKEIW+KICELGGGFEGDGTEG HGS+
Subjt:  WYSETEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE

SwissProt top hitse value%identityAlignment
Q6P3E0 Thymocyte nuclear protein 11.4e-1132.32Show/hide
Query:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---SETEGGD-----------
        YWL+K+EP          ++S  D  A   +T  WDGV+N QA   L++MKL D  FFYHS  K   +VG++ + +E Y   ++ E  D           
Subjt:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY---SETEGGD-----------

Query:  ---AVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L E+K   +        +K+  LF + RLSV P+ +E ++ I  L
Subjt:  ---AVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL

Q6PFL8 Thymocyte nuclear protein 18.2e-0927.98Show/hide
Query:  WLLKTEPSEWSWADQVANGGRTK---------------WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDA---------
        WL+K+EP       ++ NG   K               WDGV+N QA+  ++ MK+G   FFYHS  K   + G++ + +E Y +    D          
Subjt:  WLLKTEPSEWSWADQVANGGRTK---------------WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSETEGGDA---------

Query:  --------VVDVEAVGEMREPVDLKEMKQ-----GMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL
                +VDV+    ++  + L E+K+      ++G  +K+ ALF + RLSV P+  E +E +  L
Subjt:  --------VVDVEAVGEMREPVDLKEMKQ-----GMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL

Q90679 Thymocyte nuclear protein 11.6e-1230.15Show/hide
Query:  QATKPP--QLQAYIVNPGKAMASAAAGAKKQ----YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKA
        ++TKPP    ++ + N  KA  S + G + +    +WLLK+EP          ++S  D  A   +T  W+GV+N QA+  L++MKLG   FFYHS  K 
Subjt:  QATKPP--QLQAYIVNPGKAMASAAAGAKKQ----YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKA

Query:  RRVVGVVAVAREWYSETEGGD-----------------AVVDVEAVGEMREPVDLKEMK-----QGMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL
          +VG+V + +E Y +    D                 ++VDV+ V   +  + L E+K        +G  +KN  LF + RLS+ P+ +E ++ +  L
Subjt:  RRVVGVVAVAREWYSETEGGD-----------------AVVDVEAVGEMREPVDLKEMK-----QGMEG--MKNFALFRQPRLSVVPVAKEIWEKICEL

Q91YJ3 Thymocyte nuclear protein 16.1e-1231.71Show/hide
Query:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SETE
        YWL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKL D  FFYHS  K   +VG++ + +E Y                  E +
Subjt:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SETE

Query:  GGDAVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L+E+K   +        +K+  LF + RLSV P+ +E ++ I  L
Subjt:  GGDAVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL

Q9P016 Thymocyte nuclear protein 12.3e-1129.88Show/hide
Query:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SETE
        +WL+K+EP          ++S  D  A   +T  WDGV+N QA+  L++MKLG+  FFYHS  K   + G++ + +E Y                  E  
Subjt:  YWLLKTEPS---------EWSWADQVANGGRTK-WDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWY-----------------SETE

Query:  GGDAVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L E+K   +        +KN  LF + RLS+ P+ +E ++ +  L
Subjt:  GGDAVVDVEAVGEMREPVDLKEMKQGMEG-------MKNFALFRQPRLSVVPVAKEIWEKICEL

Arabidopsis top hitse value%identityAlignment
AT2G14660.1 unknown protein3.9e-5467.95Show/hide
Query:  GAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS-ETEG--GDAVVDVEAVGEMREPVD
        G  K+YWLLKTEP+EWSW+DQ +NGG +KWDGVKNKQAQKNLKSM LGDLCFFYHSG K+R VVGVV V+REWY+ + EG  G+  VDV+A+GEMR+ VD
Subjt:  GAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYS-ETEG--GDAVVDVEAVGEMREPVD

Query:  LKEMKQGMEGM--KNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE
        LKEMK G +G+  K F LFRQPRLSVVPV +++W  ICELG GF GDG E    S+
Subjt:  LKEMKQGMEGM--KNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACTTTCTTTTCTGAGTCTTCTCCCTCAAGCCACCAAACCACCACAACTACAAGCGTACATAGTTAATCCGGGAAAGGCAATGGCCAGCGCCGCCGCCGGAGCCAA
AAAACAGTACTGGCTTCTGAAGACGGAACCGTCGGAGTGGTCGTGGGCCGACCAAGTCGCGAACGGTGGCCGAACCAAGTGGGACGGCGTCAAGAACAAGCAAGCTCAGA
AGAATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTCGGCGTGGTCGCTGTCGCACGGGAGTGGTACTCGGAG
ACCGAAGGCGGCGACGCCGTCGTCGACGTTGAGGCGGTCGGGGAAATGAGGGAACCGGTGGATTTGAAAGAGATGAAGCAGGGGATGGAAGGGATGAAGAATTTCGCGCT
GTTTCGGCAACCGAGGCTGTCGGTTGTGCCGGTAGCGAAGGAGATTTGGGAGAAGATCTGTGAATTGGGAGGGGGATTTGAAGGGGATGGAACAGAGGGCGACCATGGGA
GTGAGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATAAGAACAATTTTCCCGCTGGAAAAATCACCGATGCCACTTTCTTTTCTGAGTCTTCTCCCTCAAGCCACCAAACCACCACAACTACAAGCGTACATAGTTAATCCGGG
AAAGGCAATGGCCAGCGCCGCCGCCGGAGCCAAAAAACAGTACTGGCTTCTGAAGACGGAACCGTCGGAGTGGTCGTGGGCCGACCAAGTCGCGAACGGTGGCCGAACCA
AGTGGGACGGCGTCAAGAACAAGCAAGCTCAGAAGAATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCGTCGTCGGC
GTGGTCGCTGTCGCACGGGAGTGGTACTCGGAGACCGAAGGCGGCGACGCCGTCGTCGACGTTGAGGCGGTCGGGGAAATGAGGGAACCGGTGGATTTGAAAGAGATGAA
GCAGGGGATGGAAGGGATGAAGAATTTCGCGCTGTTTCGGCAACCGAGGCTGTCGGTTGTGCCGGTAGCGAAGGAGATTTGGGAGAAGATCTGTGAATTGGGAGGGGGAT
TTGAAGGGGATGGAACAGAGGGCGACCATGGGAGTGAGGGATAGAGCTTGAATTTGAATGCACGCAAGGTGTTTGTGAATATGCCTAAGAGAATTTTGTTGAGCCTCTTT
TGATCGAACATGATGTTTTGTGAATGGAAATGGAATTGTGATGGATTTCTGGTTGAAATTCTGAAGAAAAATTTTGAGGGGCTTGGATGTAGAAAGCAAATTTGTTTTGA
ACTTTGTGAAGAAGACGATCCAGAAAGTGAAGGCTCAGGTATGTTCTGTCTGTTGTATAGATGGTAGTTTATGGAATTTCTTTAGTTCTTTCATCAATGGCAGAACTAGA
GTTTGTTTTCACCTTTTCTTAGCCTTTAATATTTAGGGCTGTTAAGGGCACATGTTAATAAACTGTGAATACTCTACACCCCGAACTCCGAC
Protein sequenceShow/hide protein sequence
MPLSFLSLLPQATKPPQLQAYIVNPGKAMASAAAGAKKQYWLLKTEPSEWSWADQVANGGRTKWDGVKNKQAQKNLKSMKLGDLCFFYHSGAKARRVVGVVAVAREWYSE
TEGGDAVVDVEAVGEMREPVDLKEMKQGMEGMKNFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTEGDHGSEG