; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08110 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08110
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionthymocyte nuclear protein 1
Genome locationCarg_Chr04:2329811..2331316
RNA-Seq ExpressionCarg08110
SyntenyCarg08110
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002740 - EVE domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600275.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. sororia]3.4e-8699.37Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGV+SVAREWYSEGDGGDAVVDVEAVGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
        AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

KAG7030934.1 Thymocyte nuclear protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-169100Show/hide
Query:  MAPFAMKFQNCPSQCLYIHLHSPSSSPSLSMASKLVTSIFTFLFLFFFLFSSTTSVFAQPIPGSTNTSPRTTRDGFKVVDDVLEEESCNYKYWKEKLFIF
        MAPFAMKFQNCPSQCLYIHLHSPSSSPSLSMASKLVTSIFTFLFLFFFLFSSTTSVFAQPIPGSTNTSPRTTRDGFKVVDDVLEEESCNYKYWKEKLFIF
Subjt:  MAPFAMKFQNCPSQCLYIHLHSPSSSPSLSMASKLVTSIFTFLFLFFFLFSSTTSVFAQPIPGSTNTSPRTTRDGFKVVDDVLEEESCNYKYWKEKLFIF

Query:  IILATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGD
        IILATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGD
Subjt:  IILATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGD

Query:  LCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHG
        LCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHG
Subjt:  LCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHG

Query:  SEE
        SEE
Subjt:  SEE

XP_022941981.1 thymocyte nuclear protein 1 [Cucurbita moschata]1.7e-8598.73Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGV+SVAREWY EGDGGDAVVDVEAVGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
        AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

XP_022993182.1 thymocyte nuclear protein 1 [Cucurbita maxima]2.3e-8294.94Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MADAA GDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARRIVGV+SVAREWYSEGDGGD VVDVE VGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
         VDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHG+EE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

XP_023530564.1 thymocyte nuclear protein 1 [Cucurbita pepo subsp. pepo]9.3e-8496.84Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MA AA GDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGDLCFFYHSGAKARRIVGV+SVAREWYSEGDGGDAVVDVEAVGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
         VDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

TrEMBL top hitse value%identityAlignment
A0A0A0KWC0 EVE domain-containing protein3.2e-7473.13Show/hide
Query:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITST-SSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLC
        A  +G N++ IFP    P+  SLLPLP +    I++   S K MA+A AGDK+Q+WLLKTEPAEWSWADQAAN GRT WDGVKNKQAQK+LKSMKLGD C
Subjt:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITST-SSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLC

Query:  FFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE
        FFYHSGAKARR+VGV++VAREWYS  D  + VVDVEAVGEMRE VDLKEMKK +EGMK+FALFRQPRLSVVPV KEIW+KICELGGGFEGDGT+   GSE
Subjt:  FFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE

Query:  E
        E
Subjt:  E

A0A1S4E0I6 thymocyte nuclear protein 14.4e-7975.38Show/hide
Query:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCF
        A  +GAN++ IFPP  SPTS SLLPLP +         S K MA+  AGDK+Q+WLLKTEPAEWSWADQAAN GR+KWDGVKNKQAQK+LKSMKLGD CF
Subjt:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCF

Query:  FYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE
        FYHSGAKARR+VGV++VAREWYS  +  + VVDVEAVGEMRE VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIW+KICELGGGFEGDGT+  HGS+
Subjt:  FYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE

A0A6J1FML1 thymocyte nuclear protein 18.2e-8698.73Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGV+SVAREWY EGDGGDAVVDVEAVGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
        AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

A0A6J1JS35 thymocyte nuclear protein 11.1e-8294.94Show/hide
Query:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE
        MADAA GDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQK+LKSMKLGD CFFYHSGAKARRIVGV+SVAREWYSEGDGGD VVDVE VGEMRE
Subjt:  MADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMRE

Query:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE
         VDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHG+EE
Subjt:  AVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE

E5GBH2 EVE domain-containing protein4.4e-7975.38Show/hide
Query:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCF
        A  +GAN++ IFPP  SPTS SLLPLP +         S K MA+  AGDK+Q+WLLKTEPAEWSWADQAAN GR+KWDGVKNKQAQK+LKSMKLGD CF
Subjt:  ATEIGANVSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCF

Query:  FYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE
        FYHSGAKARR+VGV++VAREWYS  +  + VVDVEAVGEMRE VDLKEMKKG+EGMK+FALFRQPRLSVVPVAKEIW+KICELGGGFEGDGT+  HGS+
Subjt:  FYHSGAKARRIVGVISVAREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSE

SwissProt top hitse value%identityAlignment
Q6P3E0 Thymocyte nuclear protein 14.4e-1233.54Show/hide
Query:  FWLLKTEP---------AEWSWAD-QAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY-----------------SEGD
        +WL+K+EP          ++S  D QA     T WDGV+N QA  +L++MKL D  FFYHS  K   IVG++ + +E Y                  E +
Subjt:  FWLLKTEP---------AEWSWAD-QAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY-----------------SEGD

Query:  GGDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL
           ++VDV+ V  M+  + L E+K      K   G +K   LF + RLSV P+ +E ++ I  L
Subjt:  GGDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL

Q6PFL8 Thymocyte nuclear protein 16.4e-1131.29Show/hide
Query:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY---SEGDGGDA-----------
        WL+K+EP          ++   D  A   +T  WDGV+N QA+ +++ MK+G   FFYHS  K   I G++ + +E Y   ++ D  D            
Subjt:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY---SEGDGGDA-----------

Query:  ---VVDVEAVGEMREAVDLKEMKK-----GIEG--MKDFALFRQPRLSVVPVAKEIWEKICEL
           +VDV+    ++  + L E+KK      ++G  +KD ALF + RLSV P+  E +E +  L
Subjt:  ---VVDVEAVGEMREAVDLKEMKK-----GIEG--MKDFALFRQPRLSVVPVAKEIWEKICEL

Q90679 Thymocyte nuclear protein 17.6e-1231.29Show/hide
Query:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGD---------------
        WLLK+EP          ++S  D  A   +T  W+GV+N QA+ +L++MKLG   FFYHS  K   IVG++ + +E Y +    D               
Subjt:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWYSEGDGGD---------------

Query:  --AVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL
          ++VDV+ V   +  + L E+K      K   G +K+  LF + RLS+ P+ +E ++ +  L
Subjt:  --AVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL

Q91YJ3 Thymocyte nuclear protein 12.6e-1231.77Show/hide
Query:  STKTTITSTSSSKIMADAAAGDKK---QFWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVI
        S  T++   +SS  M        K    +WL+K+EP          ++S  D  A   +T  WDGV+N QA+ +L++MKL D  FFYHS  K   IVG++
Subjt:  STKTTITSTSSSKIMADAAAGDKK---QFWLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVI

Query:  SVAREWY-----------------SEGDGGDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL
         + +E Y                  E D   ++VDV+ V  M+  + L+E+K      K   G +K   LF + RLSV P+ +E ++ I  L
Subjt:  SVAREWY-----------------SEGDGGDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL

Q9P016 Thymocyte nuclear protein 17.6e-1231.29Show/hide
Query:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY-----------------SEGDG
        WL+K+EP          ++S  D  A   +T  WDGV+N QA+ +L++MKLG+  FFYHS  K   I G++ + +E Y                  E + 
Subjt:  WLLKTEP---------AEWSWADQAANGGRTK-WDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY-----------------SEGDG

Query:  GDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL
          ++VDV+ V  M+  + L E+K      K   G +K+  LF + RLS+ P+ +E ++ +  L
Subjt:  GDAVVDVEAVGEMREAVDLKEMK------KGIEG-MKDFALFRQPRLSVVPVAKEIWEKICEL

Arabidopsis top hitse value%identityAlignment
AT2G14660.1 unknown protein6.3e-5465.82Show/hide
Query:  GDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY---SEGDGGDAVVDVEAVGEMREAVD
        G  K++WLLKTEP EWSW+DQ +NGG +KWDGVKNKQAQK LKSM LGDLCFFYHSG K+R +VGV+ V+REWY   +EG  G+  VDV+A+GEMR+ VD
Subjt:  GDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISVAREWY---SEGDGGDAVVDVEAVGEMREAVD

Query:  LKEMK--KGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDG-TDCRHGSEE
        LKEMK  KGI   K F LFRQPRLSVVPV +++W  ICELG GF GDG  DC    +E
Subjt:  LKEMK--KGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDG-TDCRHGSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTTTTGCCATGAAGTTCCAAAATTGCCCTTCACAATGTCTTTATATTCATCTTCACAGTCCCTCCTCTTCCCCATCTCTCTCAATGGCTTCCAAGCTTGTGAC
CTCCATCTTCACGTTTCTCTTCCTTTTCTTCTTTTTATTCTCCTCGACGACCTCCGTTTTTGCTCAGCCCATTCCCGGCTCTACAAACACGTCTCCTAGGACTACTCGAG
ACGGGTTTAAGGTTGTGGATGATGTTTTGGAAGAGGAGAGCTGTAATTACAAATATTGGAAAGAAAAATTATTTATATTTATAATTTTAGCCACAGAAATAGGCGCCAAT
GTAAGCGCAATTTTCCCGCCGGGAAAATCACCGACTTCACTTTCTCTTCTCCCTCTCCCTACTTCTACCAAAACCACCATAACCTCAACCAGTTCATCGAAGATAATGGC
TGACGCGGCCGCCGGAGACAAAAAACAGTTCTGGCTTCTGAAGACGGAACCAGCGGAGTGGTCGTGGGCCGACCAAGCCGCGAACGGCGGCCGAACAAAGTGGGACGGCG
TCAAGAACAAGCAAGCGCAGAAGTATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCATCGTCGGCGTGATTTCCGTC
GCACGGGAGTGGTATTCGGAGGGCGATGGCGGCGATGCTGTCGTCGACGTCGAGGCGGTCGGGGAAATGAGAGAAGCGGTGGATTTGAAAGAGATGAAGAAGGGGATCGA
AGGGATGAAGGATTTCGCGCTGTTTCGGCAACCGAGACTGTCTGTTGTGCCGGTAGCGAAGGAGATTTGGGAGAAGATCTGCGAATTGGGAGGCGGATTTGAAGGGGATG
GAACAGATTGCCGCCATGGGAGTGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCTTTTGCCATGAAGTTCCAAAATTGCCCTTCACAATGTCTTTATATTCATCTTCACAGTCCCTCCTCTTCCCCATCTCTCTCAATGGCTTCCAAGCTTGTGAC
CTCCATCTTCACGTTTCTCTTCCTTTTCTTCTTTTTATTCTCCTCGACGACCTCCGTTTTTGCTCAGCCCATTCCCGGCTCTACAAACACGTCTCCTAGGACTACTCGAG
ACGGGTTTAAGGTTGTGGATGATGTTTTGGAAGAGGAGAGCTGTAATTACAAATATTGGAAAGAAAAATTATTTATATTTATAATTTTAGCCACAGAAATAGGCGCCAAT
GTAAGCGCAATTTTCCCGCCGGGAAAATCACCGACTTCACTTTCTCTTCTCCCTCTCCCTACTTCTACCAAAACCACCATAACCTCAACCAGTTCATCGAAGATAATGGC
TGACGCGGCCGCCGGAGACAAAAAACAGTTCTGGCTTCTGAAGACGGAACCAGCGGAGTGGTCGTGGGCCGACCAAGCCGCGAACGGCGGCCGAACAAAGTGGGACGGCG
TCAAGAACAAGCAAGCGCAGAAGTATCTCAAGTCCATGAAACTCGGCGACCTCTGTTTCTTCTACCACTCCGGCGCCAAGGCCCGCCGCATCGTCGGCGTGATTTCCGTC
GCACGGGAGTGGTATTCGGAGGGCGATGGCGGCGATGCTGTCGTCGACGTCGAGGCGGTCGGGGAAATGAGAGAAGCGGTGGATTTGAAAGAGATGAAGAAGGGGATCGA
AGGGATGAAGGATTTCGCGCTGTTTCGGCAACCGAGACTGTCTGTTGTGCCGGTAGCGAAGGAGATTTGGGAGAAGATCTGCGAATTGGGAGGCGGATTTGAAGGGGATG
GAACAGATTGCCGCCATGGGAGTGAGGAATAG
Protein sequenceShow/hide protein sequence
MAPFAMKFQNCPSQCLYIHLHSPSSSPSLSMASKLVTSIFTFLFLFFFLFSSTTSVFAQPIPGSTNTSPRTTRDGFKVVDDVLEEESCNYKYWKEKLFIFIILATEIGAN
VSAIFPPGKSPTSLSLLPLPTSTKTTITSTSSSKIMADAAAGDKKQFWLLKTEPAEWSWADQAANGGRTKWDGVKNKQAQKYLKSMKLGDLCFFYHSGAKARRIVGVISV
AREWYSEGDGGDAVVDVEAVGEMREAVDLKEMKKGIEGMKDFALFRQPRLSVVPVAKEIWEKICELGGGFEGDGTDCRHGSEE