; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019598 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019598
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionkinetochore protein NUF2 homolog
Genome locationscaffold5:35846940..35857408
RNA-Seq ExpressionSpg019598
SyntenySpg019598
Gene Ontology termsGO:0007052 - mitotic spindle organization (biological process)
GO:0045132 - meiotic chromosome segregation (biological process)
GO:0051301 - cell division (biological process)
GO:0051315 - attachment of mitotic spindle microtubules to kinetochore (biological process)
GO:0051383 - kinetochore organization (biological process)
GO:0031262 - Ndc80 complex (cellular component)
GO:0044877 - protein-containing complex binding (molecular function)
InterPro domainsIPR005549 - Kinetochore protein Nuf2, N-terminal
IPR025558 - Domain of unknown function DUF4283
IPR038275 - Nuf2, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588803.1 Kinetochore protein NUF2-like protein, partial [Cucurbita argyrosperma subsp. sororia]7.4e-10589.95Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

XP_022927739.1 probable kinetochore protein NUF2 [Cucurbita moschata]7.4e-10589.95Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

XP_022989268.1 probable kinetochore protein NUF2 [Cucurbita maxima]2.2e-10489.5Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADI LLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

XP_023529466.1 probable kinetochore protein NUF2 [Cucurbita pepo subsp. pepo]7.4e-10589.95Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

XP_038877491.1 kinetochore protein NUF2 homolog [Benincasa hispida]1.4e-10390.87Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKY+FPRLPRADIALLLA  QIAAVTERDLLHPS DLVSDLYTRLMIYLD LHEEDQG VEFAALDQLENPDL MDSV  MKL N+IKHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKPE DRT IFLSAMLNFCIHKDAKI LH PVMDEL+T GDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

TrEMBL top hitse value%identityAlignment
A0A0A0KNL7 Uncharacterized protein4.7e-9785.39Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADI LLLA  QIAAVTERDLL PSPDLVSDLYT LMIYLDLLHEEDQG +EFAALDQLENPDL M SV  MKLHN+IKH IASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKPE DRT IFLSA+LNF IHKDAK+  H PVM+EL T  DQQREWEVK SQLNAEI EYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQ+S
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

A0A1S4E5E4 probable kinetochore protein NUF25.5e-9885.84Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADIALLLA  QIAAVTERDLL PSPDLVSDLYT LMIYLDLLHEEDQG +EFAALDQLENPDL M SV  MKLHN+IKH IASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKPE DRT IFLSA+LNF IHKDAK+  H PVM+EL T  DQQREWEVKISQLNAEI EYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQ+S
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRA+IRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

A0A6J1DWL1 probable kinetochore protein NUF22.8e-9785.39Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKY   RLP ADIALLLAQ QIAAVTE DLLHPSPDLVSDLYTRL+IYLD LHEEDQG VEFAALDQLENPDL MDSVRIMKL+ ++KHVIASLDC KK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLS MLNFCIHKDAKI LH P+++E++TLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQ+IDAKVKEL Q IGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+K GEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

A0A6J1EIG6 probable kinetochore protein NUF23.6e-10589.95Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

A0A6J1JNW1 probable kinetochore protein NUF21.0e-10489.5Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MSKYEFPRLPRADI LLLAQCQIAAVTERDLLHP+PDLVSDLYTRLMIYLD LHEEDQG VEFAAL+Q ENPDL MDSV+ MKL+NR+KHVIASLDCPKK
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
        FTLKDLIKP+ DRT IFLSAMLNFCIHKDAKI LH PVM+EL+TL DQQREWEVKISQLNAEIAEYNEARERE+PFVQEIDAKVKEL QTIGGLNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKV
        LRASIRKLK+KAGEMDEK+
Subjt:  LRASIRKLKDKAGEMDEKV

SwissProt top hitse value%identityAlignment
Q10173 Kinetochore protein nuf23.1e-0525.27Show/hide
Query:  KYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLM-IYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKKF
        K+ FP L RA+I   +    I   T ++L  P+   V  LY   + +++ L  +  +  V  +  D +EN +++ +S+R    +  +   + ++ C   F
Subjt:  KYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLM-IYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKKF

Query:  TLKDLIKPEGDRTGIFLSAMLNFCIHK-------DAKITLHTPVMDELSTLGDQQREWEVKI-----SQLNAE-IAEYNEAREREI
        T++DL+KP+ +R  + LSA++NF   +       D  I     +++  + L  Q+++ E K+      +L +E I + NE R  E+
Subjt:  TLKDLIKPEGDRTGIFLSAMLNFCIHK-------DAKITLHTPVMDELSTLGDQQREWEVKI-----SQLNAE-IAEYNEAREREI

Q8RXJ0 Kinetochore protein NUF2 homolog1.9e-5848.68Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MS YE+PRL R+DI   L   QIA+VTE DL  P+ D VS+LYTR++IYLD L EE++G V+F AL+QLENPD    S++ MKL+ ++K ++  LDCP  
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
         + KDL++PE  RT  F+SA+LN+ ++KD+K+ L  P  +EL  L +Q+++ E K++QLNAEI E++EA ER++PFVQE++A +++L + I  LNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKVILGQVNTFE
        LRA+ +K+++K+ +MD ++   + +  E
Subjt:  LRASIRKLKDKAGEMDEKVILGQVNTFE

Arabidopsis top hitse value%identityAlignment
AT1G61000.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitosis; LOCATED IN: plasma membrane; EXPRESSED IN: cultured cell; CONTAINS InterPro DOMAIN/s: Kinetochore protein Nuf2 (InterPro:IPR005549); Has 50972 Blast hits to 29793 proteins in 2070 species: Archae - 902; Bacteria - 7404; Metazoa - 23628; Fungi - 4156; Plants - 2312; Viruses - 158; Other Eukaryotes - 12412 (source: NCBI BLink).1.3e-5948.68Show/hide
Query:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK
        MS YE+PRL R+DI   L   QIA+VTE DL  P+ D VS+LYTR++IYLD L EE++G V+F AL+QLENPD    S++ MKL+ ++K ++  LDCP  
Subjt:  MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKK

Query:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS
         + KDL++PE  RT  F+SA+LN+ ++KD+K+ L  P  +EL  L +Q+++ E K++QLNAEI E++EA ER++PFVQE++A +++L + I  LNNQQMS
Subjt:  FTLKDLIKPEGDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMS

Query:  LRASIRKLKDKAGEMDEKVILGQVNTFE
        LRA+ +K+++K+ +MD ++   + +  E
Subjt:  LRASIRKLKDKAGEMDEKVILGQVNTFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAGTATGAATTTCCGAGGCTTCCACGGGCAGATATTGCATTGCTACTAGCACAGTGTCAAATTGCTGCTGTAACAGAGCGGGACCTTCTTCACCCGTCACCAGA
TTTGGTTTCAGATCTTTACACACGTTTAATGATCTACCTTGACTTGCTCCATGAGGAAGACCAAGGACTAGTTGAGTTTGCTGCCTTGGATCAGCTTGAGAACCCCGATC
TGCTTATGGATTCAGTTCGGATAATGAAGTTGCACAACAGAATAAAGCATGTCATTGCTTCTCTTGACTGTCCAAAAAAGTTTACATTGAAAGATTTGATAAAACCTGAA
GGAGATAGGACTGGAATTTTCCTTAGTGCAATGTTGAACTTTTGCATTCACAAAGATGCCAAAATCACCCTCCATACTCCAGTTATGGATGAGCTGAGTACTCTTGGTGA
CCAGCAAAGGGAATGGGAGGTTAAGATTTCTCAGTTGAATGCAGAGATTGCAGAGTACAATGAAGCAAGAGAAAGGGAAATACCTTTCGTTCAAGAGATTGATGCTAAAG
TGAAAGAACTACAACAGACTATTGGAGGACTTAACAATCAGCAGATGTCATTGCGTGCTTCTATCCGAAAGTTGAAAGACAAAGCTGGAGAAATGGATGAGAAGGTCATC
CTGGGTCAAGTTAATACTTTTGAGAGGCTTTCAAGAGTGATGCCCCTGTGGTTGGCCCTTTTTGTTGCATCCTTTGTCGGAAAGCAGAGAGAGATCTTGATCATTTGTTG
TGGGATTGTTGTTTTGCTCACTCTGTCTAGAGCTTCTTCTTTGAGGTTTTTGGGTTTCAAACTGCAGGCCAACGCAGTTGAGTTATTGCAGAATCCTGTTTCTTCTTTCT
TTCATCAGAAAATTAAGGAAGACTTTGGAGTCATTAGGTTGATTAAGTTCTTTTCGGATAAAGAATGGTTCTTTGAATGCGCTGTTTGGCCTTCCACGGGTGGGAGGAGG
ACTATTCAAGTTCCAGCGGGCTTGAATAAGAAAGGTTGGTATGTATTTTGGGAAATGATTAGGGATTTCTCTCTCAAAATTCACTCTTATGAGAATCAGTCAAATCGGTT
ATTTTTTAACAATTTGGAGGGTCTTCCTGCTTTAGTTAATGACTCAGAAGGTCAAGTCTTTCCTAACTCCTATGCTGAAGTGGTTAAGCGAGGTGGCTCCATGAAGAATT
CATTCTCCTTGGAAGATTCAGCTAGAAATGTTAAGATTGTTACTGAAGAAGCTTACTGGGTTCGTAAGAATTGGGATGTGCTGGAAATAGACTTGGAAAGTTCACTCGTT
GTTTCTAGATTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAATCTTGATCAACCCCTTTATGGATGATAAAGCCTTGAT
TCAGGTGGCAGATGGTTGTTTGGATATTTCTATGAATGGCAAGTGGAAGAAATTTGGGAACCTTCACTTGAAATTGGAATTGTGGTCCTCTGAAATCCATTCACAGCCAA
AATTAATAAAAAGCTACGGAGGATGGATTGCAATCAGAAATCTACCTTTGAATTTGTGGCATCGTGATTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGT
ATTTCATCCAATACGCTTAATTTATTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATTTTTGTGGCTTTATCCCTGCTGATATTAATGTTAAGATTGGTAATAA
GTTTGAATTCTCTTTAAGATTTGGTGATATTAATGCATTAGAGGACAAAAATTTGAAGTTTGATTCAATTAGAAAGTTAAGTGTCAATGACTTTTCAAATTCCCTGGATG
AAATTAGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACCTTGTTAATGAAGGGGTAAGGTTGAATGAATCGTCATTTATTTCTTGTTATCAGGAGGAATTTAATGCG
GCAATGGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCATATTAATTACACAGGCTGTAATGAATCTCCCTCCAAGATGATTAATGACAGCAATTACAATTTGAAAGA
TGATATACAGCAGCGTTCGTCAAATATCAATTCTGAAAATGGGTTAATTCAGTCCAAAGAATTTGGGTTGGTTGAATTTTCAAAGAAGAAAAGTGCTTTGTTGCTGGAGA
AGAATTTTAATGCCAACGGTAAGGTTTATAATGCCATCTGTTCAGATTTTAATGGAGCATTTTCTGATGGTGTTGTGCATGAGTCCCAAGATTTATTATCCACGCCTTTT
CATGACCTTCCTTTGGGTTTGAATTGCTGTAATGTAGTAGAAGATGAATCGATTGTTCCTAAGGTTTTACTTTTAGAAAAGTCGGTTGAGTTACCAAAGGAAAAGGCTGA
GATTTCACGTTCCAAGGAAGCATTAATGGAAGAAGTTAGTGTTAATTTCATTGGGCCGGTTGAGTTTGCAAAGGAGAAAAGTGCTTTGTTGCTGGAGAATGATTTTAATG
CCAACGGTAAGGTTTTTAATGGCATCAATTCAGAGATTTGCAAAGCATTTACTGATGGTGCTTTGCATGAGTCCCAGGTTTTATTATTCTCGCCTATTCAAGACATTCCT
TCGGGTTTGAAGTGCTGTAATGCAGTGGGCTTGGAATCAAATGAACCGTTTGTTCCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAA
ATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAATTCCAATTATAACCTTGATGTTATTGAAGAATCTTGTTCTCAATCTTTGCTCCCTGCTTTGAATCAGTCTA
GATGCTGCCAAACTAATCTTAATGAGTTATCAAATTCCACATCATCCAATCAGTATATTCTTTCAAACATTCAATCTGACCCTTCTTTAACAAAGGGAGTTTTTATTCCT
TCATCCAAAGTTGAAAACAAAGTTGATCAATCATATTCATCTCCTATTGATTCTGATAATGATTCAGTGGTGAGTATTAGTAGTGTAGAGGCTGAAAATCAGTATTTGAA
TGATGAAAACAATGAATTGTTGGAGGAAGATTCTTTTGCACTGGCTTTTAATCGGATTTTCCAGAATAATGAAGATATTTCTGAAGTTCAGTTGAATGATTGTGATGTTT
CAGCAACACCCTTAGTATCTGTTCCAAGTAAATTTTCATCTCTACTAGAAGATTGTGACATTCAGTTGAAGGAAATTCAGCCCTTTTTACCCCCTGAGCAATCTGAAAAT
TGTGGAATTTCTTCAAGATTTCTGATTTCCATGGAATGGGATGTGATGTTTGATAAATCTAGAGTCTCCAAACAGTTTTCTTGTGAGAGAGATCAATTTAATGAGGTGTT
GGGCTCTCCAAAAGGTGCTTCATTGCATGAAAAGGCCCTCATGAAGTCCAGCCGTTCAGGCCAAGTGGAGATAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAGTATGAATTTCCGAGGCTTCCACGGGCAGATATTGCATTGCTACTAGCACAGTGTCAAATTGCTGCTGTAACAGAGCGGGACCTTCTTCACCCGTCACCAGA
TTTGGTTTCAGATCTTTACACACGTTTAATGATCTACCTTGACTTGCTCCATGAGGAAGACCAAGGACTAGTTGAGTTTGCTGCCTTGGATCAGCTTGAGAACCCCGATC
TGCTTATGGATTCAGTTCGGATAATGAAGTTGCACAACAGAATAAAGCATGTCATTGCTTCTCTTGACTGTCCAAAAAAGTTTACATTGAAAGATTTGATAAAACCTGAA
GGAGATAGGACTGGAATTTTCCTTAGTGCAATGTTGAACTTTTGCATTCACAAAGATGCCAAAATCACCCTCCATACTCCAGTTATGGATGAGCTGAGTACTCTTGGTGA
CCAGCAAAGGGAATGGGAGGTTAAGATTTCTCAGTTGAATGCAGAGATTGCAGAGTACAATGAAGCAAGAGAAAGGGAAATACCTTTCGTTCAAGAGATTGATGCTAAAG
TGAAAGAACTACAACAGACTATTGGAGGACTTAACAATCAGCAGATGTCATTGCGTGCTTCTATCCGAAAGTTGAAAGACAAAGCTGGAGAAATGGATGAGAAGGTCATC
CTGGGTCAAGTTAATACTTTTGAGAGGCTTTCAAGAGTGATGCCCCTGTGGTTGGCCCTTTTTGTTGCATCCTTTGTCGGAAAGCAGAGAGAGATCTTGATCATTTGTTG
TGGGATTGTTGTTTTGCTCACTCTGTCTAGAGCTTCTTCTTTGAGGTTTTTGGGTTTCAAACTGCAGGCCAACGCAGTTGAGTTATTGCAGAATCCTGTTTCTTCTTTCT
TTCATCAGAAAATTAAGGAAGACTTTGGAGTCATTAGGTTGATTAAGTTCTTTTCGGATAAAGAATGGTTCTTTGAATGCGCTGTTTGGCCTTCCACGGGTGGGAGGAGG
ACTATTCAAGTTCCAGCGGGCTTGAATAAGAAAGGTTGGTATGTATTTTGGGAAATGATTAGGGATTTCTCTCTCAAAATTCACTCTTATGAGAATCAGTCAAATCGGTT
ATTTTTTAACAATTTGGAGGGTCTTCCTGCTTTAGTTAATGACTCAGAAGGTCAAGTCTTTCCTAACTCCTATGCTGAAGTGGTTAAGCGAGGTGGCTCCATGAAGAATT
CATTCTCCTTGGAAGATTCAGCTAGAAATGTTAAGATTGTTACTGAAGAAGCTTACTGGGTTCGTAAGAATTGGGATGTGCTGGAAATAGACTTGGAAAGTTCACTCGTT
GTTTCTAGATTGATGGCCCATTATTCTTGGAAGAATGTTAAGATTGCCCTTGAGGATTTCTTTAAATCTTCAATCTTGATCAACCCCTTTATGGATGATAAAGCCTTGAT
TCAGGTGGCAGATGGTTGTTTGGATATTTCTATGAATGGCAAGTGGAAGAAATTTGGGAACCTTCACTTGAAATTGGAATTGTGGTCCTCTGAAATCCATTCACAGCCAA
AATTAATAAAAAGCTACGGAGGATGGATTGCAATCAGAAATCTACCTTTGAATTTGTGGCATCGTGATTCCTTTGAAGCTATTGGAAAGAACCTTGGAGGGTTGGTTAGT
ATTTCATCCAATACGCTTAATTTATTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATTTTTGTGGCTTTATCCCTGCTGATATTAATGTTAAGATTGGTAATAA
GTTTGAATTCTCTTTAAGATTTGGTGATATTAATGCATTAGAGGACAAAAATTTGAAGTTTGATTCAATTAGAAAGTTAAGTGTCAATGACTTTTCAAATTCCCTGGATG
AAATTAGGGTTAAGCAAGTGATATTGGATGAAGAAGTGGACCTTGTTAATGAAGGGGTAAGGTTGAATGAATCGTCATTTATTTCTTGTTATCAGGAGGAATTTAATGCG
GCAATGGGTTCTCCAAAAGTTGCTTCAATGCATGATGAGCATATTAATTACACAGGCTGTAATGAATCTCCCTCCAAGATGATTAATGACAGCAATTACAATTTGAAAGA
TGATATACAGCAGCGTTCGTCAAATATCAATTCTGAAAATGGGTTAATTCAGTCCAAAGAATTTGGGTTGGTTGAATTTTCAAAGAAGAAAAGTGCTTTGTTGCTGGAGA
AGAATTTTAATGCCAACGGTAAGGTTTATAATGCCATCTGTTCAGATTTTAATGGAGCATTTTCTGATGGTGTTGTGCATGAGTCCCAAGATTTATTATCCACGCCTTTT
CATGACCTTCCTTTGGGTTTGAATTGCTGTAATGTAGTAGAAGATGAATCGATTGTTCCTAAGGTTTTACTTTTAGAAAAGTCGGTTGAGTTACCAAAGGAAAAGGCTGA
GATTTCACGTTCCAAGGAAGCATTAATGGAAGAAGTTAGTGTTAATTTCATTGGGCCGGTTGAGTTTGCAAAGGAGAAAAGTGCTTTGTTGCTGGAGAATGATTTTAATG
CCAACGGTAAGGTTTTTAATGGCATCAATTCAGAGATTTGCAAAGCATTTACTGATGGTGCTTTGCATGAGTCCCAGGTTTTATTATTCTCGCCTATTCAAGACATTCCT
TCGGGTTTGAAGTGCTGTAATGCAGTGGGCTTGGAATCAAATGAACCGTTTGTTCCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAA
ATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAATTCCAATTATAACCTTGATGTTATTGAAGAATCTTGTTCTCAATCTTTGCTCCCTGCTTTGAATCAGTCTA
GATGCTGCCAAACTAATCTTAATGAGTTATCAAATTCCACATCATCCAATCAGTATATTCTTTCAAACATTCAATCTGACCCTTCTTTAACAAAGGGAGTTTTTATTCCT
TCATCCAAAGTTGAAAACAAAGTTGATCAATCATATTCATCTCCTATTGATTCTGATAATGATTCAGTGGTGAGTATTAGTAGTGTAGAGGCTGAAAATCAGTATTTGAA
TGATGAAAACAATGAATTGTTGGAGGAAGATTCTTTTGCACTGGCTTTTAATCGGATTTTCCAGAATAATGAAGATATTTCTGAAGTTCAGTTGAATGATTGTGATGTTT
CAGCAACACCCTTAGTATCTGTTCCAAGTAAATTTTCATCTCTACTAGAAGATTGTGACATTCAGTTGAAGGAAATTCAGCCCTTTTTACCCCCTGAGCAATCTGAAAAT
TGTGGAATTTCTTCAAGATTTCTGATTTCCATGGAATGGGATGTGATGTTTGATAAATCTAGAGTCTCCAAACAGTTTTCTTGTGAGAGAGATCAATTTAATGAGGTGTT
GGGCTCTCCAAAAGGTGCTTCATTGCATGAAAAGGCCCTCATGAAGTCCAGCCGTTCAGGCCAAGTGGAGATAGTTTAG
Protein sequenceShow/hide protein sequence
MSKYEFPRLPRADIALLLAQCQIAAVTERDLLHPSPDLVSDLYTRLMIYLDLLHEEDQGLVEFAALDQLENPDLLMDSVRIMKLHNRIKHVIASLDCPKKFTLKDLIKPE
GDRTGIFLSAMLNFCIHKDAKITLHTPVMDELSTLGDQQREWEVKISQLNAEIAEYNEAREREIPFVQEIDAKVKELQQTIGGLNNQQMSLRASIRKLKDKAGEMDEKVI
LGQVNTFERLSRVMPLWLALFVASFVGKQREILIICCGIVVLLTLSRASSLRFLGFKLQANAVELLQNPVSSFFHQKIKEDFGVIRLIKFFSDKEWFFECAVWPSTGGRR
TIQVPAGLNKKGWYVFWEMIRDFSLKIHSYENQSNRLFFNNLEGLPALVNDSEGQVFPNSYAEVVKRGGSMKNSFSLEDSARNVKIVTEEAYWVRKNWDVLEIDLESSLV
VSRLMAHYSWKNVKIALEDFFKSSILINPFMDDKALIQVADGCLDISMNGKWKKFGNLHLKLELWSSEIHSQPKLIKSYGGWIAIRNLPLNLWHRDSFEAIGKNLGGLVS
ISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKFEFSLRFGDINALEDKNLKFDSIRKLSVNDFSNSLDEIRVKQVILDEEVDLVNEGVRLNESSFISCYQEEFNA
AMGSPKVASMHDEHINYTGCNESPSKMINDSNYNLKDDIQQRSSNINSENGLIQSKEFGLVEFSKKKSALLLEKNFNANGKVYNAICSDFNGAFSDGVVHESQDLLSTPF
HDLPLGLNCCNVVEDESIVPKVLLLEKSVELPKEKAEISRSKEALMEEVSVNFIGPVEFAKEKSALLLENDFNANGKVFNGINSEICKAFTDGALHESQVLLFSPIQDIP
SGLKCCNAVGLESNEPFVPKALKKKYESFPLHYSRRKYEKSEILDSIPINSNYNLDVIEESCSQSLLPALNQSRCCQTNLNELSNSTSSNQYILSNIQSDPSLTKGVFIP
SSKVENKVDQSYSSPIDSDNDSVVSISSVEAENQYLNDENNELLEEDSFALAFNRIFQNNEDISEVQLNDCDVSATPLVSVPSKFSSLLEDCDIQLKEIQPFLPPEQSEN
CGISSRFLISMEWDVMFDKSRVSKQFSCERDQFNEVLGSPKGASLHEKALMKSSRSGQVEIV