; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010184 (gene) of Snake gourd v1 genome

Gene IDTan0010184
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein MIS12 homolog
Genome locationLG09:70590161..70593150
RNA-Seq ExpressionTan0010184
SyntenyTan0010184
Gene Ontology termsGO:0000070 - mitotic sister chromatid segregation (biological process)
GO:0034501 - protein localization to kinetochore (biological process)
GO:0051301 - cell division (biological process)
GO:0051382 - kinetochore assembly (biological process)
GO:0000818 - nuclear MIS12/MIND complex (cellular component)
InterPro domainsIPR008685 - Centromere protein Mis12


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589575.1 Protein MIS12-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.5e-11281.85Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR KLSEVRKENVVLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTNGDISHHHKG SN KLDDIQE LADL  L
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

KAG7023263.1 Protein MIS12-like protein [Cucurbita argyrosperma subsp. argyrosperma]6.1e-11181.11Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR KLSEVRKENVVLNQELQALERQ ASSNSQVN FNEALQLYE+N
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTNGDISHHHKG SN KLDDIQE LADL  L
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

XP_022988478.1 protein MIS12 homolog isoform X2 [Cucurbita maxima]8.0e-11181.11Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR  LSEVRKEN+VLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTN DISHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

XP_023515498.1 protein MIS12 homolog isoform X1 [Cucurbita pepo subsp. pepo]1.8e-11081.55Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDL +GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR KLSEVRKENVVLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQ-EMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQ EMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTNGDISHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQ-EMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

XP_023515499.1 protein MIS12 homolog isoform X2 [Cucurbita pepo subsp. pepo]7.2e-11281.85Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDL +GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR KLSEVRKENVVLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTNGDISHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

TrEMBL top hitse value%identityAlignment
A0A6J1C2Q6 protein MIS12 homolog1.2e-10778.52Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGEAVFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ                              QASA LKTEG D+SQDLIMGISRVR+ V S
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
         LDKRLAMWEKYCLNHCF+VPEGFSLPSNDESPADTSISHD LYDV LDTELDSLR +LSEVR+ENVVLNQELQALERQTASSNSQ+N+FNEAL+LYE+ 
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNE FQE+M TASELRAKIGKLKKRR+EESKLTKVEKVHTNGD+SHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

A0A6J1E0A1 protein MIS12 homolog1.5e-11080.37Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR KLSEVRKEN+VLNQELQALERQ ASSNSQVN FNEALQLYE+N
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTNGD SHHHKG SN KLDDIQE LADL  L
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

A0A6J1EZI4 protein MIS12 homolog isoform X14.1e-10577.99Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSK + VFDSLNL+PQLFINEALNVVDDLV DA DFYQS                             QAS ALK+EGSDRSQDLIMGISRVRTL QS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPE FSLPSNDESPADTSIS+  LYDVDLDTELDSLR KLSEVRKENV LNQE QALER+TASSNSQ NYFNEALQLYEQ+
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLN
        SVNEMFQEMMGTAS+LRA+IGKLK+R+ME+SKLT V+KVHTNGDISHHHKG SN KLDDIQE LA LN
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLN

A0A6J1JD43 protein MIS12 homolog isoform X19.5e-11080.81Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR  LSEVRKEN+VLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQ-EMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQ EMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTN DISHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQ-EMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

A0A6J1JME3 protein MIS12 homolog isoform X23.9e-11181.11Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS
        MEGSKGE VFDSLNLNPQLFINEALN+VDDLVDDAFDFYQ+                             QASA+LKTEGSDRSQDLI+GISRVRT VQS
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQS

Query:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN
        GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESP  TS+S D LYDVDLDTELD LR  LSEVRKEN+VLNQELQALERQTASSNSQVN FNEALQLYEQN
Subjt:  GLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQN

Query:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEK+HTN DISHHHKG SN KLDDIQE LADL TL
Subjt:  SVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

SwissProt top hitse value%identityAlignment
Q2V0Z5 Protein MIS12 homolog2.3e-5245.59Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSD--RSQDLIMGISRVRTLV
        MEGSK EAVFDS+NLNPQ+FINEA+N V+D VD AFDFY                              R AS +LK +GSD  +SQ L  GI+RVR L+
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSD--RSQDLIMGISRVRTLV

Query:  QSGLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYE
         S +D RL +WE Y L  CF+VP+GF LP ++ES   +S+  D LYD++LD ELDSLR KL+ V K +V L+ ELQALER + S    +   NEAL+LY+
Subjt:  QSGLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYE

Query:  QNSVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        ++S++E+F+EM   ASELRA + +LK RRM+ S+  KV+++  +G          + KL+D+++  A+L  +
Subjt:  QNSVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL

Arabidopsis top hitse value%identityAlignment
AT5G35520.1 minichromosome instability 12 (mis12)-like1.6e-5345.59Show/hide
Query:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSD--RSQDLIMGISRVRTLV
        MEGSK EAVFDS+NLNPQ+FINEA+N V+D VD AFDFY                              R AS +LK +GSD  +SQ L  GI+RVR L+
Subjt:  MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSD--RSQDLIMGISRVRTLV

Query:  QSGLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYE
         S +D RL +WE Y L  CF+VP+GF LP ++ES   +S+  D LYD++LD ELDSLR KL+ V K +V L+ ELQALER + S    +   NEAL+LY+
Subjt:  QSGLDKRLAMWEKYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYE

Query:  QNSVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL
        ++S++E+F+EM   ASELRA + +LK RRM+ S+  KV+++  +G          + KL+D+++  A+L  +
Subjt:  QNSVNEMFQEMMGTASELRAKIGKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAGTAAGGGTGAGGCGGTATTCGATTCACTGAATCTGAATCCTCAGCTCTTCATTAATGAAGCTCTCAATGTTGTTGACGATTTGGTTGATGACGCTTTCGA
TTTTTATCAATCGTATGGTTCTCTCTGTCTCATCTCTTCAAGTTGGGTTTACAATTTTGTTTGGTTAATTTTGTTCCTGTCAATTATTTTCCTCTTTAGACAAGCCTCAG
CCGCCTTGAAGACCGAGGGCTCGGATAGATCCCAGGATCTTATCATGGGAATATCGCGGGTCCGAACTCTCGTCCAATCGGGTCTGGATAAGCGGTTGGCCATGTGGGAG
AAATACTGTCTCAATCACTGTTTTTCAGTCCCTGAAGGGTTTTCTTTACCTTCAAATGATGAATCACCTGCTGACACCTCAATAAGTCATGACTACCTCTATGATGTGGA
TCTGGATACAGAGTTAGATTCTCTTAGGACTAAGCTCTCCGAGGTCAGAAAAGAGAATGTTGTATTGAACCAAGAACTACAAGCTTTAGAAAGGCAAACTGCTTCCAGTA
ACTCCCAAGTCAATTATTTCAACGAAGCATTACAATTATATGAGCAAAATTCTGTGAATGAGATGTTCCAGGAAATGATGGGAACTGCATCAGAGCTGCGAGCGAAAATA
GGAAAACTGAAGAAAAGGAGGATGGAGGAATCCAAGCTTACTAAAGTAGAGAAGGTTCATACAAACGGAGACATCTCTCACCATCACAAGGGTCTCTCTAATGTTAAGCT
GGATGACATTCAAGAACTTTTAGCTGATTTGAACACCCTTTGA
mRNA sequenceShow/hide mRNA sequence
GTTCAGAGTCTTCAGACTATAAGCCCGCTCCGAGTCCCCATCACTCACAGTCACCACTACCATTTTGGCGGCAAAATTCCTCCATTTGTGCGAAAGCTTGAGAGCTTCCA
TCTGTTTCTCGATCTAAATTTCTCGTCTAGTTTTTCATTGGGAAAAAAGGCTTTTTTAGGGGGAAGAGAAAGGAAAATTTCGATGGAAGGAAGTAAGGGTGAGGCGGTAT
TCGATTCACTGAATCTGAATCCTCAGCTCTTCATTAATGAAGCTCTCAATGTTGTTGACGATTTGGTTGATGACGCTTTCGATTTTTATCAATCGTATGGTTCTCTCTGT
CTCATCTCTTCAAGTTGGGTTTACAATTTTGTTTGGTTAATTTTGTTCCTGTCAATTATTTTCCTCTTTAGACAAGCCTCAGCCGCCTTGAAGACCGAGGGCTCGGATAG
ATCCCAGGATCTTATCATGGGAATATCGCGGGTCCGAACTCTCGTCCAATCGGGTCTGGATAAGCGGTTGGCCATGTGGGAGAAATACTGTCTCAATCACTGTTTTTCAG
TCCCTGAAGGGTTTTCTTTACCTTCAAATGATGAATCACCTGCTGACACCTCAATAAGTCATGACTACCTCTATGATGTGGATCTGGATACAGAGTTAGATTCTCTTAGG
ACTAAGCTCTCCGAGGTCAGAAAAGAGAATGTTGTATTGAACCAAGAACTACAAGCTTTAGAAAGGCAAACTGCTTCCAGTAACTCCCAAGTCAATTATTTCAACGAAGC
ATTACAATTATATGAGCAAAATTCTGTGAATGAGATGTTCCAGGAAATGATGGGAACTGCATCAGAGCTGCGAGCGAAAATAGGAAAACTGAAGAAAAGGAGGATGGAGG
AATCCAAGCTTACTAAAGTAGAGAAGGTTCATACAAACGGAGACATCTCTCACCATCACAAGGGTCTCTCTAATGTTAAGCTGGATGACATTCAAGAACTTTTAGCTGAT
TTGAACACCCTTTGAACTTATTTCATTATGACATCAACCTTACTGTATATGATGCTTGCTGTGCAACTTGTCACAATGAGTTCTTCAATGTTTTAACTTGATTTGGCTGT
GAGATATTCTAAACCAAATTTGTATAATATATCATCTAGATTGATGACACTTCTTAGCAGTTTCCTTAGAGAGTTTACAGATGGGCCATGAATAGGTTAGGTGTTGGCTC
AATTCTTCCCATGTATCAATTATTGCGATTTGCATCATCAGGTTAGCCTGATGGTTTCAGAAAGTTTCTTTGCTGGTTTGGCACTTAAATTCAGCTGATTTTGATGGTA
Protein sequenceShow/hide protein sequence
MEGSKGEAVFDSLNLNPQLFINEALNVVDDLVDDAFDFYQSYGSLCLISSSWVYNFVWLILFLSIIFLFRQASAALKTEGSDRSQDLIMGISRVRTLVQSGLDKRLAMWE
KYCLNHCFSVPEGFSLPSNDESPADTSISHDYLYDVDLDTELDSLRTKLSEVRKENVVLNQELQALERQTASSNSQVNYFNEALQLYEQNSVNEMFQEMMGTASELRAKI
GKLKKRRMEESKLTKVEKVHTNGDISHHHKGLSNVKLDDIQELLADLNTL