; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G004440 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G004440
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBeta-N-acetylhexosaminidase
Genome locationCmo_Chr19:5428264..5434997
RNA-Seq ExpressionCmoCh19G004440
SyntenyCmoCh19G004440
Gene Ontology termsGO:0008757 - S-adenosylmethionine-dependent methyltransferase activity (molecular function)
InterPro domainsIPR044995 - Thiocyanate methyltransferase/thiol methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571820.1 hypothetical protein SDJN03_28548, partial [Cucurbita argyrosperma subsp. sororia]3.3e-14197.74Show/hide
Query:  TSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVMV
        ++A++SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRG+KDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQ QSIVEGSGSVMV
Subjt:  TSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVMV

Query:  SEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELL
        SEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELL
Subjt:  SEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELL

Query:  SKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        SKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  SKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

XP_022952960.1 uncharacterized protein LOC111455482 isoform X1 [Cucurbita moschata]2.4e-144100Show/hide
Query:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
        MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
Subjt:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM

Query:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

XP_022971993.1 uncharacterized protein LOC111470645 [Cucurbita maxima]7.8e-14398.5Show/hide
Query:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
        MTSA++SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQ QSIVEGSGSVM
Subjt:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM

Query:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        LSKVKNVIEKPYNDHLPL+EASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

XP_023554674.1 uncharacterized protein LOC111811869 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-14198.14Show/hide
Query:  MTSAAA--SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGS
        MTSAAA  SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRP LNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQ QSIVEGSGS
Subjt:  MTSAAA--SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGS

Query:  VMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQ
        VMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQ
Subjt:  VMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQ

Query:  ELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        ELLSKVKNVIEKPYNDHLPLIEASRLCNMDIIS+VQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  ELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

XP_038888204.1 uncharacterized protein LOC120078074 isoform X1 [Benincasa hispida]1.7e-12687Show/hide
Query:  MTSAAASSSSSL----------SLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQ
        MTS+++SSS SL          SLR+IN  N FFSP PNFPLFT  S RRPSLN++L IQGH RGDKDGD S+PRKK+TTQM+GFGSNDE+GTQ+PTQ Q
Subjt:  MTSAAASSSSSL----------SLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQ

Query:  SIVEGSGSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSL
        SIVEGSGSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSL
Subjt:  SIVEGSGSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSL

Query:  KKQPPESQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        KKQPPESQELLSKVKNVIEKP+NDHL LIEASRLCNMDIISHVQQVICFAFHDSRLLM+TCQEAKNLRKIVTLFYLD
Subjt:  KKQPPESQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

TrEMBL top hitse value%identityAlignment
A0A1S3BHE5 uncharacterized protein LOC103489683 isoform X13.0e-12490.04Show/hide
Query:  SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVMVSEFKP
        SSSSS SLRFIN  NPFFSP+PNFPL    S RR SLN++L I+GH RGD DG+ SVP+KKNTT+M+GFGSNDE+GTQIPTQ QSIVEGSGSVMVSEFKP
Subjt:  SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVMVSEFKP

Query:  VPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVKN
        VPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVKN
Subjt:  VPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVKN

Query:  VIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        VIEKP+NDHL LIEASRLCNMDIISHVQQVICFAFHDSRLLM+TCQEAKNLRKIVTLFYLD
Subjt:  VIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

A0A6J1GLU9 uncharacterized protein LOC111455482 isoform X21.2e-117100Show/hide
Query:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
        MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
Subjt:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM

Query:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVKNVIEKPYNDHLPLIEASR
        LSKVKNVIEKPYNDHLPLIEASR
Subjt:  LSKVKNVIEKPYNDHLPLIEASR

A0A6J1GNC5 uncharacterized protein LOC111455482 isoform X11.2e-144100Show/hide
Query:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
        MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
Subjt:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM

Query:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

A0A6J1I3X4 uncharacterized protein LOC111468939 isoform X11.4e-11686.69Show/hide
Query:  SSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRP--SLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDE-SGTQIPTQVQSIVEGSGSVMVSEF
        SSSS SLRFIN  NPFF P+PN         RRP  SL++ L IQGH +GDKDGD  V + KNTTQM+GFG NDE +GTQIPTQ QSIVEGSGSVMV+EF
Subjt:  SSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRP--SLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDE-SGTQIPTQVQSIVEGSGSVMVSEF

Query:  KPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKV
        KPVPDVDYLQELLAIQQQGPR+IGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKV
Subjt:  KPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKV

Query:  KNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        KNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLM+TCQEAKNLRKIVTLFYLD
Subjt:  KNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

A0A6J1I8K9 uncharacterized protein LOC1114706453.8e-14398.5Show/hide
Query:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM
        MTSA++SSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQ QSIVEGSGSVM
Subjt:  MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVM

Query:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  VSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        LSKVKNVIEKPYNDHLPL+EASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
Subjt:  LSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43945.1 unknown protein1.6e-9388.48Show/hide
Query:  SNDESGTQIPTQVQSIVEGSGSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRA
        S+++SG +IPTQ Q+IVEGSGSV VSE KP  DVDY+QELLAIQQQGPR+IGFFGTRNMGF+HQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRA
Subjt:  SNDESGTQIPTQVQSIVEGSGSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRA

Query:  EKPELLTVILPQSLKKQPPESQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        E+PELLTVILPQSLKKQPPESQELLSKV+NV+EKP+NDHLPL+EASRLCNMDIIS VQQVICFAFHDS+LLM+TCQEAKNLRKIVTLFYLD
Subjt:  EKPELLTVILPQSLKKQPPESQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD

AT3G59870.1 unknown protein1.9e-9167.53Show/hide
Query:  PNPFFSPTPNF---------------PLFTHFSTRRPSLNST-----LLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDE---SGTQIPTQVQSIVEGS
        P+PF S  PNF                 F  FS RR    S+     +      R   D D  V ++ N T    F S+++   +G +IPTQ Q+IVEG 
Subjt:  PNPFFSPTPNF---------------PLFTHFSTRRPSLNST-----LLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDE---SGTQIPTQVQSIVEGS

Query:  GSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPE
        GS+ VSE + VPDVDY+QELLAIQQQGPR IGFFGTRNMGF+HQELI+ILSYAMVITKNHIYTSGA+GTNAAVIRGALRAE+PELLTVILPQSLKKQPPE
Subjt:  GSVMVSEFKPVPDVDYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPE

Query:  SQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD
        SQELLSKV+NVIEKP+NDHLPL+EASRLCNMDIIS VQQ+ICFAFHDS+LLM+TCQEA+NLRKIVTLFYLD
Subjt:  SQELLSKVKNVIEKPYNDHLPLIEASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCTGCTGCTGCTTCTTCTTCTTCCTCTCTATCTCTTCGATTCATTAACACTCCCAATCCCTTCTTCTCCCCAACCCCTAATTTCCCCCTTTTCACTCATTTCTC
CACGCGCCGCCCTTCTCTCAATTCAACTTTGTTGATTCAAGGGCATAGAAGAGGAGATAAAGATGGAGATGGTAGCGTGCCTCGGAAGAAGAATACAACACAGATGTATG
GATTTGGGTCTAATGATGAGTCAGGCACTCAAATTCCAACCCAAGTCCAATCCATTGTGGAAGGATCAGGGTCAGTTATGGTGTCTGAGTTCAAACCAGTTCCTGATGTT
GATTATCTACAGGAGTTATTGGCTATTCAACAACAAGGTCCAAGAGCCATTGGTTTCTTTGGAACTCGAAATATGGGTTTCTTGCATCAAGAACTCATTGAGATTCTTAG
CTATGCAATGGTTATAACGAAGAATCACATCTATACTTCAGGAGCATCTGGAACCAATGCAGCAGTTATCAGAGGTGCTTTGAGGGCTGAGAAACCAGAACTTCTTACTG
TCATTTTGCCACAAAGTTTGAAAAAACAACCCCCTGAGAGCCAGGAATTATTATCCAAAGTTAAGAACGTGATAGAGAAGCCCTACAACGATCACCTACCTTTAATAGAA
GCTAGCAGGTTATGTAATATGGACATTATTTCTCATGTACAGCAAGTCATTTGCTTTGCATTTCATGATAGTAGGTTGCTCATGGATACTTGCCAAGAGGCAAAAAATCT
TCGAAAAATCGTTACACTTTTTTATCTCGACTAA
mRNA sequenceShow/hide mRNA sequence
TTGGTTAGCTCCTCATATATTATTCGTTTACAGAAAAAAGGGGAAGAAAAGTATGAAAGAGAGAAGAGAAAATCAGAGGAGAGAGCAAAAGGAAGGGTGCAACGCGCAAT
GACTTCTGCTGCTGCTTCTTCTTCTTCCTCTCTATCTCTTCGATTCATTAACACTCCCAATCCCTTCTTCTCCCCAACCCCTAATTTCCCCCTTTTCACTCATTTCTCCA
CGCGCCGCCCTTCTCTCAATTCAACTTTGTTGATTCAAGGGCATAGAAGAGGAGATAAAGATGGAGATGGTAGCGTGCCTCGGAAGAAGAATACAACACAGATGTATGGA
TTTGGGTCTAATGATGAGTCAGGCACTCAAATTCCAACCCAAGTCCAATCCATTGTGGAAGGATCAGGGTCAGTTATGGTGTCTGAGTTCAAACCAGTTCCTGATGTTGA
TTATCTACAGGAGTTATTGGCTATTCAACAACAAGGTCCAAGAGCCATTGGTTTCTTTGGAACTCGAAATATGGGTTTCTTGCATCAAGAACTCATTGAGATTCTTAGCT
ATGCAATGGTTATAACGAAGAATCACATCTATACTTCAGGAGCATCTGGAACCAATGCAGCAGTTATCAGAGGTGCTTTGAGGGCTGAGAAACCAGAACTTCTTACTGTC
ATTTTGCCACAAAGTTTGAAAAAACAACCCCCTGAGAGCCAGGAATTATTATCCAAAGTTAAGAACGTGATAGAGAAGCCCTACAACGATCACCTACCTTTAATAGAAGC
TAGCAGGTTATGTAATATGGACATTATTTCTCATGTACAGCAAGTCATTTGCTTTGCATTTCATGATAGTAGGTTGCTCATGGATACTTGCCAAGAGGCAAAAAATCTTC
GAAAAATCGTTACACTTTTTTATCTCGACTAAGTTCCTTGCTTATAATTCTGTAAATACATTGTTTAACAGAGCTTATTGCATATTCTGCCTAATTTTGCTTGTAAAGTT
GTTATTACAACTAAAACTTTCAAAAGAGAAAAACAACCAAAACAATTGGGAATCAGAACTATAAGAAATTTTCAGTCAAAAATAAGGATATGAATTGGCGCAACAAGGAA
GAAAACAAAACTAGCGTTCATTAGAAGAATGAAGAACACAAAACTGTTGCTGTACGTATGAAACGAACCCCGAAAAAAAGCTCACCGATTTGCCTTCGCAACTCTCT
Protein sequenceShow/hide protein sequence
MTSAAASSSSSLSLRFINTPNPFFSPTPNFPLFTHFSTRRPSLNSTLLIQGHRRGDKDGDGSVPRKKNTTQMYGFGSNDESGTQIPTQVQSIVEGSGSVMVSEFKPVPDV
DYLQELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVKNVIEKPYNDHLPLIE
ASRLCNMDIISHVQQVICFAFHDSRLLMDTCQEAKNLRKIVTLFYLD