; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011124 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011124
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLEA_2 domain-containing protein
Genome locationChr01:2642613..2646026
RNA-Seq ExpressionHG10011124
SyntenyHG10011124
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039116.1 proline-rich receptor-like protein kinase PERK10 [Cucumis melo var. makuwa]2.7e-11886.69Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N R+TQ PLPPPPSR P NN+HRPLPPPPSRAP N+H+  RSP  P + PN ++ NTRYPSPPSPP+SRRQHFGYG A    SSS SFR
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AI+LVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

KAG6600338.1 hypothetical protein SDJN03_05571, partial [Cucurbita argyrosperma subsp. sororia]1.5e-11686.46Show/hide
Query:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS
        MEEMTSRP+V NSRN   QRPLPPPPSR    P NNNHRPLPPPPSRAP NV H+THRS  PS+PPSRPNSDS N R+PSPP SPP SRRQHFGY T  +
Subjt:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS

Query:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV
        SSSSSASFRGCCCCLCLLF FIALLALAIVLV+VLAVKPKKPQFDLQRVGVQYMGIT PNLFSLSS+D++T  T T T+TSA+LSLNIRLLFTAVNPNKV
Subjt:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV

Query:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEA IAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVL+FDSPGVQ
Subjt:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

XP_004140800.1 NDR1/HIN1-like protein 2 [Cucumis sativus]2.5e-11686.33Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N RNTQ PLPPPPSR PDNN+  PLPPPPSRAP N+    RSP  P + PNS++ NTRYPSPPSPP+SRRQHFGYG A    SSS S R
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD ETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

XP_008456109.1 PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis melo]2.7e-11886.69Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N R+TQ PLPPPPSR P NN+HRPLPPPPSRAP N+H+  RSP  P + PN ++ NTRYPSPPSPP+SRRQHFGYG A    SSS SFR
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AI+LVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

XP_038880209.1 proline-rich receptor-like protein kinase PERK10 [Benincasa hispida]8.3e-12891.7Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDN-NNHRPLPPPPSRAPLNVHNTHRS-PSLPPSRPNSDSHNTRYP---SPPSPPTSRRQHFGYGTASSSSSS
        MEEMTSRPHV N RNTQRPLPPPPSR  DN N+HRPLPPPPSRAP NVHNTHRS P  PPSR NSD+HNTRYP   SPPSPPTSRRQHFGYGTASSSSSS
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDN-NNHRPLPPPPSRAPLNVHNTHRS-PSLPPSRPNSDSHNTRYP---SPPSPPTSRRQHFGYGTASSSSSS

Query:  SASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTAT------STSASLSLNIRLLFTAVNPNK
        SASFRGCCCCLCLLF FIALLALAIVLVI+LAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTAT      +TSASLSLNIRLLFTAVNPNK
Subjt:  SASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTAT------STSASLSLNIRLLFTAVNPNK

Query:  VGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

TrEMBL top hitse value%identityAlignment
A0A0A0L9B0 LEA_2 domain-containing protein1.2e-11686.33Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N RNTQ PLPPPPSR PDNN+  PLPPPPSRAP N+    RSP  P + PNS++ NTRYPSPPSPP+SRRQHFGYG A    SSS S R
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD ETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

A0A1S3C355 proline-rich receptor-like protein kinase PERK101.3e-11886.69Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N R+TQ PLPPPPSR P NN+HRPLPPPPSRAP N+H+  RSP  P + PN ++ NTRYPSPPSPP+SRRQHFGYG A    SSS SFR
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AI+LVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

A0A5A7TAP5 Proline-rich receptor-like protein kinase PERK101.3e-11886.69Show/hide
Query:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR
        MEEMTSRP  +N R+TQ PLPPPPSR P NN+HRPLPPPPSRAP N+H+  RSP  P + PN ++ NTRYPSPPSPP+SRRQHFGYG A    SSS SFR
Subjt:  MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFR

Query:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT
        GCCCCLCLLF FIALLA+AI+LVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATT+T TSASLSLNIRLLFTAVNPNKVGIKYG+SRFT
Subjt:  GCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFT

Query:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        VMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
Subjt:  VMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

A0A6J1FQG4 uncharacterized protein LOC1114478542.1e-11686.11Show/hide
Query:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS
        MEEMTSRP+V NSRN   QRPLPPPPSR    P NNNHRPLPPPPSRAP NV H+THRS  PS+PPSRPNSDS N R+PSPP SPP SR+QHFGY T  +
Subjt:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS

Query:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV
        SSSSSASFRGCCCCLCLLF FIALLALAIVLV+VLAVKPKKPQFDLQRVGVQYMGIT PNLFSLSS+D++T  T T T+TSA+LSLNIRLLFTAVNPNKV
Subjt:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV

Query:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEA IAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVL+FDSPGVQ
Subjt:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

A0A6J1J1C4 proline-rich receptor-like protein kinase PERK103.0e-11585.42Show/hide
Query:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS
        MEEMTSRP+V NSRN   QRPLPPPPSR    P NNNHRPLPPPPSRAP NV H+ H S  PS+PPSRPNS+S N R+PSPP SPP SRRQHFGY T  +
Subjt:  MEEMTSRPHVMNSRNT--QRPLPPPPSRT---PDNNNHRPLPPPPSRAPLNV-HNTHRS--PSLPPSRPNSDSHNTRYPSPP-SPPTSRRQHFGYGTASS

Query:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV
        SSSSSASFRGCCCCLCLLF FIALLALAIVLV+VLAVKPKKPQFDLQRVGVQYMGIT PNLFSLSS+D++T  T T T+TSA+LSLNIRLLFTAVNPNKV
Subjt:  SSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAAT-TATSTSASLSLNIRLLFTAVNPNKV

Query:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ
        GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEA IAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVL+FDSPGVQ
Subjt:  GIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.5e-6363.72Show/hide
Query:  PPSPPTSRRQHFGYGTA---------SSSSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETA
        PP P +SR    G   A         S SSSSSAS +GCCCCL LLF F+ALL LA+VL+++LAVKPKKPQFDLQ+V V YMGI+ P             
Subjt:  PPSPPTSRRQHFGYGTA---------SSSSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETA

Query:  ATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEV
        +     T+ASLSL IR+LFTAVNPNKVGI+YG S FTVMY+G+PLG+A VPGFYQ+AHS + VEATI+VDRVNL+QA AADL+RDASLNDRVEL V G+V
Subjt:  ATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEV

Query:  GARIRVLDFDSPGVQ
        GA+IRV++FDSPGVQ
Subjt:  GARIRVLDFDSPGVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAATGACGTCACGACCTCATGTCATGAACTCTCGCAACACGCAACGTCCTCTTCCACCGCCACCGTCAAGAACACCCGACAACAACAATCACCGTCCTCTTCC
GCCTCCACCTTCAAGAGCTCCGCTCAATGTTCACAACACTCACCGTTCTCCTTCATTGCCACCGTCCAGGCCTAATTCCGACTCTCACAACACTCGCTATCCATCTCCGC
CGTCGCCGCCTACCTCTCGTCGCCAACATTTTGGTTACGGCACGGCATCATCCTCGTCTTCCTCATCGGCTTCCTTCCGAGGCTGCTGCTGCTGCCTCTGCCTCCTCTTC
TTCTTCATCGCTCTCCTCGCTCTCGCTATCGTCCTCGTCATTGTTCTCGCCGTCAAACCTAAAAAGCCTCAATTCGATCTCCAGCGAGTCGGCGTTCAATACATGGGGAT
AACCGCTCCAAATCTCTTCTCATTGTCTTCCTCTGACGCCGAGACCGCTGCGACAACGGCGACGTCCACCTCCGCATCGTTATCGCTTAACATTCGATTGCTGTTCACGG
CGGTGAATCCTAACAAAGTCGGAATAAAGTACGGGAATTCGAGGTTCACAGTGATGTACCGAGGGATTCCGTTAGGGAAAGCGATAGTTCCTGGATTTTACCAAGAGGCA
CACAGTCAGAGAGAGGTGGAGGCGACGATCGCGGTGGATCGAGTGAATTTGCTTCAGGCGGACGCCGCCGATCTCATCAGAGACGCTTCGTTGAACGATCGAGTAGAACT
GAGGGTTTTGGGGGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCCGGCGTTCAGGACGAAGAAGAAACTGAAAATGATCGATGTATGACAGGTGTCGGTC
GATTGCTCAATAGTGATAAGTCCAAGGAATCAATCTTTGACTTCCAAGCAATGTGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAATGACGTCACGACCTCATGTCATGAACTCTCGCAACACGCAACGTCCTCTTCCACCGCCACCGTCAAGAACACCCGACAACAACAATCACCGTCCTCTTCC
GCCTCCACCTTCAAGAGCTCCGCTCAATGTTCACAACACTCACCGTTCTCCTTCATTGCCACCGTCCAGGCCTAATTCCGACTCTCACAACACTCGCTATCCATCTCCGC
CGTCGCCGCCTACCTCTCGTCGCCAACATTTTGGTTACGGCACGGCATCATCCTCGTCTTCCTCATCGGCTTCCTTCCGAGGCTGCTGCTGCTGCCTCTGCCTCCTCTTC
TTCTTCATCGCTCTCCTCGCTCTCGCTATCGTCCTCGTCATTGTTCTCGCCGTCAAACCTAAAAAGCCTCAATTCGATCTCCAGCGAGTCGGCGTTCAATACATGGGGAT
AACCGCTCCAAATCTCTTCTCATTGTCTTCCTCTGACGCCGAGACCGCTGCGACAACGGCGACGTCCACCTCCGCATCGTTATCGCTTAACATTCGATTGCTGTTCACGG
CGGTGAATCCTAACAAAGTCGGAATAAAGTACGGGAATTCGAGGTTCACAGTGATGTACCGAGGGATTCCGTTAGGGAAAGCGATAGTTCCTGGATTTTACCAAGAGGCA
CACAGTCAGAGAGAGGTGGAGGCGACGATCGCGGTGGATCGAGTGAATTTGCTTCAGGCGGACGCCGCCGATCTCATCAGAGACGCTTCGTTGAACGATCGAGTAGAACT
GAGGGTTTTGGGGGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCCGGCGTTCAGGACGAAGAAGAAACTGAAAATGATCGATGTATGACAGGTGTCGGTC
GATTGCTCAATAGTGATAAGTCCAAGGAATCAATCTTTGACTTCCAAGCAATGTGGATTTGA
Protein sequenceShow/hide protein sequence
MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLF
FFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEA
HSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQDEEETENDRCMTGVGRLLNSDKSKESIFDFQAMWI